BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 024104
         (272 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|118485488|gb|ABK94599.1| unknown [Populus trichocarpa]
          Length = 268

 Score =  427 bits (1097), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 204/263 (77%), Positives = 234/263 (88%), Gaps = 1/263 (0%)

Query: 11  LFFQLLTL-QFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC 69
           L  QLL L QF SIISAIRKDIGF+++  C+ TVQGRYLLSDDNGYVCDA+S+DP SRCC
Sbjct: 6   LSLQLLILFQFSSIISAIRKDIGFEQSRSCRNTVQGRYLLSDDNGYVCDALSVDPPSRCC 65

Query: 70  PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 129
           PE G +FSCQGCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDF
Sbjct: 66  PENGLRFSCQGCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDF 125

Query: 130 CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDS 189
           CAGRCRHNSESVVHENAY SDFHHCFS+PSN+SGA  TQLE RL GI+V+IG+QGESCDS
Sbjct: 126 CAGRCRHNSESVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDS 185

Query: 190 VCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLY 249
           VCKS+GQSCVLNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLY
Sbjct: 186 VCKSNGQSCVLNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLY 245

Query: 250 TRIQSMLSCDGSHRHTRRLCPCA 272
           TR QS+LSCDGS  HTRRLCPCA
Sbjct: 246 TRTQSLLSCDGSLWHTRRLCPCA 268


>gi|297743774|emb|CBI36657.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  410 bits (1053), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/273 (72%), Positives = 227/273 (83%), Gaps = 1/273 (0%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
           M  SRL L  LF  +L +Q LS +SAIRKD    E   C+TTVQGRYL +DDNG+VCDA+
Sbjct: 1   MSNSRLILSLLFRFILIIQILSRVSAIRKDPNLLEKRICRTTVQGRYLRTDDNGHVCDAL 60

Query: 61  SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
           SMDPQS CC  K E+FSC GCNL+SQCCNSYE+CVSCC+NPART KE  L VK+AKP TA
Sbjct: 61  SMDPQSGCCFGKREQFSCHGCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTA 120

Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVI 179
           G Y SVFDFC GRCRHNSESVVHENAYLSDFHHCFS+PSN++  GV +  E RL+GI V+
Sbjct: 121 GAYASVFDFCTGRCRHNSESVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVV 180

Query: 180 IGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAP 239
           +GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK  CLAS+GADQPAEV DDAP
Sbjct: 181 VGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAP 240

Query: 240 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
            +LNP ACLYTR Q++LSCDGSH+HTRRLCPCA
Sbjct: 241 RHLNPGACLYTRTQTILSCDGSHQHTRRLCPCA 273


>gi|359478124|ref|XP_002264803.2| PREDICTED: UPF0454 protein C12orf49 homolog [Vitis vinifera]
          Length = 279

 Score =  409 bits (1052), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/273 (72%), Positives = 227/273 (83%), Gaps = 1/273 (0%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
           M  SRL L  LF  +L +Q LS +SAIRKD    E   C+TTVQGRYL +DDNG+VCDA+
Sbjct: 7   MSNSRLILSLLFRFILIIQILSRVSAIRKDPNLLEKRICRTTVQGRYLRTDDNGHVCDAL 66

Query: 61  SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
           SMDPQS CC  K E+FSC GCNL+SQCCNSYE+CVSCC+NPART KE  L VK+AKP TA
Sbjct: 67  SMDPQSGCCFGKREQFSCHGCNLISQCCNSYEFCVSCCLNPARTQKELALKVKMAKPVTA 126

Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQ-LEGRLSGISVI 179
           G Y SVFDFC GRCRHNSESVVHENAYLSDFHHCFS+PSN++  GV +  E RL+GI V+
Sbjct: 127 GAYASVFDFCTGRCRHNSESVVHENAYLSDFHHCFSLPSNSTAGGVDRNPELRLNGIHVV 186

Query: 180 IGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAP 239
           +GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK  CLAS+GADQPAEV DDAP
Sbjct: 187 VGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAP 246

Query: 240 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
            +LNP ACLYTR Q++LSCDGSH+HTRRLCPCA
Sbjct: 247 RHLNPGACLYTRTQTILSCDGSHQHTRRLCPCA 279


>gi|224112759|ref|XP_002332721.1| predicted protein [Populus trichocarpa]
 gi|222832884|gb|EEE71361.1| predicted protein [Populus trichocarpa]
          Length = 245

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/245 (77%), Positives = 221/245 (90%)

Query: 28  RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
           RKDIGF+++  C+ TVQGRYLLSDDNGYVCDA+S+DP SRCCPE G +FSCQGCNL+SQC
Sbjct: 1   RKDIGFEQSRSCRNTVQGRYLLSDDNGYVCDALSVDPPSRCCPENGLRFSCQGCNLVSQC 60

Query: 88  CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDFCAGRCRHNSESVVHENAY
Sbjct: 61  CNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDFCAGRCRHNSESVVHENAY 120

Query: 148 LSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLN 207
            SDFHHCFS+PSN+SGA  TQLE RL GI+V+IG+QGESCDSVCKS+GQSCVLNKL++LN
Sbjct: 121 RSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDSVCKSNGQSCVLNKLLVLN 180

Query: 208 QCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRR 267
           QC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLYTR QS+LSCDGS  HTRR
Sbjct: 181 QCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLYTRTQSLLSCDGSLWHTRR 240

Query: 268 LCPCA 272
           LCPCA
Sbjct: 241 LCPCA 245


>gi|356538903|ref|XP_003537940.1| PREDICTED: uncharacterized protein LOC100782588 [Glycine max]
          Length = 281

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/258 (74%), Positives = 221/258 (85%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
           L+   F S I+AIRKD+GFQ    CKTTVQGRYLLSDDNGYVC+A S++ +SRCCP++GE
Sbjct: 24  LVIFHFSSTITAIRKDVGFQIIHPCKTTVQGRYLLSDDNGYVCNAASVNSRSRCCPQRGE 83

Query: 75  KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
           KFSC GCNLLSQCCNSYEYCVSCC+NPA   KEQVL +K+AKPTTA TY SVFD+CAGRC
Sbjct: 84  KFSCHGCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRC 143

Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
           RH+SESVVHENAY+SDFHHCFS+ SN+SG   T  E RLSGI+V++GRQGESC+SVCKS 
Sbjct: 144 RHSSESVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSR 203

Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 254
           GQSCV NKL++LN C+IIQKYMSCK  CLAS+GADQPAEV  DAP +LNP +CLYT  QS
Sbjct: 204 GQSCVPNKLVVLNSCDIIQKYMSCKGSCLASVGADQPAEVVYDAPVHLNPGSCLYTETQS 263

Query: 255 MLSCDGSHRHTRRLCPCA 272
           +LSCDG H+HTRRLCPCA
Sbjct: 264 ILSCDGLHQHTRRLCPCA 281


>gi|297817236|ref|XP_002876501.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322339|gb|EFH52760.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 272

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 190/272 (69%), Positives = 224/272 (82%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
           M  S  +L      L+   F   +SAIRKDIGF E   C+TTVQGRYL+SDD G VCD +
Sbjct: 1   MSTSGFSLFLAVLHLMGFLFPLRVSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDVL 60

Query: 61  SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
           S++ ++RCCP KGE+FSC GCNLLSQCCNSYE+CVSCC+NP+RTL E+V+ VK+AKP TA
Sbjct: 61  SLESRTRCCPWKGERFSCHGCNLLSQCCNSYEFCVSCCLNPSRTLLEKVVKVKVAKPATA 120

Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVII 180
           GTY SVFDFCAGRCRHNSESVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+
Sbjct: 121 GTYKSVFDFCAGRCRHNSESVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIV 180

Query: 181 GRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPS 240
           G QG+SCD+VCKS GQ CV+NKL LLNQC+++++YM+CK  CLAS GADQPAEV +DAP 
Sbjct: 181 GSQGDSCDAVCKSRGQLCVMNKLSLLNQCDVMKRYMTCKGSCLASAGADQPAEVVEDAPR 240

Query: 241 NLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           +L P ACLYTR QSMLSCDGSH+HTRRLCPCA
Sbjct: 241 DLYPGACLYTRTQSMLSCDGSHQHTRRLCPCA 272


>gi|388496984|gb|AFK36558.1| unknown [Lotus japonicus]
          Length = 272

 Score =  389 bits (999), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 184/255 (72%), Positives = 214/255 (83%), Gaps = 1/255 (0%)

Query: 19  QFLSIISAIRKDIGFQET-PFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFS 77
            F S I+AIRKDIG Q   P CKTTVQGRY L+DDNG+VC A+S+D  SRCCP+  EKFS
Sbjct: 18  HFSSPITAIRKDIGLQSIIPTCKTTVQGRYHLTDDNGFVCSALSVDSMSRCCPQSREKFS 77

Query: 78  CQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHN 137
           C GCN+LSQCCNSYEYCVSCC+NPA T KEQVL +KI+KP TA TY +VFD+CA RCRH+
Sbjct: 78  CHGCNVLSQCCNSYEYCVSCCLNPALTSKEQVLKMKISKPVTARTYENVFDYCAARCRHS 137

Query: 138 SESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQS 197
           SESVVHENAYLSDFHHCFS+PSN+SG  VT  E RL+GI+V++GRQGESC SVCK  GQS
Sbjct: 138 SESVVHENAYLSDFHHCFSLPSNSSGTNVTLTEARLNGINVVVGRQGESCSSVCKLRGQS 197

Query: 198 CVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLS 257
           CV NKL++LN C+IIQ+YMSCK  CLAS+G DQPAEV  DAP +LNP +CLYT  +S+LS
Sbjct: 198 CVPNKLVVLNNCDIIQRYMSCKGSCLASVGTDQPAEVVYDAPKDLNPGSCLYTETESVLS 257

Query: 258 CDGSHRHTRRLCPCA 272
           CDGSH+HT+RLCPCA
Sbjct: 258 CDGSHQHTKRLCPCA 272


>gi|255585501|ref|XP_002533442.1| conserved hypothetical protein [Ricinus communis]
 gi|223526704|gb|EEF28938.1| conserved hypothetical protein [Ricinus communis]
          Length = 251

 Score =  382 bits (981), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 179/242 (73%), Positives = 206/242 (85%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAV 60
           M K  LN   + F L  LQFL +I +IR+DIGF E   C+TTVQGR+LLSDDNG+VCDA+
Sbjct: 4   MSKFGLNAPCILFPLFILQFLPVIFSIRRDIGFHEMKICRTTVQGRFLLSDDNGHVCDAL 63

Query: 61  SMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA 120
           S+DPQSRCCP +GEKFSC GCNLLS CCNSYE+CVSCC+NPART +E+V+ +KIAKP TA
Sbjct: 64  SLDPQSRCCPVRGEKFSCHGCNLLSHCCNSYEFCVSCCLNPARTQEEEVMKLKIAKPATA 123

Query: 121 GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVII 180
           GTY S FDFCAGRCRHNSESVVHENAYLSDFHHCFS+PSN++GA   QLE RL GI+VII
Sbjct: 124 GTYTSTFDFCAGRCRHNSESVVHENAYLSDFHHCFSLPSNSTGANYRQLEARLVGINVII 183

Query: 181 GRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPS 240
           GRQGESCDS CKS+GQSCVLNKL++LNQC+I+QKYMSC+  CLASMG DQPAEV D AP 
Sbjct: 184 GRQGESCDSACKSNGQSCVLNKLLVLNQCDIMQKYMSCRGACLASMGTDQPAEVVDIAPK 243

Query: 241 NL 242
           +L
Sbjct: 244 HL 245


>gi|30694985|ref|NP_191498.2| uncharacterized protein [Arabidopsis thaliana]
 gi|42572729|ref|NP_974460.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332646394|gb|AEE79915.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332646395|gb|AEE79916.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 273

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 183/258 (70%), Positives = 221/258 (85%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
           ++   F   +SAIRKDIGF E   C+TTVQGRYL+SDD G VCDA+S++ ++RCCP KGE
Sbjct: 16  IMGFLFPLRVSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDALSLESRTRCCPWKGE 75

Query: 75  KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
           +FSC GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRC
Sbjct: 76  RFSCHGCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRC 135

Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
           RHNSESVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS 
Sbjct: 136 RHNSESVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSR 195

Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 254
           GQ CV+NKL LLNQC+++++YM+CK  CLAS GADQPAEV +DAP +L P ACLYTR QS
Sbjct: 196 GQLCVMNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQS 255

Query: 255 MLSCDGSHRHTRRLCPCA 272
           +LSCDGSH+HTRRLCPCA
Sbjct: 256 LLSCDGSHQHTRRLCPCA 273


>gi|242084782|ref|XP_002442816.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
 gi|241943509|gb|EES16654.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor]
          Length = 276

 Score =  354 bits (909), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 163/253 (64%), Positives = 203/253 (80%)

Query: 20  FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
           F + +S IRKDIGF E   C++TVQGR+L+SDDNGYVC A+S++P S CCP  G++FSCQ
Sbjct: 24  FPNPVSGIRKDIGFIEPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83

Query: 80  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 199
           SVVHENAY SDFHHCFS+  N+SG+  + +  +L GI+V++GR GESC  VCK  GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSVVSKLLGINVVVGRPGESCSLVCKVRGQSCV 203

Query: 200 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 259
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263

Query: 260 GSHRHTRRLCPCA 272
           GSH+HTRRLCPCA
Sbjct: 264 GSHQHTRRLCPCA 276


>gi|218185557|gb|EEC67984.1| hypothetical protein OsI_35752 [Oryza sativa Indica Group]
          Length = 276

 Score =  348 bits (893), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 164/276 (59%), Positives = 212/276 (76%), Gaps = 4/276 (1%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
           M  + L LL L  +++   F +    +++ IRKDIG      C++TVQGR+L+SDDNGYV
Sbjct: 1   MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGYV 60

Query: 57  CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
           C ++S+DP SRCCP  G +FSCQGC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+
Sbjct: 61  CSSLSIDPWSRCCPRTGSRFSCQGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVAR 120

Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
           P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF +  N+SG+  +    RL GI
Sbjct: 121 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 180

Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
           ++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +S+G DQPA+V D
Sbjct: 181 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSSLGPDQPAQVVD 240

Query: 237 DAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           +APSNLNP ACLY ++   L+CDGSH+HTRRLCPCA
Sbjct: 241 EAPSNLNPGACLYMQMDERLTCDGSHQHTRRLCPCA 276


>gi|222615817|gb|EEE51949.1| hypothetical protein OsJ_33587 [Oryza sativa Japonica Group]
          Length = 276

 Score =  347 bits (890), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 163/276 (59%), Positives = 212/276 (76%), Gaps = 4/276 (1%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
           M  + L LL L  +++   F +    +++ IRKDIG      C++TVQGR+L+SDDNGYV
Sbjct: 1   MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGYV 60

Query: 57  CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
           C ++S+DP SRCCP  G +FSCQGC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+
Sbjct: 61  CSSLSIDPWSRCCPRTGSRFSCQGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVAR 120

Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
           P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF +  N+SG+  +    RL GI
Sbjct: 121 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 180

Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
           ++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +++G DQPA+V D
Sbjct: 181 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVD 240

Query: 237 DAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           +APSNLNP ACLY ++   L+CDGSH+HTRRLCPCA
Sbjct: 241 EAPSNLNPGACLYMQMDERLTCDGSHQHTRRLCPCA 276


>gi|357156916|ref|XP_003577619.1| PREDICTED: uncharacterized protein LOC100836694 [Brachypodium
           distachyon]
          Length = 275

 Score =  345 bits (885), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 156/249 (62%), Positives = 196/249 (78%)

Query: 24  ISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNL 83
           +SAIRK+ GF     C+ TVQGR+L+SDDNGYVC ++S+DP S CCP  G +FSCQGC L
Sbjct: 27  VSAIRKETGFAAPIMCRRTVQGRHLISDDNGYVCSSLSIDPWSHCCPRTGARFSCQGCKL 86

Query: 84  LSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 143
             QCCNSYEYCVSCC++P+RT +  VL +K+AKP TAGTY ++FDFC GRCRH+S SVVH
Sbjct: 87  DLQCCNSYEYCVSCCLDPSRTKERDVLKLKVAKPVTAGTYANIFDFCMGRCRHSSASVVH 146

Query: 144 ENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKL 203
           ENAY+SDFHHCF +  N SG+  +    RL+GISVI+GRQGE C S C++ GQSCV ++L
Sbjct: 147 ENAYVSDFHHCFMVQQNLSGSTDSNYGSRLAGISVIVGRQGEPCSSTCRAKGQSCVPSRL 206

Query: 204 ILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHR 263
             LN+C+I+QKYM C+ GC  S+G DQPAEV D+AP++LNP ACLY +    L+CDGSH+
Sbjct: 207 SELNKCQILQKYMRCRSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQTDERLTCDGSHQ 266

Query: 264 HTRRLCPCA 272
           HTRR+CPCA
Sbjct: 267 HTRRVCPCA 275


>gi|449469174|ref|XP_004152296.1| PREDICTED: uncharacterized protein LOC101217783 [Cucumis sativus]
          Length = 252

 Score =  343 bits (881), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 170/251 (67%), Positives = 195/251 (77%), Gaps = 19/251 (7%)

Query: 22  SIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGC 81
           S IS+IR +IG      C++TVQGRYLLSDDNGYVCDA+S+DP SRCC  KGEKF C GC
Sbjct: 21  STISSIRTNIGLAVNSSCRSTVQGRYLLSDDNGYVCDALSVDPLSRCCHGKGEKFPCHGC 80

Query: 82  NLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESV 141
           NL+ QCCNSYEYCVSCC NP+RT +EQ+L +KIAKP TAG                    
Sbjct: 81  NLVLQCCNSYEYCVSCCQNPSRTKREQILKIKIAKPATAGK------------------- 121

Query: 142 VHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLN 201
           VHENAY S FHHCFS+PS +SG   TQLE RLSGI+VIIGRQGESCDSVCKS+GQSCV N
Sbjct: 122 VHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIGRQGESCDSVCKSNGQSCVPN 181

Query: 202 KLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGS 261
           K+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP  LNP ACLY+  QSM+SCDGS
Sbjct: 182 KILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKELNPGACLYSSAQSMVSCDGS 241

Query: 262 HRHTRRLCPCA 272
           H HT+RLCPCA
Sbjct: 242 HPHTKRLCPCA 252


>gi|226499180|ref|NP_001143347.1| uncharacterized protein LOC100275960 precursor [Zea mays]
 gi|195618642|gb|ACG31151.1| hypothetical protein [Zea mays]
          Length = 276

 Score =  340 bits (872), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 161/253 (63%), Positives = 198/253 (78%)

Query: 20  FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
           F S +S IRK IGF +   C++TVQGR+L+SDDNGYVC A+S++P S CCP  G++FSCQ
Sbjct: 24  FPSPVSGIRKGIGFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83

Query: 80  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
           GC L  QCCNSYEYCVSCC NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCELDLQCCNSYEYCVSCCFNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 199
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++GR  ESC  VCK  GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203

Query: 200 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 259
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263

Query: 260 GSHRHTRRLCPCA 272
           GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276


>gi|219887191|gb|ACL53970.1| unknown [Zea mays]
 gi|413916125|gb|AFW56057.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
 gi|413916126|gb|AFW56058.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
 gi|413916127|gb|AFW56059.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
          Length = 276

 Score =  338 bits (867), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 160/253 (63%), Positives = 198/253 (78%)

Query: 20  FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
           F S +S IRK I F +   C++TVQGR+L+SDDNGYVC A+S++P S CCP  G++FSCQ
Sbjct: 24  FPSPVSGIRKGIDFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83

Query: 80  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCV 199
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++GR  ESC  VCK  GQSCV
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCV 203

Query: 200 LNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCD 259
            ++L +LN+CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CD
Sbjct: 204 PSRLSVLNKCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCD 263

Query: 260 GSHRHTRRLCPCA 272
           GSH HTRRLCPCA
Sbjct: 264 GSHHHTRRLCPCA 276


>gi|413916124|gb|AFW56056.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
          Length = 258

 Score =  330 bits (847), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 156/245 (63%), Positives = 193/245 (78%)

Query: 28  RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
           RK I F +   C++TVQGR+L+SDDNGYVC A+S++P S CCP  G++FSCQGC L  QC
Sbjct: 14  RKGIDFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQGCELDLQC 73

Query: 88  CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S SVVHENAY
Sbjct: 74  CNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSASVVHENAY 133

Query: 148 LSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLN 207
            SDFHHCFS+  N+SG+  +    +L GI+V++GR  ESC  VCK  GQSCV ++L +LN
Sbjct: 134 ASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVGRPRESCSLVCKVRGQSCVPSRLSVLN 193

Query: 208 QCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRR 267
           +CEI+QKYM CK GC  S+G DQPAEV D+AP++LNP ACLY ++   L+CDGSH HTRR
Sbjct: 194 KCEILQKYMRCKSGCFPSLGPDQPAEVVDEAPTSLNPGACLYMQMDERLTCDGSHHHTRR 253

Query: 268 LCPCA 272
           LCPCA
Sbjct: 254 LCPCA 258


>gi|7801689|emb|CAB91609.1| putative protein [Arabidopsis thaliana]
          Length = 273

 Score =  326 bits (836), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 158/228 (69%), Positives = 194/228 (85%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
           ++   F   +SAIRKDIGF E   C+TTVQGRYL+SDD G VCDA+S++ ++RCCP KGE
Sbjct: 16  IMGFLFPLRVSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDALSLESRTRCCPWKGE 75

Query: 75  KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
           +FSC GCN+LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRC
Sbjct: 76  RFSCHGCNILSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRC 135

Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
           RHNSESVVHENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS 
Sbjct: 136 RHNSESVVHENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSR 195

Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 242
           GQ CV+NKL LLNQC+++++YM+CK  CLAS GADQPAEV +DAP +L
Sbjct: 196 GQLCVMNKLSLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDL 243


>gi|108864223|gb|ABA92588.2| expressed protein [Oryza sativa Japonica Group]
          Length = 251

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 142/246 (57%), Positives = 187/246 (76%), Gaps = 4/246 (1%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
           M  + L LL L  +++   F +    +++ IRKDIG      C++TVQGR+L+SDDNGYV
Sbjct: 1   MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGYV 60

Query: 57  CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
           C ++S+DP SRCCP  G +FSCQGC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+
Sbjct: 61  CSSLSIDPWSRCCPRTGSRFSCQGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVAR 120

Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
           P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF +  N+SG+  +    RL GI
Sbjct: 121 PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 180

Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
           ++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +++G DQPA+V D
Sbjct: 181 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVD 240

Query: 237 DAPSNL 242
           +APSNL
Sbjct: 241 EAPSNL 246


>gi|255637592|gb|ACU19121.1| unknown [Glycine max]
          Length = 224

 Score =  295 bits (756), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 138/185 (74%), Positives = 159/185 (85%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE 74
           L+   F S I+AIRKD+GFQ    CKTTVQGRYLLSDDNGYVC+A S++ +SRCCP++GE
Sbjct: 24  LVIFHFSSTITAIRKDVGFQIIHPCKTTVQGRYLLSDDNGYVCNAASVNSRSRCCPQRGE 83

Query: 75  KFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
           KFSC GCNLLSQCCNSYEYCVSCC+NPA   KEQVL +K+AKPTTA TY SVFD+CAGRC
Sbjct: 84  KFSCHGCNLLSQCCNSYEYCVSCCLNPALISKEQVLKMKVAKPTTARTYSSVFDYCAGRC 143

Query: 135 RHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS 194
           RH+SESVVHENAY+SDFHHCFS+ SN+SG   T  E RLSGI+V++GRQGESC+SVCKS 
Sbjct: 144 RHSSESVVHENAYISDFHHCFSLSSNSSGKNSTLTEARLSGINVVVGRQGESCNSVCKSR 203

Query: 195 GQSCV 199
           GQSCV
Sbjct: 204 GQSCV 208


>gi|147779400|emb|CAN76795.1| hypothetical protein VITISV_030117 [Vitis vinifera]
          Length = 258

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 145/243 (59%), Positives = 166/243 (68%), Gaps = 38/243 (15%)

Query: 28  RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
           RKD    E   C+TTVQGRYL +DDN      + +D           +    GCNL+SQC
Sbjct: 8   RKDPNLLEKRICRTTVQGRYLRTDDNDNPQLGLDVD----------MRVIHIGCNLISQC 57

Query: 88  CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CNSYE+CVSCC+NPART KE  L VK+AKP TAG Y SVFDFC GRCRHNSESVVHENAY
Sbjct: 58  CNSYEFCVSCCLNPARTQKELALKVKMAKPVTAGAYASVFDFCTGRCRHNSESVVHENAY 117

Query: 148 LSDFHHCFSMPSNASGAG---------------VTQL-------------EGRLSGISVI 179
           LSDFHHCFS+PSN++G                 V QL             E RL+GI V+
Sbjct: 118 LSDFHHCFSLPSNSAGEAWSLIPNSWNGLFLKIVKQLMASMHAGGVDRNPELRLNGIHVV 177

Query: 180 IGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAP 239
           +GRQGESCDSVCKS+GQSCV +KL +LNQCEI+QKYMSCK  CLAS+GADQPAEV DDAP
Sbjct: 178 VGRQGESCDSVCKSNGQSCVPSKLSVLNQCEIMQKYMSCKGACLASIGADQPAEVVDDAP 237

Query: 240 SNL 242
            +L
Sbjct: 238 RHL 240


>gi|168051463|ref|XP_001778174.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670496|gb|EDQ57064.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 235

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 130/235 (55%), Positives = 162/235 (68%), Gaps = 1/235 (0%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
           C  TVQGRYLLSD  GYVC+ + ++  + CCP +GE+F+CQGCN+ SQCCNSYEYCVSCC
Sbjct: 1   CNNTVQGRYLLSDSRGYVCNIMEVNSLTGCCPREGEQFACQGCNIGSQCCNSYEYCVSCC 60

Query: 99  VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
           +NP RT  E VL  K+A+  TAG++ S+FD+C+GRCRHNS+SVVHENAY+S+ HHCFS  
Sbjct: 61  LNPVRTTLEVVLKTKLARQPTAGSWTSLFDYCSGRCRHNSQSVVHENAYVSEDHHCFSAR 120

Query: 159 SNASGAGVT-QLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS 217
           S   G  V    E  L+ + V +GR+G SCD+ C +   SC   KL  LN C  +QKY  
Sbjct: 121 STPPGVQVEMDAEDELADVFVYVGREGLSCDATCSAQKLSCREEKLAALNNCAELQKYNK 180

Query: 218 CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           C   C+ S G DQPAEV   AP +++P ACLY    S LSC G H  +RRLCPCA
Sbjct: 181 CHGLCIPSTGPDQPAEVVSTAPRHMHPGACLYNTQVSQLSCKGYHLFSRRLCPCA 235


>gi|297728217|ref|NP_001176472.1| Os11g0267100 [Oryza sativa Japonica Group]
 gi|255679983|dbj|BAH95200.1| Os11g0267100 [Oryza sativa Japonica Group]
          Length = 208

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 123/194 (63%), Positives = 157/194 (80%)

Query: 79  QGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 138
           +GC L  QCC+SYEYCVSCC+NP+RT +  VL +K+A+P T+GTY +VFDFC GRCRH+S
Sbjct: 15  RGCKLDLQCCDSYEYCVSCCLNPSRTKETDVLKLKVARPVTSGTYRNVFDFCMGRCRHSS 74

Query: 139 ESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSC 198
            SVVHENAY SDFHHCF +  N+SG+  +    RL GI++ IGR+GESC SVC++ GQSC
Sbjct: 75  ASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGINISIGRRGESCSSVCRAKGQSC 134

Query: 199 VLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSC 258
           V ++L +LN+CEI+QKYM CK GC +++G DQPA+V D+APSNLNP ACLY ++   L+C
Sbjct: 135 VPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVDEAPSNLNPGACLYMQMDERLTC 194

Query: 259 DGSHRHTRRLCPCA 272
           DGSH+HTRRLCPCA
Sbjct: 195 DGSHQHTRRLCPCA 208


>gi|302788558|ref|XP_002976048.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
 gi|300156324|gb|EFJ22953.1| hypothetical protein SELMODRAFT_415982 [Selaginella moellendorffii]
          Length = 263

 Score =  252 bits (644), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 118/235 (50%), Positives = 163/235 (69%), Gaps = 1/235 (0%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
           C  TVQG +LL+D  GYVC   ++D    CCPE+G++FSC GCNL+SQCCN++EYCVSCC
Sbjct: 29  CNNTVQGPHLLADSQGYVCSVRNIDFGRGCCPEEGQQFSCSGCNLVSQCCNTFEYCVSCC 88

Query: 99  VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
           ++P  T KE  L+ K A+  TA  + SVF++C GRCRHNS SVVHENAY S++HHCF   
Sbjct: 89  LDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSGSVVHENAYTSEYHHCFFTQ 148

Query: 159 SNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS 217
           +N S      +E G  +  S+I GR+G+SCD  C+  G+ C  + L L+N+C ++QK M+
Sbjct: 149 TNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKCKASILPLINECSVLQKSMN 208

Query: 218 CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           CK  C++  G D P+EV   AP +L+P AC++    ++LSC G+HR+T+RLCPC 
Sbjct: 209 CKGACISGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSCQGAHRYTKRLCPCG 263


>gi|302769810|ref|XP_002968324.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
 gi|300163968|gb|EFJ30578.1| hypothetical protein SELMODRAFT_409585 [Selaginella moellendorffii]
          Length = 263

 Score =  252 bits (644), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 118/235 (50%), Positives = 163/235 (69%), Gaps = 1/235 (0%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
           C  TVQG +LL+D  GYVC   ++D    CCPE+G++FSC GCNL+SQCCN++EYCVSCC
Sbjct: 29  CNNTVQGPHLLADSQGYVCSVRNIDFGRGCCPEEGQQFSCSGCNLVSQCCNTFEYCVSCC 88

Query: 99  VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
           ++P  T KE  L+ K A+  TA  + SVF++C GRCRHNS SVVHENAY S++HHCF   
Sbjct: 89  LDPIHTSKEFALSSKSARQLTAANFVSVFEYCTGRCRHNSGSVVHENAYTSEYHHCFFTQ 148

Query: 159 SNASGAGVTQLE-GRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMS 217
           +N S      +E G  +  S+I GR+G+SCD  C+  G+ C  + L L+N+C ++QK M+
Sbjct: 149 TNLSSGRDHGVEAGENTDYSLITGRRGQSCDEACRQRGKKCKASILPLINECSVLQKSMN 208

Query: 218 CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           CK  C++  G D P+EV   AP +L+P AC++    ++LSC G+HR+T+RLCPC 
Sbjct: 209 CKGACVSGSGPDLPSEVVSSAPRHLHPGACVFNSKNTLLSCQGAHRYTKRLCPCG 263


>gi|449484814|ref|XP_004156988.1| PREDICTED: uncharacterized LOC101231289 [Cucumis sativus]
          Length = 154

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 117/151 (77%), Positives = 131/151 (86%)

Query: 122 TYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 181
           TY SVFDFC GRCRHNSESVVHENAY S FHHCFS+PS +SG   TQLE RLSGI+VIIG
Sbjct: 4   TYASVFDFCVGRCRHNSESVVHENAYHSAFHHCFSLPSYSSGDNSTQLEFRLSGINVIIG 63

Query: 182 RQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSN 241
           RQGESCDSVCKS+GQSCV NK+++LN C+I+QKY+SCK GCLAS+G DQPAEVA DAP  
Sbjct: 64  RQGESCDSVCKSNGQSCVPNKILVLNHCDIMQKYLSCKGGCLASVGTDQPAEVAYDAPKE 123

Query: 242 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           LNP ACLY+  QSM+SCDGSH HT+RLCPCA
Sbjct: 124 LNPGACLYSSAQSMVSCDGSHPHTKRLCPCA 154


>gi|413916128|gb|AFW56060.1| hypothetical protein ZEAMMB73_821816 [Zea mays]
          Length = 186

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 101/162 (62%), Positives = 127/162 (78%)

Query: 20  FLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQ 79
           F S +S IRK I F +   C++TVQGR+L+SDDNGYVC A+S++P S CCP  G++FSCQ
Sbjct: 24  FPSPVSGIRKGIDFIQPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQ 83

Query: 80  GCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSE 139
           GC L  QCCNSYEYCVSCC+NP++  KE VL +K+AKP TAGTY +VFDFC GRCRH+S 
Sbjct: 84  GCELDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTAGTYTNVFDFCMGRCRHSSA 143

Query: 140 SVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIG 181
           SVVHENAY SDFHHCFS+  N+SG+  +    +L GI+V++G
Sbjct: 144 SVVHENAYASDFHHCFSVQQNSSGSTESSSVSKLLGINVVVG 185


>gi|62734222|gb|AAX96331.1| hypothetical protein LOC_Os11g16560 [Oryza sativa Japonica Group]
          Length = 216

 Score =  207 bits (527), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 112/246 (45%), Positives = 153/246 (62%), Gaps = 39/246 (15%)

Query: 1   MPKSRLNLLFLFFQLLTLQFLS----IISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYV 56
           M  + L LL L  +++   F +    +++ IRKDIG      C++TVQGR+L+SDDNG  
Sbjct: 1   MATTALPLLHLLPRIVCAAFFACLPGLVAGIRKDIGLAAPIMCRSTVQGRHLISDDNGRT 60

Query: 57  CDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAK 116
                       CP                   ++ YC+        T +  VL +K+A+
Sbjct: 61  P-----------CPSH----------------FAHIYCL--------TKETDVLKLKVAR 85

Query: 117 PTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGI 176
           P T+GTY +VFDFC GRCRH+S SVVHENAY SDFHHCF +  N+SG+  +    RL GI
Sbjct: 86  PVTSGTYRNVFDFCMGRCRHSSASVVHENAYASDFHHCFLLQQNSSGSADSGSGPRLDGI 145

Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVAD 236
           ++ IGR+GESC SVC++ GQSCV ++L +LN+CEI+QKYM CK GC +++G DQPA+V D
Sbjct: 146 NISIGRRGESCSSVCRAKGQSCVPSRLSVLNKCEILQKYMRCKSGCFSNLGPDQPAQVVD 205

Query: 237 DAPSNL 242
           +APSNL
Sbjct: 206 EAPSNL 211


>gi|384245502|gb|EIE18996.1| hypothetical protein COCSUDRAFT_31544 [Coccomyxa subellipsoidea
           C-169]
          Length = 278

 Score =  167 bits (422), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 87/235 (37%), Positives = 129/235 (54%), Gaps = 4/235 (1%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
           C  T+QGR+ ++D  G +C   S+ P + CC   G+K+SC  CN+   CC  YE+CVSCC
Sbjct: 42  CNNTIQGRWQVADSQGMLCSRQSLRPDNGCC-SAGDKYSCATCNMHDSCCADYEHCVSCC 100

Query: 99  VNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMP 158
            +P   L +     +       G + SVF++C  +CR +S+S VHENAYLS FHHCFS  
Sbjct: 101 QDPQHGLPDLSTVHRAPDRPETGLWPSVFEYCRNKCRTSSKSTVHENAYLSPFHHCFSS- 159

Query: 159 SNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSC 218
           S                ++V++G  G+SC+++C+S+G+ C    L  LN C +++++ +C
Sbjct: 160 SGKPTTAAPPTPPIPKSVTVVLGSAGQSCNALCESTGRRCAQQHLASLNNCNVLREHYAC 219

Query: 219 KRGCLASMG-ADQPAEVADDAPSNLNPTACLYTRIQ-SMLSCDGSHRHTRRLCPC 271
           + GC    G AD P  V   AP    PT CL    + +  SC+ +     RLCPC
Sbjct: 220 EAGCEPDSGTADAPIYVNASAPKQQRPTICLTMDSKVNQFSCEATSASNHRLCPC 274


>gi|242070783|ref|XP_002450668.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
 gi|241936511|gb|EES09656.1| hypothetical protein SORBIDRAFT_05g009356 [Sorghum bicolor]
          Length = 111

 Score =  163 bits (413), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 74/111 (66%), Positives = 90/111 (81%)

Query: 28  RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQC 87
           RKDI F E   C++TVQGR+L+SDDNGYVC A+S++P S CCP  G++FSCQGC L  QC
Sbjct: 1   RKDICFIEPIVCRSTVQGRHLISDDNGYVCSALSINPWSHCCPTTGDRFSCQGCKLDLQC 60

Query: 88  CNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 138
           CNSYEYCVSCC+NP++  KE VL +K+AKP T GTY +VFDFC GRCRH+S
Sbjct: 61  CNSYEYCVSCCLNPSKIKKEDVLKLKVAKPVTVGTYTNVFDFCMGRCRHSS 111


>gi|159475585|ref|XP_001695899.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
 gi|158275459|gb|EDP01236.1| predicted membrane-anchored protein [Chlamydomonas reinhardtii]
          Length = 269

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 141/278 (50%), Gaps = 30/278 (10%)

Query: 12  FFQLLTLQFLSIISAI---RKDIGFQETP----FCKTTVQGRYLLSDDNGYVCDAVSMDP 64
            +  L +  ++ +S+I    + +  ++ P     C+ TVQG  +++DD+G VC    +  
Sbjct: 3   LYGALVVALMASLSSIVLGERRLFIRDLPEPKTRCRNTVQGLNMVTDDHGVVCKRSDLSY 62

Query: 65  QSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNP---ARTLKEQVLNVKIAKPTTAG 121
            + CC   G +  C  C++  +CC+ YE CVSCC+ P   A ++ +Q L  +  +   +G
Sbjct: 63  STGCC-TSGNQHDCALCDMRDRCCSEYESCVSCCLAPQHNAVSIAKQAL--RSPRHKDSG 119

Query: 122 TYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFS-----MPSNASGAGVTQLEGRLSGI 176
            +G  F++C G CR +S S  HENAY+S  HHCFS     M S+   AGV      + G+
Sbjct: 120 FWGDPFEYCKGICRTHSRSTAHENAYISSRHHCFSQLGRPMLSDPLPAGV------MDGV 173

Query: 177 SVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGC--LASMGADQPAEV 234
            V+ G++  +CD VC +  + C  + L  L+ C+ +++   C+ GC  +A +G   P+ V
Sbjct: 174 EVVTGQRNANCDDVCAAKQKKCSADHLRWLSSCDRLREQFGCEAGCEVVAGLG---PSYV 230

Query: 235 ADDAPSNLNPTACLYTRIQ-SMLSCDGSHRHTRRLCPC 271
             +AP    P  C     +   LSC         LCPC
Sbjct: 231 DGNAPKPARPAMCFAQPAEGGKLSCSAREEQHLMLCPC 268


>gi|428177708|gb|EKX46586.1| hypothetical protein GUITHDRAFT_107373 [Guillardia theta CCMP2712]
          Length = 233

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 22/230 (9%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG- 73
           L+ +  + + + +    G      C+ TVQG   + DD GY C    +D Q+ CC  K  
Sbjct: 5   LILMAMMGLAAGLEPSSG--SGSVCRNTVQGPKHVVDDQGYFCSVDQVDYQTGCCDTKKV 62

Query: 74  --EKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV----- 126
              ++SC+ C   ++CC+ YE CVSCC+NP+    +++ N  +A      T+        
Sbjct: 63  TEPRWSCRSCEAETKCCDHYENCVSCCLNPS---NDELRNTAMAASRDKKTFADAVKRKD 119

Query: 127 -FDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGE 185
            F+FC   CR +S+S VH NAY+S+   C S PS  S           + ++V +G+QGE
Sbjct: 120 SFEFCRASCRTSSKSTVHGNAYISEKRFCLS-PSILSNT-------LPADVAVTVGKQGE 171

Query: 186 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVA 235
           SC  VC S G+ C    +  +N CE++QK   CK  C  + G DQPA V+
Sbjct: 172 SCAEVCSSVGKRCDTRYITSINTCEVLQKNFPCKNTCEKNYGHDQPAYVS 221


>gi|242051320|ref|XP_002463404.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
 gi|241926781|gb|EER99925.1| hypothetical protein SORBIDRAFT_02g043236 [Sorghum bicolor]
          Length = 82

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 66/83 (79%), Gaps = 1/83 (1%)

Query: 56  VCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIA 115
           VC A+S++P S CCP   ++FSC+GC L  QCCNSYEYCVSCC+NP++  KE VL +K A
Sbjct: 1   VCSALSINPWSHCCPTL-DRFSCRGCKLDLQCCNSYEYCVSCCLNPSKIKKEDVLKLKEA 59

Query: 116 KPTTAGTYGSVFDFCAGRCRHNS 138
           KP TAGTY +VFDFC GRCRH+S
Sbjct: 60  KPVTAGTYTNVFDFCMGRCRHSS 82


>gi|307110379|gb|EFN58615.1| hypothetical protein CHLNCDRAFT_17323, partial [Chlorella
           variabilis]
          Length = 225

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 112/234 (47%), Gaps = 17/234 (7%)

Query: 46  RYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTL 105
           R+ ++DD G VC   S++  + CC   GE  SC  C    +CC+ YE+CVSCC+ P    
Sbjct: 1   RWYIADDQGMVCSRDSLNYATGCC-TGGELHSCTSCEAKDKCCSRYEHCVSCCLRPENKP 59

Query: 106 KEQVLNVKIA--KPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCFSMPSNASG 163
            E++ ++     KP T G + + F++C   CR  + S  HENA++ D   CFS       
Sbjct: 60  GEKMQSIFRGRNKPET-GHWSTPFEYCQAVCRTTARSTQHENAFILDRRFCFSKVGFVPP 118

Query: 164 AGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQ-KYMSCKRGC 222
           A     E      S++ G  G+SCD  C +   +C     + +N C  ++ K+M C  G 
Sbjct: 119 APALPKEA-----SLVAGAAGQSCDVACAAKQMTCRAEWFLSINDCNSLRAKFMCC--GS 171

Query: 223 LASMGADQPAEVADDAPSNLNPTAC-----LYTRIQSMLSCDGSHRHTRRLCPC 271
            A+   + P  V   AP +  P  C     L  ++    +C  ++   RRLCPC
Sbjct: 172 CAADQTEYPGYVEGSAPKHQQPAFCAFLPPLAAQLLPSFNCSHANGSVRRLCPC 225


>gi|358345744|ref|XP_003636935.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
 gi|355502870|gb|AES84073.1| hypothetical protein MTR_065s0010 [Medicago truncatula]
          Length = 131

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/75 (68%), Positives = 56/75 (74%)

Query: 197 SCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSML 256
           S VL   I L     IQKYMSCK GCLAS+G DQPAEV  DAP  +NP +CLYT  QS+L
Sbjct: 28  SYVLTVSIDLRYVGSIQKYMSCKGGCLASVGTDQPAEVVYDAPKYMNPESCLYTETQSIL 87

Query: 257 SCDGSHRHTRRLCPC 271
           SCDGSH+HTRRLCPC
Sbjct: 88  SCDGSHQHTRRLCPC 102


>gi|328719433|ref|XP_003246762.1| PREDICTED: UPF0454 protein C12orf49 homolog [Acyrthosiphon pisum]
          Length = 193

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 49/122 (40%), Positives = 69/122 (56%), Gaps = 8/122 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
           C+ TVQGR  ++DD GY+C    +   + CC    +++ C  CN    CC  YE+CVSCC
Sbjct: 70  CRNTVQGREYIADDKGYICQRDDLLANN-CCGTTEKRYICDTCNA-DGCCVLYEHCVSCC 127

Query: 99  VNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSDFHHC 154
           +NP  R L E VL    A  T    + ++   F+ C  +CR +S+SV HENAYL    HC
Sbjct: 128 LNPNKRKLLETVLGK--ASETFRVLFSAINNHFELCLAKCRTSSQSVQHENAYLKSNKHC 185

Query: 155 FS 156
           ++
Sbjct: 186 YT 187


>gi|195393082|ref|XP_002055183.1| GJ19227 [Drosophila virilis]
 gi|194149693|gb|EDW65384.1| GJ19227 [Drosophila virilis]
          Length = 215

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 52/118 (44%), Positives = 66/118 (55%), Gaps = 9/118 (7%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQG  LL+D+ G+VC    M     C  E      +SC  CN  S CC  YEYCVS
Sbjct: 76  CRNSVQGSQLLADERGFVCLREQMLANGCCNVELAGISYYSCHSCNSTSHCCGVYEYCVS 135

Query: 97  CCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSD 150
           CC++P  R+L EQVL    A+ T    Y SV   F+ C  +CR NS SV HEN Y ++
Sbjct: 136 CCLHPNKRSLLEQVLQ---AQDTQKYIYASVVDHFELCLVKCRTNSHSVEHENKYRNE 190


>gi|195130989|ref|XP_002009933.1| GI15640 [Drosophila mojavensis]
 gi|193908383|gb|EDW07250.1| GI15640 [Drosophila mojavensis]
          Length = 216

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 52/118 (44%), Positives = 66/118 (55%), Gaps = 9/118 (7%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQG+ LL+D+ G+VC    M     C  E      +SC  CN  S CC  YEYCVS
Sbjct: 77  CRNSVQGQQLLADERGFVCLREQMLVNGCCNVELAGISYYSCYSCNTTSHCCGVYEYCVS 136

Query: 97  CCVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSD 150
           CC++P  R+L EQVL    A  T    Y SV   F+ C  +CR NS SV HEN Y ++
Sbjct: 137 CCLHPNKRSLLEQVLQ---AHNTQKYIYASVVDHFELCLVKCRTNSHSVEHENKYRNE 191


>gi|194768977|ref|XP_001966587.1| GF22253 [Drosophila ananassae]
 gi|190617351|gb|EDV32875.1| GF22253 [Drosophila ananassae]
          Length = 217

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 51/134 (38%), Positives = 70/134 (52%), Gaps = 5/134 (3%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQGR LL+D+ G+VC    + P   C  E      +SC+ CN  + CC  YEYCVS
Sbjct: 78  CRNSVQGRQLLADERGFVCRREELLPSGCCNTELPGIAYYSCRSCNTTTHCCAIYEYCVS 137

Query: 97  CCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFHHC 154
           CC++P +  L EQVL    +           F+ C  +CR NS SV HEN Y      HC
Sbjct: 138 CCLHPEKQPLLEQVLRSANSPKYIFTKVADHFELCLVKCRTNSHSVQHENQYRDPSAKHC 197

Query: 155 FSM-PSNASGAGVT 167
           + +  ++ S   VT
Sbjct: 198 YGLTEAHESQKNVT 211


>gi|357603409|gb|EHJ63752.1| hypothetical protein KGM_18059 [Danaus plexippus]
          Length = 198

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 96/186 (51%), Gaps = 29/186 (15%)

Query: 3   KSRLNLLFLFFQLLTLQFLSII------SAIRKDIGFQETPF---------------CKT 41
           + RL L  +F   LT   +S +      + + +D+  +  PF               C+ 
Sbjct: 11  RRRLVLGIIFASSLTYCIVSFLKEGSNKNMVYQDVSIEHKPFVWRTLQEHNETQDTLCRN 70

Query: 42  TVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCVSCC 98
           ++QG+ LL DD GYVC  + +  ++ CC    +  +++SC+ C   + CC  YEYCVSCC
Sbjct: 71  SIQGKSLLVDDRGYVCQRMDL-AKNGCCNIEAKHTQRYSCKSCKN-NNCCVIYEYCVSCC 128

Query: 99  VNPA-RTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFH-HCF 155
           ++P+ R+L E VL+ +   +     +    ++ C  +CR +S SV+HEN+Y    H HCF
Sbjct: 129 LDPSKRSLLELVLSKLSAEEKVLFRSISDDYELCLTKCRTSSHSVLHENSYKDPSHKHCF 188

Query: 156 SMPSNA 161
              +N+
Sbjct: 189 GDRTNS 194


>gi|449529758|ref|XP_004171865.1| PREDICTED: uncharacterized protein LOC101231289 [Cucumis sativus]
          Length = 144

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/55 (69%), Positives = 44/55 (80%)

Query: 22  SIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKF 76
           S IS+IR +IG      C++TVQGRYLLSDDNGYVCDA+S+DP SRCC  KGEKF
Sbjct: 58  STISSIRTNIGLAVNSSCRSTVQGRYLLSDDNGYVCDALSVDPLSRCCHGKGEKF 112


>gi|195044419|ref|XP_001991819.1| GH11862 [Drosophila grimshawi]
 gi|193901577|gb|EDW00444.1| GH11862 [Drosophila grimshawi]
          Length = 214

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 71/124 (57%), Gaps = 10/124 (8%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQGR LL D++G+VC    M     C  E      +SC+ CN+ S CC  YE+C+S
Sbjct: 77  CRNSVQGRNLLVDEHGFVCKRDQMLVNGCCNVELPGISYYSCRSCNVTSHCCAIYEHCIS 136

Query: 97  CCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLS-DF 151
           CC++P + +L EQVL    A  T    Y SV   F+ C  +CR NS SV HEN Y + D 
Sbjct: 137 CCLHPNKQSLLEQVLQ---ASDTQKYIYASVVDHFELCLVKCRTNSHSVEHENKYRNEDA 193

Query: 152 HHCF 155
            +C+
Sbjct: 194 KYCY 197


>gi|308805236|ref|XP_003079930.1| unnamed protein product [Ostreococcus tauri]
 gi|116058387|emb|CAL53576.1| unnamed protein product [Ostreococcus tauri]
          Length = 189

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 4/127 (3%)

Query: 38  FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNLLSQCCNSYEYCVSC 97
            C+ +      ++DD G  C+   +D  S CCP      SC+ C+   +CC  YE CVSC
Sbjct: 41  LCRVSHASVSRVADDRGRSCEWYDVDTVSGCCPTTSATHSCEKCDAAVECCERYETCVSC 100

Query: 98  CVNPAR--TLK-EQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHC 154
           C+ PAR  TL+ E   + +       G +   FDFCA RCR      +HEN Y     HC
Sbjct: 101 CLAPARRATLETEMRTHARGRNQVVTGFFDDPFDFCANRCRTQPSVTLHENEYAYSTKHC 160

Query: 155 F-SMPSN 160
           F   P N
Sbjct: 161 FGDYPKN 167


>gi|405975633|gb|EKC40187.1| UPF0454 protein C12orf49-like protein [Crassostrea gigas]
          Length = 185

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/124 (37%), Positives = 73/124 (58%), Gaps = 6/124 (4%)

Query: 38  FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQG-CNLLSQ--CCNSYEYC 94
            C+ +VQG+ L+ D+ GYVC+  S+D +  C  ++G + + +  C+  S+  CC  YEYC
Sbjct: 50  ICRNSVQGKSLIVDERGYVCNRTSVDARGCCHVKRGSRLTSKHRCDTCSKNACCVVYEYC 109

Query: 95  VSCCVNP-ARTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDF- 151
           V+CC+ P  R L +++L N +       G+    F+ C  +CR +S SV HEN+Y + F 
Sbjct: 110 VACCLKPDKRPLLQKILTNAREGLDIALGSVSDHFELCLAKCRTSSSSVQHENSYRNAFA 169

Query: 152 HHCF 155
            HCF
Sbjct: 170 KHCF 173


>gi|198471721|ref|XP_001355702.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
 gi|198146047|gb|EAL32761.2| GA13867 [Drosophila pseudoobscura pseudoobscura]
          Length = 216

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 50/125 (40%), Positives = 67/125 (53%), Gaps = 8/125 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----PEKGEKFSCQGCNLLSQCCNSYEYC 94
           C+ +VQGR LL+D+ G+VC      P S CC    P  G  +SC+ CN  + CC  YEYC
Sbjct: 77  CRNSVQGRQLLTDERGFVCLREQALP-SGCCNLEMPGIG-YYSCRTCNATTSCCRLYEYC 134

Query: 95  VSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
           VSCC++P +  L E+VL           +    F+ C  +CR NS SV HEN Y      
Sbjct: 135 VSCCLHPDKQPLLERVLQSAHTPKYIYASVADHFELCLVKCRTNSHSVEHENKYRDGKAK 194

Query: 153 HCFSM 157
           HC+ +
Sbjct: 195 HCYGI 199


>gi|328869989|gb|EGG18364.1| hypothetical protein DFA_03858 [Dictyostelium fasciculatum]
          Length = 1057

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 42/122 (34%), Positives = 62/122 (50%), Gaps = 4/122 (3%)

Query: 36   TPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP-EKGEKFSCQGCNLLSQCCNSYEYC 94
            + +CK TVQ    + DD G++C    +D  S CC     +++SC GC+   +CC  Y YC
Sbjct: 925  STYCKYTVQSSKYVVDDQGHICLRSELD--SGCCNINNFDRYSCNGCSSKYRCCEEYTYC 982

Query: 95   VSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYL-SDFHH 153
            VSCC+   +   E VL   +   +  G     FD C   CR +S SV+H+  Y   D  +
Sbjct: 983  VSCCLGARQETLEAVLVQMVEGVSPIGEPHDQFDLCQLVCRTSSHSVIHQREYKHPDTKY 1042

Query: 154  CF 155
            C+
Sbjct: 1043 CY 1044


>gi|195164215|ref|XP_002022944.1| GL16447 [Drosophila persimilis]
 gi|194105006|gb|EDW27049.1| GL16447 [Drosophila persimilis]
          Length = 216

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 50/125 (40%), Positives = 66/125 (52%), Gaps = 8/125 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----PEKGEKFSCQGCNLLSQCCNSYEYC 94
           C+ +VQGR LL+D+ G+VC      P S CC    P  G  +SC+ CN  + CC  YEYC
Sbjct: 77  CRNSVQGRQLLTDERGFVCLREQALP-SGCCNLEMPGIG-YYSCRTCNATTNCCRLYEYC 134

Query: 95  VSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
           VSCC++P +  L E VL           +    F+ C  +CR NS SV HEN Y      
Sbjct: 135 VSCCLHPDKQPLLELVLQSAYTPKYIYASVADHFELCLVKCRTNSHSVEHENKYRDGKAK 194

Query: 153 HCFSM 157
           HC+ +
Sbjct: 195 HCYGI 199


>gi|194894481|ref|XP_001978075.1| GG19394 [Drosophila erecta]
 gi|190649724|gb|EDV47002.1| GG19394 [Drosophila erecta]
          Length = 217

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 57/152 (37%), Positives = 79/152 (51%), Gaps = 15/152 (9%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----P 70
           L T Q L+    + +  G  E   C+ +VQGR  L+D+ G+VC    +     CC    P
Sbjct: 54  LWTQQLLAPEEQLNRTRGDPEA-LCRNSVQGRQWLADERGFVCRREQVLTNG-CCNLELP 111

Query: 71  EKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 126
             G  +SC+ CN  + CC  YEYCVSCC++P +  L E+VL   +A  T    + SV   
Sbjct: 112 GIG-YYSCRTCNTSTHCCAVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVADH 167

Query: 127 FDFCAGRCRHNSESVVHENAYLSD-FHHCFSM 157
           F+ C  +CR NS SV HEN Y      HC+ +
Sbjct: 168 FELCLVKCRTNSHSVEHENKYRDPAAKHCYGL 199


>gi|242012077|ref|XP_002426767.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212510949|gb|EEB14029.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 203

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/115 (40%), Positives = 67/115 (58%), Gaps = 11/115 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L+ DD GYVC    +   S CC    E  +KF+C  C   + CC+ YEYC+
Sbjct: 75  CRNSVQGKVLIVDDRGYVCSRSDL-LSSGCCNTGSENTKKFACDTCKD-NGCCSIYEYCI 132

Query: 96  SCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC++P    K+ VL   + K +    + SV   F+ C  +CR NS+SV HEN+Y
Sbjct: 133 SCCLHPD---KKGVLQNVLGKESENVLFASVSDHFELCLAKCRTNSQSVHHENSY 184


>gi|195478878|ref|XP_002100681.1| GE16041 [Drosophila yakuba]
 gi|194188205|gb|EDX01789.1| GE16041 [Drosophila yakuba]
          Length = 217

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 57/152 (37%), Positives = 79/152 (51%), Gaps = 15/152 (9%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----P 70
           L T Q L+    + +  G  E   C+ +VQGR  L+D+ G+VC    +     CC    P
Sbjct: 54  LWTQQLLAPEEQLNRTRGDPEA-RCRNSVQGRQWLADERGFVCRREEVLTNG-CCNLELP 111

Query: 71  EKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSV--- 126
             G  +SC+ CN  + CC  YEYCVSCC++P +  L E+VL   +A  T    + SV   
Sbjct: 112 GIG-YYSCRTCNTSTHCCGVYEYCVSCCLHPGQQPLLERVL---LAPNTPKYIFASVTDH 167

Query: 127 FDFCAGRCRHNSESVVHENAYLSD-FHHCFSM 157
           F+ C  +CR NS SV HEN Y      HC+ +
Sbjct: 168 FELCLVKCRTNSHSVEHENKYRDPAAKHCYGL 199


>gi|28571353|ref|NP_788909.1| CG15643 [Drosophila melanogaster]
 gi|7293087|gb|AAF48472.1| CG15643 [Drosophila melanogaster]
 gi|17944302|gb|AAL48044.1| RE11751p [Drosophila melanogaster]
 gi|220947868|gb|ACL86477.1| CG15643-PA [synthetic construct]
 gi|220957174|gb|ACL91130.1| CG15643-PA [synthetic construct]
          Length = 212

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 9/149 (6%)

Query: 15  LLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----P 70
           L T Q L+    + +  G  E   C+ +VQGR  L+D+ G+VC    +     CC    P
Sbjct: 54  LWTQQLLAPEEQLNRTRGDPEA-RCRNSVQGRQWLADERGFVCRREEVLTNG-CCNLELP 111

Query: 71  EKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDF 129
             G  +SC+ CN  + CC  YEYCVSCC++P +  L E+VL           +    F+ 
Sbjct: 112 GIG-YYSCRTCNTSTHCCGVYEYCVSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFEL 170

Query: 130 CAGRCRHNSESVVHENAYLSD-FHHCFSM 157
           C  +CR NS SV HEN Y      HC+ +
Sbjct: 171 CLVKCRTNSHSVEHENKYRDPAAKHCYGL 199


>gi|347964633|ref|XP_316825.5| AGAP000855-PA [Anopheles gambiae str. PEST]
 gi|333469436|gb|EAA12112.5| AGAP000855-PA [Anopheles gambiae str. PEST]
          Length = 243

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 19/167 (11%)

Query: 16  LTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC------ 69
           L  + L   +A   D    +   C+ ++QGR LL DD GYVC    +     CC      
Sbjct: 69  LGAEVLPAGTANETDDQPAQPASCRNSIQGRSLLVDDRGYVCPRADLLANG-CCSAGSGG 127

Query: 70  ----PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNP-ARTLKEQVLNVKIAKPTTAGTYG 124
                + G  F C+ C L ++CC  YEYCV+CC+NP  R + E+VL    A       Y 
Sbjct: 128 TTANADAGRLFPCRTC-LPNRCCAVYEYCVACCLNPDKRPILEEVL--AKANGRQVALYA 184

Query: 125 SV---FDFCAGRCRHNSESVVHENAYLS-DFHHCFSMPSNASGAGVT 167
            V   F+ C  +CR NS+SV +EN Y + +  HC+   S A+ A  +
Sbjct: 185 EVTDQFELCLTKCRTNSQSVQNENKYRNPEQKHCYGEMSEAAWASAS 231


>gi|156380036|ref|XP_001631761.1| predicted protein [Nematostella vectensis]
 gi|156218806|gb|EDO39698.1| predicted protein [Nematostella vectensis]
          Length = 135

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 73/127 (57%), Gaps = 8/127 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG  L+ D+ GY+C    +   S CC   GE   +++C+ C + + CC+ YE CV
Sbjct: 2   CRNSVQGPSLIVDERGYLCSRKDLSA-SGCCHSDGETTHRYNCESCQV-NNCCSIYENCV 59

Query: 96  SCCVNPA-RTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLSD-FH 152
           SCC++P  + L  +VLNV    P     +    F+ C  +CR +S+SV HEN+Y  + + 
Sbjct: 60  SCCLDPKQKELLREVLNVWRTAPNVILKSITDQFELCLTKCRTSSKSVWHENSYKDNKYK 119

Query: 153 HCFSMPS 159
           HCF + S
Sbjct: 120 HCFGLTS 126


>gi|195432368|ref|XP_002064195.1| GK19833 [Drosophila willistoni]
 gi|194160280|gb|EDW75181.1| GK19833 [Drosophila willistoni]
          Length = 215

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 45/112 (40%), Positives = 59/112 (52%), Gaps = 3/112 (2%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE--KFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQG   L+D+ G+VC    + P   C  E  +   +SC  CN  S CC  YEYCVS
Sbjct: 76  CRNSVQGIQWLADELGFVCQREQILPSGCCNSELPDVSYYSCHTCNQTSMCCEIYEYCVS 135

Query: 97  CCVNP-ARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CC++P  R L E +L    A+          F+ C  +CR NS SV HEN Y
Sbjct: 136 CCLHPDKRPLLETILETVNAQRYIYAQVEDHFELCLVKCRTNSHSVEHENKY 187


>gi|195355329|ref|XP_002044144.1| GM22548 [Drosophila sechellia]
 gi|194129433|gb|EDW51476.1| GM22548 [Drosophila sechellia]
          Length = 197

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 65/125 (52%), Gaps = 8/125 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----PEKGEKFSCQGCNLLSQCCNSYEYC 94
           C+ +VQGR  L+D+ G+VC    +     CC    P  G  +SC+ CN  + CC  YEYC
Sbjct: 63  CRNSVQGRQWLADERGFVCRREEVLTNG-CCNLELPGIG-YYSCRTCNTSTHCCGVYEYC 120

Query: 95  VSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSD-FH 152
           VSCC++P +  L E+VL           +    F+ C  +CR NS SV HEN Y      
Sbjct: 121 VSCCLHPGQQPLLERVLQAPNTPKYIFASVTDHFELCLVKCRTNSHSVEHENKYRDPAAK 180

Query: 153 HCFSM 157
           HCF +
Sbjct: 181 HCFGL 185


>gi|391333374|ref|XP_003741090.1| PREDICTED: UPF0454 protein C12orf49 homolog [Metaseiulus
           occidentalis]
          Length = 193

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 67/123 (54%), Gaps = 8/123 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG-EKFSCQGCNLLSQCCNSYEYCVSC 97
           C+ ++QG   ++DD GYVC+   +     C  ++G E++ C  C     CC+SYE+C+SC
Sbjct: 69  CRNSLQGPQWVADDRGYVCNRTLLLSSGCCDADQGLERYPCDSC--ADGCCSSYEFCISC 126

Query: 98  CVNP-ARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLSDF-H 152
           C+ P  R   E+VL      P +   Y  V   FD C  +CR +S+SV +EN Y+     
Sbjct: 127 CLRPQQRPTLERVLGRLGRSPKSLKLYALVQTQFDLCLTKCRTSSQSVRNENTYIDPVKK 186

Query: 153 HCF 155
           HCF
Sbjct: 187 HCF 189


>gi|289742927|gb|ADD20211.1| uncharacterized conserved protein [Glossina morsitans morsitans]
          Length = 217

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/114 (39%), Positives = 62/114 (54%), Gaps = 4/114 (3%)

Query: 38  FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEK--FSCQGCNLLSQCCNSYEYCV 95
            C+ ++QG+ LL DD G+VC+   +     C  E  E   ++C+ CN  S CC+ YE+CV
Sbjct: 78  ICRNSLQGKELLVDDRGFVCNREHILQNGCCNVESPETLYYTCETCNNSSHCCSIYEFCV 137

Query: 96  SCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC++P  R L E  L  V   + T        F+ C  +CR NS SV HEN Y
Sbjct: 138 SCCLHPEKRPLLELALQAVNGRQLTVFAQVLDQFELCLVKCRTNSHSVEHENRY 191


>gi|124486989|ref|NP_001074705.1| UPF0454 protein C12orf49 homolog precursor [Mus musculus]
 gi|81897450|sp|Q8BTG6.1|CL049_MOUSE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|26355654|dbj|BAC41192.1| unnamed protein product [Mus musculus]
 gi|74355387|gb|AAI04374.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
 gi|74355856|gb|AAI04373.1| RIKEN cDNA 2410131K14 gene [Mus musculus]
 gi|148687847|gb|EDL19794.1| mCG15258 [Mus musculus]
          Length = 205

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG++LL+D+ GYVC+   +     CC       +++ C GC L + CC +YEYCV
Sbjct: 75  CRNSVQGKHLLTDELGYVCERKDLLANG-CCDVSVPSTKQYCCDGC-LANGCCEAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|351694757|gb|EHA97675.1| hypothetical protein GW7_10171 [Heterocephalus glaber]
          Length = 273

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L + CC +YEYCV
Sbjct: 143 CRNSIQGKHLITDELGYVCERKDLLANG-CCDVSIPSAKQYCCDGC-LANGCCGAYEYCV 200

Query: 96  SCCVNPAR-TLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 201 SCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTSSQSVQHENTY 254


>gi|157818285|ref|NP_001102536.1| uncharacterized protein LOC498188 [Rattus norvegicus]
 gi|149063483|gb|EDM13806.1| similar to hypothetical protein FLJ21415 (predicted) [Rattus
           norvegicus]
          Length = 205

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG++L++D+ GYVC+   +     CC       +++ C GC L + CC +YEYCV
Sbjct: 75  CRNSVQGKHLITDELGYVCERKDLLANG-CCDVSVPSTKQYCCNGC-LANGCCEAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|194214270|ref|XP_001489615.2| PREDICTED: UPF0454 protein C12orf49 homolog, partial [Equus
           caballus]
          Length = 178

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 68/114 (59%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC  +     ++SC GC L S CC++YE+CV
Sbjct: 48  CRNSIQGKHLITDELGYVCERRDLLVNG-CCDVRVPSTRQYSCDGC-LSSGCCSAYEHCV 105

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 106 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 159


>gi|440791825|gb|ELR13063.1| hypothetical protein ACA1_097480 [Acanthamoeba castellanii str.
           Neff]
          Length = 264

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 76/142 (53%), Gaps = 12/142 (8%)

Query: 30  DIGFQETPF--CKTTVQGRYLLSDDNGYVC--DAVSMDPQSRCCPEKGE----KFSCQGC 81
           D G  +T    C  + QGR  ++D+ GYVC  D +       CC E       ++SCQ C
Sbjct: 123 DAGLGKTWLRTCDNSKQGRTYVTDERGYVCRWDQMYSGAGYGCCWESDRTNISRYSCQQC 182

Query: 82  NLLSQCCNSYEYCVSCCVNPA---RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNS 138
           ++ + CC  YE+CVSCC+NP+   R      LN++ ++        S+F+ C   CR +S
Sbjct: 183 DIKTTCCLEYEFCVSCCLNPSQEQRVASWLQLNIEDSREREEMAPLSLFEQCLALCRTSS 242

Query: 139 ESVVHENAYLSDFHHCFS-MPS 159
            S++ ++ ++S  H C++ +PS
Sbjct: 243 ASILRQHIFISPSHFCYAGVPS 264


>gi|148224108|ref|NP_001085845.1| UPF0454 protein C12orf49 homolog precursor [Xenopus laevis]
 gi|82184289|sp|Q6GNT2.1|CL049_XENLA RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|49115513|gb|AAH73420.1| MGC80896 protein [Xenopus laevis]
          Length = 205

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/123 (35%), Positives = 69/123 (56%), Gaps = 8/123 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++DD GY+C+   +     CC       + +SC+ C L + CC+ YE+CV
Sbjct: 75  CRNSVQGKLLVTDDMGYICERKELLANG-CCNINVASTKLYSCETC-LPNGCCSVYEFCV 132

Query: 96  SCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSDF-H 152
           SCC+ P + L+ E+ LN   +A           F+ C  +CR +S+SV HEN Y +    
Sbjct: 133 SCCLQPNKQLQLERFLNKAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHENTYRNPIAK 192

Query: 153 HCF 155
           HC+
Sbjct: 193 HCY 195


>gi|157167582|ref|XP_001655053.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
 gi|157167584|ref|XP_001655054.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
 gi|157167586|ref|XP_001655055.1| hypothetical protein AaeL_AAEL010931 [Aedes aegypti]
 gi|94469020|gb|ABF18359.1| uncharacterized conserved protein [Aedes aegypti]
 gi|108872807|gb|EAT37032.1| AAEL010931-PA [Aedes aegypti]
 gi|108872808|gb|EAT37033.1| AAEL010931-PB [Aedes aegypti]
 gi|108872809|gb|EAT37034.1| AAEL010931-PC [Aedes aegypti]
          Length = 228

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/123 (38%), Positives = 63/123 (51%), Gaps = 8/123 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L+ DD GYVC    +   + CC    E  + F C  C   S CC  YEYCV
Sbjct: 75  CRNSVQGKALIVDDRGYVCTRFEL-LWTGCCNVEVESTKLFHCDTCQS-SNCCAIYEYCV 132

Query: 96  SCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
           SCC++P  R L E+VL     +      T    ++ C  +CR NS SV +EN Y      
Sbjct: 133 SCCMHPNKRPLLERVLAKATGRQIAVYATVNDQYELCLTKCRTNSHSVQNENKYRDPKLK 192

Query: 153 HCF 155
           HC+
Sbjct: 193 HCY 195


>gi|62858887|ref|NP_001016016.1| uncharacterized protein LOC548770 [Xenopus (Silurana) tropicalis]
 gi|183985985|gb|AAI66293.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
 gi|213627063|gb|AAI70669.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
 gi|213627065|gb|AAI70671.1| hypothetical protein LOC548770 [Xenopus (Silurana) tropicalis]
          Length = 204

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 64/114 (56%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++DD GY+C+   +     CC       + +SC+ C L + CC+ YEYCV
Sbjct: 74  CRNSVQGKLLITDDMGYICERKELLANG-CCNINVASTKLYSCETC-LSNGCCSVYEYCV 131

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 132 SCCLQPNKQLLLERFLNRAAVAFQNLFQAVEDHFELCLAKCRTSSQSVQHENTY 185


>gi|321477285|gb|EFX88244.1| hypothetical protein DAPPUDRAFT_305537 [Daphnia pulex]
          Length = 222

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 69/118 (58%), Gaps = 15/118 (12%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQG+ L++DD GYVC   ++ P   C  E    + +SC+ C+  + CC+ YEYC+S
Sbjct: 79  CRNSVQGKALIADDKGYVCSRQNVMPNGCCKAETDGTKHYSCETCH-NNGCCSIYEYCIS 137

Query: 97  CCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGRCRHNSESVVHENAY 147
           CC+ P    K++VL   + K   AGT   +       F+ C  +CR +S+SV HEN+Y
Sbjct: 138 CCLQPD---KKEVLQDVLGK--AAGTLNLLLTSVTDHFELCLTKCRTSSQSVQHENSY 190


>gi|348584144|ref|XP_003477832.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cavia porcellus]
          Length = 205

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 67/114 (58%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLANG-CCDGSVPSTKQYCCDGC-LANGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFLAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|115496644|ref|NP_001069491.1| UPF0454 protein C12orf49 homolog precursor [Bos taurus]
 gi|122133614|sp|Q17QN8.1|CL049_BOVIN RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|109659176|gb|AAI18255.1| Chromosome 12 open reading frame 49 ortholog [Bos taurus]
 gi|296478528|tpg|DAA20643.1| TPA: hypothetical protein LOC534423 precursor [Bos taurus]
          Length = 205

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L S CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERRDLLVNG-CCNVNVPGTKQYCCDGC-LSSGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|417397061|gb|JAA45564.1| Putative conserved secreted protein precursor [Desmodus rotundus]
          Length = 205

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 74/132 (56%), Gaps = 9/132 (6%)

Query: 23  IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
           I+  ++ ++G    P   C+ ++QG++L++D+ GYVC+   +     CC       +++ 
Sbjct: 57  ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSSKQYC 115

Query: 78  CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
           C GC L + CC++YE+CVSCC+ P +  L E+ LN   +A           F+ C  +CR
Sbjct: 116 CDGC-LSNGCCSAYEHCVSCCLQPGKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174

Query: 136 HNSESVVHENAY 147
            +S+SV HEN Y
Sbjct: 175 TSSQSVQHENTY 186


>gi|91087999|ref|XP_973740.1| PREDICTED: similar to CG15643 CG15643-PA [Tribolium castaneum]
 gi|270011897|gb|EFA08345.1| hypothetical protein TcasGA2_TC005988 [Tribolium castaneum]
          Length = 203

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 67/117 (57%), Gaps = 13/117 (11%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ L+ DD G+VC    +   S CC   P+   ++SC  C   + CC  YE+C+
Sbjct: 69  CRNSIQGKVLIVDDRGFVCPRHEI-LHSGCCNDTPQTSPQYSCDTCKG-NNCCAIYEFCI 126

Query: 96  SCCVNPARTLKEQVLNVKIAKPTTAGT--YGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC++P    K+++L   + K T      + SV   F+ C  +CR NS+SV HEN+Y
Sbjct: 127 SCCLHPD---KKEMLENLLGKATEQNNVLFASVTDHFELCLAKCRTNSQSVQHENSY 180


>gi|426247320|ref|XP_004017434.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ovis aries]
          Length = 205

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L S CC +YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPGTKQYCCDGC-LSSGCCGAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|311270685|ref|XP_003132942.1| PREDICTED: UPF0454 protein C12orf49 homolog [Sus scrofa]
          Length = 205

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 67/114 (58%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPGTKQYCCDGC-LSNGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|440912218|gb|ELR61808.1| hypothetical protein M91_20260, partial [Bos grunniens mutus]
          Length = 201

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L S CC++YEYCV
Sbjct: 71  CRNSIQGKHLITDELGYVCERRDLLVNG-CCNVNVPGTKQYCCDGC-LSSGCCSAYEYCV 128

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 129 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 182


>gi|56753599|gb|AAW25002.1| SJCHGC02079 protein [Schistosoma japonicum]
          Length = 244

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 7/126 (5%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+   Q   LL+D+ GY+C+ +++   S CCP +   ++F C  C   + CC+ YE+CVS
Sbjct: 118 CRHIKQSPILLADELGYLCNYLNLLKNS-CCPFQNLTQRFVCHSCKF-NHCCSVYEHCVS 175

Query: 97  CCVNPA-RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFHH 153
           CC+NP  + L +QV+ N               F+FC   CR +S +V+HEN Y + D  +
Sbjct: 176 CCLNPTNKPLWDQVMQNANANSRRHLKLAIDAFEFCVAVCRTSSLTVLHENKYRNLDEIY 235

Query: 154 CFSMPS 159
           CF + +
Sbjct: 236 CFGLET 241


>gi|57099849|ref|XP_540555.1| PREDICTED: UPF0454 protein C12orf49 homolog [Canis lupus
           familiaris]
 gi|355735600|gb|AES11718.1| hypothetical protein [Mustela putorius furo]
          Length = 205

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GY+C+   +     CC       +++ C GC L   CC +YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYICERKDLLVNG-CCNVNVPGTKQYRCDGC-LPHGCCGAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|301770727|ref|XP_002920782.1| PREDICTED: UPF0454 protein C12orf49 homolog [Ailuropoda
           melanoleuca]
 gi|281343735|gb|EFB19319.1| hypothetical protein PANDA_009565 [Ailuropoda melanoleuca]
          Length = 205

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GY+C+   +     CC       +++ C GC L   CC +YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYICERKDLLVNG-CCNVNVPGTKQYRCDGC-LPHGCCGAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|354466980|ref|XP_003495949.1| PREDICTED: UPF0454 protein C12orf49 homolog [Cricetulus griseus]
 gi|344237025|gb|EGV93128.1| UPF0454 protein C12orf49-like [Cricetulus griseus]
          Length = 205

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG++L++D+ GYVC+   +     CC       +++ C GC L + CC +YE+CV
Sbjct: 75  CRNSVQGKHLITDELGYVCERKDLLANG-CCDVTVPSTKQYCCDGC-LANGCCEAYEHCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|403281581|ref|XP_003932260.1| PREDICTED: UPF0454 protein C12orf49 homolog [Saimiri boliviensis
           boliviensis]
          Length = 205

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 73/132 (55%), Gaps = 9/132 (6%)

Query: 23  IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
           I+  ++ ++G    P   C+ ++QG++L++D+ GYVC+   +     CC       +++ 
Sbjct: 57  ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYC 115

Query: 78  CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
           C GC   + CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR
Sbjct: 116 CDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174

Query: 136 HNSESVVHENAY 147
            +S+SV HEN Y
Sbjct: 175 TSSQSVQHENTY 186


>gi|296213028|ref|XP_002753098.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Callithrix
           jacchus]
          Length = 205

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 73/132 (55%), Gaps = 9/132 (6%)

Query: 23  IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
           I+  ++ ++G    P   C+ ++QG++L++D+ GYVC+   +     CC       +++ 
Sbjct: 57  ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYC 115

Query: 78  CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
           C GC   + CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C  +CR
Sbjct: 116 CDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174

Query: 136 HNSESVVHENAY 147
            +S+SV HEN Y
Sbjct: 175 TSSQSVQHENTY 186


>gi|344295368|ref|XP_003419384.1| PREDICTED: UPF0454 protein C12orf49 homolog [Loxodonta africana]
          Length = 174

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 74/132 (56%), Gaps = 9/132 (6%)

Query: 23  IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFS 77
           I   ++ ++G    P   C+ ++QG++L++D+ GYVC+   +     CC       E++ 
Sbjct: 26  IAWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTEQYC 84

Query: 78  CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
           C GC L + CC++YEYCVSCC+ P++  L E+ L+   +A           F+ C  +CR
Sbjct: 85  CNGC-LSNGCCSAYEYCVSCCLQPSKQLLLERFLSRAAVAFQNLFMAVEDHFELCLAKCR 143

Query: 136 HNSESVVHENAY 147
            +S+SV HEN Y
Sbjct: 144 TSSQSVQHENTY 155


>gi|444723218|gb|ELW63879.1| hypothetical protein TREES_T100018686 [Tupaia chinensis]
          Length = 205

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC L S CC +YE+CV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVHVPTTKQYCCDGC-LPSGCCGAYEHCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|410976770|ref|XP_003994786.1| PREDICTED: UPF0454 protein C12orf49 homolog [Felis catus]
          Length = 171

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 67/114 (58%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GY+C+   +     CC       +++ C GC L + CC++YEYCV
Sbjct: 41  CRNSIQGKHLITDELGYICERKDLLVNG-CCNVNVPGTKQYRCDGC-LPNGCCSAYEYCV 98

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 99  SCCLQPSKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 152


>gi|346472337|gb|AEO36013.1| hypothetical protein [Amblyomma maculatum]
          Length = 206

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 45/125 (36%), Positives = 69/125 (55%), Gaps = 10/125 (8%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--KGEKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQG+ L++DD GYVC    +     C  E     ++ C  C L + CC+ YEYC+S
Sbjct: 72  CRNSVQGKVLVADDRGYVCQRSEVASNGCCIAEATSARQYCCDTC-LPNGCCSIYEYCIS 130

Query: 97  CCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLS-DF 151
           CC++P + TL +++L    A  T      S+   F+ C  +CR +S SV HEN+Y     
Sbjct: 131 CCMHPQKITLLQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSNSVHHENSYRDPKA 188

Query: 152 HHCFS 156
            HC++
Sbjct: 189 KHCYA 193


>gi|241792024|ref|XP_002400643.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215510808|gb|EEC20261.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 209

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 8/134 (5%)

Query: 29  KDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--KGEKFSCQGCNLLSQ 86
           KD     T  C+ +VQG+ L++DD GYVC    +     C  E    +++ C  C   + 
Sbjct: 61  KDKNSSATISCRNSVQGKVLVADDRGYVCQRSEVASTGCCIVESPNTKRYCCDTCQ-PNG 119

Query: 87  CCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVH 143
           CC+ YEYC+SCC+ P +    Q +  K A  T      S+   F+ C  +CR +SESV H
Sbjct: 120 CCSIYEYCISCCMQPQKIALLQKIMGK-ASETFKMLVASITDHFELCLTKCRTSSESVHH 178

Query: 144 ENAYLS-DFHHCFS 156
           EN+Y      HC++
Sbjct: 179 ENSYRDPSAKHCYA 192


>gi|412993097|emb|CCO16630.1| predicted protein [Bathycoccus prasinos]
          Length = 152

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 62/128 (48%), Gaps = 12/128 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC-PEKG--EKFSCQGCNL-----LSQCCNS 90
           C  T QG + ++DDNGY CD   + P + CC PEK   +++ C  C+L       +CC +
Sbjct: 8   CVNTKQGLHHITDDNGYTCDWFDVFPPTGCCPPEKAIEKRYGCSACDLAKGGETEKCCGT 67

Query: 91  YEYCVSCCVNPARTLKEQVLN---VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           YE CVSCC N   +  E       +   +P T G +   F  C  +CR      VHE+ Y
Sbjct: 68  YESCVSCCQNAKVSTAETDRTREPIGRMQPQT-GYFSDAFSHCKSKCRTQPTVTVHESTY 126

Query: 148 LSDFHHCF 155
             +   CF
Sbjct: 127 GYEKRFCF 134


>gi|402580999|gb|EJW74948.1| hypothetical protein WUBG_14143 [Wuchereria bancrofti]
          Length = 183

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 69/132 (52%), Gaps = 8/132 (6%)

Query: 35  ETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSY 91
           E   C+ T+QGR  ++DD GY+C+   +   S  C E G    +FSC  C+L + CC+ Y
Sbjct: 49  EGRACRNTIQGRTSVTDDRGYICERAFL--MSNGCCEVGTQSIRFSCVWCDLYTGCCSHY 106

Query: 92  EYCVSCCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS 149
           E CVSCC+ P +   L + +++    +     +    F+ C  +CR +S SV  EN Y S
Sbjct: 107 EQCVSCCLRPEQKNILLKMIVSTSGHRLRQILSATDQFELCMLKCRTSSNSVHSENKYKS 166

Query: 150 D-FHHCFSMPSN 160
           +   +C+    N
Sbjct: 167 EKMKYCYKQEPN 178


>gi|62897527|dbj|BAD96703.1| hypothetical protein FLJ21415 variant [Homo sapiens]
          Length = 205

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132

Query: 96  SCCVNPARTLK-EQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P + L  E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLPLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|312070660|ref|XP_003138249.1| hypothetical protein LOAG_02664 [Loa loa]
 gi|307766590|gb|EFO25824.1| hypothetical protein LOAG_02664 [Loa loa]
          Length = 224

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 6/131 (4%)

Query: 35  ETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP--EKGEKFSCQGCNLLSQCCNSYE 92
           E   C+ TVQGR  ++DD GY C+   +     CC    +  +FSC  C+L + CC+ YE
Sbjct: 90  EGKTCRNTVQGRTSVTDDRGYTCERAFLMLNG-CCEVGTQSVRFSCAWCDLYTGCCSYYE 148

Query: 93  YCVSCCVNPAR--TLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLSD 150
            CVSCC+ P +   L E + +    +     +    F+ C  +CR +S SV  EN Y S+
Sbjct: 149 QCVSCCLRPEQKNILLEMIESTSGHRLRQILSATDQFELCMLKCRTSSNSVHSENKYKSE 208

Query: 151 -FHHCFSMPSN 160
              +C+    N
Sbjct: 209 KMKYCYKQEPN 219


>gi|13376060|ref|NP_079014.1| UPF0454 protein C12orf49 precursor [Homo sapiens]
 gi|109098869|ref|XP_001112172.1| PREDICTED: UPF0454 protein C12orf49-like isoform 1 [Macaca mulatta]
 gi|297693075|ref|XP_002823850.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pongo
           abelii]
 gi|402887789|ref|XP_003907263.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Papio
           anubis]
 gi|74733622|sp|Q9H741.1|CL049_HUMAN RecName: Full=UPF0454 protein C12orf49; Flags: Precursor
 gi|10437508|dbj|BAB15058.1| unnamed protein product [Homo sapiens]
 gi|18044293|gb|AAH19843.1| Chromosome 12 open reading frame 49 [Homo sapiens]
 gi|119618494|gb|EAW98088.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
 gi|119618495|gb|EAW98089.1| chromosome 12 open reading frame 49, isoform CRA_a [Homo sapiens]
 gi|380816602|gb|AFE80175.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
 gi|383413575|gb|AFH30001.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
 gi|384949484|gb|AFI38347.1| hypothetical protein LOC79794 precursor [Macaca mulatta]
          Length = 205

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|114647164|ref|XP_001154941.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Pan
           troglodytes]
 gi|410209358|gb|JAA01898.1| chromosome 12 open reading frame 49 [Pan troglodytes]
 gi|410255302|gb|JAA15618.1| chromosome 12 open reading frame 49 [Pan troglodytes]
 gi|410289894|gb|JAA23547.1| chromosome 12 open reading frame 49 [Pan troglodytes]
 gi|410335531|gb|JAA36712.1| chromosome 12 open reading frame 49 [Pan troglodytes]
          Length = 205

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|109098867|ref|XP_001112213.1| PREDICTED: UPF0454 protein C12orf49-like isoform 2 [Macaca mulatta]
          Length = 272

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|395833970|ref|XP_003789990.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 1 [Otolemur
           garnettii]
          Length = 205

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|307207910|gb|EFN85471.1| UPF0454 protein C12orf49-like protein [Harpegnathos saltator]
          Length = 276

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 86/183 (46%), Gaps = 26/183 (14%)

Query: 3   KSRLNLLFLFFQLLTLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSM 62
           K+ L  + L  +  +    S+ +A   +      P C+ ++QG+ L+ D+ G VC    +
Sbjct: 29  KAPLWQMELLERAFSENAASVDTANLNESDASTQP-CRNSIQGKALIVDERGIVCARHEV 87

Query: 63  DPQSRCCPE--------------KGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLK-- 106
            P   C  E              + E++SC+ CN    CC  YEYCVSCC++P + +K  
Sbjct: 88  LPNGCCRIEQKQSPKNEETPIMARRERYSCKTCNPQG-CCTIYEYCVSCCLHPGKQIKGR 146

Query: 107 EQVL--NVKIAKPTTAGTYG----SVFDFCAGRCRHNSESVVHENAYLSDF-HHCFSM-P 158
           + V+  +VK  K   A          F  C   CR +S SV HEN Y   F  HC++M P
Sbjct: 147 KDVVSGSVKAQKEEDAVRTRIRNLDRFQICLAACRTSSASVRHENTYKDPFAKHCYAMQP 206

Query: 159 SNA 161
           SN+
Sbjct: 207 SNS 209


>gi|397525490|ref|XP_003832699.1| PREDICTED: UPF0454 protein C12orf49 homolog [Pan paniscus]
          Length = 247

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 117 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 174

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 175 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 228


>gi|427782911|gb|JAA56907.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 210

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 69/125 (55%), Gaps = 10/125 (8%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ +VQG+ L++DD GYVC    +     C  E     ++ C  C   + CC+ YEYC+S
Sbjct: 72  CRNSVQGKVLVADDRGYVCQRSEVASNGCCIAEASSTRRYCCDTCQP-NGCCSIYEYCIS 130

Query: 97  CCVNPAR-TLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAYLS-DF 151
           CC++P + T+ +++L    A  T      S+   F+ C  +CR +S+SV HEN+Y     
Sbjct: 131 CCMHPQKITILQKILGK--ASDTFKMLVASITDHFELCLTKCRTSSDSVHHENSYRDPKA 188

Query: 152 HHCFS 156
            HC++
Sbjct: 189 KHCYA 193


>gi|225711380|gb|ACO11536.1| UPF0454 protein C12orf49 homolog precursor [Caligus rogercresseyi]
          Length = 210

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 68/128 (53%), Gaps = 8/128 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCV 95
           C+ T+QG  L++DD G +C   +M  QS CC         ++C  CNL   CC++YE CV
Sbjct: 78  CRHTIQGSLLIADDQGVLCLRKNM-LQSGCCERNVTGSTLYACSHCNLDMGCCSAYEACV 136

Query: 96  SCCVNP-ARTLKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
           SCC++P  R     VLN  +        +    ++ C  +CR +S SV HEN Y + ++ 
Sbjct: 137 SCCMSPDKRAFLMSVLNEARSLSNILILSVSDQYELCLAKCRTSSRSVHHENTYKNPEYK 196

Query: 153 HCF-SMPS 159
            C+  +PS
Sbjct: 197 FCYKKIPS 204


>gi|332250748|ref|XP_003274513.1| PREDICTED: UPF0454 protein C12orf49 homolog [Nomascus leucogenys]
          Length = 286

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 156 CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 213

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 214 SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 267


>gi|256086857|ref|XP_002579602.1| hypothetical protein [Schistosoma mansoni]
          Length = 392

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 5/112 (4%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ T Q   L+ D+ GY+C+  ++   S CCP +   ++F C  C   + CC+ YE+CVS
Sbjct: 66  CRRTKQSPILVVDELGYLCNYFNLLTNS-CCPFQNVTQRFFCNSCKS-NHCCSIYEHCVS 123

Query: 97  CCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CC+NP  ++L ++V N+              F+FC   CR +S +V HEN Y
Sbjct: 124 CCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFHENRY 175


>gi|353230151|emb|CCD76322.1| hypothetical protein Smp_168260 [Schistosoma mansoni]
          Length = 375

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 5/112 (4%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ T Q   L+ D+ GY+C+  ++   S CCP +   ++F C  C   + CC+ YE+CVS
Sbjct: 66  CRRTKQSPILVVDELGYLCNYFNLLTNS-CCPFQNVTQRFFCNSCKS-NHCCSIYEHCVS 123

Query: 97  CCVNPA-RTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CC+NP  ++L ++V N+              F+FC   CR +S +V HEN Y
Sbjct: 124 CCLNPINKSLWDKVQNLDANYKRHLQLATDAFEFCIAVCRTSSLTVFHENRY 175


>gi|56119050|ref|NP_001007842.1| UPF0454 protein C12orf49 homolog precursor [Gallus gallus]
 gi|82081584|sp|Q5ZJE4.1|CL049_CHICK RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|53133640|emb|CAG32149.1| hypothetical protein RCJMB04_18o22 [Gallus gallus]
          Length = 205

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++D+ GY+C+   +     CC       + +SC  C L + CC+ YEYCV
Sbjct: 75  CRNSVQGKLLITDELGYICERKDLLVNG-CCNVNVPSTKLYSCDSC-LPNGCCSVYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHENTY 186


>gi|355564717|gb|EHH21217.1| hypothetical protein EGK_04231, partial [Macaca mulatta]
 gi|355786567|gb|EHH66750.1| hypothetical protein EGM_03800, partial [Macaca fascicularis]
          Length = 169

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 39  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 96

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 97  SCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 150


>gi|224072158|ref|XP_002200066.1| PREDICTED: UPF0454 protein C12orf49 homolog [Taeniopygia guttata]
          Length = 205

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++D+ GY+C+   +     CC       + +SC  C L + CC+ YE+CV
Sbjct: 75  CRNSVQGKLLITDELGYICERKDLLVNG-CCNVHVPSTKLYSCDSC-LPNGCCSIYEFCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   IA           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 186


>gi|324509188|gb|ADY43866.1| Unknown [Ascaris suum]
          Length = 218

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 78/156 (50%), Gaps = 20/156 (12%)

Query: 18  LQFLSIISAIRKDIGFQETP---FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP--EK 72
           +Q+L+  SA R      ET     C+ T QG+  ++DD GY CD   +     CC     
Sbjct: 65  IQWLAD-SAWRSGSDINETTGETACRNTRQGKTTVTDDRGYTCDRSYLMLNG-CCEVIPL 122

Query: 73  GEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAG-------TYGS 125
             +F+C GC+L S CC+ YE+CV CC+ P +  KE +L +     TT+G       +   
Sbjct: 123 SVRFNCNGCDLHSGCCSFYEHCVGCCMRPDQ--KEILLEM---IETTSGHRLRQILSATD 177

Query: 126 VFDFCAGRCRHNSESVVHENAYLSD-FHHCFSMPSN 160
            F+ C  +CR +S SV  EN Y S+   +C+    N
Sbjct: 178 QFELCTLKCRTSSNSVHSENKYKSEALKYCYKQEPN 213


>gi|348533454|ref|XP_003454220.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oreochromis niloticus]
          Length = 200

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ LL+D+ GYVC+   +     CC        ++ C+ C L + CC+ YEYCV
Sbjct: 70  CRNSIQGKTLLTDELGYVCERKDLLVNG-CCNVNAPSTRQYICKSC-LANGCCSIYEYCV 127

Query: 96  SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   A       + +V   F+ C  +CR +S+SV HEN Y
Sbjct: 128 SCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQHENTY 181


>gi|47221493|emb|CAG08155.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 201

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ LL+D+ GYVC+   +     CC        ++ C+ C L + CC+ YEYCV
Sbjct: 71  CRNSIQGKTLLTDELGYVCERKDLLVNG-CCNVNAPSSRQYICKSC-LANGCCSIYEYCV 128

Query: 96  SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   A       + +V   F+ C  +CR +S+SV HEN Y
Sbjct: 129 SCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDHFELCLAKCRTSSQSVQHENTY 182


>gi|324520392|gb|ADY47626.1| Unknown [Ascaris suum]
          Length = 180

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 10/151 (6%)

Query: 18  LQFLSIISAIRKDIGFQETP---FCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP--EK 72
           +Q+L+  SA R      ET     C+ T QG+  ++DD GY CD   +     CC     
Sbjct: 27  IQWLAD-SAWRSGSDINETTGETACRNTRQGKTTVTDDRGYTCDRSYLMLNG-CCEVIPL 84

Query: 73  GEKFSCQGCNLLSQCCNSYEYCVSCCVNP--ARTLKEQVLNVKIAKPTTAGTYGSVFDFC 130
             +F+C GC+L S CC+ YE+CV CC+ P     L E +      +     +    F+ C
Sbjct: 85  SVRFNCNGCDLHSGCCSFYEHCVGCCMRPDQKEILLEMIETTSGHRLRQILSATDQFELC 144

Query: 131 AGRCRHNSESVVHENAYLSD-FHHCFSMPSN 160
             +CR +S SV  EN Y S+   +C+    N
Sbjct: 145 TLKCRTSSNSVHSENKYKSEALKYCYKQEPN 175


>gi|198423098|ref|XP_002130011.1| PREDICTED: similar to Chromosome 12 open reading frame 49 ortholog
           [Ciona intestinalis]
          Length = 210

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 70/141 (49%), Gaps = 20/141 (14%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ L+ DDNGYVC    +     CC        +++C+ C   S CC+ YE+CV
Sbjct: 79  CRNSLQGKQLIVDDNGYVCKRDYLLDNG-CCDVTMHDTGRYACKTCQT-SNCCSVYEHCV 136

Query: 96  SCCVNPA-RTLKEQVLNVKIAKPTT----AGTYGSVFDFCAGRCRHNSESVVHENAYLS- 149
           SCC+ P  + L E+V  V+  K  T      +    FD C  +CR +S SV HEN Y   
Sbjct: 137 SCCLQPKHKNLLEKV--VRGLKTNTMLIMMSSIHDQFDLCLMKCRTSSTSVRHENTYRDP 194

Query: 150 DFHHCFSMPSNASGAGVTQLE 170
              +CF       G G  +L 
Sbjct: 195 SMKYCF-------GVGAPELR 208


>gi|410923361|ref|XP_003975150.1| PREDICTED: UPF0454 protein C12orf49 homolog [Takifugu rubripes]
          Length = 199

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ LL+D+ GYVC+   +     CC        ++ C+ C L + CC+ YEYCV
Sbjct: 69  CRNSIQGKTLLTDELGYVCERKDLLVNG-CCNVNAPSSRQYICKSC-LANGCCSIYEYCV 126

Query: 96  SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   A       + +V   F+ C  +CR +S+SV HEN Y
Sbjct: 127 SCCLQPDKQPLLERFLNR--AAEGFRNLFTAVEDHFELCLAKCRTSSQSVQHENTY 180


>gi|326930051|ref|XP_003211166.1| PREDICTED: UPF0454 protein C12orf49 homolog [Meleagris gallopavo]
          Length = 205

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 63/114 (55%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++D+ GY+C+   +     CC       + +SC  C L + CC+ YEYCV
Sbjct: 75  CRNSVQGKLLITDELGYICERKDLLVNG-CCNVNVPSTKLYSCDSC-LPNGCCSVYEYCV 132

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   +A           F+ C  +CR +S+SV HEN Y
Sbjct: 133 SCCLQPNKQHLLERFLNRAAMAFQNLFMAVEDRFELCLAKCRTSSQSVQHENTY 186


>gi|327276076|ref|XP_003222797.1| PREDICTED: LOW QUALITY PROTEIN: UPF0454 protein C12orf49 homolog
           [Anolis carolinensis]
          Length = 205

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 13/124 (10%)

Query: 32  GFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC--PEKGEK-FSCQGCNLLSQCC 88
           G Q+   C+ +VQG+ L++D+ GY+C+   +     CC     G K +SC+ C L + CC
Sbjct: 68  GSQQGNQCRNSVQGKLLITDELGYICERKDVLVNG-CCNVDAPGTKLYSCESC-LSNGCC 125

Query: 89  NSYEYCVSCCVNPARTLKEQVLNVKIAKPTTA--GTYGSV---FDFCAGRCRHNSESVVH 143
           + YE CVSCC+ P    K+QVL   + +   A    + SV   F+ C  +CR +S+SV H
Sbjct: 126 SVYEACVSCCLQPN---KQQVLENFLNRAAVAFRNLFLSVEDHFELCLAKCRTSSQSVQH 182

Query: 144 ENAY 147
           EN Y
Sbjct: 183 ENTY 186


>gi|66472238|ref|NP_001018573.1| UPF0454 protein C12orf49 homolog precursor [Danio rerio]
 gi|82192840|sp|Q503V3.1|CL049_DANRE RecName: Full=UPF0454 protein C12orf49 homolog; Flags: Precursor
 gi|63100598|gb|AAH95168.1| Zgc:110063 [Danio rerio]
          Length = 199

 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ LL+D+ GYVC+   +     CC        ++ C+ C L + CC+ YEYCV
Sbjct: 69  CRNSIQGKSLLTDELGYVCERKDLLVNG-CCNVNAPSSSQYVCKSC-LSNGCCSIYEYCV 126

Query: 96  SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   A       + +V   F+ C  +CR +S+SV HEN Y
Sbjct: 127 SCCLQPDKQPLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQHENTY 180


>gi|348670076|gb|EGZ09898.1| hypothetical protein PHYSODRAFT_549244 [Phytophthora sojae]
          Length = 367

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 70/133 (52%), Gaps = 14/133 (10%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDP-QSRCCPE------KGEKFSCQGCNL-LSQCCNS 90
           C+ TVQG  LL+D  GYVC    +D  ++ CC E      + ++F+C  C+     CC+ 
Sbjct: 181 CENTVQGVRLLTDSMGYVCTRSQLDKVKAGCCNETDASLPRIKQFTCDQCDTNPPHCCDV 240

Query: 91  YEYCVSCCVNPARTL-----KEQVLNVKIAKPTTAGTYG-SVFDFCAGRCRHNSESVVHE 144
           +E+CVSCC++P         ++ + +     P  +     +VF +C  RCR +S SV H+
Sbjct: 241 FEHCVSCCMHPLNVRFTGIRRDYLTHADPNHPVYSDPAEITVFQYCQYRCRTSSASVQHQ 300

Query: 145 NAYLSDFHHCFSM 157
           N+Y S    C+ +
Sbjct: 301 NSYRSHRSFCYGI 313


>gi|432872487|ref|XP_004072113.1| PREDICTED: UPF0454 protein C12orf49 homolog [Oryzias latipes]
          Length = 201

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 64/116 (55%), Gaps = 11/116 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG+ LL+D+ GYVC+   +     CC        +  C+ C L + CC+ YEYCV
Sbjct: 71  CRNSIQGKSLLTDELGYVCERKDLLVNG-CCNVNAPNSRQHICKSC-LANGCCSIYEYCV 128

Query: 96  SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L E+ LN   A       + +V   F+ C  +CR +S+SV HEN Y
Sbjct: 129 SCCLQPDKQLLLERFLNR--AAEGFQNLFTAVEDHFELCLAKCRTSSQSVQHENTY 182


>gi|449685232|ref|XP_004210848.1| PREDICTED: UPF0454 protein C12orf49 homolog [Hydra magnipapillata]
          Length = 191

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 68/129 (52%), Gaps = 10/129 (7%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSYEYCV 95
           C+ + QG YL++D+ G VC  +    ++ CC        +++C  C+  ++CC  YE+CV
Sbjct: 61  CRNSKQGPYLIADEKGQVC-MIKNVLKNGCCNFHANSTVRYNCNSCHT-NRCCEVYEHCV 118

Query: 96  SCCVNPA---RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYL-SD 150
           SCC+ P    R L E++L +  +A+          F+ C  +CR +S SV  EN Y  S 
Sbjct: 119 SCCLQPVPRKRRLIEKILVHAHMARNPLLREVKDTFELCLHKCRTSSASVYQENKYKDSS 178

Query: 151 FHHCFSMPS 159
           F +CF   S
Sbjct: 179 FKYCFGTDS 187


>gi|170033812|ref|XP_001844770.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167874847|gb|EDS38230.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 228

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 64/123 (52%), Gaps = 8/123 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L+ DD G+VC    +   + CC    E  + F C  C   + CC  YE+CV
Sbjct: 75  CRNSVQGKALIVDDRGFVCTRSEL-LWTGCCNVEVESTKLFHCDTCQS-NACCAIYEFCV 132

Query: 96  SCCVNP-ARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLS-DFH 152
           SCC++P  ++L E+VL     +      T    ++ C  +CR NS SV +EN Y      
Sbjct: 133 SCCMHPEKKSLLEKVLAKASGRQIAVYATVNDQYELCLTKCRTNSHSVQNENKYRDPKSK 192

Query: 153 HCF 155
           HC+
Sbjct: 193 HCY 195


>gi|170590022|ref|XP_001899772.1| hypothetical protein [Brugia malayi]
 gi|158592898|gb|EDP31494.1| conserved hypothetical protein [Brugia malayi]
          Length = 295

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 71/137 (51%), Gaps = 21/137 (15%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE---KFSCQGCNLLSQCCNSYEYCV 95
           C+ T+QGR  ++DD GY C+   +   S  C E G    +FSC  C+L + CC+ YE CV
Sbjct: 160 CRNTIQGRTSVTDDRGYTCERAFL--MSNGCCEVGTQSLRFSCVWCDLYTGCCSHYEQCV 217

Query: 96  SCCVNPARTLKEQVLNVKIAKPTTAGTYGSV------FDFCAGRCRHNS-----ESVVHE 144
           SCC+ P    +++ + +K+ + T+      +      F+ C  +CR +S     ESV  E
Sbjct: 218 SCCLKP----EQKNILLKMIESTSGHRLRQILSATDQFELCMLKCRTSSNVRAHESVHSE 273

Query: 145 NAYLSD-FHHCFSMPSN 160
           N Y S+   +C+    N
Sbjct: 274 NKYKSEKMKYCYKQEPN 290


>gi|449265786|gb|EMC76924.1| UPF0454 protein C12orf49 like protein, partial [Columba livia]
          Length = 187

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 64/114 (56%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++D+ GY+C+   +     CC       + +SC+ C L + CC+ YE CV
Sbjct: 57  CRNSVQGKLLITDELGYICERKDLLVNG-CCNVNVPNTKLYSCESC-LPNGCCSVYECCV 114

Query: 96  SCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           SCC+ P++  L E+ LN   IA           F+ C  +CR +S+SV HEN Y
Sbjct: 115 SCCLQPSKQHLLERFLNRAAIAFQNLFMAVEDRFELCLAKCRTSSQSVQHENTY 168


>gi|390359679|ref|XP_797277.2| PREDICTED: UPF0454 protein C12orf49 homolog [Strongylocentrotus
           purpuratus]
          Length = 198

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 74/147 (50%), Gaps = 13/147 (8%)

Query: 17  TLQFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEK- 75
           T Q +S+ +   + + F  T  C+ + QG+  + D+  YVC    +     C PE     
Sbjct: 46  TPQQISLRTRTSEKLKFNLTS-CRNSRQGKTFVVDEKAYVCSWKDLLANGCCNPEAKSSS 104

Query: 76  -FSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKP--TTAGTYGSV---FDF 129
             +C GC+  + CC  YE CVSCC++P    KE+VL   + K   T    + SV   F+ 
Sbjct: 105 LHNCNGCSK-NGCCVMYERCVSCCMHPE---KERVLQDMLGKASHTFRLLFQSVADHFEL 160

Query: 130 CAGRCRHNSESVVHENAYLS-DFHHCF 155
           C  +CR +S+SV HEN Y   D  HC+
Sbjct: 161 CLTKCRTSSQSVQHENTYRDPDAKHCY 187


>gi|383856970|ref|XP_003703979.1| PREDICTED: UPF0454 protein C12orf49 homolog [Megachile rotundata]
          Length = 325

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 69/148 (46%), Gaps = 32/148 (21%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---------------EKGEKFSCQGCNL 83
           C+ +VQG+ L+ D+ G VC    + P   CC                 K E++SC+ CN 
Sbjct: 112 CRNSVQGKVLIVDERGIVCSRQDILPNG-CCNIDQKDPTKNEDTSTVTKSERYSCKTCNA 170

Query: 84  LSQCCNSYEYCVSCCVNPARTLK--EQVL------NVKIAKPTTAGTYG----SVFDFCA 131
              CC  YEYCVSCC++P + +K  + VL      N K  K   A          F  C 
Sbjct: 171 -QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHKDQDAVKKRLRNLDRFQVCL 229

Query: 132 GRCRHNSESVVHENAYLSDFH--HCFSM 157
             CR +S SV HEN Y  D H  HC+++
Sbjct: 230 AACRTSSASVRHENTY-KDPHSKHCYTL 256


>gi|345483911|ref|XP_001602536.2| PREDICTED: UPF0454 protein C12orf49 homolog [Nasonia vitripennis]
          Length = 329

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 69/151 (45%), Gaps = 30/151 (19%)

Query: 39  CKTTVQGRYLLSDDNGYVC-------------DAVSMDPQSRCCPE-KGEKFSCQGCNLL 84
           C+ +VQG+ L+ D+ GYVC             +A      S   P  K  ++SC+ C+  
Sbjct: 113 CRNSVQGKLLIVDEKGYVCSRHEVLSNGCCSLEASDQRTISSSVPSSKKTRYSCKTCSP- 171

Query: 85  SQCCNSYEYCVSCCVNPARTLKEQVLNV-KIAKPTTAGTYGS-----------VFDFCAG 132
             CC  YEYCVSCC++P +  K +   + +I K       G             F  C  
Sbjct: 172 QGCCTIYEYCVSCCLDPNKVKKRKSFTIDQIRKENLKLRRGEDILKLRLRSLDRFQLCLA 231

Query: 133 RCRHNSESVVHENAYLSD-FHHCF--SMPSN 160
            CR +S SV HEN Y +  + HC+   +PSN
Sbjct: 232 ACRTSSASVQHENTYKNPLYKHCYILQLPSN 262


>gi|301113522|ref|XP_002998531.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111832|gb|EEY69884.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 363

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 68/133 (51%), Gaps = 14/133 (10%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDP-QSRCCPE------KGEKFSCQGCNL-LSQCCNS 90
           C+ TVQG  LL+D  GYVC    +D  +  CC E      + ++++C  C+     CC+ 
Sbjct: 181 CENTVQGLKLLTDSMGYVCTRSQLDKVKPGCCNETDASLPRIKQYTCDQCDTNPPHCCDV 240

Query: 91  YEYCVSCCVNPARTLKEQVLN--VKIAKPT----TAGTYGSVFDFCAGRCRHNSESVVHE 144
           +E+CVSCC++P       +    +  A P     +     +VF +C  RCR +S SV H+
Sbjct: 241 FEHCVSCCMHPLNVRSSGIRRDFLTHADPNHPVYSDPAEITVFQYCQYRCRTSSASVQHQ 300

Query: 145 NAYLSDFHHCFSM 157
           N+Y S    C+ +
Sbjct: 301 NSYRSHRSFCYGI 313


>gi|281208176|gb|EFA82354.1| cGMP-specific phosphodiesterase [Polysphondylium pallidum PN500]
          Length = 283

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 67/146 (45%), Gaps = 30/146 (20%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---------------------------PE 71
           C+ TVQG   ++DD G+ C    +D ++ CC                            +
Sbjct: 133 CRNTVQGTKFIADDQGFNCLRSEVD-KNGCCIVNNNNNNSNNNNSTDNNNNNKINQYNSD 191

Query: 72  KGEKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLNVKIAKPTTAGTYGSVFDFC 130
             ++FSC GC+    CCN YE CVSCC+   +  + E+VL   + K  ++    + F+ C
Sbjct: 192 IFKRFSCYGCSNKFHCCNEYELCVSCCMGSDKVPILEEVLGRMLDKGISSNYLRNQFELC 251

Query: 131 AGRCRHNSESVVHENAYLS-DFHHCF 155
              CR +S S++H+  Y+  D   C+
Sbjct: 252 TMTCRTSSRSLIHQREYIEPDLKFCY 277


>gi|443709414|gb|ELU04086.1| hypothetical protein CAPTEDRAFT_93600 [Capitella teleta]
          Length = 211

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 64/116 (55%), Gaps = 11/116 (9%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ +VQG+ L++DD GYVC    +   S CC       +++ C  C   + CC  YE+CV
Sbjct: 77  CRNSVQGQVLIADDEGYVCKRKDVK-GSGCCSLSAATTKRYVCDSCT-KNGCCEVYEHCV 134

Query: 96  SCCVNPART-LKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           SCC+ P +  L   +L+   A  +    + SV   F+ C  +CR +S+SV HEN+Y
Sbjct: 135 SCCLLPEKQPLLRNILSR--ASDSFRHLFASVADHFELCLAKCRTSSQSVQHENSY 188


>gi|350405127|ref|XP_003487334.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus impatiens]
          Length = 329

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 68/147 (46%), Gaps = 30/147 (20%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
           C+ ++QG+ L+ D+ G VC    + P   C  E              K E++SC+ CN  
Sbjct: 113 CRNSIQGKVLIVDERGIVCSRQEILPNGCCNVEQKDSNKNEDTSTIAKKERYSCKTCNA- 171

Query: 85  SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
             CC  YEYCVSCC++P + +K  + VL   +     A     V          F  C  
Sbjct: 172 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLA 231

Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
            CR +S SV HEN Y  D H  HC+++
Sbjct: 232 ACRTSSASVRHENTY-KDPHSKHCYTL 257


>gi|340712268|ref|XP_003394684.1| PREDICTED: UPF0454 protein C12orf49 homolog [Bombus terrestris]
          Length = 329

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 68/147 (46%), Gaps = 30/147 (20%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
           C+ ++QG+ L+ D+ G VC    + P   C  E              K E++SC+ CN  
Sbjct: 113 CRNSIQGKVLIVDERGVVCSRQEILPNGCCNMEQKDSNKNEDTSTIAKKERYSCKTCNA- 171

Query: 85  SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
             CC  YEYCVSCC++P + +K  + VL   +     A     V          F  C  
Sbjct: 172 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKAHRDQDVVKKRLRNLDRFQVCLA 231

Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
            CR +S SV HEN Y  D H  HC+++
Sbjct: 232 ACRTSSASVRHENTY-KDPHSKHCYTL 257


>gi|332016576|gb|EGI57457.1| UPF0454 protein C12orf49-like protein [Acromyrmex echinatior]
          Length = 293

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 65/134 (48%), Gaps = 19/134 (14%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE-------------KGEKFSCQGCNLLS 85
           C+ ++QGR L+ DD+G VC    +     C  E             + E++SC+ CN  S
Sbjct: 110 CRNSIQGRGLIVDDHGIVCSRHEVQSNGCCAIEQKQSSKNEESFMIRRERYSCKTCN--S 167

Query: 86  Q-CCNSYEYCVSCCVNPARTLK-EQVLNVKIAK-PTTAGTYGSVFDFCAGRCRHNSESVV 142
           Q CC  YEYCVSCC++P + ++  + L+   AK     G     F  C   CR +S SV 
Sbjct: 168 QGCCTIYEYCVSCCLHPDKQIRGRKDLSSGSAKVQKNVGRTAVHFQICLAACRTSSSSVR 227

Query: 143 HENAYLSDF-HHCF 155
            +N Y      HC+
Sbjct: 228 RDNTYKDPLAKHCY 241


>gi|145348106|ref|XP_001418497.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578726|gb|ABO96790.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 198

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 4/116 (3%)

Query: 43  VQGRYLLSDDNGYVCDAVSMDPQSRCCPEK--GEKFSCQGCNLLSQCCNSYEYCVSCCVN 100
           V  R ++ DD G  C    +D  + CC       +++C GC+  ++CC  YE C +CC  
Sbjct: 58  VDARRVV-DDRGRACAWYEVDSVTGCCSTTTTAARYACDGCDESARCCRDYETCAACCQG 116

Query: 101 PARTLKEQVLNVKIAKPTTA-GTYGSVFDFCAGRCRHNSESVVHENAYLSDFHHCF 155
           P   ++  +      +   A G +   F++C  RCR       HEN Y  D  +C+
Sbjct: 117 PTHDVEAAMKRHARGRNQIATGFFEDAFEYCKSRCRTQPSVTFHENTYAYDTAYCY 172


>gi|380014036|ref|XP_003691050.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis florea]
          Length = 327

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 30/147 (20%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
           C+ ++QG+ L+ D+ G VC    + P   C  E              + E++SC+ CN  
Sbjct: 112 CRNSIQGKVLIVDERGIVCSRQEILPNGCCSMEQKGPIKNEDTSTIPRRERYSCKTCNA- 170

Query: 85  SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
             CC  YEYCVSCC++P + +K  + VL   +     +     V          F  C  
Sbjct: 171 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVVKKRLRNLDRFQVCLA 230

Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
            CR +S SV HEN Y  D H  HC++ 
Sbjct: 231 ACRTSSASVRHENTY-KDPHSKHCYTF 256


>gi|110755335|ref|XP_001121865.1| PREDICTED: UPF0454 protein C12orf49 homolog [Apis mellifera]
          Length = 327

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 30/147 (20%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
           C+ ++QG+ L+ D+ G VC    + P   C  E              + E++SC+ CN  
Sbjct: 112 CRNSIQGKVLIVDERGIVCSRQEILPNGCCSMEQKDSIKNEDTSTTPRRERYSCKTCNA- 170

Query: 85  SQCCNSYEYCVSCCVNPARTLK--EQVLNVKIAKPTTAGTYGSV----------FDFCAG 132
             CC  YEYCVSCC++P + +K  + VL   +     +     V          F  C  
Sbjct: 171 QGCCAIYEYCVSCCLHPGKQIKGRKDVLLGLLRDNHKSHRDQDVVKKRLRNLDRFQVCLA 230

Query: 133 RCRHNSESVVHENAYLSDFH--HCFSM 157
            CR +S SV HEN Y  D H  HC++ 
Sbjct: 231 ACRTSSASVRHENTY-KDPHSKHCYTF 256


>gi|307185715|gb|EFN71631.1| UPF0454 protein C12orf49-like protein [Camponotus floridanus]
          Length = 322

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 64/139 (46%), Gaps = 24/139 (17%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPE--------------KGEKFSCQGCNLL 84
           C+ ++QG+ L+ D++G VC    +     C  E              K +++SC+ CN L
Sbjct: 117 CRNSIQGKGLIVDEHGIVCSRYEVLSNGCCTMEQKQAPKNKETSSIVKRKRYSCETCNSL 176

Query: 85  SQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSV-------FDFCAGRCRHN 137
             CC  YEYCVSCC++P + +  Q+   K+         G +       F  C   CR +
Sbjct: 177 G-CCTIYEYCVSCCLDPDKQIL-QMRGRKLGSARMQKDGGRLRIRNQDRFQICLAVCRTS 234

Query: 138 SESVVHENAYLSDF-HHCF 155
           S SV HEN Y      HC+
Sbjct: 235 SLSVQHENTYKDPLAKHCY 253


>gi|431914235|gb|ELK15493.1| hypothetical protein PAL_GLEAN10010741 [Pteropus alecto]
          Length = 219

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 73/146 (50%), Gaps = 23/146 (15%)

Query: 23  IISAIRKDIGFQETPF--CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK---GEKFS 77
           I+  ++ ++G    P   C+ ++QG++L++D+ GYVC+   +     CC       +++ 
Sbjct: 57  ILWKVQFNLGNSSRPSNQCRNSIQGKHLITDELGYVCERKDLLANG-CCNVNVPGTKQYC 115

Query: 78  CQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCAGRCR 135
           C GC L + CC +YE+CVSCC+ P +  L E+ LN   +A           F+ C  +CR
Sbjct: 116 CDGC-LSNGCCGAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCLAKCR 174

Query: 136 HNS--------------ESVVHENAY 147
            +S              +SV HEN Y
Sbjct: 175 TSSQVRIFDVSRKLEDGQSVQHENTY 200


>gi|255083216|ref|XP_002504594.1| predicted protein [Micromonas sp. RCC299]
 gi|226519862|gb|ACO65852.1| predicted protein [Micromonas sp. RCC299]
          Length = 366

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 65/157 (41%), Gaps = 40/157 (25%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNL---LSQCCNSYEYCV 95
           C T V+ R+L++D +G VC  +++D  + CC E     S + C        CC++Y  CV
Sbjct: 29  CATAVENRHLVADSSGGVCPPLALDAATGCCLEPKRVVSKEMCAESCSERGCCDAYAACV 88

Query: 96  SCC------------------------VNPARTLKEQVLNVK------IAKPTTAGTYGS 125
            CC                        ++PA   +  + +          +PT   T+  
Sbjct: 89  VCCHDDIARRIAGSEGHDPIELAGFTPMHPAMWRRWLITHDPNGVRKLEGEPTRGETFAG 148

Query: 126 V-------FDFCAGRCRHNSESVVHENAYLSDFHHCF 155
                   FD+C  RCR NS    +ENA+  D HHCF
Sbjct: 149 ADDHAVAPFDYCKHRCRTNSGVTKYENAFQDDRHHCF 185


>gi|334327393|ref|XP_003340895.1| PREDICTED: UPF0454 protein C12orf49 homolog [Monodelphis domestica]
          Length = 174

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 11/101 (10%)

Query: 54  GYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPAR-TLKEQV 109
           GYVC+   +     CC        +F C GC L + CC +YEYCVSCC+ P++  L E+ 
Sbjct: 59  GYVCERKDLLVNG-CCNVNAASARRFWCDGC-LANGCCGAYEYCVSCCLQPSKQLLLERF 116

Query: 110 LNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           LN   A       + +V   F+ C  +CR +S+SV HEN Y
Sbjct: 117 LNQAAA--AFQNLFMAVEDHFELCLAKCRTSSQSVQHENTY 155


>gi|452823888|gb|EME30895.1| peptidyl-prolyl cis-trans isomerase [Galdieria sulphuraria]
          Length = 360

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 49/98 (50%), Gaps = 3/98 (3%)

Query: 178 VIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM---GADQPAEV 234
           +I  ++ ++C   C+     C      +LN C  + +Y  C+   +A+M   G+D P  V
Sbjct: 248 IIPAKRKQNCYRACQLKNLVCFKKGFSILNNCPTLSEYFDCRECEIATMESAGSDMPCRV 307

Query: 235 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           +  APS  +   C+ +   S+ SC+ SH  T RLCPC 
Sbjct: 308 SLKAPSTYSSGVCMISPNLSLTSCEASHSQTERLCPCV 345


>gi|291221905|ref|XP_002730960.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 200

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 63/114 (55%), Gaps = 7/114 (6%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCN--LLSQCCNSYEYCVS 96
           C+ +VQG+ L++D+ GY+C+   +     CC    EK     C+  L + CC+ YE+CVS
Sbjct: 69  CRNSVQGKVLIADEKGYLCERSDVLGNG-CCDITAEKTVLHCCDKCLDNGCCSIYEHCVS 127

Query: 97  CCVNPARTLKEQVLNVKIAKPTTAGTYGSV---FDFCAGRCRHNSESVVHENAY 147
           CC+ P +    Q +  + +K      + SV   F+ C  +CR +SESV HEN Y
Sbjct: 128 CCLQPQKQQLLQGVLGRASKAFQI-LFSSVTDHFELCLAKCRTSSESVQHENTY 180


>gi|255071235|ref|XP_002507699.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
 gi|226522974|gb|ACO68957.1| glycosyltransferase family 13 protein [Micromonas sp. RCC299]
          Length = 636

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 4/97 (4%)

Query: 175 GISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEV 234
           G++ +   QG++CD+ C+ +   C       LN CE++ K+  C+ GC   +G D P  V
Sbjct: 540 GLTPVAAVQGQTCDAACRYNHMKCHAPDFWFLNSCEVLAKFFPCEAGCALVLGDDIPNYV 599

Query: 235 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
             +         CL +  Q+  +C  SHR T RLC C
Sbjct: 600 VSNRMDTYQ--KCLVSERQA--TCAASHRATTRLCAC 632


>gi|260782034|ref|XP_002586098.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
 gi|229271187|gb|EEN42109.1| hypothetical protein BRAFLDRAFT_252331 [Branchiostoma floridae]
          Length = 200

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 56/113 (49%), Gaps = 7/113 (6%)

Query: 40  KTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG---EKFSCQGCNLLSQCCNSYEYCVS 96
           +  + G   +S   G+VCD   +     CC        +F+C  C   + CC  YE+CVS
Sbjct: 70  QNRIHGYDFVSTTAGFVCDREDILVGG-CCDVTSLTTRRFTCASCQD-NGCCQVYEHCVS 127

Query: 97  CCVNPART--LKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAY 147
           CC+ P +   L+  + +V  A  T   +    F+ C  +CR +S+SV HEN+Y
Sbjct: 128 CCLQPEKQPLLRSLLDHVPDAFKTLFASVTDHFELCLAKCRTSSQSVQHENSY 180


>gi|358339009|dbj|GAA47150.1| UPF0454 protein C12orf49 homolog, partial [Clonorchis sinensis]
          Length = 186

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 23/161 (14%)

Query: 20  FLSIISAIRKDIGFQETPFCKTTVQ-GRYLLSDDNGYVCDAVSMDPQSRCCPEKG----- 73
           +L+  +A+  + GF     C +T Q     + DD G  C   ++   S CCP        
Sbjct: 31  WLNNTNALTTNKGF-----CSSTRQSASSYVVDDTGTYCPRTAL-LNSGCCPTPVAPDHA 84

Query: 74  ------EKFSCQGCNLLSQ-CCNSYEYCVSCCVNPART--LKEQVLNV-KIAKPTTAGTY 123
                  +F C  CN   + CC  YE+CVSCC+N       +E +L   ++         
Sbjct: 85  DTAIIVRRFVCDSCNQTRENCCQIYEHCVSCCMNRQHIHLWREALLQASELPAQRHLLLA 144

Query: 124 GSVFDFCAGRCRHNSESVVHENAYLSDFH-HCFSMPSNASG 163
             +F +C  +CR +S+SV+HEN Y    H +CF +   + G
Sbjct: 145 DGLFQYCQAKCRTSSQSVLHENTYRDAEHRYCFGLYLTSPG 185


>gi|320162724|gb|EFW39623.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 319

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 16/121 (13%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKG--EKFSCQGCNLLSQCCNSYEYCVS 96
           C+ TVQGR L++DD G +C   S+ P   C P      ++ C  C   + CC  +E C  
Sbjct: 103 CRNTVQGRVLVTDDRGNMCTRTSVLPNGCCSPAANTTRQYVCDSC-TPNDCCTQFEVCTF 161

Query: 97  CCVNPARTLKEQV-LNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYL-SDFHHC 154
             +      ++ + +NV+             F+ C  +CR +S SVVHEN Y  + + +C
Sbjct: 162 FLIKFLHRARDALFMNVQ-----------DQFEMCLAKCRTSSLSVVHENTYKDTQYKYC 210

Query: 155 F 155
           F
Sbjct: 211 F 211


>gi|326432709|gb|EGD78279.1| hypothetical protein PTSG_09343 [Salpingoeca sp. ATCC 50818]
          Length = 198

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 53/122 (43%), Gaps = 6/122 (4%)

Query: 28  RKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGE--KFSCQGCNLLS 85
           R+ +       CK T      L DD G VC+     P   CC    +  +  C  C+   
Sbjct: 70  RRHLAAASNDSCKYTTFDGLQLVDDLGRVCNVEDAQPNG-CCFSTSQIVEAVCDSCDA-G 127

Query: 86  QCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPT--TAGTYGSVFDFCAGRCRHNSESVVH 143
            CC  YE+CVSCC+ P      + +   + +P           F++C  +CR +S+SV H
Sbjct: 128 HCCRVYEHCVSCCLAPKHVPLRRFVFHNLTEPQRRVLSLVSDQFEYCEAKCRTSSQSVQH 187

Query: 144 EN 145
           EN
Sbjct: 188 EN 189


>gi|426374279|ref|XP_004054004.1| PREDICTED: UPF0454 protein C12orf49 homolog [Gorilla gorilla
           gorilla]
          Length = 220

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/130 (32%), Positives = 65/130 (50%), Gaps = 24/130 (18%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC---PEKGEKFSCQGCNLLSQCCNSYEYCV 95
           C+ ++QG++L++D+ GYVC+   +     CC       +++ C GC   + CC++YEYCV
Sbjct: 75  CRNSIQGKHLITDELGYVCERKDLLVNG-CCNVNVPSTKQYCCDGC-WPNGCCSAYEYCV 132

Query: 96  SCCVNPART-LKEQVLN---VKIAKPTTAGTY------GSVFDF--------CAGRCRHN 137
           SCC+ P +  L E+ LN   V       A  Y         FDF        C+   +++
Sbjct: 133 SCCLQPNKQLLLERFLNRAAVAFQNLFMALLYPLPDFCSPPFDFMPLLMSLVCSASIQYH 192

Query: 138 SESVVHENAY 147
             SV HEN Y
Sbjct: 193 -WSVQHENTY 201


>gi|432116785|gb|ELK37410.1| hypothetical protein MDA_GLEAN10014215 [Myotis davidii]
          Length = 149

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 45/76 (59%), Gaps = 3/76 (3%)

Query: 74  EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
           +++ C GC L + CC++YE+CVSCC+ P +  L E+ LN   +A           F+ C 
Sbjct: 56  KQYCCDGC-LTNGCCSAYEHCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 114

Query: 132 GRCRHNSESVVHENAY 147
            +CR +S+SV HEN Y
Sbjct: 115 AKCRTSSQSVQHENTY 130


>gi|402887791|ref|XP_003907264.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Papio
           anubis]
          Length = 175

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 74  EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
           +++ C GC   + CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C 
Sbjct: 82  KQYCCDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 140

Query: 132 GRCRHNSESVVHENAY 147
            +CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156


>gi|332840508|ref|XP_003314001.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Pan
           troglodytes]
          Length = 175

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 74  EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
           +++ C GC   + CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C 
Sbjct: 82  KQYCCDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 140

Query: 132 GRCRHNSESVVHENAY 147
            +CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156


>gi|330794772|ref|XP_003285451.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
 gi|325084626|gb|EGC38050.1| hypothetical protein DICPUDRAFT_149321 [Dictyostelium purpureum]
          Length = 322

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 74  EKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTY-GSVFDFCAG 132
           E++SC+ C   S CC++YEYC+SCC+   +    Q + +  A+   A  Y  + FD C  
Sbjct: 232 ERYSCKRCLEGSDCCDTYEYCISCCMRYDKVPVVQEMLIGFAERGVAKEYLKNQFDLCTM 291

Query: 133 RCRHNSESVVHENAYLS-DFHHCF 155
            CR +S S++++  Y +  F  C+
Sbjct: 292 ACRTSSSSLLNQREYQNPKFKFCY 315



 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEK 72
           C++TVQG   ++DD GYVC   S D  ++CC ++
Sbjct: 90  CQSTVQGNLYIADDQGYVCSRSSWDSITKCCKKQ 123


>gi|395833972|ref|XP_003789991.1| PREDICTED: UPF0454 protein C12orf49 homolog isoform 2 [Otolemur
           garnettii]
          Length = 175

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 74  EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
           +++ C GC   + CC++YEYCVSCC+ P +  L E+ LN   +A           F+ C 
Sbjct: 82  KQYCCDGC-WPNGCCSAYEYCVSCCLQPNKQLLLERFLNRAAMAFQNLFMAVEDHFELCL 140

Query: 132 GRCRHNSESVVHENAY 147
            +CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156


>gi|290976460|ref|XP_002670958.1| predicted protein [Naegleria gruberi]
 gi|284084522|gb|EFC38214.1| predicted protein [Naegleria gruberi]
          Length = 350

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 55/131 (41%), Gaps = 15/131 (11%)

Query: 144 ENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKL 203
           E +  +DF H  S+P     A        L  I  ++G  GE+CD++C  S   C  N  
Sbjct: 229 ETSAPNDFVHD-SLPCETLPASAIHSLQELQTIKAVLGALGENCDTICAKSESVCAPNYF 287

Query: 204 ILLNQCEIIQKYMS---CKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDG 260
             +N+C+ ++K+ S   CK+             V D      +   C+    +S   C  
Sbjct: 288 PYINRCDEMRKHYSSCECKK-----------EGVIDSRAPYFDGKYCIIGNRRSKFRCGN 336

Query: 261 SHRHTRRLCPC 271
           +H   RRLCPC
Sbjct: 337 AHPSERRLCPC 347


>gi|221042418|dbj|BAH12886.1| unnamed protein product [Homo sapiens]
          Length = 175

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 74  EKFSCQGCNLLSQCCNSYEYCVSCCVNPART-LKEQVLN-VKIAKPTTAGTYGSVFDFCA 131
           +++ C GC   + CC++YEYC SCC+ P +  L E+ LN   +A           F+ C 
Sbjct: 82  KQYCCDGC-WPNGCCSAYEYCASCCLQPNKQLLLERFLNRAAVAFQNLFMAVEDHFELCL 140

Query: 132 GRCRHNSESVVHENAY 147
            +CR +S+SV HEN Y
Sbjct: 141 AKCRTSSQSVQHENTY 156


>gi|299117113|emb|CBN73884.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 83

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 195 GQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQS 254
           G  C  +   LLN C +++ +  C   C  S G +QP  V  DAP +  P  CL +    
Sbjct: 3   GMICKESSFQLLNSCGLLRDHFPCT-SCTESNGFEQPCYVELDAPEDNLPGQCLISSNPR 61

Query: 255 MLSCDGSHRHTRRLCPCA 272
             +C  SH  TRRLCPC 
Sbjct: 62  HSTCSASHEVTRRLCPCT 79


>gi|291000376|ref|XP_002682755.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
 gi|284096383|gb|EFC50011.1| N-acetylglucosaminylaminotransferase [Naegleria gruberi]
          Length = 617

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 59/124 (47%), Gaps = 12/124 (9%)

Query: 153 HCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSS-GQSCVLNKLILLNQCEI 211
           HC +  S         L  + S I+  I +QG+SC S CK      C       +N C+ 
Sbjct: 469 HCPAPISKFHKDSNWSLPKKDSIITFAISKQGQSCSSYCKQGFNMECYRFSFEEMNSCKF 528

Query: 212 IQKYMSCK-RGCLASMGADQPAEVADDAPSNLNPTACLYT---RIQSMLSCDGSHRHTRR 267
           ++K++  +   C+ S G DQPA        +++   CL+    R + + +C+  H +T+R
Sbjct: 529 LRKFVGQECTECIPSSGPDQPA-------IDVDSKQCLFNGGGRGKFISTCEAYHVNTKR 581

Query: 268 LCPC 271
           +CPC
Sbjct: 582 ICPC 585


>gi|66817203|ref|XP_642480.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
 gi|60470568|gb|EAL68547.1| hypothetical protein DDB_G0277747 [Dictyostelium discoideum AX4]
          Length = 312

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 69/172 (40%), Gaps = 55/172 (31%)

Query: 39  CKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC----------------------------- 69
           C++T+QG   ++DD GYVC     D  + CC                             
Sbjct: 133 CESTIQGSTYIADDQGYVCLRSQWDSVTGCCFGGNKTSTTTNNNKHFRDDDDDDNDNDEG 192

Query: 70  -------------------PEKG-EKFSCQGC--NLLSQCCNSYEYCVSCCVNPART--L 105
                               E G ++F+C  C    +S CC+SYE+C+SCC+   +   L
Sbjct: 193 NQKTTNNKNRGDNENQIENEEIGVQRFTCSRCLGGEVSDCCDSYEFCISCCMRQDKVSIL 252

Query: 106 KEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVHENAYLS-DFHHCFS 156
            + ++N    K        + FD C+  CR +S S+V++  Y   +F  C+S
Sbjct: 253 HDTLINF-AEKNLDKKYLKNQFDLCSMACRTSSASLVNQREYQDPEFKFCYS 303


>gi|294955121|ref|XP_002788417.1| alpha-1,3-mannosyl-glycoprotein beta-1,
           2-n-acetylglucosaminyltransferase, putative [Perkinsus
           marinus ATCC 50983]
 gi|239903817|gb|EER20213.1| alpha-1,3-mannosyl-glycoprotein beta-1,
           2-n-acetylglucosaminyltransferase, putative [Perkinsus
           marinus ATCC 50983]
          Length = 1171

 Score = 46.6 bits (109), Expect = 0.013,   Method: Composition-based stats.
 Identities = 28/89 (31%), Positives = 38/89 (42%), Gaps = 3/89 (3%)

Query: 183 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNL 242
           +G SC   C  +   C  + L   N C I+  +  C+ GC   +G + PA V D     +
Sbjct: 513 RGVSCTHTCIHNNARCYDHLLEFANDCHIMLSHFPCEEGCGHQLGLELPAYVMDSTLHTV 572

Query: 243 NPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
               CL T   S  SC   H  T R+C C
Sbjct: 573 G--QCLVTDHGSP-SCATYHISTARMCTC 598


>gi|340369555|ref|XP_003383313.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase B-like
           [Amphimedon queenslandica]
          Length = 662

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 1/97 (1%)

Query: 176 ISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCK-RGCLASMGADQPAEV 234
           +  +I + G++C   C  +   C  +   L+N  E +  ++ C     L    AD    +
Sbjct: 553 LRTVISKPGQNCRDSCLENKMVCESSFFSLINTHEAVFNHLGCNVTHTLEEKYADYAPAI 612

Query: 235 ADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
           A  A   + P  C+   I+ +L C G  ++ RRLCPC
Sbjct: 613 AVSADKPIEPGHCVIIDIRRLLGCPGYSKNFRRLCPC 649


>gi|290976364|ref|XP_002670910.1| predicted protein [Naegleria gruberi]
 gi|284084474|gb|EFC38166.1| predicted protein [Naegleria gruberi]
          Length = 466

 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 18/116 (15%)

Query: 158 PSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQK-YM 216
           P+N  G  +T+L      +  I+  +GE+CD+ C  +  +C       +N+C+ ++K Y 
Sbjct: 364 PTNTHG--ITELHN----LKAILCEKGENCDTCCSRNNATCDGLFFPYINRCDEMRKHYP 417

Query: 217 SCKRGCLASMGADQPAEVADDAPS-NLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
            CK  C          E+AD++ S N +   C+    +S   C  S+R  RRLCPC
Sbjct: 418 DCK--C--------AYELADESRSPNFDGELCVIGNRRSRFRCGNSYRTERRLCPC 463


>gi|290987267|ref|XP_002676344.1| predicted protein [Naegleria gruberi]
 gi|284089946|gb|EFC43600.1| predicted protein [Naegleria gruberi]
          Length = 481

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 9/123 (7%)

Query: 152 HHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEI 211
           H  F  P       +T  E  L+ +  ++  +GE CD+ CK   ++C ++    +N C  
Sbjct: 365 HDVFEFPLKPPTLKITLSE--LNKVESVLCEEGEDCDTCCKKRAKTCDVSFFPYINHCRE 422

Query: 212 IQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDG--SHRHTRRLC 269
           IQK+   K  C   +  +  + +A    +N +   C      S  SC    S  + RRLC
Sbjct: 423 IQKH--SKERCNCKIDDEFFSSIAPSMDTNKD---CYVGYRPSRFSCSNKKSKSNHRRLC 477

Query: 270 PCA 272
           PC 
Sbjct: 478 PCK 480


>gi|426221180|ref|XP_004004788.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Ovis aries]
          Length = 739

 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 17/104 (16%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVNC-----------QS 681

Query: 232 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           +E+A D  APS  +P    C++     + SC G+H H RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHAHHRRICPC 724


>gi|440795738|gb|ELR16854.1| GNT-I family protein [Acanthamoeba castellanii str. Neff]
          Length = 566

 Score = 40.4 bits (93), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 62/165 (37%), Gaps = 21/165 (12%)

Query: 117 PTTAGTYGSV------FDFCAGRCRHNSESVVH---ENAYLSDFHHCFSMPSNASGAGVT 167
           P     YGS+      +D    R  HN   V+        L+D  HC  +P +       
Sbjct: 401 PYVRENYGSISLSLALWD-TPSRGTHNHSLVLRYKKNTLLLADKRHCAYLPESLRILPHE 459

Query: 168 QLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRG-CLASM 226
           QL  RL   +V     GE C + C   G  C       L  C+ +     C RG C  ++
Sbjct: 460 QL--RLVTSAV-----GEDCHAPCAREGLKCHQGDFDFLRHCDALASRFPCHRGRCQLAV 512

Query: 227 GADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
           G + P  +    P       CL +  +   SC   H+  +RLCPC
Sbjct: 513 GPEMPIFLP-HGPHQNATGRCLVS--EDFASCAAKHKGGQRLCPC 554


>gi|443721485|gb|ELU10776.1| hypothetical protein CAPTEDRAFT_200210 [Capitella teleta]
          Length = 484

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 7/120 (5%)

Query: 155 FSMPSNASGAGVTQLEGRLSGISVII-GRQGESCDSVCKSSGQSCVLNKLILLNQ-CEII 212
           F+ P  A  + +  + GRL    +I+ G  G  C  VC    + C +  L LLN+ C ++
Sbjct: 368 FATPHGAQ-SPLRGVIGRLQLDYIIVTGDDGLDCSQVCAGVYKECDVQGLFLLNKGCAVL 426

Query: 213 QKY-MSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
           +   +SCK      +   QP     D P+ L+   C  T      SC G H  ++R+CPC
Sbjct: 427 ELLALSCKVCSEVKVEGSQP--YTGDMPAMLSDGNCC-TGWPHHSSCFGKHVESKRICPC 483


>gi|298710994|emb|CBJ32301.1| hypothetical protein Esi_0327_0028 [Ectocarpus siliculosus]
          Length = 550

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 36/114 (31%), Gaps = 49/114 (42%)

Query: 91  YEYCVSCCVNPARTLKEQVLNVKIA---KPTTAGTYGS---------------------- 125
           YE+CVSCC NP R  + + +    A    P      GS                      
Sbjct: 391 YEFCVSCCQNPQRGEEREAIQASAALSGHPAYRDLLGSGNNDAANGREPPPPRQLRASAA 450

Query: 126 ------------------------VFDFCAGRCRHNSESVVHENAYLSDFHHCF 155
                                    F  CA RCR  S SV HEN+Y S   HCF
Sbjct: 451 AAEAGGVEKNGQQVEAEERDARELAFAHCAFRCRTYSGSVAHENSYRSPLKHCF 504


>gi|440902995|gb|ELR53712.1| Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Bos grunniens
           mutus]
          Length = 739

 Score = 38.5 bits (88), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681

Query: 232 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           +E+A D  APS  +P    C++     + SC G+H   RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724


>gi|300797805|ref|NP_001178271.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Bos taurus]
 gi|296490527|tpg|DAA32640.1| TPA: mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Bos taurus]
          Length = 739

 Score = 38.5 bits (88), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V     G+SC  VC+ +   C  +    LN+ + + KY ++C           Q 
Sbjct: 633 LSALQVKFAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYDVNC-----------QS 681

Query: 232 AEVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           +E+A D  APS  +P    C++     + SC G+H   RR+CPC
Sbjct: 682 SELAKDIVAPS-FDPKNKHCVFQGDLLLFSCAGAHARHRRICPC 724


>gi|296204935|ref|XP_002749539.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Callithrix
           jacchus]
          Length = 741

 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C+   LA     + 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTCQSSELA-----KD 689

Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
             V    P N     C++     + SC G+H   RR+CPC
Sbjct: 690 ILVPSFDPKN---KHCVFQGDLLLFSCAGAHPRHRRVCPC 726


>gi|403280116|ref|XP_003931580.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Saimiri
           boliviensis boliviensis]
          Length = 741

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C+   LA     + 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTCQSSELA-----KD 689

Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
             V    P N     C++     + SC G+H   RR+CPC
Sbjct: 690 ILVPSFDPKN---KHCVFQGDLLLFSCAGAHPRHRRVCPC 726


>gi|303280994|ref|XP_003059789.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458444|gb|EEH55741.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 274

 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 125 SVFDFCAGRCRHNSESVVHENAYLSDFHHCF-SMPSNASGAGVTQLEGR 172
           S F++CA RCR NS    +EN +    HHCF S+ + + GA  T   GR
Sbjct: 204 SSFEYCAHRCRTNSAVTKYENEFQDAKHHCFGSVYAESVGADDTLSPGR 252


>gi|62702324|gb|AAX93248.1| unknown [Homo sapiens]
          Length = 231

 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 9/100 (9%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C+   LA     + 
Sbjct: 125 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYKVTCQSSELA-----KD 179

Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
             V    P N +   C++     + SC G+H   +R+CPC
Sbjct: 180 ILVPSFDPKNKH---CVFQGDLLLFSCAGAHPRHQRVCPC 216


>gi|348520977|ref|XP_003448003.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase B-like
           [Oreochromis niloticus]
          Length = 805

 Score = 37.7 bits (86), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 19/105 (18%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
           LS + +++ ++G+SC   C+S+G  C       +N  E ++             G D   
Sbjct: 699 LSSLRLLVSQEGQSCVEACQSAGFICEPAHFRFINNKEALR-------------GLDVQC 745

Query: 233 EVADDAPSNLNPT------ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           EV D   +++ P        C   R   + SC G     RRLCPC
Sbjct: 746 EVVDSEINHILPAFSVVRQECGLQREPLLFSCAGHSPKYRRLCPC 790


>gi|301770689|ref|XP_002920763.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like
           [Ailuropoda melanoleuca]
 gi|281337646|gb|EFB13230.1| hypothetical protein PANDA_009537 [Ailuropoda melanoleuca]
          Length = 741

 Score = 37.4 bits (85), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 9/100 (9%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY ++C+   LA     + 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKDKDLLKYEVTCQSSELA-----KD 689

Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
             V    P N +   C++     + SC G+H   RR+CPC
Sbjct: 690 ILVPSFDPKNRH---CVFQGDLLLFSCAGAHPRHRRVCPC 726


>gi|291391470|ref|XP_002712459.1| PREDICTED: N-acetylglucosaminyltransferase V [Oryctolagus
           cuniculus]
          Length = 741

 Score = 37.4 bits (85), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 9/100 (9%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V +   G+SC  VC+ +   C  +    LN+ + + KY +SC+   LA     + 
Sbjct: 635 LSALQVKLAEPGQSCKQVCQENQLICEPSFFQHLNKNKDVLKYEVSCQSSELA-----KD 689

Query: 232 AEVADDAPSNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
             V    P N     C++     + SC G+H   +R+CPC
Sbjct: 690 ILVPSFDPKN---KHCVFQGDLLLFSCAGAHPRHQRICPC 726


>gi|426337254|ref|XP_004032629.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Gorilla
           gorilla gorilla]
          Length = 741

 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQADLLLFSCAGAHPRYQRVCPC 726


>gi|428167931|gb|EKX36882.1| hypothetical protein GUITHDRAFT_116905 [Guillardia theta CCMP2712]
          Length = 652

 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 37/91 (40%), Gaps = 6/91 (6%)

Query: 183 QGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASM-GADQPAEVADDAPSN 241
           QG SC + C+    +C      +LN CE +       + C     G D PA   +D    
Sbjct: 290 QGVSCFTACQEQNLACQRIWFDVLNNCEALAVAFPSHKICSTDFYGRDLPAYRPEDLTVL 349

Query: 242 LNPTACLYTRIQSMLSCDGSHRHTRRLCPCA 272
           +N     Y+      SC G H  T+RLC C 
Sbjct: 350 VNKKPRTYSS-----SCGGKHEKTKRLCGCG 375


>gi|196010315|ref|XP_002115022.1| predicted protein [Trichoplax adhaerens]
 gi|190582405|gb|EDV22478.1| predicted protein [Trichoplax adhaerens]
          Length = 1490

 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 3/64 (4%)

Query: 77  SCQGCNLLSQCCNSYEYCVSCCV--NPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRC 134
           +C  CN  ++CC+ YE+CVSCC     A  LKE + N          +  + F+FC  +C
Sbjct: 107 ACHTCNS-NRCCDQYEFCVSCCQYRKEALDLKEIIRNSPDGIQQLLTSAKTRFEFCLIQC 165

Query: 135 RHNS 138
           R +S
Sbjct: 166 RTSS 169


>gi|428175838|gb|EKX44726.1| hypothetical protein GUITHDRAFT_109505 [Guillardia theta CCMP2712]
          Length = 551

 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 15/92 (16%)

Query: 186 SCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQP------AEVADDAP 239
           +CDSVC+S+G+ C  +  + L++C+I +K    +  C  +   D P       E+     
Sbjct: 124 NCDSVCQSAGKVCARDWFVFLDRCDIAKKVFP-EGTCSFAFYPDMPFYDKTTKEIFVSKD 182

Query: 240 SNLNPTACLYTRIQSMLSCDGSHRHTRRLCPC 271
           + L+P  C    +          R   RLCPC
Sbjct: 183 TKLDPPTCDAAPLGG--------RSAERLCPC 206


>gi|344268130|ref|XP_003405916.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Loxodonta
           africana]
          Length = 741

 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 17/104 (16%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKY-MSCKRGCLASMGADQP 231
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY ++C           Q 
Sbjct: 635 LSALQVKLAEPGKSCKQVCQESQLICEPSFFQHLNKDKDMLKYEVTC-----------QS 683

Query: 232 AEVADD--APSNLNPTA--CLYTRIQSMLSCDGSHRHTRRLCPC 271
           +E+A D   PS  +P +  C++     + SC G+H   +R+CPC
Sbjct: 684 SELAKDILVPS-FDPKSKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|297266872|ref|XP_002799448.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A-like [Macaca
           mulatta]
          Length = 547

 Score = 37.0 bits (84), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 441 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 490

Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 491 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 532


>gi|297668509|ref|XP_002812479.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Pongo abelii]
          Length = 741

 Score = 37.0 bits (84), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


>gi|291233073|ref|XP_002736478.1| PREDICTED: N-acetylglucosaminyltransferase V-like [Saccoglossus
           kowalevskii]
          Length = 561

 Score = 37.0 bits (84), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 55/129 (42%), Gaps = 19/129 (14%)

Query: 145 NAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKLI 204
           NAY+ +   C + P              LS + V I    ESC +VC  +G  C  +   
Sbjct: 435 NAYIENQDFCSANPKQWPP---------LSALKVKISNPKESCKAVCMKNGLICEPSFFK 485

Query: 205 LLNQCEIIQK--YMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSH 262
            LN  E+I++  Y+ CK   ++      P+ + +   SN+    C   + + + SC G  
Sbjct: 486 SLNTQEMIEREGYLECK-SVISVEDIYAPSFIKE---SNI----CHLEKQKLLFSCAGGS 537

Query: 263 RHTRRLCPC 271
               R+CPC
Sbjct: 538 DTRNRVCPC 546


>gi|4505167|ref|NP_002401.1| alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Homo sapiens]
 gi|114580983|ref|XP_515802.2| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A isoform 2 [Pan
           troglodytes]
 gi|397504546|ref|XP_003822849.1| PREDICTED: alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A [Pan paniscus]
 gi|1169980|sp|Q09328.1|MGT5A_HUMAN RecName: Full=Alpha-1,6-mannosylglycoprotein
           6-beta-N-acetylglucosaminyltransferase A; AltName:
           Full=Alpha-mannoside
           beta-1,6-N-acetylglucosaminyltransferase; AltName:
           Full=GlcNAc-T V; Short=GNT-V; AltName: Full=Mannoside
           acetylglucosaminyltransferase 5; AltName:
           Full=N-acetylglucosaminyl-transferase V
 gi|4545222|gb|AAD22449.1|AF113921_1 alpha-1,3(6)-mannosylglycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Homo sapiens]
 gi|870752|dbj|BAA04570.1| N-acetylglucosaminyltransferase V [Homo sapiens]
 gi|119632062|gb|EAX11657.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
           [Homo sapiens]
 gi|119632063|gb|EAX11658.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase, isoform CRA_a
           [Homo sapiens]
 gi|162317656|gb|AAI56314.1| Mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
           construct]
 gi|162319448|gb|AAI57049.1| Mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [synthetic
           construct]
 gi|410216290|gb|JAA05364.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
 gi|410265502|gb|JAA20717.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
 gi|410308382|gb|JAA32791.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
 gi|410332945|gb|JAA35419.1| mannosyl (alpha-1,6-)-glycoprotein
           beta-1,6-N-acetyl-glucosaminyltransferase [Pan
           troglodytes]
          Length = 741

 Score = 37.0 bits (84), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 173 LSGISVIIGRQGESCDSVCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPA 232
           LS + V +   G+SC  VC+ S   C  +    LN+ + + KY   K  C       Q +
Sbjct: 635 LSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKY---KVTC-------QSS 684

Query: 233 EVADD--APSNLNPT--ACLYTRIQSMLSCDGSHRHTRRLCPC 271
           E+A D   PS  +P    C++     + SC G+H   +R+CPC
Sbjct: 685 ELAKDILVPS-FDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPC 726


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.135    0.429 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,066,340,777
Number of Sequences: 23463169
Number of extensions: 157805885
Number of successful extensions: 407431
Number of sequences better than 100.0: 229
Number of HSP's better than 100.0 without gapping: 113
Number of HSP's successfully gapped in prelim test: 116
Number of HSP's that attempted gapping in prelim test: 406977
Number of HSP's gapped (non-prelim): 263
length of query: 272
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 132
effective length of database: 9,074,351,707
effective search space: 1197814425324
effective search space used: 1197814425324
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 75 (33.5 bits)