BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023081
         (287 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|118489433|gb|ABK96519.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 291

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 226/280 (80%), Positives = 244/280 (87%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           PYHV+HKLPPGDSPYVRAKHVQLVEKDP AAI LFWKAINAGDRVDSALKDMAVV+KQQD
Sbjct: 12  PYHVLHKLPPGDSPYVRAKHVQLVEKDPVAAIALFWKAINAGDRVDSALKDMAVVMKQQD 71

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIK+FR  CSKQAQESLDNVLIDLYKKCG+++EQIELLKQKLRMI+ GEAFNG
Sbjct: 72  RAEEAIEAIKAFRDRCSKQAQESLDNVLIDLYKKCGKVEEQIELLKQKLRMIHQGEAFNG 131

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           K TKTARSHG+KFQVTVKQETSRILGNLGWAYMQKGNY +AEVVYRKAQ IDPDANKACN
Sbjct: 132 KATKTARSHGRKFQVTVKQETSRILGNLGWAYMQKGNYLAAEVVYRKAQSIDPDANKACN 191

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
           L  CLIKQ RY EA+ VL+D+  GKL GS E K+ NR +ELL ELE  QS    L    S
Sbjct: 192 LGLCLIKQTRYAEAQVVLDDIFRGKLLGSDEPKSRNRAQELLCELETHQSSSMFLEPSGS 251

Query: 248 SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           SLED F+EGLD L +QWTPYRSRRLPIFEEIS FRDQLAC
Sbjct: 252 SLEDVFVEGLDQLTSQWTPYRSRRLPIFEEISSFRDQLAC 291


>gi|224072650|ref|XP_002303823.1| predicted protein [Populus trichocarpa]
 gi|222841255|gb|EEE78802.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 225/280 (80%), Positives = 243/280 (86%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           PYHV+HKLPPGDSPYVRAKHVQLVEKDP AAI LFWKAINAGDRVDSALKDMAVV+KQQD
Sbjct: 12  PYHVLHKLPPGDSPYVRAKHVQLVEKDPGAAIALFWKAINAGDRVDSALKDMAVVMKQQD 71

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIK+FR  CSKQAQESLDNVLIDLYKKCG ++EQIELLKQKLRMI+ GEAFNG
Sbjct: 72  RAEEAIEAIKAFRDRCSKQAQESLDNVLIDLYKKCGNIEEQIELLKQKLRMIHQGEAFNG 131

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           K TKTARSHG+KFQVTVKQETSRILGNLGWAYMQKGNY +AEVVYRKAQ IDPDANKACN
Sbjct: 132 KATKTARSHGRKFQVTVKQETSRILGNLGWAYMQKGNYLAAEVVYRKAQSIDPDANKACN 191

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
           L  CLIKQ RY EA++VL+D+  GKL GS E K+ NR +ELL ELE  QS         S
Sbjct: 192 LGLCLIKQTRYAEAQAVLDDIFRGKLLGSDEPKSRNRAQELLCELETSQSSSMFSEPSGS 251

Query: 248 SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           SLED F+EGLD L +QWTPYRSRRLPIFEEIS FRDQLAC
Sbjct: 252 SLEDVFVEGLDQLTSQWTPYRSRRLPIFEEISSFRDQLAC 291


>gi|255554172|ref|XP_002518126.1| conserved hypothetical protein [Ricinus communis]
 gi|223542722|gb|EEF44259.1| conserved hypothetical protein [Ricinus communis]
          Length = 296

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 226/282 (80%), Positives = 250/282 (88%), Gaps = 5/282 (1%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+HV+HKLPPGDSPYVRAKHVQLV KDPE AIVLFWKAINAGDRVDSALKDMA+V+KQQD
Sbjct: 18  PFHVIHKLPPGDSPYVRAKHVQLVHKDPEGAIVLFWKAINAGDRVDSALKDMAIVMKQQD 77

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR  CSKQAQESLDNVLIDLYKKCG+++EQIELLKQKLRMIY GEAFNG
Sbjct: 78  RAEEAIEAIKSFRDRCSKQAQESLDNVLIDLYKKCGKIEEQIELLKQKLRMIYEGEAFNG 137

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           K TKTARSHGKKFQVT+KQETSRILGNLGWAYMQ+G+Y +AEVVYRKAQ IDPDANKACN
Sbjct: 138 KRTKTARSHGKKFQVTIKQETSRILGNLGWAYMQQGSYLAAEVVYRKAQSIDPDANKACN 197

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
           L  CLIKQ RYTEA SVL DVL  KLSGS + K+ +RVKELL ELE  QS   S+ T  S
Sbjct: 198 LCLCLIKQTRYTEAHSVLNDVLQSKLSGSEDPKSRSRVKELLHELETCQS---SVVTSPS 254

Query: 248 --SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
             SLEDAF+EGL+ LM+QW+P+RS+RLPIFEEI+P RDQLAC
Sbjct: 255 GLSLEDAFVEGLEQLMSQWSPHRSKRLPIFEEITPCRDQLAC 296


>gi|225463848|ref|XP_002265262.1| PREDICTED: uncharacterized protein LOC100244941 [Vitis vinifera]
 gi|296088770|emb|CBI38220.3| unnamed protein product [Vitis vinifera]
          Length = 292

 Score =  459 bits (1181), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 219/283 (77%), Positives = 246/283 (86%), Gaps = 4/283 (1%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           + +PYHV+HKLPPGD+PYVRAKH+QLVEKDPEAAI LFW+AIN GDRVDSALKDMAVV+K
Sbjct: 14  KQEPYHVIHKLPPGDTPYVRAKHLQLVEKDPEAAIALFWEAINVGDRVDSALKDMAVVMK 73

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           QQ+RV+EA+EAIKSFR  CSKQAQESLDNVL+DLYKKCG+L+EQIELLKQKLRMIY GEA
Sbjct: 74  QQNRVEEAIEAIKSFRDRCSKQAQESLDNVLMDLYKKCGKLEEQIELLKQKLRMIYQGEA 133

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FN KPTKTARSHG+KFQVT++QETSRILGNLGWAYMQ+ NY +AEVVYR+AQ IDPDANK
Sbjct: 134 FNRKPTKTARSHGRKFQVTIQQETSRILGNLGWAYMQQTNYATAEVVYREAQTIDPDANK 193

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           ACNL  CLIKQARY EARSVLEDVL GK  GS + K+ NR  ELL+E+EPWQS     +T
Sbjct: 194 ACNLGLCLIKQARYDEARSVLEDVLHGKFCGSNDPKSRNRAHELLEEVEPWQSEAVFPST 253

Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
            + SLED F E    LM Q +P R+RRLPIFEEISPFRDQLAC
Sbjct: 254 SEVSLEDCFFE----LMKQSSPSRTRRLPIFEEISPFRDQLAC 292


>gi|147809817|emb|CAN73753.1| hypothetical protein VITISV_002585 [Vitis vinifera]
          Length = 292

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 219/283 (77%), Positives = 245/283 (86%), Gaps = 4/283 (1%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           + +PYHV+HKLPPGD+PYVRAKH QLVEKDPEAAI LFW+AIN GDRVDSALKDMAVV+K
Sbjct: 14  KQEPYHVIHKLPPGDTPYVRAKHXQLVEKDPEAAIALFWEAINVGDRVDSALKDMAVVMK 73

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           QQ+RV+EA+EAIKSFR  CSKQAQESLDNVL+DLYKKCG+L+EQIELLKQKLRMIY GEA
Sbjct: 74  QQNRVEEAIEAIKSFRDRCSKQAQESLDNVLMDLYKKCGKLEEQIELLKQKLRMIYQGEA 133

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FN KPTKTARSHG+KFQVT++QETSRILGNLGWAYMQ+ NY +AEVVYR+AQ IDPDANK
Sbjct: 134 FNRKPTKTARSHGRKFQVTIQQETSRILGNLGWAYMQQTNYATAEVVYREAQTIDPDANK 193

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           ACNL  CLIKQARY EARSVLEDVL GK  GS + K+ NR  ELL+E+EPWQS     +T
Sbjct: 194 ACNLGLCLIKQARYDEARSVLEDVLHGKFCGSNDPKSRNRAHELLEEVEPWQSEAVFPST 253

Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
            + SLED F E    LM Q +P R+RRLPIFEEISPFRDQLAC
Sbjct: 254 SEVSLEDCFFE----LMKQSSPSRTRRLPIFEEISPFRDQLAC 292


>gi|224057581|ref|XP_002299278.1| predicted protein [Populus trichocarpa]
 gi|222846536|gb|EEE84083.1| predicted protein [Populus trichocarpa]
          Length = 292

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 219/282 (77%), Positives = 243/282 (86%), Gaps = 2/282 (0%)

Query: 8   PYH-VVHKLPP-GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
           PYH V+HKLPP GDSPYVRAKH QLVEKDPEAAI LFWKAINAGDRVDSALKDMAVV+KQ
Sbjct: 11  PYHHVIHKLPPPGDSPYVRAKHAQLVEKDPEAAIALFWKAINAGDRVDSALKDMAVVMKQ 70

Query: 66  QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
           QDR +EA+EA+K+FR  CSKQAQESLDNVLIDLYKKCG+++EQI+LLKQKL+MI+ GEAF
Sbjct: 71  QDRAEEAIEAVKAFRDRCSKQAQESLDNVLIDLYKKCGKIEEQIDLLKQKLKMIHQGEAF 130

Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           N K TKTARSHG+KFQVTVKQETSRILGNLGWAYMQ+GNY +AE VY KAQ  DPDANKA
Sbjct: 131 NRKATKTARSHGRKFQVTVKQETSRILGNLGWAYMQRGNYMAAEAVYHKAQSFDPDANKA 190

Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTK 245
           CNL  CLIKQ RY EA+SV++D+L GKL GS E K+ NR +ELL+ELE  QS        
Sbjct: 191 CNLCLCLIKQTRYAEAKSVIDDILQGKLLGSDEPKSRNRAQELLRELETCQSSSMFSEPS 250

Query: 246 KSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
             SLEDAF+EGLD LM+QWTPYRSRRLPIFEEISPF DQLAC
Sbjct: 251 GLSLEDAFVEGLDQLMSQWTPYRSRRLPIFEEISPFMDQLAC 292


>gi|225464225|ref|XP_002265748.1| PREDICTED: uncharacterized protein LOC100252818 [Vitis vinifera]
 gi|297744303|emb|CBI37273.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/283 (74%), Positives = 245/283 (86%), Gaps = 5/283 (1%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+ V+HKLP GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDRVDSALKDMAVV+KQ D
Sbjct: 17  PFCVLHKLPSGDTPYVRAKHAQLVEKDPEVAIVLFWKAINAGDRVDSALKDMAVVMKQLD 76

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR LCSKQAQESLDNVLIDLYKKCGR+DEQI+LLKQKLR+IY GEAFNG
Sbjct: 77  RTEEAIEAIKSFRSLCSKQAQESLDNVLIDLYKKCGRIDEQIDLLKQKLRLIYQGEAFNG 136

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSNYMAAEVVYKKAQMIDPDANKACN 196

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTK- 245
           L+ CLIKQAR +EARS+L +VL G + GS + K  NR +EL+ E+EP W  +PPS   K 
Sbjct: 197 LALCLIKQARNSEARSILNEVLQGNIPGSEDCKAQNRAQELMVEVEPRW--LPPSTKMKI 254

Query: 246 -KSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
            +  LED  ++GL+ L+N+W P R++RLPIFEEIS +R+QLAC
Sbjct: 255 ARFDLEDDLIDGLEKLLNEWAPSRTKRLPIFEEISSYRNQLAC 297


>gi|147814975|emb|CAN74743.1| hypothetical protein VITISV_025585 [Vitis vinifera]
          Length = 297

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/283 (74%), Positives = 245/283 (86%), Gaps = 5/283 (1%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+ V+HKLP GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDRVDSALKDMAVV+KQ D
Sbjct: 17  PFCVLHKLPSGDTPYVRAKHAQLVEKDPEVAIVLFWKAINAGDRVDSALKDMAVVMKQLD 76

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR LCSKQAQESLDNVLIDLYKKCGR+DEQI+LLKQKLR+IY GEAFNG
Sbjct: 77  RTEEAIEAIKSFRSLCSKQAQESLDNVLIDLYKKCGRIDEQIDLLKQKLRLIYQGEAFNG 136

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSNYMAAEVVYKKAQMIDPDANKACN 196

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTK- 245
           L+ CLIKQAR +EARS+L +VL G + GS + K  NR +EL+ E+EP W  +PPS   K 
Sbjct: 197 LALCLIKQARNSEARSILNEVLQGNIPGSEDCKAQNRAQELMLEVEPRW--LPPSTKMKI 254

Query: 246 -KSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
            +  LED  ++GL+ L+N+W P R++RLPIFEEIS +R+QLAC
Sbjct: 255 ARFDLEDDLIDGLEKLLNEWAPSRTKRLPIFEEISSYRNQLAC 297


>gi|356567884|ref|XP_003552145.1| PREDICTED: uncharacterized protein LOC100805188 [Glycine max]
          Length = 296

 Score =  430 bits (1105), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 206/283 (72%), Positives = 239/283 (84%), Gaps = 2/283 (0%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           +D  YHV+HK+P GDSPYVRAKH QLVEKDPEAAIVLFWKAIN+GD+VDSALKDMAVV+K
Sbjct: 16  KDDLYHVLHKVPYGDSPYVRAKHAQLVEKDPEAAIVLFWKAINSGDKVDSALKDMAVVMK 75

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           Q DR DEA+EAI+SFR LCSKQ+QESLDNVLIDLYKKCG++DEQIE+LK+KL++IY GEA
Sbjct: 76  QLDRSDEAIEAIRSFRSLCSKQSQESLDNVLIDLYKKCGKIDEQIEMLKRKLKLIYQGEA 135

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FNGK TKTARSHGKKFQV++KQETSR+LGNLGWAYMQK NY  AEVVYRKAQ+IDPD NK
Sbjct: 136 FNGKLTKTARSHGKKFQVSIKQETSRLLGNLGWAYMQKMNYMMAEVVYRKAQIIDPDCNK 195

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           ACNL  CLI+QARY EA+ VLEDVL G L GS ++K   R ++L  EL     +PP   +
Sbjct: 196 ACNLGLCLIRQARYEEAQLVLEDVLKGNLPGSDDSKARKRAQDLRTELRSM--LPPPHFS 253

Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               L+D F++GL+ LMN+W P RS+RLPIFEEIS FRDQLAC
Sbjct: 254 DLLGLDDEFIKGLEQLMNEWGPIRSKRLPIFEEISSFRDQLAC 296


>gi|449432444|ref|XP_004134009.1| PREDICTED: uncharacterized protein LOC101216846 [Cucumis sativus]
 gi|449487532|ref|XP_004157673.1| PREDICTED: uncharacterized protein LOC101223392 [Cucumis sativus]
          Length = 298

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 212/291 (72%), Positives = 242/291 (83%), Gaps = 8/291 (2%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
           Q  + P+HVVHKLP GDSPYVRAKHVQLV+KDPEAAIVLFWKAINAGDRVDSALKDMAVV
Sbjct: 10  QNLETPFHVVHKLPAGDSPYVRAKHVQLVQKDPEAAIVLFWKAINAGDRVDSALKDMAVV 69

Query: 63  LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
           +KQQDR +EA+EAI SFR  CSKQAQESLDNVLIDLYKKCGR++EQI+LLKQKLRMI  G
Sbjct: 70  MKQQDRAEEAIEAINSFRDRCSKQAQESLDNVLIDLYKKCGRVEEQIDLLKQKLRMINQG 129

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
           EAFNGK TKTARSHGKKFQVT++QETSRILGNLGWAYMQ+ N+ +AEVVY+KAQ+IDPDA
Sbjct: 130 EAFNGKATKTARSHGKKFQVTIRQETSRILGNLGWAYMQQENHKAAEVVYQKAQIIDPDA 189

Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
           NKACNLS CL+KQARY+EAR+VLE VL  K+ GS + K+  R +EL++ELE  +S    L
Sbjct: 190 NKACNLSLCLMKQARYSEARAVLEQVLHDKVGGSNDQKSRKRAEELMKELEEAESANKLL 249

Query: 243 TTKKSS-----LEDAFLEGLDDLMNQWTPYR-SRRLPIFEEISPFRDQLAC 287
               SS      +D F+  L  + NQ +P R SRRLPIFEEIS FRDQLAC
Sbjct: 250 MMGLSSGGSEDYDDGFINQL--VTNQRSPLRSSRRLPIFEEISQFRDQLAC 298


>gi|225464227|ref|XP_002265934.1| PREDICTED: uncharacterized protein LOC100254562 [Vitis vinifera]
 gi|297744304|emb|CBI37274.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 201/279 (72%), Positives = 239/279 (85%), Gaps = 1/279 (0%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+ V+HK+P GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDR+DSALKDMAVV+KQ +
Sbjct: 17  PFCVIHKVPSGDTPYVRAKHAQLVEKDPETAIVLFWKAINAGDRIDSALKDMAVVMKQLN 76

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR LCSKQAQE LDNVLIDLYKKCGR+D+QI+LLKQKLR+IY GEAFNG
Sbjct: 77  RTEEAIEAIKSFRGLCSKQAQEPLDNVLIDLYKKCGRMDQQIDLLKQKLRLIYQGEAFNG 136

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK  + +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSKFIAAEVVYKKAQMIDPDANKACN 196

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
           L+ CLIKQAR  EA  VL +VL GKL GS + +T NR +EL+ ELEP W   P ++   +
Sbjct: 197 LARCLIKQARNVEAHLVLNEVLQGKLLGSKDCQTQNRAQELMLELEPKWLPPPETVKITR 256

Query: 247 SSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQL 285
             LED F++ L+ L+N+W+P R++RLPIFEEIS +R+QL
Sbjct: 257 FDLEDDFIDELEKLLNEWSPLRTKRLPIFEEISSYRNQL 295


>gi|147783587|emb|CAN61444.1| hypothetical protein VITISV_044239 [Vitis vinifera]
          Length = 297

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 202/279 (72%), Positives = 237/279 (84%), Gaps = 1/279 (0%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+ V+HK+P GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDRVDSALKDMAVV+KQ +
Sbjct: 17  PFCVIHKVPSGDTPYVRAKHAQLVEKDPETAIVLFWKAINAGDRVDSALKDMAVVMKQLN 76

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR LCSKQAQE LDNVLIDLYKKCGR+D+QI+LLKQKLR+IY GEAFNG
Sbjct: 77  RTEEAIEAIKSFRGLCSKQAQEPLDNVLIDLYKKCGRMDQQIDLLKQKLRLIYQGEAFNG 136

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK  + +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSKFMAAEVVYKKAQMIDPDANKACN 196

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
           L+ CLIKQAR  EA  VL +VL GKL GS +  T NR +EL+ ELEP W   P ++    
Sbjct: 197 LARCLIKQARNVEAHLVLNEVLQGKLLGSEDCXTQNRAQELMLELEPKWLPPPETVKITG 256

Query: 247 SSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQL 285
             LED F++ L+ L+N+W+P R++RLPIFEEIS +R+QL
Sbjct: 257 FDLEDDFIDELEKLLNEWSPLRTKRLPIFEEISSYRNQL 295


>gi|115464761|ref|NP_001055980.1| Os05g0506000 [Oryza sativa Japonica Group]
 gi|52353366|gb|AAU43934.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579531|dbj|BAF17894.1| Os05g0506000 [Oryza sativa Japonica Group]
 gi|215692779|dbj|BAG88193.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215704367|dbj|BAG93801.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 299

 Score =  423 bits (1088), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 196/279 (70%), Positives = 239/279 (85%), Gaps = 1/279 (0%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYVRAKH+QLV+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 22  FHVVHKVPAGDSPYVRAKHLQLVDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 81

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
             EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 82  AKEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 141

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 142 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 201

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP   +  S      S
Sbjct: 202 GLCLIKQSRHDEARQVLHDVVLRRISGSEDDKVVARAKQLLHELEPVTHV-TSPNNAGLS 260

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           + +  +E LD ++N+WTP+RSRRLP+FEEI+  RDQ+AC
Sbjct: 261 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 299


>gi|388491310|gb|AFK33721.1| unknown [Lotus japonicus]
          Length = 301

 Score =  423 bits (1087), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 200/283 (70%), Positives = 236/283 (83%), Gaps = 2/283 (0%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           +D  YHV+HK+P GD+PYVRAKH QLVEKDPE AI LFW+AIN GD+VDSALKDMAVV+K
Sbjct: 21  KDDSYHVLHKVPYGDTPYVRAKHAQLVEKDPEGAIALFWRAINTGDKVDSALKDMAVVMK 80

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           Q DR +EA+EAI SFR LCSKQ+QESLDNVLIDLYKKCG++DEQIELLK+KL++IY GEA
Sbjct: 81  QLDRSEEAIEAICSFRGLCSKQSQESLDNVLIDLYKKCGKIDEQIELLKRKLKLIYEGEA 140

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FNGK TKTARSHGKKFQV++KQETSR+LGNLGWAYMQK NY  AE VYRKAQ+IDPD NK
Sbjct: 141 FNGKLTKTARSHGKKFQVSIKQETSRLLGNLGWAYMQKMNYVMAEAVYRKAQMIDPDCNK 200

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           ACNL  CLIKQARY EA++++EDVL G   GS ++K+  R ++LL EL     +PP   +
Sbjct: 201 ACNLGLCLIKQARYEEAQTIIEDVLRGNYPGSDDSKSKKRAQDLLTELRSM--LPPPHFS 258

Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               L+D F++GL+ L+N+W P RSRRLPIFEEIS FRDQLAC
Sbjct: 259 DLLGLDDEFIKGLEQLINEWGPIRSRRLPIFEEISSFRDQLAC 301


>gi|3859116|gb|AAC72543.1| unknown [Oryza sativa Japonica Group]
          Length = 321

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 195/279 (69%), Positives = 239/279 (85%), Gaps = 1/279 (0%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P G+SPYVRAKH+QLV+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 44  FHVVHKVPAGNSPYVRAKHLQLVDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 103

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
             EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 104 AKEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 163

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 164 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 223

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP   +  S      S
Sbjct: 224 GLCLIKQSRHDEARQVLHDVVLRRISGSEDDKVVARAKQLLHELEPVTHV-TSPNNAGLS 282

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           + +  +E LD ++N+WTP+RSRRLP+FEEI+  RDQ+AC
Sbjct: 283 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 321


>gi|224065324|ref|XP_002301774.1| predicted protein [Populus trichocarpa]
 gi|222843500|gb|EEE81047.1| predicted protein [Populus trichocarpa]
          Length = 309

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/282 (70%), Positives = 239/282 (84%), Gaps = 4/282 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV+HK+P GD PYV+AKH QLV+KDPEAAIV FWKAINAGD+VDSALKDMAVV+KQ DR
Sbjct: 29  FHVIHKVPAGDGPYVKAKHAQLVQKDPEAAIVWFWKAINAGDKVDSALKDMAVVMKQIDR 88

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EA+KSFR  CSKQAQESLDNVLIDLYKKCG+++EQIEL+K+KLR+IY GEAFNGK
Sbjct: 89  TEEAIEAVKSFRGRCSKQAQESLDNVLIDLYKKCGKVEEQIELIKRKLRLIYQGEAFNGK 148

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
           PTKTARSHGKKFQV+VKQETSR+LGNLGWAYMQK N+ +AEVVY+KAQ+IDPDANKACNL
Sbjct: 149 PTKTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNFIAAEVVYQKAQMIDPDANKACNL 208

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS- 247
           + CLIKQARY EARSVL DV  G+L GS + K+ +R +ELL E++  ++ P  LT     
Sbjct: 209 ALCLIKQARYDEARSVLLDVSQGRLPGSGDVKSRSRAEELLMEVKSRET-PDELTDMLGF 267

Query: 248 --SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               +D F++GL+ LM++W P RS+RLPIFEEIS FRD L C
Sbjct: 268 NLDEDDDFVKGLEKLMSEWAPSRSKRLPIFEEISSFRDPLTC 309


>gi|357505367|ref|XP_003622972.1| Male sterility 5 family protein [Medicago truncatula]
 gi|355497987|gb|AES79190.1| Male sterility 5 family protein [Medicago truncatula]
          Length = 300

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 197/279 (70%), Positives = 235/279 (84%), Gaps = 2/279 (0%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           YHV++K+P GDSPYVRAKH QLVEKDPEAAIVLFWKAINAGD+VDSALKDMAVV+KQ DR
Sbjct: 24  YHVLYKVPYGDSPYVRAKHAQLVEKDPEAAIVLFWKAINAGDKVDSALKDMAVVMKQLDR 83

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAI SFR LCSKQ+QESLDNVLIDLYKKCG++DE+I+LLKQKL++IY GEAFNGK
Sbjct: 84  SEEAIEAISSFRGLCSKQSQESLDNVLIDLYKKCGKIDEEIDLLKQKLKLIYQGEAFNGK 143

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV++KQETSR+LGNLGWAYMQK NY  AE VYRKAQ+IDPD NKACNL
Sbjct: 144 LTKTARSHGKKFQVSIKQETSRLLGNLGWAYMQKMNYVMAEAVYRKAQMIDPDCNKACNL 203

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLI+QARY EA+ +++D+L G+L GS + K+  R ++LL+EL      P  L      
Sbjct: 204 GLCLIRQARYEEAQLIIDDILKGELPGSDDIKSKKRAQDLLEELRSLLPTPCPLDI--LV 261

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           ++D F++G++ LMN+W P RS+RLPIFEEIS  RDQLAC
Sbjct: 262 MDDEFIKGIEQLMNEWGPVRSKRLPIFEEISSCRDQLAC 300


>gi|242088403|ref|XP_002440034.1| hypothetical protein SORBIDRAFT_09g024870 [Sorghum bicolor]
 gi|241945319|gb|EES18464.1| hypothetical protein SORBIDRAFT_09g024870 [Sorghum bicolor]
          Length = 298

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 193/279 (69%), Positives = 234/279 (83%), Gaps = 1/279 (0%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYV AKH+QLVEK P+ AIV FWKAIN+GDRVDSALKDMAVV+KQQDR
Sbjct: 21  FHVVHKVPAGDSPYVVAKHLQLVEKQPDMAIVWFWKAINSGDRVDSALKDMAVVMKQQDR 80

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAI+SFRHLCSKQAQESLDN+LIDLYKKCG+++EQIELLKQKL+MIY GEAFNGK
Sbjct: 81  SEEAIEAIRSFRHLCSKQAQESLDNLLIDLYKKCGKVEEQIELLKQKLKMIYLGEAFNGK 140

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARSHGKKFQV+++QETSRILGNLGWAYMQ+ N+ +AE+VYRKAQ I+PDAN+ACNL
Sbjct: 141 ATKKARSHGKKFQVSIQQETSRILGNLGWAYMQQNNFEAAELVYRKAQTIEPDANRACNL 200

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ R+ EAR  LEDV L ++ GS + K + R ++LL+EL P + +        S 
Sbjct: 201 GLCLIKQGRHEEARQALEDVRLRRIYGSEDEKVVARAEQLLRELNPLKCVSSPFEVGLSV 260

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
            E+  +  LD +MN+WTP+RSRRLP+FEEI+ FRDQ+AC
Sbjct: 261 HEE-IMGKLDLMMNEWTPFRSRRLPVFEEIATFRDQMAC 298


>gi|356567886|ref|XP_003552146.1| PREDICTED: uncharacterized protein LOC100805731 [Glycine max]
          Length = 296

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 194/283 (68%), Positives = 236/283 (83%), Gaps = 2/283 (0%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           +D  YHV+HK+P GDSPYV+AKH QLV+KDPEAAIVLFWKAINAGD+VDSALKDMAVV+K
Sbjct: 16  KDDLYHVIHKVPYGDSPYVKAKHAQLVDKDPEAAIVLFWKAINAGDKVDSALKDMAVVMK 75

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           Q DR +EA+EAIKSFR LCSK +QESLDNVL+DLYKKCG+++EQIELLK+KLR+IY GEA
Sbjct: 76  QLDRSEEAIEAIKSFRGLCSKHSQESLDNVLLDLYKKCGKIEEQIELLKRKLRLIYQGEA 135

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FNG+ T+TARSHGKKFQV++KQET+R+LGNLGWAYMQK NY  AEVV++KAQ++D DANK
Sbjct: 136 FNGRTTRTARSHGKKFQVSIKQETARLLGNLGWAYMQKENYMMAEVVFKKAQMVDADANK 195

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           ACNL  CL++Q RY EA  +LE+VL GK  GS E K+  R +ELL+EL    ++P     
Sbjct: 196 ACNLGLCLMRQCRYEEAYYILEEVLQGKHQGSDEIKSRKRAEELLEELNA--NLPQPEFM 253

Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               L+D F++G+DD+++ W   RSRRLPIFEEIS FRDQLAC
Sbjct: 254 ADLDLDDDFVKGIDDMLDAWNTNRSRRLPIFEEISSFRDQLAC 296


>gi|357133078|ref|XP_003568155.1| PREDICTED: uncharacterized protein LOC100821103 [Brachypodium
           distachyon]
          Length = 293

 Score =  413 bits (1062), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 194/279 (69%), Positives = 234/279 (83%), Gaps = 4/279 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYVRAKH+QLVEKD EAAIV FWKAIN+GDRVDSALKDMAVV+KQQDR
Sbjct: 19  FHVVHKVPAGDSPYVRAKHLQLVEKDAEAAIVWFWKAINSGDRVDSALKDMAVVMKQQDR 78

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAI+SFRHLCS+ AQESLDN+LIDLYKKCG+++EQIELLKQKL+MI+ GEAFNGK
Sbjct: 79  AEEAIEAIRSFRHLCSRHAQESLDNLLIDLYKKCGKVEEQIELLKQKLKMIHLGEAFNGK 138

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV+++QE SRILGNLGWAYMQ+ NY +AE+VYRKAQ I PDAN+ACNL
Sbjct: 139 ITKTARSHGKKFQVSIQQEMSRILGNLGWAYMQQSNYEAAELVYRKAQTIVPDANRACNL 198

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ R  EA  +L+DVLLG++SG  + + + R ++LL EL P  S P  +     S
Sbjct: 199 GLCLIKQGRNDEATRILQDVLLGRISGLDDNRAVARAEQLLSELGPMTS-PFDVDL---S 254

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
             +  +E LD +MN+WTP+RSRRLP+FEE+  FRDQ+AC
Sbjct: 255 FTEEIMERLDLVMNEWTPFRSRRLPVFEELDAFRDQMAC 293


>gi|255560193|ref|XP_002521114.1| conserved hypothetical protein [Ricinus communis]
 gi|223539683|gb|EEF41265.1| conserved hypothetical protein [Ricinus communis]
          Length = 291

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 193/283 (68%), Positives = 236/283 (83%), Gaps = 6/283 (2%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV++K+P GD PYV+AKH QLV+KDPEAAIV FWKAINAGDRVDSALKDMAVV+KQ DR
Sbjct: 11  FHVIYKVPSGDGPYVKAKHAQLVQKDPEAAIVWFWKAINAGDRVDSALKDMAVVMKQVDR 70

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAIKSFR  CS+ AQESLDNVLIDLYKKCG+++EQI+LLK+KLR+IY GEAFNGK
Sbjct: 71  TEEAIEAIKSFRGRCSRNAQESLDNVLIDLYKKCGKVEEQIDLLKRKLRLIYQGEAFNGK 130

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
           PTKTARSHGKKFQV+V+QETSR+LGNLGWAYMQK N+ +AEVVY+KAQ+IDPDANKA NL
Sbjct: 131 PTKTARSHGKKFQVSVEQETSRLLGNLGWAYMQKSNFMAAEVVYKKAQMIDPDANKAYNL 190

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLI+QARY EAR +L++VL G+  GS + K+  R +ELL E+E    +PP   T +  
Sbjct: 191 GFCLIRQARYDEARQILQNVLEGRFPGSNDCKSRKRAQELLMEME--SKLPPPELTNRIG 248

Query: 249 L----EDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           +    +D F++G++ +MN+W P R +RLPIFEEIS  RDQLAC
Sbjct: 249 INVDGDDDFVKGIEQMMNKWAPSRPKRLPIFEEISSLRDQLAC 291


>gi|195640432|gb|ACG39684.1| pollenless3 [Zea mays]
          Length = 298

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/279 (68%), Positives = 230/279 (82%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYV AKH+QLVEK P+ AIV FWKAIN+GDRVDSALKDMA+V+KQQDR
Sbjct: 20  FHVVHKVPAGDSPYVVAKHLQLVEKQPDVAIVWFWKAINSGDRVDSALKDMAMVMKQQDR 79

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAI+SFRHLCSKQAQESLDN+LIDLYKKCG+++EQIELLKQKL+ IY GEAFNGK
Sbjct: 80  SEEAIEAIRSFRHLCSKQAQESLDNLLIDLYKKCGKVEEQIELLKQKLKSIYLGEAFNGK 139

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARSHGKKFQV+++QETSRILGNLGWAYMQ+ N+ +AE+VYRKAQ I+PDAN+ACNL
Sbjct: 140 ATKKARSHGKKFQVSIQQETSRILGNLGWAYMQQNNFEAAELVYRKAQAIEPDANRACNL 199

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ R+ EAR  LEDV L ++ GS + K + R ++LL+EL P Q +        S 
Sbjct: 200 GLCLIKQGRHEEARQALEDVRLRRIYGSEDAKVVARAEQLLRELNPLQCVSSPFQVGLSV 259

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
            E    E    +MN+WTP+RSRRLP+FEEI+ FRDQ+AC
Sbjct: 260 HEGIMGEPDLVVMNEWTPFRSRRLPVFEEIATFRDQMAC 298


>gi|356524529|ref|XP_003530881.1| PREDICTED: uncharacterized protein LOC100788542 [Glycine max]
          Length = 297

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 193/284 (67%), Positives = 237/284 (83%), Gaps = 3/284 (1%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           +D  YHV+HK+P GDSPYV+AKH QLV+KDPEAAIVLFWKAIN GD+VDSALKDMAVV+K
Sbjct: 16  KDDLYHVIHKVPYGDSPYVKAKHAQLVDKDPEAAIVLFWKAINCGDKVDSALKDMAVVMK 75

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           Q DR +EA+EAIKSFR LCSK +QESLDNVL+DLYKKCG+++EQIELLK+KLR+IY GEA
Sbjct: 76  QLDRSEEAIEAIKSFRGLCSKHSQESLDNVLLDLYKKCGKIEEQIELLKRKLRLIYQGEA 135

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FNG+ T+TARSHGKKFQV++KQET+R+LGNLGWAYMQK NY  AEVV++KAQ++D DANK
Sbjct: 136 FNGRTTRTARSHGKKFQVSIKQETARLLGNLGWAYMQKENYMMAEVVFKKAQMVDADANK 195

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           ACNL  CL++Q+RY EA  +LE+VL+G + GS E K+  R +ELL+EL    ++P     
Sbjct: 196 ACNLGLCLMRQSRYKEAYYILEEVLMGIIPGSDEIKSRKRAEELLEELNA--NLPQPEFM 253

Query: 245 KKSSLEDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
               L+D F++G+DDL++ W T  R RRLPIFEEIS FRDQLAC
Sbjct: 254 DALGLDDDFVKGIDDLLDAWNTNNRPRRLPIFEEISSFRDQLAC 297


>gi|225464230|ref|XP_002266290.1| PREDICTED: uncharacterized protein LOC100263213 [Vitis vinifera]
          Length = 293

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/284 (69%), Positives = 236/284 (83%), Gaps = 6/284 (2%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+ V+HKLP GD+PY RAK V +VEKDPEAAIVLFW AINA DRV+SALKDMAVV+K+ D
Sbjct: 12  PFCVIHKLPSGDAPYARAKRVLMVEKDPEAAIVLFWNAINARDRVESALKDMAVVMKRLD 71

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR+LC KQAQESLDN+LIDLYKKCGR +E I++LKQKLR+IY GEAFNG
Sbjct: 72  RSEEAIEAIKSFRNLCPKQAQESLDNILIDLYKKCGRTNEHIDMLKQKLRLIYQGEAFNG 131

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           KPT+TARS+ KKFQ+T++QETSRILGNLGW YMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 132 KPTRTARSNRKKFQITIEQETSRILGNLGWVYMQKSNYMAAEVVYKKAQMIDPDANKACN 191

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
           L+ CLI Q RYTEA SVL++VL GKL  S + K  NR +EL+ E+EP W  +PPS T K 
Sbjct: 192 LALCLINQGRYTEAHSVLKEVLQGKLPNSEDCKAQNRAQELMLEVEPKW--LPPSETIKL 249

Query: 247 SS--LEDAFLEGLDDLMNQWTPYRSRRLPIF-EEISPFRDQLAC 287
               LED F +G + ++N W P+R++RLPIF EEIS +R+QLAC
Sbjct: 250 EGFDLEDDFSDGFEKVLNIWAPFRTKRLPIFVEEISSYRNQLAC 293


>gi|226533206|ref|NP_001141923.1| uncharacterized protein LOC100274072 [Zea mays]
 gi|194706464|gb|ACF87316.1| unknown [Zea mays]
 gi|195640848|gb|ACG39892.1| pollenless3 [Zea mays]
 gi|413945947|gb|AFW78596.1| pollenless3 [Zea mays]
          Length = 298

 Score =  406 bits (1044), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 193/281 (68%), Positives = 232/281 (82%), Gaps = 4/281 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYV AKH+QLVEK P+ AIV FWKAIN+GDRVDSALKDMA+V+KQQDR
Sbjct: 20  FHVVHKVPAGDSPYVVAKHLQLVEKQPDVAIVWFWKAINSGDRVDSALKDMAMVMKQQDR 79

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAI+SFRHLCSKQAQESLDN+LIDLYKKCG+++EQIELLKQKL+ IY GEAFNGK
Sbjct: 80  SEEAIEAIRSFRHLCSKQAQESLDNLLIDLYKKCGKVEEQIELLKQKLKSIYLGEAFNGK 139

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARSHGKKFQV+++QETSRILGNLGWAYMQ+ N+ +AE+VYRKAQ I+PDAN+ACNL
Sbjct: 140 ATKKARSHGKKFQVSIQQETSRILGNLGWAYMQQNNFEAAELVYRKAQAIEPDANRACNL 199

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ R+ EAR  LEDV L ++ GS + K + R ++LL+EL P Q +        S 
Sbjct: 200 GLCLIKQGRHDEARQALEDVRLRRIYGSEDGKVVARAEQLLRELNPLQCVSSPFQVGLSV 259

Query: 249 LEDAFLEGLDDL--MNQWTPYRSRRLPIFEEISPFRDQLAC 287
            E   + G  DL  MN+WTP+RSRRLP+FEEI+ FRDQ+AC
Sbjct: 260 HEG--IMGKPDLVVMNEWTPFRSRRLPVFEEIATFRDQMAC 298


>gi|297744306|emb|CBI37276.3| unnamed protein product [Vitis vinifera]
          Length = 296

 Score =  406 bits (1043), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/284 (69%), Positives = 236/284 (83%), Gaps = 6/284 (2%)

Query: 8   PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           P+ V+HKLP GD+PY RAK V +VEKDPEAAIVLFW AINA DRV+SALKDMAVV+K+ D
Sbjct: 15  PFCVIHKLPSGDAPYARAKRVLMVEKDPEAAIVLFWNAINARDRVESALKDMAVVMKRLD 74

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
           R +EA+EAIKSFR+LC KQAQESLDN+LIDLYKKCGR +E I++LKQKLR+IY GEAFNG
Sbjct: 75  RSEEAIEAIKSFRNLCPKQAQESLDNILIDLYKKCGRTNEHIDMLKQKLRLIYQGEAFNG 134

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           KPT+TARS+ KKFQ+T++QETSRILGNLGW YMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 135 KPTRTARSNRKKFQITIEQETSRILGNLGWVYMQKSNYMAAEVVYKKAQMIDPDANKACN 194

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
           L+ CLI Q RYTEA SVL++VL GKL  S + K  NR +EL+ E+EP W  +PPS T K 
Sbjct: 195 LALCLINQGRYTEAHSVLKEVLQGKLPNSEDCKAQNRAQELMLEVEPKW--LPPSETIKL 252

Query: 247 SS--LEDAFLEGLDDLMNQWTPYRSRRLPIF-EEISPFRDQLAC 287
               LED F +G + ++N W P+R++RLPIF EEIS +R+QLAC
Sbjct: 253 EGFDLEDDFSDGFEKVLNIWAPFRTKRLPIFVEEISSYRNQLAC 296


>gi|30678993|ref|NP_171969.2| male sterility MS5 family protein [Arabidopsis thaliana]
 gi|22655080|gb|AAM98131.1| unknown protein [Arabidopsis thaliana]
 gi|30984518|gb|AAP42722.1| At1g04770 [Arabidopsis thaliana]
 gi|332189620|gb|AEE27741.1| male sterility MS5 family protein [Arabidopsis thaliana]
          Length = 303

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 198/284 (69%), Positives = 235/284 (82%), Gaps = 5/284 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW AI A DRVDSALKDMA+++KQQ+R
Sbjct: 20  YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 79

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 80  AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
           PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ  +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199

Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
             CLIKQ ++ EARS+L  DVL+    GS + + + RV+ELL EL+P +    +  + + 
Sbjct: 200 CTCLIKQGKHDEARSILFRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSVEC 259

Query: 248 SL---EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
            +   E A +EGLD+ + +W  PYR+RRLPIFEEI P RDQLAC
Sbjct: 260 EVGIDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 303


>gi|7211988|gb|AAF40459.1|AC004809_17 Strong similarity to the beta-9 tubulin gene (178 gene product)
           gb|AF060248 [Arabidopsis thaliana]
          Length = 364

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 198/284 (69%), Positives = 235/284 (82%), Gaps = 5/284 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW AI A DRVDSALKDMA+++KQQ+R
Sbjct: 81  YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 140

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 141 AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 200

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
           PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ  +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 201 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 260

Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
             CLIKQ ++ EARS+L  DVL+    GS + + + RV+ELL EL+P +    +  + + 
Sbjct: 261 CTCLIKQGKHDEARSILFRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSVEC 320

Query: 248 SL---EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
            +   E A +EGLD+ + +W  PYR+RRLPIFEEI P RDQLAC
Sbjct: 321 EVGIDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 364


>gi|357505377|ref|XP_003622977.1| hypothetical protein MTR_7g059010 [Medicago truncatula]
 gi|355497992|gb|AES79195.1| hypothetical protein MTR_7g059010 [Medicago truncatula]
          Length = 387

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 194/279 (69%), Positives = 232/279 (83%), Gaps = 2/279 (0%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           YHV+HK+P GD+PYV+AKH QLV+KDPE AIV FWKAINAGD+VDSALKDMAVV+KQ DR
Sbjct: 19  YHVIHKVPYGDTPYVKAKHAQLVDKDPEVAIVYFWKAINAGDKVDSALKDMAVVMKQLDR 78

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAIKSFR LC+K +QESLDNVL+DLYKKCGR++EQIELLK+KLR+IY GEAFNG+
Sbjct: 79  AEEAIEAIKSFRGLCNKHSQESLDNVLLDLYKKCGRVEEQIELLKRKLRLIYQGEAFNGR 138

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV++KQET+R+LGNLGWAYMQK NY  AEVV++KAQ+ID DANKA NL
Sbjct: 139 TTKTARSHGKKFQVSIKQETARLLGNLGWAYMQKTNYMMAEVVFKKAQMIDADANKALNL 198

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
           + CL++Q+RY EA  +LE VL GKL GS E K+ NR +ELL EL    ++P         
Sbjct: 199 ALCLMRQSRYEEAYLILEQVLQGKLPGSDEIKSRNRAEELLVELSA--NLPQPKFMDDLG 256

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           L+D  L+G+D L+N W+P RSRRLPIFEEIS FRDQLA 
Sbjct: 257 LDDDLLKGIDGLLNVWSPTRSRRLPIFEEISSFRDQLAA 295


>gi|110742853|dbj|BAE99325.1| hypothetical protein [Arabidopsis thaliana]
          Length = 303

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 198/284 (69%), Positives = 235/284 (82%), Gaps = 5/284 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW AI A DRVDSALKDMA+++KQQ+R
Sbjct: 20  YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 79

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 80  AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
           PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ  +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199

Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
             CLIKQ ++ EARS+L  DVL+    GS + + + RV+ELL EL+P +    +  + + 
Sbjct: 200 CTCLIKQGKHDEARSILFRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSVEC 259

Query: 248 SL---EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
            +   E A +EGLD+ + +W  PYR+RRLPIFEEI P RDQLAC
Sbjct: 260 EVGIDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 303


>gi|297848718|ref|XP_002892240.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338082|gb|EFH68499.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 305

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/286 (68%), Positives = 233/286 (81%), Gaps = 7/286 (2%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW+AI A DRVDSALKDMA+++KQQ+R
Sbjct: 20  YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWRAIEARDRVDSALKDMALLMKQQNR 79

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 80  AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
           PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ  +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199

Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
             CLIKQ +  EAR+++  DVL     GS + + + RV+ELL EL+P +    +L +   
Sbjct: 200 CTCLIKQGKQDEARTIIFRDVLTENKEGSGDPRLMARVQELLSELKPQEEEAAALASVSV 259

Query: 248 SL-----EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
                  E A +EGLD+ + +W  PYR+RRLPIFEEI P RDQLAC
Sbjct: 260 ECEVGMDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 305


>gi|449468732|ref|XP_004152075.1| PREDICTED: uncharacterized protein LOC101204990 [Cucumis sativus]
          Length = 293

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 188/286 (65%), Positives = 234/286 (81%), Gaps = 1/286 (0%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
           + +D+ +HV+HK+PPGD+PYVRAK+ QL++KDPE+AI LFW+AIN GDRV+SALKDMAVV
Sbjct: 8   ERQDELFHVIHKVPPGDTPYVRAKYAQLIKKDPESAIALFWEAINKGDRVESALKDMAVV 67

Query: 63  LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
           +KQ DR +EA+  +++FR LCSK +Q SLDNVLIDL+KKCGR++EQIELLK+KLRMIY G
Sbjct: 68  MKQIDRAEEAIHILQTFRFLCSKHSQNSLDNVLIDLFKKCGRIEEQIELLKRKLRMIYQG 127

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
           EAFNGKPT+TARSHGKKFQV+VKQETSR+LGNLGWAYMQK NY  AE VY+KAQ+IDPDA
Sbjct: 128 EAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDA 187

Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
           NKACNL  CL+KQ R +EA  VLE V   ++ GS+E K   R  +LL E+   QS+P S+
Sbjct: 188 NKACNLGLCLMKQGRLSEAIFVLEQVQQAQIPGSSEIKAQKRSADLLTEIRSRQSLPDSI 247

Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPY-RSRRLPIFEEISPFRDQLAC 287
                S++  FL GL+ L+N+  P+ RS+RLP+FEEIS FRDQLAC
Sbjct: 248 DLLGLSVDVDFLNGLELLVNKKGPFSRSKRLPVFEEISSFRDQLAC 293


>gi|125552911|gb|EAY98620.1| hypothetical protein OsI_20545 [Oryza sativa Indica Group]
          Length = 284

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/279 (65%), Positives = 226/279 (81%), Gaps = 16/279 (5%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHKL               V+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 22  FHVVHKL---------------VDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 66

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 67  AEEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 126

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 127 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 186

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP   +  S      S
Sbjct: 187 GLCLIKQSRHDEARQVLHDVVLRRISGSEDEKVVARAKQLLHELEPVTHV-TSPNNAGLS 245

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           + +  +E LD ++N+WTP+RSRRLP+FEEI+  RDQ+AC
Sbjct: 246 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 284


>gi|222632158|gb|EEE64290.1| hypothetical protein OsJ_19127 [Oryza sativa Japonica Group]
          Length = 284

 Score =  385 bits (990), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/279 (65%), Positives = 225/279 (80%), Gaps = 16/279 (5%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHKL               V+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 22  FHVVHKL---------------VDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 66

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
             EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 67  AKEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 126

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 127 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 186

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
             CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP   +  S      S
Sbjct: 187 GLCLIKQSRHDEARQVLHDVVLRRISGSEDDKVVARAKQLLHELEPVTHV-TSPNNAGLS 245

Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
           + +  +E LD ++N+WTP+RSRRLP+FEEI+  RDQ+AC
Sbjct: 246 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 284


>gi|297795599|ref|XP_002865684.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311519|gb|EFH41943.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 306

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 179/285 (62%), Positives = 222/285 (77%), Gaps = 5/285 (1%)

Query: 6   DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
           D+ +HV+HK+P GD+PYVRAKH QL+EK+PE AIV FWKAIN GDRVDSALKDMAVV+KQ
Sbjct: 24  DEMFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVMKQ 83

Query: 66  QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
            DR +EA+EAIKSFR  CSK +Q+SLDNVLIDLYKKCGR++EQ+ELLK+KLR IY GEAF
Sbjct: 84  LDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGEAF 143

Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           NGKPTKTARSHGKKFQVTV+QE SR+LGNLGWAYMQ+  Y SAE VYRKAQ+++PDANK+
Sbjct: 144 NGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDANKS 203

Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV---KELLQELEPWQSIPPSL 242
           CNL+ CLIKQ R+ E R VL+DVL  ++ G+ + +T  R       L+   P        
Sbjct: 204 CNLAMCLIKQGRFEEGRLVLDDVLESRVLGADDCRTRQRAEELLSELESSLPRMGDAEME 263

Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               + L+D F+ GL+ + +  T ++S+RLPIFE+IS FR+ L C
Sbjct: 264 DVLGNILDDDFVLGLEQMTS--TSFKSKRLPIFEQISSFRNTLVC 306


>gi|9758884|dbj|BAB09438.1| unnamed protein product [Arabidopsis thaliana]
          Length = 326

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 179/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)

Query: 6   DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
           D+ +HV+HK+P GD+PYVRAKH QL+EK+PE AIV FWKAIN GDRVDSALKDMAVV+KQ
Sbjct: 44  DELFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVMKQ 103

Query: 66  QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
            DR +EA+EAIKSFR  CSK +Q+SLDNVLIDLYKKCGR++EQ+ELLK+KLR IY GEAF
Sbjct: 104 LDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGEAF 163

Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           NGKPTKTARSHGKKFQVTV+QE SR+LGNLGWAYMQ+  Y SAE VYRKAQ+++PDANK+
Sbjct: 164 NGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDANKS 223

Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV---KELLQELEPWQSIPPSL 242
           CNL+ CLIKQ R+ E R VL+DVL  ++ G+ + +T  R       L+   P        
Sbjct: 224 CNLAMCLIKQGRFEEGRLVLDDVLEYRVLGADDCRTRQRAEELLSELESSLPRMRDAEME 283

Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               + L+D F+ GL+++ +  T ++S+RLPIFE+IS FR+ L C
Sbjct: 284 DVLGNILDDDFVLGLEEMTS--TSFKSKRLPIFEQISSFRNTLVC 326


>gi|30695554|ref|NP_199696.2| tetratricopeptide repeat domain-containing protein [Arabidopsis
           thaliana]
 gi|26451131|dbj|BAC42669.1| unknown protein [Arabidopsis thaliana]
 gi|28950875|gb|AAO63361.1| At5g48850 [Arabidopsis thaliana]
 gi|332008350|gb|AED95733.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
           thaliana]
          Length = 306

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 179/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)

Query: 6   DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
           D+ +HV+HK+P GD+PYVRAKH QL+EK+PE AIV FWKAIN GDRVDSALKDMAVV+KQ
Sbjct: 24  DELFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVMKQ 83

Query: 66  QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
            DR +EA+EAIKSFR  CSK +Q+SLDNVLIDLYKKCGR++EQ+ELLK+KLR IY GEAF
Sbjct: 84  LDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGEAF 143

Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           NGKPTKTARSHGKKFQVTV+QE SR+LGNLGWAYMQ+  Y SAE VYRKAQ+++PDANK+
Sbjct: 144 NGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDANKS 203

Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV---KELLQELEPWQSIPPSL 242
           CNL+ CLIKQ R+ E R VL+DVL  ++ G+ + +T  R       L+   P        
Sbjct: 204 CNLAMCLIKQGRFEEGRLVLDDVLEYRVLGADDCRTRQRAEELLSELESSLPRMRDAEME 263

Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
               + L+D F+ GL+++ +  T ++S+RLPIFE+IS FR+ L C
Sbjct: 264 DVLGNILDDDFVLGLEEMTS--TSFKSKRLPIFEQISSFRNTLVC 306


>gi|212722260|ref|NP_001132763.1| uncharacterized protein LOC100194250 [Zea mays]
 gi|194695332|gb|ACF81750.1| unknown [Zea mays]
 gi|413957078|gb|AFW89727.1| hypothetical protein ZEAMMB73_680631 [Zea mays]
          Length = 304

 Score =  333 bits (854), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 168/286 (58%), Positives = 214/286 (74%), Gaps = 8/286 (2%)

Query: 10  HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
           HV HK+P GD PY RAKH QLVEKD +A+I  FWKAI  GD+VDSALKDMAVV+KQ+  +
Sbjct: 19  HVAHKIPAGDGPYARAKHFQLVEKDLDASIAWFWKAIGTGDKVDSALKDMAVVMKQRGYL 78

Query: 70  DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
            EA+ A++S RHLC KQ+QESLDN+L+DLYK  GR  E+IELLKQKLR IY GEAF+GK 
Sbjct: 79  AEAIGAVRSLRHLCPKQSQESLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 138

Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
           TK ARSHG+K  V++KQETSR+LGNL WAYMQ+ N+ +AEVVYRKAQ+IDPDANKACNL+
Sbjct: 139 TKRARSHGRKIHVSIKQETSRVLGNLAWAYMQQRNFMAAEVVYRKAQMIDPDANKACNLA 198

Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG--STETKTINRVKELLQEL--EPWQSIPPSLTTK 245
            CLI+QAR  +A+ VL DVL G+       + K + +V+ELL  +  + WQ        +
Sbjct: 199 LCLIEQARLADAQLVLTDVLAGRYQARDQQDGKIVRKVEELLARIMAQTWQGGGGGRRRR 258

Query: 246 KSSLEDAFLEG-LDDLMNQWTPYR--SRRLPIFEEISP-FRDQLAC 287
             S +D ++E  +  L++   PYR  SRRLP+FEEISP +++Q+AC
Sbjct: 259 DESDDDDWVENQMLALLDVAVPYRKTSRRLPVFEEISPIYKEQVAC 304


>gi|242036819|ref|XP_002465804.1| hypothetical protein SORBIDRAFT_01g046120 [Sorghum bicolor]
 gi|241919658|gb|EER92802.1| hypothetical protein SORBIDRAFT_01g046120 [Sorghum bicolor]
          Length = 312

 Score =  326 bits (836), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 168/297 (56%), Positives = 213/297 (71%), Gaps = 23/297 (7%)

Query: 10  HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
           HV HK+P GD PY RAKH QLVEKD +A+I  FWKAI+ GD+VDSALKDMAVV+KQ+  +
Sbjct: 20  HVAHKIPAGDGPYARAKHFQLVEKDLDASIAWFWKAISTGDKVDSALKDMAVVMKQRGYL 79

Query: 70  DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
            EA++AI+S RHLC KQ+QESLDN+L+DLYK  GR  E+IELLKQKLR IY GEAF+GK 
Sbjct: 80  TEAIDAIRSLRHLCPKQSQESLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 139

Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
           TK ARSHG+K  V+V+QETSRILGNL WAYMQ+ N+ +AEVVYRKAQ+IDPDANKACNL+
Sbjct: 140 TKRARSHGRKIHVSVRQETSRILGNLAWAYMQQRNFMAAEVVYRKAQMIDPDANKACNLA 199

Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG---STETKTINRVKELLQEL--EPWQSIPPSLTT 244
            CLI+Q R+ +A  VL DVL G+        + K + +V+ELL  +  + W     +   
Sbjct: 200 LCLIEQTRFADAELVLADVLAGRYLARDQQQDGKIVRKVEELLARIVAQTWPGGGANSGR 259

Query: 245 KKSS-----------LEDAFLEGLDDLMNQWTPYR--SRRLPIFEEISP-FRDQLAC 287
           ++             +E+  L  LD  +    PYR  S+RLP+FEEISP +R+Q+AC
Sbjct: 260 RRDESGSESDDDDDWVENQMLALLDVAV----PYRKSSKRLPVFEEISPVYREQVAC 312


>gi|115450941|ref|NP_001049071.1| Os03g0165900 [Oryza sativa Japonica Group]
 gi|108706358|gb|ABF94153.1| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113547542|dbj|BAF10985.1| Os03g0165900 [Oryza sativa Japonica Group]
 gi|125585045|gb|EAZ25709.1| hypothetical protein OsJ_09542 [Oryza sativa Japonica Group]
 gi|215717030|dbj|BAG95393.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 315

 Score =  323 bits (829), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 168/295 (56%), Positives = 212/295 (71%), Gaps = 18/295 (6%)

Query: 10  HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
           HV +K+P GD PY RAKH QLVEKD +A+I  FWKAIN+GD+VDSALKDMAVV+KQ+   
Sbjct: 19  HVAYKIPAGDGPYARAKHYQLVEKDLDASIAWFWKAINSGDKVDSALKDMAVVMKQRGYH 78

Query: 70  DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
            +A++AIKS RHLC KQ+Q+SLDN+L+DLYK  GR  E+IELLKQKLR IY GEAF+GK 
Sbjct: 79  ADAIDAIKSLRHLCPKQSQDSLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 138

Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
           TK ARSHG+K  V+VKQETSR+LGNL WAYMQ+GN+ +AEVVYRKAQ++DPDANKACNL+
Sbjct: 139 TKRARSHGRKIHVSVKQETSRVLGNLAWAYMQQGNFMAAEVVYRKAQMVDPDANKACNLA 198

Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG--------STETKTINRVKELLQEL------EPW 235
            CLI+Q R  +A +VL  VL G+            T  K + +V+EL+  +         
Sbjct: 199 LCLIEQRRPADAEAVLAGVLAGRYHARDDDHGAPHTGGKILAKVEELMARITGEVDGRSN 258

Query: 236 QSIPPSLTTKKSSLEDAFLEGLDDLMNQW-TPYR--SRRLPIFEEISPF-RDQLA 286
            S   S       +ED  +E LD ++ QW  PYR  +RRLP+FEEI+P  R+Q+A
Sbjct: 259 SSGGSSSDDDDRDVEDEMVELLDVVVRQWAAPYRRSNRRLPVFEEITPVCREQMA 313


>gi|125542541|gb|EAY88680.1| hypothetical protein OsI_10155 [Oryza sativa Indica Group]
          Length = 315

 Score =  323 bits (828), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 168/295 (56%), Positives = 212/295 (71%), Gaps = 18/295 (6%)

Query: 10  HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
           HV +K+P GD PY RAKH QLVEKD +A+I  FWKAIN+GD+VDSALKDMAVV+KQ+   
Sbjct: 19  HVAYKIPAGDGPYARAKHYQLVEKDLDASIAWFWKAINSGDKVDSALKDMAVVMKQRGYH 78

Query: 70  DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
            +A++AIKS RHLC KQ+Q+SLDN+L+DLYK  GR  E+IELLKQKLR IY GEAF+GK 
Sbjct: 79  ADAIDAIKSLRHLCPKQSQDSLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 138

Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
           TK ARSHG+K  V+VKQETSR+LGNL WAYMQ+GN+ +AEVVYRKAQ++DPDANKACNL+
Sbjct: 139 TKRARSHGRKIHVSVKQETSRVLGNLAWAYMQQGNFMAAEVVYRKAQMVDPDANKACNLA 198

Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG--------STETKTINRVKELLQEL------EPW 235
            CLI+Q R  +A +VL  VL G+            T  K + +V+EL+  +         
Sbjct: 199 LCLIEQRRPADAEAVLAGVLAGRYHARDDDHGAPHTGGKILAKVEELMARITGEVDGRSN 258

Query: 236 QSIPPSLTTKKSSLEDAFLEGLDDLMNQW-TPYR--SRRLPIFEEISPF-RDQLA 286
            S   S       +ED  +E LD ++ QW  PYR  +RRLP+FEEI+P  R+Q+A
Sbjct: 259 SSGGSSSDDDDRDVEDEMVELLDVVVRQWAAPYRRSNRRLPVFEEITPVCREQMA 313


>gi|357113902|ref|XP_003558740.1| PREDICTED: uncharacterized protein LOC100831382 [Brachypodium
           distachyon]
          Length = 305

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 165/295 (55%), Positives = 213/295 (72%), Gaps = 15/295 (5%)

Query: 6   DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
           D   HV HK+P GD PY RAKH QLVEKD +A+IV FWKAI  GD+VDSALKDMAVV+KQ
Sbjct: 13  DAAAHVAHKIPSGDGPYARAKHYQLVEKDLDASIVWFWKAIETGDKVDSALKDMAVVMKQ 72

Query: 66  QDRVDEAVEAIKSFRHLC--SKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           +  + +A++AIKS RHLC  S+Q+QESLDN+L+DLYK  GR  E+I+LLKQKLR I+HGE
Sbjct: 73  RGYLKDAIDAIKSLRHLCNPSRQSQESLDNILLDLYKASGRTREEIDLLKQKLRRIFHGE 132

Query: 124 AF-NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
           AF  GK TK ARSHG+K  V+VKQETSR+LGNL WAYMQ+ N+ +AE VYRKAQ++DPDA
Sbjct: 133 AFPRGKSTKRARSHGRKIHVSVKQETSRVLGNLAWAYMQERNFMAAEAVYRKAQMVDPDA 192

Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELEPWQSI 238
           NKACNL+ CL++Q R  +A  VL+ VL G    ++ G    K + + +EL++ +   +  
Sbjct: 193 NKACNLALCLVEQRRMGDAEKVLDGVLSGVYVEQIGGG--EKAVRKAEELMERIRAGKGG 250

Query: 239 PPSLTTKKSSLE-DAFLEGLDDLMNQWT-PYR--SRRLPIFEEISPF--RDQLAC 287
                 ++  +E D   E LD ++ +W  PYR   RRLP+FEEI+PF  R Q+AC
Sbjct: 251 DGVEEDQEDGVEADEMAELLDVVVKEWARPYRRSDRRLPVFEEITPFCGRGQIAC 305


>gi|226531990|ref|NP_001151272.1| pollenless3 [Zea mays]
 gi|195645436|gb|ACG42186.1| pollenless3 [Zea mays]
 gi|414865004|tpg|DAA43561.1| TPA: pollenless3 [Zea mays]
          Length = 312

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 164/288 (56%), Positives = 206/288 (71%), Gaps = 18/288 (6%)

Query: 10  HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
           HV HK+PPGD PY RAKH QLVEKD +A+I  FWKAI+AGD++DSALKDMAVV+KQ+  +
Sbjct: 22  HVAHKIPPGDGPYARAKHFQLVEKDLDASIAWFWKAISAGDKLDSALKDMAVVMKQRGYL 81

Query: 70  DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
            EA++AI+S RHLC KQ+QE LDN+L+DLYK  GR  E+IELLKQKLR IY GEAF+G+ 
Sbjct: 82  AEAIDAIRSLRHLCPKQSQEPLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGRT 141

Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
           TK ARSHG+K  V+VKQETSR+LGNL WAYMQ+ N+ +AEVVYRKAQ+IDPDANKACNL+
Sbjct: 142 TKRARSHGRKIHVSVKQETSRVLGNLAWAYMQQRNFMAAEVVYRKAQMIDPDANKACNLA 201

Query: 190 HCLIKQARYTEARSVLEDVLLGKL---SGSTETKTINRVKELLQELEPWQSIP------- 239
            CLI+QAR  +A  VL DVL G+        + K + +V+ELL  +    + P       
Sbjct: 202 LCLIEQARLADAELVLADVLAGRYLQARDQQDGKIVRKVEELLARIMAHTTWPGGGGGDG 261

Query: 240 ------PSLTTKKSSLEDAFLEGLDDLMNQWTPYR--SRRLPIFEEIS 279
                  S +  +S  +D     +  L++   PYR  SRRLP+FEEIS
Sbjct: 262 RRRPADKSGSGSESDDDDWVENQMLALLDVAVPYRKKSRRLPVFEEIS 309


>gi|168001142|ref|XP_001753274.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695560|gb|EDQ81903.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 296

 Score =  312 bits (799), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 147/233 (63%), Positives = 186/233 (79%), Gaps = 3/233 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
            R   +HVVHK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 25  ARADSFHVVHKVPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVM 84

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           KQQ+R  EA+EAIKS R  CS QAQESLDNVL+DLYK+CGRLD+QI+LLK KL +I+ G 
Sbjct: 85  KQQNRPQEAIEAIKSLRSRCSDQAQESLDNVLLDLYKRCGRLDDQIDLLKHKLHLIHQGM 144

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AFNGK TKTARS GKKFQV+++QE +R+LGNLGWA+MQ+ N+ +AE VYRKA  I+PD N
Sbjct: 145 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWAHMQQSNFVAAEAVYRKALSIEPDNN 204

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQELE 233
           K CNL  CL+KQ R  EA+++L+ V      + +  +  K+ +R +E+LQ+LE
Sbjct: 205 KVCNLGICLMKQGRLEEAKAMLQSVTRCNDNRWASDSHLKSYDRAQEMLQDLE 257


>gi|168001098|ref|XP_001753252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695538|gb|EDQ81881.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 298

 Score =  312 bits (799), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 148/233 (63%), Positives = 185/233 (79%), Gaps = 3/233 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
            R   +HVVHK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 25  ARADSFHVVHKVPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVM 84

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           KQQ+R  EA+EAIKS R  CS QAQESLDNVL+DLYK+CGRLD+QI+LLK KL +I+ G 
Sbjct: 85  KQQNRPQEAIEAIKSLRSRCSDQAQESLDNVLLDLYKRCGRLDDQIDLLKHKLHLIHQGM 144

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AFNGK TKTARS GKKFQV+++QE +R+LGNLGWA MQ+ N+ +AE VYRKA  I+PD N
Sbjct: 145 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWACMQQSNFVAAEAVYRKALSIEPDNN 204

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQELE 233
           K CNL  CL+KQ R  EA+++L+ V      + +  +  K+ +R +E+LQELE
Sbjct: 205 KVCNLGICLMKQGRLEEAKAMLQSVTRCNDNRWASDSHLKSYDRAQEMLQELE 257


>gi|242045418|ref|XP_002460580.1| hypothetical protein SORBIDRAFT_02g031200 [Sorghum bicolor]
 gi|241923957|gb|EER97101.1| hypothetical protein SORBIDRAFT_02g031200 [Sorghum bicolor]
          Length = 516

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 150/237 (63%), Positives = 182/237 (76%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 37  MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMA 96

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 97  IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 156

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI P
Sbjct: 157 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 216

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 217 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAAVDGLRGADSHLKAYERAQEMLRDLE 273


>gi|115480407|ref|NP_001063797.1| Os09g0538500 [Oryza sativa Japonica Group]
 gi|50726663|dbj|BAD34381.1| putative pollenless3 [Oryza sativa Japonica Group]
 gi|113632030|dbj|BAF25711.1| Os09g0538500 [Oryza sativa Japonica Group]
 gi|125606466|gb|EAZ45502.1| hypothetical protein OsJ_30159 [Oryza sativa Japonica Group]
 gi|215740583|dbj|BAG97239.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215768313|dbj|BAH00542.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215768379|dbj|BAH00608.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 515

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 150/237 (63%), Positives = 182/237 (76%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 46  MLRTNSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWAAINAGDRVDSALKDMA 105

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 106 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 165

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI P
Sbjct: 166 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 225

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 226 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 282


>gi|125564523|gb|EAZ09903.1| hypothetical protein OsI_32198 [Oryza sativa Indica Group]
          Length = 513

 Score =  311 bits (797), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 150/237 (63%), Positives = 182/237 (76%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 46  MLRTNSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWAAINAGDRVDSALKDMA 105

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 106 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 165

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI P
Sbjct: 166 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 225

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 226 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 282


>gi|357159691|ref|XP_003578528.1| PREDICTED: uncharacterized protein LOC100833530 [Brachypodium
           distachyon]
          Length = 508

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 164/297 (55%), Positives = 203/297 (68%), Gaps = 18/297 (6%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV  K+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 40  MLRTNSDAFHVAFKVPVGDTPYVRAKRVQLVDKDPEKAIALFWAAINAGDRVDSALKDMA 99

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EA+KS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 100 IVMKQQNRAEEAIEAVKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 159

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI P
Sbjct: 160 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 219

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE   
Sbjct: 220 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLEIKL 279

Query: 237 SIPPSLTTKKSSLEDAFLEGLDDLM-----NQWTPYRSRRLPIFEEISP--FRDQLA 286
              P+  T    L+ ++L   D LM     + W P      P  E + P   RDQ A
Sbjct: 280 VGRPAWATAGDQLDKSWL--FDALMLGSSSSIWQPQ-----PCVEHLLPPLPRDQFA 329


>gi|226497904|ref|NP_001141607.1| hypothetical protein [Zea mays]
 gi|194705256|gb|ACF86712.1| unknown [Zea mays]
 gi|414888334|tpg|DAA64348.1| TPA: hypothetical protein ZEAMMB73_633017 [Zea mays]
          Length = 485

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 156/253 (61%), Positives = 188/253 (74%), Gaps = 5/253 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V+KQQ+R
Sbjct: 39  FHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMAIVMKQQNR 98

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I  G AFNGK
Sbjct: 99  AEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLINQGHAFNGK 158

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI PD NK CNL
Sbjct: 159 RTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGPDNNKMCNL 218

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEP-WQSIPPSLT 243
             CL+KQAR  EA+ VL+ V    + G     +  K   R +E+L++LE      PP+  
Sbjct: 219 GICLMKQARVLEAKDVLKQVRPAAVDGLRGADSHLKAYERAQEMLRDLETKLIGRPPADQ 278

Query: 244 TKKSSLEDAFLEG 256
              + L DA L G
Sbjct: 279 LDTNWLFDALLLG 291


>gi|302818051|ref|XP_002990700.1| hypothetical protein SELMODRAFT_451580 [Selaginella moellendorffii]
 gi|300141622|gb|EFJ08332.1| hypothetical protein SELMODRAFT_451580 [Selaginella moellendorffii]
          Length = 538

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 147/237 (62%), Positives = 186/237 (78%), Gaps = 4/237 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
           +R   +HV+HK+P GDSPYVRAKHVQLV+K+P+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 36  SRADSFHVIHKVPLGDSPYVRAKHVQLVDKEPDKAIALFWAAINAGDRVDSALKDMAIVM 95

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           KQQ+R +EA+EAIKS R  CS QAQESLDNVL+DLYK+CGRLD+QI LLK KL++IY G 
Sbjct: 96  KQQNRPEEAIEAIKSLRGRCSDQAQESLDNVLLDLYKRCGRLDDQIALLKHKLQLIYAGL 155

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AFNGK TKTARS GKKFQV+++QE +R+LGNLGWAYMQ+ ++ +AE VYRKA  ++PD N
Sbjct: 156 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWAYMQQSDFIAAEAVYRKALSMEPDNN 215

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL----GKLSGSTETKTINRVKELLQELEPWQ 236
           K CNL  CL+KQ R  EA+ +L+ V+      +    +  K+  R +E+L E+E  Q
Sbjct: 216 KMCNLGICLMKQGRVDEAKVLLKSVMPASSDARWGADSHLKSYERAQEMLAEMEQGQ 272


>gi|302771231|ref|XP_002969034.1| hypothetical protein SELMODRAFT_451579 [Selaginella moellendorffii]
 gi|300163539|gb|EFJ30150.1| hypothetical protein SELMODRAFT_451579 [Selaginella moellendorffii]
          Length = 533

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 147/237 (62%), Positives = 186/237 (78%), Gaps = 4/237 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
           +R   +HV+HK+P GDSPYVRAKHVQLV+K+P+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 36  SRADSFHVIHKVPLGDSPYVRAKHVQLVDKEPDKAIALFWAAINAGDRVDSALKDMAIVM 95

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           KQQ+R +EA+EAIKS R  CS QAQESLDNVL+DLYK+CGRLD+QI LLK KL++IY G 
Sbjct: 96  KQQNRPEEAIEAIKSLRGRCSDQAQESLDNVLLDLYKRCGRLDDQIALLKHKLQLIYAGL 155

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AFNGK TKTARS GKKFQV+++QE +R+LGNLGWAYMQ+ ++ +AE VYRKA  ++PD N
Sbjct: 156 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWAYMQQSDFIAAEAVYRKALSMEPDNN 215

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELEPWQ 236
           K CNL  CL+KQ R  EA+ +L+ V+      +    +  K+  R +E+L E+E  Q
Sbjct: 216 KMCNLGICLMKQGRVDEAKVLLKSVMPASSDTRWGADSHLKSYERAQEMLAEMEQGQ 272


>gi|168005546|ref|XP_001755471.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693178|gb|EDQ79531.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 147/233 (63%), Positives = 186/233 (79%), Gaps = 4/233 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
           +R  P+HVVHK+P GDSPYV+AKHVQLV+KDP+ AI LFW AIN+GDRVDSALKDMA+V+
Sbjct: 26  SRGDPFHVVHKVPVGDSPYVKAKHVQLVDKDPDRAIALFWAAINSGDRVDSALKDMAIVM 85

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           +QQ+R  EA+EAIKS R  CS QAQESLDNVL+DLYK+CGRLD+QI+LL+ KL +I+ G 
Sbjct: 86  RQQNRPQEAIEAIKSLRSRCSDQAQESLDNVLLDLYKRCGRLDDQIDLLRHKLHLIHQGL 145

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AFNGK TKTARS GKKFQV+VKQE +R+LGNLGWAYMQ+ N+ +AE VYRKA  I+ D N
Sbjct: 146 AFNGKRTKTARSQGKKFQVSVKQEATRLLGNLGWAYMQQLNFVAAEAVYRKALSIETDIN 205

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQEL 232
           K CNL  CL+KQ R  EA++ L+ V +     + +  ++ K+  R ++LL+EL
Sbjct: 206 KVCNLGICLMKQGRLEEAKAALQRVTVAYSGIRWASDSQLKSYERAQDLLKEL 258


>gi|414590076|tpg|DAA40647.1| TPA: hypothetical protein ZEAMMB73_529703 [Zea mays]
          Length = 547

 Score =  310 bits (793), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 150/237 (63%), Positives = 181/237 (76%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 75  MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMA 134

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL+ I+
Sbjct: 135 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQRIH 194

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI P
Sbjct: 195 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 254

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 255 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 311


>gi|293333705|ref|NP_001170740.1| uncharacterized protein LOC100384832 [Zea mays]
 gi|238007284|gb|ACR34677.1| unknown [Zea mays]
          Length = 509

 Score =  310 bits (793), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 150/237 (63%), Positives = 181/237 (76%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 37  MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMA 96

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL+ I+
Sbjct: 97  IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQRIH 156

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A LI P
Sbjct: 157 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 216

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 217 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 273


>gi|326515846|dbj|BAK07169.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 316

 Score =  309 bits (792), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 160/292 (54%), Positives = 200/292 (68%), Gaps = 14/292 (4%)

Query: 10  HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
           HV  K+P G  PY RAKH QLVEKD +A+I  FWKAI  GD+VDSALKDMAVV+KQ+  +
Sbjct: 25  HVALKIPSGGGPYARAKHYQLVEKDLDASIAWFWKAIETGDKVDSALKDMAVVMKQRGYL 84

Query: 70  DEAVEAIKSFRHLC-SKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF-NG 127
           D+AV+AI+S RHLC  KQ+QESLDN+L+DLYK  GR  E+IELLK KLR IYHG+AF  G
Sbjct: 85  DDAVDAIRSLRHLCPGKQSQESLDNILLDLYKASGRTKEEIELLKHKLRRIYHGQAFPAG 144

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           K TK ARSHG+K  V+V+QETSR+LGNL WAYMQ+ N+ +AE VYRKAQ+++PDANKACN
Sbjct: 145 KSTKRARSHGRKIHVSVQQETSRVLGNLAWAYMQQRNFMAAEAVYRKAQMVEPDANKACN 204

Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTE-----TKTINRVKELLQELEPWQSIPPSL 242
           L+ CL++Q R  +A  VL  V+ G      E      K + + +ELL+ +          
Sbjct: 205 LAVCLVEQGRLGDAEGVLAYVVAGAFRDGREREHGGGKVVRKAEELLERIRAEMGGGGGE 264

Query: 243 TTKKSSLE---DAFLEGLDDLMNQWT-PYRS--RRLPIFEEISPF-RDQLAC 287
                      D   E LD +  QW  PYR   RRLP+FEEI+PF R+Q+AC
Sbjct: 265 KEAGEEDGAEADEMSELLDAVARQWAPPYRKSHRRLPVFEEITPFGREQMAC 316


>gi|224071329|ref|XP_002303407.1| predicted protein [Populus trichocarpa]
 gi|222840839|gb|EEE78386.1| predicted protein [Populus trichocarpa]
          Length = 238

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 150/236 (63%), Positives = 179/236 (75%), Gaps = 4/236 (1%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S+TR + +H +HK+P GDSPYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+
Sbjct: 1   SRTRSESFHALHKVPVGDSPYVRAKNVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMAI 60

Query: 62  VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           V+KQQ+R DEA+EAIKS RH CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL MI  
Sbjct: 61  VMKQQNRADEAIEAIKSLRHRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLYMIQQ 120

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A  I PD
Sbjct: 121 GLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALAIAPD 180

Query: 182 ANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
            NK CNL  CL+KQ R +EA+  L  V      G     +  K   R +++L +LE
Sbjct: 181 NNKMCNLGICLMKQGRISEAKETLRRVKPAVADGPRGVDSHLKAYERAQQMLNDLE 236


>gi|326532454|dbj|BAK05156.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 539

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 149/237 (62%), Positives = 182/237 (76%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           M +T    +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 45  MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWGAINAGDRVDSALKDMA 104

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 105 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 164

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A +I P
Sbjct: 165 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALVIGP 224

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 225 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 281


>gi|302779988|ref|XP_002971769.1| hypothetical protein SELMODRAFT_441590 [Selaginella moellendorffii]
 gi|300160901|gb|EFJ27518.1| hypothetical protein SELMODRAFT_441590 [Selaginella moellendorffii]
          Length = 439

 Score =  308 bits (790), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 146/233 (62%), Positives = 185/233 (79%), Gaps = 1/233 (0%)

Query: 7   QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
           + +HV+HK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+KQQ
Sbjct: 15  ESFHVLHKIPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVMKQQ 74

Query: 67  DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFN 126
           +R +EA+EAIKS R  C+ QAQE+LDNVL+DLYK+CGRLD+QI LLK+KL +I+ G AFN
Sbjct: 75  NRPEEAIEAIKSLRDRCTDQAQEALDNVLLDLYKRCGRLDDQIALLKRKLHLIHEGLAFN 134

Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
           GK TKTARS G+KFQV+++QE SR+LGNLGWAYMQ+ NY +AE VYRKA  I+PD+NK C
Sbjct: 135 GKRTKTARSQGRKFQVSIEQEASRLLGNLGWAYMQQSNYIAAEAVYRKALSIEPDSNKVC 194

Query: 187 NLSHCLIKQARYTEARSVLEDVLLGKLSGS-TETKTINRVKELLQELEPWQSI 238
           NL  C  KQ +  +AR  LE V     + S ++ KT  R +E+L EL   +S+
Sbjct: 195 NLGICFQKQGKLHDARVTLESVAPPAWNASPSQRKTYERAQEVLVELREMKSV 247


>gi|302760791|ref|XP_002963818.1| hypothetical protein SELMODRAFT_405259 [Selaginella moellendorffii]
 gi|300169086|gb|EFJ35689.1| hypothetical protein SELMODRAFT_405259 [Selaginella moellendorffii]
          Length = 439

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 146/233 (62%), Positives = 185/233 (79%), Gaps = 1/233 (0%)

Query: 7   QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
           + +HV+HK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+KQQ
Sbjct: 15  ESFHVLHKIPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVMKQQ 74

Query: 67  DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFN 126
           +R +EA+EAIKS R  C+ QAQE+LDNVL+DLYK+CGRLD+QI LLK+KL +I+ G AFN
Sbjct: 75  NRPEEAIEAIKSLRDRCTDQAQEALDNVLLDLYKRCGRLDDQIALLKRKLHLIHEGLAFN 134

Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
           GK TKTARS G+KFQV+++QE SR+LGNLGWAYMQ+ NY +AE VYRKA  I+PD+NK C
Sbjct: 135 GKRTKTARSQGRKFQVSIEQEASRLLGNLGWAYMQQSNYIAAEAVYRKALSIEPDSNKVC 194

Query: 187 NLSHCLIKQARYTEARSVLEDVLLGKLSGS-TETKTINRVKELLQELEPWQSI 238
           NL  C  KQ +  +AR  LE V     + S ++ KT  R +E+L EL   +S+
Sbjct: 195 NLGICFQKQGKLHDARVTLESVAPPAWNASPSQRKTYERAQEVLVELREMKSV 247


>gi|255577530|ref|XP_002529643.1| conserved hypothetical protein [Ricinus communis]
 gi|223530869|gb|EEF32730.1| conserved hypothetical protein [Ricinus communis]
          Length = 482

 Score =  307 bits (786), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 152/258 (58%), Positives = 188/258 (72%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR + +H +HK+P GD+PYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 28  VSRTRSESFHAIHKVPVGDTPYVRAKNVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 87

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I 
Sbjct: 88  IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLYLIQ 147

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A  I P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYVEAEDAYRRALSIAP 207

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R +EA+  L  V      G     +  K   R +++L++LE   
Sbjct: 208 DNNKMCNLGICLMKQGRISEAKDTLRRVKPAVADGPRGVDSHLKAYERAQQMLKDLESEM 267

Query: 237 SIPPSLTTKKSSLEDAFL 254
                   ++S L DAFL
Sbjct: 268 MNKGGDRVEQSRLFDAFL 285


>gi|449434138|ref|XP_004134853.1| PREDICTED: uncharacterized protein LOC101220568 [Cucumis sativus]
          Length = 500

 Score =  305 bits (781), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 149/237 (62%), Positives = 180/237 (75%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+ R  PYHV HK+P GD+PYVRAK+VQLV KDP+ AI LFW AINAGDRVDSALKDMA
Sbjct: 28  VSRLRPDPYHVTHKVPVGDTPYVRAKNVQLVAKDPDKAIPLFWAAINAGDRVDSALKDMA 87

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R+ CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I 
Sbjct: 88  IVMKQQNRAEEAIEAIKSLRNRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLFLIQ 147

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YRKA  I P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYVEAEDAYRKALTIAP 207

Query: 181 DANKACNLSHCLIKQARYTEA----RSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R +EA    R V   V+ G     +  K   R +++L++LE
Sbjct: 208 DNNKMCNLGICLMKQGRISEAKENLRRVKPAVIDGPRGTDSHLKAYERAQQMLKDLE 264


>gi|224071331|ref|XP_002303408.1| predicted protein [Populus trichocarpa]
 gi|222840840|gb|EEE78387.1| predicted protein [Populus trichocarpa]
          Length = 233

 Score =  305 bits (780), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 148/233 (63%), Positives = 176/233 (75%), Gaps = 4/233 (1%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           R + +H +HK+P GDSPYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V+K
Sbjct: 1   RSESFHALHKVPVGDSPYVRAKNVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMAIVMK 60

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           QQ+R DEA+EAIKS RH CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL MI  G A
Sbjct: 61  QQNRADEAIEAIKSLRHRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLYMIQQGLA 120

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
           FNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A  I PD NK
Sbjct: 121 FNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALAIAPDNNK 180

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
            CNL  CL+KQ R +EA+  L  V      G     +  K   R +++L +LE
Sbjct: 181 MCNLGICLMKQGRISEAKETLRRVKPAVADGPRGVDSHLKAYERAQQMLNDLE 233


>gi|449491318|ref|XP_004158859.1| PREDICTED: uncharacterized LOC101220568 [Cucumis sativus]
          Length = 409

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 149/237 (62%), Positives = 180/237 (75%), Gaps = 4/237 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+ R  PYHV HK+P GD+PYVRAK+VQLV KDP+ AI LFW AINAGDRVDSALKDMA
Sbjct: 28  VSRLRPDPYHVTHKVPVGDTPYVRAKNVQLVAKDPDKAIPLFWAAINAGDRVDSALKDMA 87

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R+ CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I 
Sbjct: 88  IVMKQQNRAEEAIEAIKSLRNRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLFLIQ 147

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YRKA  I P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYVEAEDAYRKALTIAP 207

Query: 181 DANKACNLSHCLIKQARYTEA----RSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
           D NK CNL  CL+KQ R +EA    R V   V+ G     +  K   R +++L++LE
Sbjct: 208 DNNKMCNLGICLMKQGRISEAKENLRRVKPAVIDGPRGTDSHLKAYERAQQMLKDLE 264


>gi|356559262|ref|XP_003547919.1| PREDICTED: uncharacterized protein LOC100788369 [Glycine max]
          Length = 501

 Score =  304 bits (778), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 147/235 (62%), Positives = 178/235 (75%), Gaps = 4/235 (1%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
           +TR + +HV HK+P GD+PYVRAK+VQLV KDPE AI LFW AINAGDRVDSALKDMA+V
Sbjct: 31  RTRSETFHVAHKVPIGDTPYVRAKNVQLVNKDPERAIPLFWAAINAGDRVDSALKDMAIV 90

Query: 63  LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
           +KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I  G
Sbjct: 91  MKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLYLIQQG 150

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
            AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A LI PD 
Sbjct: 151 LAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEEAYRRALLIAPDN 210

Query: 183 NKACNLSHCLIKQARYTEARSVL----EDVLLGKLSGSTETKTINRVKELLQELE 233
           NK CNL  CL+KQ R  EA+  L      V+ G     +  K   R +++L++LE
Sbjct: 211 NKMCNLGICLMKQGRIGEAKETLYRVKPAVMDGPRGSDSHLKAYERAQQMLKDLE 265


>gi|147797595|emb|CAN75795.1| hypothetical protein VITISV_024890 [Vitis vinifera]
          Length = 471

 Score =  304 bits (778), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 151/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR + +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 24  VSRTRSESFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 83

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQ+SLDN+L+DLYK+CGRLD+QI LL+ KL +I 
Sbjct: 84  IVMKQQNRAEEAIEAIKSLRSRCSDQAQDSLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 143

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY  AE  YR+A  + P
Sbjct: 144 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQRNNYIEAEDAYRRALSMTP 203

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L +LE   
Sbjct: 204 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGLRGVDSHLKAFERARQMLLDLESEM 263

Query: 237 SIPPSLTTKKSSLEDAFL 254
               S   ++S L DAFL
Sbjct: 264 LXKGSDPVEQSRLFDAFL 281


>gi|225431084|ref|XP_002265165.1| PREDICTED: uncharacterized protein LOC100257355 [Vitis vinifera]
          Length = 474

 Score =  302 bits (774), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 152/258 (58%), Positives = 184/258 (71%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR   +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 27  VSRTRSDSFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWSAINAGDRVDSALKDMA 86

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I 
Sbjct: 87  IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 146

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY  AE  YR+A  + P
Sbjct: 147 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQQNNYIEAEDAYRRALSMAP 206

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L +LE   
Sbjct: 207 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGPRGVDSHLKAFERARQMLLDLESEM 266

Query: 237 SIPPSLTTKKSSLEDAFL 254
               S   ++S L DAFL
Sbjct: 267 LNKGSDPVEQSRLFDAFL 284


>gi|359476811|ref|XP_002265051.2| PREDICTED: uncharacterized protein LOC100245548 [Vitis vinifera]
          Length = 1169

 Score =  302 bits (774), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 151/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR + +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 99  VSRTRSESFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 158

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQ+SLDN+L+DLYK+CGRLD+QI LL+ KL +I 
Sbjct: 159 IVMKQQNRAEEAIEAIKSLRSRCSDQAQDSLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 218

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY  AE  YR+A  + P
Sbjct: 219 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQRNNYIEAEDAYRRALSMTP 278

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L +LE   
Sbjct: 279 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGLRGVDSHLKAFERARQMLLDLESEM 338

Query: 237 SIPPSLTTKKSSLEDAFL 254
               S   ++S L DAFL
Sbjct: 339 LNKGSDPVEQSRLFDAFL 356


>gi|297734982|emb|CBI17344.3| unnamed protein product [Vitis vinifera]
          Length = 395

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 151/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR + +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 27  VSRTRSESFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 86

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQ+SLDN+L+DLYK+CGRLD+QI LL+ KL +I 
Sbjct: 87  IVMKQQNRAEEAIEAIKSLRSRCSDQAQDSLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 146

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY  AE  YR+A  + P
Sbjct: 147 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQRNNYIEAEDAYRRALSMTP 206

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L +LE   
Sbjct: 207 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGLRGVDSHLKAFERARQMLLDLESEM 266

Query: 237 SIPPSLTTKKSSLEDAFL 254
               S   ++S L DAFL
Sbjct: 267 LNKGSDPVEQSRLFDAFL 284


>gi|356502717|ref|XP_003520163.1| PREDICTED: uncharacterized protein LOC100803414 [Glycine max]
          Length = 500

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 151/256 (58%), Positives = 186/256 (72%), Gaps = 4/256 (1%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
           +TR + +HV HK+P GD+PYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V
Sbjct: 31  RTRSESFHVAHKVPVGDTPYVRAKNVQLVDKDPERAIPLFWAAINAGDRVDSALKDMAIV 90

Query: 63  LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
           +KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I  G
Sbjct: 91  MKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIGLLKHKLYLIQQG 150

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
            AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A  I PD 
Sbjct: 151 LAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALSIAPDN 210

Query: 183 NKACNLSHCLIKQARYTEARSVL----EDVLLGKLSGSTETKTINRVKELLQELEPWQSI 238
           NK CNL  CL+KQ R  EA+  L      V+ G     +  K   R +++L++LE     
Sbjct: 211 NKMCNLGICLMKQGRIGEAKETLYRVKPAVMDGPRGSDSHLKAYERAQQMLKDLESEMMN 270

Query: 239 PPSLTTKKSSLEDAFL 254
                 ++S L +AFL
Sbjct: 271 KGVDRIEQSRLFEAFL 286


>gi|297734984|emb|CBI17346.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 152/258 (58%), Positives = 184/258 (71%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR   +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 24  VSRTRSDSFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWSAINAGDRVDSALKDMA 83

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I 
Sbjct: 84  IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 143

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY  AE  YR+A  + P
Sbjct: 144 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQQNNYIEAEDAYRRALSMAP 203

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L +LE   
Sbjct: 204 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGPRGVDSHLKAFERARQMLLDLESEM 263

Query: 237 SIPPSLTTKKSSLEDAFL 254
               S   ++S L DAFL
Sbjct: 264 LNKGSDPVEQSRLFDAFL 281


>gi|224133684|ref|XP_002327655.1| predicted protein [Populus trichocarpa]
 gi|222836740|gb|EEE75133.1| predicted protein [Populus trichocarpa]
          Length = 454

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 150/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR + +  +HK+P GDSPYVRAK+VQLV+KDPE A+ LFW AINAGDRVDSALKDMA
Sbjct: 28  VSRTRSESFQAIHKVPVGDSPYVRAKNVQLVDKDPEKAVPLFWAAINAGDRVDSALKDMA 87

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS RH CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I 
Sbjct: 88  IVMKQQNRAEEAIEAIKSLRHRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLFLIQ 147

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A    P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALATAP 207

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L++LE   
Sbjct: 208 DNNKMCNLGICLMKQGRIGEAKETLRRVKPAVADGPRGVDSHLKAYERAQQMLKDLESEM 267

Query: 237 SIPPSLTTKKSSLEDAFL 254
                   ++  L DAFL
Sbjct: 268 MSKGGDRVEQRRLFDAFL 285


>gi|357517969|ref|XP_003629273.1| hypothetical protein MTR_8g075260 [Medicago truncatula]
 gi|355523295|gb|AET03749.1| hypothetical protein MTR_8g075260 [Medicago truncatula]
          Length = 479

 Score =  300 bits (767), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 143/235 (60%), Positives = 177/235 (75%), Gaps = 4/235 (1%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
           +TR + +H+ HK+P GD+PYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V
Sbjct: 30  RTRSESFHITHKVPIGDTPYVRAKNVQLVDKDPERAIPLFWAAINAGDRVDSALKDMAIV 89

Query: 63  LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
           +KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I  G
Sbjct: 90  MKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLYLIQQG 149

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
            AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY  AE  YR+A  + PD 
Sbjct: 150 LAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEEAYRRALCLAPDN 209

Query: 183 NKACNLSHCLIKQARYTEARSVLE----DVLLGKLSGSTETKTINRVKELLQELE 233
           NK CNL  CL+KQ R  EA+  L      V  G     +  K   R +++L++LE
Sbjct: 210 NKMCNLGICLMKQGRIAEAKETLHRVKPAVTDGPRGSDSHLKAYERAQQMLKDLE 264


>gi|15230439|ref|NP_190696.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
           thaliana]
 gi|6562280|emb|CAB62650.1| MS5-like protein [Arabidopsis thaliana]
 gi|30102656|gb|AAP21246.1| At3g51280 [Arabidopsis thaliana]
 gi|110735963|dbj|BAE99956.1| MS5 like protein [Arabidopsis thaliana]
 gi|332645252|gb|AEE78773.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
           thaliana]
          Length = 430

 Score =  300 bits (767), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 146/236 (61%), Positives = 179/236 (75%), Gaps = 4/236 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+T+ + +H +HK+P GDSPYVRAK+VQLVEKDPE AI LFWKAINAGDRVDSALKDMA
Sbjct: 22  ISRTQSESFHAIHKVPVGDSPYVRAKNVQLVEKDPERAIPLFWKAINAGDRVDSALKDMA 81

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I 
Sbjct: 82  IVMKQQNRAEEAIEAIKSLRVRCSDQAQESLDNILLDLYKRCGRLDDQIGLLKHKLFLIQ 141

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ N+  AE  YR+A  I P
Sbjct: 142 KGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDAYRRALSIAP 201

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQEL 232
           D NK CNL  CL+KQ R  EA+  L  V    + G     +  K   R +++L +L
Sbjct: 202 DNNKMCNLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQMLNDL 257


>gi|297816392|ref|XP_002876079.1| hypothetical protein ARALYDRAFT_485479 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321917|gb|EFH52338.1| hypothetical protein ARALYDRAFT_485479 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 432

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 146/236 (61%), Positives = 178/236 (75%), Gaps = 4/236 (1%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+T+ + +H +HK+P GDSPYVRAK+VQLVEKDPE AI LFWKAINAGDRVDSALKDMA
Sbjct: 22  ISRTQSESFHAIHKVPVGDSPYVRAKNVQLVEKDPERAIPLFWKAINAGDRVDSALKDMA 81

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK CGRLD+QI LLK KL +I 
Sbjct: 82  IVMKQQNRAEEAIEAIKSLRVRCSDQAQESLDNILLDLYKWCGRLDDQIGLLKHKLFLIQ 141

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ N+  AE  YR+A  I P
Sbjct: 142 KGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDAYRRALSIAP 201

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQEL 232
           D NK CNL  CL+KQ R  EA+  L  V    + G     +  K   R +++L +L
Sbjct: 202 DNNKMCNLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQMLNDL 257


>gi|194698136|gb|ACF83152.1| unknown [Zea mays]
 gi|414888333|tpg|DAA64347.1| TPA: hypothetical protein ZEAMMB73_633017 [Zea mays]
          Length = 516

 Score =  295 bits (755), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 158/294 (53%), Positives = 191/294 (64%), Gaps = 38/294 (12%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V+KQQ+R
Sbjct: 39  FHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMAIVMKQQNR 98

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK---------------------------- 100
            +EA+EAIKS R  CS QAQESLDN+L+DLYK                            
Sbjct: 99  AEEAIEAIKSLRSRCSDQAQESLDNILLDLYKVRARTSSTLISLHVRSIHASGSETVLNL 158

Query: 101 ---KCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGW 157
              +CGRLD+QI LLK KL++I  G AFNGK TKTARS G+KFQVT++QE +R+LGNLGW
Sbjct: 159 NAQRCGRLDDQISLLKHKLQLINQGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGW 218

Query: 158 AYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG-- 215
           A MQK NYT AE  YR+A LI PD NK CNL  CL+KQAR  EA+ VL+ V    + G  
Sbjct: 219 ALMQKENYTEAEGAYRRALLIGPDNNKMCNLGICLMKQARVLEAKDVLKQVRPAAVDGLR 278

Query: 216 --STETKTINRVKELLQELEP-WQSIPPSLTTKKSSLEDAFLEGLDDLMNQWTP 266
              +  K   R +E+L++LE      PP+     + L DA L G    +  W P
Sbjct: 279 GADSHLKAYERAQEMLRDLETKLIGRPPADQLDTNWLFDALLLGSSSSI--WQP 330


>gi|118487530|gb|ABK95592.1| unknown [Populus trichocarpa]
          Length = 690

 Score =  288 bits (738), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 142/230 (61%), Positives = 177/230 (76%), Gaps = 7/230 (3%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYVRAKHVQL+EKDP  A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 53  FHVVHKVPAGDSPYVRAKHVQLIEKDPSKAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 112

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAIKSFRHLC   +QES+DNVL++LYK+ GR++E+IE+L+ KL++I  G AF+GK
Sbjct: 113 ADEAIEAIKSFRHLCPCDSQESIDNVLVELYKRSGRIEEEIEMLQCKLKLIEEGIAFSGK 172

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHG+K Q+TV+QE SRILGNL WAY+Q  +Y  AE  YRKA  ++PD NK CNL
Sbjct: 173 KTKTARSHGRKIQITVEQERSRILGNLAWAYLQHHDYGLAEQHYRKALSLEPDQNKQCNL 232

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGS-----TETKTINRVKELLQELE 233
           + CL+   R  EA+S+L+ V    LSGS     +  K+  R  ++L E E
Sbjct: 233 AICLMHMNRIPEAKSLLQTV--KALSGSKPMDDSYAKSFERACQILAEFE 280


>gi|224122150|ref|XP_002330553.1| predicted protein [Populus trichocarpa]
 gi|222872111|gb|EEF09242.1| predicted protein [Populus trichocarpa]
          Length = 232

 Score =  288 bits (738), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 142/230 (61%), Positives = 177/230 (76%), Gaps = 7/230 (3%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HVVHK+P GDSPYVRAKHVQL+EKDP  A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 5   FHVVHKVPAGDSPYVRAKHVQLIEKDPSKAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 64

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAIKSFRHLC   +QES+DNVL++LYK+ GR++E+IE+L+ KL++I  G AF+GK
Sbjct: 65  ADEAIEAIKSFRHLCPCDSQESIDNVLVELYKRSGRIEEEIEMLQCKLKLIEEGIAFSGK 124

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARSHG+K Q+TV+QE SRILGNL WAY+Q  +Y  AE  YRKA  ++PD NK CNL
Sbjct: 125 KTKTARSHGRKIQITVEQERSRILGNLAWAYLQHHDYGLAEQHYRKALSLEPDQNKQCNL 184

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGS-----TETKTINRVKELLQELE 233
           + CL+   R  EA+S+L+ V    LSGS     +  K+  R  ++L E E
Sbjct: 185 AICLMHMNRIPEAKSLLQTV--KALSGSKPMDDSYAKSFERACQILAEFE 232


>gi|297740866|emb|CBI31048.3| unnamed protein product [Vitis vinifera]
          Length = 673

 Score =  287 bits (735), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 137/241 (56%), Positives = 181/241 (75%), Gaps = 3/241 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
            +D  +HVVHK+P GDSPYVRAK VQL++KDP  AI LFW AIN+GDRVDSALKDMAVV+
Sbjct: 41  NKDDLFHVVHKVPAGDSPYVRAKQVQLIDKDPNRAISLFWAAINSGDRVDSALKDMAVVM 100

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           KQ +R DEA+EAIKSFRHLC +++QESLDNVL++LYK+ GRLDEQIE+L+ KL+ I  G 
Sbjct: 101 KQLNRSDEAIEAIKSFRHLCPQESQESLDNVLVELYKRSGRLDEQIEMLQYKLKNIDEGS 160

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AF GK TK ARS GKK Q++++QE SR+LGNL WAY+Q+GNY +A  +Y++A  +DPD N
Sbjct: 161 AFGGKRTKIARSQGKKIQISIEQEKSRLLGNLAWAYLQQGNYKTAGELYKQALALDPDRN 220

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELEPWQSIPP 240
           K CNL+ CL+   +  EA+++L  + +   +G  +    K+  R  ++L ELE    I P
Sbjct: 221 KECNLAICLMYMNKIKEAKAMLYAIQVSSQNGRMDDSYVKSFERASQVLTELEANSVIDP 280

Query: 241 S 241
           +
Sbjct: 281 N 281


>gi|359483722|ref|XP_002266141.2| PREDICTED: uncharacterized protein LOC100242406 [Vitis vinifera]
          Length = 941

 Score =  287 bits (734), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 137/241 (56%), Positives = 181/241 (75%), Gaps = 3/241 (1%)

Query: 4   TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
            +D  +HVVHK+P GDSPYVRAK VQL++KDP  AI LFW AIN+GDRVDSALKDMAVV+
Sbjct: 41  NKDDLFHVVHKVPAGDSPYVRAKQVQLIDKDPNRAISLFWAAINSGDRVDSALKDMAVVM 100

Query: 64  KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
           KQ +R DEA+EAIKSFRHLC +++QESLDNVL++LYK+ GRLDEQIE+L+ KL+ I  G 
Sbjct: 101 KQLNRSDEAIEAIKSFRHLCPQESQESLDNVLVELYKRSGRLDEQIEMLQYKLKNIDEGS 160

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
           AF GK TK ARS GKK Q++++QE SR+LGNL WAY+Q+GNY +A  +Y++A  +DPD N
Sbjct: 161 AFGGKRTKIARSQGKKIQISIEQEKSRLLGNLAWAYLQQGNYKTAGELYKQALALDPDRN 220

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELEPWQSIPP 240
           K CNL+ CL+   +  EA+++L  + +   +G  +    K+  R  ++L ELE    I P
Sbjct: 221 KECNLAICLMYMNKIKEAKAMLYAIQVSSQNGRMDDSYVKSFERASQVLTELEANSVIDP 280

Query: 241 S 241
           +
Sbjct: 281 N 281


>gi|224061607|ref|XP_002300564.1| predicted protein [Populus trichocarpa]
 gi|222847822|gb|EEE85369.1| predicted protein [Populus trichocarpa]
          Length = 281

 Score =  283 bits (724), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 138/230 (60%), Positives = 175/230 (76%), Gaps = 7/230 (3%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV+HK+P GDSPYVRAKHVQL+EKDP  A+ LFW AIN+GDRVDSALKDMAVV+KQ DR
Sbjct: 53  FHVIHKVPAGDSPYVRAKHVQLIEKDPSKAVSLFWAAINSGDRVDSALKDMAVVMKQLDR 112

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAIKSFRHLC   +QES+DNVL++LYK+ GR++E+IE+L++KL+ I  G AF+GK
Sbjct: 113 ADEAIEAIKSFRHLCPYDSQESIDNVLVELYKRSGRIEEEIEMLQRKLKNIEEGIAFSGK 172

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TKTARS G+K Q+TV+QE SRILGNL WAY+Q  +Y  AE  YRK   ++PD NK CNL
Sbjct: 173 KTKTARSQGRKIQITVEQERSRILGNLAWAYLQHHDYGLAEQYYRKGLSMEPDQNKQCNL 232

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGS-----TETKTINRVKELLQELE 233
           + CL+   R  EA+S+L+ V     SGS     +  K+  R  ++L ELE
Sbjct: 233 AICLMHMNRIPEAKSLLQTVKAS--SGSKPMDDSYAKSFERACQILTELE 280


>gi|357448553|ref|XP_003594552.1| hypothetical protein MTR_2g030510 [Medicago truncatula]
 gi|355483600|gb|AES64803.1| hypothetical protein MTR_2g030510 [Medicago truncatula]
          Length = 617

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 138/257 (53%), Positives = 184/257 (71%), Gaps = 6/257 (2%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S ++D  +HV+HK+P GDSPYV+AK VQLV+KDP  AI LFW AINAGDRV+SALKDMA+
Sbjct: 44  SVSKDDLFHVIHKVPSGDSPYVKAKQVQLVDKDPGKAISLFWAAINAGDRVESALKDMAL 103

Query: 62  VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           V+KQ +R DEA+EAIKSFRHLC   +QESLDN+L++LYK+ GR+DE+I +L QKL+ I  
Sbjct: 104 VMKQLNRSDEAIEAIKSFRHLCPSDSQESLDNILVELYKRSGRVDEEIGMLHQKLKQIED 163

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G  + G+ TK ARS GKK Q++ +QE SRILGNL WAY+QKG+Y +AE  YRKA   + D
Sbjct: 164 GMTYVGRTTKHARSQGKKIQISAEQEISRILGNLAWAYLQKGDYKTAEEHYRKALSFEVD 223

Query: 182 ANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELEPWQSI 238
            NK CNL+ CL++  + TEAR +L+ V     +   +    K+  R  ++LQE+E   S 
Sbjct: 224 RNKQCNLAICLMQMNKVTEARFLLQAVTAATKNRKMDDSFVKSYERATQMLQEME---ST 280

Query: 239 PPSLTTKKSSLEDAFLE 255
            PS+ + K   ++ F E
Sbjct: 281 APSVDSVKDKGDNKFNE 297


>gi|147797600|emb|CAN75800.1| hypothetical protein VITISV_024895 [Vitis vinifera]
          Length = 402

 Score =  273 bits (697), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 144/258 (55%), Positives = 175/258 (67%), Gaps = 13/258 (5%)

Query: 1   MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
           +S+TR   +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINA DRVDSALKDMA
Sbjct: 24  VSRTRSDSFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAXDRVDSALKDMA 83

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           +V+KQQ+R +EA+EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I 
Sbjct: 84  IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 143

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G AFNGK TKTARS GKKFQV+V QE +R+L N         NY  AE  YR+A  + P
Sbjct: 144 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLQN---------NYIEAEDAYRRALSMAP 194

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
           D NK CNL  CL+KQ R  EA+  L  V      G     +  K   R +++L +LE   
Sbjct: 195 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGPRGVDSHLKAFERARQMLLDLESEM 254

Query: 237 SIPPSLTTKKSSLEDAFL 254
               S   ++S L DAFL
Sbjct: 255 LNXGSDPVEQSRLFDAFL 272


>gi|449435896|ref|XP_004135730.1| PREDICTED: uncharacterized protein LOC101215262 [Cucumis sativus]
          Length = 683

 Score =  269 bits (688), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 132/228 (57%), Positives = 168/228 (73%), Gaps = 3/228 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV+HK+P GDSPYV+AK VQL+EKDP  A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 52  FHVIHKVPAGDSPYVKAKQVQLIEKDPSRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 111

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAIKSFRHLC   +QES+DNVLI+LYK+ GR++E+I++L++KL+ I  G  F GK
Sbjct: 112 SDEAIEAIKSFRHLCPYDSQESIDNVLIELYKRSGRIEEEIDMLQRKLKQIEDGTIFGGK 171

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARS GKK Q+T++QE SR+LGNL WA++Q  N   AE  YRKA  ++ D NK CNL
Sbjct: 172 RTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLNNIYVAEDYYRKALSLEADNNKKCNL 231

Query: 189 SHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQELE 233
           + C I   R TEA+S+L+ V     GK +  +  K+  R   +L E E
Sbjct: 232 AICQILTNRLTEAKSLLQSVRASSGGKPTEESYAKSFERAFHMLTEKE 279


>gi|449488574|ref|XP_004158093.1| PREDICTED: uncharacterized protein LOC101226286 [Cucumis sativus]
          Length = 297

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 131/226 (57%), Positives = 166/226 (73%), Gaps = 3/226 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +HV+HK+P GDSPYV+AK VQL+EKDP  A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 52  FHVIHKVPAGDSPYVKAKQVQLIEKDPSRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 111

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAIKSFRHLC   +QES+DNVLI+LYK+ GR++E+I++L++KL+ I  G  F GK
Sbjct: 112 SDEAIEAIKSFRHLCPYDSQESIDNVLIELYKRSGRIEEEIDMLQRKLKQIEDGTIFGGK 171

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARS GKK Q+T++QE SR+LGNL WA++Q  N   AE  YRKA  ++ D NK CNL
Sbjct: 172 RTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLNNIYVAEDYYRKALSLEADNNKKCNL 231

Query: 189 SHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQE 231
           + C I   R TEA+S+L+ V     GK    +  K+  R   +L E
Sbjct: 232 AICQILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERAFHMLTE 277


>gi|147809582|emb|CAN66650.1| hypothetical protein VITISV_043263 [Vitis vinifera]
          Length = 217

 Score =  266 bits (680), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 142/261 (54%), Positives = 175/261 (67%), Gaps = 47/261 (18%)

Query: 30  LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
           +VEKDPEAAIVLFW AINA DRV+SALKDMAVV+K+ DR +EA+EAIK+  HL +  +  
Sbjct: 1   MVEKDPEAAIVLFWNAINARDRVESALKDMAVVMKRLDRSEEAIEAIKN--HLITSSSTY 58

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
                                                   TKTARS+ KKFQ+T++QETS
Sbjct: 59  ----------------------------------------TKTARSNRKKFQITIEQETS 78

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           RILGNLGW YMQK NY +AEVVY+KAQ+IDPDANKACNL+ CLI Q RYTEA SVL++VL
Sbjct: 79  RILGNLGWVYMQKSNYMAAEVVYKKAQMIDPDANKACNLALCLINQGRYTEAHSVLKEVL 138

Query: 210 LGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKKSS--LEDAFLEGLDDLMNQWTP 266
            GKL  S + K  NR +EL+ E+EP W  +PPS   K     LED F +G + ++N W P
Sbjct: 139 QGKLPNSEDCKAQNRAQELMLEVEPKW--LPPSEXIKLEGFDLEDDFSDGFEKVLNIWAP 196

Query: 267 YRSRRLPIFEEISPFRDQLAC 287
           +R++RLPIFEEIS +R+QLAC
Sbjct: 197 FRTKRLPIFEEISSYRNQLAC 217


>gi|356532429|ref|XP_003534775.1| PREDICTED: uncharacterized protein LOC100792980 [Glycine max]
          Length = 657

 Score =  261 bits (666), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 125/228 (54%), Positives = 166/228 (72%), Gaps = 3/228 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +H++HK+P GDSPYV+AK VQLV+KDP  A+ LFW AINAGDRV+SALKDMA+V+KQ +R
Sbjct: 107 FHIIHKVPAGDSPYVKAKQVQLVDKDPGRAVSLFWAAINAGDRVESALKDMALVMKQLNR 166

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAI+SFRHLC   +Q+SLDN+L++LYK+ GR+DE+I +L  KL+ I  G  F G+
Sbjct: 167 SDEAIEAIRSFRHLCPSDSQDSLDNILVELYKRSGRVDEEIAMLCHKLKQIEDGLTFVGR 226

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARS GKK Q+T +QE SRILGNL WAY+QKG+Y +AE  YRKA   + D NK CNL
Sbjct: 227 TTKQARSQGKKIQITAEQEISRILGNLAWAYLQKGDYKTAEEHYRKALSFEVDRNKQCNL 286

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELE 233
           + CLI   +  EA+ +L+ V     +   +    K+  R  ++L E+E
Sbjct: 287 AICLIHMNKIKEAKFLLQAVRTATKNRKMDDSFAKSFERASQMLIEIE 334


>gi|102139799|gb|ABF69984.1| male sterility protein-related [Musa acuminata]
          Length = 778

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 125/222 (56%), Positives = 163/222 (73%), Gaps = 2/222 (0%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
           DSPY+RAKH Q+V+KD + A+ LFW AIN GDRVDSALKDMA+ +KQ +R +EA+EAIKS
Sbjct: 53  DSPYLRAKHAQMVDKDLQKAVPLFWAAINCGDRVDSALKDMALAMKQVNRAEEAIEAIKS 112

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
           FRHLCS + QES+DNVL++LYKKCGR+D+QIELL  KL+MI  G AF G+ TK  RS GK
Sbjct: 113 FRHLCSPKTQESIDNVLLELYKKCGRIDDQIELLNFKLKMIDEGLAFGGRRTKLTRSKGK 172

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
           KF V++  E SR+LGNL WAYMQ  NY +AE +Y KA  I+ D NK CNL+ CL+K  R 
Sbjct: 173 KFHVSLDHEKSRLLGNLAWAYMQSENYETAETLYWKALAIEQDYNKQCNLAICLMKTGRL 232

Query: 199 TEARSVLEDV--LLGKLSGSTETKTINRVKELLQELEPWQSI 238
            EARS+L+ V      +      K+  +  ++L+E+EP +++
Sbjct: 233 EEARSILQVVKRASSNICDQFFVKSFKQASQMLKEIEPQENL 274


>gi|356558069|ref|XP_003547331.1| PREDICTED: uncharacterized protein LOC100794684 [Glycine max]
          Length = 595

 Score =  258 bits (658), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 124/228 (54%), Positives = 165/228 (72%), Gaps = 3/228 (1%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           +H++HK+P GDSPYV+AK VQLV+KDP  AI LFW AINA DRV+SALKDMA+V+KQ +R
Sbjct: 44  FHIIHKVPAGDSPYVKAKQVQLVDKDPGRAISLFWAAINARDRVESALKDMALVMKQLNR 103

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA+EAI+SFRHLC   +++SLDN+L++LYK+ GR+DE+I +L  KL+ I  G  F G+
Sbjct: 104 SDEAIEAIRSFRHLCPSDSRDSLDNILVELYKRSGRIDEEIAMLHHKLKQIEDGLTFVGR 163

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
            TK ARS GKK Q+T +QE SRILGNL WAY+QKG+Y +AE  YRKA   + D NK CNL
Sbjct: 164 TTKQARSQGKKIQITAEQEISRILGNLAWAYLQKGDYKAAEEHYRKALSFEVDRNKQCNL 223

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELE 233
           + CLI   +  EA+ +L+ V     +   +    K+  R  ++L E+E
Sbjct: 224 AICLIHMNKIKEAKFLLQAVRTATKNRKMDDSFAKSFERASQMLIEIE 271


>gi|297799972|ref|XP_002867870.1| male sterility MS5 [Arabidopsis lyrata subsp. lyrata]
 gi|297313706|gb|EFH44129.1| male sterility MS5 [Arabidopsis lyrata subsp. lyrata]
          Length = 426

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 123/236 (52%), Positives = 169/236 (71%), Gaps = 5/236 (2%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S+ RD P+H+VHK+P GDSPYVRAKH QL++KDP  AI LFW AINAGDRVDSALKDM V
Sbjct: 44  SEKRD-PFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMVV 102

Query: 62  VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           V+KQ DR DE +EAI+SFR+LCS ++Q+S+DN+L++LYKK GR++E+ ELL+ KL+ +  
Sbjct: 103 VMKQLDRSDEGIEAIRSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAELLEHKLKTLEQ 162

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G  F G+  +  R  GK   +T++QE +R+LGNLGW ++Q  NY  AE  YR+A  ++ D
Sbjct: 163 GMGFGGRVIRAKRVQGKHVTMTIEQEKARVLGNLGWVHLQLHNYGIAEQHYRRALGLERD 222

Query: 182 ANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKTINRVKELLQELE 233
            NK CNL+ CL++  R  EA+S+L+DV       +       K+ +R  E+L E+E
Sbjct: 223 KNKQCNLAICLMRMGRIPEAKSLLDDVRDSPTESECGDEPFAKSYDRAVEMLAEIE 278


>gi|4028970|gb|AAC97106.1| pollenless3 [Arabidopsis thaliana]
          Length = 434

 Score =  254 bits (648), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 123/236 (52%), Positives = 168/236 (71%), Gaps = 4/236 (1%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S  R  P+H+VHK+P GDSPYVRAKH QL++KDP  AI LFW AINAGDRVDSALKDMAV
Sbjct: 44  SSERRDPFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAV 103

Query: 62  VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           V+KQ  R DE +EAIKSFR+LCS ++Q+S+DN+L++LYKK GR++E+  LL+ KL+ +  
Sbjct: 104 VMKQLGRSDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQ 163

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G  F G+ ++  R  GK   +T++QE +RILGNLGW ++Q  NY  AE  YR+A  ++ D
Sbjct: 164 GMGFGGRVSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRRALGLERD 223

Query: 182 ANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKTINRVKELLQELE 233
            NK CNL+ CL++ +R  EA+S+L+DV       +       K+ +R  E+L E+E
Sbjct: 224 KNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGDEPFAKSYDRAVEMLAEIE 279


>gi|3859112|gb|AAC72541.1| male sterility MS5 [Arabidopsis thaliana]
          Length = 434

 Score =  254 bits (648), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 123/236 (52%), Positives = 168/236 (71%), Gaps = 4/236 (1%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S  R  P+H+VHK+P GDSPYVRAKH QL++KDP  AI LFW AINAGDRVDSALKDMAV
Sbjct: 44  SSERRDPFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAV 103

Query: 62  VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           V+KQ  R DE +EAIKSFR+LCS ++Q+S+DN+L++LYKK GR++E+  LL+ KL+ +  
Sbjct: 104 VMKQLGRSDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQ 163

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G  F G+ ++  R  GK   +T++QE +RILGNLGW ++Q  NY  AE  YR+A  ++ D
Sbjct: 164 GMGFGGRVSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRRALGLERD 223

Query: 182 ANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKTINRVKELLQELE 233
            NK CNL+ CL++ +R  EA+S+L+DV       +       K+ +R  E+L E+E
Sbjct: 224 KNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGDEPFAKSYDRAVEMLAEIE 279


>gi|449532994|ref|XP_004173462.1| PREDICTED: uncharacterized protein LOC101227090, partial [Cucumis
           sativus]
          Length = 188

 Score =  252 bits (644), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 123/188 (65%), Positives = 148/188 (78%), Gaps = 1/188 (0%)

Query: 101 KCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYM 160
           KCGR++EQIELLK+KLRMIY GEAFNGKPT+TARSHGKKFQV+VKQETSR+LGNLGWAYM
Sbjct: 1   KCGRIEEQIELLKRKLRMIYQGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYM 60

Query: 161 QKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETK 220
           QK NY  AE VY+KAQ+IDPDANKACNL  CL+KQ R +EA  VLE V   ++ GS+E K
Sbjct: 61  QKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLSEAIFVLEQVQQAQIPGSSEIK 120

Query: 221 TINRVKELLQELEPWQSIPPSLTTKKSSLEDAFLEGLDDLMNQWTPY-RSRRLPIFEEIS 279
              R  +LL E+   QS+P S+     S++  FL GL+ L+N+  P+ RS+RLP+FEEIS
Sbjct: 121 AQKRSADLLTEIRSRQSLPDSIDLLGLSVDVDFLNGLELLVNKKGPFSRSKRLPVFEEIS 180

Query: 280 PFRDQLAC 287
            FRDQLAC
Sbjct: 181 SFRDQLAC 188


>gi|15233432|ref|NP_193822.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
 gi|5262780|emb|CAB45885.1| putative protein [Arabidopsis thaliana]
 gi|7268886|emb|CAB79090.1| putative protein [Arabidopsis thaliana]
 gi|332658973|gb|AEE84373.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 450

 Score =  243 bits (621), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 123/252 (48%), Positives = 168/252 (66%), Gaps = 20/252 (7%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S  R  P+H+VHK+P GDSPYVRAKH QL++KDP  AI LFW AINAGDRVDSALKDMAV
Sbjct: 44  SSERRDPFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAV 103

Query: 62  VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           V+KQ  R DE +EAIKSFR+LCS ++Q+S+DN+L++LYKK GR++E+  LL+ KL+ +  
Sbjct: 104 VMKQLGRSDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQ 163

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYR-------- 173
           G  F G+ ++  R  GK   +T++QE +RILGNLGW ++Q  NY  AE  YR        
Sbjct: 164 GMGFGGRVSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRFGFVTKIP 223

Query: 174 --------KAQLIDPDANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKT 221
                   +A  ++ D NK CNL+ CL++ +R  EA+S+L+DV       +       K+
Sbjct: 224 NIDYCLVMRALGLERDKNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGDEPFAKS 283

Query: 222 INRVKELLQELE 233
            +R  E+L E+E
Sbjct: 284 YDRAVEMLAEIE 295


>gi|326504242|dbj|BAJ90953.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 639

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 119/232 (51%), Positives = 161/232 (69%), Gaps = 2/232 (0%)

Query: 16  PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           P  DSPY RAK  QL+EKDP  A+ LFW AIN+G+R++SALKDMA VLKQ DR +EA+EA
Sbjct: 70  PAADSPYARAKQAQLIEKDPNKAVPLFWAAINSGERIESALKDMATVLKQADRAEEAIEA 129

Query: 76  IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARS 135
           I+SFR  C  +AQESLDN+L+DLYKKCGR  EQIE+L  KLRMI    A     TK ++S
Sbjct: 130 IRSFRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTVKLRMIDEDLASGRWKTKLSKS 189

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQ 195
           HG+ F ++++ E +R+LGNL WAYMQ  NY  AE++YR+A  I+ D NK CNL+ C++K 
Sbjct: 190 HGRVFYLSLRDEKARLLGNLAWAYMQSENYEEAEMLYRQALAIETDYNKECNLAICMMKM 249

Query: 196 ARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
            +  EA+ +L+ +     +     ++  R  E+L+ELE  Q++P  +T  KS
Sbjct: 250 GKVAEAKYLLQAIPY-NCNDENHVRSFARATEVLRELES-QALPSPITQMKS 299


>gi|297794971|ref|XP_002865370.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311205|gb|EFH41629.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 470

 Score =  236 bits (603), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 116/223 (52%), Positives = 155/223 (69%), Gaps = 3/223 (1%)

Query: 14  KLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV 73
           ++P GDSPYVRAKH QLV KDP+ AI LFW A+NAGDRVDSALK+M VVLK  DR DE +
Sbjct: 49  RVPTGDSPYVRAKHAQLVSKDPDRAISLFWAAVNAGDRVDSALKNMVVVLKHLDRSDEGI 108

Query: 74  EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTA 133
           EAIKSFR+LC  ++Q+S+DN+L++LYKK GR+ E+ ELL+ KLR + H   + G+     
Sbjct: 109 EAIKSFRYLCPFESQDSIDNLLLELYKKSGRITEEAELLEHKLRTLEHDTHYGGRMKIAK 168

Query: 134 RSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLI 193
           RSHG++  +T++QE ++ILGNL W ++Q  NY  AE  YR A  ++PD NK CNL+ CL+
Sbjct: 169 RSHGEQNNMTIEQEKAQILGNLAWVHLQLHNYGIAEQYYRNALSLEPDNNKLCNLAICLM 228

Query: 194 KQARYTEARSVLEDV---LLGKLSGSTETKTINRVKELLQELE 233
           +  R  EA+ +LEDV   L  +  G     +  R  E+L E E
Sbjct: 229 RMDRIPEAKPLLEDVRQSLGNQWKGEPFRNSFERATEMLGERE 271


>gi|357144581|ref|XP_003573343.1| PREDICTED: uncharacterized protein LOC100825322 [Brachypodium
           distachyon]
          Length = 660

 Score =  234 bits (598), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 118/236 (50%), Positives = 163/236 (69%), Gaps = 4/236 (1%)

Query: 14  KLPPG--DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
           + PP   DSPYVRAK  QL+EKDP  A+ LFW AIN+G+R++SALKDMA VLKQ +R +E
Sbjct: 88  RAPPAVVDSPYVRAKQAQLIEKDPNKAVPLFWAAINSGERIESALKDMATVLKQANRAEE 147

Query: 72  AVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTK 131
           A+EAI+SFR  C  +AQ+SLDNVL+DLYKKCGR  EQIE+L  KLRM+    A     TK
Sbjct: 148 AIEAIRSFRDRCPNEAQDSLDNVLLDLYKKCGRTKEQIEMLTIKLRMVDEDLASGRWKTK 207

Query: 132 TARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHC 191
            ++SHG+   ++++ E +R+LGNL WA+MQ  NY  AE++YR+A  I+ D NK CNL+ C
Sbjct: 208 LSKSHGRVVYLSLRDEKARLLGNLAWAHMQSENYEEAEMLYRQALAIEADYNKECNLAVC 267

Query: 192 LIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
           L+K  +  EA+ +L+ +     S     ++  R  E+++ELE  Q++P  +T  KS
Sbjct: 268 LMKTGKVAEAKYLLQAIPYNS-SDEKHVRSFARATEMIKELES-QALPSPITQMKS 321


>gi|242080455|ref|XP_002444996.1| hypothetical protein SORBIDRAFT_07g002510 [Sorghum bicolor]
 gi|241941346|gb|EES14491.1| hypothetical protein SORBIDRAFT_07g002510 [Sorghum bicolor]
          Length = 712

 Score =  234 bits (597), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 123/246 (50%), Positives = 166/246 (67%), Gaps = 6/246 (2%)

Query: 22  YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
           YVRAK  Q+VEKDP  A+ LFW AIN+GDR +SALKDMA VLKQ +R +EA+EAI+SFR 
Sbjct: 117 YVRAKQAQIVEKDPNKAVPLFWAAINSGDRTESALKDMANVLKQANRAEEAIEAIRSFRD 176

Query: 82  LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
            C  +AQESLDN+L+DLYKKCGR DEQIE+L  KLR++    A     TK ++SHG+   
Sbjct: 177 RCPYEAQESLDNILLDLYKKCGRTDEQIEMLTIKLRIVDEELASGRWKTKMSKSHGRVVY 236

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           ++++ E +R+LGNL WAYMQ  NY  AE++YR+A  I+ D NK CNL+ CL+K  +  EA
Sbjct: 237 LSLRDEKARLLGNLAWAYMQSENYEGAEMLYRQALAIEADYNKECNLAICLMKTGKVAEA 296

Query: 202 RSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAFL----EGL 257
           + +++ +        +  K+++R  E+L+ELE  QS+P  +T  KS     FL    E L
Sbjct: 297 KYLIQAIPY-NCDDESHVKSLSRATEMLRELE-LQSLPSPITQAKSKESQIFLADDVEML 354

Query: 258 DDLMNQ 263
            DL  Q
Sbjct: 355 VDLQPQ 360


>gi|125560036|gb|EAZ05484.1| hypothetical protein OsI_27700 [Oryza sativa Indica Group]
          Length = 813

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
           DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 85  DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 144

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
           FR  C  +AQESLDN+L+DLYKKCGR  EQIE+L  KLR++    A     TK ++SHG+
Sbjct: 145 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 204

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
              ++++ E +R+LGNL WA+MQ  NY  AE++YR+A  I+ D NK CNL+ CLIK  + 
Sbjct: 205 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 264

Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
            EA+ +L+ +     S  +  +++ R +E+L ELE
Sbjct: 265 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 298


>gi|29467532|dbj|BAC66721.1| putative pollenless3 [Oryza sativa Japonica Group]
          Length = 815

 Score =  233 bits (595), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
           DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 85  DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 144

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
           FR  C  +AQESLDN+L+DLYKKCGR  EQIE+L  KLR++    A     TK ++SHG+
Sbjct: 145 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 204

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
              ++++ E +R+LGNL WA+MQ  NY  AE++YR+A  I+ D NK CNL+ CLIK  + 
Sbjct: 205 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 264

Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
            EA+ +L+ +     S  +  +++ R +E+L ELE
Sbjct: 265 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 298


>gi|297607922|ref|NP_001060921.2| Os08g0130300 [Oryza sativa Japonica Group]
 gi|255678123|dbj|BAF22835.2| Os08g0130300 [Oryza sativa Japonica Group]
          Length = 769

 Score =  233 bits (595), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
           DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 85  DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 144

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
           FR  C  +AQESLDN+L+DLYKKCGR  EQIE+L  KLR++    A     TK ++SHG+
Sbjct: 145 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 204

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
              ++++ E +R+LGNL WA+MQ  NY  AE++YR+A  I+ D NK CNL+ CLIK  + 
Sbjct: 205 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 264

Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
            EA+ +L+ +     S  +  +++ R +E+L ELE
Sbjct: 265 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 298


>gi|222639859|gb|EEE67991.1| hypothetical protein OsJ_25931 [Oryza sativa Japonica Group]
          Length = 1338

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
           DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 610 DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 669

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
           FR  C  +AQESLDN+L+DLYKKCGR  EQIE+L  KLR++    A     TK ++SHG+
Sbjct: 670 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 729

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
              ++++ E +R+LGNL WA+MQ  NY  AE++YR+A  I+ D NK CNL+ CLIK  + 
Sbjct: 730 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 789

Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
            EA+ +L+ +     S  +  +++ R +E+L ELE
Sbjct: 790 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 823


>gi|326522530|dbj|BAK07727.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 641

 Score =  231 bits (590), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 115/229 (50%), Positives = 157/229 (68%), Gaps = 2/229 (0%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
           DSPY+RAK  QL+EKDP  A+ LFW AIN+G+R++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 72  DSPYLRAKQAQLIEKDPNKAVPLFWAAINSGERIESALKDMATVLKQANRAEEAIEAIRS 131

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
           FR  C  +AQESLDN+L+DLYKKCGR  EQIE+L  KLRM+    A      K ++SHG+
Sbjct: 132 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTVKLRMVDEDLASGRWKAKLSKSHGR 191

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
              + ++ E +R+LGNL WAYMQ  NY  AE++YR+A  ++ D NK CNL  CL+K  + 
Sbjct: 192 VVYLYLRDEKARLLGNLAWAYMQSENYEEAEMLYRQALAMEADYNKECNLGICLMKMGKV 251

Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
            EA+ +L+ +     +     K+  R  E+L+ELE  Q++P  +T  KS
Sbjct: 252 AEAKYLLQPIPY-NCNDENHVKSFARATEMLRELES-QALPSPVTQMKS 298


>gi|449488572|ref|XP_004158092.1| PREDICTED: uncharacterized protein LOC101226064 [Cucumis sativus]
          Length = 719

 Score =  231 bits (590), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 118/207 (57%), Positives = 148/207 (71%), Gaps = 3/207 (1%)

Query: 30  LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
           L++KDP  A+ LFW AINAGDRVDSALKDMAVV+KQ DR DEA+EAIKSFRHLC   +QE
Sbjct: 9   LIDKDPNRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPYDSQE 68

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
           S+DNVLI+LYK+ GR++E+I++L+ KL+ I  G  F GK TK ARS GKK Q+TV+QE S
Sbjct: 69  SIDNVLIELYKRSGRIEEEIDMLQCKLKQIEDGTVFGGKRTKAARSQGKKVQITVEQEKS 128

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           R+LGNL WA++Q  N   AE  YRKA  ++ D NK CNL+ CLI   R TEA+S+L+ V 
Sbjct: 129 RVLGNLAWAFLQLDNIYIAEEYYRKALSLESDNNKKCNLAICLILTNRLTEAKSLLQSVR 188

Query: 210 L---GKLSGSTETKTINRVKELLQELE 233
               GK    +  K+  R   +L E E
Sbjct: 189 ASSGGKPMEESYAKSFERASHMLAEKE 215


>gi|413941821|gb|AFW74470.1| hypothetical protein ZEAMMB73_887441 [Zea mays]
          Length = 690

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 120/246 (48%), Positives = 166/246 (67%), Gaps = 6/246 (2%)

Query: 22  YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
           YVRAK  Q+VEKDP  A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+SFR 
Sbjct: 96  YVRAKQAQIVEKDPNKAVPLFWAAINSGDRIESALKDMANVLKQANRSEEAIEAIRSFRD 155

Query: 82  LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
            C  +AQESLDN+L+DLYKKCGR DEQIE+L  KLR++    A     TK ++SHG+   
Sbjct: 156 RCPYEAQESLDNILLDLYKKCGRTDEQIEMLTLKLRIVDEELASGRWKTKLSKSHGRVVY 215

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           ++++ E +R+LGNL WAYMQ  NY  AE++YR+A  I+ D NK CNL+ CL+K  +  EA
Sbjct: 216 LSLRDEKARLLGNLAWAYMQSENYEEAEMLYRQALAIEADYNKECNLAICLMKTGKLAEA 275

Query: 202 RSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF----LEGL 257
           + ++  +     +  +  K+++R  E+L+E +  QS+P  +T  KS     F    +E L
Sbjct: 276 KYLIHAIPY-NCNDESHVKSLSRATEMLREFD-LQSLPSPITQAKSKEPRNFVADDVEML 333

Query: 258 DDLMNQ 263
            DL  Q
Sbjct: 334 VDLQPQ 339


>gi|15241471|ref|NP_199246.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
           thaliana]
 gi|10176888|dbj|BAB10118.1| male sterility MS5; pollenless3 [Arabidopsis thaliana]
 gi|67633856|gb|AAY78852.1| male sterility MS5 family protein [Arabidopsis thaliana]
 gi|332007714|gb|AED95097.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
           thaliana]
          Length = 469

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 117/223 (52%), Positives = 151/223 (67%), Gaps = 3/223 (1%)

Query: 14  KLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV 73
           ++  GDSPYVRAKH QLV KDP  AI LFW AINAGDRVDSALKDM VVLKQ +R DE +
Sbjct: 49  RVRTGDSPYVRAKHAQLVSKDPNRAISLFWAAINAGDRVDSALKDMVVVLKQLNRFDEGI 108

Query: 74  EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTA 133
           EAIKSFR+LC  ++Q+S+DN+L++LY K GR+ E  ELL+ KLR +   + + G+     
Sbjct: 109 EAIKSFRYLCPFESQDSIDNLLLELYMKSGRITEVAELLEHKLRTLEQDKHYGGRIKIAK 168

Query: 134 RSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLI 193
           RSH ++   T++QE +RILGNL W ++Q  NY  AE  YR A  ++PD NK CNL+ CLI
Sbjct: 169 RSHEEQNNKTIEQEKARILGNLAWVHLQLHNYGIAEQYYRNALSLEPDNNKLCNLAICLI 228

Query: 194 KQARYTEARSVLEDV---LLGKLSGSTETKTINRVKELLQELE 233
           +  R  EA+S+LEDV   L  +       K+  R  E+L E E
Sbjct: 229 RMERTHEAKSLLEDVKQSLGNQWKNEPFCKSFERATEMLAERE 271


>gi|116794097|gb|ABK27005.1| unknown [Picea sitchensis]
          Length = 248

 Score =  220 bits (561), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 108/182 (59%), Positives = 140/182 (76%), Gaps = 8/182 (4%)

Query: 59  MAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
           MAVV+ QQ+R  EA+EAIKSFRHLCSKQAQE LDNVLIDL KKCGR DEQI LLK KLR+
Sbjct: 1   MAVVMNQQNRPLEAIEAIKSFRHLCSKQAQEPLDNVLIDLLKKCGRFDEQISLLKHKLRL 60

Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI 178
           I  G AFNGK TKTARSHG+KFQV++K+ET+R+LGNLG A+MQ+ +Y  AE+VYRKAQ++
Sbjct: 61  IQEGAAFNGKSTKTARSHGRKFQVSIKKETTRLLGNLGLAFMQQHDYCYAEIVYRKAQVL 120

Query: 179 DPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGST--------ETKTINRVKELLQ 230
           +PD NK CNLS CL++Q +  EA  +L+ +L    +           ++ +++RV+ LL+
Sbjct: 121 EPDDNKVCNLSVCLMRQGKVEEAMGLLQGLLNNNKNSMNYKHNNSRGKSNSLDRVEALLK 180

Query: 231 EL 232
           E+
Sbjct: 181 EI 182


>gi|168001280|ref|XP_001753343.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695629|gb|EDQ81972.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 608

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 106/233 (45%), Positives = 139/233 (59%), Gaps = 56/233 (24%)

Query: 5   RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
           R+  +H+VH +P G+SPYV+AKHVQLV+KDP+ AI LFW+AIN+GDRVDSALKDMA+V+K
Sbjct: 91  RNDAFHIVHNVPMGNSPYVKAKHVQLVDKDPDRAIALFWEAINSGDRVDSALKDMAIVMK 150

Query: 65  QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
           Q+ R  EA+E IKS R  CS +AQESLDNVL+DLYK                        
Sbjct: 151 QKSRPHEAIEVIKSLRSRCSDRAQESLDNVLLDLYK------------------------ 186

Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
                                       GNLGWAYMQ+  + +AE VYRKA LI+ D NK
Sbjct: 187 ----------------------------GNLGWAYMQQSKFLAAEAVYRKALLIESDVNK 218

Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELE 233
            CNL  CL+KQ R  EA+S+L  V+L     + +  +  K+  R +E++++LE
Sbjct: 219 VCNLGICLMKQGRLEEAKSILRSVILPCNDRRWTSDSHLKSFERAQEMMEKLE 271


>gi|326515788|dbj|BAK07140.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 423

 Score =  200 bits (509), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 98/165 (59%), Positives = 121/165 (73%), Gaps = 4/165 (2%)

Query: 73  VEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKT 132
            EAIKS R  CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+ G AFNGK TKT
Sbjct: 1   TEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIHQGHAFNGKRTKT 60

Query: 133 ARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCL 192
           ARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE  YR+A +I PD NK CNL  CL
Sbjct: 61  ARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALVIGPDNNKMCNLGICL 120

Query: 193 IKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
           +KQ R  EA+ VL+ V    + G     +  K   R +E+L++LE
Sbjct: 121 MKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 165


>gi|168005732|ref|XP_001755564.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693271|gb|EDQ79624.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 141

 Score =  197 bits (501), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 93/144 (64%), Positives = 114/144 (79%), Gaps = 3/144 (2%)

Query: 30  LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
           LV+KDP+ AI LFW A+N GDRVDSALKDMA+V+KQQ+R  EA+EAIKS R+ CS +AQE
Sbjct: 1   LVDKDPDRAISLFWTAVNVGDRVDSALKDMAIVMKQQNRPQEAIEAIKSLRNRCSDEAQE 60

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
           SLDNVL+ LYK+CGRLD+QI++L  KL +++ G     K +KTARS GKKFQV + QE S
Sbjct: 61  SLDNVLLGLYKRCGRLDDQIDVLTHKLHLVHQGIV---KRSKTARSQGKKFQVLISQEAS 117

Query: 150 RILGNLGWAYMQKGNYTSAEVVYR 173
            ILGNLGWAYMQ+ N+  AE  YR
Sbjct: 118 SILGNLGWAYMQQSNFVGAEAAYR 141


>gi|224171113|ref|XP_002339460.1| predicted protein [Populus trichocarpa]
 gi|222875155|gb|EEF12286.1| predicted protein [Populus trichocarpa]
          Length = 130

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 76/131 (58%), Positives = 97/131 (74%), Gaps = 4/131 (3%)

Query: 160 MQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTET 219
           MQK N+ +AEVVY+KAQ+IDPDANKACNL+ CLIKQARY EARSVL DV  G+L GS + 
Sbjct: 1   MQKPNFIAAEVVYQKAQMIDPDANKACNLALCLIKQARYDEARSVLLDVSQGRLPGSGDV 60

Query: 220 KTINRVKELLQELEPWQSIPPSLTTKKS---SLEDAFLEGLDDLMNQWTPYRSRRLPIFE 276
           K+ +R +ELL E++  ++ P  LT         +D F++GL+ LM++W P RS+RLPIFE
Sbjct: 61  KSRSRAEELLMEVKSRET-PDELTDMLGFNLDEDDDFVKGLEKLMSEWAPSRSKRLPIFE 119

Query: 277 EISPFRDQLAC 287
           EIS FRD L C
Sbjct: 120 EISSFRDPLTC 130


>gi|3859114|gb|AAC72542.1| MS5-like protein [Arabidopsis thaliana]
          Length = 307

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 64/125 (51%), Positives = 80/125 (64%), Gaps = 4/125 (3%)

Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
           LK KL +I  G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ N+  AE  
Sbjct: 1   LKHKLFLIQKGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDA 60

Query: 172 YRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKE 227
           YR+A  I PD NK CNL  CL+KQ R  EA+  L  V    + G     +  K   R ++
Sbjct: 61  YRRALSIAPDNNKMCNLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQ 120

Query: 228 LLQEL 232
           +L +L
Sbjct: 121 MLNDL 125


>gi|148657770|ref|YP_001277975.1| cold-shock protein, DNA-binding [Roseiflexus sp. RS-1]
 gi|148569880|gb|ABQ92025.1| Cold-shock protein, DNA-binding [Roseiflexus sp. RS-1]
          Length = 1555

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 113/204 (55%), Gaps = 28/204 (13%)

Query: 9   YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
           Y  V  +P G +PY RAK VQLVEKD E A  L   AI  GD V+SA+KD+A +L Q  R
Sbjct: 477 YARVSGIPKGSNPYARAKRVQLVEKDLERAAQLLRDAIRQGDNVESAVKDLAALLVQLGR 536

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
            DEA++ ++  R   S   Q+S+DN+LI+ Y+  G+ D+ I LL ++L+           
Sbjct: 537 PDEAIQVLEKNRSRISN--QQSVDNMLINFYQNAGQHDKAISLLHKQLQ----------- 583

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-- 186
                       Q   + + ++IL  +   Y++K +Y  AE  +++ +   PD NK+   
Sbjct: 584 ------------QANTETKKAQILWQIAIGYLRKEDYVRAEQTFQELRRAQPD-NKSIQR 630

Query: 187 NLSHCLIKQARYTEARSVLEDVLL 210
           N++ CL KQ RY EAR +L+ +LL
Sbjct: 631 NIALCLFKQERYEEARKILDGILL 654


>gi|222625343|gb|EEE59475.1| hypothetical protein OsJ_11684 [Oryza sativa Japonica Group]
          Length = 241

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/162 (38%), Positives = 84/162 (51%), Gaps = 41/162 (25%)

Query: 14  KLPPG--DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
           + PP   DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA           
Sbjct: 105 RAPPSSVDSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMAT---------- 154

Query: 72  AVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTK 131
                                        KC R  EQIE+L  KL  +    A     TK
Sbjct: 155 -----------------------------KCDRTKEQIEMLTLKLIFVDEELASGRWKTK 185

Query: 132 TARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYR 173
            ++SHG+   ++++ E + +LGNL WA+MQ  NY  AE++YR
Sbjct: 186 LSKSHGRVVYLSLRDEKAWLLGNLAWAHMQSENYDGAEMLYR 227


>gi|255564367|ref|XP_002523180.1| hypothetical protein RCOM_1336590 [Ricinus communis]
 gi|223537587|gb|EEF39211.1| hypothetical protein RCOM_1336590 [Ricinus communis]
          Length = 618

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 49/64 (76%), Positives = 54/64 (84%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S  R   +HVVHK+P GDSPYVRAKHVQL++KDP  AI LFW AINAGDRVDSALKDMAV
Sbjct: 40  SNCRRDLFHVVHKVPSGDSPYVRAKHVQLIDKDPSKAISLFWAAINAGDRVDSALKDMAV 99

Query: 62  VLKQ 65
           V+KQ
Sbjct: 100 VMKQ 103



 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 58/101 (57%), Gaps = 11/101 (10%)

Query: 138 KKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQAR 197
           K   V +KQ    ILGNL WAY+Q  +Y  AE  YRKA  ++PD NK CNL+ CL+   +
Sbjct: 95  KDMAVVMKQ----ILGNLAWAYLQHHDYGLAEQHYRKALSVEPDYNKQCNLALCLMHMNK 150

Query: 198 YTEARSVLEDVLLGKLSGSTE-----TKTINRVKELLQELE 233
             EA+S+L+ V      GSTE      K+  R  E+L +LE
Sbjct: 151 IPEAKSLLQAV--SDSCGSTEMDESYAKSFERAVEMLNDLE 189


>gi|238481350|ref|NP_001154733.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332005697|gb|AED93080.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 237

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/72 (72%), Positives = 61/72 (84%)

Query: 81  HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
           H+   QAQESL+NVLIDLYKK GR +EQ+ELLK +L MIY  EAFNGKP K ARSHG+KF
Sbjct: 66  HISYGQAQESLENVLIDLYKKGGRTEEQVELLKLQLWMIYQEEAFNGKPAKIARSHGRKF 125

Query: 141 QVTVKQETSRIL 152
           QVTV++ETSR+L
Sbjct: 126 QVTVEKETSRML 137


>gi|328953610|ref|YP_004370944.1| cold-shock protein DNA-binding protein [Desulfobacca acetoxidans
           DSM 11109]
 gi|328453934|gb|AEB09763.1| Cold-shock protein DNA-binding protein [Desulfobacca acetoxidans
           DSM 11109]
          Length = 1568

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 109/208 (52%), Gaps = 26/208 (12%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
           Q + + Y  V  +P G +PY RAK  QL+EKD + A+ L  +AI  GD  +SA+KD+A V
Sbjct: 475 QNKWREYTRVTGVPKGSTPYARAKRAQLIEKDLDKAVQLLREAIKRGDNTESAVKDLASV 534

Query: 63  LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
           L QQ + +EA++ ++S + + S    +S+DN+LI  Y+  G+ D  I LL++KL+     
Sbjct: 535 LVQQGKPEEAIKTLESHKQVSSDP--KSIDNLLITSYQNAGQYDHAITLLQKKLK----- 587

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD- 181
                             Q   + + +  L  +    ++KG+Y  +E  +RK     PD 
Sbjct: 588 ------------------QADNEIKRAPYLFQIALMNLKKGDYGQSEQFFRKILTKQPDN 629

Query: 182 ANKACNLSHCLIKQARYTEARSVLEDVL 209
                N++ CL KQ  Y EA+ +L+ +L
Sbjct: 630 VTVQRNIAICLSKQGHYDEAKKLLQKIL 657


>gi|242095292|ref|XP_002438136.1| hypothetical protein SORBIDRAFT_10g008600 [Sorghum bicolor]
 gi|241916359|gb|EER89503.1| hypothetical protein SORBIDRAFT_10g008600 [Sorghum bicolor]
          Length = 203

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 78/210 (37%), Positives = 103/210 (49%), Gaps = 49/210 (23%)

Query: 59  MAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
           MAVV+KQQD ++EA+EAI+SFR+LCSKQ+QESL N+LIDL+K    L  QIELLK+ L +
Sbjct: 1   MAVVMKQQDPLEEAIEAIRSFRYLCSKQSQESLGNLLIDLHK----LRVQIELLKKNLEI 56

Query: 119 IYHG---------EAFNGKPTKTARSHG------------------KKFQVTVKQETSRI 151
            + G         E    KPT      G                  + F  T    + +I
Sbjct: 57  QWEGDEESTDPWQEVSGLKPTGDVSHSGTICASWFLLMLITYLPIDQVFSCTQLVSSLKI 116

Query: 152 LGNLGWAYMQKGNY-----------TSAEVVYRKAQLIDPDANKACNLSHCLIKQ---AR 197
           L      Y   G             +  E+VYR AQ I+ DAN+ACNL  CLI Q    R
Sbjct: 117 LP----VYHVTGQSRQGEHAVQQLPSCTEMVYRNAQTIESDANRACNLGLCLINQRVGKR 172

Query: 198 YTEARSVLEDVLLGKLSGSTETKTINRVKE 227
             E R  L  +LL ++  S   K + R ++
Sbjct: 173 RHEKRFQLRRILLYEIRRSMHEKVVARAEQ 202


>gi|50582704|gb|AAT78774.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108709730|gb|ABF97525.1| hypothetical protein LOC_Os03g40840 [Oryza sativa Japonica Group]
          Length = 274

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 71/143 (49%), Gaps = 39/143 (27%)

Query: 30  LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
           ++EKDP  A+ LFW AIN+GDR++SALKDMA                             
Sbjct: 166 VIEKDPNKAVPLFWAAINSGDRIESALKDMAT---------------------------- 197

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
                      KC R  EQIE+L  KL  +    A     TK ++SHG+   ++++ E +
Sbjct: 198 -----------KCDRTKEQIEMLTLKLIFVDEELASGRWKTKLSKSHGRVVYLSLRDEKA 246

Query: 150 RILGNLGWAYMQKGNYTSAEVVY 172
            +LGNL WA+MQ  NY  AE++Y
Sbjct: 247 WLLGNLAWAHMQSENYDGAEMLY 269


>gi|334187848|ref|NP_001154732.2| uncharacterized protein [Arabidopsis thaliana]
 gi|332005696|gb|AED93079.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 201

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/72 (62%), Positives = 54/72 (75%), Gaps = 7/72 (9%)

Query: 81  HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
           H+   QAQESL+N       K GR +EQ+ELLK +L MIY  EAFNGKP K ARSHG+KF
Sbjct: 66  HISYGQAQESLEN-------KGGRTEEQVELLKLQLWMIYQEEAFNGKPAKIARSHGRKF 118

Query: 141 QVTVKQETSRIL 152
           QVTV++ETSR+L
Sbjct: 119 QVTVEKETSRML 130


>gi|297808269|ref|XP_002872018.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317855|gb|EFH48277.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 84

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 40/62 (64%), Positives = 50/62 (80%), Gaps = 3/62 (4%)

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD---ANK 184
           KPT TARSHGKKF VTV+++T+RILGN GWAYMQ  +YT+A+ VY KAQ+I+P+   AN 
Sbjct: 10  KPTNTARSHGKKFMVTVEKKTTRILGNFGWAYMQLRDYTAAKDVYWKAQVIEPEGMLANI 69

Query: 185 AC 186
            C
Sbjct: 70  KC 71


>gi|40538969|gb|AAR87226.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 160

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/56 (64%), Positives = 44/56 (78%), Gaps = 2/56 (3%)

Query: 14  KLPPG--DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           + PP   DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA VLKQ +
Sbjct: 105 RAPPSSVDSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQAN 160


>gi|297837461|ref|XP_002886612.1| hypothetical protein ARALYDRAFT_893498 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332453|gb|EFH62871.1| hypothetical protein ARALYDRAFT_893498 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 123

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 49/84 (58%), Gaps = 27/84 (32%)

Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL-------------------------- 152
           +Y GEAFNGKPT T RSHG+KFQVTV++ETSRIL                          
Sbjct: 17  LYQGEAFNGKPTNTTRSHGRKFQVTVEKETSRILMLIKKCGMKIVNSAKPVEQMKLLINT 76

Query: 153 -GNLGWAYMQKGNYTSAEVVYRKA 175
            GN GWAYMQ  +YT+A+ VY KA
Sbjct: 77  KGNFGWAYMQLKDYTTAKDVYLKA 100


>gi|218193290|gb|EEC75717.1| hypothetical protein OsI_12552 [Oryza sativa Indica Group]
          Length = 132

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 33/48 (68%), Positives = 40/48 (83%)

Query: 19  DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
           DSPYVRAK  Q++EKDP  A+ LFW AIN+GDR++SALKDMA VL +Q
Sbjct: 76  DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLNKQ 123


>gi|116792779|gb|ABK26494.1| unknown [Picea sitchensis]
          Length = 151

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 52/79 (65%), Gaps = 6/79 (7%)

Query: 160 MQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGST-- 217
           MQ  +Y SAE+VYRKAQ+++PD +KACNLS CLIKQ +  EA  +L+ VL          
Sbjct: 1   MQHDDYCSAEIVYRKAQVLEPDDHKACNLSVCLIKQGKVEEAMGLLQGVLNSNDDDHNNN 60

Query: 218 ----ETKTINRVKELLQEL 232
               ++K+++R + LL+E+
Sbjct: 61  NSRGKSKSLDRAEALLKEI 79


>gi|222616416|gb|EEE52548.1| hypothetical protein OsJ_34783 [Oryza sativa Japonica Group]
          Length = 102

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 34/60 (56%), Positives = 45/60 (75%), Gaps = 1/60 (1%)

Query: 44  KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAI-KSFRHLCSKQAQESLDNVLIDLYKKC 102
           KAIN+ D+VDSALKDM VV+KQQD  +E +EAI +      S++ QESL+N+LID+Y  C
Sbjct: 22  KAINSRDKVDSALKDMVVVMKQQDHAEEEIEAISRQIIQARSRKTQESLNNLLIDMYMAC 81


>gi|303290781|ref|XP_003064677.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453703|gb|EEH51011.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 316

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 56/105 (53%), Gaps = 3/105 (2%)

Query: 22  YVRAKHVQLVEKDPEAAIVLFWKAI-NAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFR 80
           Y  AKH QL  KD   A+ L    I   G+   SALKD+  +LKQ  R D+AV AI+ +R
Sbjct: 145 YAEAKHAQLTRKDLPTALTLMVAEIERRGECAQSALKDLVCILKQMGRHDDAVRAIERYR 204

Query: 81  HL--CSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
                 ++ QESLDN+L+DLYK    L     +    +R  + GE
Sbjct: 205 GAWPMDERLQESLDNMLLDLYKHSRDLGAFSSITLVPVRPRWRGE 249


>gi|153869950|ref|ZP_01999452.1| Cold-shock protein, DNA-binding [Beggiatoa sp. PS]
 gi|152073584|gb|EDN70548.1| Cold-shock protein, DNA-binding [Beggiatoa sp. PS]
          Length = 501

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 57/92 (61%), Gaps = 3/92 (3%)

Query: 7   QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
           Q Y+    LP G +P+  A+  +L EKD   A+ L+ +AI   D  +SA+K++A VL Q 
Sbjct: 412 QVYNNPPPLPSGSNPFACAERAKL-EKDLSKAVKLYRRAIKQKDHFESAVKNLASVLAQL 470

Query: 67  DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDL 98
           DRV++A+E ++ +R       Q+S+DN+L++ 
Sbjct: 471 DRVEDAIEVLQQYRDEIDN--QKSIDNMLVNF 500


>gi|171702827|dbj|BAG16369.1| male sterility 5 family protein [Brassica rapa var. perviridis]
          Length = 77

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/37 (70%), Positives = 30/37 (81%), Gaps = 1/37 (2%)

Query: 2  SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAA 38
          S+ RD P+H+VHK+P GDSPYVRAKH QLV KDP  A
Sbjct: 42 SEKRD-PFHIVHKVPSGDSPYVRAKHAQLVSKDPNRA 77


>gi|171702812|dbj|BAG16362.1| male sterility 5 family protein [Brassica oleracea var. italica]
          Length = 75

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 24/37 (64%), Positives = 28/37 (75%)

Query: 2  SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAA 38
          S  +  P+H+VHK+P GDSPYVRAKH QLV KDP  A
Sbjct: 39 SSEKCDPFHIVHKVPSGDSPYVRAKHAQLVSKDPNRA 75


>gi|196232461|ref|ZP_03131314.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196223533|gb|EDY18050.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 760

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 80/175 (45%), Gaps = 11/175 (6%)

Query: 44  KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCG 103
           +A+    R  +A  ++ +V +   RVDEA+EA +  R L  + A     + L +L ++ G
Sbjct: 65  RALAIEPRNGAAYSNLGLVYRSLGRVDEAMEAYR--RALQLQPALPEPYHNLANLLRQTG 122

Query: 104 RLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ-VTVKQETSRILGNLGWAYMQK 162
           RL+E +  L+Q +R+  HG   +        + G+K + +   QE  R+  N   AY   
Sbjct: 123 RLNEAVGWLQQAIRLRPHGVELHKNLGDVLSAAGRKDEAIAAYQEAIRLNPNFAEAYNNL 182

Query: 163 GNYTSAE-------VVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           GN    E        V+ +AQ + PD A    NL+  L    ++  A +  +  L
Sbjct: 183 GNILRGERRLSEAITVFGEAQRLLPDSAEIHNNLAAALADDGQFAHADAAYQRAL 237


>gi|238478814|ref|NP_001154416.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332194374|gb|AEE32495.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 221

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 18/69 (26%)

Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
           MIY  EAFNGKP K ARSHG+ F+   +++       LG+  +             + Q+
Sbjct: 1   MIYQEEAFNGKPAKIARSHGRNFRSRSRRKP------LGYCRVS------------ETQV 42

Query: 178 IDPDANKAC 186
           I+PDANKAC
Sbjct: 43  IEPDANKAC 51


>gi|334183195|ref|NP_001185185.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332194375|gb|AEE32496.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 211

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 18/69 (26%)

Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
           MIY  EAFNGKP K ARSHG+ F+   +++       LG+  +             + Q+
Sbjct: 1   MIYQEEAFNGKPAKIARSHGRNFRSRSRRKP------LGYCRVS------------ETQV 42

Query: 178 IDPDANKAC 186
           I+PDANKAC
Sbjct: 43  IEPDANKAC 51


>gi|168049632|ref|XP_001777266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671368|gb|EDQ57921.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 460

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 4/65 (6%)

Query: 173 RKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKEL 228
           RKA LI+ D NK CN   CL+KQ R  EA +VL+ V +     + +  +  K+  R ++L
Sbjct: 16  RKALLIESDDNKVCNFGVCLMKQGRLEEAEAVLQRVTVACSGIRWASDSHLKSYERAQDL 75

Query: 229 LQELE 233
           L+ELE
Sbjct: 76  LKELE 80


>gi|357142312|ref|XP_003572530.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Brachypodium
           distachyon]
          Length = 1001

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 57/228 (25%), Positives = 93/228 (40%), Gaps = 38/228 (16%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI+ + +A+ A      A  ++A +  +Q ++D AV        +C  Q  E+ +N 
Sbjct: 297 PQDAIMSYQRALQARPDYAMAYGNLATIYYEQGQLDMAVRCYNQ-AIVCDPQFIEAYNN- 354

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNG--------KPTKTARSHGKKFQVTVK 145
           + +  K  GR++E I   +  L +   H +A               TA S  K   ++V 
Sbjct: 355 MGNALKDSGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNLVTTAASFYKA-AISVT 413

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
              S  L NL   Y Q+GNY  A   Y +   IDP A                       
Sbjct: 414 SGLSSPLNNLALIYKQQGNYADAITCYTEVLRIDPTA----------------------- 450

Query: 206 EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
            D L+ +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 451 ADALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPTMAEAHANLASAY 495


>gi|15222846|ref|NP_175416.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332194373|gb|AEE32494.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 230

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 18/69 (26%)

Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
           MIY  EAFNGKP K ARSHG+ F+   +++       LG+  +             + Q+
Sbjct: 1   MIYQEEAFNGKPAKIARSHGRNFRSRSRRKP------LGYCRVS------------ETQV 42

Query: 178 IDPDANKAC 186
           I+PDANKAC
Sbjct: 43  IEPDANKAC 51


>gi|168056376|ref|XP_001780196.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668346|gb|EDQ54955.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 38/65 (58%), Gaps = 4/65 (6%)

Query: 173 RKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKEL 228
           RKA  I+ D NK CN S CL+KQ R  EA +VL+ V +     +    +  K+  R ++L
Sbjct: 26  RKAWSIEFDVNKVCNFSVCLMKQGRLEEAEAVLQRVTVACTGIRWPSDSHLKSYERAQDL 85

Query: 229 LQELE 233
           L+ELE
Sbjct: 86  LKELE 90


>gi|198282345|ref|YP_002218666.1| hypothetical protein Lferr_0200 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|198246866|gb|ACH82459.1| TPR repeat-containing protein [Acidithiobacillus ferrooxidans ATCC
           53993]
          Length = 505

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/184 (29%), Positives = 87/184 (47%), Gaps = 16/184 (8%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK-QAQESLDNVLI 96
           AI  F KA+        AL  + V L +QDR+DEA   ++  + L  K    E+L N+ +
Sbjct: 274 AIGTFRKALTIKPDYLEALMRLGVALGRQDRMDEAETVLR--KALAIKPDYPEALMNLGV 331

Query: 97  DLYKKCGRLDEQIELLKQKLR--------MIYHGEAFN--GKPTKTARSHGKKFQVTVKQ 146
            L  + GR +E   LL+Q L         ++  G A +  G+    A +  ++  + ++ 
Sbjct: 332 TLGHE-GRWEETETLLRQALTIKPDYPEALMNLGAALSRQGRCLDEAENILRR-TLAIQP 389

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVL 205
           +    L NLG    ++G +  AE + R+A  I P+ A+   NL   L KQ R  EA ++L
Sbjct: 390 DHPDALVNLGATLNKQGRWNEAETILRQAVAIKPEHADALINLGAVLSKQGRLDEAETIL 449

Query: 206 EDVL 209
              L
Sbjct: 450 RQAL 453



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 82/182 (45%), Gaps = 24/182 (13%)

Query: 40  VLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQ--ESLDNVLID 97
           + F KA++       AL ++  +L  Q RVD   EAI +FR   + +    E+L  + + 
Sbjct: 242 IAFRKALSIKPAYPEALMNLGAILSDQGRVD---EAIGTFRKALTIKPDYLEALMRLGVA 298

Query: 98  LYKKCGRLDEQIELLKQKLRM------------IYHGEAFNGKPTKTARSHGKKFQVTVK 145
           L ++  R+DE   +L++ L +            +  G     + T+T         +T+K
Sbjct: 299 LGRQ-DRMDEAETVLRKALAIKPDYPEALMNLGVTLGHEGRWEETETLLRQA----LTIK 353

Query: 146 QETSRILGNLGWAYMQKGN-YTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
            +    L NLG A  ++G     AE + R+   I PD   A  NL   L KQ R+ EA +
Sbjct: 354 PDYPEALMNLGAALSRQGRCLDEAENILRRTLAIQPDHPDALVNLGATLNKQGRWNEAET 413

Query: 204 VL 205
           +L
Sbjct: 414 IL 415


>gi|254445366|ref|ZP_05058842.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
           DG1235]
 gi|198259674|gb|EDY83982.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
           DG1235]
          Length = 463

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 73/165 (44%), Gaps = 17/165 (10%)

Query: 61  VVLKQ---QDRVDEAVE-AIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQK- 115
           VV+KQ   Q ++D   E A+ +        A  + D +  + Y + G L   I+   Q  
Sbjct: 63  VVVKQVLDQLQLDNNKETALATILRANGPDATAAYDFIAANFYAEKGDLKNAIKYYDQAA 122

Query: 116 ------LRMIYHGEAF---NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYT 166
                 LR + +G       G+     +   K   +  K  T+  +G LG  Y+    Y 
Sbjct: 123 KKFPSFLRAVRNGAIMKVKEGRYEAARKDFTKAINLGAKDTTT--MGLLGLCYVNTEKYF 180

Query: 167 SAEVVYRKAQLIDPDANK-ACNLSHCLIKQARYTEARSVLEDVLL 210
           SAE  YR+A ++DPD       L+  L++Q +Y E  +VLE +L+
Sbjct: 181 SAETAYREAIVLDPDVKDWQVGLAKSLLQQRKYEEGIAVLEQILV 225


>gi|47848089|dbj|BAD21873.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
           Japonica Group]
 gi|47848095|dbj|BAD21878.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
           Japonica Group]
          Length = 953

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/224 (22%), Positives = 91/224 (40%), Gaps = 36/224 (16%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI+ + +A+ A      A  ++A +  +Q ++D A+        +C  Q  E+ +N + +
Sbjct: 301 AIISYQRAVQARPDYAMAYGNLATIYYEQGQLDIAIRCYNQ-AIICDPQFVEAYNN-MGN 358

Query: 98  LYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQETS 149
             K  GR++E I   +  L +  +        G  +      +A +   K  ++V    S
Sbjct: 359 ALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNLISAAASFYKAAISVTSGLS 418

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
             L NL   Y Q+GNY  A   Y +   +DP A                        D L
Sbjct: 419 SPLNNLAVIYKQQGNYADAITCYTEVLRVDPTA-----------------------ADAL 455

Query: 210 LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           + +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 456 VNR--GNT-FKEIGRVNEAIQDYIQAATIRPTMAEAHANLASAY 496


>gi|218190772|gb|EEC73199.1| hypothetical protein OsI_07263 [Oryza sativa Indica Group]
          Length = 1004

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 52/224 (23%), Positives = 91/224 (40%), Gaps = 36/224 (16%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI+ + +A+ A      A  ++A +  +Q ++D A+        +C  Q  E+ +N + +
Sbjct: 301 AIISYQRAVQARPDYAMAYGNLATIYYEQGQLDIAIRCYNQ-AIICDPQFVEAYNN-MGN 358

Query: 98  LYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTK-------TARSHGKKFQVTVKQETS 149
             K  GR++E I   +  L +   H +A              +A +   K  ++V    S
Sbjct: 359 ALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNLISAAASFYKAAISVTSGLS 418

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
             L NL   Y Q+GNY  A   Y +   +DP A                        D L
Sbjct: 419 SPLNNLAVIYKQQGNYADAITCYTEVLRVDPTA-----------------------ADAL 455

Query: 210 LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           + +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 456 VNR--GNT-FKEIGRVNEAIQDYIQAATIRPTMAEAHANLASAY 496


>gi|224140549|ref|XP_002323645.1| predicted protein [Populus trichocarpa]
 gi|222868275|gb|EEF05406.1| predicted protein [Populus trichocarpa]
          Length = 682

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 101/229 (44%), Gaps = 32/229 (13%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDN- 93
           P+ A+ LF+K +N G + D     M  VL    R+  A+E      +L +    E LDN 
Sbjct: 291 PKEALDLFFKMLNEGLKPDC--YAMVGVLCSCARLG-ALELGDWASNLIN--GNEFLDNS 345

Query: 94  ----VLIDLYKKCGRLDEQIEL---LKQKLRMIYH----GEAFNGKPTKTARSHGKKFQV 142
                LID+Y KCGR+D   E+   +++K R++++    G A +G         G+  + 
Sbjct: 346 VLGTALIDMYAKCGRMDRAWEVFRGMRKKDRVVWNAAISGLAMSGHVKDALGLFGQMEKS 405

Query: 143 TVKQETSRILGNL------GWAYMQKGNYTSAEVVY-------RKAQLIDPDANKAC-NL 188
            +K + +  +G L      G     +  + S E V+           ++D      C + 
Sbjct: 406 GIKPDRNTFVGLLCACTHAGLVEEGRRYFNSMECVFTLTPEIEHYGCMVDLLGRAGCLDE 465

Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQS 237
           +H LIK     EA +++   LLG      +T+ +  V + L  LEPW S
Sbjct: 466 AHQLIKSMP-MEANAIVWGALLGGCRLHRDTQLVEVVLKKLIALEPWHS 513


>gi|222622881|gb|EEE57013.1| hypothetical protein OsJ_06783 [Oryza sativa Japonica Group]
          Length = 1004

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 52/224 (23%), Positives = 90/224 (40%), Gaps = 36/224 (16%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI+ + +A+ A      A  ++A +  +Q ++D A+        +C  Q  E+ +N + +
Sbjct: 301 AIISYQRAVQARPDYAMAYGNLATIYYEQGQLDIAIRCYNQ-AIICDPQFVEAYNN-MGN 358

Query: 98  LYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTK-------TARSHGKKFQVTVKQETS 149
             K  GR+ E I   +  L +   H +A              +A +   K  ++V    S
Sbjct: 359 ALKDAGRVKEAINCYRSCLALQANHPQALTNLGNIYMEWNLISAAASFYKAAISVTSGLS 418

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
             L NL   Y Q+GNY  A   Y +   +DP A                        D L
Sbjct: 419 SPLNNLAVIYKQQGNYADAITCYTEVLRVDPTA-----------------------ADAL 455

Query: 210 LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           + +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 456 VNR--GNT-FKEIGRVNEAIQDYIQAATIRPTMAEAHANLASAY 496


>gi|242065108|ref|XP_002453843.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
 gi|241933674|gb|EES06819.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
          Length = 1011

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI+ + +A+ A      A  ++A +  +Q ++D A+        +   Q  E+ +N 
Sbjct: 307 PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFVEAYNN- 364

Query: 95  LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
           + +  K  GR++E I   +  L +  +        G  +      +A +   K  ++V  
Sbjct: 365 MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWSMISAAASFYKAAISVTS 424

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S  L NL   Y Q+GNY  A   Y +   IDP A                        
Sbjct: 425 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 461

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           D L+ +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 462 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMAEAHANLASAY 505


>gi|420251802|ref|ZP_14754961.1| Tfp pilus assembly protein PilF [Burkholderia sp. BT03]
 gi|398057091|gb|EJL49069.1| Tfp pilus assembly protein PilF [Burkholderia sp. BT03]
          Length = 617

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 75/163 (46%), Gaps = 15/163 (9%)

Query: 55  ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ 114
           A  ++ VVL+Q  R  EA +A K       + A   ++  L  L +  GR+DE  E+  +
Sbjct: 158 AYNNLGVVLRQLGRPGEAEQAHKKAIATDPEHAGAHIN--LARLLESLGRVDEA-EVAYR 214

Query: 115 KLRMIY--HGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGN 164
           +   +Y  H E  N       ++ G+        +  + +K +  +   NLG     +  
Sbjct: 215 RGLCLYPEHAETHNNL-GAMLQAQGRMPDAETCYRRSLAIKPDFPKACFNLGIVLQTQKR 273

Query: 165 YTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLE 206
           Y  AE++YR+A  + PD  +A  NL+H L+   ++TE   V E
Sbjct: 274 YREAELLYRRALQVKPDMLEARLNLAHLLLATGQFTEGWQVFE 316


>gi|168017832|ref|XP_001761451.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687457|gb|EDQ73840.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 538

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 4/65 (6%)

Query: 173 RKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKEL 228
           ++A  I+ D NK CN   CL+KQ R  EA +VL+ V +     + +  +  K+  R ++L
Sbjct: 87  KRALSIESDVNKVCNFGVCLMKQGRLEEAEAVLQRVTVAYSGIRWASDSHLKSYERAQDL 146

Query: 229 LQELE 233
           L+ELE
Sbjct: 147 LKELE 151


>gi|425437156|ref|ZP_18817582.1| Similar to tr|Q7NLR0|Q7NLR0 [Microcystis aeruginosa PCC 9432]
 gi|389677907|emb|CCH93194.1| Similar to tr|Q7NLR0|Q7NLR0 [Microcystis aeruginosa PCC 9432]
          Length = 776

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 35/167 (20%)

Query: 59  MAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGRLDEQIEL 111
           MA + + Q R  EA     EA+  F+            SL+N+   LY+  GR  E   L
Sbjct: 1   MAFLYQYQGRYSEAEPLYKEALFIFKQQLGNNHPSTAASLNNLAF-LYRIQGRYSEAEPL 59

Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
           LKQ L +         +  +   +H          +T++ L NL   Y+ +G Y+ AE +
Sbjct: 60  LKQSLAI---------RKQQLGDNH---------PDTAQSLNNLAELYLSQGRYSEAEPL 101

Query: 172 Y------RKAQLID--PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           Y      RK QL D  PD A    NL+   + Q RY+EA  +L+  L
Sbjct: 102 YKQALTIRKQQLGDNHPDVAQSLNNLAELYLSQGRYSEAEPLLKQSL 148



 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 45/171 (26%), Positives = 69/171 (40%), Gaps = 35/171 (20%)

Query: 55  ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQ-------AQESLDNVLIDLYKKCGRLDE 107
           +L ++A +   Q R  EA    K    +  +Q         +SL+N L +LY   GR  E
Sbjct: 81  SLNNLAELYLSQGRYSEAEPLYKQALTIRKQQLGDNHPDVAQSLNN-LAELYLSQGRYSE 139

Query: 108 QIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTS 167
              LLKQ L +         +  +   +H          +T++ L NL   Y  +G Y+ 
Sbjct: 140 AEPLLKQSLAI---------RKQQLGDNH---------PDTAQSLNNLAGLYQSQGRYSE 181

Query: 168 AEVVYRKAQLI--------DPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           AE +Y++A  I         PD A    NL+     Q +Y EA  +    L
Sbjct: 182 AEPLYKEALFIFKQQLGNNHPDVAQTLNNLATLYWNQGKYPEADELFSQGL 232


>gi|413936917|gb|AFW71468.1| hypothetical protein ZEAMMB73_864118 [Zea mays]
          Length = 517

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI+ + +A+ A      A  ++A +  +Q ++D A+        +   Q  E+ +N 
Sbjct: 2   PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFIEAYNN- 59

Query: 95  LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
           + +  K  GR++E I   +  L +  +        G  +      +A +   K  ++V  
Sbjct: 60  MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNMISAATSFYKAAISVTS 119

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S  L NL   Y Q+GNY  A   Y +   IDP A                        
Sbjct: 120 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 156

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           D L+ +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 157 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMPEAHANLASAY 200


>gi|434398683|ref|YP_007132687.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
           PCC 7437]
 gi|428269780|gb|AFZ35721.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
           PCC 7437]
          Length = 900

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 43/171 (25%), Positives = 78/171 (45%), Gaps = 22/171 (12%)

Query: 34  DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVE-----------AIKSFRHL 82
           D + AI  + + I       +   ++ V+  Q+ ++ EA+E            IK++ +L
Sbjct: 120 DWQGAIAAYRRTIALKPGYAAVYFNLGVLYDQRTQLTEAIENYYQAIKLQPNYIKAYSNL 179

Query: 83  CSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV 142
            S  A++   N  I++Y++  +LD     L   L  ++    FN +P +   S    F+ 
Sbjct: 180 GSIFAKQKKLNAAIEIYQQGLKLDPTWGTLHNNLGQVF---WFNEQPDRALAS----FET 232

Query: 143 TVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHC 191
            V  E +  L   NLG  + Q+GN+ +A   Y+K  +I+ + N     SHC
Sbjct: 233 AVIVEPNMALAHHNLGKLWQQQGNFNNAIAHYQK--VIELEPNNIWAYSHC 281


>gi|224123642|ref|XP_002319130.1| predicted protein [Populus trichocarpa]
 gi|222857506|gb|EEE95053.1| predicted protein [Populus trichocarpa]
          Length = 923

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 51/227 (22%), Positives = 88/227 (38%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AIV + +A+ A  +   A  ++A    ++ ++D A+   K     C ++  E+ +N 
Sbjct: 222 PQEAIVCYQQAVQARPKYAMAFGNLASTYYERGQLDLAILHYKQ-AIACDQRFLEAYNN- 279

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
           L +  K  GR+DE I+   Q L +           G  +       A +   K  + V  
Sbjct: 280 LGNALKDVGRVDEAIQCYNQCLSLQPNHPQALTNLGNIYMEWNMSAAAASCYKATLAVTT 339

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S    NL   Y Q+GNY+ A   Y +   I+P A                        
Sbjct: 340 GLSAPFSNLAVIYKQQGNYSDAISCYNEVLRIEPLAAD---------------------- 377

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
               G ++     K I RV E +Q+     +I P++    ++L  A+
Sbjct: 378 ----GLVNRGNTYKEIGRVSEAIQDYINAITIRPNMAEAHANLASAY 420


>gi|430747186|ref|YP_007206315.1| O-linked N-acetylglucosamine transferase, SPINDLY family
           [Singulisphaera acidiphila DSM 18658]
 gi|430018906|gb|AGA30620.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Singulisphaera acidiphila DSM 18658]
          Length = 827

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 15/179 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E A+V F  AI        A  +M+VVLKQ  R+DEA+ + +    L    A     N L
Sbjct: 191 EEAVVRFQAAIRLRPEFAGAYSNMSVVLKQLGRLDEAITSGREAVRLDPGFA--GAHNNL 248

Query: 96  IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQ---------VTVK 145
             + +K G  DE      + LR+     EA+    +  +R    +F+         +T+ 
Sbjct: 249 GVILEKEGGWDEATTCFHEALRLDPRFVEAYYNLGSVLSRL--GRFEDAESICRQAITLD 306

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARS 203
            +++    NL +A  ++G  T AE  YR+A  + P+  +   NL+  L K  +  EA +
Sbjct: 307 PDSAEAHHNLAFALSERGQLTEAETNYRRAIQLKPEFVDPYVNLTSVLGKFGKLDEAEA 365



 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 82/176 (46%), Gaps = 13/176 (7%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AIV   + +     +   L D+ ++L+   +++EAV   ++   L  + A  +  N+ + 
Sbjct: 159 AIVALEQVLRIKPELPETLNDLGLLLEMTGQLEEAVVRFQAAIRLRPEFAG-AYSNMSVV 217

Query: 98  LYKKCGRLDEQIELLKQKLRM--IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI---- 151
           L K+ GRLDE I   ++ +R+   + G A N       +  G     T   E  R+    
Sbjct: 218 L-KQLGRLDEAITSGREAVRLDPGFAG-AHNNLGVILEKEGGWDEATTCFHEALRLDPRF 275

Query: 152 ---LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
                NLG    + G +  AE + R+A  +DPD+ +A  NL+  L ++ + TEA +
Sbjct: 276 VEAYYNLGSVLSRLGRFEDAESICRQAITLDPDSAEAHHNLAFALSERGQLTEAET 331


>gi|409993902|ref|ZP_11277028.1| hypothetical protein APPUASWS_22358 [Arthrospira platensis str.
           Paraca]
 gi|409935250|gb|EKN76788.1| hypothetical protein APPUASWS_22358 [Arthrospira platensis str.
           Paraca]
          Length = 917

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 43/174 (24%), Positives = 82/174 (47%), Gaps = 21/174 (12%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI    ++  A  DMA+ L ++  ++ A++ ++  R  C    +  + N L
Sbjct: 123 EGAIGCYQKAIALNPKLGEAYLDMALRLNERGDINTAIKVLQEGRINCPNFKE--IFNTL 180

Query: 96  IDLYKKCGRLDEQIELLKQKLR------MIYH--GEAF--NGKPTKTARSHGKKFQVTVK 145
             L  +  ++DE I + ++ L       ++Y+  G AF   GK ++   ++ K   +++K
Sbjct: 181 GYLQLQQNQIDEAIAIFQEALSIDPTEPLVYNNLGWAFARQGKLSEAIAAYHKA--ISLK 238

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-------DANKACNLSHCL 192
            + +    NLG  +  K N+  A   ++KA  I+P       D   +C +  CL
Sbjct: 239 PDLAIAYSNLGKLWQHKNNHRQAITYFQKAIAIEPDNIMFYSDCGNSCLIIGCL 292


>gi|293334567|ref|NP_001167956.1| uncharacterized protein LOC100381672 [Zea mays]
 gi|223945123|gb|ACN26645.1| unknown [Zea mays]
          Length = 1009

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI+ + +A+ A      A  ++A +  +Q ++D A+        +   Q  E+ +N 
Sbjct: 305 PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFIEAYNN- 362

Query: 95  LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
           + +  K  GR++E I   +  L +  +        G  +      +A +   K  ++V  
Sbjct: 363 MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNMISAATSFYKAAISVTS 422

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S  L NL   Y Q+GNY  A   Y +   IDP A                        
Sbjct: 423 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 459

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           D L+ +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 460 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMPEAHANLASAY 503


>gi|413936918|gb|AFW71469.1| hypothetical protein ZEAMMB73_864118 [Zea mays]
          Length = 706

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI+ + +A+ A      A  ++A +  +Q ++D A+        +   Q  E+ +N 
Sbjct: 2   PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFIEAYNN- 59

Query: 95  LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
           + +  K  GR++E I   +  L +  +        G  +      +A +   K  ++V  
Sbjct: 60  MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNMISAATSFYKAAISVTS 119

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S  L NL   Y Q+GNY  A   Y +   IDP A                        
Sbjct: 120 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 156

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           D L+ +  G+T  K I RV E +Q+     +I P++    ++L  A+
Sbjct: 157 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMPEAHANLASAY 200


>gi|291569288|dbj|BAI91560.1| TPR domain protein [Arthrospira platensis NIES-39]
          Length = 917

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 43/174 (24%), Positives = 82/174 (47%), Gaps = 21/174 (12%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI    ++  A  DMA+ L ++  ++ A++ ++  R  C    +  + N L
Sbjct: 123 EGAIGCYQKAIALNPKLGEAYLDMALRLNERGDINTAIKVLQQGRINCPNFKE--IFNTL 180

Query: 96  IDLYKKCGRLDEQIELLKQKLR------MIYH--GEAF--NGKPTKTARSHGKKFQVTVK 145
             L  +  ++DE I + ++ L       ++Y+  G AF   GK ++   ++ K   +++K
Sbjct: 181 GYLQLQQNQIDEAIAIFQEALSIDPTEPLVYNNLGWAFARQGKLSEAIAAYHKA--ISLK 238

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-------DANKACNLSHCL 192
            + +    NLG  +  K N+  A   ++KA  I+P       D   +C +  CL
Sbjct: 239 PDLAIAYSNLGKLWQHKNNHRQAITYFQKAIAIEPDNIMFYSDCGNSCLIIGCL 292


>gi|294952719|ref|XP_002787430.1| hypothetical protein Pmar_PMAR028692 [Perkinsus marinus ATCC
          50983]
 gi|239902402|gb|EER19226.1| hypothetical protein Pmar_PMAR028692 [Perkinsus marinus ATCC
          50983]
          Length = 293

 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 15 LPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVE 74
          +P  DS Y +AK ++  + D + A   F +AI  GDRVDS+LKD+A +L     V  AV 
Sbjct: 25 MPSYDSWYGQAKRLEF-DGDLKGAFSAFMEAIARGDRVDSSLKDVAGLLNMVGHVHAAVR 83

Query: 75 AIKSFRHLCSK 85
           ++  R L S 
Sbjct: 84 FLEDHRGLASN 94


>gi|375011677|ref|YP_004988665.1| cytochrome c biogenesis factor [Owenweeksia hongkongensis DSM
           17368]
 gi|359347601|gb|AEV32020.1| cytochrome c biogenesis factor [Owenweeksia hongkongensis DSM
           17368]
          Length = 657

 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 29/109 (26%)

Query: 103 GRLDEQIELLKQKLR-MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQ 161
           G+LDE + L K+ L+    H EA N                           NLG+ Y+Q
Sbjct: 556 GQLDEAVTLFKKLLKEQPQHAEALN---------------------------NLGYTYLQ 588

Query: 162 KGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
            GN T A + + KA  +DPD  +A  N +  LI Q  + EA+  L++VL
Sbjct: 589 MGNITLAGLNFDKALAVDPDYLQAWLNKASLLIMQENFAEAKKALKEVL 637


>gi|344198703|ref|YP_004783029.1| hypothetical protein Acife_0495 [Acidithiobacillus ferrivorans SS3]
 gi|343774147|gb|AEM46703.1| Tetratricopeptide TPR_2 repeat-containing protein
           [Acidithiobacillus ferrivorans SS3]
          Length = 545

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 95/220 (43%), Gaps = 24/220 (10%)

Query: 2   SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
           S+  D    V+  +P           + + +K    A + F KA++       AL ++ V
Sbjct: 244 SEAEDAIRKVLSLMPDNAESIFVLGAISVGKKCLSEAEMAFRKALSIKPAYPEALMNLGV 303

Query: 62  VLKQQDRVDEAVEAIKSFRH--LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
           +L  Q R+D   EAI +FR   +     +E+L  + + L  + GR+DE    L++ L + 
Sbjct: 304 MLSDQGRMD---EAIGTFRKALIIKPGYREALMRLGVALGYQ-GRMDEAESTLRKALAIK 359

Query: 119 -----------IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN-YT 166
                      +  G     + T+T         + +K +    L NLG A  ++G    
Sbjct: 360 SDYPEAIMNLGVILGHEGRWEETETLLRQA----LAIKPDYPEALMNLGAALSKQGRCLD 415

Query: 167 SAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVL 205
            AE + R+A  I PD A+   NL   L KQ R+ EA ++L
Sbjct: 416 EAETILRQALAIQPDHADALVNLGATLNKQGRWDEAETIL 455



 Score = 40.4 bits (93), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 48/211 (22%), Positives = 80/211 (37%), Gaps = 42/211 (19%)

Query: 41  LFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK 100
           + + A+N  +   +AL +++V+L  Q+R  EA +AI+    L    A+       I + K
Sbjct: 215 VLYDALNINENNVAALHNLSVLLLYQNRYSEAEDAIRKVLSLMPDNAESIFVLGAISVGK 274

Query: 101 KC--------------------------------GRLDEQIELLKQKL--------RMIY 120
           KC                                GR+DE I   ++ L         ++ 
Sbjct: 275 KCLSEAEMAFRKALSIKPAYPEALMNLGVMLSDQGRMDEAIGTFRKALIIKPGYREALMR 334

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            G A   +          +  + +K +    + NLG     +G +   E + R+A  I P
Sbjct: 335 LGVALGYQGRMDEAESTLRKALAIKSDYPEAIMNLGVILGHEGRWEETETLLRQALAIKP 394

Query: 181 DANKAC-NLSHCLIKQAR-YTEARSVLEDVL 209
           D  +A  NL   L KQ R   EA ++L   L
Sbjct: 395 DYPEALMNLGAALSKQGRCLDEAETILRQAL 425


>gi|224104413|ref|XP_002313429.1| predicted protein [Populus trichocarpa]
 gi|222849837|gb|EEE87384.1| predicted protein [Populus trichocarpa]
          Length = 88

 Score = 45.1 bits (105), Expect = 0.037,   Method: Composition-based stats.
 Identities = 22/40 (55%), Positives = 29/40 (72%)

Query: 111 LLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSR 150
           +L+ KL+ I  G AF+GK TKTARS G+K Q+TV+ E  R
Sbjct: 1   MLQCKLKNIEKGIAFSGKKTKTARSQGRKIQITVEHERPR 40


>gi|294055795|ref|YP_003549453.1| hypothetical protein [Coraliomargarita akajimensis DSM 45221]
 gi|293615128|gb|ADE55283.1| TPR repeat-containing protein [Coraliomargarita akajimensis DSM
           45221]
          Length = 453

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK-ACNLSHCLIKQARYTEARSVLED 207
            R  G LG+AY+  G Y +A+  YR+A L+ PD       L+ CL++  RY  A ++ + 
Sbjct: 160 GRSYGLLGYAYLTGGRYYAADAAYRQAILMQPDVKDWKIGLARCLMETERYDVAIALFDT 219

Query: 208 VL 209
           +L
Sbjct: 220 LL 221


>gi|289548588|ref|YP_003473576.1| hypothetical protein Thal_0816 [Thermocrinis albus DSM 14484]
 gi|289182205|gb|ADC89449.1| TPR repeat-containing protein [Thermocrinis albus DSM 14484]
          Length = 132

 Score = 45.1 bits (105), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 2/66 (3%)

Query: 138 KKFQVTVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQ 195
           K+++  +K++    L   NLG  Y +KG+YT AE  YR+A  +  D +   NL++ L KQ
Sbjct: 41  KEYKKALKKDPKNFLALFNLGNVYAKKGDYTVAEKFYREALAVKEDPDVLNNLAYVLYKQ 100

Query: 196 ARYTEA 201
            R  EA
Sbjct: 101 GRREEA 106


>gi|376005011|ref|ZP_09782581.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375326605|emb|CCE18334.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 530

 Score = 44.7 bits (104), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI A  +      ++   L QQ ++DEA+ A +    L    A  S DN L  
Sbjct: 21  AIASYNQAIAASPKSAWYYHNLGEALSQQGKIDEAIAAYRQATELNPNSAW-SYDN-LGT 78

Query: 98  LYKKCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQE 147
           L  + G L E +   +Q + +       YH  A      G P +      K  ++T   +
Sbjct: 79  LLNQQGNLPEAVRCFRQAIELDPYFWQFYHNLALVLIKQGHPEEAVSLLQKAIELTA--D 136

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            + +  +LG AY Q+  Y+ A   YR+   ++P
Sbjct: 137 DAELYHSLGKAYHQQQQYSEAVTAYRQGLELNP 169


>gi|302037002|ref|YP_003797324.1| hypothetical protein NIDE1664 [Candidatus Nitrospira defluvii]
 gi|300605066|emb|CBK41399.1| exported protein of unknown function, contains TPR repeats
           [Candidatus Nitrospira defluvii]
          Length = 531

 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 15/82 (18%)

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIKQA 196
           QE +R L NL   YM +  +T AE +YR++ +I        +P  A    NL+H  + Q 
Sbjct: 243 QEVARTLNNLANVYMYQARFTDAESLYRRSLVITEKVLGKENPKVAATLTNLAHMSLDQG 302

Query: 197 RYTEARSV------LEDVLLGK 212
           R+TEA S+      +E+ + GK
Sbjct: 303 RFTEAESLNQRALAIEEKVFGK 324


>gi|423065536|ref|ZP_17054326.1| TPR containing protein [Arthrospira platensis C1]
 gi|406712979|gb|EKD08154.1| TPR containing protein [Arthrospira platensis C1]
          Length = 530

 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI A  +      ++   L QQ ++DEA+ A +    L    A  S DN L  
Sbjct: 21  AIASYNQAIAASPKSAWYYHNLGEALSQQGKIDEAIAAYRQATELNPNSAW-SYDN-LGT 78

Query: 98  LYKKCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQE 147
           L  + G L E +   +Q + +       YH  A      G P +      K  ++T   +
Sbjct: 79  LLNQQGNLPEAVRCFRQAIELDPYFWQFYHNLALVLIKQGHPEEAVSLLQKAIELTA--D 136

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            + +  +LG AY Q+  Y+ A   YR+   ++P
Sbjct: 137 DAELYHSLGKAYHQQQQYSEAVTAYRQGLELNP 169


>gi|209527528|ref|ZP_03276030.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
 gi|209492016|gb|EDZ92369.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
          Length = 530

 Score = 44.7 bits (104), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI A  +      ++   L QQ ++DEA+ A +    L    A  S DN L  
Sbjct: 21  AIASYNQAIAASPKSAWYYHNLGEALSQQGKIDEAIAAYRQATELNPNSAW-SYDN-LGT 78

Query: 98  LYKKCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQE 147
           L  + G L E +   +Q + +       YH  A      G P +      K  ++T   +
Sbjct: 79  LLNQQGNLPEAVRCFRQAIELDPYFWQFYHNLALVLIKQGHPEEAVSLLQKAIELTA--D 136

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            + +  +LG AY Q+  Y+ A   YR+   ++P
Sbjct: 137 DAELYHSLGKAYHQQQQYSEAVTAYRQGLELNP 169


>gi|257059198|ref|YP_003137086.1| peptidase S1 and S6 chymotrypsin/Hap [Cyanothece sp. PCC 8802]
 gi|256589364|gb|ACV00251.1| peptidase S1 and S6 chymotrypsin/Hap [Cyanothece sp. PCC 8802]
          Length = 810

 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 77/177 (43%), Gaps = 19/177 (10%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDNVL 95
           AI  + KAI        A  ++   L  Q +++EA+ A +    L     QA  +L N L
Sbjct: 82  AIAAYQKAIQLNPNDADAYNNLGNALYYQGKLEEAIAAYQKAIQLNPNFAQAYNNLGNAL 141

Query: 96  IDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFN--GKPTKTARSHGKKFQVTVK 145
            D     G+L+E I   ++ +++       Y+  G A +  GK  +   ++ K  Q+   
Sbjct: 142 SD----QGKLEEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPN 197

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEA 201
              +    NLG A   +G    A   Y+KA  +DP DAN   NL   L KQ +  EA
Sbjct: 198 YADAYY--NLGVALFDQGKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEA 252



 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 78/175 (44%), Gaps = 11/175 (6%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI        A  ++ V L  Q ++DEA+ A +    L    A  + +N+ 
Sbjct: 182 EEAIAAYQKAIQLNPNYADAYYNLGVALFDQGKLDEAIAAYQKAIQLDPNDA-NAYNNLG 240

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYH-GEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
             LYK+ G+L+E I   ++ +++  +  EA+N      +    +   +   Q+  ++  N
Sbjct: 241 AALYKQ-GKLEEAIAAYQKAIQLNPNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPN 299

Query: 155 LGWAYMQKGNYTS-------AEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEA 201
              AY   GN  S       A   Y+KA  ++P+   A N L + L  Q +  EA
Sbjct: 300 FALAYNGLGNALSDQGKRDEAIAAYQKAIQLNPNFALAYNGLGNALSDQGKRDEA 354



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 44/177 (24%), Positives = 79/177 (44%), Gaps = 15/177 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI        A  ++ + L  Q +++EA+ A +    L    A ++  N+ 
Sbjct: 148 EEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPNYA-DAYYNLG 206

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYH-GEAFN---------GKPTKTARSHGKKFQVTVK 145
           + L+ + G+LDE I   ++ +++  +   A+N         GK  +   ++ K  Q+   
Sbjct: 207 VALFDQ-GKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEAIAAYQKAIQLN-- 263

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEA 201
              +    NLG A   +G    A   Y+KA  ++P+   A N L + L  Q +  EA
Sbjct: 264 PNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPNFALAYNGLGNALSDQGKRDEA 320



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 75/179 (41%), Gaps = 19/179 (10%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDN 93
           E AI  + KAI     +  A  ++ V L  Q + DEA+ A +    L      A   L N
Sbjct: 250 EEAIAAYQKAIQLNPNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPNFALAYNGLGN 309

Query: 94  VLIDLYKKCGRLDEQIELLKQKLR------MIYH--GEAFN--GKPTKTARSHGKKFQVT 143
            L D     G+ DE I   ++ ++      + Y+  G A +  GK  +   ++ K  Q+ 
Sbjct: 310 ALSD----QGKRDEAIAAYQKAIQLNPNFALAYNGLGNALSDQGKRDEAIAAYQKAIQLD 365

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
                +    NLG A   +G    A   Y+KA  ++P+   A  NL + L  Q +  EA
Sbjct: 366 PNDANA--YNNLGLALRNQGKRDEAITAYQKAIQLNPNFALAYNNLGNALYSQGKREEA 422


>gi|67920262|ref|ZP_00513782.1| TPR repeat:Sel1-like repeat:Sel1-like repeat [Crocosphaera watsonii
           WH 8501]
 gi|67857746|gb|EAM52985.1| TPR repeat:Sel1-like repeat:Sel1-like repeat [Crocosphaera watsonii
           WH 8501]
          Length = 353

 Score = 44.7 bits (104), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 80/177 (45%), Gaps = 15/177 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E A   + KAI        A  +M   L++Q +++EA+ A K    L    A  + +N+ 
Sbjct: 77  EEATAAYKKAIELDPNYSFAYNNMGNALRKQGKLEEAIAAYKKAIELDPNDAF-AYNNMG 135

Query: 96  IDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFN--GKPTKTARSHGKKFQVTVK 145
           + L    G+L+E I   K+ + +       Y+  G A N  GK  +   ++ K   + + 
Sbjct: 136 LAL-DDQGKLEEAIAAYKKAIELDPNYATAYYNMGNALNRQGKLEEAIAAYKKA--IELD 192

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEA 201
              S    N+G A  ++G Y  A   Y+KA  I+P+ A    N+   L KQ +Y EA
Sbjct: 193 PNYSFAYNNMGVALRKQGKYDEAIAAYKKAIEINPNYAFAYNNMGVALRKQGKYDEA 249



 Score = 43.9 bits (102), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 75/178 (42%), Gaps = 17/178 (9%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE--SLDN 93
           E AI  + KAI        A  +M + L  Q +++EA+ A K    L    A    ++ N
Sbjct: 111 EEAIAAYKKAIELDPNDAFAYNNMGLALDDQGKLEEAIAAYKKAIELDPNYATAYYNMGN 170

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGK--------KFQVTV 144
            L     + G+L+E I   K+ + +   +  A+N       R  GK        K  + +
Sbjct: 171 AL----NRQGKLEEAIAAYKKAIELDPNYSFAYNNMGV-ALRKQGKYDEAIAAYKKAIEI 225

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEA 201
               +    N+G A  ++G Y  A   Y+KA  I+P DA    N+   L  Q +Y EA
Sbjct: 226 NPNYAFAYNNMGVALRKQGKYDEAIAAYKKAIEINPNDAFGYNNMGLALDDQGKYDEA 283



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 60/146 (41%), Gaps = 28/146 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI        A  +M V L++Q + DEA+ A K    +    A  + +N+ 
Sbjct: 179 EEAIAAYKKAIELDPNYSFAYNNMGVALRKQGKYDEAIAAYKKAIEINPNYAF-AYNNMG 237

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
           + L +K G+ DE I   K+ +  I   +AF                            N+
Sbjct: 238 VAL-RKQGKYDEAIAAYKKAIE-INPNDAFG-------------------------YNNM 270

Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G A   +G Y  A   ++KA  IDP+
Sbjct: 271 GLALDDQGKYDEAIAAHKKALEIDPN 296


>gi|411119538|ref|ZP_11391918.1| TPR repeat-containing protein [Oscillatoriales cyanobacterium JSC-12]
 gi|410711401|gb|EKQ68908.1| TPR repeat-containing protein [Oscillatoriales cyanobacterium JSC-12]
          Length = 2384

 Score = 44.3 bits (103), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 68/158 (43%), Gaps = 10/158 (6%)

Query: 38   AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
            AI  +   + A D       ++AV L +Q ++DEA+   +  + L  K       N   +
Sbjct: 927  AIAYYQNTLKANDNHLDTYNNLAVALHEQGKLDEAMPYYQ--KALALKPDNPDAHNNYAN 984

Query: 98   LYKKCGRLDEQIELLKQKL--RMIYH------GEAFNGKPTKTARSHGKKFQVTVKQETS 149
            L ++  RLDE I   +Q +  R  Y       G A+  K    + +   +  +  K    
Sbjct: 985  LLRERSRLDEAIYHYQQAIAARPDYPDAYNNLGLAYYAKGNFASAAEAYRQAIERKPHFP 1044

Query: 150  RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
            + L +LG A  + GN+  A   Y++A  + PD  KA N
Sbjct: 1045 QALNHLGNALKELGNFAEAARYYQQAIALKPDYAKAYN 1082



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 81/175 (46%), Gaps = 11/175 (6%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + +A+N       A  ++ +  K Q ++DEA+   +    L    A E+  N+ 
Sbjct: 122 EEAIAAYQQALNLNQAYAEAHNNLGIAYKDQGKLDEALACYREAIRLKPNYA-EAHHNMG 180

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHG-EAFNG-KPTKTARSHGKK----FQ--VTVKQE 147
           I L ++  +LD+ I   +Q + +  H  +A+     T   + +G++    +Q  VT+K  
Sbjct: 181 IVL-RQQNKLDDAIHYFRQAIALKPHYIDAYTSLGSTLQQQGNGEEAIACYQQVVTLKPN 239

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
            +    NLG A   +G    A   +++A  + P+    C NL + L++  R  EA
Sbjct: 240 YAEGFNNLGLALQHQGKLEEAIATFQQALALQPNFPGVCNNLGNLLLEVNRVDEA 294



 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 42/169 (24%), Positives = 76/169 (44%), Gaps = 17/169 (10%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + KA+        AL ++  VLK Q +++ AV  ++    L    A+  + N L +
Sbjct: 328 AITHYQKALELRPNFVEALSNLGAVLKDQHKLEAAVSYLEQAVSLGPSYAE--IHNNLGN 385

Query: 98  LYKKCGRLDEQIEL------LKQKLRMIYHGEA----FNGKPTKTARSHGKKFQVTVKQE 147
            Y++  R+DE I        LK ++  ++        + G+  + A  H +K  + ++ +
Sbjct: 386 AYQEQKRVDEAIACYRTAVALKPEMAEVHSNLGNMLQYIGE-FEEAFEHFRK-AIEIQPD 443

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD---ANKACNLSHCLI 193
            + +  NLG A+   G    A   Y KA  + PD   A+    L+H L+
Sbjct: 444 FAGVYNNLGIAHRNAGQVQEAFAAYSKALELKPDFVEAHWNTALNHLLL 492


>gi|444912555|ref|ZP_21232716.1| hypothetical protein D187_04652 [Cystobacter fuscus DSM 2262]
 gi|444716773|gb|ELW57614.1| hypothetical protein D187_04652 [Cystobacter fuscus DSM 2262]
          Length = 588

 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 79/176 (44%), Gaps = 26/176 (14%)

Query: 23  VRAKHVQLVEK--DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFR 80
           +R +  + VE+  +PE A+ L  +A+    R    L+  A  LK+Q R  EA+ A++   
Sbjct: 360 LRIQQARAVERGGEPERAVALLREALGR-KRTPELLEAFASTLKRQGRPGEALAALRE-- 416

Query: 81  HLCSKQAQES------LDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTAR 134
                QA E       L  VL+D+    G L     +L+ +     H  A N      A+
Sbjct: 417 --AISQAPEDAAPRYVLATVLLDMGDVPGALYSMRGVLRLE---PDHAAAMNFIGYLLAQ 471

Query: 135 SHGKKFQ---------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
            HG+ F          + ++ +T   L +LGW + Q+G+Y SA     +A  ++P+
Sbjct: 472 -HGRDFAEAERLVRRALALRPDTGSFLDSLGWIHYQRGDYPSAVRTLARAAELEPE 526


>gi|441504257|ref|ZP_20986254.1| Flp pilus assembly protein TadD [Photobacterium sp. AK15]
 gi|441428430|gb|ELR65895.1| Flp pilus assembly protein TadD [Photobacterium sp. AK15]
          Length = 365

 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 1/67 (1%)

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEAR 202
           V  ++ + L NLG++Y   GN   AEV+ R+A  + PD  +A  NL    IK  RY +A 
Sbjct: 219 VDGKSQKALTNLGYSYYLSGNLRQAEVINRRASTLYPDNQRAWSNLGLTYIKAKRYDDAN 278

Query: 203 SVLEDVL 209
                ++
Sbjct: 279 DAFSRIM 285


>gi|82702040|ref|YP_411606.1| hypothetical protein Nmul_A0911 [Nitrosospira multiformis ATCC
           25196]
 gi|82410105|gb|ABB74214.1| Tetratricopeptide TPR_4 [Nitrosospira multiformis ATCC 25196]
          Length = 875

 Score = 43.9 bits (102), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 78/167 (46%), Gaps = 11/167 (6%)

Query: 55  ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ 114
           AL  + ++ +   R+D  +E IK  R L  K       N L +  ++ G+L++ I   + 
Sbjct: 79  ALHFLGLLARDAGRIDIGIELIK--RALRFKPNYVEAHNNLGNTLRQQGKLNDAIASYRT 136

Query: 115 KLRMIYHGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGNYT 166
            +++        G      R  G+        +  + ++ + + +  N+G  + ++G+  
Sbjct: 137 AVKLEPRFAEAYGNLGNALREQGRLDDAMINYRKALGIQPQLAEMHCNIGIVHREQGDLE 196

Query: 167 SAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLEDVLLGK 212
           +A   +RKA L+ PD+ +A  NL + L++Q ++ EA S     +L K
Sbjct: 197 NAVSSFRKALLLKPDSAEAFNNLGNVLVEQGKFEEAVSSFGKAILYK 243


>gi|394988448|ref|ZP_10381283.1| hypothetical protein SCD_00848 [Sulfuricella denitrificans skB26]
 gi|393791827|dbj|GAB70922.1| hypothetical protein SCD_00848 [Sulfuricella denitrificans skB26]
          Length = 735

 Score = 43.9 bits (102), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 74/161 (45%), Gaps = 11/161 (6%)

Query: 58  DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
           ++ VVL +Q R+ EA   ++    +    A ++ +N+ I L K+  RL E     +Q L+
Sbjct: 89  NLGVVLVEQGRLSEAEANLRRALEIKPNHA-DAHNNLGIALMKQ-ERLSEAEPCFQQALK 146

Query: 118 MI--YHGEAFNGKPTKTARSHGKKFQVT------VKQETSRILGNLGWAYMQKGNYTSAE 169
           +   YH    N     T +    + +++      +K +    L NLG    Q+G    +E
Sbjct: 147 IKPDYHEALNNLGSLLTEQGLLAEAEISCVEALKIKPDYVPALSNLGVCLQQQGRLAESE 206

Query: 170 VVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
             +R+A  I PD+ +A C L   L  Q R+ EA + L   L
Sbjct: 207 ACFRRALEIKPDSAEALCGLGITLNDQGRFPEAEANLRRAL 247



 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 1/89 (1%)

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G A +G+   +      +  + + ++ +    NLG A      ++ AE  YR+A  I P+
Sbjct: 23  GTALSGQGKSSEALDALQRAIALSRDDAEAHSNLGNALKNLRRFSEAETSYRQAIKIRPN 82

Query: 182 -ANKACNLSHCLIKQARYTEARSVLEDVL 209
            A   CNL   L++Q R +EA + L   L
Sbjct: 83  FAEAQCNLGVVLVEQGRLSEAEANLRRAL 111


>gi|398346185|ref|ZP_10530888.1| hypothetical protein Lbro5_02905 [Leptospira broomii str. 5399]
          Length = 689

 Score = 43.9 bits (102), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 7/86 (8%)

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLE 206
           +S  LGN+ +    +G+Y SAE  +R+A  + P+  KA  NLS  L K  +  EA   LE
Sbjct: 279 SSAYLGNIAY---NRGDYGSAEHYFREAATLSPNDAKALYNLSVVLKKNGKMEEALKYLE 335

Query: 207 DVLLGKLSGSTETKTINRVKELLQEL 232
              + KL+G+++ +    + E  ++L
Sbjct: 336 ---MAKLAGASDPEIFRSIAESFEQL 358


>gi|390568922|ref|ZP_10249213.1| TPR repeat-containing protein [Burkholderia terrae BS001]
 gi|389939128|gb|EIN00966.1| TPR repeat-containing protein [Burkholderia terrae BS001]
          Length = 702

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 72/163 (44%), Gaps = 15/163 (9%)

Query: 55  ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ 114
           A  ++ VVL+Q  R  EA +A K       + A   ++  L  L +  GR+DE  E+  +
Sbjct: 243 AYNNLGVVLRQLGRPGEAKQAHKKAIATDPEHAGAHIN--LARLLESLGRVDE-AEVAYR 299

Query: 115 KLRMIY--HGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGN 164
           +   +Y  H E  N       ++ G+        +  + +K +      NLG     +  
Sbjct: 300 RGLCLYPDHAETHNNL-GAMLQAQGRMPDAETCYRRSLAIKPDFPNACFNLGIVLQAQKR 358

Query: 165 YTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLE 206
           Y  AE +YR+A    PD  +A  NL+H L+   ++TE   V E
Sbjct: 359 YHEAERLYRRALQAKPDMLEARLNLAHLLLATGQFTEGWQVFE 401


>gi|115477811|ref|NP_001062501.1| Os08g0559300 [Oryza sativa Japonica Group]
 gi|75325353|sp|Q6YZI0.1|SPY_ORYSJ RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY
 gi|45736106|dbj|BAD13137.1| putative gibberellin action negative regulator SPY [Oryza sativa
           Japonica Group]
 gi|113624470|dbj|BAF24415.1| Os08g0559300 [Oryza sativa Japonica Group]
 gi|215686860|dbj|BAG89710.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 927

 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D+AVE  +    +    +Q SL+N+ 
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 361

Query: 96  IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++++ +     + EA+N                           N
Sbjct: 362 V-VYTVQGKMDAASSMIQKAIFANSTYAEAYN---------------------------N 393

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   G+ TSA   Y K   IDPD+  A
Sbjct: 394 LGVLYRDAGSITSAVQAYEKCLQIDPDSRNA 424


>gi|218201606|gb|EEC84033.1| hypothetical protein OsI_30264 [Oryza sativa Indica Group]
          Length = 947

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D+AVE  +    +    +Q SL+N+ 
Sbjct: 323 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 381

Query: 96  IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++++ +     + EA+N                           N
Sbjct: 382 V-VYTVQGKMDAASSMIQKAIFANSTYAEAYN---------------------------N 413

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   G+ TSA   Y K   IDPD+  A
Sbjct: 414 LGVLYRDAGSITSAVQAYEKCLQIDPDSRNA 444


>gi|118391306|ref|XP_001028412.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89281566|gb|EAR80749.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 318

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTE 200
           V +  + S   GNLG+++MQK  Y  A   ++KA  +DP D+    NL +  +K+  Y +
Sbjct: 13  VQLDPKDSWAFGNLGYSFMQKEMYDDAITFFQKAVQLDPRDSWAFGNLGYSFMKKKMYDD 72

Query: 201 ARSVLE 206
           A + L+
Sbjct: 73  AITFLQ 78



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 46/198 (23%), Positives = 80/198 (40%), Gaps = 18/198 (9%)

Query: 14  KLPPGDSPYVRAKHVQLVEKDP-EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEA 72
           +L P DS   R      ++K+  + AI  F KA+    R   A  ++     ++   D+A
Sbjct: 116 QLDPKDSCAFRCMGYSFMKKEMYDDAITFFQKAVQLDPRDSWAFGNLGYSFMKKKMYDDA 175

Query: 73  VEAIK-----------SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           +   +           +F +L     ++ + +  I  ++K  +LD +      KL     
Sbjct: 176 ITFFQKAVQLDPKDSWAFGNLGYSFMKKKMYDDAITFFQKAVQLDPKDSWAFGKL----- 230

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           G +F  K          +  V +  + S   G LG+++MQK  Y  A    +KA  +DPD
Sbjct: 231 GYSFMQKQMYDDAITFFQKAVQLDPKDSWAFGKLGYSFMQKEMYDDAITFSQKAVQLDPD 290

Query: 182 ANKA-CNLSHCLIKQARY 198
             +   NL     K+ RY
Sbjct: 291 VKENLLNLGIAFQKKGRY 308


>gi|222641017|gb|EEE69149.1| hypothetical protein OsJ_28275 [Oryza sativa Japonica Group]
          Length = 947

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D+AVE  +    +    +Q SL+N+ 
Sbjct: 323 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 381

Query: 96  IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++++ +     + EA+N                           N
Sbjct: 382 V-VYTVQGKMDAASSMIQKAIFANSTYAEAYN---------------------------N 413

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   G+ TSA   Y K   IDPD+  A
Sbjct: 414 LGVLYRDAGSITSAVQAYEKCLQIDPDSRNA 444


>gi|223934537|ref|ZP_03626458.1| Tetratricopeptide TPR_2 repeat protein [bacterium Ellin514]
 gi|223897000|gb|EEF63440.1| Tetratricopeptide TPR_2 repeat protein [bacterium Ellin514]
          Length = 718

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 80/181 (44%), Gaps = 15/181 (8%)

Query: 32  EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESL 91
           +  P+ A+  + K+I      D A  ++  VL ++ + +EA+    S   L   +A+  +
Sbjct: 484 QGKPDEALANYQKSIEINPNYDEAQNNIGYVLAKKGKPEEAIPYYFSSLRLNPNRAE--V 541

Query: 92  DNVLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTA---------RSHGKKFQ 141
            N L + Y   G+LDE I   +  L++   + EA+NG     A         R      +
Sbjct: 542 HNNLGNAYADLGKLDEAIREYQAALKINTNYAEAYNGIGISLAKKGDLAGATRWLSDAIR 601

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTE 200
           +  K  +S    NLG  Y  +G +  A + Y+     +P DA    NL++ L +Q +  E
Sbjct: 602 LNPKNVSSH--SNLGNVYAMQGKFDLAAIEYKLVLKQNPDDALTHNNLANLLSEQGKLDE 659

Query: 201 A 201
           A
Sbjct: 660 A 660


>gi|254409909|ref|ZP_05023689.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182945|gb|EDX77929.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 956

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 67/169 (39%), Gaps = 33/169 (19%)

Query: 56  LKDMAVVLKQQDRVDEAVE-AIKSFRHLCSKQAQESLD-----NVLIDLYKKCGRLDEQI 109
           LK     L  Q + +EA+  A +  R   S   ++ LD     N L  LY+  GR  E  
Sbjct: 60  LKQQVEQLYNQGKYNEAIPLAERMLRLYQSVYGEDHLDITYSLNYLGILYRNQGRYTEAE 119

Query: 110 ELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAE 169
            L +Q L M                   K+         +  L NL + Y  +G YT AE
Sbjct: 120 PLYRQALEM------------------KKRLLGEEHPHVATSLNNLAYLYESQGRYTEAE 161

Query: 170 VVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
            +YR+A     +L+  +    A    NL++    Q RYTEA  +L   L
Sbjct: 162 PLYRQALEMYKRLLGEEHPLVATSLNNLAYLYESQGRYTEAEPLLRQAL 210



 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V ++L ++A + + Q R  EA     +A++ ++ L  ++      SL+N L  LY+  GR
Sbjct: 140 VATSLNNLAYLYESQGRYTEAEPLYRQALEMYKRLLGEEHPLVATSLNN-LAYLYESQGR 198

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   LL+Q L M               R  G+             L NL  +Y  +G 
Sbjct: 199 YTEAEPLLRQALEM-------------RKRLLGQDHPAVATS-----LNNLAGSYQSQGR 240

Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
           YT AE + R+A     +L+  +    A    NL+     Q RY+EA  +    L
Sbjct: 241 YTEAEPLLRQALEMSKRLLGEEHPHVALSLNNLAALYDNQGRYSEAEPLYRQAL 294


>gi|297180994|gb|ADI17196.1| FOG: tpr repeat-protein [uncultured Rhodobacterales bacterium
           HF0070_10D05]
          Length = 734

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 2/84 (2%)

Query: 33  KDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
           K+ + A+V + KAI+      +A  +M   LK Q ++DEA+EA    + LC K    +  
Sbjct: 346 KNYDDAVVFYNKAISIKPDYATAYNNMGNALKDQGKLDEAIEAFN--KALCIKPDYATAY 403

Query: 93  NVLIDLYKKCGRLDEQIELLKQKL 116
           N + +  K  G+LDE IE   + L
Sbjct: 404 NNMGNALKDQGKLDEAIEAFNKAL 427


>gi|255562045|ref|XP_002522031.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
 gi|223538835|gb|EEF40435.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
          Length = 979

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 55/247 (22%), Positives = 92/247 (37%), Gaps = 38/247 (15%)

Query: 16  PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           P     Y+   +V      P+ AIV + +A+        A  ++A    ++ ++D A+  
Sbjct: 256 PTFPDAYLNLGNVYRALGMPQEAIVCYQRAVQTRPNYAVAFGNLASTYYERGQLDLAIHH 315

Query: 76  IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNG------- 127
            K     C  +  E+ +N L +  K  GR++E I+   Q L +   H +A          
Sbjct: 316 YKQ-AIACDGRFLEAYNN-LGNALKDVGRVEEAIQCYNQCLALQPSHPQALTNLGNIYME 373

Query: 128 -KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
                TA S+ K   + V    S    NL   Y Q+GNY  A   Y +   IDP A    
Sbjct: 374 WNMASTAASYYKA-TLAVTTGLSAPFNNLAVIYKQQGNYADAISCYNEVLRIDPLAAD-- 430

Query: 187 NLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKK 246
                                   G ++     K I RV + +Q+     +I P++    
Sbjct: 431 ------------------------GLVNRGNTYKEIGRVSDAIQDYIRAITIRPTMAEAH 466

Query: 247 SSLEDAF 253
           ++L  A+
Sbjct: 467 ANLASAY 473


>gi|67902940|ref|XP_681726.1| hypothetical protein AN8457.2 [Aspergillus nidulans FGSC A4]
 gi|40747923|gb|EAA67079.1| hypothetical protein AN8457.2 [Aspergillus nidulans FGSC A4]
 gi|259484403|tpe|CBF80593.1| TPA: Pfs, NB-ARC and TPR domain protein (JCVI) [Aspergillus nidulans
            FGSC A4]
          Length = 1131

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 93/243 (38%), Gaps = 46/243 (18%)

Query: 54   SALKDMAVVLKQQDRVDEAVE----AIKSFRHLCSKQAQESLDNV--LIDLYKKCGRLDE 107
            + + ++  +   Q ++ EA E    A+  +          +LD V  L +LY   GRL E
Sbjct: 819  NTVNNLGNLYSDQGKLREAEEMYQQALAGYEKALGPAHTSTLDTVNNLGNLYSDQGRLKE 878

Query: 108  QIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTS 167
              E+ +Q         A  GK       H           T   +GNLG  Y  +G    
Sbjct: 879  AEEMYQQ---------ALAGKEKALGLDH---------TSTLDTVGNLGLLYRDQGKLRE 920

Query: 168  AEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVLLGKLS--GS 216
            AE ++++A     + + PD     N   NL +    Q R  EA  + +  L GK    G 
Sbjct: 921  AEEMFQRALTGKEKALGPDHTSTLNTVNNLGNLYSDQGRLKEAEEMYQRALAGKEKALGL 980

Query: 217  TETKTINRVKEL---------LQELEPWQSIPPSLTTKKSSLEDAFLEGLDDLMNQWTPY 267
              T T+N V  L         L+E E  +    +LT K+ +L       LD + N    Y
Sbjct: 981  DHTSTLNTVNNLGLLYRDQGKLKEAE--EMFQRALTGKEKALGPDHTSTLDTVGNLGLLY 1038

Query: 268  RSR 270
            R +
Sbjct: 1039 RDQ 1041


>gi|427707117|ref|YP_007049494.1| hypothetical protein Nos7107_1704 [Nostoc sp. PCC 7107]
 gi|427359622|gb|AFY42344.1| Tetratricopeptide TPR_1 repeat-containing protein [Nostoc sp. PCC
           7107]
          Length = 552

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 105/247 (42%), Gaps = 42/247 (17%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDNVL 95
           A+  +  AI    ++ +A   +  VL+ Q ++DEA+   K    L  K   A   + NVL
Sbjct: 81  ALAAYQTAIKINPKLANAYNGLGNVLRDQGKLDEALATYKIAIKLDGKLANAYNGMGNVL 140

Query: 96  IDLYKKCGRLDEQIELLKQKLRM------IYHGEA----FNGKPTKTARSHGKKFQVTVK 145
                + G+L+E I   ++ +++       Y+G      + GK  +   S+ K  Q   K
Sbjct: 141 ----SEQGKLNESIAAYQKSIQLDPKNALPYNGMGNVLIYQGKLDEAIASYRKAIQFDPK 196

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQ-------AR 197
              +    NLG A   +     A   Y+KA  IDP    A  +L   L +Q       A+
Sbjct: 197 YAVT--YHNLGLALYNQKKLDEALAAYKKAIQIDPKYTSAYVSLGLALSEQGKLDEAMAK 254

Query: 198 YTEARSVLEDVLLGKLSGSTETKTI------------NRVKELLQELEPWQSIPPSLTTK 245
           Y +A S+ ED    K +  T   T+             ++KE ++E +   SI  +  T 
Sbjct: 255 YRQALSLPED----KSATPTTVHTLAHNNLGFALQRQGKLKEAIEEYKQAISIDSNFVTA 310

Query: 246 KSSLEDA 252
           +++L++A
Sbjct: 311 QTNLKEA 317


>gi|254410083|ref|ZP_05023863.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183119|gb|EDX78103.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 955

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 67/169 (39%), Gaps = 33/169 (19%)

Query: 56  LKDMAVVLKQQDRVDEAVE-AIKSFRHLCSKQAQESLD-----NVLIDLYKKCGRLDEQI 109
           LK     L  Q + +EA+  A +  R   S   ++ LD     N L  LY+  GR  E  
Sbjct: 59  LKQQVEQLYNQGKYNEAIPLAERMLRLYQSVYGEDHLDITYSLNYLGILYRNQGRYTEAE 118

Query: 110 ELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAE 169
            L +Q L M                   K+         +  L NL + Y  +G YT AE
Sbjct: 119 PLYRQALEM------------------KKRLLGEEHPHVATSLNNLAYLYESQGRYTEAE 160

Query: 170 VVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
            +YR+A     +L+  +    A    NL++    Q RYTEA  +L   L
Sbjct: 161 PLYRQALEMYKRLLGEEHPLVATSLNNLAYLYESQGRYTEAEPLLRQAL 209



 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V ++L ++A + + Q R  EA     +A++ ++ L  ++      SL+N L  LY+  GR
Sbjct: 139 VATSLNNLAYLYESQGRYTEAEPLYRQALEMYKRLLGEEHPLVATSLNN-LAYLYESQGR 197

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   LL+Q L M               R  G+             L NL  +Y  +G 
Sbjct: 198 YTEAEPLLRQALEM-------------RKRLLGQDHPAVATS-----LNNLAGSYQSQGR 239

Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
           YT AE + R+A     +L+  +    A    NL+     Q RY+EA  +    L
Sbjct: 240 YTEAEPLLRQALEMSKRLLGEEHPHVALSLNNLAALYDNQGRYSEAEPLYRQAL 293


>gi|423065545|ref|ZP_17054335.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
 gi|406712988|gb|EKD08163.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
          Length = 917

 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 37/156 (23%), Positives = 74/156 (47%), Gaps = 14/156 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI    ++  A  DM + L ++  +D A++ ++     C     + + N L
Sbjct: 124 EGAIGCYEKAIALNPKLGQAYLDMGLRLNERGDIDTAIKVLQQGGINCPNF--KEIFNTL 181

Query: 96  IDLYKKCGRLDEQIELLKQKLR------MIYH----GEAFNGKPTKTARSHGKKFQVTVK 145
             L  +  ++DE I + ++ L       ++Y+       + GK ++   ++ K   +++K
Sbjct: 182 GYLQLQQNQIDEAIAIFQEALNIDPTEPLVYNNLGWAWTYQGKLSEAIAAYQKA--ISLK 239

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
            + +    NLG  + QK N+  A   ++KA  I+PD
Sbjct: 240 PDLAIAYSNLGKLWQQKNNHRQAISYFQKAIAIEPD 275


>gi|397688051|ref|YP_006525370.1| hypothetical protein PSJM300_14750 [Pseudomonas stutzeri DSM 10701]
 gi|395809607|gb|AFN79012.1| TPR domain-containing protein [Pseudomonas stutzeri DSM 10701]
          Length = 575

 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 30/59 (50%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           IL +LGWA+ + GN   AE   RKA    PD   A +L   L    +  EAR+V  D L
Sbjct: 495 ILDSLGWAHYRLGNLQEAEAQLRKAHARFPDHEVAAHLGEVLWVSGQRKEARTVWADAL 553


>gi|113477766|ref|YP_723827.1| hypothetical protein Tery_4362 [Trichodesmium erythraeum IMS101]
 gi|110168814|gb|ABG53354.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
          Length = 725

 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 84/188 (44%), Gaps = 48/188 (25%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIEL 111
           +DS  + +A+  + +DR  E+                ++L N+ I +Y+  G  +  I  
Sbjct: 147 IDSFQQQLAITQETKDRQSES----------------KALGNLGI-VYQSLGHFERSINY 189

Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
            +Q+L +           T+         ++  +Q  S+ LGNLG  Y  +G Y+ AE +
Sbjct: 190 FQQQLAI-----------TR---------EIKDRQSESKALGNLGICYENQGQYSKAEPL 229

Query: 172 YRKA-----QLID---PD-ANKACNLSHCLIKQARYTEARSV-LEDVLLGK-LSGSTETK 220
           Y +A     QL+    PD A+   NL+     Q RY+EA  + LE + + K L G+T  +
Sbjct: 230 YLEALKMNKQLLGSTHPDVASNLNNLAGLYYSQGRYSEAEPLYLEALKMNKQLLGATHPE 289

Query: 221 TINRVKEL 228
             + +  L
Sbjct: 290 IASNLNNL 297


>gi|209528249|ref|ZP_03276714.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
 gi|209491321|gb|EDZ91711.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
          Length = 915

 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 37/156 (23%), Positives = 74/156 (47%), Gaps = 14/156 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI    ++  A  DM + L ++  +D A++ ++     C     + + N L
Sbjct: 122 EGAIGCYEKAIALNPKLGQAYLDMGLRLNERGDIDTAIKVLQQGGINCPNF--KEIFNTL 179

Query: 96  IDLYKKCGRLDEQIELLKQKLR------MIYH----GEAFNGKPTKTARSHGKKFQVTVK 145
             L  +  ++DE I + ++ L       ++Y+       + GK ++   ++ K   +++K
Sbjct: 180 GYLQLQQNQIDEAIAIFQEALNIDPTEPLVYNNLGWAWTYQGKLSEAIAAYQKA--ISLK 237

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
            + +    NLG  + QK N+  A   ++KA  I+PD
Sbjct: 238 PDLAIAYSNLGKLWQQKNNHRQAISYFQKAIAIEPD 273


>gi|406870824|gb|EKD21860.1| hypothetical protein ACD_87C00299G0002, partial [uncultured
           bacterium]
          Length = 637

 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 68/171 (39%), Gaps = 23/171 (13%)

Query: 26  KHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK 85
           KH Q VE         + KAIN G +     K++A    Q  +  EA   I  +  L SK
Sbjct: 463 KHAQSVEN--------YQKAINLGAKDPQLHKNLAQAYSQMGKTKEA---IPMYEKLASK 511

Query: 86  QAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM----------IYHGEAFNGKPTKTARS 135
           +    + NVL D Y K    D+ I + K+ + M            +     G   +    
Sbjct: 512 KPTAEVLNVLADAYMKEKLYDKAIMIYKKLIEMNPKKAAYFASAAYAYGLKGDSDRQIEY 571

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
           +  +  +    E + +  +LG AY +KG Y  A   Y  A  ++PDA +  
Sbjct: 572 Y--RLSLKFDPEDADVYASLGAAYEKKGLYAEALKAYTDAYELNPDATQVA 620


>gi|157108260|ref|XP_001650149.1| kinesin light chain 1 and [Aedes aegypti]
 gi|108868572|gb|EAT32797.1| AAEL014967-PA [Aedes aegypti]
          Length = 472

 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 73/175 (41%), Gaps = 35/175 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHL-------CSKQAQESLDNVLIDLYKKCGR 104
           V + L  + +V + Q+ + EA++ +     +       C      +L+N+ + LY K G 
Sbjct: 234 VATMLSILTMVYRDQNNLPEAIKHMNEALGIWVRCLGECHPSVAAALNNLAV-LYGKNGN 292

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L K+         A   +     R H          + ++ L NL      +G 
Sbjct: 293 YKEAESLCKR---------ALANRENVLGRYH---------PDVAKQLNNLALLCQNQGK 334

Query: 165 YTSAEVVYRKA--------QLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVLL 210
           +   E+  R+A         +IDP+A K  CNL+ C +K  +Y EA  +L DVL+
Sbjct: 335 HGEVELYIRRALEIFESQLGVIDPNAIKTKCNLAACCLKLRKYKEADQLLRDVLV 389


>gi|376005002|ref|ZP_09782572.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
 gi|375326596|emb|CCE18325.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
          Length = 915

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 74/156 (47%), Gaps = 14/156 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI    ++  A  DM + L ++  +D A++ ++     C     + + N L
Sbjct: 122 EGAIGCYEKAIALNPKLGQAYLDMGLRLNERGDIDTAIKVLQQGGINCPNF--QEIFNTL 179

Query: 96  IDLYKKCGRLDEQIELLKQKLR------MIYH----GEAFNGKPTKTARSHGKKFQVTVK 145
             L  +  ++DE I + ++ L       ++Y+      A  GK ++   ++ K   +++K
Sbjct: 180 GYLQLQQNQIDEAIAIFQESLNIDPTEPLVYNNLGWALACQGKLSEAIAAYEKA--ISLK 237

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
            + +    NLG  + QK N+  A   ++KA  I+PD
Sbjct: 238 PDLAIAYSNLGKLWQQKNNHRQAISYFQKAIAIEPD 273


>gi|297833104|ref|XP_002884434.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330274|gb|EFH60693.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 977

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 58/251 (23%), Positives = 93/251 (37%), Gaps = 46/251 (18%)

Query: 16  PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           P     Y+   +V      P  AI+ +  A+        A  ++A +  +Q ++D A   
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNCAMAFGNIASIYYEQGQLDLA--- 310

Query: 76  IKSFRHLCSK-----QAQESLDNVLIDLYKKCGRLDEQIELLKQKLR--------MIYHG 122
           I+ ++   S+     +A  +L N L D+    GR+DE I    Q L         M   G
Sbjct: 311 IRHYKQAISRDPRFLEAYNNLGNALKDI----GRVDEAIRCYNQCLALQPNHPQAMANLG 366

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
             +         S   K  + V    S    NL   Y Q+GNY+ A   Y +   IDP A
Sbjct: 367 NIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLA 426

Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
                                   D L+ +  G+T  K I RV E +Q+     +  P++
Sbjct: 427 -----------------------ADALVNR--GNT-YKEIGRVTEAIQDYMHAINFRPTM 460

Query: 243 TTKKSSLEDAF 253
               ++L  A+
Sbjct: 461 AEAHANLASAY 471


>gi|452003920|gb|EMD96377.1| hypothetical protein COCHEDRAFT_1201158 [Cochliobolus heterostrophus
            C5]
          Length = 1255

 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 52/207 (25%), Positives = 86/207 (41%), Gaps = 42/207 (20%)

Query: 43   WKAINAGDRVD-SALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQAQESL--DNVL 95
            +KA+  G+ +    + ++ ++ K++ R  +A    +EA++ F  +     + +L   N L
Sbjct: 854  YKALEPGNLLTFKTVANLGLLYKERSRFADAERMYIEALQGFEQVLGSNHKTTLRTANNL 913

Query: 96   IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
              LYK  GRL +   + KQ L+             +   + G K   T++      + +L
Sbjct: 914  GILYKNQGRLVDAERIHKQVLK-------------RKEDALGSKHLSTLQ-----TIDSL 955

Query: 156  GWAYMQKGNYTSAEVVYRKA------------QLIDPDANKACNLSHCLIKQARYTEARS 203
            G  YM + N T AE +Y KA             L         NL HC   Q R  +A  
Sbjct: 956  GVVYMNQSNLTKAEELYEKALNGKEEVLGSKHHLTLQTVGNLGNLYHC---QGRLVDAER 1012

Query: 204  VLEDVLLG--KLSGSTETKTINRVKEL 228
            +LE VL G   + G     T+N +  L
Sbjct: 1013 MLERVLQGYKDVFGHKHPSTLNTINNL 1039


>gi|449449593|ref|XP_004142549.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Cucumis
           sativus]
          Length = 975

 Score = 42.4 bits (98), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 52/227 (22%), Positives = 88/227 (38%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AIV + +AI        A  ++A    +Q ++D A+   K     C  +  E+ +N 
Sbjct: 281 PQEAIVCYQRAIQMRPNYAIAYGNLASTYYEQSQLDMAILHYKQ-AITCDPRFLEAYNN- 338

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPT-----KTARSHGKKFQVTVKQET 148
           L +  K+ GR++E I+   Q L +   H +A               +    ++ T++  T
Sbjct: 339 LGNALKEFGRVEEAIQCYNQCLALQPSHPQALTNLGNIYMEWNMVPAAASYYKATLRVTT 398

Query: 149 --SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S    NL   Y Q+GNY  A   Y +   IDP A                        
Sbjct: 399 GLSAPFNNLAIIYKQQGNYADAISCYNEVLRIDPLAAD---------------------- 436

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
               G ++     K I RV E +Q+     +I P++    ++L  A+
Sbjct: 437 ----GLVNRGNTYKEIGRVSEAIQDYIRAINIRPTMAEAHANLASAY 479


>gi|406981058|gb|EKE02578.1| hypothetical protein ACD_20C00357G0001 [uncultured bacterium]
          Length = 536

 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 68/163 (41%), Gaps = 37/163 (22%)

Query: 54  SALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQA--QESLDNVLIDLYKKCGRLDEQIEL 111
           +A K +  + ++Q   D A+E  +   HL    A    +L N+L   Y K G+ DE I  
Sbjct: 278 TAYKSLCSIYEEQSDYDSAIEMYRKLIHLNPNDAVFYSNLANLL---YLK-GKFDEAISA 333

Query: 112 LKQKLRMIYHGEAFNGKPTK-----TARSHGKKFQ-------VTVKQETSRILGN----- 154
            +  + +          P K      A++ G  FQ         +    +  L N     
Sbjct: 334 YQTAVTL---------NPNKRWTSVIAQTLGYIFQESKENVDAAISSYQNATLLNPEDID 384

Query: 155 ----LGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCL 192
               LG A+  KG+Y +A +VYR A  IDP +A   CNL + L
Sbjct: 385 TYISLGSAFYDKGDYNNALIVYRTALEIDPNNARIHCNLGYLL 427


>gi|186681165|ref|YP_001864361.1| hypothetical protein Npun_R0666 [Nostoc punctiforme PCC 73102]
 gi|186463617|gb|ACC79418.1| TPR repeat-containing protein [Nostoc punctiforme PCC 73102]
          Length = 532

 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 101/231 (43%), Gaps = 22/231 (9%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + KA+       +A  ++   L  Q ++ EA+ A +    +    A     N+ + 
Sbjct: 287 AIAAYQKALQIDPNYVNAHCNLGKALHTQGKLSEAMAAYQRALRVDPNYASAHC-NLGVT 345

Query: 98  LYKKCGRLDEQIELLKQKLRM--------IYHGEAFN--GKPTKTARSHGKKFQVTVKQE 147
           LY + G+L E I   ++ L++           G+A +  GK ++   ++ +  +V     
Sbjct: 346 LYHQ-GKLSEAIAAYQKALQIDPNYVNAHCNLGKALHTQGKLSEAMAAYQRALRVDPNYA 404

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLE 206
           ++    NLG     +G  + A   Y++A  +DP DA+  CNL   L  Q + +EA +  +
Sbjct: 405 SAHC--NLGVTLYHQGKLSEAIAAYQRALRLDPNDADTHCNLGIALHDQGKLSEAIAAYQ 462

Query: 207 DVLL-------GKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLE 250
             LL          +     K   +++E + ELE    + P+ T  +++LE
Sbjct: 463 RALLIDPNDADAHCNLGIALKGQGKLEEAIAELEIAVRLNPNNTVIRNNLE 513



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 21/191 (10%)

Query: 33  KDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
           K  E A+V +  A++       A   + VVL  Q ++ EA+ A +      + Q   +  
Sbjct: 180 KKLEEAVVKYKAALSLDPNSVYAHNALGVVLHTQGKLSEAIAAYQK-----ALQIDPNYV 234

Query: 93  NVLIDLYKKC---GRLDEQIELLKQKLRM--------IYHGEAFN--GKPTKTARSHGKK 139
           N   +L K     G+L E +   ++ LR+           G A +  GK ++   ++ K 
Sbjct: 235 NAHCNLGKALHTQGKLSEAMAAYQRALRLDPNDADTHCNLGIALHDQGKLSEAIAAYQKA 294

Query: 140 FQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARY 198
            Q+      +    NLG A   +G  + A   Y++A  +DP+ A+  CNL   L  Q + 
Sbjct: 295 LQIDPNYVNAHC--NLGKALHTQGKLSEAMAAYQRALRVDPNYASAHCNLGVTLYHQGKL 352

Query: 199 TEARSVLEDVL 209
           +EA +  +  L
Sbjct: 353 SEAIAAYQKAL 363


>gi|225430656|ref|XP_002270163.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC [Vitis vinifera]
 gi|296085167|emb|CBI28662.3| unnamed protein product [Vitis vinifera]
          Length = 986

 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 53/227 (23%), Positives = 83/227 (36%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AIV + +A+        A  +MA    +Q ++D A+   K     C     E+ +N 
Sbjct: 282 PQEAIVCYQRALQTRPEYAMAYGNMAGTYYEQGQMDMAIVHYKQAIE-CDSGFLEAYNN- 339

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
           L +  K  GR+DE I+   Q L +           G  +       A +   K  + V  
Sbjct: 340 LGNALKDVGRIDEAIQCYHQCLALQPNHPQALTNLGNIYMEWNMVAAAATYYKATLAVTT 399

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S    NL   Y Q+GNY  A   Y +   IDP A                        
Sbjct: 400 GLSAPFSNLAIIYKQQGNYADAISCYNEVLRIDPLAAD---------------------- 437

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
               G ++     K I RV E +Q+     +I P++    ++L  A+
Sbjct: 438 ----GLVNRGNTFKEIGRVSEAIQDYIHAITIRPTMAEAHANLASAY 480


>gi|148264105|ref|YP_001230811.1| TPR repeat-containing protein [Geobacter uraniireducens Rf4]
 gi|146397605|gb|ABQ26238.1| Tetratricopeptide TPR_2 repeat protein [Geobacter uraniireducens
           Rf4]
          Length = 573

 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 88/212 (41%), Gaps = 20/212 (9%)

Query: 14  KLPPGDSPYVRA-KHVQLVEKD---PEAAIVLFWKAI-NAGDRVDSALKDMAVVLKQQDR 68
           K+PP  + Y  A  H+  + K+   PE  I L  +AI N  +R++  L ++A + +  D+
Sbjct: 343 KIPPSSAHYFDALGHLGFLYKENGEPEKGIALLKEAITNQPNRIELYL-NLAGLYESMDQ 401

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK----------QKLRM 118
             E +  +           + S    +  LY K G  DE I  +K          Q L  
Sbjct: 402 FAEGLRVLTDVEGNFPNDPRLSFRMGV--LYDKMGNKDESIARMKKVIALAPNDAQALNY 459

Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEV-VYRKAQL 177
           + +  A  G     A  +  K  V ++ +   IL +LGWAY +   Y  A   + R  QL
Sbjct: 460 LGYTYAELGVNLDEALQYLNK-AVLLRPDDGFILDSLGWAYYKMKRYDQAVFHLERAVQL 518

Query: 178 IDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           +D DA    +L+        Y +A +    VL
Sbjct: 519 VDEDATIIGHLADAYFANREYRKALTRYRRVL 550


>gi|153868963|ref|ZP_01998678.1| TPR repeat containing protein [Beggiatoa sp. PS]
 gi|152074473|gb|EDN71324.1| TPR repeat containing protein [Beggiatoa sp. PS]
          Length = 838

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 51/89 (57%), Gaps = 12/89 (13%)

Query: 26  KHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ---QDRVDEAVEAIKSFRHL 82
           K+ QL EK        + +AI+  DR +SA+KD+A +L+Q   Q+   EA++ ++ +R  
Sbjct: 482 KNFQLAEK-------YYREAISHNDRKESAIKDLASLLQQKRTQEDTKEAIKILEKYRPQ 534

Query: 83  CSKQAQESLDNVLIDLYKKCGRLDEQIEL 111
              QA  S+DN+L+  Y+   R  + IEL
Sbjct: 535 MYDQA--SVDNMLVVYYRLDERYHDVIEL 561


>gi|323452442|gb|EGB08316.1| putative O-linked GlcNAc transferase [Aureococcus anophagefferens]
          Length = 1151

 Score = 42.0 bits (97), Expect = 0.28,   Method: Composition-based stats.
 Identities = 43/161 (26%), Positives = 68/161 (42%), Gaps = 37/161 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESLDN 93
           AI  +  A+    R  +A  ++ +VLK++  VD+A+    EAI         Q  ++  N
Sbjct: 385 AIQCYATAVGLAPRFAAAHSNLGLVLKERGTVDDALAHYREAIA-----VDPQFADAYSN 439

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG 153
            + + YK  GRLD+ I           +GEA   +P                 + +    
Sbjct: 440 -MGNAYKDLGRLDDAIRC---------YGEALKLRP-----------------DFADACS 472

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLI 193
           NL  AY   G +  A   YR+A  + PD + A  NL+H L+
Sbjct: 473 NLAAAYKDGGRHAEAVACYRRALELKPDFSDAFSNLAHSLV 513


>gi|196232460|ref|ZP_03131313.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196223532|gb|EDY18049.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 792

 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 40/174 (22%), Positives = 77/174 (44%), Gaps = 17/174 (9%)

Query: 44  KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESL-DNVLIDLYKKC 102
           +A+        A  ++  VL +  +++EA   + S+R     Q   ++ +N L ++ ++ 
Sbjct: 96  RAVEVAPGFADAHYNLGAVLAELGQMEEA---LASYRRALEIQPTHAVAENNLGNILREL 152

Query: 103 GRLDEQIELLKQKLRM-IYHGEAFN---------GKPTKTARSHGKKFQVTVKQETSRIL 152
            RLDE I   ++ +++   + +A N         GK  +   ++G+  ++  K + + + 
Sbjct: 153 RRLDEAIAAYRRAIQLQPAYADAHNNLGVALSEQGKSDEAIAAYGRALEL--KPDGNAVH 210

Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVL 205
            NLG A    G Y  A V YR++    P     C  L   L+   R+ EA  V 
Sbjct: 211 ANLGNALRASGRYAEAVVAYRRSLQSSPARLDICQGLGEALVLLGRFDEAGEVF 264


>gi|118357157|ref|XP_001011828.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89293595|gb|EAR91583.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 494

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 77/192 (40%), Gaps = 44/192 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEA------------------------- 72
           AI  F  A+  G  + S L ++ ++  Q  R++EA                         
Sbjct: 193 AINCFLNALEIGLELYSPLINLGIIYSQMGRLEEAQSCYLKILKTHPQDWNALLGLAKLF 252

Query: 73  -----VEAIKSFRHLCSKQAQESLDNV--LIDLYKKCGRLDEQIELLKQKLRM----IYH 121
                +E  K F   CS       DN   ++  Y + G ++E I   +  L+     ++H
Sbjct: 253 TKRGMIEEAKFFLQKCSLIYNLDQDNCDDIVYCYCQLGMIEEAIIWYENALKFMPDSVFH 312

Query: 122 ----GEAF--NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               G+    NG   K+     K  ++   Q  S IL NLG+AY  +G+Y+ A   Y+++
Sbjct: 313 LIIIGQLHLRNGNIEKSKIFFEKILKIRPNQ--SYILNNLGFAYYLEGDYSKAISYYQQS 370

Query: 176 QLIDPDANKACN 187
           Q I+P+     N
Sbjct: 371 QEINPNVYDTFN 382


>gi|388841092|gb|AFK79142.1| hypothetical protein PCC8801_1300 [uncultured bacterium F39-01]
          Length = 645

 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 72/165 (43%), Gaps = 11/165 (6%)

Query: 54  SALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK 113
           +A  ++   L +  R ++++EA+++   L    A  + +N+   LYK  GR  E IE   
Sbjct: 252 TAYSNLGYALDRLGRSNDSIEALRNAVRLKGDDA-VAYNNLGASLYK-AGRYQEAIEAFG 309

Query: 114 QKLRMIYHG-EAFN--GKPTKTARSHGKKFQ-----VTVKQETSRILGNLGWAYMQKGNY 165
             +R+  +  EA N  G        + +  Q     V VK ++     NLG AY   G Y
Sbjct: 310 NAVRLNPNDVEALNNLGAAYYVTAQYDRALQNFQQAVRVKADSPDAQYNLGNAYYMTGKY 369

Query: 166 TSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
             A   YR+A  +  D  +A  NL   LI      EA + L + +
Sbjct: 370 REATAAYRQAIQLKADYVEARTNLGSLLIALGENQEAITELNESI 414


>gi|294140783|ref|YP_003556761.1| hypothetical protein SVI_2012 [Shewanella violacea DSS12]
 gi|293327252|dbj|BAJ01983.1| TPR domain protein [Shewanella violacea DSS12]
          Length = 343

 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 43/73 (58%), Gaps = 3/73 (4%)

Query: 140 FQVTVKQE--TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQA 196
           +++ +++E  ++ I+ NLG+++   GNYTSAE   R+A   DP+ ++   NL    +++ 
Sbjct: 216 YKLALQKEPHSALIITNLGYSHYLTGNYTSAENYLRQAIREDPNFDRGWTNLGLVYVRKG 275

Query: 197 RYTEARSVLEDVL 209
            YT A +  E  +
Sbjct: 276 LYTRALATFEHTM 288


>gi|168056412|ref|XP_001780214.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668364|gb|EDQ54973.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 511

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 178 IDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELE 233
           I+ D  K CN   CL+KQ R  EA +VL+ V +     +    +  K+  R ++LL+ELE
Sbjct: 73  IESDVKKICNFGVCLMKQGRLEEAEAVLQRVTVACSGIRWPSDSHLKSYERAQDLLKELE 132


>gi|339486353|ref|YP_004700881.1| putative glycosyl transferase [Pseudomonas putida S16]
 gi|338837196|gb|AEJ12001.1| putative glycosyl transferase [Pseudomonas putida S16]
          Length = 1147

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 6/104 (5%)

Query: 148 TSRILGNLGWAYMQKGNYTSA-EVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
           ++ +   LG +YM+ G+++ A +     AQL +P    A N +HCL K  R  EA  VL 
Sbjct: 238 SAEVFNRLGGSYMEDGHFSIALQYFNAAAQLPNPPVWTAFNRAHCLAKLHRLDEAIQVLS 297

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLE 250
           D +    +   +   + R+ E     + W+ +   L T+   L+
Sbjct: 298 DGIASNPNHRPQQDELERLAE-----QKWRDLHAKLMTRVDMLD 336


>gi|224144666|ref|XP_002325368.1| predicted protein [Populus trichocarpa]
 gi|222862243|gb|EEE99749.1| predicted protein [Populus trichocarpa]
          Length = 873

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 48/227 (21%), Positives = 86/227 (37%), Gaps = 36/227 (15%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI+ + +A+        A  ++A    ++ +V+ A+   K     C ++  E+ +N 
Sbjct: 169 PQEAIMCYQRAVQTRPNYAMAFGNLASTCYERGQVELAILHYKQ-AIACDQRFLEAYNN- 226

Query: 95  LIDLYKKCGRLDEQIELLKQKLR--------MIYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
           L +  K  GR+DE ++   Q L         +   G  +       A +   K  + V  
Sbjct: 227 LGNALKDVGRVDEALQCYNQCLSIQPNHPQALTNLGNIYMEWNMSAAAASCYKATLAVTT 286

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
             S    NL   Y Q+GNY+ A   Y +   I+P A                        
Sbjct: 287 GLSAPFNNLAVIYKQQGNYSDAISCYNEVLRIEPMAAD---------------------- 324

Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
               G ++     K I RV E +Q+     +I P++    ++L  A+
Sbjct: 325 ----GLVNRGNTYKEIGRVSEAIQDYIHAINIRPTMAEAHANLASAY 367



 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 63/149 (42%), Gaps = 31/149 (20%)

Query: 45  AINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGR 104
           A+  G  + +   ++AV+ KQQ    +A+        +    A + L N   + YK+ GR
Sbjct: 283 AVTTG--LSAPFNNLAVIYKQQGNYSDAISCYNEVLRI-EPMAADGLVN-RGNTYKEIGR 338

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
           + E I+          +  A N +PT  A +H                 NL  AY   G+
Sbjct: 339 VSEAIQ---------DYIHAINIRPT-MAEAHA----------------NLASAYKDSGH 372

Query: 165 YTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
             +A   YR+A L+ PD  +A CNL H L
Sbjct: 373 VEAAIKSYRQALLLRPDFPEATCNLLHTL 401


>gi|254445427|ref|ZP_05058903.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
           DG1235]
 gi|198259735|gb|EDY84043.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
           DG1235]
          Length = 568

 Score = 41.6 bits (96), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 97/243 (39%), Gaps = 35/243 (14%)

Query: 12  VHKLPPGDS-PYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVD 70
           + ++ P DS P++   +     K    A+  + KAI  G    S    +   L +  +V+
Sbjct: 266 MEEIDPTDSNPHMLRGYAYYQVKRYGQAVPHYRKAIALGGDTASLYYYLTTCLSESGQVE 325

Query: 71  EAVEAIKSFRHLCSKQAQESLDN---------VLIDLYKKCGRLDEQIELLKQKLRMIYH 121
           EA            + A+E L+          VL D+  K G  DE +ELL+Q L +   
Sbjct: 326 EA-----------ERMAREGLERAPQTVDLMVVLSDIVLKRGLRDEGVELLQQALALAPA 374

Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILG--------NLGWAYMQKGNYTSAEVVYR 173
             + N    K     GKK     + E  R L          LG  Y++KG++ +A  V  
Sbjct: 375 HISANRTLAKLYWEEGKKEAALDQLEMLRTLQPTDFATRMTLGNYYLEKGDFEAAIAVVE 434

Query: 174 KAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLS------GSTETKTINRVKE 227
           +A+ IDP+     NL    + +    EAR    +  +   S       S E   +N+VK 
Sbjct: 435 EARRIDPENTDIRNLLALSLMRQGNEEAREGAFEKAVAVYSRAIEADASLEEAYLNKVKV 494

Query: 228 LLQ 230
           L+ 
Sbjct: 495 LMH 497


>gi|381206067|ref|ZP_09913138.1| hypothetical protein SclubJA_10624 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 524

 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDV 208
           LG++Y ++ ++  AE +YR++  ++P+  K+   L   LIK+ RY +ARS LE V
Sbjct: 413 LGFSYEKQRDFNLAESLYRESLQLNPENPKSGLRLGTVLIKKGRYEDARSFLESV 467


>gi|163781658|ref|ZP_02176658.1| hypothetical protein HG1285_02208 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159882878|gb|EDP76382.1| hypothetical protein HG1285_02208 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 159

 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSV 204
           Q T++    LGWAY  K +Y  A    +KA  IDPD  N   ++   LI+Q R+ EA   
Sbjct: 38  QPTAKAYTFLGWAYSMKKDYDKAIEYCKKAIEIDPDFGNPYNDIGAYLIEQGRFEEAIPW 97

Query: 205 LEDVLLGK 212
           L+  ++ K
Sbjct: 98  LKQAIVAK 105


>gi|326912317|ref|XP_003202500.1| PREDICTED: transmembrane and TPR repeat-containing protein 1-like
           [Meleagris gallopavo]
          Length = 794

 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 77/188 (40%), Gaps = 22/188 (11%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLC--SKQAQESLDN 93
           E A++L   +I  G     A   +A +L +Q+R+ EA E  K+    C  S     +   
Sbjct: 477 EEAVILLRDSIKYGSDFADAYSSLASLLAEQERLKEAEEVYKAGIENCPESSDLHNNYGV 536

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRM--IYHGEAFN--------GKPTKTARSHGKKFQVT 143
            L+D     G  +  +   +Q + +   +H    N        G+  +    + +  +V+
Sbjct: 537 FLVD----TGAPERAVSHYRQAIHLSPTHHVAMVNLGRLHRSLGQNKEAEAWYKRALKVS 592

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN--LSHCLIKQARYTEA 201
            K E   IL  LG  Y   G Y  A  VYR+A L+ P +NK     L+  L    R  EA
Sbjct: 593 RKAE---ILSPLGALYYNTGRYEEALQVYREAALLQP-SNKETRLALAQVLAMMGRTKEA 648

Query: 202 RSVLEDVL 209
             +   +L
Sbjct: 649 EKMTNHIL 656


>gi|224088480|ref|XP_002308458.1| predicted protein [Populus trichocarpa]
 gi|222854434|gb|EEE91981.1| predicted protein [Populus trichocarpa]
          Length = 934

 Score = 41.6 bits (96), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 64/151 (42%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A +       A  ++ V+ K +D +D+AVE  ++   +    +Q SL+N+ 
Sbjct: 317 EMAIVFYELAFHFNPHCAEACNNLGVIYKDRDNLDKAVECYQATLSIKPNFSQ-SLNNLG 375

Query: 96  IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++++ +     + EA+N                           N
Sbjct: 376 V-VYTVQGKMDAAASMIEKAIMANPTYAEAYN---------------------------N 407

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   GN T A   Y +   IDPD+  A
Sbjct: 408 LGVLYRDVGNITMAISAYEQCLEIDPDSRNA 438


>gi|152997603|ref|YP_001342438.1| TPR repeat-containing protein [Marinomonas sp. MWYL1]
 gi|150838527|gb|ABR72503.1| Tetratricopeptide TPR_2 repeat protein [Marinomonas sp. MWYL1]
          Length = 603

 Score = 41.6 bits (96), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           I+ +LGW +  KG +  +    +KA  I PDA  A +    L KQ  Y EA +V E  L
Sbjct: 515 IIDSLGWGFFLKGEFEQSSYYLKKAWSILPDAEIAAHYGESLWKQRHYKEAIAVWEAAL 573


>gi|242081821|ref|XP_002445679.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
 gi|241942029|gb|EES15174.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
          Length = 910

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 64/150 (42%), Gaps = 28/150 (18%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D+AVE  +    +    AQ SL+N+ 
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPLFAQ-SLNNLG 361

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
           + +Y   G++D    ++++         A +  PT                  +    NL
Sbjct: 362 V-VYTVQGKMDSAASMIEK---------AIHANPT-----------------YAEAYNNL 394

Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           G  Y   G+ T A   Y +   IDPD+  A
Sbjct: 395 GVLYRDAGSITLAIHAYERCLQIDPDSRNA 424


>gi|91204003|emb|CAJ71656.1| hypothetical protein kustc0911 [Candidatus Kuenenia
           stuttgartiensis]
          Length = 470

 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDA-NKACNLSHCLIKQARYTEARSVLEDVLLG 211
           G LG+AY    N+ SAE  +R A L+DP   +    L+  L KQ RY EA S+ + ++ G
Sbjct: 180 GLLGFAYSSVENHLSAESSFRMAVLLDPATIDWKMGLARSLFKQERYAEAVSLCDSLISG 239


>gi|406835251|ref|ZP_11094845.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 751

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVL 205
           G LG  Y ++ N+T AE  YR+A  +DP  N A   L   L+ Q R+ EAR  L
Sbjct: 481 GTLGICYARQENFTRAEDCYRRALTLDPHQNFARIALGRTLLAQDRFDEAREEL 534


>gi|414870070|tpg|DAA48627.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
 gi|414870071|tpg|DAA48628.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
 gi|414870072|tpg|DAA48629.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
          Length = 786

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 64/150 (42%), Gaps = 28/150 (18%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D+AVE  +    +    AQ SL+N+ 
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPLFAQ-SLNNLG 361

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
           + +Y   G++D    ++++         A +  PT                  +    NL
Sbjct: 362 V-VYTVQGKMDSAASMIEK---------AIHANPTY-----------------AEAYNNL 394

Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           G  Y   G+ T A   Y +   IDPD+  A
Sbjct: 395 GVLYRDAGSITLAIHAYERCLQIDPDSRNA 424


>gi|354603236|ref|ZP_09021235.1| hypothetical protein HMPREF9450_00150 [Alistipes indistinctus YIT
           12060]
 gi|353349113|gb|EHB93379.1| hypothetical protein HMPREF9450_00150 [Alistipes indistinctus YIT
           12060]
          Length = 268

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 33/60 (55%)

Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG 215
           G A  ++ NY  AE+ YRKAQ   P      NL+  L KQ RY E  ++L+ +   +++G
Sbjct: 38  GNALYEQENYPEAEIRYRKAQEQSPTYEGLFNLADALYKQKRYDEVLNILKQISRDEVAG 97


>gi|118385554|ref|XP_001025906.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89307673|gb|EAS05661.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 1032

 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 85/185 (45%), Gaps = 23/185 (12%)

Query: 16   PPGDSPY----VRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
            P  DS Y    +  K   L++K    AI  + K +    + DS   ++    K +  VDE
Sbjct: 845  PENDSCYYNLGIACKSKGLLDK----AIQSYQKCLEIHPKNDSCYYNLGKAYKSKGLVDE 900

Query: 72   AVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYH--GE 123
            A+++ +S   +  K   ++  N L + YK  G LDE I+  +  L++       Y+  G+
Sbjct: 901  AIKSYQSSIEINPKV--DAYYNSLGNAYKVKGLLDEAIKSYQNCLKINPNYNSCYYKLGQ 958

Query: 124  AFNGKP--TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
            A+  K    +  +S+ K  ++  K ++     NLG AY  KG    A   Y+K   ++P+
Sbjct: 959  AYKSKGLLDEAIKSYQKYLEINPKNDSCYY--NLGLAYKSKGLLDEAIKSYQKCLSLNPN 1016

Query: 182  ANKAC 186
             NK C
Sbjct: 1017 -NKNC 1020


>gi|333899123|ref|YP_004472996.1| hypothetical protein Psefu_0924 [Pseudomonas fulva 12-X]
 gi|333114388|gb|AEF20902.1| Tetratricopeptide TPR_2 repeat-containing protein [Pseudomonas
           fulva 12-X]
          Length = 597

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 31/63 (49%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
           E   IL +LGWA  + GN T AE   R+A    PD   A +L   L  Q +  EAR +  
Sbjct: 513 EDPAILDSLGWANYRLGNLTDAEKWLRQAFERFPDHEVAAHLGEALWAQGKQREARKIWR 572

Query: 207 DVL 209
           D L
Sbjct: 573 DAL 575


>gi|222423063|dbj|BAH19513.1| AT3G04240 [Arabidopsis thaliana]
          Length = 750

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 57/251 (22%), Positives = 93/251 (37%), Gaps = 46/251 (18%)

Query: 16  PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           P     Y+   +V      P  AI+ +  A+        A  ++A +  +Q ++D A   
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYYEQGQLDLA--- 310

Query: 76  IKSFRHLCSK-----QAQESLDNVLIDLYKKCGRLDEQIELLKQKLR--------MIYHG 122
           I+ ++   S+     +A  +L N L D+    GR+DE +    Q L         M   G
Sbjct: 311 IRHYKQALSRDPRFLEAYNNLGNALKDI----GRVDEAVRCYNQCLALQPNHPQAMANLG 366

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
             +         S   K  + V    S    NL   Y Q+GNY+ A   Y +   IDP A
Sbjct: 367 NIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLA 426

Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
                                   D L+ +  G+T  K I RV E +Q+     +  P++
Sbjct: 427 -----------------------ADALVNR--GNT-YKEIGRVTEAIQDYMHAINFRPTM 460

Query: 243 TTKKSSLEDAF 253
               ++L  A+
Sbjct: 461 AEAHANLASAY 471


>gi|218439610|ref|YP_002377939.1| hypothetical protein PCC7424_2659 [Cyanothece sp. PCC 7424]
 gi|218172338|gb|ACK71071.1| Tetratricopeptide TPR_2 repeat protein [Cyanothece sp. PCC 7424]
          Length = 565

 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 5/90 (5%)

Query: 126 NGKPTKTARSHGKKFQV----TVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           N + T+  +++ + F +    TV+ + +  L  LG   ++KGN + A + Y+ A  IDP 
Sbjct: 85  NQQFTEALKAYNQAFLINPDLTVQSDLASDLTRLGAVLLKKGNVSEAIIAYQIAVAIDPL 144

Query: 182 ANKAC-NLSHCLIKQARYTEARSVLEDVLL 210
           A +A  NL   L +Q R+ EA      V+L
Sbjct: 145 AIEANHNLGKALAQQQRWQEAGQAFNQVIL 174


>gi|408421469|ref|YP_006762883.1| hypothetical protein TOL2_C40240 [Desulfobacula toluolica Tol2]
 gi|405108682|emb|CCK82179.1| tetratricopeptide repeat protein [Desulfobacula toluolica Tol2]
          Length = 1572

 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARS 203
           K +  RI  NLG      G YT AE  YRKA    P  +K C NL   L +Q ++ E R+
Sbjct: 578 KDQLYRIYNNLGSLMNSSGEYTDAETWYRKAMKTCPTKSKPCTNLIKVLHRQNKHNEIRT 637

Query: 204 V 204
           V
Sbjct: 638 V 638


>gi|159901659|ref|YP_001547905.1| hypothetical protein Haur_5149 [Herpetosiphon aurantiacus DSM 785]
 gi|159894698|gb|ABX07777.1| Tetratricopeptide TPR_2 repeat protein [Herpetosiphon aurantiacus
           DSM 785]
          Length = 1105

 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 81/187 (43%), Gaps = 43/187 (22%)

Query: 44  KAINAGDRVDSAL--KDMAVVLKQQDRVDEA----------VEAIKSFRHLCSKQAQESL 91
           +A+   D  D+A+   ++A VL+ Q R  EA           EA+    H  + ++  +L
Sbjct: 694 EAVLGLDHPDTAVSVNNLASVLESQGRYSEARGLYERALEVTEAVLGREHPDTARSVNNL 753

Query: 92  DNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
            +VL     + GR  E   L +Q L +    EA  G      R H          +T+R 
Sbjct: 754 ASVL----ARQGRYSEAQPLYEQALAV---NEAVLG------REH---------PDTARS 791

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIKQARYTEAR 202
           + NL      +G Y+ A+ +Y +A  +         PD A    NL+  L+ Q RY+EA+
Sbjct: 792 VNNLASVLESQGRYSEAQPLYEQALAVREAVLGENHPDTAMSMNNLAMVLLNQGRYSEAQ 851

Query: 203 SVLEDVL 209
            +LE  L
Sbjct: 852 GLLERTL 858


>gi|327399475|ref|YP_004340344.1| hypothetical protein Hipma_1328 [Hippea maritima DSM 10411]
 gi|327182104|gb|AEA34285.1| Tetratricopeptide TPR_1 repeat-containing protein [Hippea maritima
           DSM 10411]
          Length = 260

 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 80/193 (41%), Gaps = 23/193 (11%)

Query: 30  LVEK-DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQ 88
           LVE+ + E AIV F  A+      + A  ++     Q D  D+A          C+K  +
Sbjct: 15  LVEQGECEKAIVYFLNAVKENPSFEDAYIEIGYCYAQLDNFDDA-------EDYCNKAIE 67

Query: 89  ESLDNV-----LIDLYKKCGRLDEQIELLKQKL-RM------IYH--GEAFNGKPTKTAR 134
            + +N+     L  +Y K G  +++IE L + + R+      IY   G A+         
Sbjct: 68  INPNNLEAYNTLAMIYHKFGFFEDEIEALNEIIIRLDEPDASIYLNIGNAYYELGENDRA 127

Query: 135 SHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLI 193
                  + ++ + +    N+G AYM K  Y  A   Y++A  IDP+ ++   NL     
Sbjct: 128 IEFYDMAIGMEPDFAEAYANMGNAYMAKDEYIKATEAYKQALQIDPNMSDVYLNLGIVYG 187

Query: 194 KQARYTEARSVLE 206
           +   Y EA    E
Sbjct: 188 ELGSYDEAVKYFE 200


>gi|82617834|gb|ABB84829.1| VIT-vWFA-RpoN multidomain protein [uncultured delta proteobacterium
           DeepAnt-1F12]
          Length = 1156

 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 28/49 (57%)

Query: 161 QKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           Q  ++   E +  + QL DP    A +L  CL+ QARY E  +++EDV+
Sbjct: 851 QGNDFEYVEEILERVQLFDPPGVGARDLRECLLIQARYIELGAIVEDVI 899


>gi|392409193|ref|YP_006445800.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfomonile tiedjei DSM 6799]
 gi|390622329|gb|AFM23536.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfomonile tiedjei DSM 6799]
          Length = 583

 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQAR 197
           K  +T + + + +L NLG A  Q G    A +V +KA  IDP+ + A  NL H   K+ R
Sbjct: 63  KHALTYEPKNASVLSNLGNALNQIGLVDEAVMVLKKAIEIDPNHDLAHDNLGHIYSKRER 122

Query: 198 YTEARSVLEDVL 209
           Y +A+ +LE  L
Sbjct: 123 YDQAKELLEMAL 134


>gi|302559445|ref|ZP_07311787.1| LOW QUALITY PROTEIN: regulatory protein AfsR [Streptomyces
           griseoflavus Tu4000]
 gi|302477063|gb|EFL40156.1| LOW QUALITY PROTEIN: regulatory protein AfsR [Streptomyces
           griseoflavus Tu4000]
          Length = 649

 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 58/137 (42%), Gaps = 26/137 (18%)

Query: 48  AGDRVDSA--LKDMAVVLKQQDRVDEAVE----AIKSFRHLCSKQAQESLDNVLIDLYKK 101
           AGDR   A  L D A  L+   R DEA+E    A++    L   +A+ S+ + L D Y  
Sbjct: 416 AGDREGEAQSLSDTAAALRSSGRPDEAIELLHEALELNEELGDGEARSSVISNLGDAYLH 475

Query: 102 CGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQ 161
            GRLD+ +E  ++ L +               R  G  +   +       L NLG AY +
Sbjct: 476 AGRLDKSVEYTRRGLAL--------------DRVAGSVWGEGIA------LSNLGDAYQR 515

Query: 162 KGNYTSAEVVYRKAQLI 178
            G Y  A     +A ++
Sbjct: 516 MGRYDEALGCLERALVV 532


>gi|113474770|ref|YP_720831.1| hypothetical protein Tery_0977 [Trichodesmium erythraeum IMS101]
 gi|110165818|gb|ABG50358.1| TPR repeat [Trichodesmium erythraeum IMS101]
          Length = 1215

 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 73/166 (43%), Gaps = 35/166 (21%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V ++L ++A + ++Q R  EA    ++A+   + L   +      SL+N L +LY+  GR
Sbjct: 126 VATSLNNLAFLYQRQGRYTEAEPLYIQALDMIKKLLGAEHPLVATSLNN-LAELYRVQGR 184

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L +Q L+M                   KK       + +  L +L   Y  +G 
Sbjct: 185 YTEAEPLYQQALKM------------------RKKLLGAKHPDVATSLNSLALLYKDQGR 226

Query: 165 YTSAEVVYRKA-----QLID---PD-ANKACNLSHCLIKQARYTEA 201
           YT AE +Y +A     +L+    PD A+   NL+     Q RYTEA
Sbjct: 227 YTEAEPLYIQALEMRKKLLGAEHPDVASSLNNLAELYRSQGRYTEA 272


>gi|118362609|ref|XP_001014531.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89296298|gb|EAR94286.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 1163

 Score = 41.2 bits (95), Expect = 0.58,   Method: Composition-based stats.
 Identities = 41/177 (23%), Positives = 79/177 (44%), Gaps = 19/177 (10%)

Query: 38   AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS-----------FRHLCSKQ 86
            AI+++ K++N   ++D    ++ V  + + R+DEA++  +            F +L +  
Sbjct: 896  AILIYEKSLNLNIKIDVCCLNLGVCYEIKGRIDEAIKKYQQSIEINPANDVCFLNLGNAY 955

Query: 87   AQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
              + + +  I  Y+KC +L+ + E     L  +Y      G+  K  + + K   +  K+
Sbjct: 956  LNKGMFDEAIQAYQKCLQLNPKKEACYLNLGNVYQ---IKGELDKAIKCYQKCIILNPKK 1012

Query: 147  ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC--NLSHCLIKQARYTEA 201
            +   +  NLG AY  KGN   +   Y+K   ++P  N  C  NL +    +    EA
Sbjct: 1013 DICYL--NLGNAYQNKGNLEESIKNYQKCLNLNPK-NDTCLENLGNAFKNKGMIEEA 1066


>gi|383772223|ref|YP_005451289.1| hypothetical protein S23_39820 [Bradyrhizobium sp. S23321]
 gi|381360347|dbj|BAL77177.1| hypothetical protein S23_39820 [Bradyrhizobium sp. S23321]
          Length = 695

 Score = 41.2 bits (95), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 69/166 (41%), Gaps = 15/166 (9%)

Query: 58  DMAVVLKQQDRVDEAVEAIKSF--RHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQK 115
           ++AVVL +Q R+ EA EA ++   R    +    +L N+L+D      RLDE +   +  
Sbjct: 148 NLAVVLAEQGRLTEAEEAYRAVIAREPAYRGVWLNLGNLLMDQT----RLDEAVTAFRCA 203

Query: 116 LRMIYHGEAFNGKPTKTARSHG--------KKFQVTVKQETSRILGNLGWAYMQKGNYTS 167
           +          G         G         +  V ++ + +  L  LG    + G+   
Sbjct: 204 IEADPDDPGLPGNLGAALYRQGLLDDAIVQYRRAVALQPDNAPALRLLGLVLHEAGHLPE 263

Query: 168 AEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVLLGK 212
           A  +YR++  +DP D   A NL  CL +     EA +  E  LL K
Sbjct: 264 AAEMYRQSFALDPADHLIASNLGACLSELGTLAEAIAACEQALLLK 309


>gi|418066009|ref|ZP_12703377.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
           RCH3]
 gi|373561242|gb|EHP87481.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
           RCH3]
          Length = 643

 Score = 41.2 bits (95), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 31/192 (16%)

Query: 18  GDSPYVRAKHVQ--LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           G++P +  K  +  L  K+P+ A+  +  A+        A +++A + ++ ++ +EA + 
Sbjct: 400 GNNPVLHLKLARALLGAKNPKEAMASYQDALKIAPDNLEARRELAALYRKGNQNEEAEKQ 459

Query: 76  IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARS 135
            K    +    A+    N+L  LY K  + DE I LLK  + +          P   A S
Sbjct: 460 YKEILRIKKDDAEAR--NILTALYVKSKKYDELITLLKDGVELA---------PNDPA-S 507

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIK 194
           H K                LG  Y  K +Y +AE  Y+KA  +  D  K+ N L    +K
Sbjct: 508 HYK----------------LGLIYEFKKDYDAAEQEYKKAVELKGDHAKSLNALGRIYLK 551

Query: 195 QARYTEARSVLE 206
             + +EA+  LE
Sbjct: 552 TGKISEAKGALE 563


>gi|327401541|ref|YP_004342380.1| hypothetical protein Arcve_1665 [Archaeoglobus veneficus SNP6]
 gi|327317049|gb|AEA47665.1| Tetratricopeptide TPR_1 repeat-containing protein [Archaeoglobus
           veneficus SNP6]
          Length = 222

 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 43/211 (20%), Positives = 80/211 (37%), Gaps = 20/211 (9%)

Query: 33  KDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
           ++PE  +  F K +         L ++ + L +  R+DEA+  I     L    A    +
Sbjct: 16  RNPEEEVEFFSKCLEMEPENIYVLNNLGIALYELGRIDEAISYIDRALELNPDYADAWYN 75

Query: 93  NVLIDLYKKCGRLDEQIELLKQ-------------KLRMIYHGEAFNGKPTKTARSHGKK 139
             ++      G+ DE I   ++              + + Y+     GK  +  R     
Sbjct: 76  RGIV--LSDAGKYDEAIACFEKAIALNPDDAAAWNNMGLAYYESGNMGKAIECYRKC--- 130

Query: 140 FQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYT 199
             V++ +E +    N+G AY + G +  AE  ++KA  +D   +   NL     K  +Y 
Sbjct: 131 --VSIDEEHAAAWYNMGLAYYESGRFNKAEESFKKALELDESVDTLNNLGIVYGKLRQYE 188

Query: 200 EARSVLEDVLLGKLSGSTETKTINRVKELLQ 230
            A      +L      S  T  +  +K L++
Sbjct: 189 MAMECFNRILEIDPENSAATYNLLILKRLME 219


>gi|403363486|gb|EJY81487.1| TPR repeat-containing protein [Oxytricha trifallax]
          Length = 746

 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 68/163 (41%), Gaps = 38/163 (23%)

Query: 56  LKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDL---YKKCGRLDEQIELL 112
             +M VV  +Q   D+A E   +   LC++   ++ + +LI+L   ++K   +D  IEL 
Sbjct: 601 FNEMGVVYYRQKNYDQARECFANALSLCNESNSKTYETILINLAHCHRKLKDMDSAIELY 660

Query: 113 KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVY 172
           ++ L +       N K   T  S                   LG+AY  KG Y  A   Y
Sbjct: 661 EKCLTL-------NPKSPSTYTS-------------------LGYAYHLKGEYRQALTCY 694

Query: 173 RKAQLI---DPDANKAC-----NLSHCLIKQARYTEARSVLED 207
            KA  +   DP   +       +++ C I+QA Y  A + L D
Sbjct: 695 HKASFLKNEDPLTEELVQRALQDINECTIEQA-YLGAGNELGD 736


>gi|406912813|gb|EKD52347.1| hypothetical protein ACD_62C00048G0002 [uncultured bacterium]
          Length = 390

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 79/206 (38%), Gaps = 28/206 (13%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI    KA     ++  A  ++ +V  + D VDEA++    +  L      E     L
Sbjct: 91  EKAIFHLKKAGQLSPQLSDAHVNLGLVYLKMDNVDEAIKTY--YDALMIDPVNEKAYFGL 148

Query: 96  IDLYKKCGRLDEQIELLKQKLRMI-YHGEAFNGKPTKTARSHGKKFQVTVKQETSRI--- 151
              Y++ GR +E I+   + + +     +A+N      A SH   ++  +      I   
Sbjct: 149 GVAYQRMGRYEEAIDAYSEAIMISPAFSDAYNN--LGIALSHVGLYEEAITSYLIAIKIR 206

Query: 152 -------LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
                    NLG  Y   G    A   Y  A L+DP+  KA CNL H   ++  Y EA  
Sbjct: 207 PDLSEAAYNNLGITYQSVGQDDKAVEAYTNAILLDPEYTKAYCNLGHVRQQKGNYDEAMW 266

Query: 204 VLEDVL------------LGKLSGST 217
             ++ L            LG   GST
Sbjct: 267 AYQEALKIDPAFADAHYGLGNAYGST 292


>gi|357162940|ref|XP_003579571.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Brachypodium
           distachyon]
          Length = 983

 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 49/226 (21%), Positives = 88/226 (38%), Gaps = 36/226 (15%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + +A+ A      A  ++A +  +Q ++D ++    S   LC  +  E+ +N +
Sbjct: 280 EEAIACYQRALQARPDYAMAYGNLATIYYEQRQLDMSIHCY-SQAILCDPRFVEAHNN-M 337

Query: 96  IDLYKKCGRLDEQIELLKQKL--------RMIYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
            +  K  GR++E I   +  L         +   G  +      +A +   K  + V   
Sbjct: 338 GNALKDAGRVEEAINCFQSCLILQANHPQALTNLGNIYMEWNMASAAASFYKAAIAVTSG 397

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
            +    NL   Y Q+G+Y  A   Y +   IDP A                        D
Sbjct: 398 LTSPFNNLAVIYKQQGSYADAIACYTEVLRIDPTA-----------------------AD 434

Query: 208 VLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
            L+ +  G+T  K   RV E +Q+     +I P++    ++L  A+
Sbjct: 435 ALVNR--GNT-FKEFGRVAEAIQDYMQAATIMPTMAEAHANLASAY 477


>gi|239787366|emb|CAX83842.1| TPR repeat protein [uncultured bacterium]
          Length = 1036

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 85/187 (45%), Gaps = 26/187 (13%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           EA++ L +KA+    R     +++   L++  R+DEAV A K       + AQ+S  N+L
Sbjct: 66  EASLRLLYKALEIDPRQPGMWQNLGETLRRLKRLDEAVGAFK-------RAAQDS--NIL 116

Query: 96  ID-------LYKKCGRLDEQIE----LLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV-- 142
           +          K+ G+LDE +     LL ++   +    AF+       +    +  +  
Sbjct: 117 LAAVSGLALALKEAGQLDEALSAMETLLAKEPGNVAAWAAFSDVLFDAGQDDAAQLALER 176

Query: 143 TVKQETSRILGNLGWAY--MQKGNYTSAEVVYRKA--QLIDPDANKACNLSHCLIKQARY 198
            ++ + +    +L  A+   Q+G Y +AE +YR    Q  D  A+    L+  LI Q R 
Sbjct: 177 AIRLKPTSAGHHLRMAFHQSQQGQYQTAEAIYRALLRQGGDSLASVHTGLAQALISQGRL 236

Query: 199 TEARSVL 205
            EA+ ++
Sbjct: 237 EEAQPLI 243


>gi|15229778|ref|NP_187761.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
           thaliana]
 gi|75332921|sp|Q96301.1|SPY_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY
 gi|12322895|gb|AAG51433.1|AC008153_6 spindly (gibberellin signal transduction protein); 75377-80082
           [Arabidopsis thaliana]
 gi|1589778|gb|AAC49446.1| SPINDLY [Arabidopsis thaliana]
 gi|62319977|dbj|BAD94086.1| spindly [Arabidopsis thaliana]
 gi|332641539|gb|AEE75060.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
           thaliana]
          Length = 914

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           + AIV +  A +       A  ++ V+ K +D +D+AVE  +    +    AQ SL+N+ 
Sbjct: 312 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 370

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EAFN                           N
Sbjct: 371 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 402

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   GN T A   Y +   IDPD+  A
Sbjct: 403 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 433


>gi|83311235|ref|YP_421499.1| TPR repeat-containing protein [Magnetospirillum magneticum AMB-1]
 gi|82946076|dbj|BAE50940.1| TPR repeat [Magnetospirillum magneticum AMB-1]
          Length = 548

 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 35/153 (22%), Positives = 69/153 (45%), Gaps = 11/153 (7%)

Query: 58  DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
           ++ + L+   R+DEA   ++  R +    +  +  NVL D+ +  GRLDE +  L++ ++
Sbjct: 114 NLGLALRGLGRLDEAAAVLERCRDM--DPSVPAPWNVLGDIRRAQGRLDEAVTCLERAIQ 171

Query: 118 M-IYHGEAFNGKPTKTARSHGKKFQVTV-------KQETSRILGNLGWAYMQKGNYTSAE 169
           +   H  A+N            +  ++V       + +   +  +LG +  +      A 
Sbjct: 172 LDPGHSMAYNNLGVALQAQDRTEEAISVLERGLALQPDDPELHYSLGCSLQEATRTADAT 231

Query: 170 VVYRKAQLIDPDANKA-CNLSHCLIKQARYTEA 201
             YR+A  + PD   A  NLSH L++   ++E 
Sbjct: 232 ASYRRAIALRPDYGAAHWNLSHSLLRGGDFSEG 264


>gi|297829706|ref|XP_002882735.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328575|gb|EFH58994.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 897

 Score = 40.8 bits (94), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           + AIV +  A +       A  ++ V+ K +D +D+AVE  +    +    AQ SL+N+ 
Sbjct: 295 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 353

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EAFN                           N
Sbjct: 354 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 385

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   GN T A   Y +   IDPD+  A
Sbjct: 386 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 416


>gi|401408423|ref|XP_003883660.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
 gi|325118077|emb|CBZ53628.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
          Length = 987

 Score = 40.8 bits (94), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 67/169 (39%), Gaps = 17/169 (10%)

Query: 34  DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDN 93
           DP  A+ ++ +A+    R   A  +M  V K   ++++A+ A       C+   Q SL N
Sbjct: 307 DPHTALQMYREAVRINPRYVEAYNNMGAVCKNLGKLEDAI-AFYEKALACNPNYQLSLSN 365

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRMI--------YHGEAFNGKPTKTARSH-------GK 138
           + + L    G   +  E  K+ + +         Y+ +A+       A SH         
Sbjct: 366 MAVAL-TDLGTQQKTFEGAKKAISLYKKALIYNPYYSDAYYNLGVAYADSHKFDKALVNY 424

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
           +  V      +    N+G  +  + N   A V Y KA  I+PD ++  N
Sbjct: 425 QLAVAFNPRCAEAYNNMGVIHKDRENTDQAIVCYNKALEINPDFSQTLN 473


>gi|254468929|ref|ZP_05082335.1| hypothetical protein KB13_1154 [beta proteobacterium KB13]
 gi|207087739|gb|EDZ65022.1| hypothetical protein KB13_1154 [beta proteobacterium KB13]
          Length = 564

 Score = 40.8 bits (94), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 40/84 (47%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           ++++ +   IL ++GW + + GNY  A     KA  I  D   A +L   L KQ +  EA
Sbjct: 479 LSIEPQNHYILDSMGWVHFRLGNYDIAYEFITKAYAIQEDPEIAAHLGEVLWKQGKENEA 538

Query: 202 RSVLEDVLLGKLSGSTETKTINRV 225
            SV +  L      S   +T NR+
Sbjct: 539 MSVWQGSLEKFPENSVLIETKNRL 562


>gi|325954652|ref|YP_004238312.1| hypothetical protein [Weeksella virosa DSM 16922]
 gi|323437270|gb|ADX67734.1| Tetratricopeptide TPR_2 repeat-containing protein [Weeksella virosa
           DSM 16922]
          Length = 286

 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
           ILGN  +   +KGNY  AE  Y+KA   +P++ KA  NL + L +Q RY E+
Sbjct: 31  ILGNEAF---EKGNYEQAEFRYKKAATEEPNSVKANYNLGNTLFEQKRYKES 79


>gi|15229253|ref|NP_187074.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC [Arabidopsis
           thaliana]
 gi|75336082|sp|Q9M8Y0.1|SEC_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC; AltName:
           Full=Protein SECRET AGENT
 gi|6721161|gb|AAF26789.1|AC016829_13 putative O-linked GlcNAc transferase [Arabidopsis thaliana]
 gi|18139887|gb|AAL60196.1|AF441079_1 O-linked N-acetyl glucosamine transferase [Arabidopsis thaliana]
 gi|20259324|gb|AAM13988.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
 gi|21436429|gb|AAM51415.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
 gi|110742062|dbj|BAE98963.1| O-linked GlcNAc transferase like protein [Arabidopsis thaliana]
 gi|332640535|gb|AEE74056.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC [Arabidopsis
           thaliana]
          Length = 977

 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 57/251 (22%), Positives = 93/251 (37%), Gaps = 46/251 (18%)

Query: 16  PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           P     Y+   +V      P  AI+ +  A+        A  ++A +  +Q ++D A   
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYYEQGQLDLA--- 310

Query: 76  IKSFRHLCSK-----QAQESLDNVLIDLYKKCGRLDEQIELLKQKLR--------MIYHG 122
           I+ ++   S+     +A  +L N L D+    GR+DE +    Q L         M   G
Sbjct: 311 IRHYKQALSRDPRFLEAYNNLGNALKDI----GRVDEAVRCYNQCLALQPNHPQAMANLG 366

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
             +         S   K  + V    S    NL   Y Q+GNY+ A   Y +   IDP A
Sbjct: 367 NIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLA 426

Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
                                   D L+ +  G+T  K I RV E +Q+     +  P++
Sbjct: 427 -----------------------ADALVNR--GNT-YKEIGRVTEAIQDYMHAINFRPTM 460

Query: 243 TTKKSSLEDAF 253
               ++L  A+
Sbjct: 461 AEAHANLASAY 471


>gi|225620550|ref|YP_002721807.1| TPR domain-containing protein [Brachyspira hyodysenteriae WA1]
 gi|225215369|gb|ACN84103.1| TPR domain-containing protein [Brachyspira hyodysenteriae WA1]
          Length = 767

 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 69/173 (39%), Gaps = 38/173 (21%)

Query: 32  EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESL 91
           +KD   AI  F KAI    + D A  ++A+   +  + +EA+E    F H       +SL
Sbjct: 121 KKDYTNAIENFNKAIEINPKYDKAFNNLALFYYKNKKYNEAIEF---FEH------SKSL 171

Query: 92  DNVLIDLYKKCG-------RLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHG------- 137
           D  +   Y   G         D+ IE   + L+       +NGK  KTA + G       
Sbjct: 172 DERVFKAYDMLGMSYYNINNYDKAIECFTKFLQ-------YNGKSCKTANTLGAVYSFLK 224

Query: 138 ------KKFQVT--VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
                 K F +   +  + +    NL   Y  +  +  A + + KA+ +D +A
Sbjct: 225 DYDNAIKYFNIAIDINPKYANAYNNLALVYFNRKMFDKAALYFDKAKKLDVNA 277


>gi|340376830|ref|XP_003386934.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Amphimedon queenslandica]
          Length = 1029

 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 10/153 (6%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + +AI        A  ++A  LK+Q +V EA E       + S  A +SL+N L
Sbjct: 254 ELAIDTYKRAIELQPHFPDAYCNLANALKEQGKVAEAEECYNIALKMNSSHA-DSLNN-L 311

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVK-QETSRILGN 154
            ++ ++ G +DE I+L K+ L ++    A +       +  GK     +  +E  RI  N
Sbjct: 312 ANIKREQGHIDEAIKLYKRALEIMPEFAAAHSNLASILQMQGKLQDALLHYKEAIRIHPN 371

Query: 155 LGWAYMQKGN-------YTSAEVVYRKAQLIDP 180
              AY   GN       +  A   Y +A  I+P
Sbjct: 372 FADAYSNMGNTLKEMQDFQGALQCYSRAIQINP 404


>gi|39998281|ref|NP_954232.1| hypothetical protein GSU3191 [Geobacter sulfurreducens PCA]
 gi|409913632|ref|YP_006892097.1| hypothetical protein KN400_3128 [Geobacter sulfurreducens KN400]
 gi|39985227|gb|AAR36582.1| TPR domain protein [Geobacter sulfurreducens PCA]
 gi|298507217|gb|ADI85940.1| TPR domain protein [Geobacter sulfurreducens KN400]
          Length = 638

 Score = 40.8 bits (94), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 31/154 (20%)

Query: 55  ALKDMAVVLKQQDRVDEAVEAIKSFRHLCS-KQAQESLDNVLIDLYKKCGRLDEQIELLK 113
           A +++A V ++ +++D+A    K +R +   K+      N+L  +Y K  + DE + LL+
Sbjct: 437 AHRELAAVYRKLNQMDDAS---KQYREVLRIKKDDAEARNILTAIYVKEKKYDELVPLLQ 493

Query: 114 QKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYR 173
           + + +              A SH K                LG  +  + +Y SAEV YR
Sbjct: 494 EGVEL----------APNDAMSHYK----------------LGLIHEFRKDYDSAEVSYR 527

Query: 174 KAQLIDPDANKACN-LSHCLIKQARYTEARSVLE 206
           KA  +  D  KA N L    +K  + TEA+  LE
Sbjct: 528 KATELKDDHAKALNALGRIYLKTGKLTEAKEALE 561


>gi|302878311|ref|YP_003846875.1| hypothetical protein Galf_1081 [Gallionella capsiferriformans ES-2]
 gi|302581100|gb|ADL55111.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
           capsiferriformans ES-2]
          Length = 961

 Score = 40.8 bits (94), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVLL 210
           NLG    + G  T AE  +R+A    PD  +A CNL + L+  AR +EA S    VLL
Sbjct: 413 NLGIVQQELGRLTDAEASFRQALQFSPDLLEAHCNLGNVLLGAARLSEAESCYRHVLL 470



 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 73/171 (42%), Gaps = 33/171 (19%)

Query: 34  DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDN 93
           D E A   + +A+       +AL ++A +L+   R+DEA    ++      K + +S D 
Sbjct: 253 DREEAEQCYRRALQIKPGYGAALSNLANLLQMLGRLDEAAACCRTIL----KSSPDSADV 308

Query: 94  V--LIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
           +  L ++ K+ G+L E     +  LR       FN                    ++ +I
Sbjct: 309 LFNLANILKRLGQLAEAEASYRVALR-------FN-------------------PDSVQI 342

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEA 201
            GNLG    + G +  AE  +R+A  I+PD  +A CNL     +  R  EA
Sbjct: 343 HGNLGITLKELGRFEEAESSFRQALRINPDYAQAHCNLGVMFKELDRLDEA 393


>gi|312282561|dbj|BAJ34146.1| unnamed protein product [Thellungiella halophila]
          Length = 762

 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           + AIV +  A +       A  ++ V+ K +D +D+AVE  +    +    AQ SL+N+ 
Sbjct: 174 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 232

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EAFN                           N
Sbjct: 233 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 264

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   GN T A   Y +   IDPD+  A
Sbjct: 265 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 295


>gi|400601016|gb|EJP68684.1| tetratricopeptide-like protein [Beauveria bassiana ARSEF 2860]
          Length = 626

 Score = 40.8 bits (94), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 28/51 (54%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
           L   +   GNYT A     K  LI+ +A+      HCLIKQ+R+ EA +VL
Sbjct: 46  LAQVHFATGNYTRAHAFLAKQDLINRNASCRYLAGHCLIKQSRFEEALAVL 96


>gi|373469271|ref|ZP_09560471.1| tetratricopeptide repeat protein, partial [Lachnospiraceae
           bacterium oral taxon 082 str. F0431]
 gi|371764715|gb|EHO53097.1| tetratricopeptide repeat protein, partial [Lachnospiraceae
           bacterium oral taxon 082 str. F0431]
          Length = 174

 Score = 40.8 bits (94), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDP---DANKACNLSHCLIKQARYTEARSVLEDVL- 209
           +LG  YM + +Y SA V   KA  +DP   DA K   L+    K  R  +AR+ LE VL 
Sbjct: 20  DLGSKYMAELDYESAIVALNKAIKLDPKNVDAYKM--LAEVYEKSGRLEDARATLEKVLD 77

Query: 210 LGKLSGSTETKTINRVKEL 228
           L  LS   E +  NR+K L
Sbjct: 78  LDDLSSDNEDEINNRIKNL 96


>gi|431926271|ref|YP_007239305.1| Flp pilus assembly protein TadD [Pseudomonas stutzeri RCH2]
 gi|431824558|gb|AGA85675.1| Flp pilus assembly protein TadD [Pseudomonas stutzeri RCH2]
          Length = 575

 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 28/54 (51%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
           IL +LGW   + GN T AE + RKA    PD   A +L   L      +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLTEAEALLRKALQRFPDHEVAAHLGEVLWTMGERSEARAV 548


>gi|159028254|emb|CAO88064.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 939

 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 72/174 (41%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQAQE---SLDNVLIDLYKKCGR 104
           V ++L ++A++ + Q R  EA    +EA+   + L      +   SL+N L +LY+  GR
Sbjct: 709 VATSLNNLALLYESQGRYTEAEPLYLEALDLTKQLLGDNHPDVATSLNN-LAELYRSQGR 767

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L  Q L +       N     T+                  L NL   Y  +G 
Sbjct: 768 YTEAEPLYLQALDLYKQLLGDNHPYVATS------------------LNNLAALYYSQGR 809

Query: 165 YTSAEVVY------RKAQLID--PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           YT AE +Y      RK  L D  PD A+   NL+     Q RYTEA  +  + L
Sbjct: 810 YTEAEPLYLQALDLRKRLLGDNHPDVASSLNNLAELYYSQGRYTEAEPLYLEAL 863


>gi|330502031|ref|YP_004378900.1| hypothetical protein [Pseudomonas mendocina NK-01]
 gi|328916317|gb|AEB57148.1| TPR repeat-containing protein [Pseudomonas mendocina NK-01]
          Length = 569

 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 30/59 (50%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           IL +LGW   + GN   AE + R+A    PD   A +L   L  Q +  EARSV  D L
Sbjct: 489 ILDSLGWVNYRLGNLDEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQREARSVWRDAL 547


>gi|407716992|ref|YP_006838272.1| hypothetical protein Q91_1735 [Cycloclasticus sp. P1]
 gi|407257328|gb|AFT67769.1| TPR domain protein [Cycloclasticus sp. P1]
          Length = 572

 Score = 40.4 bits (93), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 1/89 (1%)

Query: 122 GEAFNGKPTKTARSHGKKFQ-VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
           G A   K  +   + G   Q +++K +   IL ++GW   ++G+Y SA V+ ++A    P
Sbjct: 467 GYALASKTFRFKEARGYLTQALSLKPDDPAILDSMGWLDYREGHYESALVLLKRAYSQTP 526

Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVL 209
           +   A +L   L    R  EAR++ E+ L
Sbjct: 527 EGEIAAHLGQTLWMLGRQKEARTIWEEAL 555


>gi|326498109|dbj|BAJ94917.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 986

 Score = 40.4 bits (93), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 51/225 (22%), Positives = 88/225 (39%), Gaps = 38/225 (16%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           A+  + +A+ A      A  ++A +  +Q ++D A+        +C  +  E+ +N + +
Sbjct: 285 AVACYQRALQARPDYAMAYGNLATIYYEQRQLDMAIHCYNQ-AIICDSRFVEAYNN-MGN 342

Query: 98  LYKKCGRLDEQIELLKQKLRM-IYHGEAFNG--------KPTKTARSHGKKFQVTVKQET 148
             K  GR++E I   +  L +   H +A               TA S  K   + V    
Sbjct: 343 ALKDAGRVEEAINCFQSCLVLQANHPQALTNLGNIYMEWNMISTAASFYKA-AIAVTSGL 401

Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDV 208
           S  L NL   Y Q+G+Y  A   Y +   IDP A                        D 
Sbjct: 402 SSPLNNLAVIYKQQGSYADAIACYTEVLRIDPTA-----------------------ADA 438

Query: 209 LLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           L+ +  G+T  K   RV E +Q+     +I P++    ++L  A+
Sbjct: 439 LVNR--GNT-FKEFGRVAEAIQDYIQAVTIRPNMAEAHANLASAY 480


>gi|146184699|ref|XP_001029949.2| SLEI family protein [Tetrahymena thermophila]
 gi|146143045|gb|EAR82286.2| SLEI family protein [Tetrahymena thermophila SB210]
          Length = 2342

 Score = 40.4 bits (93), Expect = 0.86,   Method: Composition-based stats.
 Identities = 37/183 (20%), Positives = 76/183 (41%), Gaps = 16/183 (8%)

Query: 11  VVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVD 70
           +V   P  +  +     + L +K  E AI  + K +    +   AL ++ +  + Q   D
Sbjct: 443 IVQLSPNSEYDFFSLGELYLTKKIYEEAIKCYKKTLEINPQYIKALNNLGLAYEYQQMFD 502

Query: 71  EAVEAIK-----------SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
           +A+E  K           ++ +     A + + +  I+ YKK   ++ Q       +  +
Sbjct: 503 QAIECYKKAIEIDPNYHLAYYNCGISYASKKMVDEAIECYKKVLEINPQYLNASTNMGYL 562

Query: 120 YHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID 179
           Y  +    K  +  +S      + V + + +IL NLG+AY +   +  A  +Y++   ID
Sbjct: 563 YSQQKMYDKAIECYQS-----ALQVNENSLKILNNLGYAYYKSNMHDQAIEIYKRVIQID 617

Query: 180 PDA 182
           P +
Sbjct: 618 PKS 620


>gi|359799757|ref|ZP_09302310.1| tetratricopeptide repeat family protein 11 [Achromobacter
           arsenitoxydans SY8]
 gi|359362183|gb|EHK63927.1| tetratricopeptide repeat family protein 11 [Achromobacter
           arsenitoxydans SY8]
          Length = 628

 Score = 40.4 bits (93), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 73/160 (45%), Gaps = 13/160 (8%)

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
           A +L+  DRV +AV  +++         +   +  L  LY++  RL +   +L+Q + + 
Sbjct: 452 AQILRDADRVGQAVSVLEAADQALPDTVEIKYE--LAMLYERQNRLADLERMLRQVIALD 509

Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
             H  A+N      A  H ++       +T   E S     IL ++GW   + G+ T+A 
Sbjct: 510 PDHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKFRMGDTTAAA 568

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
              R+A  + P+A+ A +L+  L  Q +  +A  +L   L
Sbjct: 569 DYLRRAYSVRPEADIAAHLAEVLWSQGKRDQATDLLRAAL 608


>gi|186476952|ref|YP_001858422.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
 gi|184193411|gb|ACC71376.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia phymatum
           STM815]
          Length = 556

 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 17/131 (12%)

Query: 87  AQESLDNVLIDLYKKCGRLDEQIELLKQKLRM-------IYH-GEAF--NGKPTKTARSH 136
           A+ +L  +L+D      RLDE + + ++ L +       +Y+ G A   + +P + A + 
Sbjct: 141 ARNNLGTLLMD----ESRLDESVAMFRETLEIDGTHIHALYNLGMALLRSVRPAEAATAF 196

Query: 137 GKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQ 195
            +   + V+ +    L NLG A    G+Y  AE VYR+   + PD A+   NL+  L+ Q
Sbjct: 197 RQA--LAVQPDHRDALHNLGTALKLNGHYDEAEAVYRQTLALAPDFADVQWNLAVLLLTQ 254

Query: 196 ARYTEARSVLE 206
            R  E   + E
Sbjct: 255 GRLAEGWPLAE 265


>gi|157129230|ref|XP_001661650.1| kinesin light chain 1 and [Aedes aegypti]
 gi|108872282|gb|EAT36507.1| AAEL011410-PA [Aedes aegypti]
          Length = 472

 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 72/175 (41%), Gaps = 35/175 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHL-------CSKQAQESLDNVLIDLYKKCGR 104
           V + L  + +V + Q+ + EA++ +     +       C      +L+N+ + LY K G 
Sbjct: 234 VATMLSILTMVYRDQNNLPEAIKHMNEALGIWVRCLGECHPSVAAALNNLAV-LYGKNGN 292

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L K+         A   +     R H          + ++ L NL      +G 
Sbjct: 293 YKEAESLCKR---------ALANRENVLGRYH---------PDVAKQLNNLALLCQNQGK 334

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVLL 210
           +   E+  R+A  I        DP+A K  CNL+ C +K  +Y EA  +L DVL+
Sbjct: 335 HGEVELYIRRALEIFESQLGANDPNAIKTKCNLAACCLKLRKYKEADQLLRDVLV 389


>gi|429859839|gb|ELA34601.1| anaphase-promoting complex subunit cut9 [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 622

 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
           L   +   GNYT A+    K  L+  + +     +HCL+KQ+R+ EA SVL D
Sbjct: 41  LAQVHFATGNYTRAQTFLAKQDLVSRNVSCRYLAAHCLVKQSRFEEALSVLGD 93


>gi|388259165|ref|ZP_10136339.1| hypothetical protein O59_003560 [Cellvibrio sp. BR]
 gi|387937123|gb|EIK43680.1| hypothetical protein O59_003560 [Cellvibrio sp. BR]
          Length = 579

 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 46/194 (23%), Positives = 77/194 (39%), Gaps = 46/194 (23%)

Query: 35  PEAAIVLFWKAI--NAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
           P +  +LF +A+  N   R+ +  +D+  +LKQ     + V A+ S  ++ + + Q    
Sbjct: 427 PSSTRLLFARAMLNNERQRLPATERDLRQILKQDS---DNVAALNSLGYILADRTQ---- 479

Query: 93  NVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL 152
                      R  E  ELL++ LR+         KP   A                 I+
Sbjct: 480 -----------RYSEAYELLEKALRL---------KPDDAA-----------------II 502

Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGK 212
            ++GW Y + G Y  A    R+A    P+A    +L   L       EAR V  +    +
Sbjct: 503 DSMGWLYYRTGKYPEALSHLRQAYEASPNAEIGAHLGEVLWVIGETEEARRVWREAQQLE 562

Query: 213 LSGSTETKTINRVK 226
              +T  +T+NR+K
Sbjct: 563 PDNATIRETLNRLK 576


>gi|196232945|ref|ZP_03131794.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196222923|gb|EDY17444.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 808

 Score = 40.4 bits (93), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 63/151 (41%), Gaps = 10/151 (6%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AIV   +AI        A  ++ V  K++ R DEAV A +    L    A+    N L +
Sbjct: 235 AIVAQRRAIELRPDFAEAYNNLGVACKERGRTDEAVAAFRRSIELQPDFAETH--NNLGN 292

Query: 98  LYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVT--------VKQETS 149
             ++CGR DE I   ++ L +     A       T RS G   + T        ++   S
Sbjct: 293 TLQECGRADEAIAAYRRALALQPEYAAAANNLASTLRSVGLLDEATAAIRRALELQPALS 352

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
            I  NLG     +G+   A   YR+A  ++P
Sbjct: 353 DIRNNLGNILKDQGDVEGAIGAYREALQLEP 383


>gi|320590030|gb|EFX02475.1| cell division protein [Grosmannia clavigera kw1407]
          Length = 936

 Score = 40.4 bits (93), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 31/51 (60%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
           L   +   GNYT A+ +  K+ LI  +A+     +HCLIKQ+R+ +A +VL
Sbjct: 41  LAQVHFAMGNYTRAQALLSKSNLIARNASCRYLAAHCLIKQSRFDDALAVL 91


>gi|157812772|gb|ABV81131.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase [Thulinius stephaniae]
          Length = 289

 Score = 40.4 bits (93), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 38/160 (23%), Positives = 63/160 (39%), Gaps = 37/160 (23%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESLDN 93
           A+ L++KA+       +A  ++A VL+QQ ++ E++    EAI+             +  
Sbjct: 73  AVRLYYKALEVLPEFAAAHSNLASVLQQQGKLQESIAHYKEAIR-------------ISP 119

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG 153
              D Y   G   +++                 G      R + +   +T+    +    
Sbjct: 120 TFADAYSNMGNTLKEM-----------------GDXQGAIRCYSRA--ITINPAFADAHS 160

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
           NL   +   GN T A   YR A  + PD   A CNL+HCL
Sbjct: 161 NLASIHKDTGNITEAVASYRTALKLKPDFPDAFCNLAHCL 200


>gi|423014093|ref|ZP_17004814.1| tetratricopeptide repeat family protein 11 [Achromobacter
           xylosoxidans AXX-A]
 gi|338783024|gb|EGP47393.1| tetratricopeptide repeat family protein 11 [Achromobacter
           xylosoxidans AXX-A]
          Length = 628

 Score = 40.4 bits (93), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 13/176 (7%)

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
           A +L+  DRV +AV  +++         +   +  L  LY++  RL +   +L+Q + + 
Sbjct: 452 AQILRDADRVGQAVSVLEAADQALPDTVEIKYE--LAMLYERQDRLADLERMLRQVIALD 509

Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
             H  A+N      A  H ++       +T   E S     IL ++GW   + G+  +A 
Sbjct: 510 PEHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKFRMGDQVAAA 568

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
              R+A  + P+A+ A +L+  L  Q +  +A  +L   LL      T  + + R+
Sbjct: 569 DYLRRAYSVRPEADIAAHLAEVLWTQGKRDQAVELLRAALLKDPKNKTVLEVVKRL 624


>gi|404498065|ref|YP_006722171.1| TPR domain-containing protein [Geobacter metallireducens GS-15]
 gi|78195662|gb|ABB33429.1| TPR domain protein [Geobacter metallireducens GS-15]
          Length = 643

 Score = 40.4 bits (93), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 46/178 (25%), Positives = 76/178 (42%), Gaps = 29/178 (16%)

Query: 30  LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
           L  K+P+ A+  +  A+        A +++A + ++ ++ +EA +  K    +    A+ 
Sbjct: 414 LGAKNPKEAMASYQDALKIAPDNLEARRELAALYRKGNQNEEAEKQYKEILRIKKDDAEA 473

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
              N+L  LY K  + DE I LLK  + +          P   A SH K           
Sbjct: 474 R--NILTALYVKSKKYDELITLLKDGVELA---------PNDPA-SHYK----------- 510

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEARSVLE 206
                LG  Y  K +Y +AE  Y+KA  +  D  K+ N L    +K  + +EA+  LE
Sbjct: 511 -----LGLIYEFKKDYDAAEQEYKKAVELKGDHAKSLNALGRIYLKTGKISEAKGALE 563


>gi|300864445|ref|ZP_07109316.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
 gi|300337589|emb|CBN54464.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
          Length = 1129

 Score = 40.4 bits (93), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 11/126 (8%)

Query: 95  LIDLYKKCGRLDEQIELLKQKLRMIYH--GEAFNGKPTKTARSHGKK------FQ--VTV 144
           L  +++K G+  E I   ++   +  H  G  F+     T  + GK+      +Q  + V
Sbjct: 425 LGKVFQKQGKSAEAIAYFQKTSDINPHVVGADFHFNLANTLLTEGKRDEAIQSYQRAIAV 484

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
           K + +    N+G A MQ+GN   A   YRKA  I P       N+++ L+ Q +Y EA S
Sbjct: 485 KPDWAEAYANIGSARMQQGNLEEAIQYYRKAIAIKPQLEALHFNIANALLHQGKYEEAIS 544

Query: 204 VLEDVL 209
             ++ +
Sbjct: 545 NYQEAI 550


>gi|383770004|ref|YP_005449067.1| hypothetical protein S23_17390 [Bradyrhizobium sp. S23321]
 gi|381358125|dbj|BAL74955.1| hypothetical protein S23_17390 [Bradyrhizobium sp. S23321]
          Length = 1062

 Score = 40.4 bits (93), Expect = 1.0,   Method: Composition-based stats.
 Identities = 52/191 (27%), Positives = 80/191 (41%), Gaps = 34/191 (17%)

Query: 36  EAAIVLFWKAINAGD-RVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQES--LD 92
           E A+ L   A  A D R+   L  +  + +  +   EA   ++  R L + QA  +  LD
Sbjct: 134 ERALALGTSAFGANDGRLAGTLSALGGLARDNENFKEAAGYLE--RALSALQAANAPPLD 191

Query: 93  -----NVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
                + L D+Y   GR D+   LLKQ + ++              RS GK  + T   +
Sbjct: 192 VASAMDDLGDVYGLLGRFDDGERLLKQGIELL-------------DRSFGKNAETTPNYD 238

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLI------DPDANKAC---NLSHCLIKQARY 198
             ++L +LG  Y+  G    AE   R+A  I      D   N A    NL+  L  +AR+
Sbjct: 239 --KMLNDLGNLYLDAGRLPDAEAAMRRALAITRARNGDAHPNVAATMGNLATVLKHEARH 296

Query: 199 TEARSVLEDVL 209
            EA  + +  L
Sbjct: 297 AEAEKLYQQTL 307


>gi|398342815|ref|ZP_10527518.1| hypothetical protein LinasL1_07018 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 689

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 7/86 (8%)

Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLE 206
           ++  LGN+ +    +G+Y SAE  +R+A  + P+  KA  NLS  L K  +  EA   LE
Sbjct: 279 SNAYLGNITY---NRGDYGSAEHYFREAATLSPNDAKALYNLSVVLKKNGKMEEALKYLE 335

Query: 207 DVLLGKLSGSTETKTINRVKELLQEL 232
              + KL+G+++ +    + +  ++L
Sbjct: 336 ---MAKLAGASDPEIFRSIADSFEQL 358


>gi|373469728|ref|ZP_09560896.1| tetratricopeptide repeat protein [Lachnospiraceae bacterium oral
           taxon 082 str. F0431]
 gi|371763886|gb|EHO52335.1| tetratricopeptide repeat protein [Lachnospiraceae bacterium oral
           taxon 082 str. F0431]
          Length = 250

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDP---DANKACNLSHCLIKQARYTEARSVLEDVL- 209
           +LG  YM + +Y SA V   KA  IDP   DA K   L+    K  R  +AR+ LE VL 
Sbjct: 34  DLGSKYMAELDYESAIVALNKAIKIDPKNVDAYKM--LAEVYEKSGRLEDARATLEKVLD 91

Query: 210 LGKLSGSTETKTINRVKEL 228
           +  LS   E +  NR+K L
Sbjct: 92  IDDLSSDNEDEINNRIKNL 110


>gi|218246150|ref|YP_002371521.1| hypothetical protein PCC8801_1300 [Cyanothece sp. PCC 8801]
 gi|218166628|gb|ACK65365.1| TPR repeat-containing protein [Cyanothece sp. PCC 8801]
          Length = 878

 Score = 40.0 bits (92), Expect = 1.1,   Method: Composition-based stats.
 Identities = 46/179 (25%), Positives = 76/179 (42%), Gaps = 19/179 (10%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE--SLDN 93
           E AI  + KAI        A  ++ + L  Q +++EA+ A +    L    A    +L N
Sbjct: 148 EEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPNYADAYYNLGN 207

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRMIYH-GEAFN---------GKPTKTARSHGKKFQVT 143
            L D     G+LDE I   ++ +++  +   A+N         GK  +   ++ K  Q+ 
Sbjct: 208 ALFDQ----GKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEAIAAYQKAIQLN 263

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
                +    NLG A   +G    A   Y+KA  ++P+  +A  NL   L  Q +  EA
Sbjct: 264 --PNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPNLAEAYNNLGVALSDQGKRDEA 320



 Score = 39.3 bits (90), Expect = 1.8,   Method: Composition-based stats.
 Identities = 48/177 (27%), Positives = 79/177 (44%), Gaps = 15/177 (8%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           + AI  + KAI        A  ++   L  Q +++EA+ A +    L    A ++  N+ 
Sbjct: 80  DEAIAAYQKAIQLNPNDADAYNNLGNALSDQGKLEEAIAAYQKAIQLNPNYA-DAYYNLG 138

Query: 96  IDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFN--GKPTKTARSHGKKFQVTVK 145
           I L  + G+L+E I   ++ +++       Y+  G A +  GK  +   ++ K  Q+   
Sbjct: 139 IALSDQ-GKLEEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPN 197

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEA 201
              +    NLG A   +G    A   Y+KA  +DP DAN   NL   L KQ +  EA
Sbjct: 198 YADAYY--NLGNALFDQGKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEA 252


>gi|405362768|ref|ZP_11025821.1| hypothetical protein A176_1960 [Chondromyces apiculatus DSM 436]
 gi|397090228|gb|EJJ21102.1| hypothetical protein A176_1960 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 624

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 81/176 (46%), Gaps = 26/176 (14%)

Query: 23  VRAKHVQLVEKD--PEAAIVLFWKAINAGDRVDSALKD-MAVVLKQQDRVDEAV----EA 75
           +RA+  + +E+   P  A  L  +A++AG    +AL D +A +L +Q R DEA+    +A
Sbjct: 396 LRAQFARALERGGTPGRAEALLREALDAGP--SAALYDALAAMLHRQGRGDEALRLLGDA 453

Query: 76  IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY-HGEAFN-------- 126
           +  F         E L  VL   +++ G +   +  ++  L +   H  A N        
Sbjct: 454 VARFPR------DEDLLYVLGAAHERQGDVPGALARMRAVLAVSPDHAAALNFLGYLLAQ 507

Query: 127 -GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
            G+    A    ++  + ++ ET   L +LGW Y ++G+YT A     +A  + PD
Sbjct: 508 AGQNLDEAERRVRR-ALELRPETGAYLDSLGWVYFRRGDYTRAVDALERASSLSPD 562


>gi|395212900|ref|ZP_10400010.1| hypothetical protein O71_04708 [Pontibacter sp. BAB1700]
 gi|394456977|gb|EJF11191.1| hypothetical protein O71_04708 [Pontibacter sp. BAB1700]
          Length = 437

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 13/143 (9%)

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG-- 153
           +  YKKC  L+ + E   Q++  IY  E      T T R     FQ  + ++    +   
Sbjct: 154 VKYYKKCIELNVENEAAMQEI--IYCMEI-----TGTMREELPFFQAFIDKDPYSYVAWF 206

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLEDVLLGK 212
           NLG  Y + G+Y  A   Y  A +I PD   A  N+++  +    YT+A      +L   
Sbjct: 207 NLGNVYNKLGSYEKAIAAYDYATIIKPDFITAYNNMANAYVFIGEYTKAIEAFNGML--- 263

Query: 213 LSGSTETKTINRVKELLQELEPW 235
             GS   +    + E  ++++ W
Sbjct: 264 EHGSPSAEVFCNIGECYEKMQQW 286


>gi|346974193|gb|EGY17645.1| anaphase-promoting complex subunit cut9 [Verticillium dahliae
           VdLs.17]
          Length = 618

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 28/53 (52%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
           L   +   GNYT A+    K  LI  + +     +HCLIKQ+RY EA  +L D
Sbjct: 41  LAQVHFATGNYTRAQTFLSKQDLITRNTSCRYLAAHCLIKQSRYDEALGLLGD 93


>gi|440680764|ref|YP_007155559.1| Tetratricopeptide TPR_2 repeat-containing protein [Anabaena
           cylindrica PCC 7122]
 gi|428677883|gb|AFZ56649.1| Tetratricopeptide TPR_2 repeat-containing protein [Anabaena
           cylindrica PCC 7122]
          Length = 737

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 52/207 (25%), Positives = 83/207 (40%), Gaps = 53/207 (25%)

Query: 18  GDSPYVRAKHV---QLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVE 74
            +S ++RA  +   QL E  P+ A +L                ++A++ K + R +EA  
Sbjct: 331 AESLFLRALKIHEEQLGENHPDTATIL---------------NNLALLYKNEGRYEEAES 375

Query: 75  AIKSFRHLCSKQAQES-------LDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
                  +C +Q  E+       L+N+ I +Y+  GR +E   L  + L++    E   G
Sbjct: 376 LYLRALKICEEQLGENHPHTATILNNLAI-VYQNEGRYEEAESLFLRDLKIC--EEQLGG 432

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVY------RKAQLIDPD 181
               TA S                L NL   Y  KG Y  AE +Y      R+ QL +  
Sbjct: 433 NHPDTANS----------------LNNLAELYRNKGRYEEAESLYLRALKIREEQLGENH 476

Query: 182 ANKACNLSHCLI---KQARYTEARSVL 205
            + A  L++  I    + RY EA S+ 
Sbjct: 477 PHTATILNNLAIVYQNEGRYEEAESLF 503


>gi|118366677|ref|XP_001016554.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89298321|gb|EAR96309.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 564

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 65/132 (49%), Gaps = 8/132 (6%)

Query: 77  KSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF-NGKPTKTARS 135
           ++FR L  +  ++ + +  I L++K  +LD++      KL     G +F   K    A +
Sbjct: 42  QAFRQLGYQFLKKQMYDDAITLFQKAIQLDDKDSWAFGKL-----GYSFLKKKMYDDAIT 96

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIK 194
             +K  + +  + S   G LG+++++K  Y  A   ++KA  +DP D++   NL +   K
Sbjct: 97  FFQK-AIQLNDKDSWAFGKLGYSFLKKKMYDDAFTFFQKAAQLDPQDSSAFANLGYLFYK 155

Query: 195 QARYTEARSVLE 206
           +  Y +A +  +
Sbjct: 156 KEMYDDAITFFQ 167


>gi|398802709|ref|ZP_10561912.1| Tfp pilus assembly protein PilF [Polaromonas sp. CF318]
 gi|398098947|gb|EJL89220.1| Tfp pilus assembly protein PilF [Polaromonas sp. CF318]
          Length = 794

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 80/186 (43%), Gaps = 15/186 (8%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ A   F +A+       +A  +  + LK   R DEA  A +    L    A E+L+N 
Sbjct: 94  PDEAEAAFRRALALDPEDAAAHNNFGIFLKDNRRPDEAEAAYRRALALRPDYA-EALNN- 151

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNG---------KPTKTARSHGKKFQVTV 144
           L +L +K GRLDE     ++ L++   + EA N          +  +   S G+    T 
Sbjct: 152 LGNLLRKSGRLDEAEAAYRRALQLRPDYAEAHNNLGILLKSSWRMREAGASQGRATPSTP 211

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIK-QARYTEARS 203
               +R L +LG  + +      AE  YR+A    P+  +A N    L+K + R  EA +
Sbjct: 212 GH--ARALNDLGNRHQKNRRSIEAEAAYREALEAWPEYAEAHNNLGVLLKSEGRSPEAEA 269

Query: 204 VLEDVL 209
           V    L
Sbjct: 270 VYRRAL 275


>gi|75318847|sp|O82422.1|SPY_HORVU RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY; AltName:
           Full=HvSPY
 gi|3617837|gb|AAC36055.1| gibberellin action negative regulator SPY [Hordeum vulgare subsp.
           vulgare]
 gi|326515544|dbj|BAK07018.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 944

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 64/151 (42%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D+AVE  +    +    +Q SL+N+ 
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 361

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EA+N                           N
Sbjct: 362 V-VYTVQGKMDAAASMIEKAILANPTYAEAYN---------------------------N 393

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   G+ T +   Y +   IDPD+  A
Sbjct: 394 LGVLYRDAGSITLSVQAYERCLQIDPDSRNA 424


>gi|218439885|ref|YP_002378214.1| hypothetical protein PCC7424_2942 [Cyanothece sp. PCC 7424]
 gi|218172613|gb|ACK71346.1| Tetratricopeptide TPR_2 repeat protein [Cyanothece sp. PCC 7424]
          Length = 632

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 95/244 (38%), Gaps = 27/244 (11%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KAI        A   + + L  Q +++EA+ A      +    A E   N+ 
Sbjct: 145 EEAIAAYNKAIEINPNYAFAYIGLGIALYNQGKLEEAIAAYNKAIEINPNYA-EVYSNLG 203

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI---- 151
             LY + G+L+E I      +  I   +AF       A S+  K +  +    + I    
Sbjct: 204 FALYNQ-GKLEEAIAAYNTAIE-INPNDAFAYNNLGIALSNQGKLEEAIAAYNTAIEINP 261

Query: 152 -----LGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVL 205
                  NLG A   +G    A   Y  A  I+P DA     L   L  Q +  EA +  
Sbjct: 262 NDAFAYNNLGVALYNQGKLEEAIAAYNTAIEINPNDAFAYIGLGIALHDQGKLEEAIAAY 321

Query: 206 EDVL-----------LGKLSGSTETKTI---NRVKELLQELEPWQSIPPSLTTKKSSLED 251
              L           +  L+ +T    +    +++E + E E    I P+ TT +++L++
Sbjct: 322 NKTLSLADKKADRASVHTLAHTTLGYALQQQGKLEEAIAEYEKALKIDPNNTTAQNNLKE 381

Query: 252 AFLE 255
           A ++
Sbjct: 382 ALIQ 385


>gi|224140075|ref|XP_002323412.1| predicted protein [Populus trichocarpa]
 gi|222868042|gb|EEF05173.1| predicted protein [Populus trichocarpa]
          Length = 917

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 64/154 (41%), Gaps = 32/154 (20%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK--SFRHLCSK-QAQESLD 92
           E AIV +  A N       A  ++ V+ K +D +D+AVE  +  S   L  K    +SL+
Sbjct: 317 EMAIVFYELAFNFNPHCAEACNNLGVIYKDRDNLDKAVECYQANSDTSLSIKPNFSQSLN 376

Query: 93  NVLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
           N+ + +Y   G++D    ++++ +     + EA+N                         
Sbjct: 377 NLGV-VYTVQGKMDAAASMIEKAIMANPTYAEAYN------------------------- 410

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
             NLG  Y   GN + A   Y +   IDPD+  A
Sbjct: 411 --NLGVLYRDAGNISMAISAYEQCLEIDPDSRNA 442


>gi|253997194|ref|YP_003049258.1| hypothetical protein Mmol_1827 [Methylotenera mobilis JLW8]
 gi|253983873|gb|ACT48731.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera mobilis JLW8]
          Length = 591

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/85 (28%), Positives = 39/85 (45%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           +T+      +L +LGWA+ +KGN   A     +A  I+PD   A +L   L ++  + +A
Sbjct: 506 LTLAPNDHYMLDSLGWAHYRKGNLDKAIQYLEQAFRINPDPEIAAHLGEVLWQKGEFEKA 565

Query: 202 RSVLEDVLLGKLSGSTETKTINRVK 226
           + V  D L           T N+ K
Sbjct: 566 KKVWSDALTADPDNEILLTTANKFK 590


>gi|339495311|ref|YP_004715604.1| TPR domain-containing protein [Pseudomonas stutzeri ATCC 17588 =
           LMG 11199]
 gi|338802683|gb|AEJ06515.1| TPR domain-containing protein [Pseudomonas stutzeri ATCC 17588 =
           LMG 11199]
          Length = 575

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 29/54 (53%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
           IL +LGW   + GN + AE + RKA    PD   A +L   L  +   +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLSEAEALLRKALQRFPDHEVAAHLGEVLWTKGERSEARAV 548


>gi|386021936|ref|YP_005939961.1| hypothetical protein PSTAA_3350 [Pseudomonas stutzeri DSM 4166]
 gi|327481909|gb|AEA85219.1| TPR domain-containing protein [Pseudomonas stutzeri DSM 4166]
          Length = 575

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 29/54 (53%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
           IL +LGW   + GN + AE + RKA    PD   A +L   L  +   +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLSEAEALLRKALQRFPDHEVAAHLGEVLWTKGERSEARAV 548


>gi|452747685|ref|ZP_21947478.1| TPR domain-containing protein [Pseudomonas stutzeri NF13]
 gi|452008429|gb|EME00669.1| TPR domain-containing protein [Pseudomonas stutzeri NF13]
          Length = 575

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 28/54 (51%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
           IL +LGW   + GN T AE + RKA     D   A +L   L  Q   +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLTEAEALLRKALQRFADHEVAAHLGEVLWAQGERSEARAV 548


>gi|307151943|ref|YP_003887327.1| NB-ARC domain-containing protein [Cyanothece sp. PCC 7822]
 gi|306982171|gb|ADN14052.1| NB-ARC domain protein [Cyanothece sp. PCC 7822]
          Length = 1023

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 70/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V ++L ++A +   QDR  EA     EA++  + +          SL+N+ + LY   GR
Sbjct: 841 VTTSLNNLAELYCSQDRYTEAEPLYTEALEMTKRMFDGNHPSVAGSLNNLAV-LYYNQGR 899

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L  + L M      F+G     A S                L NL   Y  +G 
Sbjct: 900 YTEAEPLYNEALEMT--KRMFDGDHPSVATS----------------LNNLALLYKSQGR 941

Query: 165 YTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           YT AE +Y +A  +         PD A    NL+     Q RY EA+S+  + L
Sbjct: 942 YTEAEPLYTEALEMRRRIFTGDHPDVALSLNNLAELYYSQDRYDEAKSLYTEAL 995


>gi|307731322|ref|YP_003908546.1| hypothetical protein [Burkholderia sp. CCGE1003]
 gi|307585857|gb|ADN59255.1| Tetratricopeptide TPR_1 repeat-containing protein [Burkholderia sp.
           CCGE1003]
          Length = 602

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 65/167 (38%), Gaps = 18/167 (10%)

Query: 55  ALKDMAVVLKQQDRVDEAVE----AIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIE 110
           A  +  V L+ Q  ++EAVE    AI S   L    A  +L   L     K GR  E + 
Sbjct: 124 ARNNFGVALQAQGALNEAVEQYRLAIASNPALVD--AHLNLGTAL----GKLGRFAEALA 177

Query: 111 LLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQK 162
             +  L++           G A N +    A     +  + ++ + +    NLG A  ++
Sbjct: 178 CYRNALQLDPASAEAHFNAGNAHNARGEHEAAVASFERALALRPDYAEAHVNLGSAIGKR 237

Query: 163 GNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           G+Y  AE  YR+A  + P+      +   L  Q R  E      + L
Sbjct: 238 GDYAGAEAHYRRAVELKPNPTNLVCMGGALGAQGRLAEEEKFYREAL 284


>gi|425434676|ref|ZP_18815141.1| TPR repeat-containing protein [Microcystis aeruginosa PCC 9432]
 gi|389675804|emb|CCH95098.1| TPR repeat-containing protein [Microcystis aeruginosa PCC 9432]
          Length = 310

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 62/247 (25%), Positives = 96/247 (38%), Gaps = 50/247 (20%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV- 61
           + RD    ++ K P     ++    + L +KD E A+  F     A  R+D  +   A+ 
Sbjct: 34  EARDAFEEILEKDPGSKQAHLGIGSICLKQKDYEGALTHF----GAARRIDPMMAKAALA 89

Query: 62  ---VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
              V  +Q+ +D A+EA K              D V ID     G L     LLKQ    
Sbjct: 90  VGNVYYKQNELDSAIEAFK--------------DAVNIDPTTPAGYLGMGRVLLKQ---- 131

Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQET-SRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
                    K    A+   +K  V   Q T  R+L  +   Y ++GN  +A      A  
Sbjct: 132 ---------KKYPQAKEQVQKALVFNPQLTPGRLL--MSQIYQEQGNTKAAIDEIESALR 180

Query: 178 IDPDANKAC-NLSHCLIKQARYTEARSVLEDVL-----------LGKLSGSTETKTINRV 225
           ++P A  A  +L +  +KQ +Y  AR   ED             +G +    E+  +N  
Sbjct: 181 MNPTAWSAYQSLGNIYLKQKQYNLARKNFEDAQQLNPKIPVVAKMGYIEALIESNALNEA 240

Query: 226 KELLQEL 232
            E+L+EL
Sbjct: 241 SEILREL 247


>gi|358392689|gb|EHK42093.1| hypothetical protein TRIATDRAFT_126880 [Trichoderma atroviride IMI
           206040]
          Length = 619

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI    N AC     HCLIKQ+R+ EA +VL
Sbjct: 41  LAQVHFATGNYTRAQAFLSKQDLIGR--NPACRYLAGHCLIKQSRFEEALAVL 91


>gi|119486282|ref|ZP_01620341.1| hypothetical protein L8106_16549 [Lyngbya sp. PCC 8106]
 gi|119456495|gb|EAW37625.1| hypothetical protein L8106_16549 [Lyngbya sp. PCC 8106]
          Length = 855

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 53/190 (27%), Positives = 78/190 (41%), Gaps = 41/190 (21%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V S+L ++A +   Q R  EA     +A+  ++ L          SL+N L  LYK  GR
Sbjct: 644 VASSLNNLAGLCHSQGRNREAEPLYQQALAMYQRLFDGDHPDVASSLNN-LAYLYKSQGR 702

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L +Q L M      F G     A S                L NL + Y  +G 
Sbjct: 703 NSEAEPLYQQALEMT--QRLFEGDHPDVATS----------------LNNLAYLYKSQGR 744

Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL------ 209
            + AE +Y++A     +L + D    A    NL++    Q RY+EA  + +  L      
Sbjct: 745 NSEAEPLYQQALEMRQRLFEGDHPDVATSLNNLAYLYESQGRYSEAEPLFQQALFMLVKT 804

Query: 210 LGKLSGSTET 219
           LGK   +T+T
Sbjct: 805 LGKDHPNTQT 814



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 73/179 (40%), Gaps = 37/179 (20%)

Query: 49  GDRVD--SALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSK---QAQESLDNVLIDLY 99
           GD  D  ++L ++A + K Q R  EA     +A+  ++ L          SL+N L  LY
Sbjct: 513 GDHPDVATSLNNLAYLYKSQGRNSEAEPLLQQALAMYQRLFEGDHPNIATSLNN-LAYLY 571

Query: 100 KKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAY 159
           +  GR  E   L +Q L M      F G     A S                L NLG  Y
Sbjct: 572 ESQGRYSEAEALFQQALEM--RQRLFEGDHPSVATS----------------LNNLGLLY 613

Query: 160 MQKGNYTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
             +G Y+ AE +Y++A     +L + D    A+   NL+     Q R  EA  + +  L
Sbjct: 614 NSQGRYSQAEPLYQQALAMRQRLFEGDHPNVASSLNNLAGLCHSQGRNREAEPLYQQAL 672


>gi|298241367|ref|ZP_06965174.1| transcriptional regulator, XRE family [Ktedonobacter racemifer DSM
           44963]
 gi|297554421|gb|EFH88285.1| transcriptional regulator, XRE family [Ktedonobacter racemifer DSM
           44963]
          Length = 850

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 11/86 (12%)

Query: 154 NLGWAYMQKGNYTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSV 204
           NLG+ Y  +G YT AE  Y++A     Q+  P+    AN   NL+   IKQ RY E   +
Sbjct: 614 NLGFLYYCQGRYTQAEHYYQRALDLSRQISGPEHPDTANVLNNLALTYIKQERYAECERI 673

Query: 205 LEDVL--LGKLSGSTETKTINRVKEL 228
               L    +  G+  + TIN ++ +
Sbjct: 674 YGQALAIYHRTLGAEHSHTINALQNV 699


>gi|345860131|ref|ZP_08812457.1| tetratricopeptide repeat family protein [Desulfosporosinus sp. OT]
 gi|344326772|gb|EGW38224.1| tetratricopeptide repeat family protein [Desulfosporosinus sp. OT]
          Length = 234

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIK 194
           + +K Q       +R+  NLG+ Y  KGN   A   + +  +IDP D +   NLS  LIK
Sbjct: 77  YKQKVQTEPNNTEARV--NLGYTYFLKGNDGQAIQEFNQVLVIDPKDFDAYYNLSLVLIK 134

Query: 195 QARYTEA 201
           + RY EA
Sbjct: 135 EKRYNEA 141


>gi|46123429|ref|XP_386268.1| hypothetical protein FG06092.1 [Gibberella zeae PH-1]
          Length = 616

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
           L   +   GNYT A+       LI    N +C     HCLIKQ+R+ EA SVL D
Sbjct: 41  LAQVHFATGNYTRAQAFLSSQNLISR--NPSCRYLAGHCLIKQSRFDEALSVLGD 93


>gi|408396447|gb|EKJ75605.1| hypothetical protein FPSE_04248 [Fusarium pseudograminearum CS3096]
          Length = 616

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
           L   +   GNYT A+       LI    N +C     HCLIKQ+R+ EA SVL D
Sbjct: 41  LAQVHFATGNYTRAQAFLSSQNLISR--NPSCRYLAGHCLIKQSRFDEALSVLGD 93


>gi|383810579|ref|ZP_09966073.1| tetratricopeptide repeat / tetratricopeptide repeat multi-domain
           protein [Prevotella sp. oral taxon 306 str. F0472]
 gi|383356947|gb|EID34437.1| tetratricopeptide repeat / tetratricopeptide repeat multi-domain
           protein [Prevotella sp. oral taxon 306 str. F0472]
          Length = 230

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC---NLSHCLIKQARY 198
           VT+  +TS  L   G       NY  AEV YRKA  ID + N A    NL+ CL  Q + 
Sbjct: 17  VTLNAQTSNQLIRKGNKNFSSKNYAQAEVFYRKA--IDKEPNNAIGNYNLARCLQAQHKN 74

Query: 199 TEARSVLE 206
            EA+ + E
Sbjct: 75  AEAKKIYE 82


>gi|260804643|ref|XP_002597197.1| hypothetical protein BRAFLDRAFT_66323 [Branchiostoma floridae]
 gi|229282460|gb|EEN53209.1| hypothetical protein BRAFLDRAFT_66323 [Branchiostoma floridae]
          Length = 2044

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 39/92 (42%), Gaps = 14/92 (15%)

Query: 146  QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID-----PDANKA-------CNLSHCLI 193
            Q   R L NLG AYM     ++A   + +A  +D     PD N         CNL H L 
Sbjct: 1176 QNVVRALNNLGVAYMALREDSAAAAFFTEA--LDTLRNAPDENAGRVIVACVCNLGHVLY 1233

Query: 194  KQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
                Y  AR   E++   K  G TET  ++ V
Sbjct: 1234 NLGDYQAARGHYEEIANKKRRGETETADMDAV 1265


>gi|356535232|ref|XP_003536152.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Glycine max]
          Length = 988

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 10/156 (6%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI  +  A+        A  ++A +  +Q ++D A+   K     C  +  E+ +N 
Sbjct: 284 PQEAIACYQHALQTRPNYGMAYGNLASIYYEQGQLDMAILHYKQ-AVACDPRFLEAYNN- 341

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
           L +  K  GR++E I+   Q L +           G  +       A +   K  + V  
Sbjct: 342 LGNALKDVGRVEEAIQCYNQCLTLQPNHPQALTNLGNIYMEWNMVAAAAQYYKATLNVTT 401

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
             S    NL   Y Q+GNY  A   Y +   IDP A
Sbjct: 402 GLSAPYNNLAIIYKQQGNYVDAISCYNEVLRIDPLA 437


>gi|260592518|ref|ZP_05857976.1| BatC protein [Prevotella veroralis F0319]
 gi|260535564|gb|EEX18181.1| BatC protein [Prevotella veroralis F0319]
          Length = 230

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC---NLSHCLIKQARY 198
           VT+  +TS  L   G       NY  AEV YRKA  ID + N A    NL+ CL  Q + 
Sbjct: 17  VTLNAQTSNQLIRKGNKNFSSKNYAQAEVFYRKA--IDKEPNNAIGNYNLARCLQAQHKN 74

Query: 199 TEARSVLE 206
            EA+ + E
Sbjct: 75  AEAKKIYE 82


>gi|262193489|ref|YP_003264698.1| hypothetical protein [Haliangium ochraceum DSM 14365]
 gi|262076836|gb|ACY12805.1| TPR repeat-containing protein [Haliangium ochraceum DSM 14365]
          Length = 963

 Score = 39.7 bits (91), Expect = 1.6,   Method: Composition-based stats.
 Identities = 43/169 (25%), Positives = 78/169 (46%), Gaps = 15/169 (8%)

Query: 22  YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
           YV A+ V L ++D + A+  F   I AG         + ++ + QD +DEA+  + + + 
Sbjct: 750 YVSAE-VALRQRDVDTALEHFRALIRAGGDSFDVRGRLGMIARLQDDLDEAIRQLCAAKR 808

Query: 82  LCSKQAQESLDNVLIDLYKKCGRLDEQIE------LLKQ----KLRMIYHGEAFNGKPTK 131
           L  +++    +  L D+YK+ GR+DE +       +L+Q     LR +    A   +  K
Sbjct: 809 LDPERSYPYSE--LYDVYKEAGRMDEALAELETYVMLEQMQYGPLRELVGAYAERARWDK 866

Query: 132 TARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
             R +G +  V +    + +   LG AY++ G+   A   +  A L  P
Sbjct: 867 V-RVYG-EMAVHINPSDAELFLMLGQAYLETGDPAQALFTFDSALLAKP 913


>gi|196233636|ref|ZP_03132477.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196222306|gb|EDY16835.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 752

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 77/184 (41%), Gaps = 18/184 (9%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHL--CSKQAQESLD 92
           P+ A+  F +A+        AL ++ + L    R+DEA+   +    +     +   +L 
Sbjct: 192 PQEAMTAFRRALEINPDYADALNNLGITLADAGRLDEAIACYRRALQINPAGAETNTNLG 251

Query: 93  NVLIDLYKKCGRLDEQ-------IELLKQKLRMIYH--GEAFNGKPTKTARSHGKKFQVT 143
           N L +L     RLDE        IEL K  L   Y+  G A   +      S      + 
Sbjct: 252 NALFELQ----RLDEAAAAFRAVIEL-KPDLAQAYNNLGNALREQGALNEASAEFLHALA 306

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEAR 202
           ++  ++    NLG A   +G   +A   YR+A  + P D+    N  + L+ Q R  + R
Sbjct: 307 IEPNSADFHNNLGNALKDRGEIDAALDAYRRAMELAPDDSGPWTNFVYTLLFQPRVDD-R 365

Query: 203 SVLE 206
           +++E
Sbjct: 366 ALIE 369



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 50/206 (24%), Positives = 86/206 (41%), Gaps = 16/206 (7%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQESLD 92
           PE A+ L  +A+NA  +  +A  ++   L +  R++EA +A      L     QA  +L 
Sbjct: 56  PEIAVELIQQALNAAPQHVAAHFNLGNALSELGRMEEAADAFGRATELQPDYAQAHHNLG 115

Query: 93  NVLIDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFNGKPTKTARSHGKKFQVTV 144
           + L     K GR DE I   ++ + +       Y+  G A   +  +       +  + +
Sbjct: 116 SAL----AKRGRFDEAIAAFQRAIELKPDYASAYNNLGLALKAQARRDEALAAFQQAIAL 171

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARS 203
           + + +    NLG  + +      A   +R+A  I+PD A+   NL   L    R  EA +
Sbjct: 172 QPDHAEAHFNLGNIFREWARPQEAMTAFRRALEINPDYADALNNLGITLADAGRLDEAIA 231

Query: 204 VLEDVLLGKLSGS-TETKTINRVKEL 228
                L    +G+ T T   N + EL
Sbjct: 232 CYRRALQINPAGAETNTNLGNALFEL 257


>gi|418065486|ref|ZP_12702859.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
           RCH3]
 gi|373562226|gb|EHP88443.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
           RCH3]
          Length = 592

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 91/220 (41%), Gaps = 20/220 (9%)

Query: 6   DQPYHVVHKLPPGDSPYVRA-KHVQLVEKD---PEAAIVLFWKAINAG-DRVDSALKDMA 60
           D+      K+PPG + YV A  H+  + +D   PE  I +   AI A  D++D  L  +A
Sbjct: 354 DKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLY-LA 412

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK------- 113
            + +  D+  E +  +K      ++  +  L   +  +  K G  +E I  +K       
Sbjct: 413 GLYESMDKFSEGLAVLKGVEGKFAEDPR--LHFRMGTILDKMGNKEESIARMKRVIAITP 470

Query: 114 ---QKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSA-E 169
              Q L  + +  A  G     A  + KK  V ++     IL +LGW Y +   Y  A  
Sbjct: 471 DDAQALNYLGYTYAEMGIKLDEALQYLKK-AVALRPNDGFILDSLGWVYFKMKRYDEAVP 529

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           ++ R  ++++ D     +L+        Y  A  + + +L
Sbjct: 530 LLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKIL 569


>gi|404496093|ref|YP_006720199.1| TPR domain-containing protein [Geobacter metallireducens GS-15]
 gi|78193702|gb|ABB31469.1| TPR domain protein [Geobacter metallireducens GS-15]
          Length = 573

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 91/220 (41%), Gaps = 20/220 (9%)

Query: 6   DQPYHVVHKLPPGDSPYVRA-KHVQLVEKD---PEAAIVLFWKAINAG-DRVDSALKDMA 60
           D+      K+PPG + YV A  H+  + +D   PE  I +   AI A  D++D  L  +A
Sbjct: 335 DKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLY-LA 393

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK------- 113
            + +  D+  E +  +K      ++  +  L   +  +  K G  +E I  +K       
Sbjct: 394 GLYESMDKFSEGLAVLKGVEGKFAEDPR--LHFRMGTILDKMGNKEESIARMKRVIAITP 451

Query: 114 ---QKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSA-E 169
              Q L  + +  A  G     A  + KK  V ++     IL +LGW Y +   Y  A  
Sbjct: 452 DDAQALNYLGYTYAEMGIKLDEALQYLKK-AVALRPNDGFILDSLGWVYFKMKRYDEAVP 510

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           ++ R  ++++ D     +L+        Y  A  + + +L
Sbjct: 511 LLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKIL 550


>gi|67923685|ref|ZP_00517153.1| TPR repeat:TPR repeat [Crocosphaera watsonii WH 8501]
 gi|67854485|gb|EAM49776.1| TPR repeat:TPR repeat [Crocosphaera watsonii WH 8501]
          Length = 530

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 80/194 (41%), Gaps = 17/194 (8%)

Query: 27  HVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQ 86
           H Q  E   EAAI  + KA+      D A  +  + L    R D+A+ +      L   +
Sbjct: 252 HTQQAEY--EAAIASYDKALQLTPDYDLAWNNRGIALANVGRFDKAIASYDKALQLTPDK 309

Query: 87  AQESLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYHGEAFN----GKPTKTARSH 136
             E+  N  I L+ + GR DE I    + L++       ++   +     G+  +   S+
Sbjct: 310 -DEAWCNRGIALFNR-GRSDEAIASFDKALQLKPDDHQAWNNRGYALRQLGRSDEAIASY 367

Query: 137 GKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQ 195
            K  Q+  K +  +   N G+A  Q G +  A   Y KA  + PD  +A  N    L K 
Sbjct: 368 DKALQL--KPDDHQAWNNRGYALRQLGRFDEAIASYYKALQLKPDYYEAWHNRGIALRKL 425

Query: 196 ARYTEARSVLEDVL 209
            R+ EA +  +  L
Sbjct: 426 GRFDEAIASYDKAL 439


>gi|387126389|ref|YP_006294994.1| hypothetical protein Q7A_501 [Methylophaga sp. JAM1]
 gi|386273451|gb|AFI83349.1| hypothetical protein Q7A_501 [Methylophaga sp. JAM1]
          Length = 591

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 65/156 (41%), Gaps = 19/156 (12%)

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
           A  L++ D  +E+VE +     L +      L      + +  GRL    E+L+Q LR I
Sbjct: 417 AAFLREHDMPEESVEVLTE--ALTNHPGNLDLLYSRAMVAESIGRL----EMLEQDLRTI 470

Query: 120 Y-----HGEAFNG-KPTKTARS--HGKKFQVTVKQETSR-----ILGNLGWAYMQKGNYT 166
                 + +A N    T T R+  H +  Q+  K    R      L +LGW Y + GN  
Sbjct: 471 LKIDPENAQAMNALGYTLTDRTDRHEEALQLINKALELRPNDPYYLDSLGWVYYRMGNLE 530

Query: 167 SAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEAR 202
            AE   R+A  I  D     +L   L  Q +  EAR
Sbjct: 531 KAEYYLREAVEIQADVEFVAHLGEVLWMQNKKNEAR 566


>gi|118578955|ref|YP_900205.1| hypothetical protein Ppro_0516 [Pelobacter propionicus DSM 2379]
 gi|118501665|gb|ABK98147.1| Tetratricopeptide TPR_2 repeat protein [Pelobacter propionicus DSM
           2379]
          Length = 673

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 85/208 (40%), Gaps = 42/208 (20%)

Query: 7   QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRV-------DSALKDM 59
           + Y    KL P DSP ++ K  +++ K  E ++     AI+A D V         A +++
Sbjct: 396 EQYSEFLKLKP-DSPDIQLKLARILAKKKETSL-----AIDAYDAVLKSAPDNPEANREI 449

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
           A + K +   D AV   +    L    A     + L+ LY K  + DE  ELLK  + + 
Sbjct: 450 AALYKAKGMNDRAVAHYRKALELRKDDADTR--SALVSLYVKNRQYDEITELLKGAVELF 507

Query: 120 YHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID 179
                      + A +H K                LG  +  K  Y SA   Y+KA  + 
Sbjct: 508 ----------PEDANNHYK----------------LGLIHEFKKEYGSAIACYQKAAELR 541

Query: 180 PDANKACN-LSHCLIKQARYTEARSVLE 206
           PD  +A N L    +K  R +EAR  LE
Sbjct: 542 PDHARALNALGRMYMKTDRISEAREALE 569


>gi|116200818|ref|XP_001226221.1| hypothetical protein CHGG_10954 [Chaetomium globosum CBS 148.51]
 gi|88175668|gb|EAQ83136.1| hypothetical protein CHGG_10954 [Chaetomium globosum CBS 148.51]
          Length = 1002

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 9/72 (12%)

Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYT 199
           +R++ NLG  Y ++G Y  AE + ++A     ++  P+     +   NL H  I Q RYT
Sbjct: 737 ARVIHNLGNLYAEQGRYKEAEALLKRALERNEEVWGPEREWTLSTVSNLGHVYIYQQRYT 796

Query: 200 EARSVLEDVLLG 211
           EA ++ +  L G
Sbjct: 797 EAEALYDRALEG 808


>gi|351730791|ref|ZP_08948482.1| tetratricopeptide tpr_1 repeat-containing protein [Acidovorax
           radicis N35]
          Length = 809

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 83/191 (43%), Gaps = 25/191 (13%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P  A+  + +AI    R+ +A  +    L+Q DR++EA++        C +QA  +LD+ 
Sbjct: 137 PLDAVASYDQAIALQPRLAAAFNNKGNALRQLDRLEEALK--------CYEQA-FALDSD 187

Query: 95  LID-------LYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHG-----KKFQ 141
            +D       L+   GR DE ++   + + +   H EA NGK    A+          F+
Sbjct: 188 DVDACQNMGMLHADAGRQDEALKCFDRVIALRPQHAEAHNGKGAILAQREQWTQAIAHFE 247

Query: 142 VTVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARY 198
             ++     +    NLG A       ++A   ++KA  + P D      L+  L++  RY
Sbjct: 248 AAIQGNEKLVQAHKNLGLAQRSLFQLSNAVTAFQKAAQLSPQDVEILSFLALSLLEAGRY 307

Query: 199 TEARSVLEDVL 209
            EA +  +  +
Sbjct: 308 AEALAAYDQAI 318


>gi|302416639|ref|XP_003006151.1| anaphase-promoting complex subunit cut9 [Verticillium albo-atrum
           VaMs.102]
 gi|261355567|gb|EEY17995.1| anaphase-promoting complex subunit cut9 [Verticillium albo-atrum
           VaMs.102]
          Length = 620

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
           L   +   GNYT A+    K  L+  + +     +HCLIKQ+RY EA  +L D
Sbjct: 43  LAQVHFATGNYTRAQTFLSKQDLVTRNTSCRYLAAHCLIKQSRYDEALGLLGD 95


>gi|389874415|ref|YP_006373771.1| hypothetical protein TMO_c0179 [Tistrella mobilis KA081020-065]
 gi|388531595|gb|AFK56789.1| TPR repeat-containing protein [Tistrella mobilis KA081020-065]
          Length = 579

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 73/170 (42%), Gaps = 18/170 (10%)

Query: 50  DRVDSALKDMAVVL----KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID--LYKKCG 103
           +RV +  KD A+ L    ++Q+R D+A  A  S          E  D        Y++ G
Sbjct: 367 ERVAAGQKDAAISLAELYRRQERWDDAARAYSSALAALVDPGPEHWDLFYARGVAYERAG 426

Query: 104 RLDE-QIELLK-------QKLRMIYHGEAF--NGKPTKTARSHGKKFQVTVKQETSRILG 153
           R  E + + LK       Q L + Y G ++   G     AR+  +K  VT++     I+ 
Sbjct: 427 RWPEAESDFLKALQLSPDQPLVLNYLGYSWVEKGMNLDRARAMLEK-AVTLRPRDGFIVD 485

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEAR 202
           +LGWA  Q G+Y  A     +A L+ P D     +L     +  R  EAR
Sbjct: 486 SLGWALFQMGDYEGAVARLEQAVLLQPNDPTINDHLGDAYWRVGREREAR 535


>gi|113474879|ref|YP_720940.1| hypothetical protein Tery_1102 [Trichodesmium erythraeum IMS101]
 gi|110165927|gb|ABG50467.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
          Length = 1507

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 49/170 (28%), Positives = 67/170 (39%), Gaps = 35/170 (20%)

Query: 56  LKDMAVVLKQQDRVDEA-------VEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQ 108
           L ++A + K Q R  EA       +E  K        Q    L+N L  LY+  GR  E 
Sbjct: 460 LNNLAGLYKSQGRYSEAEPLYKQAIEIHKVALPANHPQRASGLNN-LAGLYRAQGRYSEA 518

Query: 109 IELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSA 168
             LLKQ +  IY          K A      F  T        L NL   Y  +G Y+ A
Sbjct: 519 EPLLKQAIE-IY----------KVALPANHPFLATN-------LNNLAELYRAQGRYSEA 560

Query: 169 EVVYRKAQLID--------PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           E +Y++A  ID        P+ A    NL+     Q RY+EA  + +  +
Sbjct: 561 EPLYKQAIEIDNIALPANHPELATNLNNLAELYRAQGRYSEAEPLYKQAI 610


>gi|390438134|ref|ZP_10226629.1| NB-ARC domain protein [Microcystis sp. T1-4]
 gi|389838454|emb|CCI30753.1| NB-ARC domain protein [Microcystis sp. T1-4]
          Length = 896

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 72/174 (41%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V S+L  +AV+   Q R +EA    ++A++  + L  +       SL+N L  LYK  G+
Sbjct: 549 VASSLNGLAVLYHLQGRYEEAEPLYLQALELRKQLLGENHPDVATSLNN-LAGLYKSQGK 607

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
            +E   L  Q L +                   K+       + +  L NL   Y  +G 
Sbjct: 608 YEETEPLYLQALEL------------------RKRLLGENHTDFATSLNNLAELYRSQGR 649

Query: 165 YTSAEVVY------RKAQLIDPD---ANKACNLSHCLIKQARYTEARSVLEDVL 209
           YT AE +Y      RK  L D     A    NL+     Q +YTEA S+L + L
Sbjct: 650 YTEAEPLYLEALDLRKRLLGDNHPKVAQSLNNLAGLYYSQGKYTEAESLLLEAL 703


>gi|158341205|ref|YP_001522431.1| TPR repeat-containing serine/threonin protein kinase [Acaryochloris
           marina MBIC11017]
 gi|158311446|gb|ABW33057.1| serine/threonine protein kinase with TPR repeats [Acaryochloris
           marina MBIC11017]
          Length = 578

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 13/88 (14%)

Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDV 208
           +R+L NLGWA  + G+Y  AE   ++A   DP+   A    HCLI Q    +    L   
Sbjct: 493 ARLLKNLGWAEFKLGDYRQAEGRLQQAIRSDPEMAAA----HCLIAQVYQEQGNDTL--- 545

Query: 209 LLGKLSGSTETKTINRVKELLQELEPWQ 236
                      K   R+   L E+E W+
Sbjct: 546 ------AQQSWKACLRLDSFLPEVEEWK 567


>gi|358382374|gb|EHK20046.1| hypothetical protein TRIVIDRAFT_47920 [Trichoderma virens Gv29-8]
          Length = 620

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI    N AC     HCLIKQ+R+ EA +VL
Sbjct: 41  LAQVHFATGNYTRAQSFLSKQDLISR--NPACRYLAGHCLIKQSRFDEALAVL 91


>gi|146306092|ref|YP_001186557.1| hypothetical protein Pmen_1058 [Pseudomonas mendocina ymp]
 gi|145574293|gb|ABP83825.1| Tetratricopeptide TPR_2 repeat protein [Pseudomonas mendocina ymp]
          Length = 575

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 30/60 (50%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
           IL +LGW   + GN   AE + R+A    PD   A +L   L  Q +  EAR V  D L+
Sbjct: 495 ILDSLGWVNYRLGNLEEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQREARRVWRDALV 554


>gi|255600764|ref|XP_002537530.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
 gi|223516024|gb|EEF24853.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
          Length = 450

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTE 200
           V++  E+   L  LG   +   N    E  YR+A  +DPD  +A  NLS+ L++QAR+ E
Sbjct: 100 VSLAPESPTPLSQLGAVLVCMHNEPEGEDCYRRALALDPDHARAQFNLSYLLLRQARFDE 159

Query: 201 ARSVLE 206
              +L+
Sbjct: 160 GWRMLD 165


>gi|90411197|ref|ZP_01219210.1| hypothetical protein P3TCK_06512 [Photobacterium profundum 3TCK]
 gi|90328043|gb|EAS44364.1| hypothetical protein P3TCK_06512 [Photobacterium profundum 3TCK]
          Length = 361

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-DPDANKACNLSHCLIKQARYTEAR 202
           +  ++ + L NLG++Y   G+   AEV+ R+A  I   D     NL    I+  RY++A 
Sbjct: 218 INHKSRQALTNLGYSYYLSGDLKQAEVINRRATTIYQTDKRAWSNLGLVYIRSKRYSDAL 277

Query: 203 SVLEDVL 209
             L++V+
Sbjct: 278 DALQNVM 284


>gi|428212874|ref|YP_007086018.1| capsular polysaccharide biosynthesis protein [Oscillatoria
           acuminata PCC 6304]
 gi|428001255|gb|AFY82098.1| capsular polysaccharide biosynthesis protein [Oscillatoria
           acuminata PCC 6304]
          Length = 791

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 17/186 (9%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQESLDN 93
           E A   F KAI     + +    +  +L+QQ+ ++ A++  KS   L     +A  +L  
Sbjct: 195 EEAFSCFQKAIMLQPNLAAGHFKLGYLLQQQNELESAIDCFKSTIELQPDWNEAHNNLGL 254

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGK--------KFQVTV 144
           VL    +K  R +E I   K+ + +     EA+    T T +  GK        +  + +
Sbjct: 255 VL----RKINREEEAISSFKKAIEINPNFAEAYRNLGT-TLQQQGKLEAAAACLRDCIKI 309

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
           +   +   GNLG+   Q+G    A+   R A  ++PD   A  NL + L ++    E+ S
Sbjct: 310 QPNFALAHGNLGYVLEQQGKLDEAKASLRHAIALEPDLAMAYGNLGNILHREGELEESIS 369

Query: 204 VLEDVL 209
             ++ +
Sbjct: 370 CFQNAI 375


>gi|323489123|ref|ZP_08094357.1| Tetratricopeptide repeat protein [Planococcus donghaensis MPA1U2]
 gi|323397246|gb|EGA90058.1| Tetratricopeptide repeat protein [Planococcus donghaensis MPA1U2]
          Length = 1382

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 72/155 (46%), Gaps = 22/155 (14%)

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSV 204
           ++T  +   LG   M++  +T AE  YRK+  +DP D     N++H L+KQ ++ E    
Sbjct: 720 EKTDVLHSYLGHIAMEQEQFTEAETEYRKSLELDPTDLFTVTNIAHTLLKQEKHVELNKF 779

Query: 205 LEDVLLGKLSGSTETKTINRVKELL-QELEPWQSIPPSLTTKKSSLEDAFLEGLDDLMNQ 263
              V+  K +G  +T   NR   +L QE E   +   ++   +S +E   L  L ++  Q
Sbjct: 780 ---VMQYKETG--DTFYFNRTAAMLWQESEDDLAKTLAIDLLESGMERDTL-NLREIAEQ 833

Query: 264 WTP------YRSRRLPIFEEISPFRD-----QLAC 287
           +        +RSR   +FE    FR+     QL C
Sbjct: 834 YAEFGEAPQFRSR---MFERFKRFRETAPDSQLLC 865


>gi|363728118|ref|XP_416374.3| PREDICTED: transmembrane and TPR repeat-containing protein 1
           [Gallus gallus]
          Length = 797

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 76/188 (40%), Gaps = 22/188 (11%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLC--SKQAQESLDN 93
           E A++L   +I  G     A   +A +L +Q+R+ EA E  K+    C  S     +   
Sbjct: 480 EEAVILLRDSIKYGPDFADAYSSLASLLAEQERLKEAEEVYKAGIENCPESSDLHNNYGV 539

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRM--IYHGEAFN--------GKPTKTARSHGKKFQVT 143
            L+D     G  +  +   +Q + +   +H    N        G+  +    + +  +V+
Sbjct: 540 FLVD----TGAPERAVSHYRQAIHLSPTHHVAMVNLGRLHRSLGQNKEAEAWYKRALKVS 595

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN--LSHCLIKQARYTEA 201
            K E   IL  LG  Y   G Y  A  VYR+A  + P +NK     L+  L    R  EA
Sbjct: 596 RKAE---ILSPLGALYYNTGRYEEALQVYREAASLQP-SNKETRLALAQVLAMMGRTKEA 651

Query: 202 RSVLEDVL 209
             +   VL
Sbjct: 652 EKMTNHVL 659


>gi|386851106|ref|YP_006269119.1| cold-shock protein, DNA-binding protein [Actinoplanes sp. SE50/110]
 gi|359838610|gb|AEV87051.1| cold-shock protein, DNA-binding protein [Actinoplanes sp. SE50/110]
          Length = 1462

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 51/96 (53%), Gaps = 3/96 (3%)

Query: 22  YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAI-KSFR 80
           Y +AK+++   KD E A + +  AI      +SA+KD+A + ++ D  + A++ I + F 
Sbjct: 369 YDQAKYLEHRAKDLERAKMAYRDAIRKNINAESAIKDLAWLTRRVDGPEAALQVIEEEFA 428

Query: 81  HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKL 116
                 A  SLD +LID Y    R ++ + LL+  L
Sbjct: 429 DRLPPSA--SLDQILIDFYMGSQRYEDALRLLEPML 462


>gi|301094298|ref|XP_002896255.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
           putative [Phytophthora infestans T30-4]
 gi|262109650|gb|EEY67702.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
           putative [Phytophthora infestans T30-4]
          Length = 977

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 91/230 (39%), Gaps = 22/230 (9%)

Query: 42  FWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQESLDNVLIDLY 99
           + +AI        A  ++A +LK   ++D AVE  +    L      A  +L N L    
Sbjct: 195 YAQAIRVKPSFAIAWSNLAGLLKDDGQLDAAVEHYREAIRLAPDFADAYSNLGNAL---- 250

Query: 100 KKCGRLDEQIELLKQKLRM-----IYHGE---AFNGKPTKTARSHGKKFQVTVKQETSRI 151
           K+ GR+DE I+  K  L++     I HG     +          H  +  + ++      
Sbjct: 251 KESGRVDEAIQAYKSALQIRPNFAIAHGNLASCYYDAGQMELAIHTFRHAIQLEPNFPDA 310

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHC-----LIKQAR--YTEARS 203
             NLG A  + G+   A   YR A  + PD   A  NL +      L+K+A   YT A  
Sbjct: 311 YNNLGNALRECGHLEQAVTCYRTALQLKPDHPHAYNNLGNALKDKGLVKEALHCYTTAAR 370

Query: 204 VLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
           +L        +  +  K   ++ + L   +   +I P+     S++ + F
Sbjct: 371 LLPQFAAAHSNIGSVLKEQGKLDQALAHYQQAITIDPNFADAYSNMGNVF 420


>gi|367036871|ref|XP_003648816.1| hypothetical protein THITE_2106692 [Thielavia terrestris NRRL 8126]
 gi|346996077|gb|AEO62480.1| hypothetical protein THITE_2106692 [Thielavia terrestris NRRL 8126]
          Length = 1035

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 51/128 (39%), Gaps = 27/128 (21%)

Query: 93  NVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL 152
           N L ++Y+K GRL E   + ++         A  G+     R H    Q          +
Sbjct: 539 NNLGNVYRKRGRLTEAESMFQR---------ALEGQENALGRDHPSTLQT---------V 580

Query: 153 GNLGWAYMQKGNYTSAEVVYRKA------QLIDPDA---NKACNLSHCLIKQARYTEARS 203
            NLG  Y  +G  T AE +++ A       L   D    +  C+L     KQ R TEA S
Sbjct: 581 HNLGIFYANQGRLTEAESMFQWALEGQEKALAQDDVSTLDTVCSLGSVYWKQGRLTEAES 640

Query: 204 VLEDVLLG 211
           + +  L G
Sbjct: 641 MYQRALEG 648


>gi|410914070|ref|XP_003970511.1| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Takifugu rubripes]
          Length = 1036

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 46/198 (23%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E I+L ++ L                          ++++ EA    PT  
Sbjct: 326 IKREQGNIEEAIQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463


>gi|351700623|gb|EHB03542.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Heterocephalus glaber]
          Length = 1046

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 67/162 (41%), Gaps = 37/162 (22%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESL 91
           E A+ L+ KA+       +A  ++A VL+QQ ++ E +    EAI+             +
Sbjct: 344 EEAVRLYRKALEGFPDFAAAHSNLASVLQQQGKLQETLMHYKEAIR-------------I 390

Query: 92  DNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
                D Y   G + ++++ ++  L+   +  A    PT  A +H               
Sbjct: 391 SPTFADAYSNMGNILKEMQDVQGALQC--YTRAIQINPT-FADAH--------------- 432

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
             NL   +M  GN   A   YR A  + PD   A CNL+HCL
Sbjct: 433 -SNLASIHMDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCL 473


>gi|254445786|ref|ZP_05059262.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
           DG1235]
 gi|198260094|gb|EDY84402.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
           DG1235]
          Length = 453

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 11/56 (19%)

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL------SHCLIKQARYTEA 201
            G LG+ +  +GNY SAEV Y +A L +PD     NL      +   +K  RY EA
Sbjct: 161 FGMLGYCHYAEGNYISAEVAYNRAVLAEPD-----NLDWLEGKAQTYLKAERYVEA 211


>gi|209886536|ref|YP_002290393.1| hypothetical protein OCAR_7425 [Oligotropha carboxidovorans OM5]
 gi|337739942|ref|YP_004631670.1| TPR repeat-containing protein [Oligotropha carboxidovorans OM5]
 gi|386028960|ref|YP_005949735.1| TPR repeat protein [Oligotropha carboxidovorans OM4]
 gi|209874732|gb|ACI94528.1| TPR repeat protein [Oligotropha carboxidovorans OM5]
 gi|336094028|gb|AEI01854.1| TPR repeat protein [Oligotropha carboxidovorans OM4]
 gi|336097606|gb|AEI05429.1| TPR repeat protein [Oligotropha carboxidovorans OM5]
          Length = 267

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKA 185
           GKP +  R +    ++  + +   +L NLG +Y+       AE   R+AQ + P D+   
Sbjct: 151 GKPDEARRYYANALKI--RPDEPSVLSNLGMSYVLTKELPRAEATLRRAQQLAPNDSRIR 208

Query: 186 CNLSHCLIKQARYTEARSVLE 206
            NL   L  Q R+TEA ++++
Sbjct: 209 QNLGLALGLQGRFTEAEAIVQ 229


>gi|72383071|ref|YP_292426.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
 gi|72002921|gb|AAZ58723.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
          Length = 909

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 1/69 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTE 200
           + +K + + +  NLG      GN   AE  YRKA  I PD A+   NL + L + + +T+
Sbjct: 401 IQIKPDYAEVYSNLGNVLKDLGNLQDAEFSYRKAIQIKPDYADAYSNLGNILKELSNFTD 460

Query: 201 ARSVLEDVL 209
           A +  +D L
Sbjct: 461 AINQFKDAL 469


>gi|47222947|emb|CAF99103.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1037

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 46/198 (23%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 269 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 326

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E I+L ++ L                          ++++ EA    PT  
Sbjct: 327 IKREQGNIEEAIQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 386

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 387 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 446

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 447 LKLKPDFPDAYCNLAHCL 464


>gi|347734615|ref|ZP_08867638.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
 gi|346922358|gb|EGY02771.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
          Length = 579

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTE 200
           V +K + S + GNLG A  + G   +AE  YR+A  + PD A+   NL + L    R+ E
Sbjct: 95  VALKPDDSAMHGNLGIALAKLGRLDAAEGAYRRAIALRPDNADAHSNLGNVLRHHGRWDE 154

Query: 201 ARS 203
           A +
Sbjct: 155 AEA 157


>gi|340519385|gb|EGR49624.1| cell division control protein 16 [Trichoderma reesei QM6a]
          Length = 616

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI    N AC     HCLIKQ+R+ EA +VL
Sbjct: 41  LAQVHFATGNYTRAQSFLSKQDLISR--NPACRYLAGHCLIKQSRFEEALAVL 91


>gi|220933709|ref|YP_002512608.1| Tetratricopeptide TPR_2 repeat-containing protein [Thioalkalivibrio
           sulfidophilus HL-EbGr7]
 gi|219995019|gb|ACL71621.1| Tetratricopeptide TPR_2 repeat protein [Thioalkalivibrio
           sulfidophilus HL-EbGr7]
          Length = 574

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 31/63 (49%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
           E + IL +LGW   + G    AE   R+A     D   A NL+  L  + +  EAR VLE
Sbjct: 494 EDAAILDSLGWVLYRMGRLDEAETYLRQAYDTMYDPEIASNLAMLLWDRGQRDEARRVLE 553

Query: 207 DVL 209
           D L
Sbjct: 554 DAL 556


>gi|428215261|ref|YP_007088405.1| Tfp pilus assembly protein PilF [Oscillatoria acuminata PCC 6304]
 gi|428003642|gb|AFY84485.1| Tfp pilus assembly protein PilF [Oscillatoria acuminata PCC 6304]
          Length = 371

 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 95/217 (43%), Gaps = 14/217 (6%)

Query: 28  VQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQA 87
           V L ++D + A+  F +  +   R   A + +   L QQ R+DEAV A++    L  +Q 
Sbjct: 159 VLLRQEDYDGALQAFQQVTSRDTRYWQAYESIGTALLQQGRIDEAVTALQQAAALAPRQG 218

Query: 88  QESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
             S+  +L   +   G  +E +E  KQ  ++            K  +   +  +  V  +
Sbjct: 219 --SIQMILGVAFLTQGNTNEALETFKQAAQLEPRNAQLQLAIGKLLQDQEQMIEALVTYQ 276

Query: 148 TSRIL--------GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARY 198
            +  L          +   ++Q+ +Y  A V YR+   + PD   A  NL+  L  + R 
Sbjct: 277 RAASLAPDLIEPQAAIAAIHLQQEDYLQAIVAYRRLTELAPDNGDAYYNLALALRGRERI 336

Query: 199 TEARSVLEDV-LLGKLSGSTETKTINRVKELLQELEP 234
           +EA   L+    L +  G  ET+ + +V+ LL EL+P
Sbjct: 337 SEAIEQLQKAQALYQEQG--ETERLEKVEALLSELKP 371


>gi|427399267|ref|ZP_18890505.1| hypothetical protein HMPREF9710_00101 [Massilia timonae CCUG 45783]
 gi|425721758|gb|EKU84666.1| hypothetical protein HMPREF9710_00101 [Massilia timonae CCUG 45783]
          Length = 722

 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVLLGKL 213
           LG  + +KG+Y  AE  +R +  ++PDA  A  NLS  L +Q R  EA  VL++ L  + 
Sbjct: 489 LGELHARKGDYIKAEQAFRDSIRLEPDAVYAYANLSAVLQRQGREKEALQVLQEGLQVRP 548

Query: 214 SG 215
           SG
Sbjct: 549 SG 550


>gi|62319217|dbj|BAD94413.1| spindly [Arabidopsis thaliana]
 gi|62319734|dbj|BAD95289.1| spindly [Arabidopsis thaliana]
          Length = 535

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           + AIV +  A +       A  ++ V+ K +D +D+AVE  +    +    AQ SL+N+ 
Sbjct: 312 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 370

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EAFN                           N
Sbjct: 371 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 402

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   GN T A   Y +   IDPD+  A
Sbjct: 403 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 433


>gi|329895776|ref|ZP_08271152.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
 gi|328922138|gb|EGG29495.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
          Length = 807

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 67/150 (44%), Gaps = 28/150 (18%)

Query: 98  LYKKCGRLDEQIELLKQKLRM------IYHGEAFNGKPTKTARSHGKKFQ--VTVKQETS 149
           L +K GRL E +++ ++ + +      +Y+  A N    K        F+  +++K E S
Sbjct: 155 LVQKSGRLSESLDIHRKAIDLAPEDADVYYNLAVNLADLKRLDDAVSMFKHALSLKPEFS 214

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN----LSHCLIK----------- 194
           + L NLG    Q G    AE+ + KA  I P+  K  N      H L K           
Sbjct: 215 QALTNLGVVLEQLGEVEDAEICFIKAIRISPNEAKIYNNLGIAQHSLGKLLASESNYRIA 274

Query: 195 ---QARYTEARSVLEDVL--LGKLSGSTET 219
              + ++ EA + L +VL  LGKL+ + E+
Sbjct: 275 LTLEPQFAEANTNLGNVLRRLGKLTEAAES 304


>gi|385203261|ref|ZP_10030131.1| Flp pilus assembly protein TadD [Burkholderia sp. Ch1-1]
 gi|385183152|gb|EIF32426.1| Flp pilus assembly protein TadD [Burkholderia sp. Ch1-1]
          Length = 596

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEARSVLEDVL 209
           NLG   ++ G +  A+ V+R+   ++P+  +ACN L+  L+K  RY EA S +   L
Sbjct: 174 NLGMVLLKTGRHQEAQSVFRRVLELEPEHCEACNGLAVALMKDYRYEEAESFVRRAL 230


>gi|302911406|ref|XP_003050485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256731422|gb|EEU44772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 616

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 38/87 (43%), Gaps = 6/87 (6%)

Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
           +A N    ++A   G K       +T      L   +   GNYT A+       LI    
Sbjct: 11  DALNKAQYESAIFIGDKLLALTNDDTDAFW--LAQVHFATGNYTRAQAFLSSQDLISR-- 66

Query: 183 NKACNL--SHCLIKQARYTEARSVLED 207
           N +C     HCLIKQ+R+ EA +VL D
Sbjct: 67  NPSCRYLAGHCLIKQSRFDEALAVLGD 93


>gi|408418139|ref|YP_006759553.1| tetratricopeptide repeat protein, presursor [Desulfobacula
           toluolica Tol2]
 gi|405105352|emb|CCK78849.1| tetratricopeptide repeat protein, presursor [Desulfobacula
           toluolica Tol2]
          Length = 762

 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 10/183 (5%)

Query: 12  VHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
           + K P    P +   +    +K PE A + F KAI+      +A   +A +L  Q R  E
Sbjct: 228 ISKKPGTAEPLILLANFYAGQKKPEKAQISFLKAIDTDPENLNAYMLLARLLNSQKRTAE 287

Query: 72  AVEAIKSFRHLCSKQAQESLDNVLIDLY---KKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
           A + I+  + L  +    S+ N   D +   K   + +  ++ + +K       +   GK
Sbjct: 288 AEKFIQ--KALTIEPDNFSVQNAYADFHFSNKNILKAETLVDEILEKRPNYLPAKMLKGK 345

Query: 129 PTKTARSHGKK---FQVTVKQETSRILGN--LGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
              T + H K    FQ  VK+E    + N  LG ++ +K ++  A++   KA   +P+  
Sbjct: 346 ILATRKDHDKAIEIFQDLVKEEPESAMFNFLLGSSFFEKKDFKQAKIFLSKALEKNPNLF 405

Query: 184 KAC 186
           KA 
Sbjct: 406 KAL 408


>gi|428173589|gb|EKX42490.1| hypothetical protein GUITHDRAFT_88045 [Guillardia theta CCMP2712]
          Length = 315

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 8/98 (8%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVL---- 209
           LG  Y+++  +  AE  +R+A  I+P ++   C L   L+  + Y +A +VL   +    
Sbjct: 163 LGTIYLKQEKFQLAEYHFRRALEINPRNSVLHCYLGMALLSSSCYDDAIAVLNRAIKMDP 222

Query: 210 ---LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
              L KL  +     +NR +E L+EL   Q I P  +T
Sbjct: 223 NNPLAKLRKAIALSQLNRNEEALEELISLQQIAPREST 260


>gi|322700030|gb|EFY91787.1| putative anaphase control protein cut9 [Metarhizium acridum CQMa
           102]
          Length = 613

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 27/53 (50%), Gaps = 4/53 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI    N +C     HCLIKQ+RY EA  VL
Sbjct: 41  LAQVHFATGNYTRAQAFLSKQDLISR--NPSCRYLAGHCLIKQSRYEEALVVL 91


>gi|421504252|ref|ZP_15951195.1| hypothetical protein A471_13280 [Pseudomonas mendocina DLHK]
 gi|400345022|gb|EJO93389.1| hypothetical protein A471_13280 [Pseudomonas mendocina DLHK]
          Length = 575

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 29/59 (49%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           IL +LGW   + GN   AE + R+A    PD   A +L   L  Q +  EAR V  D L
Sbjct: 495 ILDSLGWVNYRLGNLEEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQREARRVWRDAL 553


>gi|421486676|ref|ZP_15934212.1| tetratricopeptide repeat family protein 11 [Achromobacter
           piechaudii HLE]
 gi|400194981|gb|EJO27981.1| tetratricopeptide repeat family protein 11 [Achromobacter
           piechaudii HLE]
          Length = 628

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 41/176 (23%), Positives = 76/176 (43%), Gaps = 13/176 (7%)

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
           A +L+  DR+ +AV  +++         +   +  L  LY++  RL +   +L+Q + + 
Sbjct: 452 AQILRDADRIQQAVSTLEAADQALPDTVEIKYE--LAMLYERQNRLADLERMLRQVIALD 509

Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
             H  A+N      A  H ++       +T   E S     IL ++GW   + G   SA 
Sbjct: 510 PDHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKFRMGESESAA 568

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
              R+A  + P+A+ A +L+  L  Q +  +A  +L   L+      T    + R+
Sbjct: 569 EYLRRAYSVRPEADIAAHLAEVLWSQGKRDQATELLRAALMKDPKNKTVQDVVKRL 624


>gi|322712900|gb|EFZ04473.1| anaphase control protein cut9 [Metarhizium anisopliae ARSEF 23]
          Length = 613

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 27/53 (50%), Gaps = 4/53 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI    N +C     HCLIKQ+RY EA  VL
Sbjct: 41  LAQVHFATGNYTRAQAFLSKQDLISR--NPSCRYLAGHCLIKQSRYEEALVVL 91


>gi|357441257|ref|XP_003590906.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
 gi|355479954|gb|AES61157.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
          Length = 744

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 10/156 (6%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI  +  A+        A  ++A +  +Q ++D A+   K     C  +  E+ +N 
Sbjct: 40  PQEAIACYQHALQTRPNYGMAYGNLASIHYEQGQLDMAILHYKQ-AIACDPRFLEAYNN- 97

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
           L +  K  GR++E I+   Q L +           G  +       A +   K  + V  
Sbjct: 98  LGNALKDVGRVEEAIQCYNQCLSLQPNHPQALTNLGNIYMEWNMVAAAASYYKATLNVTT 157

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
             S    NL   Y Q+GNY  A   Y +   IDP A
Sbjct: 158 GLSAPYNNLAIIYKQQGNYADAISCYNEVLRIDPLA 193


>gi|344257396|gb|EGW13500.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Cricetulus griseus]
          Length = 1444

 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 64/162 (39%), Gaps = 37/162 (22%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E A+ L+ KA+       +A  ++A VL+QQ ++ EA+   K    +  K A        
Sbjct: 288 EEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPKFA-------- 339

Query: 96  IDLYKKCGRLDEQIELLKQKL----RMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
            D Y   G   ++++ ++  L    R I    AF       A +H               
Sbjct: 340 -DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAF-------ADAHS-------------- 377

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
             NL   +   GN   A   YR A  + PD   A CNL+HCL
Sbjct: 378 --NLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCL 417


>gi|186681150|ref|YP_001864346.1| hypothetical protein Npun_R0649 [Nostoc punctiforme PCC 73102]
 gi|186463602|gb|ACC79403.1| TPR repeat-containing protein [Nostoc punctiforme PCC 73102]
          Length = 411

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 64/147 (43%), Gaps = 12/147 (8%)

Query: 74  EAIKSFRHLCSKQAQESLDNV-LIDLYKKCGRLDEQIELLKQKLRMIYHGEAF------- 125
           EA   +R L  + ++ S   + L D+    G++ E I   +Q L++      +       
Sbjct: 45  EAANIWRSLIERDSKNSYAYIKLADILSSQGKIAETIAAYRQGLQLTPDAAIYLKLGNFL 104

Query: 126 --NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
              G+  +   +  +  ++  K +T+ I+  L    +  GN   A V YR+A  I+PD +
Sbjct: 105 VEKGRTAEAIAAFRQAVKLDAKSDTASII--LAMNLIAMGNPEEAVVAYRQAIKIEPDDD 162

Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL 210
              NL+  L K  +  EA +   + L+
Sbjct: 163 NYNNLADTLFKIGKREEAIAAYREALI 189


>gi|421617611|ref|ZP_16058598.1| TPR domain-containing protein [Pseudomonas stutzeri KOS6]
 gi|409780391|gb|EKN60022.1| TPR domain-containing protein [Pseudomonas stutzeri KOS6]
          Length = 575

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 28/54 (51%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
           IL +LGW   + GN + AE + RKA    PD   A +L   L      +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLSEAEALLRKALQRFPDHEVAAHLGEVLWTLGERSEARAV 548


>gi|424914183|ref|ZP_18337547.1| Prophage tail length tape measure protein [Rhizobium leguminosarum
           bv. trifolii WSM597]
 gi|392850359|gb|EJB02880.1| Prophage tail length tape measure protein [Rhizobium leguminosarum
           bv. trifolii WSM597]
          Length = 1528

 Score = 38.5 bits (88), Expect = 3.1,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 138 KKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQA 196
           K F VT+ Q+T++    L  A M     ++AE + R+  LID   A  A NLS     Q 
Sbjct: 570 KNFGVTIGQDTAKAGETL--AQMFADPASAAEALSRQYGLIDAKTAEYATNLS----GQN 623

Query: 197 RYTEARSVLEDVLLGKLSGSTETKT 221
           R TEA+SVL D L  +L+ + E  T
Sbjct: 624 RLTEAQSVLLDALPNRLAKAAEATT 648


>gi|334121332|ref|ZP_08495403.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
           vaginatus FGP-2]
 gi|333455148|gb|EGK83806.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
           vaginatus FGP-2]
          Length = 1075

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 75/185 (40%), Gaps = 19/185 (10%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQ--AQESLDNVL 95
           AI  F KA+    +   A   + + L    R  EA+ A +    +  K   A + L N L
Sbjct: 269 AIAAFEKALEIDPKEHIAWHGLGITLNALGRNSEAIAAFEKALEIDPKAHIAWKGLGNAL 328

Query: 96  IDLYKKCGRLDEQIELLKQKLRM---IYH-----GEAFN--GKPTKTARSHGKKFQVTVK 145
             L    GR  E I   K+ L +    +H     G   N  G+ ++   +  K  ++  K
Sbjct: 329 NAL----GRNSEAIAAFKKALEIDPKFHHAWNGLGAPLNDLGRYSEAIAAFEKALEIDPK 384

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEARSV 204
              +     LG      G Y+ A  VY KA  IDP  + A N L + L +  RY+EA + 
Sbjct: 385 FHFA--WHGLGNVLNALGRYSEAIAVYEKALEIDPKFHFAWNGLGNALGELGRYSEAIAA 442

Query: 205 LEDVL 209
            E  L
Sbjct: 443 YEKAL 447


>gi|297735486|emb|CBI17926.3| unnamed protein product [Vitis vinifera]
          Length = 602

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 18/91 (19%)

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLR-------MIYHGEAFNGKPTKTARSHGKKFQV 142
           SLD+ LID+Y KCG +D+ +++ ++  R        I  G A +G+  K      K  + 
Sbjct: 360 SLDSSLIDMYSKCGDIDKAVQIFEESTRRDLFTWTSIVCGLAMHGRGEKALHYFSKMKEA 419

Query: 143 TVKQETSRILGNL-----------GWAYMQK 162
            V+ +   ++G L           GW Y Q 
Sbjct: 420 QVQPDDVTMVGVLSACAHAGLLDQGWWYFQS 450


>gi|357441255|ref|XP_003590905.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
 gi|355479953|gb|AES61156.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
          Length = 986

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 10/156 (6%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ AI  +  A+        A  ++A +  +Q ++D A+   K     C  +  E+ +N 
Sbjct: 282 PQEAIACYQHALQTRPNYGMAYGNLASIHYEQGQLDMAILHYKQ-AIACDPRFLEAYNN- 339

Query: 95  LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
           L +  K  GR++E I+   Q L +           G  +       A +   K  + V  
Sbjct: 340 LGNALKDVGRVEEAIQCYNQCLSLQPNHPQALTNLGNIYMEWNMVAAAASYYKATLNVTT 399

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
             S    NL   Y Q+GNY  A   Y +   IDP A
Sbjct: 400 GLSAPYNNLAIIYKQQGNYADAISCYNEVLRIDPLA 435


>gi|399519468|ref|ZP_10760263.1| TPR repeat-containing protein [Pseudomonas pseudoalcaligenes CECT
           5344]
 gi|399112564|emb|CCH36821.1| TPR repeat-containing protein [Pseudomonas pseudoalcaligenes CECT
           5344]
          Length = 575

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           IL +LGW   ++GN   AE + R+A    PD   A +L   L  Q +  +AR V  D L
Sbjct: 495 ILDSLGWVNYRQGNLDEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQRDARRVWRDAL 553


>gi|425465802|ref|ZP_18845109.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389831882|emb|CCI24962.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 830

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 92/214 (42%), Gaps = 47/214 (21%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V ++L ++A++ + Q R  EA    ++A+  ++ L          SL+N L +LY+  GR
Sbjct: 609 VANSLNNLALLYQSQGRYTEAEPLYLQALDLYKRLLGDNHPLVAASLNN-LAELYRSQGR 667

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L         + EA + +      +H          + +  L NL   Y  +G 
Sbjct: 668 YTEAEPL---------YLEALDLRKQLLGDNH---------PDVATSLNNLALLYKSQGR 709

Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSV------LEDVL 209
           YT AE +Y +A     +L+  +    A    NL++    Q RYTEA  +      L   L
Sbjct: 710 YTEAEPLYLEALDLYKRLLGDNHPHVATSLNNLAYLYESQGRYTEAEPLYLEALDLRKRL 769

Query: 210 LGKLSGSTETKTINRVKELLQELEP-----WQSI 238
           LG    +T+T   N ++ L Q+ +P     WQ I
Sbjct: 770 LGDNHPNTKT-CRNNLQRLRQQQKPTSLTWWQWI 802


>gi|147771783|emb|CAN60259.1| hypothetical protein VITISV_007741 [Vitis vinifera]
          Length = 602

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 18/91 (19%)

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLR-------MIYHGEAFNGKPTKTARSHGKKFQV 142
           SLD+ LID+Y KCG +D+ +++ ++  R        I  G A +G+  K      K  + 
Sbjct: 360 SLDSSLIDMYSKCGDIDKAVQIFEESTRRDLFTWTSIVCGLAMHGRGEKALHYFSKMKEA 419

Query: 143 TVKQETSRILGNL-----------GWAYMQK 162
            V+ +   ++G L           GW Y Q 
Sbjct: 420 QVQPDDVTMVGVLSACAHAGLLDQGWWYFQS 450


>gi|346325254|gb|EGX94851.1| anaphase-promoting complex subunit Cut9 [Cordyceps militaris CM01]
          Length = 626

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
           L   +   GNYT A     K  LI+   N +C     HCLIKQ+R+ EA +VL
Sbjct: 46  LAQVHFATGNYTRAHAFLAKQDLINR--NPSCRYLAGHCLIKQSRFEEALAVL 96


>gi|229593982|ref|XP_001025913.2| TPR Domain containing protein [Tetrahymena thermophila]
 gi|225567182|gb|EAS05668.2| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 804

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 36/156 (23%), Positives = 69/156 (44%), Gaps = 16/156 (10%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFRHLCS 84
           + AI  + K +    + DS   ++ +  K +  +DEA++A +            F +L  
Sbjct: 418 DEAIKSYKKCLEINPKEDSCYYNLGIAYKDKGMLDEAIKAYQKCLEINPKKEICFYNLGI 477

Query: 85  KQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTV 144
               + L +  I  Y+KC  ++ + +     L + Y  +   G   +  +S+ K  ++  
Sbjct: 478 AYKAKGLIDEAIQSYQKCLEINPEKDTCLHNLGIAYKAK---GLLDEAIKSYQKCIEINP 534

Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
           K++   +  NLG AYM+KG    A   Y+K   I+P
Sbjct: 535 KKDIYYM--NLGLAYMEKGMLNEAIKQYQKCIEINP 568


>gi|322421231|ref|YP_004200454.1| hypothetical protein GM18_3751 [Geobacter sp. M18]
 gi|320127618|gb|ADW15178.1| Tetratricopeptide repeat [Geobacter sp. M18]
          Length = 636

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 47/200 (23%), Positives = 83/200 (41%), Gaps = 30/200 (15%)

Query: 9   YHVVHKLPPGDSPY-VRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
           Y  + K+ P D+   ++     +  K+ ++AI  +  AI        A +++A +L++++
Sbjct: 394 YQELAKMNPSDAATQLKLARALVNTKELDSAITAYQGAIKLDAESPEAHRELANLLRKRN 453

Query: 68  RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
            +DEA    +    L  K+  +     L  +Y K    D    LLK+ + +         
Sbjct: 454 MMDEAANQYQEVLRL--KKDDQDARTALTAIYVKNKNYDSLANLLKEGVELA-------- 503

Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
            P+  A +H K                LG  Y  + NY +A   Y++A  + PD  KA N
Sbjct: 504 -PS-DANAHYK----------------LGLVYEFQKNYPAATAEYKEAVNLKPDHAKALN 545

Query: 188 -LSHCLIKQARYTEARSVLE 206
            +    +K  R  EA+  LE
Sbjct: 546 AMGRVQMKDGRIAEAKESLE 565


>gi|432879797|ref|XP_004073552.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Oryzias latipes]
          Length = 1016

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 246 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 303

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 304 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 363

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 364 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 423

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 424 LKLKPDFPDAYCNLAHCL 441


>gi|66347871|ref|NP_001018116.1| O-linked N-acetylglucosamine transferase isoform 3 [Danio rerio]
 gi|56967376|gb|AAW31872.1| O-GlcNAc transferase variant 3 [Danio rerio]
          Length = 1046

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 336 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473


>gi|351707941|gb|EHB10860.1| Kinesin light chain 4 [Heterocephalus glaber]
          Length = 619

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 9/72 (12%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-----DPD----ANKACNLSHCLIKQAR 197
           + ++ L NL      +G Y + E  Y++A  I      PD    A    NL+ C +KQ +
Sbjct: 335 DVAKQLNNLALLCQNQGKYEAVERYYQRALAIYEGQLGPDNPNVARTKNNLASCYLKQGK 394

Query: 198 YTEARSVLEDVL 209
           YTEA ++ +D+L
Sbjct: 395 YTEAETLYKDIL 406


>gi|288819196|ref|YP_003433544.1| hypothetical protein HTH_1899 [Hydrogenobacter thermophilus TK-6]
 gi|384129941|ref|YP_005512554.1| Sporulation domain-containing protein [Hydrogenobacter thermophilus
           TK-6]
 gi|288788596|dbj|BAI70343.1| hypothetical protein HTH_1899 [Hydrogenobacter thermophilus TK-6]
 gi|308752778|gb|ADO46261.1| Sporulation domain protein [Hydrogenobacter thermophilus TK-6]
          Length = 359

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDV 208
           ++   LG AY +   +  AE  ++KA  IDP   +A  NL     K   YT+A+++LED 
Sbjct: 61  KVWNALGLAYTEAKEFQKAESSFQKALEIDPAYTEAKMNLGILYYKAKDYTKAKNILEDA 120

Query: 209 L 209
           L
Sbjct: 121 L 121


>gi|358400335|gb|EHK49666.1| hypothetical protein TRIATDRAFT_280830 [Trichoderma atroviride IMI
           206040]
          Length = 1080

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 52/127 (40%), Gaps = 25/127 (19%)

Query: 154 NLGWAYMQKGNYTSAEVVYRKA-----QLIDPDANKAC----NLSHCLIKQARYTEARSV 204
           NLG  Y ++  +  AE +Y +A     +++ P+         NL H    QARY EA ++
Sbjct: 860 NLGLLYYEQDRFREAEAMYERALQGQEKVLGPEHPATLITVGNLRHVYASQARYEEAEAM 919

Query: 205 LEDVLLG--KLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDA---FLEGLDD 259
            E  L G  K+ G     T+N V  L            SL T ++  E+A   +   L  
Sbjct: 920 YERALQGFEKVLGPENPATLNTVGHL-----------GSLYTSQARFEEAEAMYKRTLKG 968

Query: 260 LMNQWTP 266
               W P
Sbjct: 969 FEKAWGP 975


>gi|291613082|ref|YP_003523239.1| hypothetical protein Slit_0612 [Sideroxydans lithotrophicus ES-1]
 gi|291583194|gb|ADE10852.1| TPR repeat-containing protein [Sideroxydans lithotrophicus ES-1]
          Length = 834

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 67/162 (41%), Gaps = 15/162 (9%)

Query: 59  MAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDNVLIDLYKKCGRLDEQIELLKQKL 116
           + VVL+   R +EA++ ++    L  K  QA  +L N L  L    GRLDE     ++ L
Sbjct: 188 LGVVLQHLGRNEEALQPMQRAVELAPKDAQAHSNLGNTLSYL----GRLDEAETSFRRAL 243

Query: 117 RMIYHGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGNYTSA 168
           ++       +     T    G+        +  + +K   +    NLG     +G    A
Sbjct: 244 KINKDFAEAHLNLGATLHDLGRFGEAEVSYRCAIQLKPGLAEAHYNLGNTLKSQGKLEEA 303

Query: 169 EVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
              YRKA  I P     + NL   L  Q +  EA ++L +VL
Sbjct: 304 VASYRKALQIAPGLVGASSNLGAALQAQGKLAEAETILRNVL 345


>gi|440490240|gb|ELQ69816.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae P131]
          Length = 664

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 4/55 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
           L   +   G+YT A+ +  K+ L+    N +C    +HCLIKQ +Y EA +VL D
Sbjct: 41  LAQVHFASGSYTRAQALLSKSDLVSR--NPSCRYLSAHCLIKQGQYDEALNVLGD 93


>gi|342885471|gb|EGU85469.1| hypothetical protein FOXB_03953 [Fusarium oxysporum Fo5176]
          Length = 616

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
           L   +   GNYT A+       LI    N +C     HCLIKQ+R+ EA S+L D
Sbjct: 41  LAQVHFATGNYTRAQAFLSSQDLIAR--NPSCRYLAGHCLIKQSRFDEALSILGD 93


>gi|389627366|ref|XP_003711336.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae 70-15]
 gi|351643668|gb|EHA51529.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae 70-15]
 gi|440469019|gb|ELQ38146.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae Y34]
          Length = 664

 Score = 38.5 bits (88), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 4/55 (7%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
           L   +   G+YT A+ +  K+ L+    N +C    +HCLIKQ +Y EA +VL D
Sbjct: 41  LAQVHFASGSYTRAQALLSKSDLVS--RNPSCRYLSAHCLIKQGQYDEALNVLGD 93


>gi|432328845|ref|YP_007246989.1| hypothetical protein AciM339_0950 [Aciduliprofundum sp. MAR08-339]
 gi|432135554|gb|AGB04823.1| hypothetical protein AciM339_0950 [Aciduliprofundum sp. MAR08-339]
          Length = 377

 Score = 38.5 bits (88), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 11/95 (11%)

Query: 121 HGEAFNGKPTKTARSHGKKFQVTVK--------QETSRILGNLGWAYMQKGNYTSAEVVY 172
           +   FNG     A  H K+F  + K        +E+     NLG  Y +KG+   A  VY
Sbjct: 261 YSSVFNG--MGVALMHMKRFDESEKYLREALRIRESDEYRTNLGNLYYRKGDINKAMEVY 318

Query: 173 RKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLE 206
           +     +PD   A  NL  CL+++ +  EAR  LE
Sbjct: 319 KSVLKRNPDCYLALNNLGRCLMRKGKIDEARRYLE 353


>gi|190340092|gb|AAI63923.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Danio rerio]
          Length = 1046

 Score = 38.5 bits (88), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 336 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473


>gi|62821820|ref|NP_001017359.1| O-linked N-acetylglucosamine transferase isoform 1 [Danio rerio]
 gi|56967372|gb|AAW31870.1| O-GlcNAc transferase variant 1 [Danio rerio]
          Length = 1062

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 336 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473


>gi|209525653|ref|ZP_03274190.1| sulfotransferase [Arthrospira maxima CS-328]
 gi|209493822|gb|EDZ94140.1| sulfotransferase [Arthrospira maxima CS-328]
          Length = 729

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 74/183 (40%), Gaps = 17/183 (9%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFRHLCS 84
           E AI  + KAI        +   +AV+L +  R+DEA++  +           S  HL  
Sbjct: 162 EQAITAYQKAIELKPDFSWSYHHLAVILTRLGRIDEAIDNYRQEIELNPDFYWSHFHLGD 221

Query: 85  KQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF---- 140
             +Q++     I  Y+K   ++ Q     Q+L  I   +   G+       + +      
Sbjct: 222 LLSQQNQPEEAIAAYRKAIAINPQQPEAHQRLTEILSRDQKTGEDALLGGRYEEAIAIYR 281

Query: 141 -QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARY 198
             V  K + S     LG A +    ++ A  V+ +A  ++PD   +  NL +CL KQ   
Sbjct: 282 DMVAAKPDYSWGYYGLGLALLNLRQWSEAIEVFNQAINLNPDCFWSYNNLGYCLFKQGEI 341

Query: 199 TEA 201
            +A
Sbjct: 342 PQA 344


>gi|449482042|ref|XP_004175980.1| PREDICTED: transmembrane and TPR repeat-containing protein 1
           [Taeniopygia guttata]
          Length = 837

 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 75/184 (40%), Gaps = 14/184 (7%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLC--SKQAQESLDN 93
           E A++L   +I  G     A   +A +L +Q+R+ EA E  K+    C  S     +   
Sbjct: 520 EEAVILLRDSIKYGPEFADAYSSLASLLAEQERLKEAEEVYKAGIENCPESSDLHNNYGV 579

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRM--IYHGEAFNGKPTKTARSHGKKFQVTVKQ--ETS 149
            L+D     G  +  +   +Q + +   +H    N      +    K+ +V  K+  + S
Sbjct: 580 FLVD----TGSPERAMSHYRQAILLSPTHHVAMVNLGRLHRSLGQNKEAEVWYKRALKVS 635

Query: 150 R---ILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVL 205
           R   IL  LG  Y   G Y  A  VYR+A  + P + +    L+  L    R  EA  + 
Sbjct: 636 RKAEILSPLGALYYNTGRYEEALQVYREAAALQPSNKDIRLALAQVLAMMGRTKEAEKMT 695

Query: 206 EDVL 209
             +L
Sbjct: 696 NHIL 699


>gi|390949283|ref|YP_006413042.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiocystis violascens DSM 198]
 gi|390425852|gb|AFL72917.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiocystis violascens DSM 198]
          Length = 972

 Score = 38.1 bits (87), Expect = 3.9,   Method: Composition-based stats.
 Identities = 54/219 (24%), Positives = 91/219 (41%), Gaps = 23/219 (10%)

Query: 56  LKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQK 115
           L D   +L++  R +EA ++ +  R L    A       L +L ++ G LDE +E     
Sbjct: 328 LSDQGNILRELGRFEEARDSYR--RALAIDPANALAHTNLGNLLRELGHLDEALEHHAAA 385

Query: 116 LRMIY-HGEAFNGK--------PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYT 166
           LR+   + E +             + AR+H  +  +++    ++  GNLG  + +     
Sbjct: 386 LRIAPDYAEGYCNAGLVLQDLGRLEEARAHYSQ-ALSINPNLAQAHGNLGNYWQELKRCH 444

Query: 167 SAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEAR-------SVLEDVLLGKLSGSTE 218
            A   YR+A  I+P  A    N+   L++Q  + EAR       S+  D +   L+  T 
Sbjct: 445 EALECYRRALAIEPRFAEAHNNMGLVLLEQGNFDEARERFEQALSIRPDYVDAYLNLGTC 504

Query: 219 TKTINRVKELLQELEPWQSIPPSLTTKKSSL---EDAFL 254
              + R  + L   +    I P L T K  L    +AFL
Sbjct: 505 HGRVGRYDKALDCFDRALRISPDLATLKPGLVKVHNAFL 543


>gi|22298814|ref|NP_682061.1| hypothetical protein tlr1271 [Thermosynechococcus elongatus BP-1]
 gi|22294995|dbj|BAC08823.1| tlr1271 [Thermosynechococcus elongatus BP-1]
          Length = 624

 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 60/135 (44%), Gaps = 15/135 (11%)

Query: 86  QAQESLDNVLIDL---------YKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSH 136
           +A  +L N  +DL         Y+K   L+  +      L +  H +   GK T+    +
Sbjct: 111 EAHSNLGNAYLDLGDTEGAIACYQKALSLNPDLPTTLYNLGLCLHAQ---GKLTEATACY 167

Query: 137 GKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQ 195
            +   +   Q  + +  NLG AY++  NYT+A V ++ A   +P+   A  NL + L  Q
Sbjct: 168 EQSLYLEPGQ--ADVHNNLGSAYLELKNYTAATVHFQAALGANPELLVAHYNLGYALHLQ 225

Query: 196 ARYTEARSVLEDVLL 210
                AR   ++VLL
Sbjct: 226 GNLAAARDRYQEVLL 240


>gi|425461303|ref|ZP_18840783.1| Tetratricopeptide TPR_2 repeat protein [Microcystis aeruginosa PCC
           9808]
 gi|389825870|emb|CCI24060.1| Tetratricopeptide TPR_2 repeat protein [Microcystis aeruginosa PCC
           9808]
          Length = 310

 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 96/247 (38%), Gaps = 50/247 (20%)

Query: 3   QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV- 61
           + RD    ++ K P     ++    + L +KD E A+  F     A  R+D  +   A+ 
Sbjct: 34  EARDAFEEILEKDPGSKQAHLGIGSICLKQKDYEGALTHF----GAARRIDPMMAKAALA 89

Query: 62  ---VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
              V  +Q+ +D A+EA K              D V ID     G L     LLKQ    
Sbjct: 90  VGNVYYKQNELDSAIEAFK--------------DAVNIDPTTPAGYLGMGRVLLKQ---- 131

Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQET-SRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
                    K    A+   +K  V   Q T  R+L  +   Y ++GN  +A      A  
Sbjct: 132 ---------KKYPQAKEQVQKALVFNPQLTPGRLL--MSQIYQEQGNTKAAIDEIESALR 180

Query: 178 IDPDANKAC-NLSHCLIKQARYTEARSVLEDVL-----------LGKLSGSTETKTINRV 225
           ++P A  A  +L +  +KQ +Y  AR   ED             +G +    E+  +N  
Sbjct: 181 MNPTAWSAYQSLGNIYLKQKQYNLARKNFEDAQQLNPKIPVVAKMGYIEALIESNALNEA 240

Query: 226 KELLQEL 232
            E+L++L
Sbjct: 241 SEILRDL 247


>gi|154318674|ref|XP_001558655.1| hypothetical protein BC1G_02726 [Botryotinia fuckeliana B05.10]
          Length = 586

 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 27/51 (52%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI  + +      HCLIKQ+R+ EA S+L
Sbjct: 41  LAQVHFSTGNYTRAQSFLTKQDLIARNPSCRYLAGHCLIKQSRFEEALSIL 91


>gi|66347879|ref|NP_001018117.1| O-linked N-acetylglucosamine transferase isoform 4 [Danio rerio]
 gi|56967378|gb|AAW31873.1| O-GlcNAc transferase variant 4 [Danio rerio]
          Length = 1036

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 326 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463


>gi|418295012|ref|ZP_12906887.1| hypothetical protein PstZobell_16958 [Pseudomonas stutzeri ATCC
           14405 = CCUG 16156]
 gi|379066370|gb|EHY79113.1| hypothetical protein PstZobell_16958 [Pseudomonas stutzeri ATCC
           14405 = CCUG 16156]
          Length = 575

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 35/75 (46%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
           IL +LGW   + G    AEV+ RKA    PD   A +L   L  +   +EAR+V    L 
Sbjct: 495 ILDSLGWINFRLGKLAEAEVLLRKALQRFPDHEVAAHLGEVLWTKGERSEARAVWSKALK 554

Query: 211 GKLSGSTETKTINRV 225
            +       +TI R+
Sbjct: 555 LQPDSPILLETIKRL 569


>gi|63095209|gb|AAY32335.1| SPY [Phyllostachys praecox]
          Length = 781

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 39/152 (25%), Positives = 63/152 (41%), Gaps = 32/152 (21%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQA-QESLDNV 94
           E AIV +  A++   R   A   + V+ K +D +D+AVE       L  K +  +SL+N+
Sbjct: 159 EMAIVFYELALHFNPRCAEACNSLGVIYKDRDNLDKAVECY--LLALSIKPSFSQSLNNL 216

Query: 95  LIDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG 153
            + +Y   G++D    ++ K  +    + EA+N                           
Sbjct: 217 GV-VYTVQGKMDAAASMIEKAIIANPTYAEAYN--------------------------- 248

Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           NLG  Y   G+ T A   Y +   IDPD+  A
Sbjct: 249 NLGVLYRDAGSITLAVQAYERCLQIDPDSRNA 280


>gi|148253060|ref|YP_001237645.1| hypothetical protein BBta_1521 [Bradyrhizobium sp. BTAi1]
 gi|146405233|gb|ABQ33739.1| hypothetical protein BBta_1521 [Bradyrhizobium sp. BTAi1]
          Length = 120

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 33/65 (50%)

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQ 195
           H     + +  ET++IL NLG++YM +G  T+A   + +A   +PD     N  + L   
Sbjct: 50  HAYGQAIKLGGETTQILNNLGYSYMLRGKLTAARTKFMEAYRREPDNPTVLNNINLLNSS 109

Query: 196 ARYTE 200
            R+ E
Sbjct: 110 YRFIE 114


>gi|91202613|emb|CAJ72252.1| Hypothetical Protein kustd1507 [Candidatus Kuenenia
           stuttgartiensis]
          Length = 700

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 1/69 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTE 200
           + +     ++  NLG AY++KGN  +A   YRKA  I P   +A  NL     +  R+ E
Sbjct: 536 IQLNMHKPQVFNNLGSAYLKKGNPDAAIAQYRKALYIYPGYAEAHSNLGFIYTETNRFEE 595

Query: 201 ARSVLEDVL 209
           A S L+  L
Sbjct: 596 ALSELKKAL 604


>gi|146283511|ref|YP_001173664.1| hypothetical protein PST_3184 [Pseudomonas stutzeri A1501]
 gi|145571716|gb|ABP80822.1| TPR domain protein [Pseudomonas stutzeri A1501]
          Length = 594

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 35/75 (46%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
           IL +LGW   + G    AEV+ RKA    PD   A +L   L  +   +EAR+V    L 
Sbjct: 514 ILDSLGWINFRLGKLAEAEVLLRKALQRFPDHEVAAHLGEVLWTKGERSEARAVWSKALK 573

Query: 211 GKLSGSTETKTINRV 225
            +       +TI R+
Sbjct: 574 LQPDSPILLETIKRL 588


>gi|347830633|emb|CCD46330.1| similar to cell division cycle protein [Botryotinia fuckeliana]
          Length = 612

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 27/51 (52%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  LI  + +      HCLIKQ+R+ EA S+L
Sbjct: 41  LAQVHFSTGNYTRAQSFLTKQDLIARNPSCRYLAGHCLIKQSRFEEALSIL 91


>gi|262066446|ref|ZP_06026058.1| tetratricopeptide repeat family protein [Fusobacterium
           periodonticum ATCC 33693]
 gi|291379873|gb|EFE87391.1| tetratricopeptide repeat family protein [Fusobacterium
           periodonticum ATCC 33693]
          Length = 349

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 13/142 (9%)

Query: 67  DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFN 126
           ++V+E +  +    +L S Q +         L K  GR DE I L       +Y    + 
Sbjct: 148 EKVNEKIFIVSELGYLYSVQGKNEEALKYFYLAKDLGRNDEWIYL------HLY----YT 197

Query: 127 GKPTKTARSHGKKF--QVTVKQETSRILGNLGWAYM-QKGNYTSAEVVYRKAQLIDPDAN 183
            K +K      K F  Q  ++ + + +L  LG  YM +  NY +AE VY+K   +  D  
Sbjct: 198 IKASKGEEEALKYFEEQAKIEDKNTVLLTALGNIYMLEPANYDAAEKVYQKVFALSGDGQ 257

Query: 184 KACNLSHCLIKQARYTEARSVL 205
           +  N    L+   +Y EA  VL
Sbjct: 258 QLYNRGRALVGLKKYKEAVEVL 279


>gi|117923677|ref|YP_864294.1| hypothetical protein Mmc1_0362 [Magnetococcus marinus MC-1]
 gi|117607433|gb|ABK42888.1| Tetratricopeptide TPR_2 repeat protein [Magnetococcus marinus MC-1]
          Length = 804

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 71/158 (44%), Gaps = 11/158 (6%)

Query: 61  VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           ++LKQ  R DEA + ++    L S       +  L  LY + G+L+E  ++L++ + +  
Sbjct: 405 LLLKQPGREDEAEQILREAMALDSNNIPPRTE--LGRLYMQQGKLEEAEQVLREVIGLDS 462

Query: 121 HGEAFNGKPTKTARSHGKKFQV-TVKQETSRILGN-------LGWAYMQKGNYTSAEVVY 172
           +      +  +     GK  +   V QE   +  N       LG  YMQ+G    AE V 
Sbjct: 463 NDIHVRTELGRLYMQQGKLEEAEQVLQEAMALDSNNIPPRTELGRLYMQQGKLKEAEQVL 522

Query: 173 RKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVL 209
           ++A  +D  +      L    ++Q +  EAR VLE VL
Sbjct: 523 QEAMALDSNNIPPRTELGRLYMQQGKREEARWVLEQVL 560


>gi|386812651|ref|ZP_10099876.1| peptidase [planctomycete KSU-1]
 gi|386404921|dbj|GAB62757.1| peptidase [planctomycete KSU-1]
          Length = 619

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 75/192 (39%), Gaps = 11/192 (5%)

Query: 30  LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
           L  +D E A+  F KA     R   A   +   + +  R  EA+EA K  R +       
Sbjct: 246 LAAEDYENALRCFEKAGKENPRYSLAYFYIGYCMDKLGRYSEAIEAYK--RAIRINATFL 303

Query: 90  SLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFNGKPTKTARSHGKKFQ 141
            +   L + Y + G   E I + KQ +R+      I++  GE +          H  K  
Sbjct: 304 EVHYNLGEDYDRLGCYGEAIGIYKQIIRIQPKNARIHYKLGEDYRILEHYPEAIHAFKKA 363

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTE 200
           +  K +      +LG      G Y+ A   Y++   I+P   KA   L     KQ  YTE
Sbjct: 364 IDSKPDFVEAYSSLGLVCFNLGYYSEAIEAYQQVITINPHDTKAHIMLGSAYSKQGCYTE 423

Query: 201 ARSVLEDVLLGK 212
           A  V + V+  K
Sbjct: 424 AIDVFKKVIYSK 435


>gi|423067050|ref|ZP_17055840.1| putative sulfotransferase [Arthrospira platensis C1]
 gi|406711336|gb|EKD06537.1| putative sulfotransferase [Arthrospira platensis C1]
          Length = 729

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 74/183 (40%), Gaps = 17/183 (9%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFRHLCS 84
           E AI  + KAI        +   +AV+L +  R+DEA++  +           S  HL  
Sbjct: 162 EQAITAYQKAIELKPDFSWSYHHLAVILTRLGRIDEAIDNYRQEIELNPDFYWSHFHLGD 221

Query: 85  KQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF---- 140
             +Q++     I  Y+K   ++ Q     Q+L  I   +   G+       + +      
Sbjct: 222 LLSQQNQPEEAIAAYRKAIAINPQQPEAHQRLTEILSRDQKTGEDALLGGRYEEAIAIYR 281

Query: 141 -QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARY 198
             V  K + S     LG A +    ++ A  V+ +A  ++PD   +  NL +CL KQ   
Sbjct: 282 DMVAAKPDYSWGYYGLGLALLNLRQWSEAIEVFNQAINLNPDCFWSYNNLGYCLFKQGEI 341

Query: 199 TEA 201
            +A
Sbjct: 342 PQA 344


>gi|357148919|ref|XP_003574938.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like
           [Brachypodium distachyon]
          Length = 959

 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 63/151 (41%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AIV +  A++   R   A  ++ V+ K +D +D AVE  +    +    +Q SL+N+ 
Sbjct: 308 EMAIVSYELALHFNPRCAEACNNLGVIYKDRDNLDRAVECYQMALTIKPNFSQ-SLNNLG 366

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EA+N                           N
Sbjct: 367 V-VYTVQGKMDAAASMIEKAILANPTYAEAYN---------------------------N 398

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  Y   G+ T +   Y +   IDPD+  A
Sbjct: 399 LGVLYRDAGSITLSVQAYERCLQIDPDSRNA 429


>gi|291566679|dbj|BAI88951.1| TPR domain protein [Arthrospira platensis NIES-39]
          Length = 729

 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 44/187 (23%), Positives = 77/187 (41%), Gaps = 17/187 (9%)

Query: 32  EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFR 80
           +K  E AI  + KAI        +   +AV+L +  RV+EA+ A +           S  
Sbjct: 158 QKQDEQAITAYRKAIELKPDFCWSYHHLAVILTRLGRVEEAIAAYRQEITLNPDFYWSHF 217

Query: 81  HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
           HL    +Q++     I+ Y++   ++ Q     Q+L  I       G+       + +  
Sbjct: 218 HLGDLLSQQNQPEEAIEAYRQAIAINPQQPEAHQRLTEILSRHQKTGEDALLTGRYEEAI 277

Query: 141 -----QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIK 194
                 V  + + S     LG A + +  +  A  V+ +A  I+PD   + N L +CL K
Sbjct: 278 AIYRDMVAARPDYSWGYYGLGLALLNQRQWREAIDVFNQAISINPDCFWSYNHLGYCLFK 337

Query: 195 QARYTEA 201
           Q + + A
Sbjct: 338 QGKISPA 344


>gi|301117834|ref|XP_002906645.1| Mitochondrial Protein Translocase (MPT) Family [Phytophthora
            infestans T30-4]
 gi|262107994|gb|EEY66046.1| Mitochondrial Protein Translocase (MPT) Family [Phytophthora
            infestans T30-4]
          Length = 1104

 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 30/63 (47%)

Query: 21   PYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFR 80
            P++ A  V +   DP  AI  F +A+    R  SA  D+A VL  Q R DEA E  +   
Sbjct: 1005 PFLNAGRVYVGTNDPMRAIAHFKQALEVDPRCSSAHLDIAQVLFAQGRTDEAFEHFEVAA 1064

Query: 81   HLC 83
              C
Sbjct: 1065 SCC 1067


>gi|66347873|ref|NP_001018115.1| O-linked N-acetylglucosamine transferase isoform 2 [Danio rerio]
 gi|56967374|gb|AAW31871.1| O-GlcNAc transferase variant 2 [Danio rerio]
          Length = 1052

 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 326 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463


>gi|328874522|gb|EGG22887.1| hypothetical protein DFA_05017 [Dictyostelium fasciculatum]
          Length = 1297

 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 9/84 (10%)

Query: 198 YTEARSVLEDVLLGKLSGS-TETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAFLEG 256
           Y ++ S +E +L   L    ++T TIN + E+L   + W+         K +  D FL G
Sbjct: 537 YMKSISQVEKMLKDSLEADKSQTNTINSITEILDNWDIWRK-------NKENAPD-FLSG 588

Query: 257 LDDLMNQWTPYRSRRLPIFEEISP 280
           L D++ Q  PY     P+FE+  P
Sbjct: 589 LIDIVVQEVPYTEESKPMFEQFIP 612


>gi|291301344|ref|YP_003512622.1| SARP family transcriptional regulator [Stackebrandtia nassauensis
           DSM 44728]
 gi|290570564|gb|ADD43529.1| transcriptional regulator, SARP family [Stackebrandtia nassauensis
           DSM 44728]
          Length = 935

 Score = 38.1 bits (87), Expect = 4.6,   Method: Composition-based stats.
 Identities = 46/175 (26%), Positives = 77/175 (44%), Gaps = 25/175 (14%)

Query: 55  ALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIE 110
           AL+D+    +      EA     EA+K  R     +    + N L D++++ G  D  I 
Sbjct: 714 ALRDLGQAQRAMKLRPEATTTLTEALKVCRDNGFHERVPGILNFLADVHRQEGEYDTAIG 773

Query: 111 LLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEV 170
           LL++ L                   H    ++ ++ +   +L +LG+ Y Q G +  A  
Sbjct: 774 LLREGL--------------AACDEHVADERIAIEIKNG-LLTSLGFNYEQTGQFDHAAE 818

Query: 171 VYRKAQLI---DPDANKAC--NLSHCLIKQARYTEARSVLEDVL-LGKLSGSTET 219
            +R+A  I   DP A+     NL+H L +Q   T+A  +LE+ L L   SG++ T
Sbjct: 819 AHREAIAISHGDPPAHAIALGNLAHVLRRQGEATQAIPLLEEALQLLNDSGNSRT 873


>gi|300676810|gb|ADK26686.1| kinesin light chain 4 [Zonotrichia albicollis]
          Length = 622

 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 33/183 (18%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIEL 111
           V + L  +A+V + Q++  EA   +     L S Q   +L+N+ + LY K G+  E   L
Sbjct: 250 VATMLNILALVYRDQNKYKEAAHLLND--KLLSLQVAATLNNLAV-LYGKRGKYKEAEPL 306

Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
            K+ L +               +  GK      KQ     L NL      +G Y   E  
Sbjct: 307 CKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGKYDEVEYY 348

Query: 172 YRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVLLG---KLSGSTET 219
           Y +A  I        DP+  K   NL+ C +KQ +Y +A  + +++L     K  GS E 
Sbjct: 349 YCRALEIYESCLGPDDPNVAKTKNNLASCYLKQGKYKDAEVLYKEILTRAHVKEFGSDEH 408

Query: 220 KTI 222
           K I
Sbjct: 409 KPI 411


>gi|409993097|ref|ZP_11276252.1| sulfotransferase [Arthrospira platensis str. Paraca]
 gi|409936022|gb|EKN77531.1| sulfotransferase [Arthrospira platensis str. Paraca]
          Length = 729

 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 44/187 (23%), Positives = 77/187 (41%), Gaps = 17/187 (9%)

Query: 32  EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFR 80
           +K  E AI  + KAI        +   +AV+L +  RV+EA+ A +           S  
Sbjct: 158 QKQDEQAITAYRKAIELKPDFCWSYHHLAVILTRLGRVEEAIAAYRQEITLNPDFYWSHF 217

Query: 81  HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
           HL    +Q++     I+ Y++   ++ Q     Q+L  I       G+       + +  
Sbjct: 218 HLGDLLSQQNQPEEAIEAYRQAIAINPQQPEAHQRLTEILSRHQKTGEDALLTGRYEEAI 277

Query: 141 -----QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIK 194
                 V  + + S     LG A + +  +  A  V+ +A  I+PD   + N L +CL K
Sbjct: 278 AIYRDMVAARPDYSWGYYGLGLALLNQRQWREAIDVFNQAISINPDCFWSYNHLGYCLFK 337

Query: 195 QARYTEA 201
           Q + + A
Sbjct: 338 QGKISPA 344


>gi|153004759|ref|YP_001379084.1| hypothetical protein Anae109_1897 [Anaeromyxobacter sp. Fw109-5]
 gi|152028332|gb|ABS26100.1| Tetratricopeptide TPR_2 repeat protein [Anaeromyxobacter sp.
           Fw109-5]
          Length = 268

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 85/205 (41%), Gaps = 21/205 (10%)

Query: 23  VRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHL 82
           VR  +  L   DPE A V F  A+        AL  + +V +++  +D A    +    L
Sbjct: 47  VRRGYAYLEAADPERATVAFSHALEFDPDFPEALNGLGIVARREGDLDTARRRFERAVRL 106

Query: 83  CS--KQAQESLDNVLIDLYKKCG---------RLDEQIELLKQKLR--MIYHGEAFNGKP 129
                +   +L   L+   ++ G         R+D  +   +Q L   ++ HG      P
Sbjct: 107 APDFAEGHANLGETLLAAGREGGGTDALRAALRIDPDLADARQNLARALLRHG---LDSP 163

Query: 130 TKTAR--SHGKKFQVTVKQETSRILGNLGWAYMQ--KGNYTSAEVVYRKAQLIDPDANKA 185
            + AR     +++   ++    R   +   A+M   +G +  AE  YR+A  ++P +++A
Sbjct: 164 DRGARWAEARREYLHLLEAAPDRAAAHHDLAFMDYVEGRFERAEAGYRRAAELEPGSHEA 223

Query: 186 CN-LSHCLIKQARYTEARSVLEDVL 209
            + L   L++ AR  EA    E  L
Sbjct: 224 LHGLCVSLVRLARCDEAVRACERCL 248


>gi|376295888|ref|YP_005167118.1| hypothetical protein DND132_1101 [Desulfovibrio desulfuricans
           ND132]
 gi|323458449|gb|EGB14314.1| Tetratricopeptide TPR_1 repeat-containing protein [Desulfovibrio
           desulfuricans ND132]
          Length = 208

 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           GN+G+ YMQ+G+   A    +KA  +DPD  +A   +S  LI    + EA  +L+ +L
Sbjct: 97  GNIGFVYMQQGDNDKAHKSLKKALKLDPDFAQAQATMSSVLIAMNDFEEADKLLKTLL 154


>gi|348516826|ref|XP_003445938.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           3 [Oreochromis niloticus]
          Length = 1064

 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 336 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473


>gi|428182889|gb|EKX51748.1| hypothetical protein GUITHDRAFT_161407 [Guillardia theta CCMP2712]
          Length = 458

 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 138 KKFQVTVKQE-TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQ 195
           K F + +K+E T+ +L NL   Y + G++T A  +Y++A  +D  DA    NL   L K 
Sbjct: 262 KYFSMALKEEVTTGVLNNLAIVYQEMGSHTEAVQMYKRALKLDQDDATAHYNLGTALEKA 321

Query: 196 ARYTEA 201
             Y EA
Sbjct: 322 GSYKEA 327


>gi|348516822|ref|XP_003445936.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Oreochromis niloticus]
          Length = 1048

 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 336 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473


>gi|262304971|gb|ACY45078.1| acetylglucosaminyl-transferase [Carcinoscorpius rotundicauda]
          Length = 290

 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 79/199 (39%), Gaps = 47/199 (23%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           A+  + +AI        A  ++A  LK++ +V EA E   +  HLC   A +SL+N L +
Sbjct: 5   AVDTYRRAIELQPNFPDAYCNLANALKEKGQVSEAEECYNTALHLCPTHA-DSLNN-LAN 62

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E   L  + L                          ++++ EA    PT  
Sbjct: 63  IKREQGYIEEATRLYLKALEVFPEFAAAHSNLASVLQQQGKLNEALMHYKEAIRISPTFA 122

Query: 131 -------KTARSHGKKFQVTVKQETSRILGNLGWA---------YMQKGNYTSAEVVYRK 174
                   T +  G   Q  ++  T  I  N G+A         +   GN   A   YR 
Sbjct: 123 DAYSNMGNTLKEMG-DIQGALQCYTRAIQINPGFADAHSNLASIHKDSGNIPEAIASYRT 181

Query: 175 AQLIDPDANKA-CNLSHCL 192
           A  + P+   A CNL+HCL
Sbjct: 182 ALKLKPEFPDAYCNLAHCL 200


>gi|348516824|ref|XP_003445937.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Oreochromis niloticus]
          Length = 1038

 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 326 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463


>gi|293607618|ref|ZP_06689952.1| tetratricopeptide TPR_2 [Achromobacter piechaudii ATCC 43553]
 gi|292814051|gb|EFF73198.1| tetratricopeptide TPR_2 [Achromobacter piechaudii ATCC 43553]
          Length = 628

 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 13/176 (7%)

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
           A +L+  DRV +AV  +++         +   +  L  LY++  RL +   +L+Q + + 
Sbjct: 452 AQILRDADRVPQAVSTLEAADQALPDTVEIKYE--LAMLYERQNRLADLERMLRQVIALD 509

Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
             H  A+N      A  H ++       +T   E S     IL ++GW   + G   SA 
Sbjct: 510 PDHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKYRMGESESAA 568

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
              R+A  + P+A+ A +L+  L  Q +  +A  +L   L+      T    I R+
Sbjct: 569 QYLRRAYSVRPEADIAAHLAEVLWSQGKRDQAVELLRVALMKDPKNKTVQDVIKRL 624


>gi|415884106|ref|ZP_11546135.1| acetyl-CoA carboxylase carboxyltransferase subunit alpha [Bacillus
           methanolicus MGA3]
 gi|387591901|gb|EIJ84218.1| acetyl-CoA carboxylase carboxyltransferase subunit alpha [Bacillus
           methanolicus MGA3]
          Length = 325

 Score = 37.7 bits (86), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 7/99 (7%)

Query: 25  AKHVQLVEKD------PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
             H+ ++E        PE A  L WK  +   R    +K  A  LK+   +DE +  +K 
Sbjct: 210 GNHIHMLENSTYSVISPEGAAALLWKDASQAKRAAETMKITAPDLKELGIIDEIIPEVKG 269

Query: 79  FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
             H   K+  E +D +LI   K+  ++ ++ EL+  + +
Sbjct: 270 GAHKDVKKQSEYIDQILIKSLKELVKMSDE-ELVDHRYK 307


>gi|406934936|gb|EKD69050.1| TPR Domain containing protein [uncultured bacterium]
          Length = 488

 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 41/188 (21%), Positives = 83/188 (44%), Gaps = 21/188 (11%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA-IKSFRHLCSKQAQESLDNV 94
           +AAI ++ + +  G        ++ V+ ++++ + +A+E  + S++     +  E ++N+
Sbjct: 148 DAAIFMYSRVVKTGKGSAQLYNNLGVLYEKKNELKKALECYLNSYKM--DGKCTEVINNI 205

Query: 95  ------------LIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV 142
                        ID Y K   ++ + +     L +IY      GK   +  +  K   V
Sbjct: 206 GLVYAKLNNYNKAIDSYLKTLDIEPENKKALNNLGLIY---GLTGKFYDSISTWKKLLNV 262

Query: 143 TVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
                ++    NLG AY+  GN T+A   +  A  I+P + +A  N     +++ARY EA
Sbjct: 263 DSGDYSTHF--NLGVAYLNVGNKTNARYYFESALKINPRSAEALMNSGSLYVQEARYDEA 320

Query: 202 RSVLEDVL 209
               + V+
Sbjct: 321 VGEFKKVI 328


>gi|157812758|gb|ABV81124.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase [Limulus polyphemus]
          Length = 290

 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 79/199 (39%), Gaps = 47/199 (23%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           A+  + +AI        A  ++A  LK++ +V EA E   +  HLC   A +SL+N L +
Sbjct: 5   AVDTYRRAIELQPNFPDAYCNLANALKEKGQVSEAEECYNTALHLCPTHA-DSLNN-LAN 62

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E   L  + L                          ++++ EA    PT  
Sbjct: 63  IKREQGYIEEATRLYLKALEVFPEFAAAHSNLASVLQQQGKLNEALMHYKEAIRISPTFA 122

Query: 131 -------KTARSHGKKFQVTVKQETSRILGNLGWA---------YMQKGNYTSAEVVYRK 174
                   T +  G   Q  ++  T  I  N G+A         +   GN   A   YR 
Sbjct: 123 DAYSNMGNTLKEMG-DIQGALQCYTRAIQINPGFADAHSNLASIHKDSGNIPEAIASYRT 181

Query: 175 AQLIDPDANKA-CNLSHCL 192
           A  + P+   A CNL+HCL
Sbjct: 182 ALKLKPEFPDAYCNLAHCL 200


>gi|124008635|ref|ZP_01693326.1| tetratricopeptide repeat domain protein [Microscilla marina ATCC
           23134]
 gi|123985879|gb|EAY25743.1| tetratricopeptide repeat domain protein [Microscilla marina ATCC
           23134]
          Length = 309

 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSH-CLIKQ 195
           +L N+GW  + KGN   A+   +KA  + P DA+   NL H C+IKQ
Sbjct: 205 VLMNIGWINLVKGNLKEADTYIKKAHQLAPYDAHTLMNLGHICMIKQ 251


>gi|428319715|ref|YP_007117597.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
 gi|428243395|gb|AFZ09181.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
          Length = 1533

 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 78/177 (44%), Gaps = 17/177 (9%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA-IKSFR---HLCSKQAQESLDN 93
           A+  + KAI        A ++ A +L Q  + +EA EA  ++F      C+ +  E+L  
Sbjct: 361 AVSAYEKAIALKPDFAGAFRNFAKLLTQLGKSEEAAEAWYRAFAIDPKSCTAEEHENLAK 420

Query: 94  VLIDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFNGKPTKTARSHGKKFQVTVK 145
            LI+     G++D+ IE  ++ + +       YH  GE   G+    A        +   
Sbjct: 421 TLIEQ----GKVDKGIECYRRAVELNPNAGAAYHELGEILKGQEQWEAAVDAYTNAIRNN 476

Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEA 201
              S    NL  + ++   +  A   YRKA  ++PD + +  NL+  L+K  R+ EA
Sbjct: 477 PNLSWSHNNLAESLVKLERWEEAVNAYRKAIELNPDFSWSHNNLADVLLKLERWEEA 533


>gi|297738270|emb|CBI27471.3| unnamed protein product [Vitis vinifera]
          Length = 574

 Score = 37.7 bits (86), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 51/101 (50%), Gaps = 9/101 (8%)

Query: 18  GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK 77
           GD    R    Q+ ++D EA + LF +   AG ++D    ++A+V   Q   D  V    
Sbjct: 167 GDIKSARLLFDQVDQRDTEA-LNLFHRMQTAGIKLD----NVALVSTLQACADLGVLDQG 221

Query: 78  SFRHLCSKQAQESLD----NVLIDLYKKCGRLDEQIELLKQ 114
            + H   K+ +  +D     VLID+Y KCG L+E IE+ ++
Sbjct: 222 KWIHAYIKKHEIEIDPILGCVLIDMYAKCGDLEEAIEVFRK 262


>gi|427415284|ref|ZP_18905469.1| putative transcriptional regulator [Leptolyngbya sp. PCC 7375]
 gi|425756049|gb|EKU96908.1| putative transcriptional regulator [Leptolyngbya sp. PCC 7375]
          Length = 725

 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 9/72 (12%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI------DPDANKACNLSHCLI---KQAR 197
           E +  L NLG  YM +G Y+ AE + ++A ++      D   N A NL++  +   KQ R
Sbjct: 543 EVALSLNNLGMLYMHQGRYSEAEPLLKRALVMSKRFLGDEHPNVATNLNNLALFYSKQER 602

Query: 198 YTEARSVLEDVL 209
           Y+EA  + ++ L
Sbjct: 603 YSEAEPLYQESL 614



 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 9/73 (12%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAE------VVYRKAQLIDPD---ANKACNLSHCLIKQAR 197
           + +R L NLGW Y ++G Y  AE      +V RK  L D     A    NL+    KQ R
Sbjct: 627 DVARSLNNLGWLYGKQGRYREAEPLLKEAIVLRKQLLGDEHPSVARSLTNLAMIYDKQER 686

Query: 198 YTEARSVLEDVLL 210
           Y EA  + ++ ++
Sbjct: 687 YDEAEILYQEAVV 699


>gi|442319830|ref|YP_007359851.1| hypothetical protein MYSTI_02851 [Myxococcus stipitatus DSM 14675]
 gi|441487472|gb|AGC44167.1| hypothetical protein MYSTI_02851 [Myxococcus stipitatus DSM 14675]
          Length = 353

 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEAR 202
             QE  +   NLG  Y ++G Y  A   +R+A  ++PD  +   NL++ L+K  +Y EA+
Sbjct: 94  FNQEHLQAYQNLGSLYNEEGAYGKAHDNFRRALKVNPDNIDTRNNLAYTLMKMGKYEEAK 153

Query: 203 SVLEDVL 209
             L  +L
Sbjct: 154 KELRTIL 160


>gi|348516828|ref|XP_003445939.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           4 [Oreochromis niloticus]
          Length = 1054

 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E ++L ++ L                          ++++ EA    PT  
Sbjct: 326 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463


>gi|91793507|ref|YP_563158.1| TPR repeat-containing protein [Shewanella denitrificans OS217]
 gi|91715509|gb|ABE55435.1| TPR repeat [Shewanella denitrificans OS217]
          Length = 448

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 11/97 (11%)

Query: 120 YHGEAF----NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
           YHG       +  P    ++  K  Q+ V     +I   LG+ Y+Q  ++ SA V Y++A
Sbjct: 124 YHGLGLVQLQSDNPIAATKNFAKALQLGVN--DPQIYSFLGYGYIQSEHFHSAVVAYQQA 181

Query: 176 QLIDPD---ANKACNLSHCLIKQARYTEARSVLEDVL 209
           +L +PD    N+A   ++ +     +  A S+LE +L
Sbjct: 182 KLFNPDDAQLNQALLYAYNMA--GHHDAALSLLEQML 216


>gi|417411996|gb|JAA52415.1| Putative kinesin light chain, partial [Desmodus rotundus]
          Length = 624

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 9/72 (12%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-----DPD----ANKACNLSHCLIKQAR 197
           + ++ L NL      +G Y + E  YR+A  I      PD    A    NL+ C +KQ +
Sbjct: 340 DVAKQLNNLALLCQNQGKYEAVECYYRRALAIYEGQLGPDNPNVARTKNNLASCYLKQGK 399

Query: 198 YTEARSVLEDVL 209
           Y EA ++ +++L
Sbjct: 400 YAEAETLYKEIL 411


>gi|315637376|ref|ZP_07892591.1| conserved hypothetical protein [Arcobacter butzleri JV22]
 gi|315478350|gb|EFU69068.1| conserved hypothetical protein [Arcobacter butzleri JV22]
          Length = 503

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 40/175 (22%), Positives = 84/175 (48%), Gaps = 15/175 (8%)

Query: 41  LFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK 100
           L+ +AI   D+   A  ++AV+L ++  +DEA +  +    +  K +  + +N+ + L K
Sbjct: 292 LYKEAIKIDDKYSKAYNNLAVLLSEKGEMDEAEKLFREAIKIDDKNSN-TYNNLAL-LLK 349

Query: 101 KCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQETSR 150
             G +DE  +L ++ +++       Y+  A      G+  +  + +  K  + +  + S+
Sbjct: 350 NKGEIDEAEKLYREAIKIDDKNSNTYNNLALLLANKGEIDEAEKLY--KEAIKIDDKNSK 407

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKA-QLIDPDANKACNLSHCLIKQARYTEARSV 204
              NL +   +KG    AE +YR+A ++ D D++   NL+  L  +    EA  +
Sbjct: 408 AYNNLAFLLSEKGEIGEAEKLYREAIKIDDKDSDIYYNLALLLKNKGDIDEAEKL 462



 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 55/208 (26%), Positives = 93/208 (44%), Gaps = 33/208 (15%)

Query: 19  DSPYVRAKH---VQLVEK-DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV- 73
           D  Y +A +   V L EK + + A  LF +AI   D+  +   ++A++LK +  +DEA  
Sbjct: 300 DDKYSKAYNNLAVLLSEKGEMDEAEKLFREAIKIDDKNSNTYNNLALLLKNKGEIDEAEK 359

Query: 74  ---EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYHGEA 124
              EAIK        +   + +N+ + L  K G +DE  +L K+ +++       Y+  A
Sbjct: 360 LYREAIK-----IDDKNSNTYNNLALLLANK-GEIDEAEKLYKEAIKIDDKNSKAYNNLA 413

Query: 125 F-------NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA-Q 176
           F        G+  K  R       + +  + S I  NL      KG+   AE +YR+A +
Sbjct: 414 FLLSEKGEIGEAEKLYRE-----AIKIDDKDSDIYYNLALLLKNKGDIDEAEKLYREAIK 468

Query: 177 LIDPDANKACNLSHCLIKQARYTEARSV 204
           + D D +   NL+  L  +    EA  +
Sbjct: 469 IDDKDLDVYNNLAVLLANKGDIDEAEKL 496


>gi|91202600|emb|CAJ72239.1| hypothetical protein kustd1494 [Candidatus Kuenenia
           stuttgartiensis]
          Length = 847

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 68/158 (43%), Gaps = 10/158 (6%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           E AI  + KA+ A      A  ++  V + +++ + AVEA +S   L   QA     N L
Sbjct: 648 EDAIAAYKKAVAADPLNSDAYYNLGNVYESKNQFELAVEAYQS--ALAIDQALAYAHNNL 705

Query: 96  IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL-- 152
             LY K G LD+ IE  +Q ++    +  A N      A+       ++  QE   +L  
Sbjct: 706 GALYDKKGILDKAIEEYRQAIKYDPLYPYAHNNLGASLAKKGDMDSALSEFQEAVHLLPD 765

Query: 153 -----GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
                 NLG+ +++ GN   A   + +   I P   +A
Sbjct: 766 NPDFRFNLGYVFLRMGNNALALQAFEETIRIKPSHTEA 803


>gi|54302280|ref|YP_132273.1| hypothetical protein PBPRB0600 [Photobacterium profundum SS9]
 gi|46915702|emb|CAG22473.1| hypothetical protein PBPRB0600 [Photobacterium profundum SS9]
          Length = 361

 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-DPDANKACNLSHCLIKQARYTEAR 202
           +  ++ + L NLG++Y   G+   AE++ R+A  I   D     NL    I+  RY++A 
Sbjct: 218 INHKSRQALTNLGYSYYLSGDLKQAEMINRRATTIYQTDKRAWSNLGLVYIRSKRYSDAL 277

Query: 203 SVLEDVL 209
             L++V+
Sbjct: 278 DALQNVM 284


>gi|308500566|ref|XP_003112468.1| hypothetical protein CRE_30752 [Caenorhabditis remanei]
 gi|308267036|gb|EFP10989.1| hypothetical protein CRE_30752 [Caenorhabditis remanei]
          Length = 526

 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 74/174 (42%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ I + ++ GR
Sbjct: 243 VATMLNVLAMVYRDQEKYKEATNFLTRALQIREKHYGENHPYVAATLNNLAIIIGRR-GR 301

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
           LDE   L K+ L++    EA  GK +                + ++ L NLG      G 
Sbjct: 302 LDEAQTLCKRSLKI---REAVYGKDSP---------------DVAKQLNNLGLLCQNLGK 343

Query: 165 YTSAEVVYRKAQLI---DPDANKA------CNLSHCLIKQARYTEARSVLEDVL 209
           Y  AE  Y+KA  I     DAN +       +LS   +KQ  Y EA    + VL
Sbjct: 344 YEEAEDYYKKALKIYEMKLDANHSETVKTRNHLSSLYMKQGNYKEAEGFYKQVL 397


>gi|310794305|gb|EFQ29766.1| tetratricopeptide [Glomerella graminicola M1.001]
          Length = 615

 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 28/51 (54%)

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
           L   +   GNYT A+    K  L+  + +     +HCLIKQ+R+ EA +VL
Sbjct: 41  LAQVHFATGNYTRAQTFLAKQDLVARNVSCRYLAAHCLIKQSRFDEAVAVL 91


>gi|189426262|ref|YP_001953439.1| hypothetical protein Glov_3213 [Geobacter lovleyi SZ]
 gi|189422521|gb|ACD96919.1| Tetratricopeptide TPR_2 repeat protein [Geobacter lovleyi SZ]
          Length = 639

 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 79/186 (42%), Gaps = 36/186 (19%)

Query: 22  YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
           +V+ K++ L E+  +A + L      A D  + A +++A V + +   D+AVE       
Sbjct: 419 FVKNKNLNLAEESYKAVLKL------APDNPE-ANRELAAVYRAKGATDKAVEHYTKALE 471

Query: 82  LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
           L  +  +    N L+ +Y K  + DE  ELL++ + +              A +H K   
Sbjct: 472 LQEEDNESR--NALVAIYVKDKKYDELAELLQEAVEL----------APDDANNHYK--- 516

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTE 200
                        LG  Y  K  Y +A   Y+KA  + PD  +A + L    +K  R +E
Sbjct: 517 -------------LGLIYDFKKEYDNAIASYKKAAELKPDHARALHALGRVYMKTGRLSE 563

Query: 201 ARSVLE 206
           AR  LE
Sbjct: 564 AREALE 569


>gi|119486472|ref|ZP_01620530.1| kinesin light chain-like protein [Lyngbya sp. PCC 8106]
 gi|119456374|gb|EAW37505.1| kinesin light chain-like protein [Lyngbya sp. PCC 8106]
          Length = 1127

 Score = 37.7 bits (86), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 46/187 (24%), Positives = 76/187 (40%), Gaps = 43/187 (22%)

Query: 56  LKDMAVVLKQQDRVDEAVEAIKSFRHLCSK-------QAQESLDNVLIDLYKKCGRLDEQ 108
           L+  AV L QQ + +EA+  +K    +  +          +SL+N+ I LY+  GR  E 
Sbjct: 111 LEQQAVQLYQQGKYNEAIPILKRVLEIIERLLGENHPDVAQSLNNLAI-LYRDQGRYSEA 169

Query: 109 IELLKQKLRMIYH----------GEAFN---------GKPTKTARSHGKKFQVTVKQ--- 146
             L ++ L  IY            ++ N         G+ ++    H +   +  K    
Sbjct: 170 EPLFQRSL-TIYEKALGENHPDVAQSLNNLAQLYYSQGRYSEAEPLHQRSLAIREKALGE 228

Query: 147 ---ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIK 194
              + +  L NL   Y  +G Y+ AE +YR++  I         PD A    NL+   + 
Sbjct: 229 NHPDVATSLNNLAQLYDSQGRYSEAEPLYRRSLAILEKALGENHPDVATSLNNLATLYVS 288

Query: 195 QARYTEA 201
           Q RY +A
Sbjct: 289 QGRYDKA 295


>gi|189426401|ref|YP_001953578.1| hypothetical protein Glov_3352 [Geobacter lovleyi SZ]
 gi|189422660|gb|ACD97058.1| TPR repeat-containing protein [Geobacter lovleyi SZ]
          Length = 404

 Score = 37.7 bits (86), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 63/151 (41%), Gaps = 10/151 (6%)

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN- 154
           ID YK   ++D+  + +   L  +Y     NG+ ++      K  +  +KQ     L   
Sbjct: 81  IDAYKMSLKVDKTQDTVYTTLAGVYID---NGQKSEAE----KVLKDGIKQNNQNTLAYY 133

Query: 155 -LGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVLLGK 212
            LG    Q G+YTSAE  +R+   ++P D N    L   L  Q +Y EA   L+     K
Sbjct: 134 MLGQLQAQNGDYTSAEANFRQVIKLEPKDGNGYYALGMALNGQEKYDEAVEALQKATDLK 193

Query: 213 LSGSTETKTINRVKELLQELEPWQSIPPSLT 243
              S     + R    L E +  Q I  +LT
Sbjct: 194 ADFSPALLELGRAYAGLGEKDKAQEIVDTLT 224


>gi|171912400|ref|ZP_02927870.1| Redoxin domain protein [Verrucomicrobium spinosum DSM 4136]
          Length = 776

 Score = 37.7 bits (86), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 120 YHGEAFNGKPTKTA---RSHGK-KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
           +HGEAF+  P + A      GK KF VT K E ++     G   +    Y  AE  +R+A
Sbjct: 25  FHGEAFDDGPRQAAVLMEGCGKVKFPVTTKNELAQKFFTQGIGQIHGFWYYEAERSFRQA 84

Query: 176 QLIDPD 181
             +DPD
Sbjct: 85  ADLDPD 90


>gi|451982583|ref|ZP_21930892.1| exported hypothetical protein, contains TPR repeats [Nitrospina
           gracilis 3/211]
 gi|451760229|emb|CCQ92187.1| exported hypothetical protein, contains TPR repeats [Nitrospina
           gracilis 3/211]
          Length = 395

 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 131 KTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLS 189
           K   SH K   + + +  ++ L +LGWA  Q G+  +AE   R++  ++  D     NL 
Sbjct: 61  KAGASHLKD-TIRLNKNHTQALRDLGWALYQMGDLAAAEKWLRQSYRLNAKDPKTVANLG 119

Query: 190 HCLIKQARYTEARSVLEDVLLG 211
             LI + RY+ A +VL++ + G
Sbjct: 120 AVLIARERYSAAVAVLDNSVPG 141


>gi|350551992|ref|ZP_08921201.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodospira
           sibirica ATCC 700588]
 gi|349795635|gb|EGZ49431.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodospira
           sibirica ATCC 700588]
          Length = 581

 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 28/59 (47%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
           I  +LGW   + G    AE+  R+A  +  DA  A NL+  L  +  Y EA  +L   L
Sbjct: 503 ITDSLGWVLFRLGRLDEAEIYLRRAHAMLEDAEIASNLAALLWARGHYDEAWEILRTAL 561


>gi|302344515|ref|YP_003809044.1| hypothetical protein Deba_3097 [Desulfarculus baarsii DSM 2075]
 gi|301641128|gb|ADK86450.1| Tetratricopeptide TPR_2 repeat protein [Desulfarculus baarsii DSM
           2075]
          Length = 803

 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 41/84 (48%), Gaps = 5/84 (5%)

Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDV 208
           R+L  LGW    +G + SA     +A+ + P DA+   +L   L K  R  EA+  L D 
Sbjct: 720 RVLDTLGWVLSLRGAHASALPELEEARKLAPTDASVYYHLGSTLAKMGRANEAKKALRDG 779

Query: 209 LLGKLSGSTETKTINRVKELLQEL 232
           L  K     +  T   +++LLQEL
Sbjct: 780 LAQK----GDQTTRQEIQKLLQEL 799


>gi|300313205|ref|YP_003777297.1| TPR repeat containing protein [Herbaspirillum seropedicae SmR1]
 gi|300075990|gb|ADJ65389.1| TPR repeat containing protein [Herbaspirillum seropedicae SmR1]
          Length = 600

 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 29/62 (46%)

Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
           E + I  +LGW   + G    AE   R+A  + PDA    +L   L  + +  EAR +  
Sbjct: 518 EDAFIADSLGWVLFRMGRLDEAETQMRRAYGLRPDAEIGVHLGEVLWTRGKQDEARKIWR 577

Query: 207 DV 208
           DV
Sbjct: 578 DV 579


>gi|254492569|ref|ZP_05105741.1| hypothetical protein MDMS009_2911 [Methylophaga thiooxidans DMS010]
 gi|224462461|gb|EEF78738.1| hypothetical protein MDMS009_2911 [Methylophaga thiooxydans DMS010]
          Length = 532

 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 32/140 (22%), Positives = 59/140 (42%), Gaps = 29/140 (20%)

Query: 42  FWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKK 101
           + KA+        A+ ++ VVL++Q R++EAVEA    + L  +Q  ++  N+   L K 
Sbjct: 136 YQKAVELQPGFYEAMANLGVVLQEQGRLEEAVEAYN--KALAVQQDAQTFFNLGTAL-KN 192

Query: 102 CGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQ 161
            G+L + I+   Q L                           +  + + +  N+G     
Sbjct: 193 QGKLGDAIDAYNQAL--------------------------VINPDYAEVHSNIGEVLRD 226

Query: 162 KGNYTSAEVVYRKAQLIDPD 181
           +G Y  +   Y++A  +DPD
Sbjct: 227 QGRYDESVKAYKQALTLDPD 246


>gi|153006532|ref|YP_001380857.1| hypothetical protein Anae109_3694 [Anaeromyxobacter sp. Fw109-5]
 gi|152030105|gb|ABS27873.1| Tetratricopeptide TPR_2 repeat protein [Anaeromyxobacter sp.
           Fw109-5]
          Length = 638

 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 31/126 (24%), Positives = 55/126 (43%), Gaps = 11/126 (8%)

Query: 68  RVDEAVEAIKSFRH-LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY-HGEAF 125
           R  +A EA+ + R  L S    ++L   L   Y + G+LD  +  ++  L ++  H EA 
Sbjct: 449 RAGKAAEAVSALRSALASDPRDQALLYALGATYHRAGQLDAAVAQMQALLALVPDHAEAL 508

Query: 126 NGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGNYT-SAEVVYRKAQ 176
           N      A    +        +  V ++  +  +  +LGW   ++G Y  +AE + +   
Sbjct: 509 NFMGYALAERGTRLDEAERLVRRAVELRPRSGHVRDSLGWVLFRRGEYARAAEALEQADA 568

Query: 177 LIDPDA 182
           L  PDA
Sbjct: 569 LAGPDA 574


>gi|189347525|ref|YP_001944054.1| hypothetical protein Clim_2046 [Chlorobium limicola DSM 245]
 gi|189341672|gb|ACD91075.1| Tetratricopeptide TPR_2 repeat protein [Chlorobium limicola DSM
           245]
          Length = 208

 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 28/120 (23%), Positives = 52/120 (43%), Gaps = 11/120 (9%)

Query: 83  CSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV 142
           C+  + + +DN+   ++K  G  D    L          G AF         S   +  +
Sbjct: 30  CASPSSKEIDNLQQQVWKNSGDADAWFRL----------GNAFARNEQYRKASEAYREAL 79

Query: 143 TVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN-KACNLSHCLIKQARYTEA 201
           ++  E  ++L  LG A   +GNY  A V + K Q + PD + +  ++ + L++   Y +A
Sbjct: 80  SIDPEKEQLLAALGAASFNQGNYREALVYFTKYQALAPDDSLRNYDIGNVLLQMREYDKA 139


>gi|332711060|ref|ZP_08430995.1| hypothetical protein LYNGBM3L_60180 [Moorea producens 3L]
 gi|332350186|gb|EGJ29791.1| hypothetical protein LYNGBM3L_60180 [Moorea producens 3L]
          Length = 419

 Score = 37.4 bits (85), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 35/81 (43%), Gaps = 20/81 (24%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQA-------------- 196
           +L NLGWA +++  Y  AE   R A  +D    +    +HCL+ Q               
Sbjct: 318 LLKNLGWARLKQERYAEAETYLRDAIELDKTFTRTPAAAHCLLAQVIENKPEKDPDNALA 377

Query: 197 ------RYTEARSVLEDVLLG 211
                 RYT+ R+  ED+ +G
Sbjct: 378 EWKICLRYTDIRNPDEDIWVG 398


>gi|344337618|ref|ZP_08768552.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
           5811]
 gi|343802571|gb|EGV20511.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
           5811]
          Length = 607

 Score = 37.4 bits (85), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 77/200 (38%), Gaps = 33/200 (16%)

Query: 18  GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK 77
           GD    R    QL ++ PE A++L+                   VL+   R DEA+    
Sbjct: 398 GDVQRAREILQQLRDRGPENAVLLYLVETE--------------VLESVGRADEAMSVFD 443

Query: 78  SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY-----HGEAFNG----- 127
           S   +         +N+L        +LD +I L +Q LR I      H +A N      
Sbjct: 444 SALQVFPDD-----ENLLYARALSAVKLD-RIALAEQDLRRIIEIDPEHADALNALGYTL 497

Query: 128 --KPTKTARSHGK-KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
             +  + A + G  +    +K E   IL ++GW + + G+Y +A    R+A     D   
Sbjct: 498 ADRTDRYAEAKGYIEKAYAIKPEEPAILDSMGWIHYRLGDYETAVDYLRRALARMSDGEI 557

Query: 185 ACNLSHCLIKQARYTEARSV 204
           A +L   L    R  EA  V
Sbjct: 558 AAHLGEVLWAMNRREEALEV 577


>gi|321465641|gb|EFX76641.1| hypothetical protein DAPPUDRAFT_54951 [Daphnia pulex]
          Length = 825

 Score = 37.4 bits (85), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 34/134 (25%), Positives = 63/134 (47%), Gaps = 21/134 (15%)

Query: 44  KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCG 103
           KA++  + +     ++ ++ ++Q R +EA+++ +S  H   + A   L+  L+      G
Sbjct: 502 KALSYRNNMADVHYNLGILYQEQKRYEEAIQSYRSAVHYRPRMAMAHLNMGLV--LALMG 559

Query: 104 RLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI--LGNLGWAYMQ 161
             DE IE+ ++  ++       +G   K  R+H          ET++I  L NLG  +  
Sbjct: 560 MKDEAIEVYRRCSQL-------DGSGLKDPRTH----------ETTKISALFNLGRLHAD 602

Query: 162 KGNYTSAEVVYRKA 175
            G YT A  VY +A
Sbjct: 603 DGQYTKAIDVYNEA 616


>gi|332234364|ref|XP_003266379.1| PREDICTED: LOW QUALITY PROTEIN: kinesin light chain 4 [Nomascus
           leucogenys]
          Length = 606

 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 65/150 (43%), Gaps = 31/150 (20%)

Query: 69  VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
           V+E      SFRH C   A  +L+N+ + LY K G+  E   L ++ L +          
Sbjct: 275 VEEGGIRSPSFRH-CPVAA--TLNNLAV-LYGKRGKYKEAEPLCQRALEIR--------- 321

Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-----DPD-- 181
                    +K   T   + ++ L NL      +G Y + E  Y++A  I      PD  
Sbjct: 322 ---------EKVLGTNHPDVAKQLNNLALLCQNQGKYEAVERYYQRALAIYEGQLGPDNP 372

Query: 182 --ANKACNLSHCLIKQARYTEARSVLEDVL 209
             A    NL+ C +KQ +Y EA ++ +++L
Sbjct: 373 NVARTKNNLASCYLKQGKYAEAETLYKEIL 402


>gi|322420030|ref|YP_004199253.1| hypothetical protein GM18_2524 [Geobacter sp. M18]
 gi|320126417|gb|ADW13977.1| Tetratricopeptide TPR_1 repeat-containing protein [Geobacter sp.
           M18]
          Length = 572

 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 39/155 (25%), Positives = 68/155 (43%), Gaps = 27/155 (17%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P  A+  + KA++     + AL +M +  + Q  + +A+++ K    +    A  ++   
Sbjct: 199 PREALTYYKKAVDLKPDFEQALIEMGISQETQGLIPDAIDSYKDLLEINPNNA--NVIQH 256

Query: 95  LIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHG-------------KKFQ 141
           L  LY +  RLDE + LL+QK           GK  +T+R  G             K FQ
Sbjct: 257 LAQLYIQQKRLDEALALLQQK----------GGKSLETSRKIGLLFLELERYDEAIKTFQ 306

Query: 142 --VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRK 174
             + V+ +  ++   L  AY +K +   A V +RK
Sbjct: 307 DILKVEPDAQQVRFYLASAYEEKEDVDQAIVEFRK 341


>gi|411118571|ref|ZP_11390952.1| methyltransferase, FkbM family [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410712295|gb|EKQ69801.1| methyltransferase, FkbM family [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 928

 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 80/196 (40%), Gaps = 32/196 (16%)

Query: 18  GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK 77
           G+ P     + Q+VE DPE     FW               +A+V  QQ R  EA+   +
Sbjct: 17  GNLPDAETAYRQVVEDDPEHIEAWFW---------------LALVTDQQGRPMEAIAHYQ 61

Query: 78  SFRHL--CSKQAQESLDNVLIDLYKKCGRLDEQI-------ELLKQKLRMIYHGEA--FN 126
               L   S +A  +L +V +    K  R DE I       EL+ Q  +  Y+     + 
Sbjct: 62  KVLQLQPNSAEAHGNLGSVWL----KLRRFDEAIAHHRKSVELMPQNAKAHYNLAIALYE 117

Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
                 A ++ ++  V +  E +    NLG A  ++G    A   Y+KA  ++P+   A 
Sbjct: 118 NNQVDEAITYYQQ-AVALMPEYANAHHNLGMALYRQGKADEAITHYQKAIALEPNHASAR 176

Query: 187 N-LSHCLIKQARYTEA 201
           N L   L +Q +  EA
Sbjct: 177 NSLGVALYQQGKIDEA 192


>gi|307154916|ref|YP_003890300.1| hypothetical protein Cyan7822_5141 [Cyanothece sp. PCC 7822]
 gi|306985144|gb|ADN17025.1| TPR repeat-containing protein [Cyanothece sp. PCC 7822]
          Length = 965

 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 65/173 (37%), Gaps = 33/173 (19%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV------LIDLYKKCGRL 105
           V ++L ++A +   Q R +EA    K    L  +   ++  NV      L  LY   GR 
Sbjct: 698 VATSLNNLAALYDSQGRYEEAEPLYKQALELSKRLLGDNHPNVATSLNNLAALYDSQGRY 757

Query: 106 DEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNY 165
           +E   L KQ L +I                  K+       + +  L NL   Y  +G Y
Sbjct: 758 EEAEPLYKQALELI------------------KRLLGDNHPDVATSLNNLAALYDSQGRY 799

Query: 166 TSAEVVY------RKAQLIDPDANKAC---NLSHCLIKQARYTEARSVLEDVL 209
             AE +Y      RK  L D   N A    NL+     Q RY EA  + +  L
Sbjct: 800 EEAEPLYKQALELRKRLLGDNHPNVASSLNNLAGLYDSQGRYEEAEPLYKQAL 852


>gi|113474349|ref|YP_720410.1| hypothetical protein Tery_0481 [Trichodesmium erythraeum IMS101]
 gi|110165397|gb|ABG49937.1| TPR repeat [Trichodesmium erythraeum IMS101]
          Length = 3145

 Score = 37.4 bits (85), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 79/171 (46%), Gaps = 11/171 (6%)

Query: 36   EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
            + AI L  + IN       A  ++A ++K++ R++EA+   +    L    +    +   
Sbjct: 1044 DIAIQLLNQVINLYPGFTEAYSNLAKLMKKEGRLEEAIAHYQKAISLEPNNSSNYSNLGF 1103

Query: 96   IDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
            I L K  G+++  I   ++ + +           G A+  K   +  S   +  + ++ +
Sbjct: 1104 IFLEK--GQIESAIINSEKSIEINPNNSQGNFNLGFAWAEKGDLSKASTYYQKAINLQPD 1161

Query: 148  TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQAR 197
             ++   NLG  + +KGN + A   Y++A  I+P+  +A CNL   L+KQ +
Sbjct: 1162 YAQAHNNLGLIFQEKGNLSKASNYYQQALEINPNYAEAWCNLGVILLKQGQ 1212


>gi|381158787|ref|ZP_09868020.1| hypothetical protein Thi970DRAFT_02489 [Thiorhodovibrio sp. 970]
 gi|380880145|gb|EIC22236.1| hypothetical protein Thi970DRAFT_02489 [Thiorhodovibrio sp. 970]
          Length = 338

 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 64/256 (25%), Positives = 97/256 (37%), Gaps = 49/256 (19%)

Query: 16  PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
           P    PY+R+  +   + + E A   F   +   +R       +  V ++ D+ ++AV+A
Sbjct: 83  PTHVQPYLRSGRIYFQQGELEKAREAFLNVLRVNERSAIGHAALGFVYQRSDQANQAVDA 142

Query: 76  IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARS 135
              FR L        L   L  + +K GR  + +   K  LR+         +PT     
Sbjct: 143 W--FRALSFNPRMVGLRKRLALILQKLGRQSDAMAQAKAALRI---------RPTDP--- 188

Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA---CNLSHCL 192
                      E   I G L   ++    +  A   YRKA  +DPDA K      L+   
Sbjct: 189 -----------EAHAINGRL---HLLAKEFDPALRAYRKAIDLDPDAKKLGIRLGLAEAC 234

Query: 193 IKQARYTEARSVLEDV--------LLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
           I      EA +VL  V        LL KL G   T+     KE ++E         ++ T
Sbjct: 235 ICAGHLDEAETVLSAVPESDQFSALLHKLWGDIYTEK-GLHKEAMEEYR------GAVLT 287

Query: 245 KKSSLEDAFLEGLDDL 260
               +E   +EGLDDL
Sbjct: 288 SGQEVE---IEGLDDL 300


>gi|148262768|ref|YP_001229474.1| TPR repeat-containing protein [Geobacter uraniireducens Rf4]
 gi|146396268|gb|ABQ24901.1| Tetratricopeptide TPR_2 repeat protein [Geobacter uraniireducens
           Rf4]
          Length = 632

 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 22/178 (12%)

Query: 19  DSPYVRAK--HVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAI 76
           D+P +  K   V +  KD  AAI  + +          A +D+A + +++++ +EA +  
Sbjct: 404 DNPLIHFKLAKVYVNSKDYPAAISEYLETTKLDPDNIEAHRDLAALFRKKNQNEEAEKEY 463

Query: 77  KSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ-------------KLRMIYHGE 123
           +S   L  K+        L  +Y K    DE I LLK+             KL +IY  +
Sbjct: 464 RSI--LRMKKDDVEARTALTSIYVKNKNYDELINLLKEGVELNPKDPNSHYKLGLIYEFK 521

Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
               K    A S  K+  V +K + ++ L  +G AYM+ G  + A+     A+  DP+
Sbjct: 522 ----KDYDAAISQYKE-SVALKSDHAKALNAMGRAYMKSGRISEAKEALETAKKADPE 574


>gi|406953657|gb|EKD82827.1| TPR Domain containing protein, partial [uncultured bacterium]
          Length = 1487

 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 21/114 (18%)

Query: 86  QAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF-NGKPT------KTARSHGK 138
           QA + LDN  ++LY +CG+L ++++L+++ LR     EA    KP       + A  +G 
Sbjct: 185 QAVKELDNSNVELYYRCGKLYQRLDLMEEALREF---EAIARMKPDWLEIRYELATLYGD 241

Query: 139 K--FQVTVKQ---------ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
           K  +++ +K+          ++ ++ NL  +Y   G+   A V Y+ A    PD
Sbjct: 242 KGEYELQIKEYEAIAAYDTTSAEVMKNLAASYAANGDLNKAIVSYKHALESTPD 295


>gi|170691501|ref|ZP_02882666.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
 gi|170143706|gb|EDT11869.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
          Length = 613

 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 36/76 (47%)

Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
           I+ ++GW   + G+ T A  V RKA  I P+A    +L   L K     +AR+   +   
Sbjct: 534 IMDSVGWVKYRMGDTTDAIKVLRKAYDIQPNAEIGAHLGEVLWKSGNQDQARAAFREARK 593

Query: 211 GKLSGSTETKTINRVK 226
            +    T  KT+ R++
Sbjct: 594 LEPDNETLVKTLQRLQ 609


>gi|427706474|ref|YP_007048851.1| NB-ARC domain-containing protein [Nostoc sp. PCC 7107]
 gi|427358979|gb|AFY41701.1| NB-ARC domain protein [Nostoc sp. PCC 7107]
          Length = 980

 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 74/166 (44%), Gaps = 35/166 (21%)

Query: 52  VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
           V ++L ++A + + QDR  EA    ++A+   R L  ++     ++L+N+ + LY+  GR
Sbjct: 626 VANSLNNLAELYRSQDRYSEAEALHIQALALRRKLLGEEHPDVAQTLNNLAV-LYRFQGR 684

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             E   L         H +A   +       H           TS  L NL   Y  +G 
Sbjct: 685 YSEAEAL---------HIQALALRRKLLGEEHPDVL-------TS--LNNLALLYYGQGK 726

Query: 165 YTSAEVVY------RKAQLID--PD-ANKACNLSHCLIKQARYTEA 201
           Y+ AE +Y      R+  L D  PD AN   NL++    Q+RY+EA
Sbjct: 727 YSEAEPLYIQALKLRRKLLGDEHPDVANSLHNLAYLYRSQSRYSEA 772


>gi|39996647|ref|NP_952598.1| hypothetical protein GSU1547 [Geobacter sulfurreducens PCA]
 gi|409912070|ref|YP_006890535.1| hypothetical protein KN400_1571 [Geobacter sulfurreducens KN400]
 gi|39983528|gb|AAR34921.1| TPR domain protein [Geobacter sulfurreducens PCA]
 gi|298505660|gb|ADI84383.1| TPR domain protein [Geobacter sulfurreducens KN400]
          Length = 473

 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTE 200
           + ++ ++     NLG     +     AE  YR A  +DPD ++A  NLS+ L++Q R+ E
Sbjct: 105 IELEPDSPAAWSNLGVLQACRKQEREAEESYRTALRLDPDYHRASFNLSYLLLRQGRFDE 164

Query: 201 ARSVLE 206
             S+LE
Sbjct: 165 GWSLLE 170


>gi|294138897|ref|YP_003554875.1| cellulose synthase operon C protein [Shewanella violacea DSS12]
 gi|293325366|dbj|BAJ00097.1| cellulose synthase operon C protein, putative [Shewanella violacea
           DSS12]
          Length = 1195

 Score = 37.4 bits (85), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 21/83 (25%), Positives = 40/83 (48%), Gaps = 15/83 (18%)

Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCL------ 192
           ++ +T + +   +LG +G  Y++ GN T A   ++KAQ +D D + A      +      
Sbjct: 273 RYALTTRPDDPELLGAIGTIYLRGGNQTQALAYFKKAQELDKDPDAASKWRSLIKVSSYW 332

Query: 193 ---------IKQARYTEARSVLE 206
                    +K+ RYT+A+ + E
Sbjct: 333 AYLKQGDTALKENRYTDAKHLYE 355


>gi|298492037|ref|YP_003722214.1| hypothetical protein Aazo_3466 ['Nostoc azollae' 0708]
 gi|298233955|gb|ADI65091.1| TPR repeat-containing protein ['Nostoc azollae' 0708]
          Length = 374

 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 29/172 (16%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           EAAI  + +A+ A  ++ +A  ++ V+L +++R  EA+ A +   +L    A     N+ 
Sbjct: 148 EAAIAAYRQALAANPKMAAAQYNLGVILYEEERCQEAIAAYQEAINLDRNNANAYF-NLA 206

Query: 96  IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
           I L ++ G+L++ I   +Q L++                            E +    NL
Sbjct: 207 IAL-QQEGQLEQAIATYRQILKL--------------------------NPENTVAYNNL 239

Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLE 206
           G   + +G  + A  +Y+KA   +P +A    NL   L  Q    EA + L+
Sbjct: 240 GSLMVIQGQPSEAIAIYQKAIGQNPKNALAYYNLGVTLYNQGNLKEANAALK 291


>gi|193215104|ref|YP_001996303.1| tetratricopeptide domain-containing protein [Chloroherpeton
           thalassium ATCC 35110]
 gi|193088581|gb|ACF13856.1| Tetratricopeptide TPR_2 repeat protein [Chloroherpeton thalassium
           ATCC 35110]
          Length = 638

 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 82/189 (43%), Gaps = 45/189 (23%)

Query: 29  QLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCS 84
           +L E++P+A ++           V  +L ++A+V+ QQ R +EA     +A+K       
Sbjct: 271 KLAEENPDAYLL----------DVAESLYNLAIVMYQQGRYEEAEPLHRQALKIREEKLG 320

Query: 85  K---QAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
                   SL+N+ I +  K GR +E   L ++ L++         +  K    H     
Sbjct: 321 DDHPDVASSLNNLAI-VMSKQGRYEEAEPLYQRALKI---------REEKLGDDH----- 365

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPDANKAC-NLSHCL 192
                + ++ L NL     Q+G Y  AE ++R+A  I         PD  K+  NL+  +
Sbjct: 366 ----PDVAKSLNNLAIVMHQQGRYEEAEPLHRQALKIREEKLGDDHPDVAKSLYNLASVM 421

Query: 193 IKQARYTEA 201
            +Q RY EA
Sbjct: 422 YEQGRYEEA 430



 Score = 37.0 bits (84), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 33/165 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK------KCGRL 105
           V S+L ++A+V+ QQ R +EA    +    +  ++  +   +V   LY       K GR 
Sbjct: 452 VASSLNNLAIVMHQQGRYEEAESLYQRAIKILEEKLGDGHPDVATSLYNLANVMSKQGRY 511

Query: 106 DEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNY 165
           +E   L +Q L+ I+  +  +  P                 + +  L NL     ++G Y
Sbjct: 512 EEAEPLYRQALK-IWEEKLGDDHP-----------------DVASSLNNLAIVMSKQGRY 553

Query: 166 TSAEVVYRKAQLIDPD---------ANKACNLSHCLIKQARYTEA 201
             AE ++R+A  I  +         A+   NL+  + KQ RY EA
Sbjct: 554 EEAEPLHRQALKIREEKLGDDHPYVAHSLNNLALMMSKQGRYEEA 598


>gi|410664414|ref|YP_006916785.1| hypothetical protein M5M_09350 [Simiduia agarivorans SA1 = DSM
           21679]
 gi|409026771|gb|AFU99055.1| hypothetical protein M5M_09350 [Simiduia agarivorans SA1 = DSM
           21679]
          Length = 577

 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 60/145 (41%), Gaps = 27/145 (18%)

Query: 60  AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
           A+V +Q DR+D A + ++         A  +L+ +   L  +  R DE  +L+ Q     
Sbjct: 435 ALVNEQLDRLDLAEQDLRKVIRYEPNNAT-ALNALGYTLADRTDRYDEAYQLISQ----- 488

Query: 120 YHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID 179
               A + KP   A                 I+ +LGW + + GNY  A +  R+A  + 
Sbjct: 489 ----AHSLKPGDAA-----------------IIDSLGWVHYRLGNYEEALLRLREALQLF 527

Query: 180 PDANKACNLSHCLIKQARYTEARSV 204
           PD   A +L   L       EAR+V
Sbjct: 528 PDPEIAAHLGEVLWVTGNQAEARAV 552


>gi|302391517|ref|YP_003827337.1| hypothetical protein [Acetohalobium arabaticum DSM 5501]
 gi|302203594|gb|ADL12272.1| Tetratricopeptide TPR_2 repeat protein [Acetohalobium arabaticum
           DSM 5501]
          Length = 251

 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 29/123 (23%), Positives = 53/123 (43%), Gaps = 28/123 (22%)

Query: 58  DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
           ++  V  +QD++D+A+E  K  + + + Q  +   NVL   Y+K  ++ E I+  K+   
Sbjct: 106 NLGRVYLKQDKIDQAIEITK--QAIETNQKIDDAYNVLGRAYRKAEKIKEAIDSFKK--- 160

Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
                                   V +  E   +L NLG+ Y+Q   Y  A  V ++A  
Sbjct: 161 -----------------------AVELNDENYYVLNNLGYTYIQTNQYKEAISVLKQAVA 197

Query: 178 IDP 180
           ++P
Sbjct: 198 LNP 200


>gi|443242367|ref|YP_007375592.1| TPR-domain containing protein [Nonlabens dokdonensis DSW-6]
 gi|442799766|gb|AGC75571.1| TPR-domain containing protein [Nonlabens dokdonensis DSW-6]
          Length = 1000

 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 48/202 (23%), Positives = 83/202 (41%), Gaps = 24/202 (11%)

Query: 59  MAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKL-- 116
           +  +  + D V+ A E         S +  E L     DL  K G+ DE I+L K++L  
Sbjct: 172 LGYIAYENDEVETATELFNQVGE--SPEDDEKLSYYKADLNYKSGKFDEAIKLAKEQLPK 229

Query: 117 ----------RMIYHGEA-FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNY 165
                     R+I  G++ FN +    A  + ++++ T  +  +     LG+AY ++GN+
Sbjct: 230 SNRREKSQLNRII--GQSLFNQEKYDEALPYLQEYEGTRGKWNNNDYYQLGYAYYKQGNF 287

Query: 166 TSAEVVYRKAQLIDPDANKACN----LSHCLIKQARYTEARSVLEDVLLGKLSGS-TETK 220
             A   + K  +ID     A N    L    IK  R  +A +  +      L    +E  
Sbjct: 288 EKAIETFNK--IIDGANATAQNAYYHLGQSYIKLDRNEDALNAFKKASEMDLDAQISEDA 345

Query: 221 TINRVKELLQELEPWQSIPPSL 242
           + N  K   +   P+ S+P  +
Sbjct: 346 SYNYAKLSYENGNPYDSVPAVI 367


>gi|119487755|ref|ZP_01621264.1| TPR repeat protein [Lyngbya sp. PCC 8106]
 gi|119455588|gb|EAW36725.1| TPR repeat protein [Lyngbya sp. PCC 8106]
          Length = 998

 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 140 FQVTVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDPDA-NKACNLSHCLIKQA 196
           +Q  ++ + S +    NLG    Q+G+ T AE +YRKA  ++P+      NL + L++Q 
Sbjct: 343 YQTAIELDNSLVYAYQNLGNLIKQQGSLTEAEAIYRKAIEVNPNHWGSYLNLGNILLEQT 402

Query: 197 RYTEARSVLEDVL 209
            + +A ++ +  L
Sbjct: 403 HFEDAIAIYQTAL 415


>gi|3023156|gb|AAD13354.1| decahistidyl-kinesin light chain [Expression vector pPK121]
          Length = 524

 Score = 37.4 bits (85), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381


>gi|195589902|ref|XP_002084688.1| GD12703 [Drosophila simulans]
 gi|194196697|gb|EDX10273.1| GD12703 [Drosophila simulans]
          Length = 634

 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381


>gi|297539283|ref|YP_003675052.1| hypothetical protein M301_2106 [Methylotenera versatilis 301]
 gi|297258630|gb|ADI30475.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera versatilis
           301]
          Length = 579

 Score = 37.4 bits (85), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 19/68 (27%), Positives = 34/68 (50%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           +T+      +L +LGWAY +KG    A     +A  ++ D   A +L   L ++ ++ EA
Sbjct: 494 LTISPNDHYMLDSLGWAYYRKGKLDKAITYLEQAYQVNQDPEIAAHLGEVLWQKGKHDEA 553

Query: 202 RSVLEDVL 209
           + +  D L
Sbjct: 554 KKIWTDAL 561


>gi|78067598|ref|YP_370367.1| hypothetical protein Bcep18194_A6129 [Burkholderia sp. 383]
 gi|77968343|gb|ABB09723.1| TPR repeat protein [Burkholderia sp. 383]
          Length = 607

 Score = 37.4 bits (85), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 22/85 (25%), Positives = 39/85 (45%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           +T+    + I+ +LGW   + G+   A  V R+A  + P+A    +L   L K     +A
Sbjct: 519 MTLAPNDAYIMDSLGWVKYRMGDTAGATKVLRRAYELQPNAEIGAHLGEVLWKSGAQDDA 578

Query: 202 RSVLEDVLLGKLSGSTETKTINRVK 226
           RS   D    +    T  +T+ R++
Sbjct: 579 RSAWRDAQKLEPDNDTLVQTLKRLQ 603


>gi|339499059|ref|YP_004697094.1| hypothetical protein Spica_0422 [Spirochaeta caldaria DSM 7334]
 gi|338833408|gb|AEJ18586.1| Tetratricopeptide TPR_1 repeat-containing protein [Spirochaeta
           caldaria DSM 7334]
          Length = 875

 Score = 37.4 bits (85), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 45/176 (25%), Positives = 76/176 (43%), Gaps = 15/176 (8%)

Query: 58  DMAVVLKQQDRVD--EAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCG-RLDEQIELLKQ 114
           D A++L   D VD  E + A ++   L      E       +  KKC  RLD  I+    
Sbjct: 616 DNALLLLNDDAVDDPEGLLAHQAGTILYRADRLEEAKAYFAEALKKCPERLDYLIDQANC 675

Query: 115 KLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRK 174
            + +  +GEA     T  A+++ +       +E   +L  +G+   +KG +  AE  YR 
Sbjct: 676 LINLGLYGEA----DTLLAKAYDR-------EENEAVLDLIGYIAYKKGEFPRAEAAYRL 724

Query: 175 AQLIDPDANKACN-LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELL 229
           A   DPD     + L+   I  AR+T A   ++ +       S E +++  ++E L
Sbjct: 725 ALERDPDNTSFLSALAWIYITMARWTSAEDCIQKLEHLVAPESVEYESVQELRERL 780


>gi|255035013|ref|YP_003085634.1| hypothetical protein Dfer_1220 [Dyadobacter fermentans DSM 18053]
 gi|254947769|gb|ACT92469.1| Tetratricopeptide TPR_2 repeat protein [Dyadobacter fermentans DSM
           18053]
          Length = 399

 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 35  PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
           P+ A++ + KA+      D AL  +A V  QQ     A+ A+K    L  K   + L+N+
Sbjct: 250 PDTALIFYQKALAIQPDYDEALNALATVAMQQRAYPAAIPALKKL--LRKKPQSKELNNM 307

Query: 95  LIDLYKKCGRLD 106
           L   Y+K G LD
Sbjct: 308 LGYAYEKSGELD 319


>gi|299132292|ref|ZP_07025487.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
 gi|298592429|gb|EFI52629.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
          Length = 269

 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKA 185
           GKP +  R +    ++  + +   +L NLG +Y+       AE   R+AQ + P D+   
Sbjct: 152 GKPEEARRYYASALKI--RPDDPSVLSNLGLSYVLTKELPQAEATLRRAQKLAPNDSRIR 209

Query: 186 CNLSHCLIKQARYTEARSVLE 206
            NL   L  Q R+ EA ++++
Sbjct: 210 QNLGLALGLQGRFAEAEAIVQ 230


>gi|325983627|ref|YP_004296029.1| hypothetical protein NAL212_3105 [Nitrosomonas sp. AL212]
 gi|325533146|gb|ADZ27867.1| hypothetical protein NAL212_3105 [Nitrosomonas sp. AL212]
          Length = 578

 Score = 37.0 bits (84), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 40/91 (43%), Gaps = 9/91 (9%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
           V +  E   I+ +LGW Y + GN T        A  + PDA  A +L   L  Q    +A
Sbjct: 488 VELAPEDPYIMDSLGWVYYRMGNLTEGLNYLNLAFSVRPDAEIAAHLGEVLWMQGAKNDA 547

Query: 202 ----RSVLE-----DVLLGKLSGSTETKTIN 223
               RS LE     +VLL  L   T+ K I+
Sbjct: 548 LNIWRSALEKEPGNEVLLETLQRLTKKKVID 578


>gi|325283109|ref|YP_004255650.1| hypothetical protein Deipr_0877 [Deinococcus proteolyticus MRP]
 gi|324314918|gb|ADY26033.1| hypothetical protein Deipr_0877 [Deinococcus proteolyticus MRP]
          Length = 271

 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 64/160 (40%), Gaps = 20/160 (12%)

Query: 69  VDEAVEAIKSFRHLCSKQAQESLD--------NVLIDLYKKCGRLDEQIELLKQKLRMIY 120
           V   VEA+K      + +AQ++L         + L  L  + GRLDE    L+  L  IY
Sbjct: 83  VQSGVEAVKPLDTKIA-EAQQALSRSDTLANRSSLAHLLTRAGRLDEAQSTLQPLLTGIY 141

Query: 121 HGEAF-----------NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAE 169
             +              G+P    +  G+    T     ++ L  L  A  Q+GN + AE
Sbjct: 142 RDDPVVLLTSAALDLAQGQPALAEQKLGQVELQTSAATRTQALTLLAEAQAQQGNLSGAE 201

Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
             Y++A             +  L+KQ R  +A+ VL  +L
Sbjct: 202 ARYQEAMHAATTEEPRARYAEFLLKQGRTEQAQEVLNRLL 241


>gi|442632013|ref|NP_001261781.1| kinesin light chain, isoform C [Drosophila melanogaster]
 gi|440215713|gb|AGB94474.1| kinesin light chain, isoform C [Drosophila melanogaster]
          Length = 507

 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381


>gi|292620687|ref|XP_001921543.2| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Danio rerio]
          Length = 1102

 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 65/162 (40%), Gaps = 37/162 (22%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESL 91
           E A+ L+ KA+       +A  ++A VL+QQ ++ EA+    EAI+             +
Sbjct: 400 EEAVQLYRKALEVFPDFAAAHSNLASVLQQQGKLQEALMHYEEAIR-------------I 446

Query: 92  DNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
                D Y   G   ++++ ++  LR   +  A    P   A +H               
Sbjct: 447 SPTFADAYSNMGNTLKEMQDIQGALRC--YTRAIQINPA-FADAH--------------- 488

Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
             NL   +   GN   A   YR A  + PD   A CNL+HCL
Sbjct: 489 -SNLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCL 529


>gi|113477254|ref|YP_723315.1| hypothetical protein Tery_3796 [Trichodesmium erythraeum IMS101]
 gi|110168302|gb|ABG52842.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
          Length = 385

 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 50/111 (45%), Gaps = 3/111 (2%)

Query: 99  YKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ--VTVKQETSRILGNLG 156
           Y++  R+ +Q   L  K   IY G A+     +  ++    +Q  + +    ++    LG
Sbjct: 78  YQEAIRIYQQAVRLDPKNATIYSGIAYLEAVQENFQAAALFYQQAIAIDPHNAKFQYGLG 137

Query: 157 WAYMQKGNYTSAEVVYRKAQLID-PDANKACNLSHCLIKQARYTEARSVLE 206
           +A  +  NY +A   YR+A L+D  + N    L   L +  +Y EAR   E
Sbjct: 138 YAMAKLENYQAAVEAYRRASLLDRNNVNTHLGLGAALFRLEKYGEARVAYE 188


>gi|304392377|ref|ZP_07374318.1| tetratricopeptide TPR_4 containing protein [Ahrensia sp. R2A130]
 gi|303295481|gb|EFL89840.1| tetratricopeptide TPR_4 containing protein [Ahrensia sp. R2A130]
          Length = 260

 Score = 37.0 bits (84), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 54/114 (47%), Gaps = 9/114 (7%)

Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
           G P +    + K  Q+  K     +L NLG +Y+ +G+  ++E  Y K  +  P A+   
Sbjct: 148 GDPKQARLQYRKALQL--KPGEPSVLSNLGMSYLLEGDLRASET-YLKQAIGRPGADSRV 204

Query: 187 --NLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSI 238
             NL+  +  Q R+ EA    E V  G+L  +     +  ++++LQE   W+ +
Sbjct: 205 RQNLALVIGLQGRFPEA----EKVAAGELPPAEAQANVAFLRQMLQEQNAWKEL 254


>gi|168061951|ref|XP_001782948.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665566|gb|EDQ52246.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 899

 Score = 37.0 bits (84), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 64/151 (42%), Gaps = 30/151 (19%)

Query: 36  EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
           + A+V++  A++       A  ++ V+ K +D +D+AVE  +    +    +Q SL+N+ 
Sbjct: 292 DMAVVMYELALHFNPHCAEACNNLGVIYKDRDNLDKAVECYQMALQIKPNFSQ-SLNNLG 350

Query: 96  IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
           + +Y   G++D    ++ K  L    + EA+N                           N
Sbjct: 351 V-VYTVQGKMDSAASMIEKAILANPSYAEAYN---------------------------N 382

Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
           LG  +   GN   A   Y +  +IDPD+  A
Sbjct: 383 LGVLHRDAGNIPLAIDAYERCLVIDPDSRNA 413


>gi|414075408|ref|YP_006994726.1| TPR repeat domain-containing protein [Anabaena sp. 90]
 gi|413968824|gb|AFW92913.1| TPR repeat domain-containing protein [Anabaena sp. 90]
          Length = 755

 Score = 37.0 bits (84), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 87/211 (41%), Gaps = 22/211 (10%)

Query: 14  KLPPGDS-PYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEA 72
           +L P D+  YV   +V   +  P+ A+  + KAI    +   A   +   L  Q+++ EA
Sbjct: 92  QLNPNDAEAYVGIGNVLNAQGKPDEAVAAYRKAIEFDPKYAKAYNSLGNALYDQEKLKEA 151

Query: 73  VEAIK---SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI-YHGEAFNGK 128
           V A +    F H  +  A  +L NVL +       LDE +   ++ + +   +  A+N  
Sbjct: 152 VAAYRKAIEFDHKYAA-AYYNLGNVLYE----QKELDEAVAAYRKAIELNPKYATAYNN- 205

Query: 129 PTKTARSHGKKFQ--VTVKQETSRI-------LGNLGWAYMQKGNYTSAEVVYRKAQLID 179
               A S  KK    V   QE  ++         NLG A   +     A   Y+KA  +D
Sbjct: 206 -LGNALSDQKKLDEAVAAYQEAIKLNPKDATAYNNLGIALSDQKKLDEAVAAYQKAIELD 264

Query: 180 PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
           P  A    NL + L  Q +  EA +  +  +
Sbjct: 265 PKYATAYYNLGNALSDQKKLDEAVAAYQKAI 295


>gi|333891999|ref|YP_004465874.1| hypothetical protein ambt_02590 [Alteromonas sp. SN2]
 gi|332992017|gb|AEF02072.1| tetratricopeptide TPR_2 [Alteromonas sp. SN2]
          Length = 928

 Score = 37.0 bits (84), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 1/73 (1%)

Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTE 200
           +++   T +IL   GW   ++GNYT A    RKA   D  D +   +L   L+KQ ++ +
Sbjct: 841 LSLSPNTPQILDTYGWILARQGNYTEALPNLRKAYSFDAVDPSIMYHLGFVLLKQGKHQQ 900

Query: 201 ARSVLEDVLLGKL 213
           A+  L + L   L
Sbjct: 901 AKVELANALASDL 913


>gi|398942265|ref|ZP_10670203.1| Flp pilus assembly protein TadD [Pseudomonas sp. GM41(2012)]
 gi|398160817|gb|EJM49072.1| Flp pilus assembly protein TadD [Pseudomonas sp. GM41(2012)]
          Length = 574

 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 29/61 (47%)

Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARS 203
           +  E   +L +LGW   + GN+  AE + R+A    PD   A +L   L    +  EAR 
Sbjct: 487 INPEDPAVLDSLGWVNFRMGNFDEAERLLRQALERFPDQEVAAHLGEVLWANGKQREARQ 546

Query: 204 V 204
           +
Sbjct: 547 I 547


>gi|326918804|ref|XP_003205676.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like,
           partial [Meleagris gallopavo]
          Length = 1038

 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 75/198 (37%), Gaps = 45/198 (22%)

Query: 38  AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
           AI  + +AI        A  ++A  LK++  V EA E   +   LC   A +SL+N L +
Sbjct: 271 AIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEECYNTALRLCPTHA-DSLNN-LAN 328

Query: 98  LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
           + ++ G ++E + L ++ L                          ++++ EA    PT  
Sbjct: 329 IKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 388

Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
               + G   +               + +    +    NL   +   GN   A   YR A
Sbjct: 389 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 448

Query: 176 QLIDPDANKA-CNLSHCL 192
             + PD   A CNL+HCL
Sbjct: 449 LKLKPDFPDAYCNLAHCL 466


>gi|241998476|ref|XP_002433881.1| O-linked N-acetylglucosamine transferase, OGT, putative [Ixodes
           scapularis]
 gi|215495640|gb|EEC05281.1| O-linked N-acetylglucosamine transferase, OGT, putative [Ixodes
           scapularis]
          Length = 832

 Score = 37.0 bits (84), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 73/161 (45%), Gaps = 25/161 (15%)

Query: 58  DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
           ++ ++L++Q+R++EA+++ K       + A   L+  L+      GR DE +++ +    
Sbjct: 524 NLGILLQEQNRLEEALQSYKLAIQFRPRLAMAHLNMGLV--LGIMGRKDEAVDVYR---- 577

Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI--LGNLGWAYMQKGNYTSAEVVYRKA 175
              H    +    K  ++H          E+++I  L NLG  Y  +G Y  A  VY++A
Sbjct: 578 ---HCAQLDSAGLKDPKTH----------ESTKISALFNLGRLYADEGKYKEAIRVYQEA 624

Query: 176 QLIDPDANKACNLSHCL----IKQARYTEARSVLEDVLLGK 212
               PD  +  +L + +     K   YTEA    ++ L  K
Sbjct: 625 VAKMPDHYQPQSLYNMMGEAYFKLGEYTEAERWYKEALRAK 665


>gi|195327213|ref|XP_002030316.1| GM24634 [Drosophila sechellia]
 gi|194119259|gb|EDW41302.1| GM24634 [Drosophila sechellia]
          Length = 498

 Score = 37.0 bits (84), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 217 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 275

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 276 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 317

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 318 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 371


>gi|17647555|ref|NP_524049.1| kinesin light chain, isoform A [Drosophila melanogaster]
 gi|442632011|ref|NP_001261780.1| kinesin light chain, isoform B [Drosophila melanogaster]
 gi|1170679|sp|P46824.1|KLC_DROME RecName: Full=Kinesin light chain; Short=KLC
 gi|157782|gb|AAA02481.1| kinesin light chain [Drosophila melanogaster]
 gi|157814|gb|AAA28669.1| kinesin light chain [Drosophila melanogaster]
 gi|16768986|gb|AAL28712.1| LD13018p [Drosophila melanogaster]
 gi|23093578|gb|AAF49890.2| kinesin light chain, isoform A [Drosophila melanogaster]
 gi|220943240|gb|ACL84163.1| Klc-PA [synthetic construct]
 gi|220953428|gb|ACL89257.1| Klc-PA [synthetic construct]
 gi|440215712|gb|AGB94473.1| kinesin light chain, isoform B [Drosophila melanogaster]
          Length = 508

 Score = 37.0 bits (84), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381


>gi|348687602|gb|EGZ27416.1| hypothetical protein PHYSODRAFT_553979 [Phytophthora sojae]
          Length = 986

 Score = 37.0 bits (84), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 54/241 (22%), Positives = 97/241 (40%), Gaps = 22/241 (9%)

Query: 31  VEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQ 88
           V+   E A + + +AI A      A  ++A +LK+  +++ A++  +    L      A 
Sbjct: 184 VQGRVEDAKLCYEQAIRAKPSFAIAWSNLAGLLKEDGQLEAAIDHYREAIRLAPDFADAY 243

Query: 89  ESLDNVLIDLYKKCGRLDEQIELLKQKLRM-----IYHGE---AFNGKPTKTARSHGKKF 140
            +L N L    K+ GR++E I+  K  L++     I HG     +          H  + 
Sbjct: 244 SNLGNAL----KEAGRVEEAIQAYKSALQIRPNFAIAHGNLASCYYDAGQMELAIHTFRH 299

Query: 141 QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHC-----LIK 194
            + ++        NLG A  + G    A   YR A  + PD   A  NL +      L+K
Sbjct: 300 AIQLEPNFPDAYNNLGNALRECGQLEQAVTCYRTALQLKPDHPHAYNNLGNALKDKGLVK 359

Query: 195 QAR--YTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDA 252
           +A   YT A  +L        +  +  K   ++ + L   +   +I P+     S++ + 
Sbjct: 360 EALHCYTTAARLLPQFAAAHSNIGSVLKEQGKLDQALAHYQQAITIDPNFADAYSNMGNV 419

Query: 253 F 253
           F
Sbjct: 420 F 420


>gi|194752059|ref|XP_001958340.1| GF10870 [Drosophila ananassae]
 gi|190625622|gb|EDV41146.1| GF10870 [Drosophila ananassae]
          Length = 508

 Score = 37.0 bits (84), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381


>gi|195160695|ref|XP_002021210.1| GL24935 [Drosophila persimilis]
 gi|198464928|ref|XP_001353418.2| GA18878 [Drosophila pseudoobscura pseudoobscura]
 gi|194118323|gb|EDW40366.1| GL24935 [Drosophila persimilis]
 gi|198149940|gb|EAL30925.2| GA18878 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score = 37.0 bits (84), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)

Query: 52  VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
           V + L  +A+V + Q++  EA   +     +  K   E       +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285

Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
             +   L K+ L +               +  GK      KQ     L NL      +G 
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327

Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
           Y   E  Y++A  I        DP+  K   NL+ C +KQ RYTEA  + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.132    0.385 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,274,884,084
Number of Sequences: 23463169
Number of extensions: 163823994
Number of successful extensions: 540664
Number of sequences better than 100.0: 840
Number of HSP's better than 100.0 without gapping: 220
Number of HSP's successfully gapped in prelim test: 620
Number of HSP's that attempted gapping in prelim test: 538787
Number of HSP's gapped (non-prelim): 2141
length of query: 287
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 146
effective length of database: 9,050,888,538
effective search space: 1321429726548
effective search space used: 1321429726548
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)