BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023081
(287 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118489433|gb|ABK96519.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 291
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 226/280 (80%), Positives = 244/280 (87%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
PYHV+HKLPPGDSPYVRAKHVQLVEKDP AAI LFWKAINAGDRVDSALKDMAVV+KQQD
Sbjct: 12 PYHVLHKLPPGDSPYVRAKHVQLVEKDPVAAIALFWKAINAGDRVDSALKDMAVVMKQQD 71
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIK+FR CSKQAQESLDNVLIDLYKKCG+++EQIELLKQKLRMI+ GEAFNG
Sbjct: 72 RAEEAIEAIKAFRDRCSKQAQESLDNVLIDLYKKCGKVEEQIELLKQKLRMIHQGEAFNG 131
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
K TKTARSHG+KFQVTVKQETSRILGNLGWAYMQKGNY +AEVVYRKAQ IDPDANKACN
Sbjct: 132 KATKTARSHGRKFQVTVKQETSRILGNLGWAYMQKGNYLAAEVVYRKAQSIDPDANKACN 191
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
L CLIKQ RY EA+ VL+D+ GKL GS E K+ NR +ELL ELE QS L S
Sbjct: 192 LGLCLIKQTRYAEAQVVLDDIFRGKLLGSDEPKSRNRAQELLCELETHQSSSMFLEPSGS 251
Query: 248 SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
SLED F+EGLD L +QWTPYRSRRLPIFEEIS FRDQLAC
Sbjct: 252 SLEDVFVEGLDQLTSQWTPYRSRRLPIFEEISSFRDQLAC 291
>gi|224072650|ref|XP_002303823.1| predicted protein [Populus trichocarpa]
gi|222841255|gb|EEE78802.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 225/280 (80%), Positives = 243/280 (86%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
PYHV+HKLPPGDSPYVRAKHVQLVEKDP AAI LFWKAINAGDRVDSALKDMAVV+KQQD
Sbjct: 12 PYHVLHKLPPGDSPYVRAKHVQLVEKDPGAAIALFWKAINAGDRVDSALKDMAVVMKQQD 71
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIK+FR CSKQAQESLDNVLIDLYKKCG ++EQIELLKQKLRMI+ GEAFNG
Sbjct: 72 RAEEAIEAIKAFRDRCSKQAQESLDNVLIDLYKKCGNIEEQIELLKQKLRMIHQGEAFNG 131
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
K TKTARSHG+KFQVTVKQETSRILGNLGWAYMQKGNY +AEVVYRKAQ IDPDANKACN
Sbjct: 132 KATKTARSHGRKFQVTVKQETSRILGNLGWAYMQKGNYLAAEVVYRKAQSIDPDANKACN 191
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
L CLIKQ RY EA++VL+D+ GKL GS E K+ NR +ELL ELE QS S
Sbjct: 192 LGLCLIKQTRYAEAQAVLDDIFRGKLLGSDEPKSRNRAQELLCELETSQSSSMFSEPSGS 251
Query: 248 SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
SLED F+EGLD L +QWTPYRSRRLPIFEEIS FRDQLAC
Sbjct: 252 SLEDVFVEGLDQLTSQWTPYRSRRLPIFEEISSFRDQLAC 291
>gi|255554172|ref|XP_002518126.1| conserved hypothetical protein [Ricinus communis]
gi|223542722|gb|EEF44259.1| conserved hypothetical protein [Ricinus communis]
Length = 296
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 226/282 (80%), Positives = 250/282 (88%), Gaps = 5/282 (1%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+HV+HKLPPGDSPYVRAKHVQLV KDPE AIVLFWKAINAGDRVDSALKDMA+V+KQQD
Sbjct: 18 PFHVIHKLPPGDSPYVRAKHVQLVHKDPEGAIVLFWKAINAGDRVDSALKDMAIVMKQQD 77
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR CSKQAQESLDNVLIDLYKKCG+++EQIELLKQKLRMIY GEAFNG
Sbjct: 78 RAEEAIEAIKSFRDRCSKQAQESLDNVLIDLYKKCGKIEEQIELLKQKLRMIYEGEAFNG 137
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
K TKTARSHGKKFQVT+KQETSRILGNLGWAYMQ+G+Y +AEVVYRKAQ IDPDANKACN
Sbjct: 138 KRTKTARSHGKKFQVTIKQETSRILGNLGWAYMQQGSYLAAEVVYRKAQSIDPDANKACN 197
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
L CLIKQ RYTEA SVL DVL KLSGS + K+ +RVKELL ELE QS S+ T S
Sbjct: 198 LCLCLIKQTRYTEAHSVLNDVLQSKLSGSEDPKSRSRVKELLHELETCQS---SVVTSPS 254
Query: 248 --SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
SLEDAF+EGL+ LM+QW+P+RS+RLPIFEEI+P RDQLAC
Sbjct: 255 GLSLEDAFVEGLEQLMSQWSPHRSKRLPIFEEITPCRDQLAC 296
>gi|225463848|ref|XP_002265262.1| PREDICTED: uncharacterized protein LOC100244941 [Vitis vinifera]
gi|296088770|emb|CBI38220.3| unnamed protein product [Vitis vinifera]
Length = 292
Score = 459 bits (1181), Expect = e-127, Method: Compositional matrix adjust.
Identities = 219/283 (77%), Positives = 246/283 (86%), Gaps = 4/283 (1%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
+ +PYHV+HKLPPGD+PYVRAKH+QLVEKDPEAAI LFW+AIN GDRVDSALKDMAVV+K
Sbjct: 14 KQEPYHVIHKLPPGDTPYVRAKHLQLVEKDPEAAIALFWEAINVGDRVDSALKDMAVVMK 73
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
QQ+RV+EA+EAIKSFR CSKQAQESLDNVL+DLYKKCG+L+EQIELLKQKLRMIY GEA
Sbjct: 74 QQNRVEEAIEAIKSFRDRCSKQAQESLDNVLMDLYKKCGKLEEQIELLKQKLRMIYQGEA 133
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FN KPTKTARSHG+KFQVT++QETSRILGNLGWAYMQ+ NY +AEVVYR+AQ IDPDANK
Sbjct: 134 FNRKPTKTARSHGRKFQVTIQQETSRILGNLGWAYMQQTNYATAEVVYREAQTIDPDANK 193
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
ACNL CLIKQARY EARSVLEDVL GK GS + K+ NR ELL+E+EPWQS +T
Sbjct: 194 ACNLGLCLIKQARYDEARSVLEDVLHGKFCGSNDPKSRNRAHELLEEVEPWQSEAVFPST 253
Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ SLED F E LM Q +P R+RRLPIFEEISPFRDQLAC
Sbjct: 254 SEVSLEDCFFE----LMKQSSPSRTRRLPIFEEISPFRDQLAC 292
>gi|147809817|emb|CAN73753.1| hypothetical protein VITISV_002585 [Vitis vinifera]
Length = 292
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 219/283 (77%), Positives = 245/283 (86%), Gaps = 4/283 (1%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
+ +PYHV+HKLPPGD+PYVRAKH QLVEKDPEAAI LFW+AIN GDRVDSALKDMAVV+K
Sbjct: 14 KQEPYHVIHKLPPGDTPYVRAKHXQLVEKDPEAAIALFWEAINVGDRVDSALKDMAVVMK 73
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
QQ+RV+EA+EAIKSFR CSKQAQESLDNVL+DLYKKCG+L+EQIELLKQKLRMIY GEA
Sbjct: 74 QQNRVEEAIEAIKSFRDRCSKQAQESLDNVLMDLYKKCGKLEEQIELLKQKLRMIYQGEA 133
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FN KPTKTARSHG+KFQVT++QETSRILGNLGWAYMQ+ NY +AEVVYR+AQ IDPDANK
Sbjct: 134 FNRKPTKTARSHGRKFQVTIQQETSRILGNLGWAYMQQTNYATAEVVYREAQTIDPDANK 193
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
ACNL CLIKQARY EARSVLEDVL GK GS + K+ NR ELL+E+EPWQS +T
Sbjct: 194 ACNLGLCLIKQARYDEARSVLEDVLHGKFCGSNDPKSRNRAHELLEEVEPWQSEAVFPST 253
Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ SLED F E LM Q +P R+RRLPIFEEISPFRDQLAC
Sbjct: 254 SEVSLEDCFFE----LMKQSSPSRTRRLPIFEEISPFRDQLAC 292
>gi|224057581|ref|XP_002299278.1| predicted protein [Populus trichocarpa]
gi|222846536|gb|EEE84083.1| predicted protein [Populus trichocarpa]
Length = 292
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 219/282 (77%), Positives = 243/282 (86%), Gaps = 2/282 (0%)
Query: 8 PYH-VVHKLPP-GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
PYH V+HKLPP GDSPYVRAKH QLVEKDPEAAI LFWKAINAGDRVDSALKDMAVV+KQ
Sbjct: 11 PYHHVIHKLPPPGDSPYVRAKHAQLVEKDPEAAIALFWKAINAGDRVDSALKDMAVVMKQ 70
Query: 66 QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
QDR +EA+EA+K+FR CSKQAQESLDNVLIDLYKKCG+++EQI+LLKQKL+MI+ GEAF
Sbjct: 71 QDRAEEAIEAVKAFRDRCSKQAQESLDNVLIDLYKKCGKIEEQIDLLKQKLKMIHQGEAF 130
Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
N K TKTARSHG+KFQVTVKQETSRILGNLGWAYMQ+GNY +AE VY KAQ DPDANKA
Sbjct: 131 NRKATKTARSHGRKFQVTVKQETSRILGNLGWAYMQRGNYMAAEAVYHKAQSFDPDANKA 190
Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTK 245
CNL CLIKQ RY EA+SV++D+L GKL GS E K+ NR +ELL+ELE QS
Sbjct: 191 CNLCLCLIKQTRYAEAKSVIDDILQGKLLGSDEPKSRNRAQELLRELETCQSSSMFSEPS 250
Query: 246 KSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
SLEDAF+EGLD LM+QWTPYRSRRLPIFEEISPF DQLAC
Sbjct: 251 GLSLEDAFVEGLDQLMSQWTPYRSRRLPIFEEISPFMDQLAC 292
>gi|225464225|ref|XP_002265748.1| PREDICTED: uncharacterized protein LOC100252818 [Vitis vinifera]
gi|297744303|emb|CBI37273.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/283 (74%), Positives = 245/283 (86%), Gaps = 5/283 (1%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+ V+HKLP GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDRVDSALKDMAVV+KQ D
Sbjct: 17 PFCVLHKLPSGDTPYVRAKHAQLVEKDPEVAIVLFWKAINAGDRVDSALKDMAVVMKQLD 76
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR LCSKQAQESLDNVLIDLYKKCGR+DEQI+LLKQKLR+IY GEAFNG
Sbjct: 77 RTEEAIEAIKSFRSLCSKQAQESLDNVLIDLYKKCGRIDEQIDLLKQKLRLIYQGEAFNG 136
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSNYMAAEVVYKKAQMIDPDANKACN 196
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTK- 245
L+ CLIKQAR +EARS+L +VL G + GS + K NR +EL+ E+EP W +PPS K
Sbjct: 197 LALCLIKQARNSEARSILNEVLQGNIPGSEDCKAQNRAQELMVEVEPRW--LPPSTKMKI 254
Query: 246 -KSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ LED ++GL+ L+N+W P R++RLPIFEEIS +R+QLAC
Sbjct: 255 ARFDLEDDLIDGLEKLLNEWAPSRTKRLPIFEEISSYRNQLAC 297
>gi|147814975|emb|CAN74743.1| hypothetical protein VITISV_025585 [Vitis vinifera]
Length = 297
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/283 (74%), Positives = 245/283 (86%), Gaps = 5/283 (1%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+ V+HKLP GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDRVDSALKDMAVV+KQ D
Sbjct: 17 PFCVLHKLPSGDTPYVRAKHAQLVEKDPEVAIVLFWKAINAGDRVDSALKDMAVVMKQLD 76
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR LCSKQAQESLDNVLIDLYKKCGR+DEQI+LLKQKLR+IY GEAFNG
Sbjct: 77 RTEEAIEAIKSFRSLCSKQAQESLDNVLIDLYKKCGRIDEQIDLLKQKLRLIYQGEAFNG 136
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSNYMAAEVVYKKAQMIDPDANKACN 196
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTK- 245
L+ CLIKQAR +EARS+L +VL G + GS + K NR +EL+ E+EP W +PPS K
Sbjct: 197 LALCLIKQARNSEARSILNEVLQGNIPGSEDCKAQNRAQELMLEVEPRW--LPPSTKMKI 254
Query: 246 -KSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ LED ++GL+ L+N+W P R++RLPIFEEIS +R+QLAC
Sbjct: 255 ARFDLEDDLIDGLEKLLNEWAPSRTKRLPIFEEISSYRNQLAC 297
>gi|356567884|ref|XP_003552145.1| PREDICTED: uncharacterized protein LOC100805188 [Glycine max]
Length = 296
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 206/283 (72%), Positives = 239/283 (84%), Gaps = 2/283 (0%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
+D YHV+HK+P GDSPYVRAKH QLVEKDPEAAIVLFWKAIN+GD+VDSALKDMAVV+K
Sbjct: 16 KDDLYHVLHKVPYGDSPYVRAKHAQLVEKDPEAAIVLFWKAINSGDKVDSALKDMAVVMK 75
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
Q DR DEA+EAI+SFR LCSKQ+QESLDNVLIDLYKKCG++DEQIE+LK+KL++IY GEA
Sbjct: 76 QLDRSDEAIEAIRSFRSLCSKQSQESLDNVLIDLYKKCGKIDEQIEMLKRKLKLIYQGEA 135
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FNGK TKTARSHGKKFQV++KQETSR+LGNLGWAYMQK NY AEVVYRKAQ+IDPD NK
Sbjct: 136 FNGKLTKTARSHGKKFQVSIKQETSRLLGNLGWAYMQKMNYMMAEVVYRKAQIIDPDCNK 195
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
ACNL CLI+QARY EA+ VLEDVL G L GS ++K R ++L EL +PP +
Sbjct: 196 ACNLGLCLIRQARYEEAQLVLEDVLKGNLPGSDDSKARKRAQDLRTELRSM--LPPPHFS 253
Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
L+D F++GL+ LMN+W P RS+RLPIFEEIS FRDQLAC
Sbjct: 254 DLLGLDDEFIKGLEQLMNEWGPIRSKRLPIFEEISSFRDQLAC 296
>gi|449432444|ref|XP_004134009.1| PREDICTED: uncharacterized protein LOC101216846 [Cucumis sativus]
gi|449487532|ref|XP_004157673.1| PREDICTED: uncharacterized protein LOC101223392 [Cucumis sativus]
Length = 298
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 212/291 (72%), Positives = 242/291 (83%), Gaps = 8/291 (2%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
Q + P+HVVHKLP GDSPYVRAKHVQLV+KDPEAAIVLFWKAINAGDRVDSALKDMAVV
Sbjct: 10 QNLETPFHVVHKLPAGDSPYVRAKHVQLVQKDPEAAIVLFWKAINAGDRVDSALKDMAVV 69
Query: 63 LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
+KQQDR +EA+EAI SFR CSKQAQESLDNVLIDLYKKCGR++EQI+LLKQKLRMI G
Sbjct: 70 MKQQDRAEEAIEAINSFRDRCSKQAQESLDNVLIDLYKKCGRVEEQIDLLKQKLRMINQG 129
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
EAFNGK TKTARSHGKKFQVT++QETSRILGNLGWAYMQ+ N+ +AEVVY+KAQ+IDPDA
Sbjct: 130 EAFNGKATKTARSHGKKFQVTIRQETSRILGNLGWAYMQQENHKAAEVVYQKAQIIDPDA 189
Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
NKACNLS CL+KQARY+EAR+VLE VL K+ GS + K+ R +EL++ELE +S L
Sbjct: 190 NKACNLSLCLMKQARYSEARAVLEQVLHDKVGGSNDQKSRKRAEELMKELEEAESANKLL 249
Query: 243 TTKKSS-----LEDAFLEGLDDLMNQWTPYR-SRRLPIFEEISPFRDQLAC 287
SS +D F+ L + NQ +P R SRRLPIFEEIS FRDQLAC
Sbjct: 250 MMGLSSGGSEDYDDGFINQL--VTNQRSPLRSSRRLPIFEEISQFRDQLAC 298
>gi|225464227|ref|XP_002265934.1| PREDICTED: uncharacterized protein LOC100254562 [Vitis vinifera]
gi|297744304|emb|CBI37274.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 201/279 (72%), Positives = 239/279 (85%), Gaps = 1/279 (0%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+ V+HK+P GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDR+DSALKDMAVV+KQ +
Sbjct: 17 PFCVIHKVPSGDTPYVRAKHAQLVEKDPETAIVLFWKAINAGDRIDSALKDMAVVMKQLN 76
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR LCSKQAQE LDNVLIDLYKKCGR+D+QI+LLKQKLR+IY GEAFNG
Sbjct: 77 RTEEAIEAIKSFRGLCSKQAQEPLDNVLIDLYKKCGRMDQQIDLLKQKLRLIYQGEAFNG 136
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK + +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSKFIAAEVVYKKAQMIDPDANKACN 196
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
L+ CLIKQAR EA VL +VL GKL GS + +T NR +EL+ ELEP W P ++ +
Sbjct: 197 LARCLIKQARNVEAHLVLNEVLQGKLLGSKDCQTQNRAQELMLELEPKWLPPPETVKITR 256
Query: 247 SSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQL 285
LED F++ L+ L+N+W+P R++RLPIFEEIS +R+QL
Sbjct: 257 FDLEDDFIDELEKLLNEWSPLRTKRLPIFEEISSYRNQL 295
>gi|147783587|emb|CAN61444.1| hypothetical protein VITISV_044239 [Vitis vinifera]
Length = 297
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 202/279 (72%), Positives = 237/279 (84%), Gaps = 1/279 (0%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+ V+HK+P GD+PYVRAKH QLVEKDPE AIVLFWKAINAGDRVDSALKDMAVV+KQ +
Sbjct: 17 PFCVIHKVPSGDTPYVRAKHAQLVEKDPETAIVLFWKAINAGDRVDSALKDMAVVMKQLN 76
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR LCSKQAQE LDNVLIDLYKKCGR+D+QI+LLKQKLR+IY GEAFNG
Sbjct: 77 RTEEAIEAIKSFRGLCSKQAQEPLDNVLIDLYKKCGRMDQQIDLLKQKLRLIYQGEAFNG 136
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
KPTKTARSHGKKFQV++KQETSRILGNLGWAYMQK + +AEVVY+KAQ+IDPDANKACN
Sbjct: 137 KPTKTARSHGKKFQVSIKQETSRILGNLGWAYMQKSKFMAAEVVYKKAQMIDPDANKACN 196
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
L+ CLIKQAR EA VL +VL GKL GS + T NR +EL+ ELEP W P ++
Sbjct: 197 LARCLIKQARNVEAHLVLNEVLQGKLLGSEDCXTQNRAQELMLELEPKWLPPPETVKITG 256
Query: 247 SSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQL 285
LED F++ L+ L+N+W+P R++RLPIFEEIS +R+QL
Sbjct: 257 FDLEDDFIDELEKLLNEWSPLRTKRLPIFEEISSYRNQL 295
>gi|115464761|ref|NP_001055980.1| Os05g0506000 [Oryza sativa Japonica Group]
gi|52353366|gb|AAU43934.1| unknown protein [Oryza sativa Japonica Group]
gi|113579531|dbj|BAF17894.1| Os05g0506000 [Oryza sativa Japonica Group]
gi|215692779|dbj|BAG88193.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704367|dbj|BAG93801.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 299
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 196/279 (70%), Positives = 239/279 (85%), Gaps = 1/279 (0%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYVRAKH+QLV+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 22 FHVVHKVPAGDSPYVRAKHLQLVDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 81
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 82 AKEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 141
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 142 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 201
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP + S S
Sbjct: 202 GLCLIKQSRHDEARQVLHDVVLRRISGSEDDKVVARAKQLLHELEPVTHV-TSPNNAGLS 260
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ + +E LD ++N+WTP+RSRRLP+FEEI+ RDQ+AC
Sbjct: 261 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 299
>gi|388491310|gb|AFK33721.1| unknown [Lotus japonicus]
Length = 301
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 200/283 (70%), Positives = 236/283 (83%), Gaps = 2/283 (0%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
+D YHV+HK+P GD+PYVRAKH QLVEKDPE AI LFW+AIN GD+VDSALKDMAVV+K
Sbjct: 21 KDDSYHVLHKVPYGDTPYVRAKHAQLVEKDPEGAIALFWRAINTGDKVDSALKDMAVVMK 80
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
Q DR +EA+EAI SFR LCSKQ+QESLDNVLIDLYKKCG++DEQIELLK+KL++IY GEA
Sbjct: 81 QLDRSEEAIEAICSFRGLCSKQSQESLDNVLIDLYKKCGKIDEQIELLKRKLKLIYEGEA 140
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FNGK TKTARSHGKKFQV++KQETSR+LGNLGWAYMQK NY AE VYRKAQ+IDPD NK
Sbjct: 141 FNGKLTKTARSHGKKFQVSIKQETSRLLGNLGWAYMQKMNYVMAEAVYRKAQMIDPDCNK 200
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
ACNL CLIKQARY EA++++EDVL G GS ++K+ R ++LL EL +PP +
Sbjct: 201 ACNLGLCLIKQARYEEAQTIIEDVLRGNYPGSDDSKSKKRAQDLLTELRSM--LPPPHFS 258
Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
L+D F++GL+ L+N+W P RSRRLPIFEEIS FRDQLAC
Sbjct: 259 DLLGLDDEFIKGLEQLINEWGPIRSRRLPIFEEISSFRDQLAC 301
>gi|3859116|gb|AAC72543.1| unknown [Oryza sativa Japonica Group]
Length = 321
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 195/279 (69%), Positives = 239/279 (85%), Gaps = 1/279 (0%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P G+SPYVRAKH+QLV+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 44 FHVVHKVPAGNSPYVRAKHLQLVDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 103
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 104 AKEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 163
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 164 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 223
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP + S S
Sbjct: 224 GLCLIKQSRHDEARQVLHDVVLRRISGSEDDKVVARAKQLLHELEPVTHV-TSPNNAGLS 282
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ + +E LD ++N+WTP+RSRRLP+FEEI+ RDQ+AC
Sbjct: 283 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 321
>gi|224065324|ref|XP_002301774.1| predicted protein [Populus trichocarpa]
gi|222843500|gb|EEE81047.1| predicted protein [Populus trichocarpa]
Length = 309
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/282 (70%), Positives = 239/282 (84%), Gaps = 4/282 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV+HK+P GD PYV+AKH QLV+KDPEAAIV FWKAINAGD+VDSALKDMAVV+KQ DR
Sbjct: 29 FHVIHKVPAGDGPYVKAKHAQLVQKDPEAAIVWFWKAINAGDKVDSALKDMAVVMKQIDR 88
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EA+KSFR CSKQAQESLDNVLIDLYKKCG+++EQIEL+K+KLR+IY GEAFNGK
Sbjct: 89 TEEAIEAVKSFRGRCSKQAQESLDNVLIDLYKKCGKVEEQIELIKRKLRLIYQGEAFNGK 148
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
PTKTARSHGKKFQV+VKQETSR+LGNLGWAYMQK N+ +AEVVY+KAQ+IDPDANKACNL
Sbjct: 149 PTKTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNFIAAEVVYQKAQMIDPDANKACNL 208
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS- 247
+ CLIKQARY EARSVL DV G+L GS + K+ +R +ELL E++ ++ P LT
Sbjct: 209 ALCLIKQARYDEARSVLLDVSQGRLPGSGDVKSRSRAEELLMEVKSRET-PDELTDMLGF 267
Query: 248 --SLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+D F++GL+ LM++W P RS+RLPIFEEIS FRD L C
Sbjct: 268 NLDEDDDFVKGLEKLMSEWAPSRSKRLPIFEEISSFRDPLTC 309
>gi|357505367|ref|XP_003622972.1| Male sterility 5 family protein [Medicago truncatula]
gi|355497987|gb|AES79190.1| Male sterility 5 family protein [Medicago truncatula]
Length = 300
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 197/279 (70%), Positives = 235/279 (84%), Gaps = 2/279 (0%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
YHV++K+P GDSPYVRAKH QLVEKDPEAAIVLFWKAINAGD+VDSALKDMAVV+KQ DR
Sbjct: 24 YHVLYKVPYGDSPYVRAKHAQLVEKDPEAAIVLFWKAINAGDKVDSALKDMAVVMKQLDR 83
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAI SFR LCSKQ+QESLDNVLIDLYKKCG++DE+I+LLKQKL++IY GEAFNGK
Sbjct: 84 SEEAIEAISSFRGLCSKQSQESLDNVLIDLYKKCGKIDEEIDLLKQKLKLIYQGEAFNGK 143
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV++KQETSR+LGNLGWAYMQK NY AE VYRKAQ+IDPD NKACNL
Sbjct: 144 LTKTARSHGKKFQVSIKQETSRLLGNLGWAYMQKMNYVMAEAVYRKAQMIDPDCNKACNL 203
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLI+QARY EA+ +++D+L G+L GS + K+ R ++LL+EL P L
Sbjct: 204 GLCLIRQARYEEAQLIIDDILKGELPGSDDIKSKKRAQDLLEELRSLLPTPCPLDI--LV 261
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
++D F++G++ LMN+W P RS+RLPIFEEIS RDQLAC
Sbjct: 262 MDDEFIKGIEQLMNEWGPVRSKRLPIFEEISSCRDQLAC 300
>gi|242088403|ref|XP_002440034.1| hypothetical protein SORBIDRAFT_09g024870 [Sorghum bicolor]
gi|241945319|gb|EES18464.1| hypothetical protein SORBIDRAFT_09g024870 [Sorghum bicolor]
Length = 298
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 193/279 (69%), Positives = 234/279 (83%), Gaps = 1/279 (0%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYV AKH+QLVEK P+ AIV FWKAIN+GDRVDSALKDMAVV+KQQDR
Sbjct: 21 FHVVHKVPAGDSPYVVAKHLQLVEKQPDMAIVWFWKAINSGDRVDSALKDMAVVMKQQDR 80
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAI+SFRHLCSKQAQESLDN+LIDLYKKCG+++EQIELLKQKL+MIY GEAFNGK
Sbjct: 81 SEEAIEAIRSFRHLCSKQAQESLDNLLIDLYKKCGKVEEQIELLKQKLKMIYLGEAFNGK 140
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARSHGKKFQV+++QETSRILGNLGWAYMQ+ N+ +AE+VYRKAQ I+PDAN+ACNL
Sbjct: 141 ATKKARSHGKKFQVSIQQETSRILGNLGWAYMQQNNFEAAELVYRKAQTIEPDANRACNL 200
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ R+ EAR LEDV L ++ GS + K + R ++LL+EL P + + S
Sbjct: 201 GLCLIKQGRHEEARQALEDVRLRRIYGSEDEKVVARAEQLLRELNPLKCVSSPFEVGLSV 260
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
E+ + LD +MN+WTP+RSRRLP+FEEI+ FRDQ+AC
Sbjct: 261 HEE-IMGKLDLMMNEWTPFRSRRLPVFEEIATFRDQMAC 298
>gi|356567886|ref|XP_003552146.1| PREDICTED: uncharacterized protein LOC100805731 [Glycine max]
Length = 296
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/283 (68%), Positives = 236/283 (83%), Gaps = 2/283 (0%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
+D YHV+HK+P GDSPYV+AKH QLV+KDPEAAIVLFWKAINAGD+VDSALKDMAVV+K
Sbjct: 16 KDDLYHVIHKVPYGDSPYVKAKHAQLVDKDPEAAIVLFWKAINAGDKVDSALKDMAVVMK 75
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
Q DR +EA+EAIKSFR LCSK +QESLDNVL+DLYKKCG+++EQIELLK+KLR+IY GEA
Sbjct: 76 QLDRSEEAIEAIKSFRGLCSKHSQESLDNVLLDLYKKCGKIEEQIELLKRKLRLIYQGEA 135
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FNG+ T+TARSHGKKFQV++KQET+R+LGNLGWAYMQK NY AEVV++KAQ++D DANK
Sbjct: 136 FNGRTTRTARSHGKKFQVSIKQETARLLGNLGWAYMQKENYMMAEVVFKKAQMVDADANK 195
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
ACNL CL++Q RY EA +LE+VL GK GS E K+ R +ELL+EL ++P
Sbjct: 196 ACNLGLCLMRQCRYEEAYYILEEVLQGKHQGSDEIKSRKRAEELLEELNA--NLPQPEFM 253
Query: 245 KKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
L+D F++G+DD+++ W RSRRLPIFEEIS FRDQLAC
Sbjct: 254 ADLDLDDDFVKGIDDMLDAWNTNRSRRLPIFEEISSFRDQLAC 296
>gi|357133078|ref|XP_003568155.1| PREDICTED: uncharacterized protein LOC100821103 [Brachypodium
distachyon]
Length = 293
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/279 (69%), Positives = 234/279 (83%), Gaps = 4/279 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYVRAKH+QLVEKD EAAIV FWKAIN+GDRVDSALKDMAVV+KQQDR
Sbjct: 19 FHVVHKVPAGDSPYVRAKHLQLVEKDAEAAIVWFWKAINSGDRVDSALKDMAVVMKQQDR 78
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAI+SFRHLCS+ AQESLDN+LIDLYKKCG+++EQIELLKQKL+MI+ GEAFNGK
Sbjct: 79 AEEAIEAIRSFRHLCSRHAQESLDNLLIDLYKKCGKVEEQIELLKQKLKMIHLGEAFNGK 138
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV+++QE SRILGNLGWAYMQ+ NY +AE+VYRKAQ I PDAN+ACNL
Sbjct: 139 ITKTARSHGKKFQVSIQQEMSRILGNLGWAYMQQSNYEAAELVYRKAQTIVPDANRACNL 198
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ R EA +L+DVLLG++SG + + + R ++LL EL P S P + S
Sbjct: 199 GLCLIKQGRNDEATRILQDVLLGRISGLDDNRAVARAEQLLSELGPMTS-PFDVDL---S 254
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ +E LD +MN+WTP+RSRRLP+FEE+ FRDQ+AC
Sbjct: 255 FTEEIMERLDLVMNEWTPFRSRRLPVFEELDAFRDQMAC 293
>gi|255560193|ref|XP_002521114.1| conserved hypothetical protein [Ricinus communis]
gi|223539683|gb|EEF41265.1| conserved hypothetical protein [Ricinus communis]
Length = 291
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 193/283 (68%), Positives = 236/283 (83%), Gaps = 6/283 (2%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV++K+P GD PYV+AKH QLV+KDPEAAIV FWKAINAGDRVDSALKDMAVV+KQ DR
Sbjct: 11 FHVIYKVPSGDGPYVKAKHAQLVQKDPEAAIVWFWKAINAGDRVDSALKDMAVVMKQVDR 70
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAIKSFR CS+ AQESLDNVLIDLYKKCG+++EQI+LLK+KLR+IY GEAFNGK
Sbjct: 71 TEEAIEAIKSFRGRCSRNAQESLDNVLIDLYKKCGKVEEQIDLLKRKLRLIYQGEAFNGK 130
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
PTKTARSHGKKFQV+V+QETSR+LGNLGWAYMQK N+ +AEVVY+KAQ+IDPDANKA NL
Sbjct: 131 PTKTARSHGKKFQVSVEQETSRLLGNLGWAYMQKSNFMAAEVVYKKAQMIDPDANKAYNL 190
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLI+QARY EAR +L++VL G+ GS + K+ R +ELL E+E +PP T +
Sbjct: 191 GFCLIRQARYDEARQILQNVLEGRFPGSNDCKSRKRAQELLMEME--SKLPPPELTNRIG 248
Query: 249 L----EDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ +D F++G++ +MN+W P R +RLPIFEEIS RDQLAC
Sbjct: 249 INVDGDDDFVKGIEQMMNKWAPSRPKRLPIFEEISSLRDQLAC 291
>gi|195640432|gb|ACG39684.1| pollenless3 [Zea mays]
Length = 298
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/279 (68%), Positives = 230/279 (82%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYV AKH+QLVEK P+ AIV FWKAIN+GDRVDSALKDMA+V+KQQDR
Sbjct: 20 FHVVHKVPAGDSPYVVAKHLQLVEKQPDVAIVWFWKAINSGDRVDSALKDMAMVMKQQDR 79
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAI+SFRHLCSKQAQESLDN+LIDLYKKCG+++EQIELLKQKL+ IY GEAFNGK
Sbjct: 80 SEEAIEAIRSFRHLCSKQAQESLDNLLIDLYKKCGKVEEQIELLKQKLKSIYLGEAFNGK 139
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARSHGKKFQV+++QETSRILGNLGWAYMQ+ N+ +AE+VYRKAQ I+PDAN+ACNL
Sbjct: 140 ATKKARSHGKKFQVSIQQETSRILGNLGWAYMQQNNFEAAELVYRKAQAIEPDANRACNL 199
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ R+ EAR LEDV L ++ GS + K + R ++LL+EL P Q + S
Sbjct: 200 GLCLIKQGRHEEARQALEDVRLRRIYGSEDAKVVARAEQLLRELNPLQCVSSPFQVGLSV 259
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
E E +MN+WTP+RSRRLP+FEEI+ FRDQ+AC
Sbjct: 260 HEGIMGEPDLVVMNEWTPFRSRRLPVFEEIATFRDQMAC 298
>gi|356524529|ref|XP_003530881.1| PREDICTED: uncharacterized protein LOC100788542 [Glycine max]
Length = 297
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 193/284 (67%), Positives = 237/284 (83%), Gaps = 3/284 (1%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
+D YHV+HK+P GDSPYV+AKH QLV+KDPEAAIVLFWKAIN GD+VDSALKDMAVV+K
Sbjct: 16 KDDLYHVIHKVPYGDSPYVKAKHAQLVDKDPEAAIVLFWKAINCGDKVDSALKDMAVVMK 75
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
Q DR +EA+EAIKSFR LCSK +QESLDNVL+DLYKKCG+++EQIELLK+KLR+IY GEA
Sbjct: 76 QLDRSEEAIEAIKSFRGLCSKHSQESLDNVLLDLYKKCGKIEEQIELLKRKLRLIYQGEA 135
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FNG+ T+TARSHGKKFQV++KQET+R+LGNLGWAYMQK NY AEVV++KAQ++D DANK
Sbjct: 136 FNGRTTRTARSHGKKFQVSIKQETARLLGNLGWAYMQKENYMMAEVVFKKAQMVDADANK 195
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
ACNL CL++Q+RY EA +LE+VL+G + GS E K+ R +ELL+EL ++P
Sbjct: 196 ACNLGLCLMRQSRYKEAYYILEEVLMGIIPGSDEIKSRKRAEELLEELNA--NLPQPEFM 253
Query: 245 KKSSLEDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
L+D F++G+DDL++ W T R RRLPIFEEIS FRDQLAC
Sbjct: 254 DALGLDDDFVKGIDDLLDAWNTNNRPRRLPIFEEISSFRDQLAC 297
>gi|225464230|ref|XP_002266290.1| PREDICTED: uncharacterized protein LOC100263213 [Vitis vinifera]
Length = 293
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/284 (69%), Positives = 236/284 (83%), Gaps = 6/284 (2%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+ V+HKLP GD+PY RAK V +VEKDPEAAIVLFW AINA DRV+SALKDMAVV+K+ D
Sbjct: 12 PFCVIHKLPSGDAPYARAKRVLMVEKDPEAAIVLFWNAINARDRVESALKDMAVVMKRLD 71
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR+LC KQAQESLDN+LIDLYKKCGR +E I++LKQKLR+IY GEAFNG
Sbjct: 72 RSEEAIEAIKSFRNLCPKQAQESLDNILIDLYKKCGRTNEHIDMLKQKLRLIYQGEAFNG 131
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
KPT+TARS+ KKFQ+T++QETSRILGNLGW YMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 132 KPTRTARSNRKKFQITIEQETSRILGNLGWVYMQKSNYMAAEVVYKKAQMIDPDANKACN 191
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
L+ CLI Q RYTEA SVL++VL GKL S + K NR +EL+ E+EP W +PPS T K
Sbjct: 192 LALCLINQGRYTEAHSVLKEVLQGKLPNSEDCKAQNRAQELMLEVEPKW--LPPSETIKL 249
Query: 247 SS--LEDAFLEGLDDLMNQWTPYRSRRLPIF-EEISPFRDQLAC 287
LED F +G + ++N W P+R++RLPIF EEIS +R+QLAC
Sbjct: 250 EGFDLEDDFSDGFEKVLNIWAPFRTKRLPIFVEEISSYRNQLAC 293
>gi|226533206|ref|NP_001141923.1| uncharacterized protein LOC100274072 [Zea mays]
gi|194706464|gb|ACF87316.1| unknown [Zea mays]
gi|195640848|gb|ACG39892.1| pollenless3 [Zea mays]
gi|413945947|gb|AFW78596.1| pollenless3 [Zea mays]
Length = 298
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 193/281 (68%), Positives = 232/281 (82%), Gaps = 4/281 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYV AKH+QLVEK P+ AIV FWKAIN+GDRVDSALKDMA+V+KQQDR
Sbjct: 20 FHVVHKVPAGDSPYVVAKHLQLVEKQPDVAIVWFWKAINSGDRVDSALKDMAMVMKQQDR 79
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAI+SFRHLCSKQAQESLDN+LIDLYKKCG+++EQIELLKQKL+ IY GEAFNGK
Sbjct: 80 SEEAIEAIRSFRHLCSKQAQESLDNLLIDLYKKCGKVEEQIELLKQKLKSIYLGEAFNGK 139
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARSHGKKFQV+++QETSRILGNLGWAYMQ+ N+ +AE+VYRKAQ I+PDAN+ACNL
Sbjct: 140 ATKKARSHGKKFQVSIQQETSRILGNLGWAYMQQNNFEAAELVYRKAQAIEPDANRACNL 199
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ R+ EAR LEDV L ++ GS + K + R ++LL+EL P Q + S
Sbjct: 200 GLCLIKQGRHDEARQALEDVRLRRIYGSEDGKVVARAEQLLRELNPLQCVSSPFQVGLSV 259
Query: 249 LEDAFLEGLDDL--MNQWTPYRSRRLPIFEEISPFRDQLAC 287
E + G DL MN+WTP+RSRRLP+FEEI+ FRDQ+AC
Sbjct: 260 HEG--IMGKPDLVVMNEWTPFRSRRLPVFEEIATFRDQMAC 298
>gi|297744306|emb|CBI37276.3| unnamed protein product [Vitis vinifera]
Length = 296
Score = 406 bits (1043), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/284 (69%), Positives = 236/284 (83%), Gaps = 6/284 (2%)
Query: 8 PYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
P+ V+HKLP GD+PY RAK V +VEKDPEAAIVLFW AINA DRV+SALKDMAVV+K+ D
Sbjct: 15 PFCVIHKLPSGDAPYARAKRVLMVEKDPEAAIVLFWNAINARDRVESALKDMAVVMKRLD 74
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
R +EA+EAIKSFR+LC KQAQESLDN+LIDLYKKCGR +E I++LKQKLR+IY GEAFNG
Sbjct: 75 RSEEAIEAIKSFRNLCPKQAQESLDNILIDLYKKCGRTNEHIDMLKQKLRLIYQGEAFNG 134
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
KPT+TARS+ KKFQ+T++QETSRILGNLGW YMQK NY +AEVVY+KAQ+IDPDANKACN
Sbjct: 135 KPTRTARSNRKKFQITIEQETSRILGNLGWVYMQKSNYMAAEVVYKKAQMIDPDANKACN 194
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKK 246
L+ CLI Q RYTEA SVL++VL GKL S + K NR +EL+ E+EP W +PPS T K
Sbjct: 195 LALCLINQGRYTEAHSVLKEVLQGKLPNSEDCKAQNRAQELMLEVEPKW--LPPSETIKL 252
Query: 247 SS--LEDAFLEGLDDLMNQWTPYRSRRLPIF-EEISPFRDQLAC 287
LED F +G + ++N W P+R++RLPIF EEIS +R+QLAC
Sbjct: 253 EGFDLEDDFSDGFEKVLNIWAPFRTKRLPIFVEEISSYRNQLAC 296
>gi|30678993|ref|NP_171969.2| male sterility MS5 family protein [Arabidopsis thaliana]
gi|22655080|gb|AAM98131.1| unknown protein [Arabidopsis thaliana]
gi|30984518|gb|AAP42722.1| At1g04770 [Arabidopsis thaliana]
gi|332189620|gb|AEE27741.1| male sterility MS5 family protein [Arabidopsis thaliana]
Length = 303
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/284 (69%), Positives = 235/284 (82%), Gaps = 5/284 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW AI A DRVDSALKDMA+++KQQ+R
Sbjct: 20 YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 79
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 80 AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199
Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
CLIKQ ++ EARS+L DVL+ GS + + + RV+ELL EL+P + + + +
Sbjct: 200 CTCLIKQGKHDEARSILFRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSVEC 259
Query: 248 SL---EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
+ E A +EGLD+ + +W PYR+RRLPIFEEI P RDQLAC
Sbjct: 260 EVGIDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 303
>gi|7211988|gb|AAF40459.1|AC004809_17 Strong similarity to the beta-9 tubulin gene (178 gene product)
gb|AF060248 [Arabidopsis thaliana]
Length = 364
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/284 (69%), Positives = 235/284 (82%), Gaps = 5/284 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW AI A DRVDSALKDMA+++KQQ+R
Sbjct: 81 YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 140
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 141 AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 200
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 201 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 260
Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
CLIKQ ++ EARS+L DVL+ GS + + + RV+ELL EL+P + + + +
Sbjct: 261 CTCLIKQGKHDEARSILFRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSVEC 320
Query: 248 SL---EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
+ E A +EGLD+ + +W PYR+RRLPIFEEI P RDQLAC
Sbjct: 321 EVGIDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 364
>gi|357505377|ref|XP_003622977.1| hypothetical protein MTR_7g059010 [Medicago truncatula]
gi|355497992|gb|AES79195.1| hypothetical protein MTR_7g059010 [Medicago truncatula]
Length = 387
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/279 (69%), Positives = 232/279 (83%), Gaps = 2/279 (0%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
YHV+HK+P GD+PYV+AKH QLV+KDPE AIV FWKAINAGD+VDSALKDMAVV+KQ DR
Sbjct: 19 YHVIHKVPYGDTPYVKAKHAQLVDKDPEVAIVYFWKAINAGDKVDSALKDMAVVMKQLDR 78
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAIKSFR LC+K +QESLDNVL+DLYKKCGR++EQIELLK+KLR+IY GEAFNG+
Sbjct: 79 AEEAIEAIKSFRGLCNKHSQESLDNVLLDLYKKCGRVEEQIELLKRKLRLIYQGEAFNGR 138
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV++KQET+R+LGNLGWAYMQK NY AEVV++KAQ+ID DANKA NL
Sbjct: 139 TTKTARSHGKKFQVSIKQETARLLGNLGWAYMQKTNYMMAEVVFKKAQMIDADANKALNL 198
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
+ CL++Q+RY EA +LE VL GKL GS E K+ NR +ELL EL ++P
Sbjct: 199 ALCLMRQSRYEEAYLILEQVLQGKLPGSDEIKSRNRAEELLVELSA--NLPQPKFMDDLG 256
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
L+D L+G+D L+N W+P RSRRLPIFEEIS FRDQLA
Sbjct: 257 LDDDLLKGIDGLLNVWSPTRSRRLPIFEEISSFRDQLAA 295
>gi|110742853|dbj|BAE99325.1| hypothetical protein [Arabidopsis thaliana]
Length = 303
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/284 (69%), Positives = 235/284 (82%), Gaps = 5/284 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW AI A DRVDSALKDMA+++KQQ+R
Sbjct: 20 YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 79
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 80 AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199
Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
CLIKQ ++ EARS+L DVL+ GS + + + RV+ELL EL+P + + + +
Sbjct: 200 CTCLIKQGKHDEARSILFRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSVEC 259
Query: 248 SL---EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
+ E A +EGLD+ + +W PYR+RRLPIFEEI P RDQLAC
Sbjct: 260 EVGIDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 303
>gi|297848718|ref|XP_002892240.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297338082|gb|EFH68499.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 305
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/286 (68%), Positives = 233/286 (81%), Gaps = 7/286 (2%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
Y+VVHKLP GDSPYVRAKHVQLVEKD EAAI LFW+AI A DRVDSALKDMA+++KQQ+R
Sbjct: 20 YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWRAIEARDRVDSALKDMALLMKQQNR 79
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA++AI+SFR LCS+QAQESLDNVLIDLYKKCGR++EQ+ELLKQKL MIY GEAFNGK
Sbjct: 80 AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
PTKTARSHGKKFQVTV++ETSRILGNLGWAYMQ +YT+AE VYRKAQLI+PDANKACNL
Sbjct: 140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199
Query: 189 SHCLIKQARYTEARSVL-EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
CLIKQ + EAR+++ DVL GS + + + RV+ELL EL+P + +L +
Sbjct: 200 CTCLIKQGKQDEARTIIFRDVLTENKEGSGDPRLMARVQELLSELKPQEEEAAALASVSV 259
Query: 248 SL-----EDAFLEGLDDLMNQW-TPYRSRRLPIFEEISPFRDQLAC 287
E A +EGLD+ + +W PYR+RRLPIFEEI P RDQLAC
Sbjct: 260 ECEVGMDEIAVVEGLDEFVKEWRRPYRTRRLPIFEEILPLRDQLAC 305
>gi|449468732|ref|XP_004152075.1| PREDICTED: uncharacterized protein LOC101204990 [Cucumis sativus]
Length = 293
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 188/286 (65%), Positives = 234/286 (81%), Gaps = 1/286 (0%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
+ +D+ +HV+HK+PPGD+PYVRAK+ QL++KDPE+AI LFW+AIN GDRV+SALKDMAVV
Sbjct: 8 ERQDELFHVIHKVPPGDTPYVRAKYAQLIKKDPESAIALFWEAINKGDRVESALKDMAVV 67
Query: 63 LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
+KQ DR +EA+ +++FR LCSK +Q SLDNVLIDL+KKCGR++EQIELLK+KLRMIY G
Sbjct: 68 MKQIDRAEEAIHILQTFRFLCSKHSQNSLDNVLIDLFKKCGRIEEQIELLKRKLRMIYQG 127
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
EAFNGKPT+TARSHGKKFQV+VKQETSR+LGNLGWAYMQK NY AE VY+KAQ+IDPDA
Sbjct: 128 EAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYMQKPNYMMAEAVYKKAQMIDPDA 187
Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
NKACNL CL+KQ R +EA VLE V ++ GS+E K R +LL E+ QS+P S+
Sbjct: 188 NKACNLGLCLMKQGRLSEAIFVLEQVQQAQIPGSSEIKAQKRSADLLTEIRSRQSLPDSI 247
Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPY-RSRRLPIFEEISPFRDQLAC 287
S++ FL GL+ L+N+ P+ RS+RLP+FEEIS FRDQLAC
Sbjct: 248 DLLGLSVDVDFLNGLELLVNKKGPFSRSKRLPVFEEISSFRDQLAC 293
>gi|125552911|gb|EAY98620.1| hypothetical protein OsI_20545 [Oryza sativa Indica Group]
Length = 284
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/279 (65%), Positives = 226/279 (81%), Gaps = 16/279 (5%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHKL V+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 22 FHVVHKL---------------VDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 66
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 67 AEEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 126
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 127 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 186
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP + S S
Sbjct: 187 GLCLIKQSRHDEARQVLHDVVLRRISGSEDEKVVARAKQLLHELEPVTHV-TSPNNAGLS 245
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ + +E LD ++N+WTP+RSRRLP+FEEI+ RDQ+AC
Sbjct: 246 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 284
>gi|222632158|gb|EEE64290.1| hypothetical protein OsJ_19127 [Oryza sativa Japonica Group]
Length = 284
Score = 385 bits (990), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/279 (65%), Positives = 225/279 (80%), Gaps = 16/279 (5%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHKL V+KDPE AIV FWKAIN+ D+VDSALKDMAVV+KQQDR
Sbjct: 22 FHVVHKL---------------VDKDPETAIVWFWKAINSRDKVDSALKDMAVVMKQQDR 66
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
EA+EAI+SFRHLCS+QAQESLDN+LIDLYKKCG++DEQI+LLKQKL+MIY GEAFNGK
Sbjct: 67 AKEAIEAIRSFRHLCSRQAQESLDNLLIDLYKKCGKVDEQIDLLKQKLKMIYLGEAFNGK 126
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHGKKFQV+++QETSRILGNLGWAYMQ+ NY++AE+VYRKAQ I+PDAN+ACNL
Sbjct: 127 ATKTARSHGKKFQVSIQQETSRILGNLGWAYMQQSNYSAAELVYRKAQSIEPDANRACNL 186
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSS 248
CLIKQ+R+ EAR VL DV+L ++SGS + K + R K+LL ELEP + S S
Sbjct: 187 GLCLIKQSRHDEARQVLHDVVLRRISGSEDDKVVARAKQLLHELEPVTHV-TSPNNAGLS 245
Query: 249 LEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ + +E LD ++N+WTP+RSRRLP+FEEI+ RDQ+AC
Sbjct: 246 VSEEIMERLDLVLNEWTPFRSRRLPVFEEIATLRDQIAC 284
>gi|297795599|ref|XP_002865684.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297311519|gb|EFH41943.1| male sterility MS5 family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 306
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/285 (62%), Positives = 222/285 (77%), Gaps = 5/285 (1%)
Query: 6 DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
D+ +HV+HK+P GD+PYVRAKH QL+EK+PE AIV FWKAIN GDRVDSALKDMAVV+KQ
Sbjct: 24 DEMFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVMKQ 83
Query: 66 QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
DR +EA+EAIKSFR CSK +Q+SLDNVLIDLYKKCGR++EQ+ELLK+KLR IY GEAF
Sbjct: 84 LDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGEAF 143
Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
NGKPTKTARSHGKKFQVTV+QE SR+LGNLGWAYMQ+ Y SAE VYRKAQ+++PDANK+
Sbjct: 144 NGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDANKS 203
Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV---KELLQELEPWQSIPPSL 242
CNL+ CLIKQ R+ E R VL+DVL ++ G+ + +T R L+ P
Sbjct: 204 CNLAMCLIKQGRFEEGRLVLDDVLESRVLGADDCRTRQRAEELLSELESSLPRMGDAEME 263
Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ L+D F+ GL+ + + T ++S+RLPIFE+IS FR+ L C
Sbjct: 264 DVLGNILDDDFVLGLEQMTS--TSFKSKRLPIFEQISSFRNTLVC 306
>gi|9758884|dbj|BAB09438.1| unnamed protein product [Arabidopsis thaliana]
Length = 326
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)
Query: 6 DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
D+ +HV+HK+P GD+PYVRAKH QL+EK+PE AIV FWKAIN GDRVDSALKDMAVV+KQ
Sbjct: 44 DELFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVMKQ 103
Query: 66 QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
DR +EA+EAIKSFR CSK +Q+SLDNVLIDLYKKCGR++EQ+ELLK+KLR IY GEAF
Sbjct: 104 LDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGEAF 163
Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
NGKPTKTARSHGKKFQVTV+QE SR+LGNLGWAYMQ+ Y SAE VYRKAQ+++PDANK+
Sbjct: 164 NGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDANKS 223
Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV---KELLQELEPWQSIPPSL 242
CNL+ CLIKQ R+ E R VL+DVL ++ G+ + +T R L+ P
Sbjct: 224 CNLAMCLIKQGRFEEGRLVLDDVLEYRVLGADDCRTRQRAEELLSELESSLPRMRDAEME 283
Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ L+D F+ GL+++ + T ++S+RLPIFE+IS FR+ L C
Sbjct: 284 DVLGNILDDDFVLGLEEMTS--TSFKSKRLPIFEQISSFRNTLVC 326
>gi|30695554|ref|NP_199696.2| tetratricopeptide repeat domain-containing protein [Arabidopsis
thaliana]
gi|26451131|dbj|BAC42669.1| unknown protein [Arabidopsis thaliana]
gi|28950875|gb|AAO63361.1| At5g48850 [Arabidopsis thaliana]
gi|332008350|gb|AED95733.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
thaliana]
Length = 306
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/285 (62%), Positives = 223/285 (78%), Gaps = 5/285 (1%)
Query: 6 DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
D+ +HV+HK+P GD+PYVRAKH QL+EK+PE AIV FWKAIN GDRVDSALKDMAVV+KQ
Sbjct: 24 DELFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVMKQ 83
Query: 66 QDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF 125
DR +EA+EAIKSFR CSK +Q+SLDNVLIDLYKKCGR++EQ+ELLK+KLR IY GEAF
Sbjct: 84 LDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGEAF 143
Query: 126 NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
NGKPTKTARSHGKKFQVTV+QE SR+LGNLGWAYMQ+ Y SAE VYRKAQ+++PDANK+
Sbjct: 144 NGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDANKS 203
Query: 186 CNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV---KELLQELEPWQSIPPSL 242
CNL+ CLIKQ R+ E R VL+DVL ++ G+ + +T R L+ P
Sbjct: 204 CNLAMCLIKQGRFEEGRLVLDDVLEYRVLGADDCRTRQRAEELLSELESSLPRMRDAEME 263
Query: 243 TTKKSSLEDAFLEGLDDLMNQWTPYRSRRLPIFEEISPFRDQLAC 287
+ L+D F+ GL+++ + T ++S+RLPIFE+IS FR+ L C
Sbjct: 264 DVLGNILDDDFVLGLEEMTS--TSFKSKRLPIFEQISSFRNTLVC 306
>gi|212722260|ref|NP_001132763.1| uncharacterized protein LOC100194250 [Zea mays]
gi|194695332|gb|ACF81750.1| unknown [Zea mays]
gi|413957078|gb|AFW89727.1| hypothetical protein ZEAMMB73_680631 [Zea mays]
Length = 304
Score = 333 bits (854), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 168/286 (58%), Positives = 214/286 (74%), Gaps = 8/286 (2%)
Query: 10 HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
HV HK+P GD PY RAKH QLVEKD +A+I FWKAI GD+VDSALKDMAVV+KQ+ +
Sbjct: 19 HVAHKIPAGDGPYARAKHFQLVEKDLDASIAWFWKAIGTGDKVDSALKDMAVVMKQRGYL 78
Query: 70 DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
EA+ A++S RHLC KQ+QESLDN+L+DLYK GR E+IELLKQKLR IY GEAF+GK
Sbjct: 79 AEAIGAVRSLRHLCPKQSQESLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 138
Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
TK ARSHG+K V++KQETSR+LGNL WAYMQ+ N+ +AEVVYRKAQ+IDPDANKACNL+
Sbjct: 139 TKRARSHGRKIHVSIKQETSRVLGNLAWAYMQQRNFMAAEVVYRKAQMIDPDANKACNLA 198
Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG--STETKTINRVKELLQEL--EPWQSIPPSLTTK 245
CLI+QAR +A+ VL DVL G+ + K + +V+ELL + + WQ +
Sbjct: 199 LCLIEQARLADAQLVLTDVLAGRYQARDQQDGKIVRKVEELLARIMAQTWQGGGGGRRRR 258
Query: 246 KSSLEDAFLEG-LDDLMNQWTPYR--SRRLPIFEEISP-FRDQLAC 287
S +D ++E + L++ PYR SRRLP+FEEISP +++Q+AC
Sbjct: 259 DESDDDDWVENQMLALLDVAVPYRKTSRRLPVFEEISPIYKEQVAC 304
>gi|242036819|ref|XP_002465804.1| hypothetical protein SORBIDRAFT_01g046120 [Sorghum bicolor]
gi|241919658|gb|EER92802.1| hypothetical protein SORBIDRAFT_01g046120 [Sorghum bicolor]
Length = 312
Score = 326 bits (836), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 168/297 (56%), Positives = 213/297 (71%), Gaps = 23/297 (7%)
Query: 10 HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
HV HK+P GD PY RAKH QLVEKD +A+I FWKAI+ GD+VDSALKDMAVV+KQ+ +
Sbjct: 20 HVAHKIPAGDGPYARAKHFQLVEKDLDASIAWFWKAISTGDKVDSALKDMAVVMKQRGYL 79
Query: 70 DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
EA++AI+S RHLC KQ+QESLDN+L+DLYK GR E+IELLKQKLR IY GEAF+GK
Sbjct: 80 TEAIDAIRSLRHLCPKQSQESLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 139
Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
TK ARSHG+K V+V+QETSRILGNL WAYMQ+ N+ +AEVVYRKAQ+IDPDANKACNL+
Sbjct: 140 TKRARSHGRKIHVSVRQETSRILGNLAWAYMQQRNFMAAEVVYRKAQMIDPDANKACNLA 199
Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG---STETKTINRVKELLQEL--EPWQSIPPSLTT 244
CLI+Q R+ +A VL DVL G+ + K + +V+ELL + + W +
Sbjct: 200 LCLIEQTRFADAELVLADVLAGRYLARDQQQDGKIVRKVEELLARIVAQTWPGGGANSGR 259
Query: 245 KKSS-----------LEDAFLEGLDDLMNQWTPYR--SRRLPIFEEISP-FRDQLAC 287
++ +E+ L LD + PYR S+RLP+FEEISP +R+Q+AC
Sbjct: 260 RRDESGSESDDDDDWVENQMLALLDVAV----PYRKSSKRLPVFEEISPVYREQVAC 312
>gi|115450941|ref|NP_001049071.1| Os03g0165900 [Oryza sativa Japonica Group]
gi|108706358|gb|ABF94153.1| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113547542|dbj|BAF10985.1| Os03g0165900 [Oryza sativa Japonica Group]
gi|125585045|gb|EAZ25709.1| hypothetical protein OsJ_09542 [Oryza sativa Japonica Group]
gi|215717030|dbj|BAG95393.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 315
Score = 323 bits (829), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 168/295 (56%), Positives = 212/295 (71%), Gaps = 18/295 (6%)
Query: 10 HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
HV +K+P GD PY RAKH QLVEKD +A+I FWKAIN+GD+VDSALKDMAVV+KQ+
Sbjct: 19 HVAYKIPAGDGPYARAKHYQLVEKDLDASIAWFWKAINSGDKVDSALKDMAVVMKQRGYH 78
Query: 70 DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
+A++AIKS RHLC KQ+Q+SLDN+L+DLYK GR E+IELLKQKLR IY GEAF+GK
Sbjct: 79 ADAIDAIKSLRHLCPKQSQDSLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 138
Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
TK ARSHG+K V+VKQETSR+LGNL WAYMQ+GN+ +AEVVYRKAQ++DPDANKACNL+
Sbjct: 139 TKRARSHGRKIHVSVKQETSRVLGNLAWAYMQQGNFMAAEVVYRKAQMVDPDANKACNLA 198
Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG--------STETKTINRVKELLQEL------EPW 235
CLI+Q R +A +VL VL G+ T K + +V+EL+ +
Sbjct: 199 LCLIEQRRPADAEAVLAGVLAGRYHARDDDHGAPHTGGKILAKVEELMARITGEVDGRSN 258
Query: 236 QSIPPSLTTKKSSLEDAFLEGLDDLMNQW-TPYR--SRRLPIFEEISPF-RDQLA 286
S S +ED +E LD ++ QW PYR +RRLP+FEEI+P R+Q+A
Sbjct: 259 SSGGSSSDDDDRDVEDEMVELLDVVVRQWAAPYRRSNRRLPVFEEITPVCREQMA 313
>gi|125542541|gb|EAY88680.1| hypothetical protein OsI_10155 [Oryza sativa Indica Group]
Length = 315
Score = 323 bits (828), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 168/295 (56%), Positives = 212/295 (71%), Gaps = 18/295 (6%)
Query: 10 HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
HV +K+P GD PY RAKH QLVEKD +A+I FWKAIN+GD+VDSALKDMAVV+KQ+
Sbjct: 19 HVAYKIPAGDGPYARAKHYQLVEKDLDASIAWFWKAINSGDKVDSALKDMAVVMKQRGYH 78
Query: 70 DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
+A++AIKS RHLC KQ+Q+SLDN+L+DLYK GR E+IELLKQKLR IY GEAF+GK
Sbjct: 79 ADAIDAIKSLRHLCPKQSQDSLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGKT 138
Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
TK ARSHG+K V+VKQETSR+LGNL WAYMQ+GN+ +AEVVYRKAQ++DPDANKACNL+
Sbjct: 139 TKRARSHGRKIHVSVKQETSRVLGNLAWAYMQQGNFMAAEVVYRKAQMVDPDANKACNLA 198
Query: 190 HCLIKQARYTEARSVLEDVLLGKLSG--------STETKTINRVKELLQEL------EPW 235
CLI+Q R +A +VL VL G+ T K + +V+EL+ +
Sbjct: 199 LCLIEQRRPADAEAVLAGVLAGRYHARDDDHGAPHTGGKILAKVEELMARITGEVDGRSN 258
Query: 236 QSIPPSLTTKKSSLEDAFLEGLDDLMNQW-TPYR--SRRLPIFEEISPF-RDQLA 286
S S +ED +E LD ++ QW PYR +RRLP+FEEI+P R+Q+A
Sbjct: 259 SSGGSSSDDDDRDVEDEMVELLDVVVRQWAAPYRRSNRRLPVFEEITPVCREQMA 313
>gi|357113902|ref|XP_003558740.1| PREDICTED: uncharacterized protein LOC100831382 [Brachypodium
distachyon]
Length = 305
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/295 (55%), Positives = 213/295 (72%), Gaps = 15/295 (5%)
Query: 6 DQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ 65
D HV HK+P GD PY RAKH QLVEKD +A+IV FWKAI GD+VDSALKDMAVV+KQ
Sbjct: 13 DAAAHVAHKIPSGDGPYARAKHYQLVEKDLDASIVWFWKAIETGDKVDSALKDMAVVMKQ 72
Query: 66 QDRVDEAVEAIKSFRHLC--SKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
+ + +A++AIKS RHLC S+Q+QESLDN+L+DLYK GR E+I+LLKQKLR I+HGE
Sbjct: 73 RGYLKDAIDAIKSLRHLCNPSRQSQESLDNILLDLYKASGRTREEIDLLKQKLRRIFHGE 132
Query: 124 AF-NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
AF GK TK ARSHG+K V+VKQETSR+LGNL WAYMQ+ N+ +AE VYRKAQ++DPDA
Sbjct: 133 AFPRGKSTKRARSHGRKIHVSVKQETSRVLGNLAWAYMQERNFMAAEAVYRKAQMVDPDA 192
Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELEPWQSI 238
NKACNL+ CL++Q R +A VL+ VL G ++ G K + + +EL++ + +
Sbjct: 193 NKACNLALCLVEQRRMGDAEKVLDGVLSGVYVEQIGGG--EKAVRKAEELMERIRAGKGG 250
Query: 239 PPSLTTKKSSLE-DAFLEGLDDLMNQWT-PYR--SRRLPIFEEISPF--RDQLAC 287
++ +E D E LD ++ +W PYR RRLP+FEEI+PF R Q+AC
Sbjct: 251 DGVEEDQEDGVEADEMAELLDVVVKEWARPYRRSDRRLPVFEEITPFCGRGQIAC 305
>gi|226531990|ref|NP_001151272.1| pollenless3 [Zea mays]
gi|195645436|gb|ACG42186.1| pollenless3 [Zea mays]
gi|414865004|tpg|DAA43561.1| TPA: pollenless3 [Zea mays]
Length = 312
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 164/288 (56%), Positives = 206/288 (71%), Gaps = 18/288 (6%)
Query: 10 HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
HV HK+PPGD PY RAKH QLVEKD +A+I FWKAI+AGD++DSALKDMAVV+KQ+ +
Sbjct: 22 HVAHKIPPGDGPYARAKHFQLVEKDLDASIAWFWKAISAGDKLDSALKDMAVVMKQRGYL 81
Query: 70 DEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKP 129
EA++AI+S RHLC KQ+QE LDN+L+DLYK GR E+IELLKQKLR IY GEAF+G+
Sbjct: 82 AEAIDAIRSLRHLCPKQSQEPLDNILLDLYKASGRTKEEIELLKQKLRKIYLGEAFHGRT 141
Query: 130 TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLS 189
TK ARSHG+K V+VKQETSR+LGNL WAYMQ+ N+ +AEVVYRKAQ+IDPDANKACNL+
Sbjct: 142 TKRARSHGRKIHVSVKQETSRVLGNLAWAYMQQRNFMAAEVVYRKAQMIDPDANKACNLA 201
Query: 190 HCLIKQARYTEARSVLEDVLLGKL---SGSTETKTINRVKELLQELEPWQSIP------- 239
CLI+QAR +A VL DVL G+ + K + +V+ELL + + P
Sbjct: 202 LCLIEQARLADAELVLADVLAGRYLQARDQQDGKIVRKVEELLARIMAHTTWPGGGGGDG 261
Query: 240 ------PSLTTKKSSLEDAFLEGLDDLMNQWTPYR--SRRLPIFEEIS 279
S + +S +D + L++ PYR SRRLP+FEEIS
Sbjct: 262 RRRPADKSGSGSESDDDDWVENQMLALLDVAVPYRKKSRRLPVFEEIS 309
>gi|168001142|ref|XP_001753274.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695560|gb|EDQ81903.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 296
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 147/233 (63%), Positives = 186/233 (79%), Gaps = 3/233 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
R +HVVHK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 25 ARADSFHVVHKVPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVM 84
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
KQQ+R EA+EAIKS R CS QAQESLDNVL+DLYK+CGRLD+QI+LLK KL +I+ G
Sbjct: 85 KQQNRPQEAIEAIKSLRSRCSDQAQESLDNVLLDLYKRCGRLDDQIDLLKHKLHLIHQGM 144
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AFNGK TKTARS GKKFQV+++QE +R+LGNLGWA+MQ+ N+ +AE VYRKA I+PD N
Sbjct: 145 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWAHMQQSNFVAAEAVYRKALSIEPDNN 204
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQELE 233
K CNL CL+KQ R EA+++L+ V + + + K+ +R +E+LQ+LE
Sbjct: 205 KVCNLGICLMKQGRLEEAKAMLQSVTRCNDNRWASDSHLKSYDRAQEMLQDLE 257
>gi|168001098|ref|XP_001753252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695538|gb|EDQ81881.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 298
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 148/233 (63%), Positives = 185/233 (79%), Gaps = 3/233 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
R +HVVHK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 25 ARADSFHVVHKVPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVM 84
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
KQQ+R EA+EAIKS R CS QAQESLDNVL+DLYK+CGRLD+QI+LLK KL +I+ G
Sbjct: 85 KQQNRPQEAIEAIKSLRSRCSDQAQESLDNVLLDLYKRCGRLDDQIDLLKHKLHLIHQGM 144
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AFNGK TKTARS GKKFQV+++QE +R+LGNLGWA MQ+ N+ +AE VYRKA I+PD N
Sbjct: 145 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWACMQQSNFVAAEAVYRKALSIEPDNN 204
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQELE 233
K CNL CL+KQ R EA+++L+ V + + + K+ +R +E+LQELE
Sbjct: 205 KVCNLGICLMKQGRLEEAKAMLQSVTRCNDNRWASDSHLKSYDRAQEMLQELE 257
>gi|242045418|ref|XP_002460580.1| hypothetical protein SORBIDRAFT_02g031200 [Sorghum bicolor]
gi|241923957|gb|EER97101.1| hypothetical protein SORBIDRAFT_02g031200 [Sorghum bicolor]
Length = 516
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/237 (63%), Positives = 182/237 (76%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 37 MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMA 96
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 97 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 156
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI P
Sbjct: 157 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 216
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 217 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAAVDGLRGADSHLKAYERAQEMLRDLE 273
>gi|115480407|ref|NP_001063797.1| Os09g0538500 [Oryza sativa Japonica Group]
gi|50726663|dbj|BAD34381.1| putative pollenless3 [Oryza sativa Japonica Group]
gi|113632030|dbj|BAF25711.1| Os09g0538500 [Oryza sativa Japonica Group]
gi|125606466|gb|EAZ45502.1| hypothetical protein OsJ_30159 [Oryza sativa Japonica Group]
gi|215740583|dbj|BAG97239.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768313|dbj|BAH00542.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768379|dbj|BAH00608.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 515
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/237 (63%), Positives = 182/237 (76%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 46 MLRTNSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWAAINAGDRVDSALKDMA 105
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 106 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 165
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI P
Sbjct: 166 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 225
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 226 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 282
>gi|125564523|gb|EAZ09903.1| hypothetical protein OsI_32198 [Oryza sativa Indica Group]
Length = 513
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/237 (63%), Positives = 182/237 (76%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 46 MLRTNSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWAAINAGDRVDSALKDMA 105
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 106 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 165
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI P
Sbjct: 166 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 225
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 226 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 282
>gi|357159691|ref|XP_003578528.1| PREDICTED: uncharacterized protein LOC100833530 [Brachypodium
distachyon]
Length = 508
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/297 (55%), Positives = 203/297 (68%), Gaps = 18/297 (6%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV K+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 40 MLRTNSDAFHVAFKVPVGDTPYVRAKRVQLVDKDPEKAIALFWAAINAGDRVDSALKDMA 99
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EA+KS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 100 IVMKQQNRAEEAIEAVKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 159
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI P
Sbjct: 160 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 219
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 220 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLEIKL 279
Query: 237 SIPPSLTTKKSSLEDAFLEGLDDLM-----NQWTPYRSRRLPIFEEISP--FRDQLA 286
P+ T L+ ++L D LM + W P P E + P RDQ A
Sbjct: 280 VGRPAWATAGDQLDKSWL--FDALMLGSSSSIWQPQ-----PCVEHLLPPLPRDQFA 329
>gi|226497904|ref|NP_001141607.1| hypothetical protein [Zea mays]
gi|194705256|gb|ACF86712.1| unknown [Zea mays]
gi|414888334|tpg|DAA64348.1| TPA: hypothetical protein ZEAMMB73_633017 [Zea mays]
Length = 485
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 156/253 (61%), Positives = 188/253 (74%), Gaps = 5/253 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V+KQQ+R
Sbjct: 39 FHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMAIVMKQQNR 98
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
+EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I G AFNGK
Sbjct: 99 AEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLINQGHAFNGK 158
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI PD NK CNL
Sbjct: 159 RTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGPDNNKMCNL 218
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEP-WQSIPPSLT 243
CL+KQAR EA+ VL+ V + G + K R +E+L++LE PP+
Sbjct: 219 GICLMKQARVLEAKDVLKQVRPAAVDGLRGADSHLKAYERAQEMLRDLETKLIGRPPADQ 278
Query: 244 TKKSSLEDAFLEG 256
+ L DA L G
Sbjct: 279 LDTNWLFDALLLG 291
>gi|302818051|ref|XP_002990700.1| hypothetical protein SELMODRAFT_451580 [Selaginella moellendorffii]
gi|300141622|gb|EFJ08332.1| hypothetical protein SELMODRAFT_451580 [Selaginella moellendorffii]
Length = 538
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 147/237 (62%), Positives = 186/237 (78%), Gaps = 4/237 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
+R +HV+HK+P GDSPYVRAKHVQLV+K+P+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 36 SRADSFHVIHKVPLGDSPYVRAKHVQLVDKEPDKAIALFWAAINAGDRVDSALKDMAIVM 95
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
KQQ+R +EA+EAIKS R CS QAQESLDNVL+DLYK+CGRLD+QI LLK KL++IY G
Sbjct: 96 KQQNRPEEAIEAIKSLRGRCSDQAQESLDNVLLDLYKRCGRLDDQIALLKHKLQLIYAGL 155
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AFNGK TKTARS GKKFQV+++QE +R+LGNLGWAYMQ+ ++ +AE VYRKA ++PD N
Sbjct: 156 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWAYMQQSDFIAAEAVYRKALSMEPDNN 215
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL----GKLSGSTETKTINRVKELLQELEPWQ 236
K CNL CL+KQ R EA+ +L+ V+ + + K+ R +E+L E+E Q
Sbjct: 216 KMCNLGICLMKQGRVDEAKVLLKSVMPASSDARWGADSHLKSYERAQEMLAEMEQGQ 272
>gi|302771231|ref|XP_002969034.1| hypothetical protein SELMODRAFT_451579 [Selaginella moellendorffii]
gi|300163539|gb|EFJ30150.1| hypothetical protein SELMODRAFT_451579 [Selaginella moellendorffii]
Length = 533
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 147/237 (62%), Positives = 186/237 (78%), Gaps = 4/237 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
+R +HV+HK+P GDSPYVRAKHVQLV+K+P+ AI LFW AINAGDRVDSALKDMA+V+
Sbjct: 36 SRADSFHVIHKVPLGDSPYVRAKHVQLVDKEPDKAIALFWAAINAGDRVDSALKDMAIVM 95
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
KQQ+R +EA+EAIKS R CS QAQESLDNVL+DLYK+CGRLD+QI LLK KL++IY G
Sbjct: 96 KQQNRPEEAIEAIKSLRGRCSDQAQESLDNVLLDLYKRCGRLDDQIALLKHKLQLIYAGL 155
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AFNGK TKTARS GKKFQV+++QE +R+LGNLGWAYMQ+ ++ +AE VYRKA ++PD N
Sbjct: 156 AFNGKRTKTARSQGKKFQVSIEQEATRLLGNLGWAYMQQSDFIAAEAVYRKALSMEPDNN 215
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELEPWQ 236
K CNL CL+KQ R EA+ +L+ V+ + + K+ R +E+L E+E Q
Sbjct: 216 KMCNLGICLMKQGRVDEAKVLLKSVMPASSDTRWGADSHLKSYERAQEMLAEMEQGQ 272
>gi|168005546|ref|XP_001755471.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693178|gb|EDQ79531.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 147/233 (63%), Positives = 186/233 (79%), Gaps = 4/233 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
+R P+HVVHK+P GDSPYV+AKHVQLV+KDP+ AI LFW AIN+GDRVDSALKDMA+V+
Sbjct: 26 SRGDPFHVVHKVPVGDSPYVKAKHVQLVDKDPDRAIALFWAAINSGDRVDSALKDMAIVM 85
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
+QQ+R EA+EAIKS R CS QAQESLDNVL+DLYK+CGRLD+QI+LL+ KL +I+ G
Sbjct: 86 RQQNRPQEAIEAIKSLRSRCSDQAQESLDNVLLDLYKRCGRLDDQIDLLRHKLHLIHQGL 145
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AFNGK TKTARS GKKFQV+VKQE +R+LGNLGWAYMQ+ N+ +AE VYRKA I+ D N
Sbjct: 146 AFNGKRTKTARSQGKKFQVSVKQEATRLLGNLGWAYMQQLNFVAAEAVYRKALSIETDIN 205
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQEL 232
K CNL CL+KQ R EA++ L+ V + + + ++ K+ R ++LL+EL
Sbjct: 206 KVCNLGICLMKQGRLEEAKAALQRVTVAYSGIRWASDSQLKSYERAQDLLKEL 258
>gi|414590076|tpg|DAA40647.1| TPA: hypothetical protein ZEAMMB73_529703 [Zea mays]
Length = 547
Score = 310 bits (793), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 150/237 (63%), Positives = 181/237 (76%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 75 MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMA 134
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL+ I+
Sbjct: 135 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQRIH 194
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI P
Sbjct: 195 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 254
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 255 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 311
>gi|293333705|ref|NP_001170740.1| uncharacterized protein LOC100384832 [Zea mays]
gi|238007284|gb|ACR34677.1| unknown [Zea mays]
Length = 509
Score = 310 bits (793), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 150/237 (63%), Positives = 181/237 (76%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 37 MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMA 96
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL+ I+
Sbjct: 97 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQRIH 156
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A LI P
Sbjct: 157 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALLIGP 216
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 217 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 273
>gi|326515846|dbj|BAK07169.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 316
Score = 309 bits (792), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 160/292 (54%), Positives = 200/292 (68%), Gaps = 14/292 (4%)
Query: 10 HVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRV 69
HV K+P G PY RAKH QLVEKD +A+I FWKAI GD+VDSALKDMAVV+KQ+ +
Sbjct: 25 HVALKIPSGGGPYARAKHYQLVEKDLDASIAWFWKAIETGDKVDSALKDMAVVMKQRGYL 84
Query: 70 DEAVEAIKSFRHLC-SKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF-NG 127
D+AV+AI+S RHLC KQ+QESLDN+L+DLYK GR E+IELLK KLR IYHG+AF G
Sbjct: 85 DDAVDAIRSLRHLCPGKQSQESLDNILLDLYKASGRTKEEIELLKHKLRRIYHGQAFPAG 144
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
K TK ARSHG+K V+V+QETSR+LGNL WAYMQ+ N+ +AE VYRKAQ+++PDANKACN
Sbjct: 145 KSTKRARSHGRKIHVSVQQETSRVLGNLAWAYMQQRNFMAAEAVYRKAQMVEPDANKACN 204
Query: 188 LSHCLIKQARYTEARSVLEDVLLGKLSGSTE-----TKTINRVKELLQELEPWQSIPPSL 242
L+ CL++Q R +A VL V+ G E K + + +ELL+ +
Sbjct: 205 LAVCLVEQGRLGDAEGVLAYVVAGAFRDGREREHGGGKVVRKAEELLERIRAEMGGGGGE 264
Query: 243 TTKKSSLE---DAFLEGLDDLMNQWT-PYRS--RRLPIFEEISPF-RDQLAC 287
D E LD + QW PYR RRLP+FEEI+PF R+Q+AC
Sbjct: 265 KEAGEEDGAEADEMSELLDAVARQWAPPYRKSHRRLPVFEEITPFGREQMAC 316
>gi|224071329|ref|XP_002303407.1| predicted protein [Populus trichocarpa]
gi|222840839|gb|EEE78386.1| predicted protein [Populus trichocarpa]
Length = 238
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 150/236 (63%), Positives = 179/236 (75%), Gaps = 4/236 (1%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S+TR + +H +HK+P GDSPYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+
Sbjct: 1 SRTRSESFHALHKVPVGDSPYVRAKNVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMAI 60
Query: 62 VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
V+KQQ+R DEA+EAIKS RH CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL MI
Sbjct: 61 VMKQQNRADEAIEAIKSLRHRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLYMIQQ 120
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A I PD
Sbjct: 121 GLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALAIAPD 180
Query: 182 ANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
NK CNL CL+KQ R +EA+ L V G + K R +++L +LE
Sbjct: 181 NNKMCNLGICLMKQGRISEAKETLRRVKPAVADGPRGVDSHLKAYERAQQMLNDLE 236
>gi|326532454|dbj|BAK05156.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 539
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 149/237 (62%), Positives = 182/237 (76%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
M +T +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 45 MLRTHSDSFHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWGAINAGDRVDSALKDMA 104
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+
Sbjct: 105 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIH 164
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A +I P
Sbjct: 165 QGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALVIGP 224
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
D NK CNL CL+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 225 DNNKMCNLGICLMKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 281
>gi|302779988|ref|XP_002971769.1| hypothetical protein SELMODRAFT_441590 [Selaginella moellendorffii]
gi|300160901|gb|EFJ27518.1| hypothetical protein SELMODRAFT_441590 [Selaginella moellendorffii]
Length = 439
Score = 308 bits (790), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 146/233 (62%), Positives = 185/233 (79%), Gaps = 1/233 (0%)
Query: 7 QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
+ +HV+HK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+KQQ
Sbjct: 15 ESFHVLHKIPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVMKQQ 74
Query: 67 DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFN 126
+R +EA+EAIKS R C+ QAQE+LDNVL+DLYK+CGRLD+QI LLK+KL +I+ G AFN
Sbjct: 75 NRPEEAIEAIKSLRDRCTDQAQEALDNVLLDLYKRCGRLDDQIALLKRKLHLIHEGLAFN 134
Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
GK TKTARS G+KFQV+++QE SR+LGNLGWAYMQ+ NY +AE VYRKA I+PD+NK C
Sbjct: 135 GKRTKTARSQGRKFQVSIEQEASRLLGNLGWAYMQQSNYIAAEAVYRKALSIEPDSNKVC 194
Query: 187 NLSHCLIKQARYTEARSVLEDVLLGKLSGS-TETKTINRVKELLQELEPWQSI 238
NL C KQ + +AR LE V + S ++ KT R +E+L EL +S+
Sbjct: 195 NLGICFQKQGKLHDARVTLESVAPPAWNASPSQRKTYERAQEVLVELREMKSV 247
>gi|302760791|ref|XP_002963818.1| hypothetical protein SELMODRAFT_405259 [Selaginella moellendorffii]
gi|300169086|gb|EFJ35689.1| hypothetical protein SELMODRAFT_405259 [Selaginella moellendorffii]
Length = 439
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 146/233 (62%), Positives = 185/233 (79%), Gaps = 1/233 (0%)
Query: 7 QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
+ +HV+HK+P GD+PYV+AKHVQLV+KDP+ AI LFW AINAGDRVDSALKDMA+V+KQQ
Sbjct: 15 ESFHVLHKIPVGDTPYVKAKHVQLVDKDPDRAIALFWAAINAGDRVDSALKDMAIVMKQQ 74
Query: 67 DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFN 126
+R +EA+EAIKS R C+ QAQE+LDNVL+DLYK+CGRLD+QI LLK+KL +I+ G AFN
Sbjct: 75 NRPEEAIEAIKSLRDRCTDQAQEALDNVLLDLYKRCGRLDDQIALLKRKLHLIHEGLAFN 134
Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
GK TKTARS G+KFQV+++QE SR+LGNLGWAYMQ+ NY +AE VYRKA I+PD+NK C
Sbjct: 135 GKRTKTARSQGRKFQVSIEQEASRLLGNLGWAYMQQSNYIAAEAVYRKALSIEPDSNKVC 194
Query: 187 NLSHCLIKQARYTEARSVLEDVLLGKLSGS-TETKTINRVKELLQELEPWQSI 238
NL C KQ + +AR LE V + S ++ KT R +E+L EL +S+
Sbjct: 195 NLGICFQKQGKLHDARVTLESVAPPAWNASPSQRKTYERAQEVLVELREMKSV 247
>gi|255577530|ref|XP_002529643.1| conserved hypothetical protein [Ricinus communis]
gi|223530869|gb|EEF32730.1| conserved hypothetical protein [Ricinus communis]
Length = 482
Score = 307 bits (786), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 152/258 (58%), Positives = 188/258 (72%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR + +H +HK+P GD+PYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 28 VSRTRSESFHAIHKVPVGDTPYVRAKNVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 87
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I
Sbjct: 88 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLYLIQ 147
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A I P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYVEAEDAYRRALSIAP 207
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R +EA+ L V G + K R +++L++LE
Sbjct: 208 DNNKMCNLGICLMKQGRISEAKDTLRRVKPAVADGPRGVDSHLKAYERAQQMLKDLESEM 267
Query: 237 SIPPSLTTKKSSLEDAFL 254
++S L DAFL
Sbjct: 268 MNKGGDRVEQSRLFDAFL 285
>gi|449434138|ref|XP_004134853.1| PREDICTED: uncharacterized protein LOC101220568 [Cucumis sativus]
Length = 500
Score = 305 bits (781), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 149/237 (62%), Positives = 180/237 (75%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+ R PYHV HK+P GD+PYVRAK+VQLV KDP+ AI LFW AINAGDRVDSALKDMA
Sbjct: 28 VSRLRPDPYHVTHKVPVGDTPYVRAKNVQLVAKDPDKAIPLFWAAINAGDRVDSALKDMA 87
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R+ CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I
Sbjct: 88 IVMKQQNRAEEAIEAIKSLRNRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLFLIQ 147
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YRKA I P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYVEAEDAYRKALTIAP 207
Query: 181 DANKACNLSHCLIKQARYTEA----RSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
D NK CNL CL+KQ R +EA R V V+ G + K R +++L++LE
Sbjct: 208 DNNKMCNLGICLMKQGRISEAKENLRRVKPAVIDGPRGTDSHLKAYERAQQMLKDLE 264
>gi|224071331|ref|XP_002303408.1| predicted protein [Populus trichocarpa]
gi|222840840|gb|EEE78387.1| predicted protein [Populus trichocarpa]
Length = 233
Score = 305 bits (780), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 148/233 (63%), Positives = 176/233 (75%), Gaps = 4/233 (1%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
R + +H +HK+P GDSPYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V+K
Sbjct: 1 RSESFHALHKVPVGDSPYVRAKNVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMAIVMK 60
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
QQ+R DEA+EAIKS RH CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL MI G A
Sbjct: 61 QQNRADEAIEAIKSLRHRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLYMIQQGLA 120
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
FNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A I PD NK
Sbjct: 121 FNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALAIAPDNNK 180
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
CNL CL+KQ R +EA+ L V G + K R +++L +LE
Sbjct: 181 MCNLGICLMKQGRISEAKETLRRVKPAVADGPRGVDSHLKAYERAQQMLNDLE 233
>gi|449491318|ref|XP_004158859.1| PREDICTED: uncharacterized LOC101220568 [Cucumis sativus]
Length = 409
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 149/237 (62%), Positives = 180/237 (75%), Gaps = 4/237 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+ R PYHV HK+P GD+PYVRAK+VQLV KDP+ AI LFW AINAGDRVDSALKDMA
Sbjct: 28 VSRLRPDPYHVTHKVPVGDTPYVRAKNVQLVAKDPDKAIPLFWAAINAGDRVDSALKDMA 87
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R+ CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I
Sbjct: 88 IVMKQQNRAEEAIEAIKSLRNRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLFLIQ 147
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YRKA I P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYVEAEDAYRKALTIAP 207
Query: 181 DANKACNLSHCLIKQARYTEA----RSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
D NK CNL CL+KQ R +EA R V V+ G + K R +++L++LE
Sbjct: 208 DNNKMCNLGICLMKQGRISEAKENLRRVKPAVIDGPRGTDSHLKAYERAQQMLKDLE 264
>gi|356559262|ref|XP_003547919.1| PREDICTED: uncharacterized protein LOC100788369 [Glycine max]
Length = 501
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 147/235 (62%), Positives = 178/235 (75%), Gaps = 4/235 (1%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
+TR + +HV HK+P GD+PYVRAK+VQLV KDPE AI LFW AINAGDRVDSALKDMA+V
Sbjct: 31 RTRSETFHVAHKVPIGDTPYVRAKNVQLVNKDPERAIPLFWAAINAGDRVDSALKDMAIV 90
Query: 63 LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I G
Sbjct: 91 MKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLYLIQQG 150
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A LI PD
Sbjct: 151 LAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEEAYRRALLIAPDN 210
Query: 183 NKACNLSHCLIKQARYTEARSVL----EDVLLGKLSGSTETKTINRVKELLQELE 233
NK CNL CL+KQ R EA+ L V+ G + K R +++L++LE
Sbjct: 211 NKMCNLGICLMKQGRIGEAKETLYRVKPAVMDGPRGSDSHLKAYERAQQMLKDLE 265
>gi|147797595|emb|CAN75795.1| hypothetical protein VITISV_024890 [Vitis vinifera]
Length = 471
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 151/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR + +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 24 VSRTRSESFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 83
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQ+SLDN+L+DLYK+CGRLD+QI LL+ KL +I
Sbjct: 84 IVMKQQNRAEEAIEAIKSLRSRCSDQAQDSLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 143
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY AE YR+A + P
Sbjct: 144 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQRNNYIEAEDAYRRALSMTP 203
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L +LE
Sbjct: 204 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGLRGVDSHLKAFERARQMLLDLESEM 263
Query: 237 SIPPSLTTKKSSLEDAFL 254
S ++S L DAFL
Sbjct: 264 LXKGSDPVEQSRLFDAFL 281
>gi|225431084|ref|XP_002265165.1| PREDICTED: uncharacterized protein LOC100257355 [Vitis vinifera]
Length = 474
Score = 302 bits (774), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 152/258 (58%), Positives = 184/258 (71%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 27 VSRTRSDSFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWSAINAGDRVDSALKDMA 86
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I
Sbjct: 87 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 146
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY AE YR+A + P
Sbjct: 147 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQQNNYIEAEDAYRRALSMAP 206
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L +LE
Sbjct: 207 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGPRGVDSHLKAFERARQMLLDLESEM 266
Query: 237 SIPPSLTTKKSSLEDAFL 254
S ++S L DAFL
Sbjct: 267 LNKGSDPVEQSRLFDAFL 284
>gi|359476811|ref|XP_002265051.2| PREDICTED: uncharacterized protein LOC100245548 [Vitis vinifera]
Length = 1169
Score = 302 bits (774), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 151/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR + +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 99 VSRTRSESFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 158
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQ+SLDN+L+DLYK+CGRLD+QI LL+ KL +I
Sbjct: 159 IVMKQQNRAEEAIEAIKSLRSRCSDQAQDSLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 218
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY AE YR+A + P
Sbjct: 219 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQRNNYIEAEDAYRRALSMTP 278
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L +LE
Sbjct: 279 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGLRGVDSHLKAFERARQMLLDLESEM 338
Query: 237 SIPPSLTTKKSSLEDAFL 254
S ++S L DAFL
Sbjct: 339 LNKGSDPVEQSRLFDAFL 356
>gi|297734982|emb|CBI17344.3| unnamed protein product [Vitis vinifera]
Length = 395
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 151/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR + +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 27 VSRTRSESFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAGDRVDSALKDMA 86
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQ+SLDN+L+DLYK+CGRLD+QI LL+ KL +I
Sbjct: 87 IVMKQQNRAEEAIEAIKSLRSRCSDQAQDSLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 146
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY AE YR+A + P
Sbjct: 147 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQRNNYIEAEDAYRRALSMTP 206
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L +LE
Sbjct: 207 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGLRGVDSHLKAFERARQMLLDLESEM 266
Query: 237 SIPPSLTTKKSSLEDAFL 254
S ++S L DAFL
Sbjct: 267 LNKGSDPVEQSRLFDAFL 284
>gi|356502717|ref|XP_003520163.1| PREDICTED: uncharacterized protein LOC100803414 [Glycine max]
Length = 500
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 151/256 (58%), Positives = 186/256 (72%), Gaps = 4/256 (1%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
+TR + +HV HK+P GD+PYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V
Sbjct: 31 RTRSESFHVAHKVPVGDTPYVRAKNVQLVDKDPERAIPLFWAAINAGDRVDSALKDMAIV 90
Query: 63 LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I G
Sbjct: 91 MKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIGLLKHKLYLIQQG 150
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A I PD
Sbjct: 151 LAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALSIAPDN 210
Query: 183 NKACNLSHCLIKQARYTEARSVL----EDVLLGKLSGSTETKTINRVKELLQELEPWQSI 238
NK CNL CL+KQ R EA+ L V+ G + K R +++L++LE
Sbjct: 211 NKMCNLGICLMKQGRIGEAKETLYRVKPAVMDGPRGSDSHLKAYERAQQMLKDLESEMMN 270
Query: 239 PPSLTTKKSSLEDAFL 254
++S L +AFL
Sbjct: 271 KGVDRIEQSRLFEAFL 286
>gi|297734984|emb|CBI17346.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/258 (58%), Positives = 184/258 (71%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA
Sbjct: 24 VSRTRSDSFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWSAINAGDRVDSALKDMA 83
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I
Sbjct: 84 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 143
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V QE +R+LGNLGWA MQ+ NY AE YR+A + P
Sbjct: 144 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLGNLGWALMQQNNYIEAEDAYRRALSMAP 203
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L +LE
Sbjct: 204 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGPRGVDSHLKAFERARQMLLDLESEM 263
Query: 237 SIPPSLTTKKSSLEDAFL 254
S ++S L DAFL
Sbjct: 264 LNKGSDPVEQSRLFDAFL 281
>gi|224133684|ref|XP_002327655.1| predicted protein [Populus trichocarpa]
gi|222836740|gb|EEE75133.1| predicted protein [Populus trichocarpa]
Length = 454
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 150/258 (58%), Positives = 185/258 (71%), Gaps = 4/258 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR + + +HK+P GDSPYVRAK+VQLV+KDPE A+ LFW AINAGDRVDSALKDMA
Sbjct: 28 VSRTRSESFQAIHKVPVGDSPYVRAKNVQLVDKDPEKAVPLFWAAINAGDRVDSALKDMA 87
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS RH CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I
Sbjct: 88 IVMKQQNRAEEAIEAIKSLRHRCSDQAQESLDNILLDLYKRCGRLDDQIALLKHKLFLIQ 147
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A P
Sbjct: 148 QGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEDAYRRALATAP 207
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L++LE
Sbjct: 208 DNNKMCNLGICLMKQGRIGEAKETLRRVKPAVADGPRGVDSHLKAYERAQQMLKDLESEM 267
Query: 237 SIPPSLTTKKSSLEDAFL 254
++ L DAFL
Sbjct: 268 MSKGGDRVEQRRLFDAFL 285
>gi|357517969|ref|XP_003629273.1| hypothetical protein MTR_8g075260 [Medicago truncatula]
gi|355523295|gb|AET03749.1| hypothetical protein MTR_8g075260 [Medicago truncatula]
Length = 479
Score = 300 bits (767), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 143/235 (60%), Positives = 177/235 (75%), Gaps = 4/235 (1%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
+TR + +H+ HK+P GD+PYVRAK+VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V
Sbjct: 30 RTRSESFHITHKVPIGDTPYVRAKNVQLVDKDPERAIPLFWAAINAGDRVDSALKDMAIV 89
Query: 63 LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I G
Sbjct: 90 MKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLYLIQQG 149
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ NY AE YR+A + PD
Sbjct: 150 LAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQQNNYIEAEEAYRRALCLAPDN 209
Query: 183 NKACNLSHCLIKQARYTEARSVLE----DVLLGKLSGSTETKTINRVKELLQELE 233
NK CNL CL+KQ R EA+ L V G + K R +++L++LE
Sbjct: 210 NKMCNLGICLMKQGRIAEAKETLHRVKPAVTDGPRGSDSHLKAYERAQQMLKDLE 264
>gi|15230439|ref|NP_190696.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
thaliana]
gi|6562280|emb|CAB62650.1| MS5-like protein [Arabidopsis thaliana]
gi|30102656|gb|AAP21246.1| At3g51280 [Arabidopsis thaliana]
gi|110735963|dbj|BAE99956.1| MS5 like protein [Arabidopsis thaliana]
gi|332645252|gb|AEE78773.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
thaliana]
Length = 430
Score = 300 bits (767), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 146/236 (61%), Positives = 179/236 (75%), Gaps = 4/236 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+T+ + +H +HK+P GDSPYVRAK+VQLVEKDPE AI LFWKAINAGDRVDSALKDMA
Sbjct: 22 ISRTQSESFHAIHKVPVGDSPYVRAKNVQLVEKDPERAIPLFWKAINAGDRVDSALKDMA 81
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL +I
Sbjct: 82 IVMKQQNRAEEAIEAIKSLRVRCSDQAQESLDNILLDLYKRCGRLDDQIGLLKHKLFLIQ 141
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ N+ AE YR+A I P
Sbjct: 142 KGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDAYRRALSIAP 201
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQEL 232
D NK CNL CL+KQ R EA+ L V + G + K R +++L +L
Sbjct: 202 DNNKMCNLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQMLNDL 257
>gi|297816392|ref|XP_002876079.1| hypothetical protein ARALYDRAFT_485479 [Arabidopsis lyrata subsp.
lyrata]
gi|297321917|gb|EFH52338.1| hypothetical protein ARALYDRAFT_485479 [Arabidopsis lyrata subsp.
lyrata]
Length = 432
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 146/236 (61%), Positives = 178/236 (75%), Gaps = 4/236 (1%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+T+ + +H +HK+P GDSPYVRAK+VQLVEKDPE AI LFWKAINAGDRVDSALKDMA
Sbjct: 22 ISRTQSESFHAIHKVPVGDSPYVRAKNVQLVEKDPERAIPLFWKAINAGDRVDSALKDMA 81
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK CGRLD+QI LLK KL +I
Sbjct: 82 IVMKQQNRAEEAIEAIKSLRVRCSDQAQESLDNILLDLYKWCGRLDDQIGLLKHKLFLIQ 141
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ N+ AE YR+A I P
Sbjct: 142 KGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDAYRRALSIAP 201
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQEL 232
D NK CNL CL+KQ R EA+ L V + G + K R +++L +L
Sbjct: 202 DNNKMCNLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQMLNDL 257
>gi|194698136|gb|ACF83152.1| unknown [Zea mays]
gi|414888333|tpg|DAA64347.1| TPA: hypothetical protein ZEAMMB73_633017 [Zea mays]
Length = 516
Score = 295 bits (755), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 158/294 (53%), Positives = 191/294 (64%), Gaps = 38/294 (12%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINAGDRVDSALKDMA+V+KQQ+R
Sbjct: 39 FHVAHKVPVGDTPYVRAKRVQLVDKDPEKAIALFWSAINAGDRVDSALKDMAIVMKQQNR 98
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK---------------------------- 100
+EA+EAIKS R CS QAQESLDN+L+DLYK
Sbjct: 99 AEEAIEAIKSLRSRCSDQAQESLDNILLDLYKVRARTSSTLISLHVRSIHASGSETVLNL 158
Query: 101 ---KCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGW 157
+CGRLD+QI LLK KL++I G AFNGK TKTARS G+KFQVT++QE +R+LGNLGW
Sbjct: 159 NAQRCGRLDDQISLLKHKLQLINQGHAFNGKRTKTARSQGRKFQVTLEQEATRLLGNLGW 218
Query: 158 AYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG-- 215
A MQK NYT AE YR+A LI PD NK CNL CL+KQAR EA+ VL+ V + G
Sbjct: 219 ALMQKENYTEAEGAYRRALLIGPDNNKMCNLGICLMKQARVLEAKDVLKQVRPAAVDGLR 278
Query: 216 --STETKTINRVKELLQELEP-WQSIPPSLTTKKSSLEDAFLEGLDDLMNQWTP 266
+ K R +E+L++LE PP+ + L DA L G + W P
Sbjct: 279 GADSHLKAYERAQEMLRDLETKLIGRPPADQLDTNWLFDALLLGSSSSI--WQP 330
>gi|118487530|gb|ABK95592.1| unknown [Populus trichocarpa]
Length = 690
Score = 288 bits (738), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 142/230 (61%), Positives = 177/230 (76%), Gaps = 7/230 (3%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYVRAKHVQL+EKDP A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 53 FHVVHKVPAGDSPYVRAKHVQLIEKDPSKAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 112
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAIKSFRHLC +QES+DNVL++LYK+ GR++E+IE+L+ KL++I G AF+GK
Sbjct: 113 ADEAIEAIKSFRHLCPCDSQESIDNVLVELYKRSGRIEEEIEMLQCKLKLIEEGIAFSGK 172
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHG+K Q+TV+QE SRILGNL WAY+Q +Y AE YRKA ++PD NK CNL
Sbjct: 173 KTKTARSHGRKIQITVEQERSRILGNLAWAYLQHHDYGLAEQHYRKALSLEPDQNKQCNL 232
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGS-----TETKTINRVKELLQELE 233
+ CL+ R EA+S+L+ V LSGS + K+ R ++L E E
Sbjct: 233 AICLMHMNRIPEAKSLLQTV--KALSGSKPMDDSYAKSFERACQILAEFE 280
>gi|224122150|ref|XP_002330553.1| predicted protein [Populus trichocarpa]
gi|222872111|gb|EEF09242.1| predicted protein [Populus trichocarpa]
Length = 232
Score = 288 bits (738), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 142/230 (61%), Positives = 177/230 (76%), Gaps = 7/230 (3%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HVVHK+P GDSPYVRAKHVQL+EKDP A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 5 FHVVHKVPAGDSPYVRAKHVQLIEKDPSKAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 64
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAIKSFRHLC +QES+DNVL++LYK+ GR++E+IE+L+ KL++I G AF+GK
Sbjct: 65 ADEAIEAIKSFRHLCPCDSQESIDNVLVELYKRSGRIEEEIEMLQCKLKLIEEGIAFSGK 124
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARSHG+K Q+TV+QE SRILGNL WAY+Q +Y AE YRKA ++PD NK CNL
Sbjct: 125 KTKTARSHGRKIQITVEQERSRILGNLAWAYLQHHDYGLAEQHYRKALSLEPDQNKQCNL 184
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGS-----TETKTINRVKELLQELE 233
+ CL+ R EA+S+L+ V LSGS + K+ R ++L E E
Sbjct: 185 AICLMHMNRIPEAKSLLQTV--KALSGSKPMDDSYAKSFERACQILAEFE 232
>gi|297740866|emb|CBI31048.3| unnamed protein product [Vitis vinifera]
Length = 673
Score = 287 bits (735), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 137/241 (56%), Positives = 181/241 (75%), Gaps = 3/241 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
+D +HVVHK+P GDSPYVRAK VQL++KDP AI LFW AIN+GDRVDSALKDMAVV+
Sbjct: 41 NKDDLFHVVHKVPAGDSPYVRAKQVQLIDKDPNRAISLFWAAINSGDRVDSALKDMAVVM 100
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
KQ +R DEA+EAIKSFRHLC +++QESLDNVL++LYK+ GRLDEQIE+L+ KL+ I G
Sbjct: 101 KQLNRSDEAIEAIKSFRHLCPQESQESLDNVLVELYKRSGRLDEQIEMLQYKLKNIDEGS 160
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AF GK TK ARS GKK Q++++QE SR+LGNL WAY+Q+GNY +A +Y++A +DPD N
Sbjct: 161 AFGGKRTKIARSQGKKIQISIEQEKSRLLGNLAWAYLQQGNYKTAGELYKQALALDPDRN 220
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELEPWQSIPP 240
K CNL+ CL+ + EA+++L + + +G + K+ R ++L ELE I P
Sbjct: 221 KECNLAICLMYMNKIKEAKAMLYAIQVSSQNGRMDDSYVKSFERASQVLTELEANSVIDP 280
Query: 241 S 241
+
Sbjct: 281 N 281
>gi|359483722|ref|XP_002266141.2| PREDICTED: uncharacterized protein LOC100242406 [Vitis vinifera]
Length = 941
Score = 287 bits (734), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 137/241 (56%), Positives = 181/241 (75%), Gaps = 3/241 (1%)
Query: 4 TRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVL 63
+D +HVVHK+P GDSPYVRAK VQL++KDP AI LFW AIN+GDRVDSALKDMAVV+
Sbjct: 41 NKDDLFHVVHKVPAGDSPYVRAKQVQLIDKDPNRAISLFWAAINSGDRVDSALKDMAVVM 100
Query: 64 KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
KQ +R DEA+EAIKSFRHLC +++QESLDNVL++LYK+ GRLDEQIE+L+ KL+ I G
Sbjct: 101 KQLNRSDEAIEAIKSFRHLCPQESQESLDNVLVELYKRSGRLDEQIEMLQYKLKNIDEGS 160
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
AF GK TK ARS GKK Q++++QE SR+LGNL WAY+Q+GNY +A +Y++A +DPD N
Sbjct: 161 AFGGKRTKIARSQGKKIQISIEQEKSRLLGNLAWAYLQQGNYKTAGELYKQALALDPDRN 220
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELEPWQSIPP 240
K CNL+ CL+ + EA+++L + + +G + K+ R ++L ELE I P
Sbjct: 221 KECNLAICLMYMNKIKEAKAMLYAIQVSSQNGRMDDSYVKSFERASQVLTELEANSVIDP 280
Query: 241 S 241
+
Sbjct: 281 N 281
>gi|224061607|ref|XP_002300564.1| predicted protein [Populus trichocarpa]
gi|222847822|gb|EEE85369.1| predicted protein [Populus trichocarpa]
Length = 281
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 138/230 (60%), Positives = 175/230 (76%), Gaps = 7/230 (3%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV+HK+P GDSPYVRAKHVQL+EKDP A+ LFW AIN+GDRVDSALKDMAVV+KQ DR
Sbjct: 53 FHVIHKVPAGDSPYVRAKHVQLIEKDPSKAVSLFWAAINSGDRVDSALKDMAVVMKQLDR 112
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAIKSFRHLC +QES+DNVL++LYK+ GR++E+IE+L++KL+ I G AF+GK
Sbjct: 113 ADEAIEAIKSFRHLCPYDSQESIDNVLVELYKRSGRIEEEIEMLQRKLKNIEEGIAFSGK 172
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TKTARS G+K Q+TV+QE SRILGNL WAY+Q +Y AE YRK ++PD NK CNL
Sbjct: 173 KTKTARSQGRKIQITVEQERSRILGNLAWAYLQHHDYGLAEQYYRKGLSMEPDQNKQCNL 232
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGS-----TETKTINRVKELLQELE 233
+ CL+ R EA+S+L+ V SGS + K+ R ++L ELE
Sbjct: 233 AICLMHMNRIPEAKSLLQTVKAS--SGSKPMDDSYAKSFERACQILTELE 280
>gi|357448553|ref|XP_003594552.1| hypothetical protein MTR_2g030510 [Medicago truncatula]
gi|355483600|gb|AES64803.1| hypothetical protein MTR_2g030510 [Medicago truncatula]
Length = 617
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 138/257 (53%), Positives = 184/257 (71%), Gaps = 6/257 (2%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S ++D +HV+HK+P GDSPYV+AK VQLV+KDP AI LFW AINAGDRV+SALKDMA+
Sbjct: 44 SVSKDDLFHVIHKVPSGDSPYVKAKQVQLVDKDPGKAISLFWAAINAGDRVESALKDMAL 103
Query: 62 VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
V+KQ +R DEA+EAIKSFRHLC +QESLDN+L++LYK+ GR+DE+I +L QKL+ I
Sbjct: 104 VMKQLNRSDEAIEAIKSFRHLCPSDSQESLDNILVELYKRSGRVDEEIGMLHQKLKQIED 163
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G + G+ TK ARS GKK Q++ +QE SRILGNL WAY+QKG+Y +AE YRKA + D
Sbjct: 164 GMTYVGRTTKHARSQGKKIQISAEQEISRILGNLAWAYLQKGDYKTAEEHYRKALSFEVD 223
Query: 182 ANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELEPWQSI 238
NK CNL+ CL++ + TEAR +L+ V + + K+ R ++LQE+E S
Sbjct: 224 RNKQCNLAICLMQMNKVTEARFLLQAVTAATKNRKMDDSFVKSYERATQMLQEME---ST 280
Query: 239 PPSLTTKKSSLEDAFLE 255
PS+ + K ++ F E
Sbjct: 281 APSVDSVKDKGDNKFNE 297
>gi|147797600|emb|CAN75800.1| hypothetical protein VITISV_024895 [Vitis vinifera]
Length = 402
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 144/258 (55%), Positives = 175/258 (67%), Gaps = 13/258 (5%)
Query: 1 MSQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMA 60
+S+TR +HV HK+P GD+PYVRAK VQLV+KDPE AI LFW AINA DRVDSALKDMA
Sbjct: 24 VSRTRSDSFHVTHKVPVGDTPYVRAKKVQLVDKDPEKAIPLFWAAINAXDRVDSALKDMA 83
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
+V+KQQ+R +EA+EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LL+ KL +I
Sbjct: 84 IVMKQQNRAEEAIEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQIALLRHKLFLIQ 143
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G AFNGK TKTARS GKKFQV+V QE +R+L N NY AE YR+A + P
Sbjct: 144 QGMAFNGKRTKTARSQGKKFQVSVGQEATRLLQN---------NYIEAEDAYRRALSMAP 194
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELEPWQ 236
D NK CNL CL+KQ R EA+ L V G + K R +++L +LE
Sbjct: 195 DNNKMCNLGICLMKQGRILEAKETLRRVKPAVADGPRGVDSHLKAFERARQMLLDLESEM 254
Query: 237 SIPPSLTTKKSSLEDAFL 254
S ++S L DAFL
Sbjct: 255 LNXGSDPVEQSRLFDAFL 272
>gi|449435896|ref|XP_004135730.1| PREDICTED: uncharacterized protein LOC101215262 [Cucumis sativus]
Length = 683
Score = 269 bits (688), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 132/228 (57%), Positives = 168/228 (73%), Gaps = 3/228 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV+HK+P GDSPYV+AK VQL+EKDP A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 52 FHVIHKVPAGDSPYVKAKQVQLIEKDPSRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 111
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAIKSFRHLC +QES+DNVLI+LYK+ GR++E+I++L++KL+ I G F GK
Sbjct: 112 SDEAIEAIKSFRHLCPYDSQESIDNVLIELYKRSGRIEEEIDMLQRKLKQIEDGTIFGGK 171
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARS GKK Q+T++QE SR+LGNL WA++Q N AE YRKA ++ D NK CNL
Sbjct: 172 RTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLNNIYVAEDYYRKALSLEADNNKKCNL 231
Query: 189 SHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQELE 233
+ C I R TEA+S+L+ V GK + + K+ R +L E E
Sbjct: 232 AICQILTNRLTEAKSLLQSVRASSGGKPTEESYAKSFERAFHMLTEKE 279
>gi|449488574|ref|XP_004158093.1| PREDICTED: uncharacterized protein LOC101226286 [Cucumis sativus]
Length = 297
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 131/226 (57%), Positives = 166/226 (73%), Gaps = 3/226 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+HV+HK+P GDSPYV+AK VQL+EKDP A+ LFW AINAGDRVDSALKDMAVV+KQ DR
Sbjct: 52 FHVIHKVPAGDSPYVKAKQVQLIEKDPSRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR 111
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAIKSFRHLC +QES+DNVLI+LYK+ GR++E+I++L++KL+ I G F GK
Sbjct: 112 SDEAIEAIKSFRHLCPYDSQESIDNVLIELYKRSGRIEEEIDMLQRKLKQIEDGTIFGGK 171
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARS GKK Q+T++QE SR+LGNL WA++Q N AE YRKA ++ D NK CNL
Sbjct: 172 RTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLNNIYVAEDYYRKALSLEADNNKKCNL 231
Query: 189 SHCLIKQARYTEARSVLEDVLL---GKLSGSTETKTINRVKELLQE 231
+ C I R TEA+S+L+ V GK + K+ R +L E
Sbjct: 232 AICQILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERAFHMLTE 277
>gi|147809582|emb|CAN66650.1| hypothetical protein VITISV_043263 [Vitis vinifera]
Length = 217
Score = 266 bits (680), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 142/261 (54%), Positives = 175/261 (67%), Gaps = 47/261 (18%)
Query: 30 LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
+VEKDPEAAIVLFW AINA DRV+SALKDMAVV+K+ DR +EA+EAIK+ HL + +
Sbjct: 1 MVEKDPEAAIVLFWNAINARDRVESALKDMAVVMKRLDRSEEAIEAIKN--HLITSSSTY 58
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
TKTARS+ KKFQ+T++QETS
Sbjct: 59 ----------------------------------------TKTARSNRKKFQITIEQETS 78
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
RILGNLGW YMQK NY +AEVVY+KAQ+IDPDANKACNL+ CLI Q RYTEA SVL++VL
Sbjct: 79 RILGNLGWVYMQKSNYMAAEVVYKKAQMIDPDANKACNLALCLINQGRYTEAHSVLKEVL 138
Query: 210 LGKLSGSTETKTINRVKELLQELEP-WQSIPPSLTTKKSS--LEDAFLEGLDDLMNQWTP 266
GKL S + K NR +EL+ E+EP W +PPS K LED F +G + ++N W P
Sbjct: 139 QGKLPNSEDCKAQNRAQELMLEVEPKW--LPPSEXIKLEGFDLEDDFSDGFEKVLNIWAP 196
Query: 267 YRSRRLPIFEEISPFRDQLAC 287
+R++RLPIFEEIS +R+QLAC
Sbjct: 197 FRTKRLPIFEEISSYRNQLAC 217
>gi|356532429|ref|XP_003534775.1| PREDICTED: uncharacterized protein LOC100792980 [Glycine max]
Length = 657
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 125/228 (54%), Positives = 166/228 (72%), Gaps = 3/228 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+H++HK+P GDSPYV+AK VQLV+KDP A+ LFW AINAGDRV+SALKDMA+V+KQ +R
Sbjct: 107 FHIIHKVPAGDSPYVKAKQVQLVDKDPGRAVSLFWAAINAGDRVESALKDMALVMKQLNR 166
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAI+SFRHLC +Q+SLDN+L++LYK+ GR+DE+I +L KL+ I G F G+
Sbjct: 167 SDEAIEAIRSFRHLCPSDSQDSLDNILVELYKRSGRVDEEIAMLCHKLKQIEDGLTFVGR 226
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARS GKK Q+T +QE SRILGNL WAY+QKG+Y +AE YRKA + D NK CNL
Sbjct: 227 TTKQARSQGKKIQITAEQEISRILGNLAWAYLQKGDYKTAEEHYRKALSFEVDRNKQCNL 286
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELE 233
+ CLI + EA+ +L+ V + + K+ R ++L E+E
Sbjct: 287 AICLIHMNKIKEAKFLLQAVRTATKNRKMDDSFAKSFERASQMLIEIE 334
>gi|102139799|gb|ABF69984.1| male sterility protein-related [Musa acuminata]
Length = 778
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 125/222 (56%), Positives = 163/222 (73%), Gaps = 2/222 (0%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
DSPY+RAKH Q+V+KD + A+ LFW AIN GDRVDSALKDMA+ +KQ +R +EA+EAIKS
Sbjct: 53 DSPYLRAKHAQMVDKDLQKAVPLFWAAINCGDRVDSALKDMALAMKQVNRAEEAIEAIKS 112
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
FRHLCS + QES+DNVL++LYKKCGR+D+QIELL KL+MI G AF G+ TK RS GK
Sbjct: 113 FRHLCSPKTQESIDNVLLELYKKCGRIDDQIELLNFKLKMIDEGLAFGGRRTKLTRSKGK 172
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
KF V++ E SR+LGNL WAYMQ NY +AE +Y KA I+ D NK CNL+ CL+K R
Sbjct: 173 KFHVSLDHEKSRLLGNLAWAYMQSENYETAETLYWKALAIEQDYNKQCNLAICLMKTGRL 232
Query: 199 TEARSVLEDV--LLGKLSGSTETKTINRVKELLQELEPWQSI 238
EARS+L+ V + K+ + ++L+E+EP +++
Sbjct: 233 EEARSILQVVKRASSNICDQFFVKSFKQASQMLKEIEPQENL 274
>gi|356558069|ref|XP_003547331.1| PREDICTED: uncharacterized protein LOC100794684 [Glycine max]
Length = 595
Score = 258 bits (658), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 124/228 (54%), Positives = 165/228 (72%), Gaps = 3/228 (1%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
+H++HK+P GDSPYV+AK VQLV+KDP AI LFW AINA DRV+SALKDMA+V+KQ +R
Sbjct: 44 FHIIHKVPAGDSPYVKAKQVQLVDKDPGRAISLFWAAINARDRVESALKDMALVMKQLNR 103
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA+EAI+SFRHLC +++SLDN+L++LYK+ GR+DE+I +L KL+ I G F G+
Sbjct: 104 SDEAIEAIRSFRHLCPSDSRDSLDNILVELYKRSGRIDEEIAMLHHKLKQIEDGLTFVGR 163
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL 188
TK ARS GKK Q+T +QE SRILGNL WAY+QKG+Y +AE YRKA + D NK CNL
Sbjct: 164 TTKQARSQGKKIQITAEQEISRILGNLAWAYLQKGDYKAAEEHYRKALSFEVDRNKQCNL 223
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTE---TKTINRVKELLQELE 233
+ CLI + EA+ +L+ V + + K+ R ++L E+E
Sbjct: 224 AICLIHMNKIKEAKFLLQAVRTATKNRKMDDSFAKSFERASQMLIEIE 271
>gi|297799972|ref|XP_002867870.1| male sterility MS5 [Arabidopsis lyrata subsp. lyrata]
gi|297313706|gb|EFH44129.1| male sterility MS5 [Arabidopsis lyrata subsp. lyrata]
Length = 426
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/236 (52%), Positives = 169/236 (71%), Gaps = 5/236 (2%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S+ RD P+H+VHK+P GDSPYVRAKH QL++KDP AI LFW AINAGDRVDSALKDM V
Sbjct: 44 SEKRD-PFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMVV 102
Query: 62 VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
V+KQ DR DE +EAI+SFR+LCS ++Q+S+DN+L++LYKK GR++E+ ELL+ KL+ +
Sbjct: 103 VMKQLDRSDEGIEAIRSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAELLEHKLKTLEQ 162
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G F G+ + R GK +T++QE +R+LGNLGW ++Q NY AE YR+A ++ D
Sbjct: 163 GMGFGGRVIRAKRVQGKHVTMTIEQEKARVLGNLGWVHLQLHNYGIAEQHYRRALGLERD 222
Query: 182 ANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKTINRVKELLQELE 233
NK CNL+ CL++ R EA+S+L+DV + K+ +R E+L E+E
Sbjct: 223 KNKQCNLAICLMRMGRIPEAKSLLDDVRDSPTESECGDEPFAKSYDRAVEMLAEIE 278
>gi|4028970|gb|AAC97106.1| pollenless3 [Arabidopsis thaliana]
Length = 434
Score = 254 bits (648), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 123/236 (52%), Positives = 168/236 (71%), Gaps = 4/236 (1%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S R P+H+VHK+P GDSPYVRAKH QL++KDP AI LFW AINAGDRVDSALKDMAV
Sbjct: 44 SSERRDPFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAV 103
Query: 62 VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
V+KQ R DE +EAIKSFR+LCS ++Q+S+DN+L++LYKK GR++E+ LL+ KL+ +
Sbjct: 104 VMKQLGRSDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQ 163
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G F G+ ++ R GK +T++QE +RILGNLGW ++Q NY AE YR+A ++ D
Sbjct: 164 GMGFGGRVSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRRALGLERD 223
Query: 182 ANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKTINRVKELLQELE 233
NK CNL+ CL++ +R EA+S+L+DV + K+ +R E+L E+E
Sbjct: 224 KNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGDEPFAKSYDRAVEMLAEIE 279
>gi|3859112|gb|AAC72541.1| male sterility MS5 [Arabidopsis thaliana]
Length = 434
Score = 254 bits (648), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 123/236 (52%), Positives = 168/236 (71%), Gaps = 4/236 (1%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S R P+H+VHK+P GDSPYVRAKH QL++KDP AI LFW AINAGDRVDSALKDMAV
Sbjct: 44 SSERRDPFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAV 103
Query: 62 VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
V+KQ R DE +EAIKSFR+LCS ++Q+S+DN+L++LYKK GR++E+ LL+ KL+ +
Sbjct: 104 VMKQLGRSDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQ 163
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G F G+ ++ R GK +T++QE +RILGNLGW ++Q NY AE YR+A ++ D
Sbjct: 164 GMGFGGRVSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRRALGLERD 223
Query: 182 ANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKTINRVKELLQELE 233
NK CNL+ CL++ +R EA+S+L+DV + K+ +R E+L E+E
Sbjct: 224 KNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGDEPFAKSYDRAVEMLAEIE 279
>gi|449532994|ref|XP_004173462.1| PREDICTED: uncharacterized protein LOC101227090, partial [Cucumis
sativus]
Length = 188
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 123/188 (65%), Positives = 148/188 (78%), Gaps = 1/188 (0%)
Query: 101 KCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYM 160
KCGR++EQIELLK+KLRMIY GEAFNGKPT+TARSHGKKFQV+VKQETSR+LGNLGWAYM
Sbjct: 1 KCGRIEEQIELLKRKLRMIYQGEAFNGKPTRTARSHGKKFQVSVKQETSRLLGNLGWAYM 60
Query: 161 QKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETK 220
QK NY AE VY+KAQ+IDPDANKACNL CL+KQ R +EA VLE V ++ GS+E K
Sbjct: 61 QKPNYMMAEAVYKKAQMIDPDANKACNLGLCLMKQGRLSEAIFVLEQVQQAQIPGSSEIK 120
Query: 221 TINRVKELLQELEPWQSIPPSLTTKKSSLEDAFLEGLDDLMNQWTPY-RSRRLPIFEEIS 279
R +LL E+ QS+P S+ S++ FL GL+ L+N+ P+ RS+RLP+FEEIS
Sbjct: 121 AQKRSADLLTEIRSRQSLPDSIDLLGLSVDVDFLNGLELLVNKKGPFSRSKRLPVFEEIS 180
Query: 280 PFRDQLAC 287
FRDQLAC
Sbjct: 181 SFRDQLAC 188
>gi|15233432|ref|NP_193822.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
gi|5262780|emb|CAB45885.1| putative protein [Arabidopsis thaliana]
gi|7268886|emb|CAB79090.1| putative protein [Arabidopsis thaliana]
gi|332658973|gb|AEE84373.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
Length = 450
Score = 243 bits (621), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 123/252 (48%), Positives = 168/252 (66%), Gaps = 20/252 (7%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S R P+H+VHK+P GDSPYVRAKH QL++KDP AI LFW AINAGDRVDSALKDMAV
Sbjct: 44 SSERRDPFHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAV 103
Query: 62 VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
V+KQ R DE +EAIKSFR+LCS ++Q+S+DN+L++LYKK GR++E+ LL+ KL+ +
Sbjct: 104 VMKQLGRSDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQ 163
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYR-------- 173
G F G+ ++ R GK +T++QE +RILGNLGW ++Q NY AE YR
Sbjct: 164 GMGFGGRVSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRFGFVTKIP 223
Query: 174 --------KAQLIDPDANKACNLSHCLIKQARYTEARSVLEDV----LLGKLSGSTETKT 221
+A ++ D NK CNL+ CL++ +R EA+S+L+DV + K+
Sbjct: 224 NIDYCLVMRALGLERDKNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGDEPFAKS 283
Query: 222 INRVKELLQELE 233
+R E+L E+E
Sbjct: 284 YDRAVEMLAEIE 295
>gi|326504242|dbj|BAJ90953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 639
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 119/232 (51%), Positives = 161/232 (69%), Gaps = 2/232 (0%)
Query: 16 PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
P DSPY RAK QL+EKDP A+ LFW AIN+G+R++SALKDMA VLKQ DR +EA+EA
Sbjct: 70 PAADSPYARAKQAQLIEKDPNKAVPLFWAAINSGERIESALKDMATVLKQADRAEEAIEA 129
Query: 76 IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARS 135
I+SFR C +AQESLDN+L+DLYKKCGR EQIE+L KLRMI A TK ++S
Sbjct: 130 IRSFRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTVKLRMIDEDLASGRWKTKLSKS 189
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQ 195
HG+ F ++++ E +R+LGNL WAYMQ NY AE++YR+A I+ D NK CNL+ C++K
Sbjct: 190 HGRVFYLSLRDEKARLLGNLAWAYMQSENYEEAEMLYRQALAIETDYNKECNLAICMMKM 249
Query: 196 ARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
+ EA+ +L+ + + ++ R E+L+ELE Q++P +T KS
Sbjct: 250 GKVAEAKYLLQAIPY-NCNDENHVRSFARATEVLRELES-QALPSPITQMKS 299
>gi|297794971|ref|XP_002865370.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311205|gb|EFH41629.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 470
Score = 236 bits (603), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 116/223 (52%), Positives = 155/223 (69%), Gaps = 3/223 (1%)
Query: 14 KLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV 73
++P GDSPYVRAKH QLV KDP+ AI LFW A+NAGDRVDSALK+M VVLK DR DE +
Sbjct: 49 RVPTGDSPYVRAKHAQLVSKDPDRAISLFWAAVNAGDRVDSALKNMVVVLKHLDRSDEGI 108
Query: 74 EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTA 133
EAIKSFR+LC ++Q+S+DN+L++LYKK GR+ E+ ELL+ KLR + H + G+
Sbjct: 109 EAIKSFRYLCPFESQDSIDNLLLELYKKSGRITEEAELLEHKLRTLEHDTHYGGRMKIAK 168
Query: 134 RSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLI 193
RSHG++ +T++QE ++ILGNL W ++Q NY AE YR A ++PD NK CNL+ CL+
Sbjct: 169 RSHGEQNNMTIEQEKAQILGNLAWVHLQLHNYGIAEQYYRNALSLEPDNNKLCNLAICLM 228
Query: 194 KQARYTEARSVLEDV---LLGKLSGSTETKTINRVKELLQELE 233
+ R EA+ +LEDV L + G + R E+L E E
Sbjct: 229 RMDRIPEAKPLLEDVRQSLGNQWKGEPFRNSFERATEMLGERE 271
>gi|357144581|ref|XP_003573343.1| PREDICTED: uncharacterized protein LOC100825322 [Brachypodium
distachyon]
Length = 660
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/236 (50%), Positives = 163/236 (69%), Gaps = 4/236 (1%)
Query: 14 KLPPG--DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
+ PP DSPYVRAK QL+EKDP A+ LFW AIN+G+R++SALKDMA VLKQ +R +E
Sbjct: 88 RAPPAVVDSPYVRAKQAQLIEKDPNKAVPLFWAAINSGERIESALKDMATVLKQANRAEE 147
Query: 72 AVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTK 131
A+EAI+SFR C +AQ+SLDNVL+DLYKKCGR EQIE+L KLRM+ A TK
Sbjct: 148 AIEAIRSFRDRCPNEAQDSLDNVLLDLYKKCGRTKEQIEMLTIKLRMVDEDLASGRWKTK 207
Query: 132 TARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHC 191
++SHG+ ++++ E +R+LGNL WA+MQ NY AE++YR+A I+ D NK CNL+ C
Sbjct: 208 LSKSHGRVVYLSLRDEKARLLGNLAWAHMQSENYEEAEMLYRQALAIEADYNKECNLAVC 267
Query: 192 LIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
L+K + EA+ +L+ + S ++ R E+++ELE Q++P +T KS
Sbjct: 268 LMKTGKVAEAKYLLQAIPYNS-SDEKHVRSFARATEMIKELES-QALPSPITQMKS 321
>gi|242080455|ref|XP_002444996.1| hypothetical protein SORBIDRAFT_07g002510 [Sorghum bicolor]
gi|241941346|gb|EES14491.1| hypothetical protein SORBIDRAFT_07g002510 [Sorghum bicolor]
Length = 712
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/246 (50%), Positives = 166/246 (67%), Gaps = 6/246 (2%)
Query: 22 YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
YVRAK Q+VEKDP A+ LFW AIN+GDR +SALKDMA VLKQ +R +EA+EAI+SFR
Sbjct: 117 YVRAKQAQIVEKDPNKAVPLFWAAINSGDRTESALKDMANVLKQANRAEEAIEAIRSFRD 176
Query: 82 LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
C +AQESLDN+L+DLYKKCGR DEQIE+L KLR++ A TK ++SHG+
Sbjct: 177 RCPYEAQESLDNILLDLYKKCGRTDEQIEMLTIKLRIVDEELASGRWKTKMSKSHGRVVY 236
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
++++ E +R+LGNL WAYMQ NY AE++YR+A I+ D NK CNL+ CL+K + EA
Sbjct: 237 LSLRDEKARLLGNLAWAYMQSENYEGAEMLYRQALAIEADYNKECNLAICLMKTGKVAEA 296
Query: 202 RSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAFL----EGL 257
+ +++ + + K+++R E+L+ELE QS+P +T KS FL E L
Sbjct: 297 KYLIQAIPY-NCDDESHVKSLSRATEMLRELE-LQSLPSPITQAKSKESQIFLADDVEML 354
Query: 258 DDLMNQ 263
DL Q
Sbjct: 355 VDLQPQ 360
>gi|125560036|gb|EAZ05484.1| hypothetical protein OsI_27700 [Oryza sativa Indica Group]
Length = 813
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 85 DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 144
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
FR C +AQESLDN+L+DLYKKCGR EQIE+L KLR++ A TK ++SHG+
Sbjct: 145 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 204
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
++++ E +R+LGNL WA+MQ NY AE++YR+A I+ D NK CNL+ CLIK +
Sbjct: 205 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 264
Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
EA+ +L+ + S + +++ R +E+L ELE
Sbjct: 265 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 298
>gi|29467532|dbj|BAC66721.1| putative pollenless3 [Oryza sativa Japonica Group]
Length = 815
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 85 DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 144
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
FR C +AQESLDN+L+DLYKKCGR EQIE+L KLR++ A TK ++SHG+
Sbjct: 145 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 204
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
++++ E +R+LGNL WA+MQ NY AE++YR+A I+ D NK CNL+ CLIK +
Sbjct: 205 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 264
Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
EA+ +L+ + S + +++ R +E+L ELE
Sbjct: 265 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 298
>gi|297607922|ref|NP_001060921.2| Os08g0130300 [Oryza sativa Japonica Group]
gi|255678123|dbj|BAF22835.2| Os08g0130300 [Oryza sativa Japonica Group]
Length = 769
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 85 DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 144
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
FR C +AQESLDN+L+DLYKKCGR EQIE+L KLR++ A TK ++SHG+
Sbjct: 145 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 204
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
++++ E +R+LGNL WA+MQ NY AE++YR+A I+ D NK CNL+ CLIK +
Sbjct: 205 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 264
Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
EA+ +L+ + S + +++ R +E+L ELE
Sbjct: 265 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 298
>gi|222639859|gb|EEE67991.1| hypothetical protein OsJ_25931 [Oryza sativa Japonica Group]
Length = 1338
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 112/215 (52%), Positives = 154/215 (71%), Gaps = 1/215 (0%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 610 DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQANRAEEAIEAIRS 669
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
FR C +AQESLDN+L+DLYKKCGR EQIE+L KLR++ A TK ++SHG+
Sbjct: 670 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTLKLRIVDEELASGRWKTKLSKSHGR 729
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
++++ E +R+LGNL WA+MQ NY AE++YR+A I+ D NK CNL+ CLIK +
Sbjct: 730 VVYLSLRDEKARLLGNLAWAHMQSENYDEAEMLYRQALAIEADYNKECNLAICLIKTGKV 789
Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELE 233
EA+ +L+ + S + +++ R +E+L ELE
Sbjct: 790 AEAKYLLQSI-PDNCSDESHVRSLARAREMLMELE 823
>gi|326522530|dbj|BAK07727.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 641
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 115/229 (50%), Positives = 157/229 (68%), Gaps = 2/229 (0%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
DSPY+RAK QL+EKDP A+ LFW AIN+G+R++SALKDMA VLKQ +R +EA+EAI+S
Sbjct: 72 DSPYLRAKQAQLIEKDPNKAVPLFWAAINSGERIESALKDMATVLKQANRAEEAIEAIRS 131
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGK 138
FR C +AQESLDN+L+DLYKKCGR EQIE+L KLRM+ A K ++SHG+
Sbjct: 132 FRDRCPNEAQESLDNILLDLYKKCGRTKEQIEMLTVKLRMVDEDLASGRWKAKLSKSHGR 191
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARY 198
+ ++ E +R+LGNL WAYMQ NY AE++YR+A ++ D NK CNL CL+K +
Sbjct: 192 VVYLYLRDEKARLLGNLAWAYMQSENYEEAEMLYRQALAMEADYNKECNLGICLMKMGKV 251
Query: 199 TEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKS 247
EA+ +L+ + + K+ R E+L+ELE Q++P +T KS
Sbjct: 252 AEAKYLLQPIPY-NCNDENHVKSFARATEMLRELES-QALPSPVTQMKS 298
>gi|449488572|ref|XP_004158092.1| PREDICTED: uncharacterized protein LOC101226064 [Cucumis sativus]
Length = 719
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 148/207 (71%), Gaps = 3/207 (1%)
Query: 30 LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
L++KDP A+ LFW AINAGDRVDSALKDMAVV+KQ DR DEA+EAIKSFRHLC +QE
Sbjct: 9 LIDKDPNRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIKSFRHLCPYDSQE 68
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
S+DNVLI+LYK+ GR++E+I++L+ KL+ I G F GK TK ARS GKK Q+TV+QE S
Sbjct: 69 SIDNVLIELYKRSGRIEEEIDMLQCKLKQIEDGTVFGGKRTKAARSQGKKVQITVEQEKS 128
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
R+LGNL WA++Q N AE YRKA ++ D NK CNL+ CLI R TEA+S+L+ V
Sbjct: 129 RVLGNLAWAFLQLDNIYIAEEYYRKALSLESDNNKKCNLAICLILTNRLTEAKSLLQSVR 188
Query: 210 L---GKLSGSTETKTINRVKELLQELE 233
GK + K+ R +L E E
Sbjct: 189 ASSGGKPMEESYAKSFERASHMLAEKE 215
>gi|413941821|gb|AFW74470.1| hypothetical protein ZEAMMB73_887441 [Zea mays]
Length = 690
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 120/246 (48%), Positives = 166/246 (67%), Gaps = 6/246 (2%)
Query: 22 YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
YVRAK Q+VEKDP A+ LFW AIN+GDR++SALKDMA VLKQ +R +EA+EAI+SFR
Sbjct: 96 YVRAKQAQIVEKDPNKAVPLFWAAINSGDRIESALKDMANVLKQANRSEEAIEAIRSFRD 155
Query: 82 LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
C +AQESLDN+L+DLYKKCGR DEQIE+L KLR++ A TK ++SHG+
Sbjct: 156 RCPYEAQESLDNILLDLYKKCGRTDEQIEMLTLKLRIVDEELASGRWKTKLSKSHGRVVY 215
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
++++ E +R+LGNL WAYMQ NY AE++YR+A I+ D NK CNL+ CL+K + EA
Sbjct: 216 LSLRDEKARLLGNLAWAYMQSENYEEAEMLYRQALAIEADYNKECNLAICLMKTGKLAEA 275
Query: 202 RSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF----LEGL 257
+ ++ + + + K+++R E+L+E + QS+P +T KS F +E L
Sbjct: 276 KYLIHAIPY-NCNDESHVKSLSRATEMLREFD-LQSLPSPITQAKSKEPRNFVADDVEML 333
Query: 258 DDLMNQ 263
DL Q
Sbjct: 334 VDLQPQ 339
>gi|15241471|ref|NP_199246.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
thaliana]
gi|10176888|dbj|BAB10118.1| male sterility MS5; pollenless3 [Arabidopsis thaliana]
gi|67633856|gb|AAY78852.1| male sterility MS5 family protein [Arabidopsis thaliana]
gi|332007714|gb|AED95097.1| tetratricopeptide repeat domain-containing protein [Arabidopsis
thaliana]
Length = 469
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 117/223 (52%), Positives = 151/223 (67%), Gaps = 3/223 (1%)
Query: 14 KLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV 73
++ GDSPYVRAKH QLV KDP AI LFW AINAGDRVDSALKDM VVLKQ +R DE +
Sbjct: 49 RVRTGDSPYVRAKHAQLVSKDPNRAISLFWAAINAGDRVDSALKDMVVVLKQLNRFDEGI 108
Query: 74 EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTA 133
EAIKSFR+LC ++Q+S+DN+L++LY K GR+ E ELL+ KLR + + + G+
Sbjct: 109 EAIKSFRYLCPFESQDSIDNLLLELYMKSGRITEVAELLEHKLRTLEQDKHYGGRIKIAK 168
Query: 134 RSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLI 193
RSH ++ T++QE +RILGNL W ++Q NY AE YR A ++PD NK CNL+ CLI
Sbjct: 169 RSHEEQNNKTIEQEKARILGNLAWVHLQLHNYGIAEQYYRNALSLEPDNNKLCNLAICLI 228
Query: 194 KQARYTEARSVLEDV---LLGKLSGSTETKTINRVKELLQELE 233
+ R EA+S+LEDV L + K+ R E+L E E
Sbjct: 229 RMERTHEAKSLLEDVKQSLGNQWKNEPFCKSFERATEMLAERE 271
>gi|116794097|gb|ABK27005.1| unknown [Picea sitchensis]
Length = 248
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 108/182 (59%), Positives = 140/182 (76%), Gaps = 8/182 (4%)
Query: 59 MAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
MAVV+ QQ+R EA+EAIKSFRHLCSKQAQE LDNVLIDL KKCGR DEQI LLK KLR+
Sbjct: 1 MAVVMNQQNRPLEAIEAIKSFRHLCSKQAQEPLDNVLIDLLKKCGRFDEQISLLKHKLRL 60
Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI 178
I G AFNGK TKTARSHG+KFQV++K+ET+R+LGNLG A+MQ+ +Y AE+VYRKAQ++
Sbjct: 61 IQEGAAFNGKSTKTARSHGRKFQVSIKKETTRLLGNLGLAFMQQHDYCYAEIVYRKAQVL 120
Query: 179 DPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGST--------ETKTINRVKELLQ 230
+PD NK CNLS CL++Q + EA +L+ +L + ++ +++RV+ LL+
Sbjct: 121 EPDDNKVCNLSVCLMRQGKVEEAMGLLQGLLNNNKNSMNYKHNNSRGKSNSLDRVEALLK 180
Query: 231 EL 232
E+
Sbjct: 181 EI 182
>gi|168001280|ref|XP_001753343.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695629|gb|EDQ81972.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 608
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/233 (45%), Positives = 139/233 (59%), Gaps = 56/233 (24%)
Query: 5 RDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLK 64
R+ +H+VH +P G+SPYV+AKHVQLV+KDP+ AI LFW+AIN+GDRVDSALKDMA+V+K
Sbjct: 91 RNDAFHIVHNVPMGNSPYVKAKHVQLVDKDPDRAIALFWEAINSGDRVDSALKDMAIVMK 150
Query: 65 QQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEA 124
Q+ R EA+E IKS R CS +AQESLDNVL+DLYK
Sbjct: 151 QKSRPHEAIEVIKSLRSRCSDRAQESLDNVLLDLYK------------------------ 186
Query: 125 FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
GNLGWAYMQ+ + +AE VYRKA LI+ D NK
Sbjct: 187 ----------------------------GNLGWAYMQQSKFLAAEAVYRKALLIESDVNK 218
Query: 185 ACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELE 233
CNL CL+KQ R EA+S+L V+L + + + K+ R +E++++LE
Sbjct: 219 VCNLGICLMKQGRLEEAKSILRSVILPCNDRRWTSDSHLKSFERAQEMMEKLE 271
>gi|326515788|dbj|BAK07140.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 423
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 98/165 (59%), Positives = 121/165 (73%), Gaps = 4/165 (2%)
Query: 73 VEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKT 132
EAIKS R CS QAQESLDN+L+DLYK+CGRLD+QI LLK KL++I+ G AFNGK TKT
Sbjct: 1 TEAIKSLRSRCSDQAQESLDNILLDLYKRCGRLDDQISLLKHKLQLIHQGHAFNGKRTKT 60
Query: 133 ARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCL 192
ARS G+KFQVT++QE +R+LGNLGWA MQK NYT AE YR+A +I PD NK CNL CL
Sbjct: 61 ARSQGRKFQVTLEQEATRLLGNLGWALMQKENYTEAEGAYRRALVIGPDNNKMCNLGICL 120
Query: 193 IKQARYTEARSVLEDVLLGKLSG----STETKTINRVKELLQELE 233
+KQ R EA+ VL+ V + G + K R +E+L++LE
Sbjct: 121 MKQGRVLEAKDVLKQVRPAGVDGLRGADSHLKAYERAQEMLRDLE 165
>gi|168005732|ref|XP_001755564.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693271|gb|EDQ79624.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 141
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 93/144 (64%), Positives = 114/144 (79%), Gaps = 3/144 (2%)
Query: 30 LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
LV+KDP+ AI LFW A+N GDRVDSALKDMA+V+KQQ+R EA+EAIKS R+ CS +AQE
Sbjct: 1 LVDKDPDRAISLFWTAVNVGDRVDSALKDMAIVMKQQNRPQEAIEAIKSLRNRCSDEAQE 60
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
SLDNVL+ LYK+CGRLD+QI++L KL +++ G K +KTARS GKKFQV + QE S
Sbjct: 61 SLDNVLLGLYKRCGRLDDQIDVLTHKLHLVHQGIV---KRSKTARSQGKKFQVLISQEAS 117
Query: 150 RILGNLGWAYMQKGNYTSAEVVYR 173
ILGNLGWAYMQ+ N+ AE YR
Sbjct: 118 SILGNLGWAYMQQSNFVGAEAAYR 141
>gi|224171113|ref|XP_002339460.1| predicted protein [Populus trichocarpa]
gi|222875155|gb|EEF12286.1| predicted protein [Populus trichocarpa]
Length = 130
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 76/131 (58%), Positives = 97/131 (74%), Gaps = 4/131 (3%)
Query: 160 MQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTET 219
MQK N+ +AEVVY+KAQ+IDPDANKACNL+ CLIKQARY EARSVL DV G+L GS +
Sbjct: 1 MQKPNFIAAEVVYQKAQMIDPDANKACNLALCLIKQARYDEARSVLLDVSQGRLPGSGDV 60
Query: 220 KTINRVKELLQELEPWQSIPPSLTTKKS---SLEDAFLEGLDDLMNQWTPYRSRRLPIFE 276
K+ +R +ELL E++ ++ P LT +D F++GL+ LM++W P RS+RLPIFE
Sbjct: 61 KSRSRAEELLMEVKSRET-PDELTDMLGFNLDEDDDFVKGLEKLMSEWAPSRSKRLPIFE 119
Query: 277 EISPFRDQLAC 287
EIS FRD L C
Sbjct: 120 EISSFRDPLTC 130
>gi|3859114|gb|AAC72542.1| MS5-like protein [Arabidopsis thaliana]
Length = 307
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 64/125 (51%), Positives = 80/125 (64%), Gaps = 4/125 (3%)
Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
LK KL +I G AFNGK TKTARS GKKFQV+V+QE +R+LGNLGWA MQ+ N+ AE
Sbjct: 1 LKHKLFLIQKGLAFNGKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDA 60
Query: 172 YRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG----STETKTINRVKE 227
YR+A I PD NK CNL CL+KQ R EA+ L V + G + K R ++
Sbjct: 61 YRRALSIAPDNNKMCNLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQ 120
Query: 228 LLQEL 232
+L +L
Sbjct: 121 MLNDL 125
>gi|148657770|ref|YP_001277975.1| cold-shock protein, DNA-binding [Roseiflexus sp. RS-1]
gi|148569880|gb|ABQ92025.1| Cold-shock protein, DNA-binding [Roseiflexus sp. RS-1]
Length = 1555
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 113/204 (55%), Gaps = 28/204 (13%)
Query: 9 YHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDR 68
Y V +P G +PY RAK VQLVEKD E A L AI GD V+SA+KD+A +L Q R
Sbjct: 477 YARVSGIPKGSNPYARAKRVQLVEKDLERAAQLLRDAIRQGDNVESAVKDLAALLVQLGR 536
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
DEA++ ++ R S Q+S+DN+LI+ Y+ G+ D+ I LL ++L+
Sbjct: 537 PDEAIQVLEKNRSRISN--QQSVDNMLINFYQNAGQHDKAISLLHKQLQ----------- 583
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-- 186
Q + + ++IL + Y++K +Y AE +++ + PD NK+
Sbjct: 584 ------------QANTETKKAQILWQIAIGYLRKEDYVRAEQTFQELRRAQPD-NKSIQR 630
Query: 187 NLSHCLIKQARYTEARSVLEDVLL 210
N++ CL KQ RY EAR +L+ +LL
Sbjct: 631 NIALCLFKQERYEEARKILDGILL 654
>gi|222625343|gb|EEE59475.1| hypothetical protein OsJ_11684 [Oryza sativa Japonica Group]
Length = 241
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 84/162 (51%), Gaps = 41/162 (25%)
Query: 14 KLPPG--DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
+ PP DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA
Sbjct: 105 RAPPSSVDSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMAT---------- 154
Query: 72 AVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTK 131
KC R EQIE+L KL + A TK
Sbjct: 155 -----------------------------KCDRTKEQIEMLTLKLIFVDEELASGRWKTK 185
Query: 132 TARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYR 173
++SHG+ ++++ E + +LGNL WA+MQ NY AE++YR
Sbjct: 186 LSKSHGRVVYLSLRDEKAWLLGNLAWAHMQSENYDGAEMLYR 227
>gi|255564367|ref|XP_002523180.1| hypothetical protein RCOM_1336590 [Ricinus communis]
gi|223537587|gb|EEF39211.1| hypothetical protein RCOM_1336590 [Ricinus communis]
Length = 618
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 49/64 (76%), Positives = 54/64 (84%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S R +HVVHK+P GDSPYVRAKHVQL++KDP AI LFW AINAGDRVDSALKDMAV
Sbjct: 40 SNCRRDLFHVVHKVPSGDSPYVRAKHVQLIDKDPSKAISLFWAAINAGDRVDSALKDMAV 99
Query: 62 VLKQ 65
V+KQ
Sbjct: 100 VMKQ 103
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 58/101 (57%), Gaps = 11/101 (10%)
Query: 138 KKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQAR 197
K V +KQ ILGNL WAY+Q +Y AE YRKA ++PD NK CNL+ CL+ +
Sbjct: 95 KDMAVVMKQ----ILGNLAWAYLQHHDYGLAEQHYRKALSVEPDYNKQCNLALCLMHMNK 150
Query: 198 YTEARSVLEDVLLGKLSGSTE-----TKTINRVKELLQELE 233
EA+S+L+ V GSTE K+ R E+L +LE
Sbjct: 151 IPEAKSLLQAV--SDSCGSTEMDESYAKSFERAVEMLNDLE 189
>gi|238481350|ref|NP_001154733.1| uncharacterized protein [Arabidopsis thaliana]
gi|332005697|gb|AED93080.1| uncharacterized protein [Arabidopsis thaliana]
Length = 237
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/72 (72%), Positives = 61/72 (84%)
Query: 81 HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
H+ QAQESL+NVLIDLYKK GR +EQ+ELLK +L MIY EAFNGKP K ARSHG+KF
Sbjct: 66 HISYGQAQESLENVLIDLYKKGGRTEEQVELLKLQLWMIYQEEAFNGKPAKIARSHGRKF 125
Query: 141 QVTVKQETSRIL 152
QVTV++ETSR+L
Sbjct: 126 QVTVEKETSRML 137
>gi|328953610|ref|YP_004370944.1| cold-shock protein DNA-binding protein [Desulfobacca acetoxidans
DSM 11109]
gi|328453934|gb|AEB09763.1| Cold-shock protein DNA-binding protein [Desulfobacca acetoxidans
DSM 11109]
Length = 1568
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 109/208 (52%), Gaps = 26/208 (12%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVV 62
Q + + Y V +P G +PY RAK QL+EKD + A+ L +AI GD +SA+KD+A V
Sbjct: 475 QNKWREYTRVTGVPKGSTPYARAKRAQLIEKDLDKAVQLLREAIKRGDNTESAVKDLASV 534
Query: 63 LKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHG 122
L QQ + +EA++ ++S + + S +S+DN+LI Y+ G+ D I LL++KL+
Sbjct: 535 LVQQGKPEEAIKTLESHKQVSSDP--KSIDNLLITSYQNAGQYDHAITLLQKKLK----- 587
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD- 181
Q + + + L + ++KG+Y +E +RK PD
Sbjct: 588 ------------------QADNEIKRAPYLFQIALMNLKKGDYGQSEQFFRKILTKQPDN 629
Query: 182 ANKACNLSHCLIKQARYTEARSVLEDVL 209
N++ CL KQ Y EA+ +L+ +L
Sbjct: 630 VTVQRNIAICLSKQGHYDEAKKLLQKIL 657
>gi|242095292|ref|XP_002438136.1| hypothetical protein SORBIDRAFT_10g008600 [Sorghum bicolor]
gi|241916359|gb|EER89503.1| hypothetical protein SORBIDRAFT_10g008600 [Sorghum bicolor]
Length = 203
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 78/210 (37%), Positives = 103/210 (49%), Gaps = 49/210 (23%)
Query: 59 MAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
MAVV+KQQD ++EA+EAI+SFR+LCSKQ+QESL N+LIDL+K L QIELLK+ L +
Sbjct: 1 MAVVMKQQDPLEEAIEAIRSFRYLCSKQSQESLGNLLIDLHK----LRVQIELLKKNLEI 56
Query: 119 IYHG---------EAFNGKPTKTARSHG------------------KKFQVTVKQETSRI 151
+ G E KPT G + F T + +I
Sbjct: 57 QWEGDEESTDPWQEVSGLKPTGDVSHSGTICASWFLLMLITYLPIDQVFSCTQLVSSLKI 116
Query: 152 LGNLGWAYMQKGNY-----------TSAEVVYRKAQLIDPDANKACNLSHCLIKQ---AR 197
L Y G + E+VYR AQ I+ DAN+ACNL CLI Q R
Sbjct: 117 LP----VYHVTGQSRQGEHAVQQLPSCTEMVYRNAQTIESDANRACNLGLCLINQRVGKR 172
Query: 198 YTEARSVLEDVLLGKLSGSTETKTINRVKE 227
E R L +LL ++ S K + R ++
Sbjct: 173 RHEKRFQLRRILLYEIRRSMHEKVVARAEQ 202
>gi|50582704|gb|AAT78774.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108709730|gb|ABF97525.1| hypothetical protein LOC_Os03g40840 [Oryza sativa Japonica Group]
Length = 274
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 71/143 (49%), Gaps = 39/143 (27%)
Query: 30 LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
++EKDP A+ LFW AIN+GDR++SALKDMA
Sbjct: 166 VIEKDPNKAVPLFWAAINSGDRIESALKDMAT---------------------------- 197
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
KC R EQIE+L KL + A TK ++SHG+ ++++ E +
Sbjct: 198 -----------KCDRTKEQIEMLTLKLIFVDEELASGRWKTKLSKSHGRVVYLSLRDEKA 246
Query: 150 RILGNLGWAYMQKGNYTSAEVVY 172
+LGNL WA+MQ NY AE++Y
Sbjct: 247 WLLGNLAWAHMQSENYDGAEMLY 269
>gi|334187848|ref|NP_001154732.2| uncharacterized protein [Arabidopsis thaliana]
gi|332005696|gb|AED93079.1| uncharacterized protein [Arabidopsis thaliana]
Length = 201
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/72 (62%), Positives = 54/72 (75%), Gaps = 7/72 (9%)
Query: 81 HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
H+ QAQESL+N K GR +EQ+ELLK +L MIY EAFNGKP K ARSHG+KF
Sbjct: 66 HISYGQAQESLEN-------KGGRTEEQVELLKLQLWMIYQEEAFNGKPAKIARSHGRKF 118
Query: 141 QVTVKQETSRIL 152
QVTV++ETSR+L
Sbjct: 119 QVTVEKETSRML 130
>gi|297808269|ref|XP_002872018.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297317855|gb|EFH48277.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 84
Score = 83.2 bits (204), Expect = 1e-13, Method: Composition-based stats.
Identities = 40/62 (64%), Positives = 50/62 (80%), Gaps = 3/62 (4%)
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD---ANK 184
KPT TARSHGKKF VTV+++T+RILGN GWAYMQ +YT+A+ VY KAQ+I+P+ AN
Sbjct: 10 KPTNTARSHGKKFMVTVEKKTTRILGNFGWAYMQLRDYTAAKDVYWKAQVIEPEGMLANI 69
Query: 185 AC 186
C
Sbjct: 70 KC 71
>gi|40538969|gb|AAR87226.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 160
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/56 (64%), Positives = 44/56 (78%), Gaps = 2/56 (3%)
Query: 14 KLPPG--DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
+ PP DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA VLKQ +
Sbjct: 105 RAPPSSVDSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLKQAN 160
>gi|297837461|ref|XP_002886612.1| hypothetical protein ARALYDRAFT_893498 [Arabidopsis lyrata subsp.
lyrata]
gi|297332453|gb|EFH62871.1| hypothetical protein ARALYDRAFT_893498 [Arabidopsis lyrata subsp.
lyrata]
Length = 123
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 49/84 (58%), Gaps = 27/84 (32%)
Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL-------------------------- 152
+Y GEAFNGKPT T RSHG+KFQVTV++ETSRIL
Sbjct: 17 LYQGEAFNGKPTNTTRSHGRKFQVTVEKETSRILMLIKKCGMKIVNSAKPVEQMKLLINT 76
Query: 153 -GNLGWAYMQKGNYTSAEVVYRKA 175
GN GWAYMQ +YT+A+ VY KA
Sbjct: 77 KGNFGWAYMQLKDYTTAKDVYLKA 100
>gi|218193290|gb|EEC75717.1| hypothetical protein OsI_12552 [Oryza sativa Indica Group]
Length = 132
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/48 (68%), Positives = 40/48 (83%)
Query: 19 DSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
DSPYVRAK Q++EKDP A+ LFW AIN+GDR++SALKDMA VL +Q
Sbjct: 76 DSPYVRAKQAQVIEKDPNKAVPLFWAAINSGDRIESALKDMATVLNKQ 123
>gi|116792779|gb|ABK26494.1| unknown [Picea sitchensis]
Length = 151
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 52/79 (65%), Gaps = 6/79 (7%)
Query: 160 MQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGST-- 217
MQ +Y SAE+VYRKAQ+++PD +KACNLS CLIKQ + EA +L+ VL
Sbjct: 1 MQHDDYCSAEIVYRKAQVLEPDDHKACNLSVCLIKQGKVEEAMGLLQGVLNSNDDDHNNN 60
Query: 218 ----ETKTINRVKELLQEL 232
++K+++R + LL+E+
Sbjct: 61 NSRGKSKSLDRAEALLKEI 79
>gi|222616416|gb|EEE52548.1| hypothetical protein OsJ_34783 [Oryza sativa Japonica Group]
Length = 102
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/60 (56%), Positives = 45/60 (75%), Gaps = 1/60 (1%)
Query: 44 KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAI-KSFRHLCSKQAQESLDNVLIDLYKKC 102
KAIN+ D+VDSALKDM VV+KQQD +E +EAI + S++ QESL+N+LID+Y C
Sbjct: 22 KAINSRDKVDSALKDMVVVMKQQDHAEEEIEAISRQIIQARSRKTQESLNNLLIDMYMAC 81
>gi|303290781|ref|XP_003064677.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453703|gb|EEH51011.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 316
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 56/105 (53%), Gaps = 3/105 (2%)
Query: 22 YVRAKHVQLVEKDPEAAIVLFWKAI-NAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFR 80
Y AKH QL KD A+ L I G+ SALKD+ +LKQ R D+AV AI+ +R
Sbjct: 145 YAEAKHAQLTRKDLPTALTLMVAEIERRGECAQSALKDLVCILKQMGRHDDAVRAIERYR 204
Query: 81 HL--CSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGE 123
++ QESLDN+L+DLYK L + +R + GE
Sbjct: 205 GAWPMDERLQESLDNMLLDLYKHSRDLGAFSSITLVPVRPRWRGE 249
>gi|153869950|ref|ZP_01999452.1| Cold-shock protein, DNA-binding [Beggiatoa sp. PS]
gi|152073584|gb|EDN70548.1| Cold-shock protein, DNA-binding [Beggiatoa sp. PS]
Length = 501
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 57/92 (61%), Gaps = 3/92 (3%)
Query: 7 QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQ 66
Q Y+ LP G +P+ A+ +L EKD A+ L+ +AI D +SA+K++A VL Q
Sbjct: 412 QVYNNPPPLPSGSNPFACAERAKL-EKDLSKAVKLYRRAIKQKDHFESAVKNLASVLAQL 470
Query: 67 DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDL 98
DRV++A+E ++ +R Q+S+DN+L++
Sbjct: 471 DRVEDAIEVLQQYRDEIDN--QKSIDNMLVNF 500
>gi|171702827|dbj|BAG16369.1| male sterility 5 family protein [Brassica rapa var. perviridis]
Length = 77
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/37 (70%), Positives = 30/37 (81%), Gaps = 1/37 (2%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAA 38
S+ RD P+H+VHK+P GDSPYVRAKH QLV KDP A
Sbjct: 42 SEKRD-PFHIVHKVPSGDSPYVRAKHAQLVSKDPNRA 77
>gi|171702812|dbj|BAG16362.1| male sterility 5 family protein [Brassica oleracea var. italica]
Length = 75
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 24/37 (64%), Positives = 28/37 (75%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAA 38
S + P+H+VHK+P GDSPYVRAKH QLV KDP A
Sbjct: 39 SSEKCDPFHIVHKVPSGDSPYVRAKHAQLVSKDPNRA 75
>gi|196232461|ref|ZP_03131314.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196223533|gb|EDY18050.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 760
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 80/175 (45%), Gaps = 11/175 (6%)
Query: 44 KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCG 103
+A+ R +A ++ +V + RVDEA+EA + R L + A + L +L ++ G
Sbjct: 65 RALAIEPRNGAAYSNLGLVYRSLGRVDEAMEAYR--RALQLQPALPEPYHNLANLLRQTG 122
Query: 104 RLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ-VTVKQETSRILGNLGWAYMQK 162
RL+E + L+Q +R+ HG + + G+K + + QE R+ N AY
Sbjct: 123 RLNEAVGWLQQAIRLRPHGVELHKNLGDVLSAAGRKDEAIAAYQEAIRLNPNFAEAYNNL 182
Query: 163 GNYTSAE-------VVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
GN E V+ +AQ + PD A NL+ L ++ A + + L
Sbjct: 183 GNILRGERRLSEAITVFGEAQRLLPDSAEIHNNLAAALADDGQFAHADAAYQRAL 237
>gi|238478814|ref|NP_001154416.1| uncharacterized protein [Arabidopsis thaliana]
gi|332194374|gb|AEE32495.1| uncharacterized protein [Arabidopsis thaliana]
Length = 221
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 18/69 (26%)
Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
MIY EAFNGKP K ARSHG+ F+ +++ LG+ + + Q+
Sbjct: 1 MIYQEEAFNGKPAKIARSHGRNFRSRSRRKP------LGYCRVS------------ETQV 42
Query: 178 IDPDANKAC 186
I+PDANKAC
Sbjct: 43 IEPDANKAC 51
>gi|334183195|ref|NP_001185185.1| uncharacterized protein [Arabidopsis thaliana]
gi|332194375|gb|AEE32496.1| uncharacterized protein [Arabidopsis thaliana]
Length = 211
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 18/69 (26%)
Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
MIY EAFNGKP K ARSHG+ F+ +++ LG+ + + Q+
Sbjct: 1 MIYQEEAFNGKPAKIARSHGRNFRSRSRRKP------LGYCRVS------------ETQV 42
Query: 178 IDPDANKAC 186
I+PDANKAC
Sbjct: 43 IEPDANKAC 51
>gi|168049632|ref|XP_001777266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671368|gb|EDQ57921.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 460
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 4/65 (6%)
Query: 173 RKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKEL 228
RKA LI+ D NK CN CL+KQ R EA +VL+ V + + + + K+ R ++L
Sbjct: 16 RKALLIESDDNKVCNFGVCLMKQGRLEEAEAVLQRVTVACSGIRWASDSHLKSYERAQDL 75
Query: 229 LQELE 233
L+ELE
Sbjct: 76 LKELE 80
>gi|357142312|ref|XP_003572530.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Brachypodium
distachyon]
Length = 1001
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 93/228 (40%), Gaps = 38/228 (16%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI+ + +A+ A A ++A + +Q ++D AV +C Q E+ +N
Sbjct: 297 PQDAIMSYQRALQARPDYAMAYGNLATIYYEQGQLDMAVRCYNQ-AIVCDPQFIEAYNN- 354
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNG--------KPTKTARSHGKKFQVTVK 145
+ + K GR++E I + L + H +A TA S K ++V
Sbjct: 355 MGNALKDSGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNLVTTAASFYKA-AISVT 413
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
S L NL Y Q+GNY A Y + IDP A
Sbjct: 414 SGLSSPLNNLALIYKQQGNYADAITCYTEVLRIDPTA----------------------- 450
Query: 206 EDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
D L+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 451 ADALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPTMAEAHANLASAY 495
>gi|15222846|ref|NP_175416.1| uncharacterized protein [Arabidopsis thaliana]
gi|332194373|gb|AEE32494.1| uncharacterized protein [Arabidopsis thaliana]
Length = 230
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 18/69 (26%)
Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
MIY EAFNGKP K ARSHG+ F+ +++ LG+ + + Q+
Sbjct: 1 MIYQEEAFNGKPAKIARSHGRNFRSRSRRKP------LGYCRVS------------ETQV 42
Query: 178 IDPDANKAC 186
I+PDANKAC
Sbjct: 43 IEPDANKAC 51
>gi|168056376|ref|XP_001780196.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668346|gb|EDQ54955.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 38/65 (58%), Gaps = 4/65 (6%)
Query: 173 RKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKEL 228
RKA I+ D NK CN S CL+KQ R EA +VL+ V + + + K+ R ++L
Sbjct: 26 RKAWSIEFDVNKVCNFSVCLMKQGRLEEAEAVLQRVTVACTGIRWPSDSHLKSYERAQDL 85
Query: 229 LQELE 233
L+ELE
Sbjct: 86 LKELE 90
>gi|198282345|ref|YP_002218666.1| hypothetical protein Lferr_0200 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|198246866|gb|ACH82459.1| TPR repeat-containing protein [Acidithiobacillus ferrooxidans ATCC
53993]
Length = 505
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 87/184 (47%), Gaps = 16/184 (8%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK-QAQESLDNVLI 96
AI F KA+ AL + V L +QDR+DEA ++ + L K E+L N+ +
Sbjct: 274 AIGTFRKALTIKPDYLEALMRLGVALGRQDRMDEAETVLR--KALAIKPDYPEALMNLGV 331
Query: 97 DLYKKCGRLDEQIELLKQKLR--------MIYHGEAFN--GKPTKTARSHGKKFQVTVKQ 146
L + GR +E LL+Q L ++ G A + G+ A + ++ + ++
Sbjct: 332 TLGHE-GRWEETETLLRQALTIKPDYPEALMNLGAALSRQGRCLDEAENILRR-TLAIQP 389
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVL 205
+ L NLG ++G + AE + R+A I P+ A+ NL L KQ R EA ++L
Sbjct: 390 DHPDALVNLGATLNKQGRWNEAETILRQAVAIKPEHADALINLGAVLSKQGRLDEAETIL 449
Query: 206 EDVL 209
L
Sbjct: 450 RQAL 453
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 82/182 (45%), Gaps = 24/182 (13%)
Query: 40 VLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQ--ESLDNVLID 97
+ F KA++ AL ++ +L Q RVD EAI +FR + + E+L + +
Sbjct: 242 IAFRKALSIKPAYPEALMNLGAILSDQGRVD---EAIGTFRKALTIKPDYLEALMRLGVA 298
Query: 98 LYKKCGRLDEQIELLKQKLRM------------IYHGEAFNGKPTKTARSHGKKFQVTVK 145
L ++ R+DE +L++ L + + G + T+T +T+K
Sbjct: 299 LGRQ-DRMDEAETVLRKALAIKPDYPEALMNLGVTLGHEGRWEETETLLRQA----LTIK 353
Query: 146 QETSRILGNLGWAYMQKGN-YTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
+ L NLG A ++G AE + R+ I PD A NL L KQ R+ EA +
Sbjct: 354 PDYPEALMNLGAALSRQGRCLDEAENILRRTLAIQPDHPDALVNLGATLNKQGRWNEAET 413
Query: 204 VL 205
+L
Sbjct: 414 IL 415
>gi|254445366|ref|ZP_05058842.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
DG1235]
gi|198259674|gb|EDY83982.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
DG1235]
Length = 463
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 73/165 (44%), Gaps = 17/165 (10%)
Query: 61 VVLKQ---QDRVDEAVE-AIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQK- 115
VV+KQ Q ++D E A+ + A + D + + Y + G L I+ Q
Sbjct: 63 VVVKQVLDQLQLDNNKETALATILRANGPDATAAYDFIAANFYAEKGDLKNAIKYYDQAA 122
Query: 116 ------LRMIYHGEAF---NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYT 166
LR + +G G+ + K + K T+ +G LG Y+ Y
Sbjct: 123 KKFPSFLRAVRNGAIMKVKEGRYEAARKDFTKAINLGAKDTTT--MGLLGLCYVNTEKYF 180
Query: 167 SAEVVYRKAQLIDPDANK-ACNLSHCLIKQARYTEARSVLEDVLL 210
SAE YR+A ++DPD L+ L++Q +Y E +VLE +L+
Sbjct: 181 SAETAYREAIVLDPDVKDWQVGLAKSLLQQRKYEEGIAVLEQILV 225
>gi|47848089|dbj|BAD21873.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
Japonica Group]
gi|47848095|dbj|BAD21878.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
Japonica Group]
Length = 953
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/224 (22%), Positives = 91/224 (40%), Gaps = 36/224 (16%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI+ + +A+ A A ++A + +Q ++D A+ +C Q E+ +N + +
Sbjct: 301 AIISYQRAVQARPDYAMAYGNLATIYYEQGQLDIAIRCYNQ-AIICDPQFVEAYNN-MGN 358
Query: 98 LYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQETS 149
K GR++E I + L + + G + +A + K ++V S
Sbjct: 359 ALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNLISAAASFYKAAISVTSGLS 418
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
L NL Y Q+GNY A Y + +DP A D L
Sbjct: 419 SPLNNLAVIYKQQGNYADAITCYTEVLRVDPTA-----------------------ADAL 455
Query: 210 LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 456 VNR--GNT-FKEIGRVNEAIQDYIQAATIRPTMAEAHANLASAY 496
>gi|218190772|gb|EEC73199.1| hypothetical protein OsI_07263 [Oryza sativa Indica Group]
Length = 1004
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 91/224 (40%), Gaps = 36/224 (16%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI+ + +A+ A A ++A + +Q ++D A+ +C Q E+ +N + +
Sbjct: 301 AIISYQRAVQARPDYAMAYGNLATIYYEQGQLDIAIRCYNQ-AIICDPQFVEAYNN-MGN 358
Query: 98 LYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTK-------TARSHGKKFQVTVKQETS 149
K GR++E I + L + H +A +A + K ++V S
Sbjct: 359 ALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNLISAAASFYKAAISVTSGLS 418
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
L NL Y Q+GNY A Y + +DP A D L
Sbjct: 419 SPLNNLAVIYKQQGNYADAITCYTEVLRVDPTA-----------------------ADAL 455
Query: 210 LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 456 VNR--GNT-FKEIGRVNEAIQDYIQAATIRPTMAEAHANLASAY 496
>gi|224140549|ref|XP_002323645.1| predicted protein [Populus trichocarpa]
gi|222868275|gb|EEF05406.1| predicted protein [Populus trichocarpa]
Length = 682
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 101/229 (44%), Gaps = 32/229 (13%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDN- 93
P+ A+ LF+K +N G + D M VL R+ A+E +L + E LDN
Sbjct: 291 PKEALDLFFKMLNEGLKPDC--YAMVGVLCSCARLG-ALELGDWASNLIN--GNEFLDNS 345
Query: 94 ----VLIDLYKKCGRLDEQIEL---LKQKLRMIYH----GEAFNGKPTKTARSHGKKFQV 142
LID+Y KCGR+D E+ +++K R++++ G A +G G+ +
Sbjct: 346 VLGTALIDMYAKCGRMDRAWEVFRGMRKKDRVVWNAAISGLAMSGHVKDALGLFGQMEKS 405
Query: 143 TVKQETSRILGNL------GWAYMQKGNYTSAEVVY-------RKAQLIDPDANKAC-NL 188
+K + + +G L G + + S E V+ ++D C +
Sbjct: 406 GIKPDRNTFVGLLCACTHAGLVEEGRRYFNSMECVFTLTPEIEHYGCMVDLLGRAGCLDE 465
Query: 189 SHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQS 237
+H LIK EA +++ LLG +T+ + V + L LEPW S
Sbjct: 466 AHQLIKSMP-MEANAIVWGALLGGCRLHRDTQLVEVVLKKLIALEPWHS 513
>gi|222622881|gb|EEE57013.1| hypothetical protein OsJ_06783 [Oryza sativa Japonica Group]
Length = 1004
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 90/224 (40%), Gaps = 36/224 (16%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI+ + +A+ A A ++A + +Q ++D A+ +C Q E+ +N + +
Sbjct: 301 AIISYQRAVQARPDYAMAYGNLATIYYEQGQLDIAIRCYNQ-AIICDPQFVEAYNN-MGN 358
Query: 98 LYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTK-------TARSHGKKFQVTVKQETS 149
K GR+ E I + L + H +A +A + K ++V S
Sbjct: 359 ALKDAGRVKEAINCYRSCLALQANHPQALTNLGNIYMEWNLISAAASFYKAAISVTSGLS 418
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
L NL Y Q+GNY A Y + +DP A D L
Sbjct: 419 SPLNNLAVIYKQQGNYADAITCYTEVLRVDPTA-----------------------ADAL 455
Query: 210 LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 456 VNR--GNT-FKEIGRVNEAIQDYIQAATIRPTMAEAHANLASAY 496
>gi|242065108|ref|XP_002453843.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
gi|241933674|gb|EES06819.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
Length = 1011
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI+ + +A+ A A ++A + +Q ++D A+ + Q E+ +N
Sbjct: 307 PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFVEAYNN- 364
Query: 95 LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
+ + K GR++E I + L + + G + +A + K ++V
Sbjct: 365 MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWSMISAAASFYKAAISVTS 424
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S L NL Y Q+GNY A Y + IDP A
Sbjct: 425 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 461
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
D L+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 462 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMAEAHANLASAY 505
>gi|420251802|ref|ZP_14754961.1| Tfp pilus assembly protein PilF [Burkholderia sp. BT03]
gi|398057091|gb|EJL49069.1| Tfp pilus assembly protein PilF [Burkholderia sp. BT03]
Length = 617
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 75/163 (46%), Gaps = 15/163 (9%)
Query: 55 ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ 114
A ++ VVL+Q R EA +A K + A ++ L L + GR+DE E+ +
Sbjct: 158 AYNNLGVVLRQLGRPGEAEQAHKKAIATDPEHAGAHIN--LARLLESLGRVDEA-EVAYR 214
Query: 115 KLRMIY--HGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGN 164
+ +Y H E N ++ G+ + + +K + + NLG +
Sbjct: 215 RGLCLYPEHAETHNNL-GAMLQAQGRMPDAETCYRRSLAIKPDFPKACFNLGIVLQTQKR 273
Query: 165 YTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLE 206
Y AE++YR+A + PD +A NL+H L+ ++TE V E
Sbjct: 274 YREAELLYRRALQVKPDMLEARLNLAHLLLATGQFTEGWQVFE 316
>gi|168017832|ref|XP_001761451.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687457|gb|EDQ73840.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 538
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 4/65 (6%)
Query: 173 RKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKEL 228
++A I+ D NK CN CL+KQ R EA +VL+ V + + + + K+ R ++L
Sbjct: 87 KRALSIESDVNKVCNFGVCLMKQGRLEEAEAVLQRVTVAYSGIRWASDSHLKSYERAQDL 146
Query: 229 LQELE 233
L+ELE
Sbjct: 147 LKELE 151
>gi|425437156|ref|ZP_18817582.1| Similar to tr|Q7NLR0|Q7NLR0 [Microcystis aeruginosa PCC 9432]
gi|389677907|emb|CCH93194.1| Similar to tr|Q7NLR0|Q7NLR0 [Microcystis aeruginosa PCC 9432]
Length = 776
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 35/167 (20%)
Query: 59 MAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGRLDEQIEL 111
MA + + Q R EA EA+ F+ SL+N+ LY+ GR E L
Sbjct: 1 MAFLYQYQGRYSEAEPLYKEALFIFKQQLGNNHPSTAASLNNLAF-LYRIQGRYSEAEPL 59
Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
LKQ L + + + +H +T++ L NL Y+ +G Y+ AE +
Sbjct: 60 LKQSLAI---------RKQQLGDNH---------PDTAQSLNNLAELYLSQGRYSEAEPL 101
Query: 172 Y------RKAQLID--PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
Y RK QL D PD A NL+ + Q RY+EA +L+ L
Sbjct: 102 YKQALTIRKQQLGDNHPDVAQSLNNLAELYLSQGRYSEAEPLLKQSL 148
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 69/171 (40%), Gaps = 35/171 (20%)
Query: 55 ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQ-------AQESLDNVLIDLYKKCGRLDE 107
+L ++A + Q R EA K + +Q +SL+N L +LY GR E
Sbjct: 81 SLNNLAELYLSQGRYSEAEPLYKQALTIRKQQLGDNHPDVAQSLNN-LAELYLSQGRYSE 139
Query: 108 QIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTS 167
LLKQ L + + + +H +T++ L NL Y +G Y+
Sbjct: 140 AEPLLKQSLAI---------RKQQLGDNH---------PDTAQSLNNLAGLYQSQGRYSE 181
Query: 168 AEVVYRKAQLI--------DPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
AE +Y++A I PD A NL+ Q +Y EA + L
Sbjct: 182 AEPLYKEALFIFKQQLGNNHPDVAQTLNNLATLYWNQGKYPEADELFSQGL 232
>gi|413936917|gb|AFW71468.1| hypothetical protein ZEAMMB73_864118 [Zea mays]
Length = 517
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI+ + +A+ A A ++A + +Q ++D A+ + Q E+ +N
Sbjct: 2 PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFIEAYNN- 59
Query: 95 LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
+ + K GR++E I + L + + G + +A + K ++V
Sbjct: 60 MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNMISAATSFYKAAISVTS 119
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S L NL Y Q+GNY A Y + IDP A
Sbjct: 120 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 156
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
D L+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 157 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMPEAHANLASAY 200
>gi|434398683|ref|YP_007132687.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
PCC 7437]
gi|428269780|gb|AFZ35721.1| Glycosyltransferase AER61, uncharacterized [Stanieria cyanosphaera
PCC 7437]
Length = 900
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 78/171 (45%), Gaps = 22/171 (12%)
Query: 34 DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVE-----------AIKSFRHL 82
D + AI + + I + ++ V+ Q+ ++ EA+E IK++ +L
Sbjct: 120 DWQGAIAAYRRTIALKPGYAAVYFNLGVLYDQRTQLTEAIENYYQAIKLQPNYIKAYSNL 179
Query: 83 CSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV 142
S A++ N I++Y++ +LD L L ++ FN +P + S F+
Sbjct: 180 GSIFAKQKKLNAAIEIYQQGLKLDPTWGTLHNNLGQVF---WFNEQPDRALAS----FET 232
Query: 143 TVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHC 191
V E + L NLG + Q+GN+ +A Y+K +I+ + N SHC
Sbjct: 233 AVIVEPNMALAHHNLGKLWQQQGNFNNAIAHYQK--VIELEPNNIWAYSHC 281
>gi|224123642|ref|XP_002319130.1| predicted protein [Populus trichocarpa]
gi|222857506|gb|EEE95053.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 51/227 (22%), Positives = 88/227 (38%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AIV + +A+ A + A ++A ++ ++D A+ K C ++ E+ +N
Sbjct: 222 PQEAIVCYQQAVQARPKYAMAFGNLASTYYERGQLDLAILHYKQ-AIACDQRFLEAYNN- 279
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
L + K GR+DE I+ Q L + G + A + K + V
Sbjct: 280 LGNALKDVGRVDEAIQCYNQCLSLQPNHPQALTNLGNIYMEWNMSAAAASCYKATLAVTT 339
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S NL Y Q+GNY+ A Y + I+P A
Sbjct: 340 GLSAPFSNLAVIYKQQGNYSDAISCYNEVLRIEPLAAD---------------------- 377
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
G ++ K I RV E +Q+ +I P++ ++L A+
Sbjct: 378 ----GLVNRGNTYKEIGRVSEAIQDYINAITIRPNMAEAHANLASAY 420
>gi|430747186|ref|YP_007206315.1| O-linked N-acetylglucosamine transferase, SPINDLY family
[Singulisphaera acidiphila DSM 18658]
gi|430018906|gb|AGA30620.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Singulisphaera acidiphila DSM 18658]
Length = 827
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 15/179 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E A+V F AI A +M+VVLKQ R+DEA+ + + L A N L
Sbjct: 191 EEAVVRFQAAIRLRPEFAGAYSNMSVVLKQLGRLDEAITSGREAVRLDPGFA--GAHNNL 248
Query: 96 IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQ---------VTVK 145
+ +K G DE + LR+ EA+ + +R +F+ +T+
Sbjct: 249 GVILEKEGGWDEATTCFHEALRLDPRFVEAYYNLGSVLSRL--GRFEDAESICRQAITLD 306
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARS 203
+++ NL +A ++G T AE YR+A + P+ + NL+ L K + EA +
Sbjct: 307 PDSAEAHHNLAFALSERGQLTEAETNYRRAIQLKPEFVDPYVNLTSVLGKFGKLDEAEA 365
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 82/176 (46%), Gaps = 13/176 (7%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AIV + + + L D+ ++L+ +++EAV ++ L + A + N+ +
Sbjct: 159 AIVALEQVLRIKPELPETLNDLGLLLEMTGQLEEAVVRFQAAIRLRPEFAG-AYSNMSVV 217
Query: 98 LYKKCGRLDEQIELLKQKLRM--IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI---- 151
L K+ GRLDE I ++ +R+ + G A N + G T E R+
Sbjct: 218 L-KQLGRLDEAITSGREAVRLDPGFAG-AHNNLGVILEKEGGWDEATTCFHEALRLDPRF 275
Query: 152 ---LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
NLG + G + AE + R+A +DPD+ +A NL+ L ++ + TEA +
Sbjct: 276 VEAYYNLGSVLSRLGRFEDAESICRQAITLDPDSAEAHHNLAFALSERGQLTEAET 331
>gi|409993902|ref|ZP_11277028.1| hypothetical protein APPUASWS_22358 [Arthrospira platensis str.
Paraca]
gi|409935250|gb|EKN76788.1| hypothetical protein APPUASWS_22358 [Arthrospira platensis str.
Paraca]
Length = 917
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 82/174 (47%), Gaps = 21/174 (12%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI ++ A DMA+ L ++ ++ A++ ++ R C + + N L
Sbjct: 123 EGAIGCYQKAIALNPKLGEAYLDMALRLNERGDINTAIKVLQEGRINCPNFKE--IFNTL 180
Query: 96 IDLYKKCGRLDEQIELLKQKLR------MIYH--GEAF--NGKPTKTARSHGKKFQVTVK 145
L + ++DE I + ++ L ++Y+ G AF GK ++ ++ K +++K
Sbjct: 181 GYLQLQQNQIDEAIAIFQEALSIDPTEPLVYNNLGWAFARQGKLSEAIAAYHKA--ISLK 238
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-------DANKACNLSHCL 192
+ + NLG + K N+ A ++KA I+P D +C + CL
Sbjct: 239 PDLAIAYSNLGKLWQHKNNHRQAITYFQKAIAIEPDNIMFYSDCGNSCLIIGCL 292
>gi|293334567|ref|NP_001167956.1| uncharacterized protein LOC100381672 [Zea mays]
gi|223945123|gb|ACN26645.1| unknown [Zea mays]
Length = 1009
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI+ + +A+ A A ++A + +Q ++D A+ + Q E+ +N
Sbjct: 305 PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFIEAYNN- 362
Query: 95 LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
+ + K GR++E I + L + + G + +A + K ++V
Sbjct: 363 MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNMISAATSFYKAAISVTS 422
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S L NL Y Q+GNY A Y + IDP A
Sbjct: 423 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 459
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
D L+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 460 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMPEAHANLASAY 503
>gi|413936918|gb|AFW71469.1| hypothetical protein ZEAMMB73_864118 [Zea mays]
Length = 706
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 92/227 (40%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI+ + +A+ A A ++A + +Q ++D A+ + Q E+ +N
Sbjct: 2 PQDAIMCYQRALQARPDYAMAYGNLATIYYEQGQLDMAIRCYNQ-AIVYDPQFIEAYNN- 59
Query: 95 LIDLYKKCGRLDEQIELLKQKLRMIYH--------GEAFNGKPTKTARSHGKKFQVTVKQ 146
+ + K GR++E I + L + + G + +A + K ++V
Sbjct: 60 MGNALKDAGRVEEAINCYRSCLALQANHPQALTNLGNIYMEWNMISAATSFYKAAISVTS 119
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S L NL Y Q+GNY A Y + IDP A
Sbjct: 120 GLSSPLNNLAVIYKQQGNYADAITCYTEVLRIDPTA-----------------------A 156
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
D L+ + G+T K I RV E +Q+ +I P++ ++L A+
Sbjct: 157 DALVNR--GNT-FKEIGRVNEAIQDYVQAATIRPNMPEAHANLASAY 200
>gi|291569288|dbj|BAI91560.1| TPR domain protein [Arthrospira platensis NIES-39]
Length = 917
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 82/174 (47%), Gaps = 21/174 (12%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI ++ A DMA+ L ++ ++ A++ ++ R C + + N L
Sbjct: 123 EGAIGCYQKAIALNPKLGEAYLDMALRLNERGDINTAIKVLQQGRINCPNFKE--IFNTL 180
Query: 96 IDLYKKCGRLDEQIELLKQKLR------MIYH--GEAF--NGKPTKTARSHGKKFQVTVK 145
L + ++DE I + ++ L ++Y+ G AF GK ++ ++ K +++K
Sbjct: 181 GYLQLQQNQIDEAIAIFQEALSIDPTEPLVYNNLGWAFARQGKLSEAIAAYHKA--ISLK 238
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-------DANKACNLSHCL 192
+ + NLG + K N+ A ++KA I+P D +C + CL
Sbjct: 239 PDLAIAYSNLGKLWQHKNNHRQAITYFQKAIAIEPDNIMFYSDCGNSCLIIGCL 292
>gi|294952719|ref|XP_002787430.1| hypothetical protein Pmar_PMAR028692 [Perkinsus marinus ATCC
50983]
gi|239902402|gb|EER19226.1| hypothetical protein Pmar_PMAR028692 [Perkinsus marinus ATCC
50983]
Length = 293
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 15 LPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVE 74
+P DS Y +AK ++ + D + A F +AI GDRVDS+LKD+A +L V AV
Sbjct: 25 MPSYDSWYGQAKRLEF-DGDLKGAFSAFMEAIARGDRVDSSLKDVAGLLNMVGHVHAAVR 83
Query: 75 AIKSFRHLCSK 85
++ R L S
Sbjct: 84 FLEDHRGLASN 94
>gi|375011677|ref|YP_004988665.1| cytochrome c biogenesis factor [Owenweeksia hongkongensis DSM
17368]
gi|359347601|gb|AEV32020.1| cytochrome c biogenesis factor [Owenweeksia hongkongensis DSM
17368]
Length = 657
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 29/109 (26%)
Query: 103 GRLDEQIELLKQKLR-MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQ 161
G+LDE + L K+ L+ H EA N NLG+ Y+Q
Sbjct: 556 GQLDEAVTLFKKLLKEQPQHAEALN---------------------------NLGYTYLQ 588
Query: 162 KGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
GN T A + + KA +DPD +A N + LI Q + EA+ L++VL
Sbjct: 589 MGNITLAGLNFDKALAVDPDYLQAWLNKASLLIMQENFAEAKKALKEVL 637
>gi|344198703|ref|YP_004783029.1| hypothetical protein Acife_0495 [Acidithiobacillus ferrivorans SS3]
gi|343774147|gb|AEM46703.1| Tetratricopeptide TPR_2 repeat-containing protein
[Acidithiobacillus ferrivorans SS3]
Length = 545
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 95/220 (43%), Gaps = 24/220 (10%)
Query: 2 SQTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV 61
S+ D V+ +P + + +K A + F KA++ AL ++ V
Sbjct: 244 SEAEDAIRKVLSLMPDNAESIFVLGAISVGKKCLSEAEMAFRKALSIKPAYPEALMNLGV 303
Query: 62 VLKQQDRVDEAVEAIKSFRH--LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
+L Q R+D EAI +FR + +E+L + + L + GR+DE L++ L +
Sbjct: 304 MLSDQGRMD---EAIGTFRKALIIKPGYREALMRLGVALGYQ-GRMDEAESTLRKALAIK 359
Query: 119 -----------IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN-YT 166
+ G + T+T + +K + L NLG A ++G
Sbjct: 360 SDYPEAIMNLGVILGHEGRWEETETLLRQA----LAIKPDYPEALMNLGAALSKQGRCLD 415
Query: 167 SAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVL 205
AE + R+A I PD A+ NL L KQ R+ EA ++L
Sbjct: 416 EAETILRQALAIQPDHADALVNLGATLNKQGRWDEAETIL 455
Score = 40.4 bits (93), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 48/211 (22%), Positives = 80/211 (37%), Gaps = 42/211 (19%)
Query: 41 LFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK 100
+ + A+N + +AL +++V+L Q+R EA +AI+ L A+ I + K
Sbjct: 215 VLYDALNINENNVAALHNLSVLLLYQNRYSEAEDAIRKVLSLMPDNAESIFVLGAISVGK 274
Query: 101 KC--------------------------------GRLDEQIELLKQKL--------RMIY 120
KC GR+DE I ++ L ++
Sbjct: 275 KCLSEAEMAFRKALSIKPAYPEALMNLGVMLSDQGRMDEAIGTFRKALIIKPGYREALMR 334
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G A + + + +K + + NLG +G + E + R+A I P
Sbjct: 335 LGVALGYQGRMDEAESTLRKALAIKSDYPEAIMNLGVILGHEGRWEETETLLRQALAIKP 394
Query: 181 DANKAC-NLSHCLIKQAR-YTEARSVLEDVL 209
D +A NL L KQ R EA ++L L
Sbjct: 395 DYPEALMNLGAALSKQGRCLDEAETILRQAL 425
>gi|224104413|ref|XP_002313429.1| predicted protein [Populus trichocarpa]
gi|222849837|gb|EEE87384.1| predicted protein [Populus trichocarpa]
Length = 88
Score = 45.1 bits (105), Expect = 0.037, Method: Composition-based stats.
Identities = 22/40 (55%), Positives = 29/40 (72%)
Query: 111 LLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSR 150
+L+ KL+ I G AF+GK TKTARS G+K Q+TV+ E R
Sbjct: 1 MLQCKLKNIEKGIAFSGKKTKTARSQGRKIQITVEHERPR 40
>gi|294055795|ref|YP_003549453.1| hypothetical protein [Coraliomargarita akajimensis DSM 45221]
gi|293615128|gb|ADE55283.1| TPR repeat-containing protein [Coraliomargarita akajimensis DSM
45221]
Length = 453
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 1/62 (1%)
Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK-ACNLSHCLIKQARYTEARSVLED 207
R G LG+AY+ G Y +A+ YR+A L+ PD L+ CL++ RY A ++ +
Sbjct: 160 GRSYGLLGYAYLTGGRYYAADAAYRQAILMQPDVKDWKIGLARCLMETERYDVAIALFDT 219
Query: 208 VL 209
+L
Sbjct: 220 LL 221
>gi|289548588|ref|YP_003473576.1| hypothetical protein Thal_0816 [Thermocrinis albus DSM 14484]
gi|289182205|gb|ADC89449.1| TPR repeat-containing protein [Thermocrinis albus DSM 14484]
Length = 132
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 2/66 (3%)
Query: 138 KKFQVTVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQ 195
K+++ +K++ L NLG Y +KG+YT AE YR+A + D + NL++ L KQ
Sbjct: 41 KEYKKALKKDPKNFLALFNLGNVYAKKGDYTVAEKFYREALAVKEDPDVLNNLAYVLYKQ 100
Query: 196 ARYTEA 201
R EA
Sbjct: 101 GRREEA 106
>gi|376005011|ref|ZP_09782581.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375326605|emb|CCE18334.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 530
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A + ++ L QQ ++DEA+ A + L A S DN L
Sbjct: 21 AIASYNQAIAASPKSAWYYHNLGEALSQQGKIDEAIAAYRQATELNPNSAW-SYDN-LGT 78
Query: 98 LYKKCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQE 147
L + G L E + +Q + + YH A G P + K ++T +
Sbjct: 79 LLNQQGNLPEAVRCFRQAIELDPYFWQFYHNLALVLIKQGHPEEAVSLLQKAIELTA--D 136
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
+ + +LG AY Q+ Y+ A YR+ ++P
Sbjct: 137 DAELYHSLGKAYHQQQQYSEAVTAYRQGLELNP 169
>gi|302037002|ref|YP_003797324.1| hypothetical protein NIDE1664 [Candidatus Nitrospira defluvii]
gi|300605066|emb|CBK41399.1| exported protein of unknown function, contains TPR repeats
[Candidatus Nitrospira defluvii]
Length = 531
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 15/82 (18%)
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIKQA 196
QE +R L NL YM + +T AE +YR++ +I +P A NL+H + Q
Sbjct: 243 QEVARTLNNLANVYMYQARFTDAESLYRRSLVITEKVLGKENPKVAATLTNLAHMSLDQG 302
Query: 197 RYTEARSV------LEDVLLGK 212
R+TEA S+ +E+ + GK
Sbjct: 303 RFTEAESLNQRALAIEEKVFGK 324
>gi|423065536|ref|ZP_17054326.1| TPR containing protein [Arthrospira platensis C1]
gi|406712979|gb|EKD08154.1| TPR containing protein [Arthrospira platensis C1]
Length = 530
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A + ++ L QQ ++DEA+ A + L A S DN L
Sbjct: 21 AIASYNQAIAASPKSAWYYHNLGEALSQQGKIDEAIAAYRQATELNPNSAW-SYDN-LGT 78
Query: 98 LYKKCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQE 147
L + G L E + +Q + + YH A G P + K ++T +
Sbjct: 79 LLNQQGNLPEAVRCFRQAIELDPYFWQFYHNLALVLIKQGHPEEAVSLLQKAIELTA--D 136
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
+ + +LG AY Q+ Y+ A YR+ ++P
Sbjct: 137 DAELYHSLGKAYHQQQQYSEAVTAYRQGLELNP 169
>gi|209527528|ref|ZP_03276030.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
gi|209492016|gb|EDZ92369.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
Length = 530
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 14/153 (9%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A + ++ L QQ ++DEA+ A + L A S DN L
Sbjct: 21 AIASYNQAIAASPKSAWYYHNLGEALSQQGKIDEAIAAYRQATELNPNSAW-SYDN-LGT 78
Query: 98 LYKKCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQE 147
L + G L E + +Q + + YH A G P + K ++T +
Sbjct: 79 LLNQQGNLPEAVRCFRQAIELDPYFWQFYHNLALVLIKQGHPEEAVSLLQKAIELTA--D 136
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
+ + +LG AY Q+ Y+ A YR+ ++P
Sbjct: 137 DAELYHSLGKAYHQQQQYSEAVTAYRQGLELNP 169
>gi|257059198|ref|YP_003137086.1| peptidase S1 and S6 chymotrypsin/Hap [Cyanothece sp. PCC 8802]
gi|256589364|gb|ACV00251.1| peptidase S1 and S6 chymotrypsin/Hap [Cyanothece sp. PCC 8802]
Length = 810
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 77/177 (43%), Gaps = 19/177 (10%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDNVL 95
AI + KAI A ++ L Q +++EA+ A + L QA +L N L
Sbjct: 82 AIAAYQKAIQLNPNDADAYNNLGNALYYQGKLEEAIAAYQKAIQLNPNFAQAYNNLGNAL 141
Query: 96 IDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFN--GKPTKTARSHGKKFQVTVK 145
D G+L+E I ++ +++ Y+ G A + GK + ++ K Q+
Sbjct: 142 SD----QGKLEEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPN 197
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEA 201
+ NLG A +G A Y+KA +DP DAN NL L KQ + EA
Sbjct: 198 YADAYY--NLGVALFDQGKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEA 252
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 78/175 (44%), Gaps = 11/175 (6%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI A ++ V L Q ++DEA+ A + L A + +N+
Sbjct: 182 EEAIAAYQKAIQLNPNYADAYYNLGVALFDQGKLDEAIAAYQKAIQLDPNDA-NAYNNLG 240
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYH-GEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
LYK+ G+L+E I ++ +++ + EA+N + + + Q+ ++ N
Sbjct: 241 AALYKQ-GKLEEAIAAYQKAIQLNPNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPN 299
Query: 155 LGWAYMQKGNYTS-------AEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEA 201
AY GN S A Y+KA ++P+ A N L + L Q + EA
Sbjct: 300 FALAYNGLGNALSDQGKRDEAIAAYQKAIQLNPNFALAYNGLGNALSDQGKRDEA 354
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 79/177 (44%), Gaps = 15/177 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI A ++ + L Q +++EA+ A + L A ++ N+
Sbjct: 148 EEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPNYA-DAYYNLG 206
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYH-GEAFN---------GKPTKTARSHGKKFQVTVK 145
+ L+ + G+LDE I ++ +++ + A+N GK + ++ K Q+
Sbjct: 207 VALFDQ-GKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEAIAAYQKAIQLN-- 263
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEA 201
+ NLG A +G A Y+KA ++P+ A N L + L Q + EA
Sbjct: 264 PNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPNFALAYNGLGNALSDQGKRDEA 320
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 75/179 (41%), Gaps = 19/179 (10%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDN 93
E AI + KAI + A ++ V L Q + DEA+ A + L A L N
Sbjct: 250 EEAIAAYQKAIQLNPNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPNFALAYNGLGN 309
Query: 94 VLIDLYKKCGRLDEQIELLKQKLR------MIYH--GEAFN--GKPTKTARSHGKKFQVT 143
L D G+ DE I ++ ++ + Y+ G A + GK + ++ K Q+
Sbjct: 310 ALSD----QGKRDEAIAAYQKAIQLNPNFALAYNGLGNALSDQGKRDEAIAAYQKAIQLD 365
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
+ NLG A +G A Y+KA ++P+ A NL + L Q + EA
Sbjct: 366 PNDANA--YNNLGLALRNQGKRDEAITAYQKAIQLNPNFALAYNNLGNALYSQGKREEA 422
>gi|67920262|ref|ZP_00513782.1| TPR repeat:Sel1-like repeat:Sel1-like repeat [Crocosphaera watsonii
WH 8501]
gi|67857746|gb|EAM52985.1| TPR repeat:Sel1-like repeat:Sel1-like repeat [Crocosphaera watsonii
WH 8501]
Length = 353
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 80/177 (45%), Gaps = 15/177 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E A + KAI A +M L++Q +++EA+ A K L A + +N+
Sbjct: 77 EEATAAYKKAIELDPNYSFAYNNMGNALRKQGKLEEAIAAYKKAIELDPNDAF-AYNNMG 135
Query: 96 IDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFN--GKPTKTARSHGKKFQVTVK 145
+ L G+L+E I K+ + + Y+ G A N GK + ++ K + +
Sbjct: 136 LAL-DDQGKLEEAIAAYKKAIELDPNYATAYYNMGNALNRQGKLEEAIAAYKKA--IELD 192
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEA 201
S N+G A ++G Y A Y+KA I+P+ A N+ L KQ +Y EA
Sbjct: 193 PNYSFAYNNMGVALRKQGKYDEAIAAYKKAIEINPNYAFAYNNMGVALRKQGKYDEA 249
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 75/178 (42%), Gaps = 17/178 (9%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE--SLDN 93
E AI + KAI A +M + L Q +++EA+ A K L A ++ N
Sbjct: 111 EEAIAAYKKAIELDPNDAFAYNNMGLALDDQGKLEEAIAAYKKAIELDPNYATAYYNMGN 170
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGK--------KFQVTV 144
L + G+L+E I K+ + + + A+N R GK K + +
Sbjct: 171 AL----NRQGKLEEAIAAYKKAIELDPNYSFAYNNMGV-ALRKQGKYDEAIAAYKKAIEI 225
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEA 201
+ N+G A ++G Y A Y+KA I+P DA N+ L Q +Y EA
Sbjct: 226 NPNYAFAYNNMGVALRKQGKYDEAIAAYKKAIEINPNDAFGYNNMGLALDDQGKYDEA 283
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 60/146 (41%), Gaps = 28/146 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI A +M V L++Q + DEA+ A K + A + +N+
Sbjct: 179 EEAIAAYKKAIELDPNYSFAYNNMGVALRKQGKYDEAIAAYKKAIEINPNYAF-AYNNMG 237
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
+ L +K G+ DE I K+ + I +AF N+
Sbjct: 238 VAL-RKQGKYDEAIAAYKKAIE-INPNDAFG-------------------------YNNM 270
Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPD 181
G A +G Y A ++KA IDP+
Sbjct: 271 GLALDDQGKYDEAIAAHKKALEIDPN 296
>gi|411119538|ref|ZP_11391918.1| TPR repeat-containing protein [Oscillatoriales cyanobacterium JSC-12]
gi|410711401|gb|EKQ68908.1| TPR repeat-containing protein [Oscillatoriales cyanobacterium JSC-12]
Length = 2384
Score = 44.3 bits (103), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 68/158 (43%), Gaps = 10/158 (6%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + + A D ++AV L +Q ++DEA+ + + L K N +
Sbjct: 927 AIAYYQNTLKANDNHLDTYNNLAVALHEQGKLDEAMPYYQ--KALALKPDNPDAHNNYAN 984
Query: 98 LYKKCGRLDEQIELLKQKL--RMIYH------GEAFNGKPTKTARSHGKKFQVTVKQETS 149
L ++ RLDE I +Q + R Y G A+ K + + + + K
Sbjct: 985 LLRERSRLDEAIYHYQQAIAARPDYPDAYNNLGLAYYAKGNFASAAEAYRQAIERKPHFP 1044
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
+ L +LG A + GN+ A Y++A + PD KA N
Sbjct: 1045 QALNHLGNALKELGNFAEAARYYQQAIALKPDYAKAYN 1082
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 81/175 (46%), Gaps = 11/175 (6%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + +A+N A ++ + K Q ++DEA+ + L A E+ N+
Sbjct: 122 EEAIAAYQQALNLNQAYAEAHNNLGIAYKDQGKLDEALACYREAIRLKPNYA-EAHHNMG 180
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHG-EAFNG-KPTKTARSHGKK----FQ--VTVKQE 147
I L ++ +LD+ I +Q + + H +A+ T + +G++ +Q VT+K
Sbjct: 181 IVL-RQQNKLDDAIHYFRQAIALKPHYIDAYTSLGSTLQQQGNGEEAIACYQQVVTLKPN 239
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
+ NLG A +G A +++A + P+ C NL + L++ R EA
Sbjct: 240 YAEGFNNLGLALQHQGKLEEAIATFQQALALQPNFPGVCNNLGNLLLEVNRVDEA 294
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 76/169 (44%), Gaps = 17/169 (10%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + KA+ AL ++ VLK Q +++ AV ++ L A+ + N L +
Sbjct: 328 AITHYQKALELRPNFVEALSNLGAVLKDQHKLEAAVSYLEQAVSLGPSYAE--IHNNLGN 385
Query: 98 LYKKCGRLDEQIEL------LKQKLRMIYHGEA----FNGKPTKTARSHGKKFQVTVKQE 147
Y++ R+DE I LK ++ ++ + G+ + A H +K + ++ +
Sbjct: 386 AYQEQKRVDEAIACYRTAVALKPEMAEVHSNLGNMLQYIGE-FEEAFEHFRK-AIEIQPD 443
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD---ANKACNLSHCLI 193
+ + NLG A+ G A Y KA + PD A+ L+H L+
Sbjct: 444 FAGVYNNLGIAHRNAGQVQEAFAAYSKALELKPDFVEAHWNTALNHLLL 492
>gi|444912555|ref|ZP_21232716.1| hypothetical protein D187_04652 [Cystobacter fuscus DSM 2262]
gi|444716773|gb|ELW57614.1| hypothetical protein D187_04652 [Cystobacter fuscus DSM 2262]
Length = 588
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 79/176 (44%), Gaps = 26/176 (14%)
Query: 23 VRAKHVQLVEK--DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFR 80
+R + + VE+ +PE A+ L +A+ R L+ A LK+Q R EA+ A++
Sbjct: 360 LRIQQARAVERGGEPERAVALLREALGR-KRTPELLEAFASTLKRQGRPGEALAALRE-- 416
Query: 81 HLCSKQAQES------LDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTAR 134
QA E L VL+D+ G L +L+ + H A N A+
Sbjct: 417 --AISQAPEDAAPRYVLATVLLDMGDVPGALYSMRGVLRLE---PDHAAAMNFIGYLLAQ 471
Query: 135 SHGKKFQ---------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
HG+ F + ++ +T L +LGW + Q+G+Y SA +A ++P+
Sbjct: 472 -HGRDFAEAERLVRRALALRPDTGSFLDSLGWIHYQRGDYPSAVRTLARAAELEPE 526
>gi|441504257|ref|ZP_20986254.1| Flp pilus assembly protein TadD [Photobacterium sp. AK15]
gi|441428430|gb|ELR65895.1| Flp pilus assembly protein TadD [Photobacterium sp. AK15]
Length = 365
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 1/67 (1%)
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEAR 202
V ++ + L NLG++Y GN AEV+ R+A + PD +A NL IK RY +A
Sbjct: 219 VDGKSQKALTNLGYSYYLSGNLRQAEVINRRASTLYPDNQRAWSNLGLTYIKAKRYDDAN 278
Query: 203 SVLEDVL 209
++
Sbjct: 279 DAFSRIM 285
>gi|82702040|ref|YP_411606.1| hypothetical protein Nmul_A0911 [Nitrosospira multiformis ATCC
25196]
gi|82410105|gb|ABB74214.1| Tetratricopeptide TPR_4 [Nitrosospira multiformis ATCC 25196]
Length = 875
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 78/167 (46%), Gaps = 11/167 (6%)
Query: 55 ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ 114
AL + ++ + R+D +E IK R L K N L + ++ G+L++ I +
Sbjct: 79 ALHFLGLLARDAGRIDIGIELIK--RALRFKPNYVEAHNNLGNTLRQQGKLNDAIASYRT 136
Query: 115 KLRMIYHGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGNYT 166
+++ G R G+ + + ++ + + + N+G + ++G+
Sbjct: 137 AVKLEPRFAEAYGNLGNALREQGRLDDAMINYRKALGIQPQLAEMHCNIGIVHREQGDLE 196
Query: 167 SAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLEDVLLGK 212
+A +RKA L+ PD+ +A NL + L++Q ++ EA S +L K
Sbjct: 197 NAVSSFRKALLLKPDSAEAFNNLGNVLVEQGKFEEAVSSFGKAILYK 243
>gi|394988448|ref|ZP_10381283.1| hypothetical protein SCD_00848 [Sulfuricella denitrificans skB26]
gi|393791827|dbj|GAB70922.1| hypothetical protein SCD_00848 [Sulfuricella denitrificans skB26]
Length = 735
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 74/161 (45%), Gaps = 11/161 (6%)
Query: 58 DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
++ VVL +Q R+ EA ++ + A ++ +N+ I L K+ RL E +Q L+
Sbjct: 89 NLGVVLVEQGRLSEAEANLRRALEIKPNHA-DAHNNLGIALMKQ-ERLSEAEPCFQQALK 146
Query: 118 MI--YHGEAFNGKPTKTARSHGKKFQVT------VKQETSRILGNLGWAYMQKGNYTSAE 169
+ YH N T + + +++ +K + L NLG Q+G +E
Sbjct: 147 IKPDYHEALNNLGSLLTEQGLLAEAEISCVEALKIKPDYVPALSNLGVCLQQQGRLAESE 206
Query: 170 VVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
+R+A I PD+ +A C L L Q R+ EA + L L
Sbjct: 207 ACFRRALEIKPDSAEALCGLGITLNDQGRFPEAEANLRRAL 247
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 1/89 (1%)
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G A +G+ + + + + ++ + NLG A ++ AE YR+A I P+
Sbjct: 23 GTALSGQGKSSEALDALQRAIALSRDDAEAHSNLGNALKNLRRFSEAETSYRQAIKIRPN 82
Query: 182 -ANKACNLSHCLIKQARYTEARSVLEDVL 209
A CNL L++Q R +EA + L L
Sbjct: 83 FAEAQCNLGVVLVEQGRLSEAEANLRRAL 111
>gi|398346185|ref|ZP_10530888.1| hypothetical protein Lbro5_02905 [Leptospira broomii str. 5399]
Length = 689
Score = 43.9 bits (102), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 7/86 (8%)
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLE 206
+S LGN+ + +G+Y SAE +R+A + P+ KA NLS L K + EA LE
Sbjct: 279 SSAYLGNIAY---NRGDYGSAEHYFREAATLSPNDAKALYNLSVVLKKNGKMEEALKYLE 335
Query: 207 DVLLGKLSGSTETKTINRVKELLQEL 232
+ KL+G+++ + + E ++L
Sbjct: 336 ---MAKLAGASDPEIFRSIAESFEQL 358
>gi|390568922|ref|ZP_10249213.1| TPR repeat-containing protein [Burkholderia terrae BS001]
gi|389939128|gb|EIN00966.1| TPR repeat-containing protein [Burkholderia terrae BS001]
Length = 702
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 72/163 (44%), Gaps = 15/163 (9%)
Query: 55 ALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ 114
A ++ VVL+Q R EA +A K + A ++ L L + GR+DE E+ +
Sbjct: 243 AYNNLGVVLRQLGRPGEAKQAHKKAIATDPEHAGAHIN--LARLLESLGRVDE-AEVAYR 299
Query: 115 KLRMIY--HGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGN 164
+ +Y H E N ++ G+ + + +K + NLG +
Sbjct: 300 RGLCLYPDHAETHNNL-GAMLQAQGRMPDAETCYRRSLAIKPDFPNACFNLGIVLQAQKR 358
Query: 165 YTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLE 206
Y AE +YR+A PD +A NL+H L+ ++TE V E
Sbjct: 359 YHEAERLYRRALQAKPDMLEARLNLAHLLLATGQFTEGWQVFE 401
>gi|115477811|ref|NP_001062501.1| Os08g0559300 [Oryza sativa Japonica Group]
gi|75325353|sp|Q6YZI0.1|SPY_ORYSJ RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY
gi|45736106|dbj|BAD13137.1| putative gibberellin action negative regulator SPY [Oryza sativa
Japonica Group]
gi|113624470|dbj|BAF24415.1| Os08g0559300 [Oryza sativa Japonica Group]
gi|215686860|dbj|BAG89710.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 927
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D+AVE + + +Q SL+N+
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 361
Query: 96 IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++++ + + EA+N N
Sbjct: 362 V-VYTVQGKMDAASSMIQKAIFANSTYAEAYN---------------------------N 393
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y G+ TSA Y K IDPD+ A
Sbjct: 394 LGVLYRDAGSITSAVQAYEKCLQIDPDSRNA 424
>gi|218201606|gb|EEC84033.1| hypothetical protein OsI_30264 [Oryza sativa Indica Group]
Length = 947
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D+AVE + + +Q SL+N+
Sbjct: 323 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 381
Query: 96 IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++++ + + EA+N N
Sbjct: 382 V-VYTVQGKMDAASSMIQKAIFANSTYAEAYN---------------------------N 413
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y G+ TSA Y K IDPD+ A
Sbjct: 414 LGVLYRDAGSITSAVQAYEKCLQIDPDSRNA 444
>gi|118391306|ref|XP_001028412.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89281566|gb|EAR80749.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 318
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTE 200
V + + S GNLG+++MQK Y A ++KA +DP D+ NL + +K+ Y +
Sbjct: 13 VQLDPKDSWAFGNLGYSFMQKEMYDDAITFFQKAVQLDPRDSWAFGNLGYSFMKKKMYDD 72
Query: 201 ARSVLE 206
A + L+
Sbjct: 73 AITFLQ 78
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 46/198 (23%), Positives = 80/198 (40%), Gaps = 18/198 (9%)
Query: 14 KLPPGDSPYVRAKHVQLVEKDP-EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEA 72
+L P DS R ++K+ + AI F KA+ R A ++ ++ D+A
Sbjct: 116 QLDPKDSCAFRCMGYSFMKKEMYDDAITFFQKAVQLDPRDSWAFGNLGYSFMKKKMYDDA 175
Query: 73 VEAIK-----------SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYH 121
+ + +F +L ++ + + I ++K +LD + KL
Sbjct: 176 ITFFQKAVQLDPKDSWAFGNLGYSFMKKKMYDDAITFFQKAVQLDPKDSWAFGKL----- 230
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G +F K + V + + S G LG+++MQK Y A +KA +DPD
Sbjct: 231 GYSFMQKQMYDDAITFFQKAVQLDPKDSWAFGKLGYSFMQKEMYDDAITFSQKAVQLDPD 290
Query: 182 ANKA-CNLSHCLIKQARY 198
+ NL K+ RY
Sbjct: 291 VKENLLNLGIAFQKKGRY 308
>gi|222641017|gb|EEE69149.1| hypothetical protein OsJ_28275 [Oryza sativa Japonica Group]
Length = 947
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D+AVE + + +Q SL+N+
Sbjct: 323 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 381
Query: 96 IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++++ + + EA+N N
Sbjct: 382 V-VYTVQGKMDAASSMIQKAIFANSTYAEAYN---------------------------N 413
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y G+ TSA Y K IDPD+ A
Sbjct: 414 LGVLYRDAGSITSAVQAYEKCLQIDPDSRNA 444
>gi|223934537|ref|ZP_03626458.1| Tetratricopeptide TPR_2 repeat protein [bacterium Ellin514]
gi|223897000|gb|EEF63440.1| Tetratricopeptide TPR_2 repeat protein [bacterium Ellin514]
Length = 718
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 80/181 (44%), Gaps = 15/181 (8%)
Query: 32 EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESL 91
+ P+ A+ + K+I D A ++ VL ++ + +EA+ S L +A+ +
Sbjct: 484 QGKPDEALANYQKSIEINPNYDEAQNNIGYVLAKKGKPEEAIPYYFSSLRLNPNRAE--V 541
Query: 92 DNVLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTA---------RSHGKKFQ 141
N L + Y G+LDE I + L++ + EA+NG A R +
Sbjct: 542 HNNLGNAYADLGKLDEAIREYQAALKINTNYAEAYNGIGISLAKKGDLAGATRWLSDAIR 601
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTE 200
+ K +S NLG Y +G + A + Y+ +P DA NL++ L +Q + E
Sbjct: 602 LNPKNVSSH--SNLGNVYAMQGKFDLAAIEYKLVLKQNPDDALTHNNLANLLSEQGKLDE 659
Query: 201 A 201
A
Sbjct: 660 A 660
>gi|254409909|ref|ZP_05023689.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
7420]
gi|196182945|gb|EDX77929.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
7420]
Length = 956
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 67/169 (39%), Gaps = 33/169 (19%)
Query: 56 LKDMAVVLKQQDRVDEAVE-AIKSFRHLCSKQAQESLD-----NVLIDLYKKCGRLDEQI 109
LK L Q + +EA+ A + R S ++ LD N L LY+ GR E
Sbjct: 60 LKQQVEQLYNQGKYNEAIPLAERMLRLYQSVYGEDHLDITYSLNYLGILYRNQGRYTEAE 119
Query: 110 ELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAE 169
L +Q L M K+ + L NL + Y +G YT AE
Sbjct: 120 PLYRQALEM------------------KKRLLGEEHPHVATSLNNLAYLYESQGRYTEAE 161
Query: 170 VVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
+YR+A +L+ + A NL++ Q RYTEA +L L
Sbjct: 162 PLYRQALEMYKRLLGEEHPLVATSLNNLAYLYESQGRYTEAEPLLRQAL 210
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V ++L ++A + + Q R EA +A++ ++ L ++ SL+N L LY+ GR
Sbjct: 140 VATSLNNLAYLYESQGRYTEAEPLYRQALEMYKRLLGEEHPLVATSLNN-LAYLYESQGR 198
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E LL+Q L M R G+ L NL +Y +G
Sbjct: 199 YTEAEPLLRQALEM-------------RKRLLGQDHPAVATS-----LNNLAGSYQSQGR 240
Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
YT AE + R+A +L+ + A NL+ Q RY+EA + L
Sbjct: 241 YTEAEPLLRQALEMSKRLLGEEHPHVALSLNNLAALYDNQGRYSEAEPLYRQAL 294
>gi|297180994|gb|ADI17196.1| FOG: tpr repeat-protein [uncultured Rhodobacterales bacterium
HF0070_10D05]
Length = 734
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 2/84 (2%)
Query: 33 KDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
K+ + A+V + KAI+ +A +M LK Q ++DEA+EA + LC K +
Sbjct: 346 KNYDDAVVFYNKAISIKPDYATAYNNMGNALKDQGKLDEAIEAFN--KALCIKPDYATAY 403
Query: 93 NVLIDLYKKCGRLDEQIELLKQKL 116
N + + K G+LDE IE + L
Sbjct: 404 NNMGNALKDQGKLDEAIEAFNKAL 427
>gi|255562045|ref|XP_002522031.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
gi|223538835|gb|EEF40435.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
Length = 979
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 55/247 (22%), Positives = 92/247 (37%), Gaps = 38/247 (15%)
Query: 16 PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
P Y+ +V P+ AIV + +A+ A ++A ++ ++D A+
Sbjct: 256 PTFPDAYLNLGNVYRALGMPQEAIVCYQRAVQTRPNYAVAFGNLASTYYERGQLDLAIHH 315
Query: 76 IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNG------- 127
K C + E+ +N L + K GR++E I+ Q L + H +A
Sbjct: 316 YKQ-AIACDGRFLEAYNN-LGNALKDVGRVEEAIQCYNQCLALQPSHPQALTNLGNIYME 373
Query: 128 -KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
TA S+ K + V S NL Y Q+GNY A Y + IDP A
Sbjct: 374 WNMASTAASYYKA-TLAVTTGLSAPFNNLAVIYKQQGNYADAISCYNEVLRIDPLAAD-- 430
Query: 187 NLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKK 246
G ++ K I RV + +Q+ +I P++
Sbjct: 431 ------------------------GLVNRGNTYKEIGRVSDAIQDYIRAITIRPTMAEAH 466
Query: 247 SSLEDAF 253
++L A+
Sbjct: 467 ANLASAY 473
>gi|67902940|ref|XP_681726.1| hypothetical protein AN8457.2 [Aspergillus nidulans FGSC A4]
gi|40747923|gb|EAA67079.1| hypothetical protein AN8457.2 [Aspergillus nidulans FGSC A4]
gi|259484403|tpe|CBF80593.1| TPA: Pfs, NB-ARC and TPR domain protein (JCVI) [Aspergillus nidulans
FGSC A4]
Length = 1131
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 93/243 (38%), Gaps = 46/243 (18%)
Query: 54 SALKDMAVVLKQQDRVDEAVE----AIKSFRHLCSKQAQESLDNV--LIDLYKKCGRLDE 107
+ + ++ + Q ++ EA E A+ + +LD V L +LY GRL E
Sbjct: 819 NTVNNLGNLYSDQGKLREAEEMYQQALAGYEKALGPAHTSTLDTVNNLGNLYSDQGRLKE 878
Query: 108 QIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTS 167
E+ +Q A GK H T +GNLG Y +G
Sbjct: 879 AEEMYQQ---------ALAGKEKALGLDH---------TSTLDTVGNLGLLYRDQGKLRE 920
Query: 168 AEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVLLGKLS--GS 216
AE ++++A + + PD N NL + Q R EA + + L GK G
Sbjct: 921 AEEMFQRALTGKEKALGPDHTSTLNTVNNLGNLYSDQGRLKEAEEMYQRALAGKEKALGL 980
Query: 217 TETKTINRVKEL---------LQELEPWQSIPPSLTTKKSSLEDAFLEGLDDLMNQWTPY 267
T T+N V L L+E E + +LT K+ +L LD + N Y
Sbjct: 981 DHTSTLNTVNNLGLLYRDQGKLKEAE--EMFQRALTGKEKALGPDHTSTLDTVGNLGLLY 1038
Query: 268 RSR 270
R +
Sbjct: 1039 RDQ 1041
>gi|427707117|ref|YP_007049494.1| hypothetical protein Nos7107_1704 [Nostoc sp. PCC 7107]
gi|427359622|gb|AFY42344.1| Tetratricopeptide TPR_1 repeat-containing protein [Nostoc sp. PCC
7107]
Length = 552
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 105/247 (42%), Gaps = 42/247 (17%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDNVL 95
A+ + AI ++ +A + VL+ Q ++DEA+ K L K A + NVL
Sbjct: 81 ALAAYQTAIKINPKLANAYNGLGNVLRDQGKLDEALATYKIAIKLDGKLANAYNGMGNVL 140
Query: 96 IDLYKKCGRLDEQIELLKQKLRM------IYHGEA----FNGKPTKTARSHGKKFQVTVK 145
+ G+L+E I ++ +++ Y+G + GK + S+ K Q K
Sbjct: 141 ----SEQGKLNESIAAYQKSIQLDPKNALPYNGMGNVLIYQGKLDEAIASYRKAIQFDPK 196
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQ-------AR 197
+ NLG A + A Y+KA IDP A +L L +Q A+
Sbjct: 197 YAVT--YHNLGLALYNQKKLDEALAAYKKAIQIDPKYTSAYVSLGLALSEQGKLDEAMAK 254
Query: 198 YTEARSVLEDVLLGKLSGSTETKTI------------NRVKELLQELEPWQSIPPSLTTK 245
Y +A S+ ED K + T T+ ++KE ++E + SI + T
Sbjct: 255 YRQALSLPED----KSATPTTVHTLAHNNLGFALQRQGKLKEAIEEYKQAISIDSNFVTA 310
Query: 246 KSSLEDA 252
+++L++A
Sbjct: 311 QTNLKEA 317
>gi|254410083|ref|ZP_05023863.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
7420]
gi|196183119|gb|EDX78103.1| Tetratricopeptide repeat family [Coleofasciculus chthonoplastes PCC
7420]
Length = 955
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 67/169 (39%), Gaps = 33/169 (19%)
Query: 56 LKDMAVVLKQQDRVDEAVE-AIKSFRHLCSKQAQESLD-----NVLIDLYKKCGRLDEQI 109
LK L Q + +EA+ A + R S ++ LD N L LY+ GR E
Sbjct: 59 LKQQVEQLYNQGKYNEAIPLAERMLRLYQSVYGEDHLDITYSLNYLGILYRNQGRYTEAE 118
Query: 110 ELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAE 169
L +Q L M K+ + L NL + Y +G YT AE
Sbjct: 119 PLYRQALEM------------------KKRLLGEEHPHVATSLNNLAYLYESQGRYTEAE 160
Query: 170 VVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
+YR+A +L+ + A NL++ Q RYTEA +L L
Sbjct: 161 PLYRQALEMYKRLLGEEHPLVATSLNNLAYLYESQGRYTEAEPLLRQAL 209
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V ++L ++A + + Q R EA +A++ ++ L ++ SL+N L LY+ GR
Sbjct: 139 VATSLNNLAYLYESQGRYTEAEPLYRQALEMYKRLLGEEHPLVATSLNN-LAYLYESQGR 197
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E LL+Q L M R G+ L NL +Y +G
Sbjct: 198 YTEAEPLLRQALEM-------------RKRLLGQDHPAVATS-----LNNLAGSYQSQGR 239
Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
YT AE + R+A +L+ + A NL+ Q RY+EA + L
Sbjct: 240 YTEAEPLLRQALEMSKRLLGEEHPHVALSLNNLAALYDNQGRYSEAEPLYRQAL 293
>gi|423065545|ref|ZP_17054335.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
gi|406712988|gb|EKD08163.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
Length = 917
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 74/156 (47%), Gaps = 14/156 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI ++ A DM + L ++ +D A++ ++ C + + N L
Sbjct: 124 EGAIGCYEKAIALNPKLGQAYLDMGLRLNERGDIDTAIKVLQQGGINCPNF--KEIFNTL 181
Query: 96 IDLYKKCGRLDEQIELLKQKLR------MIYH----GEAFNGKPTKTARSHGKKFQVTVK 145
L + ++DE I + ++ L ++Y+ + GK ++ ++ K +++K
Sbjct: 182 GYLQLQQNQIDEAIAIFQEALNIDPTEPLVYNNLGWAWTYQGKLSEAIAAYQKA--ISLK 239
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
+ + NLG + QK N+ A ++KA I+PD
Sbjct: 240 PDLAIAYSNLGKLWQQKNNHRQAISYFQKAIAIEPD 275
>gi|397688051|ref|YP_006525370.1| hypothetical protein PSJM300_14750 [Pseudomonas stutzeri DSM 10701]
gi|395809607|gb|AFN79012.1| TPR domain-containing protein [Pseudomonas stutzeri DSM 10701]
Length = 575
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 30/59 (50%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
IL +LGWA+ + GN AE RKA PD A +L L + EAR+V D L
Sbjct: 495 ILDSLGWAHYRLGNLQEAEAQLRKAHARFPDHEVAAHLGEVLWVSGQRKEARTVWADAL 553
>gi|113477766|ref|YP_723827.1| hypothetical protein Tery_4362 [Trichodesmium erythraeum IMS101]
gi|110168814|gb|ABG53354.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 725
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 84/188 (44%), Gaps = 48/188 (25%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIEL 111
+DS + +A+ + +DR E+ ++L N+ I +Y+ G + I
Sbjct: 147 IDSFQQQLAITQETKDRQSES----------------KALGNLGI-VYQSLGHFERSINY 189
Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
+Q+L + T+ ++ +Q S+ LGNLG Y +G Y+ AE +
Sbjct: 190 FQQQLAI-----------TR---------EIKDRQSESKALGNLGICYENQGQYSKAEPL 229
Query: 172 YRKA-----QLID---PD-ANKACNLSHCLIKQARYTEARSV-LEDVLLGK-LSGSTETK 220
Y +A QL+ PD A+ NL+ Q RY+EA + LE + + K L G+T +
Sbjct: 230 YLEALKMNKQLLGSTHPDVASNLNNLAGLYYSQGRYSEAEPLYLEALKMNKQLLGATHPE 289
Query: 221 TINRVKEL 228
+ + L
Sbjct: 290 IASNLNNL 297
>gi|209528249|ref|ZP_03276714.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
gi|209491321|gb|EDZ91711.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
Length = 915
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 74/156 (47%), Gaps = 14/156 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI ++ A DM + L ++ +D A++ ++ C + + N L
Sbjct: 122 EGAIGCYEKAIALNPKLGQAYLDMGLRLNERGDIDTAIKVLQQGGINCPNF--KEIFNTL 179
Query: 96 IDLYKKCGRLDEQIELLKQKLR------MIYH----GEAFNGKPTKTARSHGKKFQVTVK 145
L + ++DE I + ++ L ++Y+ + GK ++ ++ K +++K
Sbjct: 180 GYLQLQQNQIDEAIAIFQEALNIDPTEPLVYNNLGWAWTYQGKLSEAIAAYQKA--ISLK 237
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
+ + NLG + QK N+ A ++KA I+PD
Sbjct: 238 PDLAIAYSNLGKLWQQKNNHRQAISYFQKAIAIEPD 273
>gi|406870824|gb|EKD21860.1| hypothetical protein ACD_87C00299G0002, partial [uncultured
bacterium]
Length = 637
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 68/171 (39%), Gaps = 23/171 (13%)
Query: 26 KHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSK 85
KH Q VE + KAIN G + K++A Q + EA I + L SK
Sbjct: 463 KHAQSVEN--------YQKAINLGAKDPQLHKNLAQAYSQMGKTKEA---IPMYEKLASK 511
Query: 86 QAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM----------IYHGEAFNGKPTKTARS 135
+ + NVL D Y K D+ I + K+ + M + G +
Sbjct: 512 KPTAEVLNVLADAYMKEKLYDKAIMIYKKLIEMNPKKAAYFASAAYAYGLKGDSDRQIEY 571
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
+ + + E + + +LG AY +KG Y A Y A ++PDA +
Sbjct: 572 Y--RLSLKFDPEDADVYASLGAAYEKKGLYAEALKAYTDAYELNPDATQVA 620
>gi|157108260|ref|XP_001650149.1| kinesin light chain 1 and [Aedes aegypti]
gi|108868572|gb|EAT32797.1| AAEL014967-PA [Aedes aegypti]
Length = 472
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 73/175 (41%), Gaps = 35/175 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHL-------CSKQAQESLDNVLIDLYKKCGR 104
V + L + +V + Q+ + EA++ + + C +L+N+ + LY K G
Sbjct: 234 VATMLSILTMVYRDQNNLPEAIKHMNEALGIWVRCLGECHPSVAAALNNLAV-LYGKNGN 292
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L K+ A + R H + ++ L NL +G
Sbjct: 293 YKEAESLCKR---------ALANRENVLGRYH---------PDVAKQLNNLALLCQNQGK 334
Query: 165 YTSAEVVYRKA--------QLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVLL 210
+ E+ R+A +IDP+A K CNL+ C +K +Y EA +L DVL+
Sbjct: 335 HGEVELYIRRALEIFESQLGVIDPNAIKTKCNLAACCLKLRKYKEADQLLRDVLV 389
>gi|376005002|ref|ZP_09782572.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
gi|375326596|emb|CCE18325.1| hypothetical protein ARTHRO_370013 [Arthrospira sp. PCC 8005]
Length = 915
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 74/156 (47%), Gaps = 14/156 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI ++ A DM + L ++ +D A++ ++ C + + N L
Sbjct: 122 EGAIGCYEKAIALNPKLGQAYLDMGLRLNERGDIDTAIKVLQQGGINCPNF--QEIFNTL 179
Query: 96 IDLYKKCGRLDEQIELLKQKLR------MIYH----GEAFNGKPTKTARSHGKKFQVTVK 145
L + ++DE I + ++ L ++Y+ A GK ++ ++ K +++K
Sbjct: 180 GYLQLQQNQIDEAIAIFQESLNIDPTEPLVYNNLGWALACQGKLSEAIAAYEKA--ISLK 237
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
+ + NLG + QK N+ A ++KA I+PD
Sbjct: 238 PDLAIAYSNLGKLWQQKNNHRQAISYFQKAIAIEPD 273
>gi|297833104|ref|XP_002884434.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
lyrata]
gi|297330274|gb|EFH60693.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
lyrata]
Length = 977
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 93/251 (37%), Gaps = 46/251 (18%)
Query: 16 PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
P Y+ +V P AI+ + A+ A ++A + +Q ++D A
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNCAMAFGNIASIYYEQGQLDLA--- 310
Query: 76 IKSFRHLCSK-----QAQESLDNVLIDLYKKCGRLDEQIELLKQKLR--------MIYHG 122
I+ ++ S+ +A +L N L D+ GR+DE I Q L M G
Sbjct: 311 IRHYKQAISRDPRFLEAYNNLGNALKDI----GRVDEAIRCYNQCLALQPNHPQAMANLG 366
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
+ S K + V S NL Y Q+GNY+ A Y + IDP A
Sbjct: 367 NIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLA 426
Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
D L+ + G+T K I RV E +Q+ + P++
Sbjct: 427 -----------------------ADALVNR--GNT-YKEIGRVTEAIQDYMHAINFRPTM 460
Query: 243 TTKKSSLEDAF 253
++L A+
Sbjct: 461 AEAHANLASAY 471
>gi|452003920|gb|EMD96377.1| hypothetical protein COCHEDRAFT_1201158 [Cochliobolus heterostrophus
C5]
Length = 1255
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 86/207 (41%), Gaps = 42/207 (20%)
Query: 43 WKAINAGDRVD-SALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQAQESL--DNVL 95
+KA+ G+ + + ++ ++ K++ R +A +EA++ F + + +L N L
Sbjct: 854 YKALEPGNLLTFKTVANLGLLYKERSRFADAERMYIEALQGFEQVLGSNHKTTLRTANNL 913
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
LYK GRL + + KQ L+ + + G K T++ + +L
Sbjct: 914 GILYKNQGRLVDAERIHKQVLK-------------RKEDALGSKHLSTLQ-----TIDSL 955
Query: 156 GWAYMQKGNYTSAEVVYRKA------------QLIDPDANKACNLSHCLIKQARYTEARS 203
G YM + N T AE +Y KA L NL HC Q R +A
Sbjct: 956 GVVYMNQSNLTKAEELYEKALNGKEEVLGSKHHLTLQTVGNLGNLYHC---QGRLVDAER 1012
Query: 204 VLEDVLLG--KLSGSTETKTINRVKEL 228
+LE VL G + G T+N + L
Sbjct: 1013 MLERVLQGYKDVFGHKHPSTLNTINNL 1039
>gi|449449593|ref|XP_004142549.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Cucumis
sativus]
Length = 975
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 88/227 (38%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AIV + +AI A ++A +Q ++D A+ K C + E+ +N
Sbjct: 281 PQEAIVCYQRAIQMRPNYAIAYGNLASTYYEQSQLDMAILHYKQ-AITCDPRFLEAYNN- 338
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPT-----KTARSHGKKFQVTVKQET 148
L + K+ GR++E I+ Q L + H +A + ++ T++ T
Sbjct: 339 LGNALKEFGRVEEAIQCYNQCLALQPSHPQALTNLGNIYMEWNMVPAAASYYKATLRVTT 398
Query: 149 --SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S NL Y Q+GNY A Y + IDP A
Sbjct: 399 GLSAPFNNLAIIYKQQGNYADAISCYNEVLRIDPLAAD---------------------- 436
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
G ++ K I RV E +Q+ +I P++ ++L A+
Sbjct: 437 ----GLVNRGNTYKEIGRVSEAIQDYIRAINIRPTMAEAHANLASAY 479
>gi|406981058|gb|EKE02578.1| hypothetical protein ACD_20C00357G0001 [uncultured bacterium]
Length = 536
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 68/163 (41%), Gaps = 37/163 (22%)
Query: 54 SALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQA--QESLDNVLIDLYKKCGRLDEQIEL 111
+A K + + ++Q D A+E + HL A +L N+L Y K G+ DE I
Sbjct: 278 TAYKSLCSIYEEQSDYDSAIEMYRKLIHLNPNDAVFYSNLANLL---YLK-GKFDEAISA 333
Query: 112 LKQKLRMIYHGEAFNGKPTK-----TARSHGKKFQ-------VTVKQETSRILGN----- 154
+ + + P K A++ G FQ + + L N
Sbjct: 334 YQTAVTL---------NPNKRWTSVIAQTLGYIFQESKENVDAAISSYQNATLLNPEDID 384
Query: 155 ----LGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCL 192
LG A+ KG+Y +A +VYR A IDP +A CNL + L
Sbjct: 385 TYISLGSAFYDKGDYNNALIVYRTALEIDPNNARIHCNLGYLL 427
>gi|186681165|ref|YP_001864361.1| hypothetical protein Npun_R0666 [Nostoc punctiforme PCC 73102]
gi|186463617|gb|ACC79418.1| TPR repeat-containing protein [Nostoc punctiforme PCC 73102]
Length = 532
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 101/231 (43%), Gaps = 22/231 (9%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + KA+ +A ++ L Q ++ EA+ A + + A N+ +
Sbjct: 287 AIAAYQKALQIDPNYVNAHCNLGKALHTQGKLSEAMAAYQRALRVDPNYASAHC-NLGVT 345
Query: 98 LYKKCGRLDEQIELLKQKLRM--------IYHGEAFN--GKPTKTARSHGKKFQVTVKQE 147
LY + G+L E I ++ L++ G+A + GK ++ ++ + +V
Sbjct: 346 LYHQ-GKLSEAIAAYQKALQIDPNYVNAHCNLGKALHTQGKLSEAMAAYQRALRVDPNYA 404
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLE 206
++ NLG +G + A Y++A +DP DA+ CNL L Q + +EA + +
Sbjct: 405 SAHC--NLGVTLYHQGKLSEAIAAYQRALRLDPNDADTHCNLGIALHDQGKLSEAIAAYQ 462
Query: 207 DVLL-------GKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLE 250
LL + K +++E + ELE + P+ T +++LE
Sbjct: 463 RALLIDPNDADAHCNLGIALKGQGKLEEAIAELEIAVRLNPNNTVIRNNLE 513
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 21/191 (10%)
Query: 33 KDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
K E A+V + A++ A + VVL Q ++ EA+ A + + Q +
Sbjct: 180 KKLEEAVVKYKAALSLDPNSVYAHNALGVVLHTQGKLSEAIAAYQK-----ALQIDPNYV 234
Query: 93 NVLIDLYKKC---GRLDEQIELLKQKLRM--------IYHGEAFN--GKPTKTARSHGKK 139
N +L K G+L E + ++ LR+ G A + GK ++ ++ K
Sbjct: 235 NAHCNLGKALHTQGKLSEAMAAYQRALRLDPNDADTHCNLGIALHDQGKLSEAIAAYQKA 294
Query: 140 FQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARY 198
Q+ + NLG A +G + A Y++A +DP+ A+ CNL L Q +
Sbjct: 295 LQIDPNYVNAHC--NLGKALHTQGKLSEAMAAYQRALRVDPNYASAHCNLGVTLYHQGKL 352
Query: 199 TEARSVLEDVL 209
+EA + + L
Sbjct: 353 SEAIAAYQKAL 363
>gi|225430656|ref|XP_002270163.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC [Vitis vinifera]
gi|296085167|emb|CBI28662.3| unnamed protein product [Vitis vinifera]
Length = 986
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 83/227 (36%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AIV + +A+ A +MA +Q ++D A+ K C E+ +N
Sbjct: 282 PQEAIVCYQRALQTRPEYAMAYGNMAGTYYEQGQMDMAIVHYKQAIE-CDSGFLEAYNN- 339
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
L + K GR+DE I+ Q L + G + A + K + V
Sbjct: 340 LGNALKDVGRIDEAIQCYHQCLALQPNHPQALTNLGNIYMEWNMVAAAATYYKATLAVTT 399
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S NL Y Q+GNY A Y + IDP A
Sbjct: 400 GLSAPFSNLAIIYKQQGNYADAISCYNEVLRIDPLAAD---------------------- 437
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
G ++ K I RV E +Q+ +I P++ ++L A+
Sbjct: 438 ----GLVNRGNTFKEIGRVSEAIQDYIHAITIRPTMAEAHANLASAY 480
>gi|148264105|ref|YP_001230811.1| TPR repeat-containing protein [Geobacter uraniireducens Rf4]
gi|146397605|gb|ABQ26238.1| Tetratricopeptide TPR_2 repeat protein [Geobacter uraniireducens
Rf4]
Length = 573
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 88/212 (41%), Gaps = 20/212 (9%)
Query: 14 KLPPGDSPYVRA-KHVQLVEKD---PEAAIVLFWKAI-NAGDRVDSALKDMAVVLKQQDR 68
K+PP + Y A H+ + K+ PE I L +AI N +R++ L ++A + + D+
Sbjct: 343 KIPPSSAHYFDALGHLGFLYKENGEPEKGIALLKEAITNQPNRIELYL-NLAGLYESMDQ 401
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK----------QKLRM 118
E + + + S + LY K G DE I +K Q L
Sbjct: 402 FAEGLRVLTDVEGNFPNDPRLSFRMGV--LYDKMGNKDESIARMKKVIALAPNDAQALNY 459
Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEV-VYRKAQL 177
+ + A G A + K V ++ + IL +LGWAY + Y A + R QL
Sbjct: 460 LGYTYAELGVNLDEALQYLNK-AVLLRPDDGFILDSLGWAYYKMKRYDQAVFHLERAVQL 518
Query: 178 IDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
+D DA +L+ Y +A + VL
Sbjct: 519 VDEDATIIGHLADAYFANREYRKALTRYRRVL 550
>gi|153868963|ref|ZP_01998678.1| TPR repeat containing protein [Beggiatoa sp. PS]
gi|152074473|gb|EDN71324.1| TPR repeat containing protein [Beggiatoa sp. PS]
Length = 838
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 51/89 (57%), Gaps = 12/89 (13%)
Query: 26 KHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQ---QDRVDEAVEAIKSFRHL 82
K+ QL EK + +AI+ DR +SA+KD+A +L+Q Q+ EA++ ++ +R
Sbjct: 482 KNFQLAEK-------YYREAISHNDRKESAIKDLASLLQQKRTQEDTKEAIKILEKYRPQ 534
Query: 83 CSKQAQESLDNVLIDLYKKCGRLDEQIEL 111
QA S+DN+L+ Y+ R + IEL
Sbjct: 535 MYDQA--SVDNMLVVYYRLDERYHDVIEL 561
>gi|323452442|gb|EGB08316.1| putative O-linked GlcNAc transferase [Aureococcus anophagefferens]
Length = 1151
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 43/161 (26%), Positives = 68/161 (42%), Gaps = 37/161 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESLDN 93
AI + A+ R +A ++ +VLK++ VD+A+ EAI Q ++ N
Sbjct: 385 AIQCYATAVGLAPRFAAAHSNLGLVLKERGTVDDALAHYREAIA-----VDPQFADAYSN 439
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG 153
+ + YK GRLD+ I +GEA +P + +
Sbjct: 440 -MGNAYKDLGRLDDAIRC---------YGEALKLRP-----------------DFADACS 472
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLI 193
NL AY G + A YR+A + PD + A NL+H L+
Sbjct: 473 NLAAAYKDGGRHAEAVACYRRALELKPDFSDAFSNLAHSLV 513
>gi|196232460|ref|ZP_03131313.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196223532|gb|EDY18049.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 792
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 40/174 (22%), Positives = 77/174 (44%), Gaps = 17/174 (9%)
Query: 44 KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESL-DNVLIDLYKKC 102
+A+ A ++ VL + +++EA + S+R Q ++ +N L ++ ++
Sbjct: 96 RAVEVAPGFADAHYNLGAVLAELGQMEEA---LASYRRALEIQPTHAVAENNLGNILREL 152
Query: 103 GRLDEQIELLKQKLRM-IYHGEAFN---------GKPTKTARSHGKKFQVTVKQETSRIL 152
RLDE I ++ +++ + +A N GK + ++G+ ++ K + + +
Sbjct: 153 RRLDEAIAAYRRAIQLQPAYADAHNNLGVALSEQGKSDEAIAAYGRALEL--KPDGNAVH 210
Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVL 205
NLG A G Y A V YR++ P C L L+ R+ EA V
Sbjct: 211 ANLGNALRASGRYAEAVVAYRRSLQSSPARLDICQGLGEALVLLGRFDEAGEVF 264
>gi|118357157|ref|XP_001011828.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89293595|gb|EAR91583.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 494
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 77/192 (40%), Gaps = 44/192 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEA------------------------- 72
AI F A+ G + S L ++ ++ Q R++EA
Sbjct: 193 AINCFLNALEIGLELYSPLINLGIIYSQMGRLEEAQSCYLKILKTHPQDWNALLGLAKLF 252
Query: 73 -----VEAIKSFRHLCSKQAQESLDNV--LIDLYKKCGRLDEQIELLKQKLRM----IYH 121
+E K F CS DN ++ Y + G ++E I + L+ ++H
Sbjct: 253 TKRGMIEEAKFFLQKCSLIYNLDQDNCDDIVYCYCQLGMIEEAIIWYENALKFMPDSVFH 312
Query: 122 ----GEAF--NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
G+ NG K+ K ++ Q S IL NLG+AY +G+Y+ A Y+++
Sbjct: 313 LIIIGQLHLRNGNIEKSKIFFEKILKIRPNQ--SYILNNLGFAYYLEGDYSKAISYYQQS 370
Query: 176 QLIDPDANKACN 187
Q I+P+ N
Sbjct: 371 QEINPNVYDTFN 382
>gi|388841092|gb|AFK79142.1| hypothetical protein PCC8801_1300 [uncultured bacterium F39-01]
Length = 645
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 72/165 (43%), Gaps = 11/165 (6%)
Query: 54 SALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK 113
+A ++ L + R ++++EA+++ L A + +N+ LYK GR E IE
Sbjct: 252 TAYSNLGYALDRLGRSNDSIEALRNAVRLKGDDA-VAYNNLGASLYK-AGRYQEAIEAFG 309
Query: 114 QKLRMIYHG-EAFN--GKPTKTARSHGKKFQ-----VTVKQETSRILGNLGWAYMQKGNY 165
+R+ + EA N G + + Q V VK ++ NLG AY G Y
Sbjct: 310 NAVRLNPNDVEALNNLGAAYYVTAQYDRALQNFQQAVRVKADSPDAQYNLGNAYYMTGKY 369
Query: 166 TSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
A YR+A + D +A NL LI EA + L + +
Sbjct: 370 REATAAYRQAIQLKADYVEARTNLGSLLIALGENQEAITELNESI 414
>gi|294140783|ref|YP_003556761.1| hypothetical protein SVI_2012 [Shewanella violacea DSS12]
gi|293327252|dbj|BAJ01983.1| TPR domain protein [Shewanella violacea DSS12]
Length = 343
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 43/73 (58%), Gaps = 3/73 (4%)
Query: 140 FQVTVKQE--TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQA 196
+++ +++E ++ I+ NLG+++ GNYTSAE R+A DP+ ++ NL +++
Sbjct: 216 YKLALQKEPHSALIITNLGYSHYLTGNYTSAENYLRQAIREDPNFDRGWTNLGLVYVRKG 275
Query: 197 RYTEARSVLEDVL 209
YT A + E +
Sbjct: 276 LYTRALATFEHTM 288
>gi|168056412|ref|XP_001780214.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668364|gb|EDQ54973.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 511
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 178 IDPDANKACNLSHCLIKQARYTEARSVLEDVLLG----KLSGSTETKTINRVKELLQELE 233
I+ D K CN CL+KQ R EA +VL+ V + + + K+ R ++LL+ELE
Sbjct: 73 IESDVKKICNFGVCLMKQGRLEEAEAVLQRVTVACSGIRWPSDSHLKSYERAQDLLKELE 132
>gi|339486353|ref|YP_004700881.1| putative glycosyl transferase [Pseudomonas putida S16]
gi|338837196|gb|AEJ12001.1| putative glycosyl transferase [Pseudomonas putida S16]
Length = 1147
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 6/104 (5%)
Query: 148 TSRILGNLGWAYMQKGNYTSA-EVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
++ + LG +YM+ G+++ A + AQL +P A N +HCL K R EA VL
Sbjct: 238 SAEVFNRLGGSYMEDGHFSIALQYFNAAAQLPNPPVWTAFNRAHCLAKLHRLDEAIQVLS 297
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLE 250
D + + + + R+ E + W+ + L T+ L+
Sbjct: 298 DGIASNPNHRPQQDELERLAE-----QKWRDLHAKLMTRVDMLD 336
>gi|224144666|ref|XP_002325368.1| predicted protein [Populus trichocarpa]
gi|222862243|gb|EEE99749.1| predicted protein [Populus trichocarpa]
Length = 873
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 48/227 (21%), Positives = 86/227 (37%), Gaps = 36/227 (15%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI+ + +A+ A ++A ++ +V+ A+ K C ++ E+ +N
Sbjct: 169 PQEAIMCYQRAVQTRPNYAMAFGNLASTCYERGQVELAILHYKQ-AIACDQRFLEAYNN- 226
Query: 95 LIDLYKKCGRLDEQIELLKQKLR--------MIYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
L + K GR+DE ++ Q L + G + A + K + V
Sbjct: 227 LGNALKDVGRVDEALQCYNQCLSIQPNHPQALTNLGNIYMEWNMSAAAASCYKATLAVTT 286
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
S NL Y Q+GNY+ A Y + I+P A
Sbjct: 287 GLSAPFNNLAVIYKQQGNYSDAISCYNEVLRIEPMAAD---------------------- 324
Query: 207 DVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
G ++ K I RV E +Q+ +I P++ ++L A+
Sbjct: 325 ----GLVNRGNTYKEIGRVSEAIQDYIHAINIRPTMAEAHANLASAY 367
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 63/149 (42%), Gaps = 31/149 (20%)
Query: 45 AINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGR 104
A+ G + + ++AV+ KQQ +A+ + A + L N + YK+ GR
Sbjct: 283 AVTTG--LSAPFNNLAVIYKQQGNYSDAISCYNEVLRI-EPMAADGLVN-RGNTYKEIGR 338
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ E I+ + A N +PT A +H NL AY G+
Sbjct: 339 VSEAIQ---------DYIHAINIRPT-MAEAHA----------------NLASAYKDSGH 372
Query: 165 YTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
+A YR+A L+ PD +A CNL H L
Sbjct: 373 VEAAIKSYRQALLLRPDFPEATCNLLHTL 401
>gi|254445427|ref|ZP_05058903.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
DG1235]
gi|198259735|gb|EDY84043.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
DG1235]
Length = 568
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 97/243 (39%), Gaps = 35/243 (14%)
Query: 12 VHKLPPGDS-PYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVD 70
+ ++ P DS P++ + K A+ + KAI G S + L + +V+
Sbjct: 266 MEEIDPTDSNPHMLRGYAYYQVKRYGQAVPHYRKAIALGGDTASLYYYLTTCLSESGQVE 325
Query: 71 EAVEAIKSFRHLCSKQAQESLDN---------VLIDLYKKCGRLDEQIELLKQKLRMIYH 121
EA + A+E L+ VL D+ K G DE +ELL+Q L +
Sbjct: 326 EA-----------ERMAREGLERAPQTVDLMVVLSDIVLKRGLRDEGVELLQQALALAPA 374
Query: 122 GEAFNGKPTKTARSHGKKFQVTVKQETSRILG--------NLGWAYMQKGNYTSAEVVYR 173
+ N K GKK + E R L LG Y++KG++ +A V
Sbjct: 375 HISANRTLAKLYWEEGKKEAALDQLEMLRTLQPTDFATRMTLGNYYLEKGDFEAAIAVVE 434
Query: 174 KAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLS------GSTETKTINRVKE 227
+A+ IDP+ NL + + EAR + + S S E +N+VK
Sbjct: 435 EARRIDPENTDIRNLLALSLMRQGNEEAREGAFEKAVAVYSRAIEADASLEEAYLNKVKV 494
Query: 228 LLQ 230
L+
Sbjct: 495 LMH 497
>gi|381206067|ref|ZP_09913138.1| hypothetical protein SclubJA_10624 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 524
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDV 208
LG++Y ++ ++ AE +YR++ ++P+ K+ L LIK+ RY +ARS LE V
Sbjct: 413 LGFSYEKQRDFNLAESLYRESLQLNPENPKSGLRLGTVLIKKGRYEDARSFLESV 467
>gi|163781658|ref|ZP_02176658.1| hypothetical protein HG1285_02208 [Hydrogenivirga sp. 128-5-R1-1]
gi|159882878|gb|EDP76382.1| hypothetical protein HG1285_02208 [Hydrogenivirga sp. 128-5-R1-1]
Length = 159
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSV 204
Q T++ LGWAY K +Y A +KA IDPD N ++ LI+Q R+ EA
Sbjct: 38 QPTAKAYTFLGWAYSMKKDYDKAIEYCKKAIEIDPDFGNPYNDIGAYLIEQGRFEEAIPW 97
Query: 205 LEDVLLGK 212
L+ ++ K
Sbjct: 98 LKQAIVAK 105
>gi|326912317|ref|XP_003202500.1| PREDICTED: transmembrane and TPR repeat-containing protein 1-like
[Meleagris gallopavo]
Length = 794
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 77/188 (40%), Gaps = 22/188 (11%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLC--SKQAQESLDN 93
E A++L +I G A +A +L +Q+R+ EA E K+ C S +
Sbjct: 477 EEAVILLRDSIKYGSDFADAYSSLASLLAEQERLKEAEEVYKAGIENCPESSDLHNNYGV 536
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRM--IYHGEAFN--------GKPTKTARSHGKKFQVT 143
L+D G + + +Q + + +H N G+ + + + +V+
Sbjct: 537 FLVD----TGAPERAVSHYRQAIHLSPTHHVAMVNLGRLHRSLGQNKEAEAWYKRALKVS 592
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN--LSHCLIKQARYTEA 201
K E IL LG Y G Y A VYR+A L+ P +NK L+ L R EA
Sbjct: 593 RKAE---ILSPLGALYYNTGRYEEALQVYREAALLQP-SNKETRLALAQVLAMMGRTKEA 648
Query: 202 RSVLEDVL 209
+ +L
Sbjct: 649 EKMTNHIL 656
>gi|224088480|ref|XP_002308458.1| predicted protein [Populus trichocarpa]
gi|222854434|gb|EEE91981.1| predicted protein [Populus trichocarpa]
Length = 934
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 64/151 (42%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A + A ++ V+ K +D +D+AVE ++ + +Q SL+N+
Sbjct: 317 EMAIVFYELAFHFNPHCAEACNNLGVIYKDRDNLDKAVECYQATLSIKPNFSQ-SLNNLG 375
Query: 96 IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++++ + + EA+N N
Sbjct: 376 V-VYTVQGKMDAAASMIEKAIMANPTYAEAYN---------------------------N 407
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y GN T A Y + IDPD+ A
Sbjct: 408 LGVLYRDVGNITMAISAYEQCLEIDPDSRNA 438
>gi|152997603|ref|YP_001342438.1| TPR repeat-containing protein [Marinomonas sp. MWYL1]
gi|150838527|gb|ABR72503.1| Tetratricopeptide TPR_2 repeat protein [Marinomonas sp. MWYL1]
Length = 603
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 30/59 (50%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
I+ +LGW + KG + + +KA I PDA A + L KQ Y EA +V E L
Sbjct: 515 IIDSLGWGFFLKGEFEQSSYYLKKAWSILPDAEIAAHYGESLWKQRHYKEAIAVWEAAL 573
>gi|242081821|ref|XP_002445679.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
gi|241942029|gb|EES15174.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
Length = 910
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 64/150 (42%), Gaps = 28/150 (18%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D+AVE + + AQ SL+N+
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPLFAQ-SLNNLG 361
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
+ +Y G++D ++++ A + PT + NL
Sbjct: 362 V-VYTVQGKMDSAASMIEK---------AIHANPT-----------------YAEAYNNL 394
Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
G Y G+ T A Y + IDPD+ A
Sbjct: 395 GVLYRDAGSITLAIHAYERCLQIDPDSRNA 424
>gi|91204003|emb|CAJ71656.1| hypothetical protein kustc0911 [Candidatus Kuenenia
stuttgartiensis]
Length = 470
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDA-NKACNLSHCLIKQARYTEARSVLEDVLLG 211
G LG+AY N+ SAE +R A L+DP + L+ L KQ RY EA S+ + ++ G
Sbjct: 180 GLLGFAYSSVENHLSAESSFRMAVLLDPATIDWKMGLARSLFKQERYAEAVSLCDSLISG 239
>gi|406835251|ref|ZP_11094845.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 751
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVL 205
G LG Y ++ N+T AE YR+A +DP N A L L+ Q R+ EAR L
Sbjct: 481 GTLGICYARQENFTRAEDCYRRALTLDPHQNFARIALGRTLLAQDRFDEAREEL 534
>gi|414870070|tpg|DAA48627.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
gi|414870071|tpg|DAA48628.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
gi|414870072|tpg|DAA48629.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
Length = 786
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 64/150 (42%), Gaps = 28/150 (18%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D+AVE + + AQ SL+N+
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPLFAQ-SLNNLG 361
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
+ +Y G++D ++++ A + PT + NL
Sbjct: 362 V-VYTVQGKMDSAASMIEK---------AIHANPTY-----------------AEAYNNL 394
Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
G Y G+ T A Y + IDPD+ A
Sbjct: 395 GVLYRDAGSITLAIHAYERCLQIDPDSRNA 424
>gi|354603236|ref|ZP_09021235.1| hypothetical protein HMPREF9450_00150 [Alistipes indistinctus YIT
12060]
gi|353349113|gb|EHB93379.1| hypothetical protein HMPREF9450_00150 [Alistipes indistinctus YIT
12060]
Length = 268
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%)
Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSG 215
G A ++ NY AE+ YRKAQ P NL+ L KQ RY E ++L+ + +++G
Sbjct: 38 GNALYEQENYPEAEIRYRKAQEQSPTYEGLFNLADALYKQKRYDEVLNILKQISRDEVAG 97
>gi|118385554|ref|XP_001025906.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89307673|gb|EAS05661.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 1032
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 85/185 (45%), Gaps = 23/185 (12%)
Query: 16 PPGDSPY----VRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
P DS Y + K L++K AI + K + + DS ++ K + VDE
Sbjct: 845 PENDSCYYNLGIACKSKGLLDK----AIQSYQKCLEIHPKNDSCYYNLGKAYKSKGLVDE 900
Query: 72 AVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYH--GE 123
A+++ +S + K ++ N L + YK G LDE I+ + L++ Y+ G+
Sbjct: 901 AIKSYQSSIEINPKV--DAYYNSLGNAYKVKGLLDEAIKSYQNCLKINPNYNSCYYKLGQ 958
Query: 124 AFNGKP--TKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
A+ K + +S+ K ++ K ++ NLG AY KG A Y+K ++P+
Sbjct: 959 AYKSKGLLDEAIKSYQKYLEINPKNDSCYY--NLGLAYKSKGLLDEAIKSYQKCLSLNPN 1016
Query: 182 ANKAC 186
NK C
Sbjct: 1017 -NKNC 1020
>gi|333899123|ref|YP_004472996.1| hypothetical protein Psefu_0924 [Pseudomonas fulva 12-X]
gi|333114388|gb|AEF20902.1| Tetratricopeptide TPR_2 repeat-containing protein [Pseudomonas
fulva 12-X]
Length = 597
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 31/63 (49%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
E IL +LGWA + GN T AE R+A PD A +L L Q + EAR +
Sbjct: 513 EDPAILDSLGWANYRLGNLTDAEKWLRQAFERFPDHEVAAHLGEALWAQGKQREARKIWR 572
Query: 207 DVL 209
D L
Sbjct: 573 DAL 575
>gi|222423063|dbj|BAH19513.1| AT3G04240 [Arabidopsis thaliana]
Length = 750
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 93/251 (37%), Gaps = 46/251 (18%)
Query: 16 PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
P Y+ +V P AI+ + A+ A ++A + +Q ++D A
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYYEQGQLDLA--- 310
Query: 76 IKSFRHLCSK-----QAQESLDNVLIDLYKKCGRLDEQIELLKQKLR--------MIYHG 122
I+ ++ S+ +A +L N L D+ GR+DE + Q L M G
Sbjct: 311 IRHYKQALSRDPRFLEAYNNLGNALKDI----GRVDEAVRCYNQCLALQPNHPQAMANLG 366
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
+ S K + V S NL Y Q+GNY+ A Y + IDP A
Sbjct: 367 NIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLA 426
Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
D L+ + G+T K I RV E +Q+ + P++
Sbjct: 427 -----------------------ADALVNR--GNT-YKEIGRVTEAIQDYMHAINFRPTM 460
Query: 243 TTKKSSLEDAF 253
++L A+
Sbjct: 461 AEAHANLASAY 471
>gi|218439610|ref|YP_002377939.1| hypothetical protein PCC7424_2659 [Cyanothece sp. PCC 7424]
gi|218172338|gb|ACK71071.1| Tetratricopeptide TPR_2 repeat protein [Cyanothece sp. PCC 7424]
Length = 565
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 5/90 (5%)
Query: 126 NGKPTKTARSHGKKFQV----TVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
N + T+ +++ + F + TV+ + + L LG ++KGN + A + Y+ A IDP
Sbjct: 85 NQQFTEALKAYNQAFLINPDLTVQSDLASDLTRLGAVLLKKGNVSEAIIAYQIAVAIDPL 144
Query: 182 ANKAC-NLSHCLIKQARYTEARSVLEDVLL 210
A +A NL L +Q R+ EA V+L
Sbjct: 145 AIEANHNLGKALAQQQRWQEAGQAFNQVIL 174
>gi|408421469|ref|YP_006762883.1| hypothetical protein TOL2_C40240 [Desulfobacula toluolica Tol2]
gi|405108682|emb|CCK82179.1| tetratricopeptide repeat protein [Desulfobacula toluolica Tol2]
Length = 1572
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARS 203
K + RI NLG G YT AE YRKA P +K C NL L +Q ++ E R+
Sbjct: 578 KDQLYRIYNNLGSLMNSSGEYTDAETWYRKAMKTCPTKSKPCTNLIKVLHRQNKHNEIRT 637
Query: 204 V 204
V
Sbjct: 638 V 638
>gi|159901659|ref|YP_001547905.1| hypothetical protein Haur_5149 [Herpetosiphon aurantiacus DSM 785]
gi|159894698|gb|ABX07777.1| Tetratricopeptide TPR_2 repeat protein [Herpetosiphon aurantiacus
DSM 785]
Length = 1105
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 81/187 (43%), Gaps = 43/187 (22%)
Query: 44 KAINAGDRVDSAL--KDMAVVLKQQDRVDEA----------VEAIKSFRHLCSKQAQESL 91
+A+ D D+A+ ++A VL+ Q R EA EA+ H + ++ +L
Sbjct: 694 EAVLGLDHPDTAVSVNNLASVLESQGRYSEARGLYERALEVTEAVLGREHPDTARSVNNL 753
Query: 92 DNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
+VL + GR E L +Q L + EA G R H +T+R
Sbjct: 754 ASVL----ARQGRYSEAQPLYEQALAV---NEAVLG------REH---------PDTARS 791
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIKQARYTEAR 202
+ NL +G Y+ A+ +Y +A + PD A NL+ L+ Q RY+EA+
Sbjct: 792 VNNLASVLESQGRYSEAQPLYEQALAVREAVLGENHPDTAMSMNNLAMVLLNQGRYSEAQ 851
Query: 203 SVLEDVL 209
+LE L
Sbjct: 852 GLLERTL 858
>gi|327399475|ref|YP_004340344.1| hypothetical protein Hipma_1328 [Hippea maritima DSM 10411]
gi|327182104|gb|AEA34285.1| Tetratricopeptide TPR_1 repeat-containing protein [Hippea maritima
DSM 10411]
Length = 260
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 80/193 (41%), Gaps = 23/193 (11%)
Query: 30 LVEK-DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQ 88
LVE+ + E AIV F A+ + A ++ Q D D+A C+K +
Sbjct: 15 LVEQGECEKAIVYFLNAVKENPSFEDAYIEIGYCYAQLDNFDDA-------EDYCNKAIE 67
Query: 89 ESLDNV-----LIDLYKKCGRLDEQIELLKQKL-RM------IYH--GEAFNGKPTKTAR 134
+ +N+ L +Y K G +++IE L + + R+ IY G A+
Sbjct: 68 INPNNLEAYNTLAMIYHKFGFFEDEIEALNEIIIRLDEPDASIYLNIGNAYYELGENDRA 127
Query: 135 SHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLI 193
+ ++ + + N+G AYM K Y A Y++A IDP+ ++ NL
Sbjct: 128 IEFYDMAIGMEPDFAEAYANMGNAYMAKDEYIKATEAYKQALQIDPNMSDVYLNLGIVYG 187
Query: 194 KQARYTEARSVLE 206
+ Y EA E
Sbjct: 188 ELGSYDEAVKYFE 200
>gi|82617834|gb|ABB84829.1| VIT-vWFA-RpoN multidomain protein [uncultured delta proteobacterium
DeepAnt-1F12]
Length = 1156
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 28/49 (57%)
Query: 161 QKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
Q ++ E + + QL DP A +L CL+ QARY E +++EDV+
Sbjct: 851 QGNDFEYVEEILERVQLFDPPGVGARDLRECLLIQARYIELGAIVEDVI 899
>gi|392409193|ref|YP_006445800.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfomonile tiedjei DSM 6799]
gi|390622329|gb|AFM23536.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfomonile tiedjei DSM 6799]
Length = 583
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQAR 197
K +T + + + +L NLG A Q G A +V +KA IDP+ + A NL H K+ R
Sbjct: 63 KHALTYEPKNASVLSNLGNALNQIGLVDEAVMVLKKAIEIDPNHDLAHDNLGHIYSKRER 122
Query: 198 YTEARSVLEDVL 209
Y +A+ +LE L
Sbjct: 123 YDQAKELLEMAL 134
>gi|302559445|ref|ZP_07311787.1| LOW QUALITY PROTEIN: regulatory protein AfsR [Streptomyces
griseoflavus Tu4000]
gi|302477063|gb|EFL40156.1| LOW QUALITY PROTEIN: regulatory protein AfsR [Streptomyces
griseoflavus Tu4000]
Length = 649
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 58/137 (42%), Gaps = 26/137 (18%)
Query: 48 AGDRVDSA--LKDMAVVLKQQDRVDEAVE----AIKSFRHLCSKQAQESLDNVLIDLYKK 101
AGDR A L D A L+ R DEA+E A++ L +A+ S+ + L D Y
Sbjct: 416 AGDREGEAQSLSDTAAALRSSGRPDEAIELLHEALELNEELGDGEARSSVISNLGDAYLH 475
Query: 102 CGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQ 161
GRLD+ +E ++ L + R G + + L NLG AY +
Sbjct: 476 AGRLDKSVEYTRRGLAL--------------DRVAGSVWGEGIA------LSNLGDAYQR 515
Query: 162 KGNYTSAEVVYRKAQLI 178
G Y A +A ++
Sbjct: 516 MGRYDEALGCLERALVV 532
>gi|113474770|ref|YP_720831.1| hypothetical protein Tery_0977 [Trichodesmium erythraeum IMS101]
gi|110165818|gb|ABG50358.1| TPR repeat [Trichodesmium erythraeum IMS101]
Length = 1215
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 73/166 (43%), Gaps = 35/166 (21%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V ++L ++A + ++Q R EA ++A+ + L + SL+N L +LY+ GR
Sbjct: 126 VATSLNNLAFLYQRQGRYTEAEPLYIQALDMIKKLLGAEHPLVATSLNN-LAELYRVQGR 184
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L +Q L+M KK + + L +L Y +G
Sbjct: 185 YTEAEPLYQQALKM------------------RKKLLGAKHPDVATSLNSLALLYKDQGR 226
Query: 165 YTSAEVVYRKA-----QLID---PD-ANKACNLSHCLIKQARYTEA 201
YT AE +Y +A +L+ PD A+ NL+ Q RYTEA
Sbjct: 227 YTEAEPLYIQALEMRKKLLGAEHPDVASSLNNLAELYRSQGRYTEA 272
>gi|118362609|ref|XP_001014531.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89296298|gb|EAR94286.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 1163
Score = 41.2 bits (95), Expect = 0.58, Method: Composition-based stats.
Identities = 41/177 (23%), Positives = 79/177 (44%), Gaps = 19/177 (10%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS-----------FRHLCSKQ 86
AI+++ K++N ++D ++ V + + R+DEA++ + F +L +
Sbjct: 896 AILIYEKSLNLNIKIDVCCLNLGVCYEIKGRIDEAIKKYQQSIEINPANDVCFLNLGNAY 955
Query: 87 AQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
+ + + I Y+KC +L+ + E L +Y G+ K + + K + K+
Sbjct: 956 LNKGMFDEAIQAYQKCLQLNPKKEACYLNLGNVYQ---IKGELDKAIKCYQKCIILNPKK 1012
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC--NLSHCLIKQARYTEA 201
+ + NLG AY KGN + Y+K ++P N C NL + + EA
Sbjct: 1013 DICYL--NLGNAYQNKGNLEESIKNYQKCLNLNPK-NDTCLENLGNAFKNKGMIEEA 1066
>gi|383772223|ref|YP_005451289.1| hypothetical protein S23_39820 [Bradyrhizobium sp. S23321]
gi|381360347|dbj|BAL77177.1| hypothetical protein S23_39820 [Bradyrhizobium sp. S23321]
Length = 695
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 69/166 (41%), Gaps = 15/166 (9%)
Query: 58 DMAVVLKQQDRVDEAVEAIKSF--RHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQK 115
++AVVL +Q R+ EA EA ++ R + +L N+L+D RLDE + +
Sbjct: 148 NLAVVLAEQGRLTEAEEAYRAVIAREPAYRGVWLNLGNLLMDQT----RLDEAVTAFRCA 203
Query: 116 LRMIYHGEAFNGKPTKTARSHG--------KKFQVTVKQETSRILGNLGWAYMQKGNYTS 167
+ G G + V ++ + + L LG + G+
Sbjct: 204 IEADPDDPGLPGNLGAALYRQGLLDDAIVQYRRAVALQPDNAPALRLLGLVLHEAGHLPE 263
Query: 168 AEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVLLGK 212
A +YR++ +DP D A NL CL + EA + E LL K
Sbjct: 264 AAEMYRQSFALDPADHLIASNLGACLSELGTLAEAIAACEQALLLK 309
>gi|418066009|ref|ZP_12703377.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
RCH3]
gi|373561242|gb|EHP87481.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
RCH3]
Length = 643
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 31/192 (16%)
Query: 18 GDSPYVRAKHVQ--LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
G++P + K + L K+P+ A+ + A+ A +++A + ++ ++ +EA +
Sbjct: 400 GNNPVLHLKLARALLGAKNPKEAMASYQDALKIAPDNLEARRELAALYRKGNQNEEAEKQ 459
Query: 76 IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARS 135
K + A+ N+L LY K + DE I LLK + + P A S
Sbjct: 460 YKEILRIKKDDAEAR--NILTALYVKSKKYDELITLLKDGVELA---------PNDPA-S 507
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIK 194
H K LG Y K +Y +AE Y+KA + D K+ N L +K
Sbjct: 508 HYK----------------LGLIYEFKKDYDAAEQEYKKAVELKGDHAKSLNALGRIYLK 551
Query: 195 QARYTEARSVLE 206
+ +EA+ LE
Sbjct: 552 TGKISEAKGALE 563
>gi|327401541|ref|YP_004342380.1| hypothetical protein Arcve_1665 [Archaeoglobus veneficus SNP6]
gi|327317049|gb|AEA47665.1| Tetratricopeptide TPR_1 repeat-containing protein [Archaeoglobus
veneficus SNP6]
Length = 222
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 43/211 (20%), Positives = 80/211 (37%), Gaps = 20/211 (9%)
Query: 33 KDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
++PE + F K + L ++ + L + R+DEA+ I L A +
Sbjct: 16 RNPEEEVEFFSKCLEMEPENIYVLNNLGIALYELGRIDEAISYIDRALELNPDYADAWYN 75
Query: 93 NVLIDLYKKCGRLDEQIELLKQ-------------KLRMIYHGEAFNGKPTKTARSHGKK 139
++ G+ DE I ++ + + Y+ GK + R
Sbjct: 76 RGIV--LSDAGKYDEAIACFEKAIALNPDDAAAWNNMGLAYYESGNMGKAIECYRKC--- 130
Query: 140 FQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYT 199
V++ +E + N+G AY + G + AE ++KA +D + NL K +Y
Sbjct: 131 --VSIDEEHAAAWYNMGLAYYESGRFNKAEESFKKALELDESVDTLNNLGIVYGKLRQYE 188
Query: 200 EARSVLEDVLLGKLSGSTETKTINRVKELLQ 230
A +L S T + +K L++
Sbjct: 189 MAMECFNRILEIDPENSAATYNLLILKRLME 219
>gi|403363486|gb|EJY81487.1| TPR repeat-containing protein [Oxytricha trifallax]
Length = 746
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 68/163 (41%), Gaps = 38/163 (23%)
Query: 56 LKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDL---YKKCGRLDEQIELL 112
+M VV +Q D+A E + LC++ ++ + +LI+L ++K +D IEL
Sbjct: 601 FNEMGVVYYRQKNYDQARECFANALSLCNESNSKTYETILINLAHCHRKLKDMDSAIELY 660
Query: 113 KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVY 172
++ L + N K T S LG+AY KG Y A Y
Sbjct: 661 EKCLTL-------NPKSPSTYTS-------------------LGYAYHLKGEYRQALTCY 694
Query: 173 RKAQLI---DPDANKAC-----NLSHCLIKQARYTEARSVLED 207
KA + DP + +++ C I+QA Y A + L D
Sbjct: 695 HKASFLKNEDPLTEELVQRALQDINECTIEQA-YLGAGNELGD 736
>gi|406912813|gb|EKD52347.1| hypothetical protein ACD_62C00048G0002 [uncultured bacterium]
Length = 390
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 79/206 (38%), Gaps = 28/206 (13%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI KA ++ A ++ +V + D VDEA++ + L E L
Sbjct: 91 EKAIFHLKKAGQLSPQLSDAHVNLGLVYLKMDNVDEAIKTY--YDALMIDPVNEKAYFGL 148
Query: 96 IDLYKKCGRLDEQIELLKQKLRMI-YHGEAFNGKPTKTARSHGKKFQVTVKQETSRI--- 151
Y++ GR +E I+ + + + +A+N A SH ++ + I
Sbjct: 149 GVAYQRMGRYEEAIDAYSEAIMISPAFSDAYNN--LGIALSHVGLYEEAITSYLIAIKIR 206
Query: 152 -------LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
NLG Y G A Y A L+DP+ KA CNL H ++ Y EA
Sbjct: 207 PDLSEAAYNNLGITYQSVGQDDKAVEAYTNAILLDPEYTKAYCNLGHVRQQKGNYDEAMW 266
Query: 204 VLEDVL------------LGKLSGST 217
++ L LG GST
Sbjct: 267 AYQEALKIDPAFADAHYGLGNAYGST 292
>gi|357162940|ref|XP_003579571.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Brachypodium
distachyon]
Length = 983
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 49/226 (21%), Positives = 88/226 (38%), Gaps = 36/226 (15%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + +A+ A A ++A + +Q ++D ++ S LC + E+ +N +
Sbjct: 280 EEAIACYQRALQARPDYAMAYGNLATIYYEQRQLDMSIHCY-SQAILCDPRFVEAHNN-M 337
Query: 96 IDLYKKCGRLDEQIELLKQKL--------RMIYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
+ K GR++E I + L + G + +A + K + V
Sbjct: 338 GNALKDAGRVEEAINCFQSCLILQANHPQALTNLGNIYMEWNMASAAASFYKAAIAVTSG 397
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
+ NL Y Q+G+Y A Y + IDP A D
Sbjct: 398 LTSPFNNLAVIYKQQGSYADAIACYTEVLRIDPTA-----------------------AD 434
Query: 208 VLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
L+ + G+T K RV E +Q+ +I P++ ++L A+
Sbjct: 435 ALVNR--GNT-FKEFGRVAEAIQDYMQAATIMPTMAEAHANLASAY 477
>gi|239787366|emb|CAX83842.1| TPR repeat protein [uncultured bacterium]
Length = 1036
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 85/187 (45%), Gaps = 26/187 (13%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
EA++ L +KA+ R +++ L++ R+DEAV A K + AQ+S N+L
Sbjct: 66 EASLRLLYKALEIDPRQPGMWQNLGETLRRLKRLDEAVGAFK-------RAAQDS--NIL 116
Query: 96 ID-------LYKKCGRLDEQIE----LLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV-- 142
+ K+ G+LDE + LL ++ + AF+ + + +
Sbjct: 117 LAAVSGLALALKEAGQLDEALSAMETLLAKEPGNVAAWAAFSDVLFDAGQDDAAQLALER 176
Query: 143 TVKQETSRILGNLGWAY--MQKGNYTSAEVVYRKA--QLIDPDANKACNLSHCLIKQARY 198
++ + + +L A+ Q+G Y +AE +YR Q D A+ L+ LI Q R
Sbjct: 177 AIRLKPTSAGHHLRMAFHQSQQGQYQTAEAIYRALLRQGGDSLASVHTGLAQALISQGRL 236
Query: 199 TEARSVL 205
EA+ ++
Sbjct: 237 EEAQPLI 243
>gi|15229778|ref|NP_187761.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
thaliana]
gi|75332921|sp|Q96301.1|SPY_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY
gi|12322895|gb|AAG51433.1|AC008153_6 spindly (gibberellin signal transduction protein); 75377-80082
[Arabidopsis thaliana]
gi|1589778|gb|AAC49446.1| SPINDLY [Arabidopsis thaliana]
gi|62319977|dbj|BAD94086.1| spindly [Arabidopsis thaliana]
gi|332641539|gb|AEE75060.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
thaliana]
Length = 914
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ AIV + A + A ++ V+ K +D +D+AVE + + AQ SL+N+
Sbjct: 312 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 370
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EAFN N
Sbjct: 371 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 402
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y GN T A Y + IDPD+ A
Sbjct: 403 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 433
>gi|83311235|ref|YP_421499.1| TPR repeat-containing protein [Magnetospirillum magneticum AMB-1]
gi|82946076|dbj|BAE50940.1| TPR repeat [Magnetospirillum magneticum AMB-1]
Length = 548
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 35/153 (22%), Positives = 69/153 (45%), Gaps = 11/153 (7%)
Query: 58 DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
++ + L+ R+DEA ++ R + + + NVL D+ + GRLDE + L++ ++
Sbjct: 114 NLGLALRGLGRLDEAAAVLERCRDM--DPSVPAPWNVLGDIRRAQGRLDEAVTCLERAIQ 171
Query: 118 M-IYHGEAFNGKPTKTARSHGKKFQVTV-------KQETSRILGNLGWAYMQKGNYTSAE 169
+ H A+N + ++V + + + +LG + + A
Sbjct: 172 LDPGHSMAYNNLGVALQAQDRTEEAISVLERGLALQPDDPELHYSLGCSLQEATRTADAT 231
Query: 170 VVYRKAQLIDPDANKA-CNLSHCLIKQARYTEA 201
YR+A + PD A NLSH L++ ++E
Sbjct: 232 ASYRRAIALRPDYGAAHWNLSHSLLRGGDFSEG 264
>gi|297829706|ref|XP_002882735.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
lyrata]
gi|297328575|gb|EFH58994.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
lyrata]
Length = 897
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ AIV + A + A ++ V+ K +D +D+AVE + + AQ SL+N+
Sbjct: 295 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 353
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EAFN N
Sbjct: 354 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 385
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y GN T A Y + IDPD+ A
Sbjct: 386 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 416
>gi|401408423|ref|XP_003883660.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
gi|325118077|emb|CBZ53628.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
Length = 987
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 67/169 (39%), Gaps = 17/169 (10%)
Query: 34 DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDN 93
DP A+ ++ +A+ R A +M V K ++++A+ A C+ Q SL N
Sbjct: 307 DPHTALQMYREAVRINPRYVEAYNNMGAVCKNLGKLEDAI-AFYEKALACNPNYQLSLSN 365
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRMI--------YHGEAFNGKPTKTARSH-------GK 138
+ + L G + E K+ + + Y+ +A+ A SH
Sbjct: 366 MAVAL-TDLGTQQKTFEGAKKAISLYKKALIYNPYYSDAYYNLGVAYADSHKFDKALVNY 424
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
+ V + N+G + + N A V Y KA I+PD ++ N
Sbjct: 425 QLAVAFNPRCAEAYNNMGVIHKDRENTDQAIVCYNKALEINPDFSQTLN 473
>gi|254468929|ref|ZP_05082335.1| hypothetical protein KB13_1154 [beta proteobacterium KB13]
gi|207087739|gb|EDZ65022.1| hypothetical protein KB13_1154 [beta proteobacterium KB13]
Length = 564
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 40/84 (47%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
++++ + IL ++GW + + GNY A KA I D A +L L KQ + EA
Sbjct: 479 LSIEPQNHYILDSMGWVHFRLGNYDIAYEFITKAYAIQEDPEIAAHLGEVLWKQGKENEA 538
Query: 202 RSVLEDVLLGKLSGSTETKTINRV 225
SV + L S +T NR+
Sbjct: 539 MSVWQGSLEKFPENSVLIETKNRL 562
>gi|325954652|ref|YP_004238312.1| hypothetical protein [Weeksella virosa DSM 16922]
gi|323437270|gb|ADX67734.1| Tetratricopeptide TPR_2 repeat-containing protein [Weeksella virosa
DSM 16922]
Length = 286
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
ILGN + +KGNY AE Y+KA +P++ KA NL + L +Q RY E+
Sbjct: 31 ILGNEAF---EKGNYEQAEFRYKKAATEEPNSVKANYNLGNTLFEQKRYKES 79
>gi|15229253|ref|NP_187074.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC [Arabidopsis
thaliana]
gi|75336082|sp|Q9M8Y0.1|SEC_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC; AltName:
Full=Protein SECRET AGENT
gi|6721161|gb|AAF26789.1|AC016829_13 putative O-linked GlcNAc transferase [Arabidopsis thaliana]
gi|18139887|gb|AAL60196.1|AF441079_1 O-linked N-acetyl glucosamine transferase [Arabidopsis thaliana]
gi|20259324|gb|AAM13988.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
gi|21436429|gb|AAM51415.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
gi|110742062|dbj|BAE98963.1| O-linked GlcNAc transferase like protein [Arabidopsis thaliana]
gi|332640535|gb|AEE74056.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC [Arabidopsis
thaliana]
Length = 977
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 93/251 (37%), Gaps = 46/251 (18%)
Query: 16 PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
P Y+ +V P AI+ + A+ A ++A + +Q ++D A
Sbjct: 254 PAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPNSAMAFGNIASIYYEQGQLDLA--- 310
Query: 76 IKSFRHLCSK-----QAQESLDNVLIDLYKKCGRLDEQIELLKQKLR--------MIYHG 122
I+ ++ S+ +A +L N L D+ GR+DE + Q L M G
Sbjct: 311 IRHYKQALSRDPRFLEAYNNLGNALKDI----GRVDEAVRCYNQCLALQPNHPQAMANLG 366
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
+ S K + V S NL Y Q+GNY+ A Y + IDP A
Sbjct: 367 NIYMEWNMMGPASSLFKATLAVTTGLSAPFNNLAIIYKQQGNYSDAISCYNEVLRIDPLA 426
Query: 183 NKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSL 242
D L+ + G+T K I RV E +Q+ + P++
Sbjct: 427 -----------------------ADALVNR--GNT-YKEIGRVTEAIQDYMHAINFRPTM 460
Query: 243 TTKKSSLEDAF 253
++L A+
Sbjct: 461 AEAHANLASAY 471
>gi|225620550|ref|YP_002721807.1| TPR domain-containing protein [Brachyspira hyodysenteriae WA1]
gi|225215369|gb|ACN84103.1| TPR domain-containing protein [Brachyspira hyodysenteriae WA1]
Length = 767
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 69/173 (39%), Gaps = 38/173 (21%)
Query: 32 EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESL 91
+KD AI F KAI + D A ++A+ + + +EA+E F H +SL
Sbjct: 121 KKDYTNAIENFNKAIEINPKYDKAFNNLALFYYKNKKYNEAIEF---FEH------SKSL 171
Query: 92 DNVLIDLYKKCG-------RLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHG------- 137
D + Y G D+ IE + L+ +NGK KTA + G
Sbjct: 172 DERVFKAYDMLGMSYYNINNYDKAIECFTKFLQ-------YNGKSCKTANTLGAVYSFLK 224
Query: 138 ------KKFQVT--VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
K F + + + + NL Y + + A + + KA+ +D +A
Sbjct: 225 DYDNAIKYFNIAIDINPKYANAYNNLALVYFNRKMFDKAALYFDKAKKLDVNA 277
>gi|340376830|ref|XP_003386934.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Amphimedon queenslandica]
Length = 1029
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + +AI A ++A LK+Q +V EA E + S A +SL+N L
Sbjct: 254 ELAIDTYKRAIELQPHFPDAYCNLANALKEQGKVAEAEECYNIALKMNSSHA-DSLNN-L 311
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVK-QETSRILGN 154
++ ++ G +DE I+L K+ L ++ A + + GK + +E RI N
Sbjct: 312 ANIKREQGHIDEAIKLYKRALEIMPEFAAAHSNLASILQMQGKLQDALLHYKEAIRIHPN 371
Query: 155 LGWAYMQKGN-------YTSAEVVYRKAQLIDP 180
AY GN + A Y +A I+P
Sbjct: 372 FADAYSNMGNTLKEMQDFQGALQCYSRAIQINP 404
>gi|39998281|ref|NP_954232.1| hypothetical protein GSU3191 [Geobacter sulfurreducens PCA]
gi|409913632|ref|YP_006892097.1| hypothetical protein KN400_3128 [Geobacter sulfurreducens KN400]
gi|39985227|gb|AAR36582.1| TPR domain protein [Geobacter sulfurreducens PCA]
gi|298507217|gb|ADI85940.1| TPR domain protein [Geobacter sulfurreducens KN400]
Length = 638
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 31/154 (20%)
Query: 55 ALKDMAVVLKQQDRVDEAVEAIKSFRHLCS-KQAQESLDNVLIDLYKKCGRLDEQIELLK 113
A +++A V ++ +++D+A K +R + K+ N+L +Y K + DE + LL+
Sbjct: 437 AHRELAAVYRKLNQMDDAS---KQYREVLRIKKDDAEARNILTAIYVKEKKYDELVPLLQ 493
Query: 114 QKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYR 173
+ + + A SH K LG + + +Y SAEV YR
Sbjct: 494 EGVEL----------APNDAMSHYK----------------LGLIHEFRKDYDSAEVSYR 527
Query: 174 KAQLIDPDANKACN-LSHCLIKQARYTEARSVLE 206
KA + D KA N L +K + TEA+ LE
Sbjct: 528 KATELKDDHAKALNALGRIYLKTGKLTEAKEALE 561
>gi|302878311|ref|YP_003846875.1| hypothetical protein Galf_1081 [Gallionella capsiferriformans ES-2]
gi|302581100|gb|ADL55111.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
capsiferriformans ES-2]
Length = 961
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVLL 210
NLG + G T AE +R+A PD +A CNL + L+ AR +EA S VLL
Sbjct: 413 NLGIVQQELGRLTDAEASFRQALQFSPDLLEAHCNLGNVLLGAARLSEAESCYRHVLL 470
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 73/171 (42%), Gaps = 33/171 (19%)
Query: 34 DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDN 93
D E A + +A+ +AL ++A +L+ R+DEA ++ K + +S D
Sbjct: 253 DREEAEQCYRRALQIKPGYGAALSNLANLLQMLGRLDEAAACCRTIL----KSSPDSADV 308
Query: 94 V--LIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
+ L ++ K+ G+L E + LR FN ++ +I
Sbjct: 309 LFNLANILKRLGQLAEAEASYRVALR-------FN-------------------PDSVQI 342
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEA 201
GNLG + G + AE +R+A I+PD +A CNL + R EA
Sbjct: 343 HGNLGITLKELGRFEEAESSFRQALRINPDYAQAHCNLGVMFKELDRLDEA 393
>gi|312282561|dbj|BAJ34146.1| unnamed protein product [Thellungiella halophila]
Length = 762
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ AIV + A + A ++ V+ K +D +D+AVE + + AQ SL+N+
Sbjct: 174 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 232
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EAFN N
Sbjct: 233 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 264
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y GN T A Y + IDPD+ A
Sbjct: 265 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 295
>gi|400601016|gb|EJP68684.1| tetratricopeptide-like protein [Beauveria bassiana ARSEF 2860]
Length = 626
Score = 40.8 bits (94), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
L + GNYT A K LI+ +A+ HCLIKQ+R+ EA +VL
Sbjct: 46 LAQVHFATGNYTRAHAFLAKQDLINRNASCRYLAGHCLIKQSRFEEALAVL 96
>gi|373469271|ref|ZP_09560471.1| tetratricopeptide repeat protein, partial [Lachnospiraceae
bacterium oral taxon 082 str. F0431]
gi|371764715|gb|EHO53097.1| tetratricopeptide repeat protein, partial [Lachnospiraceae
bacterium oral taxon 082 str. F0431]
Length = 174
Score = 40.8 bits (94), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDP---DANKACNLSHCLIKQARYTEARSVLEDVL- 209
+LG YM + +Y SA V KA +DP DA K L+ K R +AR+ LE VL
Sbjct: 20 DLGSKYMAELDYESAIVALNKAIKLDPKNVDAYKM--LAEVYEKSGRLEDARATLEKVLD 77
Query: 210 LGKLSGSTETKTINRVKEL 228
L LS E + NR+K L
Sbjct: 78 LDDLSSDNEDEINNRIKNL 96
>gi|431926271|ref|YP_007239305.1| Flp pilus assembly protein TadD [Pseudomonas stutzeri RCH2]
gi|431824558|gb|AGA85675.1| Flp pilus assembly protein TadD [Pseudomonas stutzeri RCH2]
Length = 575
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 28/54 (51%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
IL +LGW + GN T AE + RKA PD A +L L +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLTEAEALLRKALQRFPDHEVAAHLGEVLWTMGERSEARAV 548
>gi|159028254|emb|CAO88064.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 939
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 72/174 (41%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQAQE---SLDNVLIDLYKKCGR 104
V ++L ++A++ + Q R EA +EA+ + L + SL+N L +LY+ GR
Sbjct: 709 VATSLNNLALLYESQGRYTEAEPLYLEALDLTKQLLGDNHPDVATSLNN-LAELYRSQGR 767
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L Q L + N T+ L NL Y +G
Sbjct: 768 YTEAEPLYLQALDLYKQLLGDNHPYVATS------------------LNNLAALYYSQGR 809
Query: 165 YTSAEVVY------RKAQLID--PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
YT AE +Y RK L D PD A+ NL+ Q RYTEA + + L
Sbjct: 810 YTEAEPLYLQALDLRKRLLGDNHPDVASSLNNLAELYYSQGRYTEAEPLYLEAL 863
>gi|330502031|ref|YP_004378900.1| hypothetical protein [Pseudomonas mendocina NK-01]
gi|328916317|gb|AEB57148.1| TPR repeat-containing protein [Pseudomonas mendocina NK-01]
Length = 569
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 30/59 (50%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
IL +LGW + GN AE + R+A PD A +L L Q + EARSV D L
Sbjct: 489 ILDSLGWVNYRLGNLDEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQREARSVWRDAL 547
>gi|407716992|ref|YP_006838272.1| hypothetical protein Q91_1735 [Cycloclasticus sp. P1]
gi|407257328|gb|AFT67769.1| TPR domain protein [Cycloclasticus sp. P1]
Length = 572
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 122 GEAFNGKPTKTARSHGKKFQ-VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
G A K + + G Q +++K + IL ++GW ++G+Y SA V+ ++A P
Sbjct: 467 GYALASKTFRFKEARGYLTQALSLKPDDPAILDSMGWLDYREGHYESALVLLKRAYSQTP 526
Query: 181 DANKACNLSHCLIKQARYTEARSVLEDVL 209
+ A +L L R EAR++ E+ L
Sbjct: 527 EGEIAAHLGQTLWMLGRQKEARTIWEEAL 555
>gi|326498109|dbj|BAJ94917.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 986
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 51/225 (22%), Positives = 88/225 (39%), Gaps = 38/225 (16%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
A+ + +A+ A A ++A + +Q ++D A+ +C + E+ +N + +
Sbjct: 285 AVACYQRALQARPDYAMAYGNLATIYYEQRQLDMAIHCYNQ-AIICDSRFVEAYNN-MGN 342
Query: 98 LYKKCGRLDEQIELLKQKLRM-IYHGEAFNG--------KPTKTARSHGKKFQVTVKQET 148
K GR++E I + L + H +A TA S K + V
Sbjct: 343 ALKDAGRVEEAINCFQSCLVLQANHPQALTNLGNIYMEWNMISTAASFYKA-AIAVTSGL 401
Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDV 208
S L NL Y Q+G+Y A Y + IDP A D
Sbjct: 402 SSPLNNLAVIYKQQGSYADAIACYTEVLRIDPTA-----------------------ADA 438
Query: 209 LLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
L+ + G+T K RV E +Q+ +I P++ ++L A+
Sbjct: 439 LVNR--GNT-FKEFGRVAEAIQDYIQAVTIRPNMAEAHANLASAY 480
>gi|146184699|ref|XP_001029949.2| SLEI family protein [Tetrahymena thermophila]
gi|146143045|gb|EAR82286.2| SLEI family protein [Tetrahymena thermophila SB210]
Length = 2342
Score = 40.4 bits (93), Expect = 0.86, Method: Composition-based stats.
Identities = 37/183 (20%), Positives = 76/183 (41%), Gaps = 16/183 (8%)
Query: 11 VVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVD 70
+V P + + + L +K E AI + K + + AL ++ + + Q D
Sbjct: 443 IVQLSPNSEYDFFSLGELYLTKKIYEEAIKCYKKTLEINPQYIKALNNLGLAYEYQQMFD 502
Query: 71 EAVEAIK-----------SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
+A+E K ++ + A + + + I+ YKK ++ Q + +
Sbjct: 503 QAIECYKKAIEIDPNYHLAYYNCGISYASKKMVDEAIECYKKVLEINPQYLNASTNMGYL 562
Query: 120 YHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID 179
Y + K + +S + V + + +IL NLG+AY + + A +Y++ ID
Sbjct: 563 YSQQKMYDKAIECYQS-----ALQVNENSLKILNNLGYAYYKSNMHDQAIEIYKRVIQID 617
Query: 180 PDA 182
P +
Sbjct: 618 PKS 620
>gi|359799757|ref|ZP_09302310.1| tetratricopeptide repeat family protein 11 [Achromobacter
arsenitoxydans SY8]
gi|359362183|gb|EHK63927.1| tetratricopeptide repeat family protein 11 [Achromobacter
arsenitoxydans SY8]
Length = 628
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 73/160 (45%), Gaps = 13/160 (8%)
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
A +L+ DRV +AV +++ + + L LY++ RL + +L+Q + +
Sbjct: 452 AQILRDADRVGQAVSVLEAADQALPDTVEIKYE--LAMLYERQNRLADLERMLRQVIALD 509
Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
H A+N A H ++ +T E S IL ++GW + G+ T+A
Sbjct: 510 PDHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKFRMGDTTAAA 568
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
R+A + P+A+ A +L+ L Q + +A +L L
Sbjct: 569 DYLRRAYSVRPEADIAAHLAEVLWSQGKRDQATDLLRAAL 608
>gi|186476952|ref|YP_001858422.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
gi|184193411|gb|ACC71376.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia phymatum
STM815]
Length = 556
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 17/131 (12%)
Query: 87 AQESLDNVLIDLYKKCGRLDEQIELLKQKLRM-------IYH-GEAF--NGKPTKTARSH 136
A+ +L +L+D RLDE + + ++ L + +Y+ G A + +P + A +
Sbjct: 141 ARNNLGTLLMD----ESRLDESVAMFRETLEIDGTHIHALYNLGMALLRSVRPAEAATAF 196
Query: 137 GKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQ 195
+ + V+ + L NLG A G+Y AE VYR+ + PD A+ NL+ L+ Q
Sbjct: 197 RQA--LAVQPDHRDALHNLGTALKLNGHYDEAEAVYRQTLALAPDFADVQWNLAVLLLTQ 254
Query: 196 ARYTEARSVLE 206
R E + E
Sbjct: 255 GRLAEGWPLAE 265
>gi|157129230|ref|XP_001661650.1| kinesin light chain 1 and [Aedes aegypti]
gi|108872282|gb|EAT36507.1| AAEL011410-PA [Aedes aegypti]
Length = 472
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 72/175 (41%), Gaps = 35/175 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHL-------CSKQAQESLDNVLIDLYKKCGR 104
V + L + +V + Q+ + EA++ + + C +L+N+ + LY K G
Sbjct: 234 VATMLSILTMVYRDQNNLPEAIKHMNEALGIWVRCLGECHPSVAAALNNLAV-LYGKNGN 292
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L K+ A + R H + ++ L NL +G
Sbjct: 293 YKEAESLCKR---------ALANRENVLGRYH---------PDVAKQLNNLALLCQNQGK 334
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVLL 210
+ E+ R+A I DP+A K CNL+ C +K +Y EA +L DVL+
Sbjct: 335 HGEVELYIRRALEIFESQLGANDPNAIKTKCNLAACCLKLRKYKEADQLLRDVLV 389
>gi|429859839|gb|ELA34601.1| anaphase-promoting complex subunit cut9 [Colletotrichum
gloeosporioides Nara gc5]
Length = 622
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
L + GNYT A+ K L+ + + +HCL+KQ+R+ EA SVL D
Sbjct: 41 LAQVHFATGNYTRAQTFLAKQDLVSRNVSCRYLAAHCLVKQSRFEEALSVLGD 93
>gi|388259165|ref|ZP_10136339.1| hypothetical protein O59_003560 [Cellvibrio sp. BR]
gi|387937123|gb|EIK43680.1| hypothetical protein O59_003560 [Cellvibrio sp. BR]
Length = 579
Score = 40.4 bits (93), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 77/194 (39%), Gaps = 46/194 (23%)
Query: 35 PEAAIVLFWKAI--NAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLD 92
P + +LF +A+ N R+ + +D+ +LKQ + V A+ S ++ + + Q
Sbjct: 427 PSSTRLLFARAMLNNERQRLPATERDLRQILKQDS---DNVAALNSLGYILADRTQ---- 479
Query: 93 NVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL 152
R E ELL++ LR+ KP A I+
Sbjct: 480 -----------RYSEAYELLEKALRL---------KPDDAA-----------------II 502
Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGK 212
++GW Y + G Y A R+A P+A +L L EAR V + +
Sbjct: 503 DSMGWLYYRTGKYPEALSHLRQAYEASPNAEIGAHLGEVLWVIGETEEARRVWREAQQLE 562
Query: 213 LSGSTETKTINRVK 226
+T +T+NR+K
Sbjct: 563 PDNATIRETLNRLK 576
>gi|196232945|ref|ZP_03131794.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196222923|gb|EDY17444.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 808
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 63/151 (41%), Gaps = 10/151 (6%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AIV +AI A ++ V K++ R DEAV A + L A+ N L +
Sbjct: 235 AIVAQRRAIELRPDFAEAYNNLGVACKERGRTDEAVAAFRRSIELQPDFAETH--NNLGN 292
Query: 98 LYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVT--------VKQETS 149
++CGR DE I ++ L + A T RS G + T ++ S
Sbjct: 293 TLQECGRADEAIAAYRRALALQPEYAAAANNLASTLRSVGLLDEATAAIRRALELQPALS 352
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
I NLG +G+ A YR+A ++P
Sbjct: 353 DIRNNLGNILKDQGDVEGAIGAYREALQLEP 383
>gi|320590030|gb|EFX02475.1| cell division protein [Grosmannia clavigera kw1407]
Length = 936
Score = 40.4 bits (93), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 31/51 (60%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
L + GNYT A+ + K+ LI +A+ +HCLIKQ+R+ +A +VL
Sbjct: 41 LAQVHFAMGNYTRAQALLSKSNLIARNASCRYLAAHCLIKQSRFDDALAVL 91
>gi|157812772|gb|ABV81131.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase [Thulinius stephaniae]
Length = 289
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 63/160 (39%), Gaps = 37/160 (23%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESLDN 93
A+ L++KA+ +A ++A VL+QQ ++ E++ EAI+ +
Sbjct: 73 AVRLYYKALEVLPEFAAAHSNLASVLQQQGKLQESIAHYKEAIR-------------ISP 119
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG 153
D Y G +++ G R + + +T+ +
Sbjct: 120 TFADAYSNMGNTLKEM-----------------GDXQGAIRCYSRA--ITINPAFADAHS 160
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
NL + GN T A YR A + PD A CNL+HCL
Sbjct: 161 NLASIHKDTGNITEAVASYRTALKLKPDFPDAFCNLAHCL 200
>gi|423014093|ref|ZP_17004814.1| tetratricopeptide repeat family protein 11 [Achromobacter
xylosoxidans AXX-A]
gi|338783024|gb|EGP47393.1| tetratricopeptide repeat family protein 11 [Achromobacter
xylosoxidans AXX-A]
Length = 628
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 13/176 (7%)
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
A +L+ DRV +AV +++ + + L LY++ RL + +L+Q + +
Sbjct: 452 AQILRDADRVGQAVSVLEAADQALPDTVEIKYE--LAMLYERQDRLADLERMLRQVIALD 509
Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
H A+N A H ++ +T E S IL ++GW + G+ +A
Sbjct: 510 PEHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKFRMGDQVAAA 568
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
R+A + P+A+ A +L+ L Q + +A +L LL T + + R+
Sbjct: 569 DYLRRAYSVRPEADIAAHLAEVLWTQGKRDQAVELLRAALLKDPKNKTVLEVVKRL 624
>gi|404498065|ref|YP_006722171.1| TPR domain-containing protein [Geobacter metallireducens GS-15]
gi|78195662|gb|ABB33429.1| TPR domain protein [Geobacter metallireducens GS-15]
Length = 643
Score = 40.4 bits (93), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 76/178 (42%), Gaps = 29/178 (16%)
Query: 30 LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
L K+P+ A+ + A+ A +++A + ++ ++ +EA + K + A+
Sbjct: 414 LGAKNPKEAMASYQDALKIAPDNLEARRELAALYRKGNQNEEAEKQYKEILRIKKDDAEA 473
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETS 149
N+L LY K + DE I LLK + + P A SH K
Sbjct: 474 R--NILTALYVKSKKYDELITLLKDGVELA---------PNDPA-SHYK----------- 510
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEARSVLE 206
LG Y K +Y +AE Y+KA + D K+ N L +K + +EA+ LE
Sbjct: 511 -----LGLIYEFKKDYDAAEQEYKKAVELKGDHAKSLNALGRIYLKTGKISEAKGALE 563
>gi|300864445|ref|ZP_07109316.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
gi|300337589|emb|CBN54464.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
Length = 1129
Score = 40.4 bits (93), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 11/126 (8%)
Query: 95 LIDLYKKCGRLDEQIELLKQKLRMIYH--GEAFNGKPTKTARSHGKK------FQ--VTV 144
L +++K G+ E I ++ + H G F+ T + GK+ +Q + V
Sbjct: 425 LGKVFQKQGKSAEAIAYFQKTSDINPHVVGADFHFNLANTLLTEGKRDEAIQSYQRAIAV 484
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
K + + N+G A MQ+GN A YRKA I P N+++ L+ Q +Y EA S
Sbjct: 485 KPDWAEAYANIGSARMQQGNLEEAIQYYRKAIAIKPQLEALHFNIANALLHQGKYEEAIS 544
Query: 204 VLEDVL 209
++ +
Sbjct: 545 NYQEAI 550
>gi|383770004|ref|YP_005449067.1| hypothetical protein S23_17390 [Bradyrhizobium sp. S23321]
gi|381358125|dbj|BAL74955.1| hypothetical protein S23_17390 [Bradyrhizobium sp. S23321]
Length = 1062
Score = 40.4 bits (93), Expect = 1.0, Method: Composition-based stats.
Identities = 52/191 (27%), Positives = 80/191 (41%), Gaps = 34/191 (17%)
Query: 36 EAAIVLFWKAINAGD-RVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQES--LD 92
E A+ L A A D R+ L + + + + EA ++ R L + QA + LD
Sbjct: 134 ERALALGTSAFGANDGRLAGTLSALGGLARDNENFKEAAGYLE--RALSALQAANAPPLD 191
Query: 93 -----NVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
+ L D+Y GR D+ LLKQ + ++ RS GK + T +
Sbjct: 192 VASAMDDLGDVYGLLGRFDDGERLLKQGIELL-------------DRSFGKNAETTPNYD 238
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLI------DPDANKAC---NLSHCLIKQARY 198
++L +LG Y+ G AE R+A I D N A NL+ L +AR+
Sbjct: 239 --KMLNDLGNLYLDAGRLPDAEAAMRRALAITRARNGDAHPNVAATMGNLATVLKHEARH 296
Query: 199 TEARSVLEDVL 209
EA + + L
Sbjct: 297 AEAEKLYQQTL 307
>gi|398342815|ref|ZP_10527518.1| hypothetical protein LinasL1_07018 [Leptospira inadai serovar Lyme
str. 10]
Length = 689
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 7/86 (8%)
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLE 206
++ LGN+ + +G+Y SAE +R+A + P+ KA NLS L K + EA LE
Sbjct: 279 SNAYLGNITY---NRGDYGSAEHYFREAATLSPNDAKALYNLSVVLKKNGKMEEALKYLE 335
Query: 207 DVLLGKLSGSTETKTINRVKELLQEL 232
+ KL+G+++ + + + ++L
Sbjct: 336 ---MAKLAGASDPEIFRSIADSFEQL 358
>gi|373469728|ref|ZP_09560896.1| tetratricopeptide repeat protein [Lachnospiraceae bacterium oral
taxon 082 str. F0431]
gi|371763886|gb|EHO52335.1| tetratricopeptide repeat protein [Lachnospiraceae bacterium oral
taxon 082 str. F0431]
Length = 250
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDP---DANKACNLSHCLIKQARYTEARSVLEDVL- 209
+LG YM + +Y SA V KA IDP DA K L+ K R +AR+ LE VL
Sbjct: 34 DLGSKYMAELDYESAIVALNKAIKIDPKNVDAYKM--LAEVYEKSGRLEDARATLEKVLD 91
Query: 210 LGKLSGSTETKTINRVKEL 228
+ LS E + NR+K L
Sbjct: 92 IDDLSSDNEDEINNRIKNL 110
>gi|218246150|ref|YP_002371521.1| hypothetical protein PCC8801_1300 [Cyanothece sp. PCC 8801]
gi|218166628|gb|ACK65365.1| TPR repeat-containing protein [Cyanothece sp. PCC 8801]
Length = 878
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 46/179 (25%), Positives = 76/179 (42%), Gaps = 19/179 (10%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE--SLDN 93
E AI + KAI A ++ + L Q +++EA+ A + L A +L N
Sbjct: 148 EEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPNYADAYYNLGN 207
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRMIYH-GEAFN---------GKPTKTARSHGKKFQVT 143
L D G+LDE I ++ +++ + A+N GK + ++ K Q+
Sbjct: 208 ALFDQ----GKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEAIAAYQKAIQLN 263
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
+ NLG A +G A Y+KA ++P+ +A NL L Q + EA
Sbjct: 264 --PNLAEAYNNLGVALSDQGKRDEAIAAYQKAIQLNPNLAEAYNNLGVALSDQGKRDEA 320
Score = 39.3 bits (90), Expect = 1.8, Method: Composition-based stats.
Identities = 48/177 (27%), Positives = 79/177 (44%), Gaps = 15/177 (8%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ AI + KAI A ++ L Q +++EA+ A + L A ++ N+
Sbjct: 80 DEAIAAYQKAIQLNPNDADAYNNLGNALSDQGKLEEAIAAYQKAIQLNPNYA-DAYYNLG 138
Query: 96 IDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFN--GKPTKTARSHGKKFQVTVK 145
I L + G+L+E I ++ +++ Y+ G A + GK + ++ K Q+
Sbjct: 139 IALSDQ-GKLEEAIAAYQKAIQLNPNFTQAYYNLGIALSDQGKLEEAIAAYQKAIQLNPN 197
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEA 201
+ NLG A +G A Y+KA +DP DAN NL L KQ + EA
Sbjct: 198 YADAYY--NLGNALFDQGKLDEAIAAYQKAIQLDPNDANAYNNLGAALYKQGKLEEA 252
>gi|405362768|ref|ZP_11025821.1| hypothetical protein A176_1960 [Chondromyces apiculatus DSM 436]
gi|397090228|gb|EJJ21102.1| hypothetical protein A176_1960 [Myxococcus sp. (contaminant ex DSM
436)]
Length = 624
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 81/176 (46%), Gaps = 26/176 (14%)
Query: 23 VRAKHVQLVEKD--PEAAIVLFWKAINAGDRVDSALKD-MAVVLKQQDRVDEAV----EA 75
+RA+ + +E+ P A L +A++AG +AL D +A +L +Q R DEA+ +A
Sbjct: 396 LRAQFARALERGGTPGRAEALLREALDAGP--SAALYDALAAMLHRQGRGDEALRLLGDA 453
Query: 76 IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY-HGEAFN-------- 126
+ F E L VL +++ G + + ++ L + H A N
Sbjct: 454 VARFPR------DEDLLYVLGAAHERQGDVPGALARMRAVLAVSPDHAAALNFLGYLLAQ 507
Query: 127 -GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
G+ A ++ + ++ ET L +LGW Y ++G+YT A +A + PD
Sbjct: 508 AGQNLDEAERRVRR-ALELRPETGAYLDSLGWVYFRRGDYTRAVDALERASSLSPD 562
>gi|395212900|ref|ZP_10400010.1| hypothetical protein O71_04708 [Pontibacter sp. BAB1700]
gi|394456977|gb|EJF11191.1| hypothetical protein O71_04708 [Pontibacter sp. BAB1700]
Length = 437
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 13/143 (9%)
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG-- 153
+ YKKC L+ + E Q++ IY E T T R FQ + ++ +
Sbjct: 154 VKYYKKCIELNVENEAAMQEI--IYCMEI-----TGTMREELPFFQAFIDKDPYSYVAWF 206
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLEDVLLGK 212
NLG Y + G+Y A Y A +I PD A N+++ + YT+A +L
Sbjct: 207 NLGNVYNKLGSYEKAIAAYDYATIIKPDFITAYNNMANAYVFIGEYTKAIEAFNGML--- 263
Query: 213 LSGSTETKTINRVKELLQELEPW 235
GS + + E ++++ W
Sbjct: 264 EHGSPSAEVFCNIGECYEKMQQW 286
>gi|346974193|gb|EGY17645.1| anaphase-promoting complex subunit cut9 [Verticillium dahliae
VdLs.17]
Length = 618
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 28/53 (52%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
L + GNYT A+ K LI + + +HCLIKQ+RY EA +L D
Sbjct: 41 LAQVHFATGNYTRAQTFLSKQDLITRNTSCRYLAAHCLIKQSRYDEALGLLGD 93
>gi|440680764|ref|YP_007155559.1| Tetratricopeptide TPR_2 repeat-containing protein [Anabaena
cylindrica PCC 7122]
gi|428677883|gb|AFZ56649.1| Tetratricopeptide TPR_2 repeat-containing protein [Anabaena
cylindrica PCC 7122]
Length = 737
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 83/207 (40%), Gaps = 53/207 (25%)
Query: 18 GDSPYVRAKHV---QLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVE 74
+S ++RA + QL E P+ A +L ++A++ K + R +EA
Sbjct: 331 AESLFLRALKIHEEQLGENHPDTATIL---------------NNLALLYKNEGRYEEAES 375
Query: 75 AIKSFRHLCSKQAQES-------LDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
+C +Q E+ L+N+ I +Y+ GR +E L + L++ E G
Sbjct: 376 LYLRALKICEEQLGENHPHTATILNNLAI-VYQNEGRYEEAESLFLRDLKIC--EEQLGG 432
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVY------RKAQLIDPD 181
TA S L NL Y KG Y AE +Y R+ QL +
Sbjct: 433 NHPDTANS----------------LNNLAELYRNKGRYEEAESLYLRALKIREEQLGENH 476
Query: 182 ANKACNLSHCLI---KQARYTEARSVL 205
+ A L++ I + RY EA S+
Sbjct: 477 PHTATILNNLAIVYQNEGRYEEAESLF 503
>gi|118366677|ref|XP_001016554.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89298321|gb|EAR96309.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 564
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 65/132 (49%), Gaps = 8/132 (6%)
Query: 77 KSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF-NGKPTKTARS 135
++FR L + ++ + + I L++K +LD++ KL G +F K A +
Sbjct: 42 QAFRQLGYQFLKKQMYDDAITLFQKAIQLDDKDSWAFGKL-----GYSFLKKKMYDDAIT 96
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIK 194
+K + + + S G LG+++++K Y A ++KA +DP D++ NL + K
Sbjct: 97 FFQK-AIQLNDKDSWAFGKLGYSFLKKKMYDDAFTFFQKAAQLDPQDSSAFANLGYLFYK 155
Query: 195 QARYTEARSVLE 206
+ Y +A + +
Sbjct: 156 KEMYDDAITFFQ 167
>gi|398802709|ref|ZP_10561912.1| Tfp pilus assembly protein PilF [Polaromonas sp. CF318]
gi|398098947|gb|EJL89220.1| Tfp pilus assembly protein PilF [Polaromonas sp. CF318]
Length = 794
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 80/186 (43%), Gaps = 15/186 (8%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ A F +A+ +A + + LK R DEA A + L A E+L+N
Sbjct: 94 PDEAEAAFRRALALDPEDAAAHNNFGIFLKDNRRPDEAEAAYRRALALRPDYA-EALNN- 151
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNG---------KPTKTARSHGKKFQVTV 144
L +L +K GRLDE ++ L++ + EA N + + S G+ T
Sbjct: 152 LGNLLRKSGRLDEAEAAYRRALQLRPDYAEAHNNLGILLKSSWRMREAGASQGRATPSTP 211
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIK-QARYTEARS 203
+R L +LG + + AE YR+A P+ +A N L+K + R EA +
Sbjct: 212 GH--ARALNDLGNRHQKNRRSIEAEAAYREALEAWPEYAEAHNNLGVLLKSEGRSPEAEA 269
Query: 204 VLEDVL 209
V L
Sbjct: 270 VYRRAL 275
>gi|75318847|sp|O82422.1|SPY_HORVU RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY; AltName:
Full=HvSPY
gi|3617837|gb|AAC36055.1| gibberellin action negative regulator SPY [Hordeum vulgare subsp.
vulgare]
gi|326515544|dbj|BAK07018.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 944
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 64/151 (42%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D+AVE + + +Q SL+N+
Sbjct: 303 EMAIVFYELALHFNPRCAEACNNLGVIYKDRDNLDKAVECYQMALSIKPNFSQ-SLNNLG 361
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EA+N N
Sbjct: 362 V-VYTVQGKMDAAASMIEKAILANPTYAEAYN---------------------------N 393
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y G+ T + Y + IDPD+ A
Sbjct: 394 LGVLYRDAGSITLSVQAYERCLQIDPDSRNA 424
>gi|218439885|ref|YP_002378214.1| hypothetical protein PCC7424_2942 [Cyanothece sp. PCC 7424]
gi|218172613|gb|ACK71346.1| Tetratricopeptide TPR_2 repeat protein [Cyanothece sp. PCC 7424]
Length = 632
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 95/244 (38%), Gaps = 27/244 (11%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KAI A + + L Q +++EA+ A + A E N+
Sbjct: 145 EEAIAAYNKAIEINPNYAFAYIGLGIALYNQGKLEEAIAAYNKAIEINPNYA-EVYSNLG 203
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI---- 151
LY + G+L+E I + I +AF A S+ K + + + I
Sbjct: 204 FALYNQ-GKLEEAIAAYNTAIE-INPNDAFAYNNLGIALSNQGKLEEAIAAYNTAIEINP 261
Query: 152 -----LGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVL 205
NLG A +G A Y A I+P DA L L Q + EA +
Sbjct: 262 NDAFAYNNLGVALYNQGKLEEAIAAYNTAIEINPNDAFAYIGLGIALHDQGKLEEAIAAY 321
Query: 206 EDVL-----------LGKLSGSTETKTI---NRVKELLQELEPWQSIPPSLTTKKSSLED 251
L + L+ +T + +++E + E E I P+ TT +++L++
Sbjct: 322 NKTLSLADKKADRASVHTLAHTTLGYALQQQGKLEEAIAEYEKALKIDPNNTTAQNNLKE 381
Query: 252 AFLE 255
A ++
Sbjct: 382 ALIQ 385
>gi|224140075|ref|XP_002323412.1| predicted protein [Populus trichocarpa]
gi|222868042|gb|EEF05173.1| predicted protein [Populus trichocarpa]
Length = 917
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 64/154 (41%), Gaps = 32/154 (20%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK--SFRHLCSK-QAQESLD 92
E AIV + A N A ++ V+ K +D +D+AVE + S L K +SL+
Sbjct: 317 EMAIVFYELAFNFNPHCAEACNNLGVIYKDRDNLDKAVECYQANSDTSLSIKPNFSQSLN 376
Query: 93 NVLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
N+ + +Y G++D ++++ + + EA+N
Sbjct: 377 NLGV-VYTVQGKMDAAASMIEKAIMANPTYAEAYN------------------------- 410
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
NLG Y GN + A Y + IDPD+ A
Sbjct: 411 --NLGVLYRDAGNISMAISAYEQCLEIDPDSRNA 442
>gi|253997194|ref|YP_003049258.1| hypothetical protein Mmol_1827 [Methylotenera mobilis JLW8]
gi|253983873|gb|ACT48731.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera mobilis JLW8]
Length = 591
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 39/85 (45%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
+T+ +L +LGWA+ +KGN A +A I+PD A +L L ++ + +A
Sbjct: 506 LTLAPNDHYMLDSLGWAHYRKGNLDKAIQYLEQAFRINPDPEIAAHLGEVLWQKGEFEKA 565
Query: 202 RSVLEDVLLGKLSGSTETKTINRVK 226
+ V D L T N+ K
Sbjct: 566 KKVWSDALTADPDNEILLTTANKFK 590
>gi|339495311|ref|YP_004715604.1| TPR domain-containing protein [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
gi|338802683|gb|AEJ06515.1| TPR domain-containing protein [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
Length = 575
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
IL +LGW + GN + AE + RKA PD A +L L + +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLSEAEALLRKALQRFPDHEVAAHLGEVLWTKGERSEARAV 548
>gi|386021936|ref|YP_005939961.1| hypothetical protein PSTAA_3350 [Pseudomonas stutzeri DSM 4166]
gi|327481909|gb|AEA85219.1| TPR domain-containing protein [Pseudomonas stutzeri DSM 4166]
Length = 575
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
IL +LGW + GN + AE + RKA PD A +L L + +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLSEAEALLRKALQRFPDHEVAAHLGEVLWTKGERSEARAV 548
>gi|452747685|ref|ZP_21947478.1| TPR domain-containing protein [Pseudomonas stutzeri NF13]
gi|452008429|gb|EME00669.1| TPR domain-containing protein [Pseudomonas stutzeri NF13]
Length = 575
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 28/54 (51%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
IL +LGW + GN T AE + RKA D A +L L Q +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLTEAEALLRKALQRFADHEVAAHLGEVLWAQGERSEARAV 548
>gi|307151943|ref|YP_003887327.1| NB-ARC domain-containing protein [Cyanothece sp. PCC 7822]
gi|306982171|gb|ADN14052.1| NB-ARC domain protein [Cyanothece sp. PCC 7822]
Length = 1023
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 70/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V ++L ++A + QDR EA EA++ + + SL+N+ + LY GR
Sbjct: 841 VTTSLNNLAELYCSQDRYTEAEPLYTEALEMTKRMFDGNHPSVAGSLNNLAV-LYYNQGR 899
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L + L M F+G A S L NL Y +G
Sbjct: 900 YTEAEPLYNEALEMT--KRMFDGDHPSVATS----------------LNNLALLYKSQGR 941
Query: 165 YTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
YT AE +Y +A + PD A NL+ Q RY EA+S+ + L
Sbjct: 942 YTEAEPLYTEALEMRRRIFTGDHPDVALSLNNLAELYYSQDRYDEAKSLYTEAL 995
>gi|307731322|ref|YP_003908546.1| hypothetical protein [Burkholderia sp. CCGE1003]
gi|307585857|gb|ADN59255.1| Tetratricopeptide TPR_1 repeat-containing protein [Burkholderia sp.
CCGE1003]
Length = 602
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 65/167 (38%), Gaps = 18/167 (10%)
Query: 55 ALKDMAVVLKQQDRVDEAVE----AIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIE 110
A + V L+ Q ++EAVE AI S L A +L L K GR E +
Sbjct: 124 ARNNFGVALQAQGALNEAVEQYRLAIASNPALVD--AHLNLGTAL----GKLGRFAEALA 177
Query: 111 LLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQK 162
+ L++ G A N + A + + ++ + + NLG A ++
Sbjct: 178 CYRNALQLDPASAEAHFNAGNAHNARGEHEAAVASFERALALRPDYAEAHVNLGSAIGKR 237
Query: 163 GNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
G+Y AE YR+A + P+ + L Q R E + L
Sbjct: 238 GDYAGAEAHYRRAVELKPNPTNLVCMGGALGAQGRLAEEEKFYREAL 284
>gi|425434676|ref|ZP_18815141.1| TPR repeat-containing protein [Microcystis aeruginosa PCC 9432]
gi|389675804|emb|CCH95098.1| TPR repeat-containing protein [Microcystis aeruginosa PCC 9432]
Length = 310
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 96/247 (38%), Gaps = 50/247 (20%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV- 61
+ RD ++ K P ++ + L +KD E A+ F A R+D + A+
Sbjct: 34 EARDAFEEILEKDPGSKQAHLGIGSICLKQKDYEGALTHF----GAARRIDPMMAKAALA 89
Query: 62 ---VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
V +Q+ +D A+EA K D V ID G L LLKQ
Sbjct: 90 VGNVYYKQNELDSAIEAFK--------------DAVNIDPTTPAGYLGMGRVLLKQ---- 131
Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQET-SRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
K A+ +K V Q T R+L + Y ++GN +A A
Sbjct: 132 ---------KKYPQAKEQVQKALVFNPQLTPGRLL--MSQIYQEQGNTKAAIDEIESALR 180
Query: 178 IDPDANKAC-NLSHCLIKQARYTEARSVLEDVL-----------LGKLSGSTETKTINRV 225
++P A A +L + +KQ +Y AR ED +G + E+ +N
Sbjct: 181 MNPTAWSAYQSLGNIYLKQKQYNLARKNFEDAQQLNPKIPVVAKMGYIEALIESNALNEA 240
Query: 226 KELLQEL 232
E+L+EL
Sbjct: 241 SEILREL 247
>gi|358392689|gb|EHK42093.1| hypothetical protein TRIATDRAFT_126880 [Trichoderma atroviride IMI
206040]
Length = 619
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
L + GNYT A+ K LI N AC HCLIKQ+R+ EA +VL
Sbjct: 41 LAQVHFATGNYTRAQAFLSKQDLIGR--NPACRYLAGHCLIKQSRFEEALAVL 91
>gi|119486282|ref|ZP_01620341.1| hypothetical protein L8106_16549 [Lyngbya sp. PCC 8106]
gi|119456495|gb|EAW37625.1| hypothetical protein L8106_16549 [Lyngbya sp. PCC 8106]
Length = 855
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 78/190 (41%), Gaps = 41/190 (21%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V S+L ++A + Q R EA +A+ ++ L SL+N L LYK GR
Sbjct: 644 VASSLNNLAGLCHSQGRNREAEPLYQQALAMYQRLFDGDHPDVASSLNN-LAYLYKSQGR 702
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L +Q L M F G A S L NL + Y +G
Sbjct: 703 NSEAEPLYQQALEMT--QRLFEGDHPDVATS----------------LNNLAYLYKSQGR 744
Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL------ 209
+ AE +Y++A +L + D A NL++ Q RY+EA + + L
Sbjct: 745 NSEAEPLYQQALEMRQRLFEGDHPDVATSLNNLAYLYESQGRYSEAEPLFQQALFMLVKT 804
Query: 210 LGKLSGSTET 219
LGK +T+T
Sbjct: 805 LGKDHPNTQT 814
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 73/179 (40%), Gaps = 37/179 (20%)
Query: 49 GDRVD--SALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSK---QAQESLDNVLIDLY 99
GD D ++L ++A + K Q R EA +A+ ++ L SL+N L LY
Sbjct: 513 GDHPDVATSLNNLAYLYKSQGRNSEAEPLLQQALAMYQRLFEGDHPNIATSLNN-LAYLY 571
Query: 100 KKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAY 159
+ GR E L +Q L M F G A S L NLG Y
Sbjct: 572 ESQGRYSEAEALFQQALEM--RQRLFEGDHPSVATS----------------LNNLGLLY 613
Query: 160 MQKGNYTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSVLEDVL 209
+G Y+ AE +Y++A +L + D A+ NL+ Q R EA + + L
Sbjct: 614 NSQGRYSQAEPLYQQALAMRQRLFEGDHPNVASSLNNLAGLCHSQGRNREAEPLYQQAL 672
>gi|298241367|ref|ZP_06965174.1| transcriptional regulator, XRE family [Ktedonobacter racemifer DSM
44963]
gi|297554421|gb|EFH88285.1| transcriptional regulator, XRE family [Ktedonobacter racemifer DSM
44963]
Length = 850
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 11/86 (12%)
Query: 154 NLGWAYMQKGNYTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSV 204
NLG+ Y +G YT AE Y++A Q+ P+ AN NL+ IKQ RY E +
Sbjct: 614 NLGFLYYCQGRYTQAEHYYQRALDLSRQISGPEHPDTANVLNNLALTYIKQERYAECERI 673
Query: 205 LEDVL--LGKLSGSTETKTINRVKEL 228
L + G+ + TIN ++ +
Sbjct: 674 YGQALAIYHRTLGAEHSHTINALQNV 699
>gi|345860131|ref|ZP_08812457.1| tetratricopeptide repeat family protein [Desulfosporosinus sp. OT]
gi|344326772|gb|EGW38224.1| tetratricopeptide repeat family protein [Desulfosporosinus sp. OT]
Length = 234
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIK 194
+ +K Q +R+ NLG+ Y KGN A + + +IDP D + NLS LIK
Sbjct: 77 YKQKVQTEPNNTEARV--NLGYTYFLKGNDGQAIQEFNQVLVIDPKDFDAYYNLSLVLIK 134
Query: 195 QARYTEA 201
+ RY EA
Sbjct: 135 EKRYNEA 141
>gi|46123429|ref|XP_386268.1| hypothetical protein FG06092.1 [Gibberella zeae PH-1]
Length = 616
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
L + GNYT A+ LI N +C HCLIKQ+R+ EA SVL D
Sbjct: 41 LAQVHFATGNYTRAQAFLSSQNLISR--NPSCRYLAGHCLIKQSRFDEALSVLGD 93
>gi|408396447|gb|EKJ75605.1| hypothetical protein FPSE_04248 [Fusarium pseudograminearum CS3096]
Length = 616
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
L + GNYT A+ LI N +C HCLIKQ+R+ EA SVL D
Sbjct: 41 LAQVHFATGNYTRAQAFLSSQNLISR--NPSCRYLAGHCLIKQSRFDEALSVLGD 93
>gi|383810579|ref|ZP_09966073.1| tetratricopeptide repeat / tetratricopeptide repeat multi-domain
protein [Prevotella sp. oral taxon 306 str. F0472]
gi|383356947|gb|EID34437.1| tetratricopeptide repeat / tetratricopeptide repeat multi-domain
protein [Prevotella sp. oral taxon 306 str. F0472]
Length = 230
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC---NLSHCLIKQARY 198
VT+ +TS L G NY AEV YRKA ID + N A NL+ CL Q +
Sbjct: 17 VTLNAQTSNQLIRKGNKNFSSKNYAQAEVFYRKA--IDKEPNNAIGNYNLARCLQAQHKN 74
Query: 199 TEARSVLE 206
EA+ + E
Sbjct: 75 AEAKKIYE 82
>gi|260804643|ref|XP_002597197.1| hypothetical protein BRAFLDRAFT_66323 [Branchiostoma floridae]
gi|229282460|gb|EEN53209.1| hypothetical protein BRAFLDRAFT_66323 [Branchiostoma floridae]
Length = 2044
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 39/92 (42%), Gaps = 14/92 (15%)
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID-----PDANKA-------CNLSHCLI 193
Q R L NLG AYM ++A + +A +D PD N CNL H L
Sbjct: 1176 QNVVRALNNLGVAYMALREDSAAAAFFTEA--LDTLRNAPDENAGRVIVACVCNLGHVLY 1233
Query: 194 KQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
Y AR E++ K G TET ++ V
Sbjct: 1234 NLGDYQAARGHYEEIANKKRRGETETADMDAV 1265
>gi|356535232|ref|XP_003536152.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Glycine max]
Length = 988
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 10/156 (6%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI + A+ A ++A + +Q ++D A+ K C + E+ +N
Sbjct: 284 PQEAIACYQHALQTRPNYGMAYGNLASIYYEQGQLDMAILHYKQ-AVACDPRFLEAYNN- 341
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
L + K GR++E I+ Q L + G + A + K + V
Sbjct: 342 LGNALKDVGRVEEAIQCYNQCLTLQPNHPQALTNLGNIYMEWNMVAAAAQYYKATLNVTT 401
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
S NL Y Q+GNY A Y + IDP A
Sbjct: 402 GLSAPYNNLAIIYKQQGNYVDAISCYNEVLRIDPLA 437
>gi|260592518|ref|ZP_05857976.1| BatC protein [Prevotella veroralis F0319]
gi|260535564|gb|EEX18181.1| BatC protein [Prevotella veroralis F0319]
Length = 230
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC---NLSHCLIKQARY 198
VT+ +TS L G NY AEV YRKA ID + N A NL+ CL Q +
Sbjct: 17 VTLNAQTSNQLIRKGNKNFSSKNYAQAEVFYRKA--IDKEPNNAIGNYNLARCLQAQHKN 74
Query: 199 TEARSVLE 206
EA+ + E
Sbjct: 75 AEAKKIYE 82
>gi|262193489|ref|YP_003264698.1| hypothetical protein [Haliangium ochraceum DSM 14365]
gi|262076836|gb|ACY12805.1| TPR repeat-containing protein [Haliangium ochraceum DSM 14365]
Length = 963
Score = 39.7 bits (91), Expect = 1.6, Method: Composition-based stats.
Identities = 43/169 (25%), Positives = 78/169 (46%), Gaps = 15/169 (8%)
Query: 22 YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
YV A+ V L ++D + A+ F I AG + ++ + QD +DEA+ + + +
Sbjct: 750 YVSAE-VALRQRDVDTALEHFRALIRAGGDSFDVRGRLGMIARLQDDLDEAIRQLCAAKR 808
Query: 82 LCSKQAQESLDNVLIDLYKKCGRLDEQIE------LLKQ----KLRMIYHGEAFNGKPTK 131
L +++ + L D+YK+ GR+DE + +L+Q LR + A + K
Sbjct: 809 LDPERSYPYSE--LYDVYKEAGRMDEALAELETYVMLEQMQYGPLRELVGAYAERARWDK 866
Query: 132 TARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
R +G + V + + + LG AY++ G+ A + A L P
Sbjct: 867 V-RVYG-EMAVHINPSDAELFLMLGQAYLETGDPAQALFTFDSALLAKP 913
>gi|196233636|ref|ZP_03132477.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196222306|gb|EDY16835.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 752
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 77/184 (41%), Gaps = 18/184 (9%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHL--CSKQAQESLD 92
P+ A+ F +A+ AL ++ + L R+DEA+ + + + +L
Sbjct: 192 PQEAMTAFRRALEINPDYADALNNLGITLADAGRLDEAIACYRRALQINPAGAETNTNLG 251
Query: 93 NVLIDLYKKCGRLDEQ-------IELLKQKLRMIYH--GEAFNGKPTKTARSHGKKFQVT 143
N L +L RLDE IEL K L Y+ G A + S +
Sbjct: 252 NALFELQ----RLDEAAAAFRAVIEL-KPDLAQAYNNLGNALREQGALNEASAEFLHALA 306
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEAR 202
++ ++ NLG A +G +A YR+A + P D+ N + L+ Q R + R
Sbjct: 307 IEPNSADFHNNLGNALKDRGEIDAALDAYRRAMELAPDDSGPWTNFVYTLLFQPRVDD-R 365
Query: 203 SVLE 206
+++E
Sbjct: 366 ALIE 369
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 86/206 (41%), Gaps = 16/206 (7%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQESLD 92
PE A+ L +A+NA + +A ++ L + R++EA +A L QA +L
Sbjct: 56 PEIAVELIQQALNAAPQHVAAHFNLGNALSELGRMEEAADAFGRATELQPDYAQAHHNLG 115
Query: 93 NVLIDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFNGKPTKTARSHGKKFQVTV 144
+ L K GR DE I ++ + + Y+ G A + + + + +
Sbjct: 116 SAL----AKRGRFDEAIAAFQRAIELKPDYASAYNNLGLALKAQARRDEALAAFQQAIAL 171
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARS 203
+ + + NLG + + A +R+A I+PD A+ NL L R EA +
Sbjct: 172 QPDHAEAHFNLGNIFREWARPQEAMTAFRRALEINPDYADALNNLGITLADAGRLDEAIA 231
Query: 204 VLEDVLLGKLSGS-TETKTINRVKEL 228
L +G+ T T N + EL
Sbjct: 232 CYRRALQINPAGAETNTNLGNALFEL 257
>gi|418065486|ref|ZP_12702859.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
RCH3]
gi|373562226|gb|EHP88443.1| Tetratricopeptide TPR_2 repeat protein [Geobacter metallireducens
RCH3]
Length = 592
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 91/220 (41%), Gaps = 20/220 (9%)
Query: 6 DQPYHVVHKLPPGDSPYVRA-KHVQLVEKD---PEAAIVLFWKAINAG-DRVDSALKDMA 60
D+ K+PPG + YV A H+ + +D PE I + AI A D++D L +A
Sbjct: 354 DKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLY-LA 412
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK------- 113
+ + D+ E + +K ++ + L + + K G +E I +K
Sbjct: 413 GLYESMDKFSEGLAVLKGVEGKFAEDPR--LHFRMGTILDKMGNKEESIARMKRVIAITP 470
Query: 114 ---QKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSA-E 169
Q L + + A G A + KK V ++ IL +LGW Y + Y A
Sbjct: 471 DDAQALNYLGYTYAEMGIKLDEALQYLKK-AVALRPNDGFILDSLGWVYFKMKRYDEAVP 529
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
++ R ++++ D +L+ Y A + + +L
Sbjct: 530 LLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKIL 569
>gi|404496093|ref|YP_006720199.1| TPR domain-containing protein [Geobacter metallireducens GS-15]
gi|78193702|gb|ABB31469.1| TPR domain protein [Geobacter metallireducens GS-15]
Length = 573
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 91/220 (41%), Gaps = 20/220 (9%)
Query: 6 DQPYHVVHKLPPGDSPYVRA-KHVQLVEKD---PEAAIVLFWKAINAG-DRVDSALKDMA 60
D+ K+PPG + YV A H+ + +D PE I + AI A D++D L +A
Sbjct: 335 DKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLY-LA 393
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLK------- 113
+ + D+ E + +K ++ + L + + K G +E I +K
Sbjct: 394 GLYESMDKFSEGLAVLKGVEGKFAEDPR--LHFRMGTILDKMGNKEESIARMKRVIAITP 451
Query: 114 ---QKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSA-E 169
Q L + + A G A + KK V ++ IL +LGW Y + Y A
Sbjct: 452 DDAQALNYLGYTYAEMGIKLDEALQYLKK-AVALRPNDGFILDSLGWVYFKMKRYDEAVP 510
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
++ R ++++ D +L+ Y A + + +L
Sbjct: 511 LLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKIL 550
>gi|67923685|ref|ZP_00517153.1| TPR repeat:TPR repeat [Crocosphaera watsonii WH 8501]
gi|67854485|gb|EAM49776.1| TPR repeat:TPR repeat [Crocosphaera watsonii WH 8501]
Length = 530
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 80/194 (41%), Gaps = 17/194 (8%)
Query: 27 HVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQ 86
H Q E EAAI + KA+ D A + + L R D+A+ + L +
Sbjct: 252 HTQQAEY--EAAIASYDKALQLTPDYDLAWNNRGIALANVGRFDKAIASYDKALQLTPDK 309
Query: 87 AQESLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYHGEAFN----GKPTKTARSH 136
E+ N I L+ + GR DE I + L++ ++ + G+ + S+
Sbjct: 310 -DEAWCNRGIALFNR-GRSDEAIASFDKALQLKPDDHQAWNNRGYALRQLGRSDEAIASY 367
Query: 137 GKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQ 195
K Q+ K + + N G+A Q G + A Y KA + PD +A N L K
Sbjct: 368 DKALQL--KPDDHQAWNNRGYALRQLGRFDEAIASYYKALQLKPDYYEAWHNRGIALRKL 425
Query: 196 ARYTEARSVLEDVL 209
R+ EA + + L
Sbjct: 426 GRFDEAIASYDKAL 439
>gi|387126389|ref|YP_006294994.1| hypothetical protein Q7A_501 [Methylophaga sp. JAM1]
gi|386273451|gb|AFI83349.1| hypothetical protein Q7A_501 [Methylophaga sp. JAM1]
Length = 591
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 65/156 (41%), Gaps = 19/156 (12%)
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
A L++ D +E+VE + L + L + + GRL E+L+Q LR I
Sbjct: 417 AAFLREHDMPEESVEVLTE--ALTNHPGNLDLLYSRAMVAESIGRL----EMLEQDLRTI 470
Query: 120 Y-----HGEAFNG-KPTKTARS--HGKKFQVTVKQETSR-----ILGNLGWAYMQKGNYT 166
+ +A N T T R+ H + Q+ K R L +LGW Y + GN
Sbjct: 471 LKIDPENAQAMNALGYTLTDRTDRHEEALQLINKALELRPNDPYYLDSLGWVYYRMGNLE 530
Query: 167 SAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEAR 202
AE R+A I D +L L Q + EAR
Sbjct: 531 KAEYYLREAVEIQADVEFVAHLGEVLWMQNKKNEAR 566
>gi|118578955|ref|YP_900205.1| hypothetical protein Ppro_0516 [Pelobacter propionicus DSM 2379]
gi|118501665|gb|ABK98147.1| Tetratricopeptide TPR_2 repeat protein [Pelobacter propionicus DSM
2379]
Length = 673
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 85/208 (40%), Gaps = 42/208 (20%)
Query: 7 QPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRV-------DSALKDM 59
+ Y KL P DSP ++ K +++ K E ++ AI+A D V A +++
Sbjct: 396 EQYSEFLKLKP-DSPDIQLKLARILAKKKETSL-----AIDAYDAVLKSAPDNPEANREI 449
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
A + K + D AV + L A + L+ LY K + DE ELLK + +
Sbjct: 450 AALYKAKGMNDRAVAHYRKALELRKDDADTR--SALVSLYVKNRQYDEITELLKGAVELF 507
Query: 120 YHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID 179
+ A +H K LG + K Y SA Y+KA +
Sbjct: 508 ----------PEDANNHYK----------------LGLIHEFKKEYGSAIACYQKAAELR 541
Query: 180 PDANKACN-LSHCLIKQARYTEARSVLE 206
PD +A N L +K R +EAR LE
Sbjct: 542 PDHARALNALGRMYMKTDRISEAREALE 569
>gi|116200818|ref|XP_001226221.1| hypothetical protein CHGG_10954 [Chaetomium globosum CBS 148.51]
gi|88175668|gb|EAQ83136.1| hypothetical protein CHGG_10954 [Chaetomium globosum CBS 148.51]
Length = 1002
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 9/72 (12%)
Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYT 199
+R++ NLG Y ++G Y AE + ++A ++ P+ + NL H I Q RYT
Sbjct: 737 ARVIHNLGNLYAEQGRYKEAEALLKRALERNEEVWGPEREWTLSTVSNLGHVYIYQQRYT 796
Query: 200 EARSVLEDVLLG 211
EA ++ + L G
Sbjct: 797 EAEALYDRALEG 808
>gi|351730791|ref|ZP_08948482.1| tetratricopeptide tpr_1 repeat-containing protein [Acidovorax
radicis N35]
Length = 809
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 83/191 (43%), Gaps = 25/191 (13%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P A+ + +AI R+ +A + L+Q DR++EA++ C +QA +LD+
Sbjct: 137 PLDAVASYDQAIALQPRLAAAFNNKGNALRQLDRLEEALK--------CYEQA-FALDSD 187
Query: 95 LID-------LYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHG-----KKFQ 141
+D L+ GR DE ++ + + + H EA NGK A+ F+
Sbjct: 188 DVDACQNMGMLHADAGRQDEALKCFDRVIALRPQHAEAHNGKGAILAQREQWTQAIAHFE 247
Query: 142 VTVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARY 198
++ + NLG A ++A ++KA + P D L+ L++ RY
Sbjct: 248 AAIQGNEKLVQAHKNLGLAQRSLFQLSNAVTAFQKAAQLSPQDVEILSFLALSLLEAGRY 307
Query: 199 TEARSVLEDVL 209
EA + + +
Sbjct: 308 AEALAAYDQAI 318
>gi|302416639|ref|XP_003006151.1| anaphase-promoting complex subunit cut9 [Verticillium albo-atrum
VaMs.102]
gi|261355567|gb|EEY17995.1| anaphase-promoting complex subunit cut9 [Verticillium albo-atrum
VaMs.102]
Length = 620
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLED 207
L + GNYT A+ K L+ + + +HCLIKQ+RY EA +L D
Sbjct: 43 LAQVHFATGNYTRAQTFLSKQDLVTRNTSCRYLAAHCLIKQSRYDEALGLLGD 95
>gi|389874415|ref|YP_006373771.1| hypothetical protein TMO_c0179 [Tistrella mobilis KA081020-065]
gi|388531595|gb|AFK56789.1| TPR repeat-containing protein [Tistrella mobilis KA081020-065]
Length = 579
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 73/170 (42%), Gaps = 18/170 (10%)
Query: 50 DRVDSALKDMAVVL----KQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID--LYKKCG 103
+RV + KD A+ L ++Q+R D+A A S E D Y++ G
Sbjct: 367 ERVAAGQKDAAISLAELYRRQERWDDAARAYSSALAALVDPGPEHWDLFYARGVAYERAG 426
Query: 104 RLDE-QIELLK-------QKLRMIYHGEAF--NGKPTKTARSHGKKFQVTVKQETSRILG 153
R E + + LK Q L + Y G ++ G AR+ +K VT++ I+
Sbjct: 427 RWPEAESDFLKALQLSPDQPLVLNYLGYSWVEKGMNLDRARAMLEK-AVTLRPRDGFIVD 485
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEAR 202
+LGWA Q G+Y A +A L+ P D +L + R EAR
Sbjct: 486 SLGWALFQMGDYEGAVARLEQAVLLQPNDPTINDHLGDAYWRVGREREAR 535
>gi|113474879|ref|YP_720940.1| hypothetical protein Tery_1102 [Trichodesmium erythraeum IMS101]
gi|110165927|gb|ABG50467.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 1507
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 67/170 (39%), Gaps = 35/170 (20%)
Query: 56 LKDMAVVLKQQDRVDEA-------VEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQ 108
L ++A + K Q R EA +E K Q L+N L LY+ GR E
Sbjct: 460 LNNLAGLYKSQGRYSEAEPLYKQAIEIHKVALPANHPQRASGLNN-LAGLYRAQGRYSEA 518
Query: 109 IELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSA 168
LLKQ + IY K A F T L NL Y +G Y+ A
Sbjct: 519 EPLLKQAIE-IY----------KVALPANHPFLATN-------LNNLAELYRAQGRYSEA 560
Query: 169 EVVYRKAQLID--------PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
E +Y++A ID P+ A NL+ Q RY+EA + + +
Sbjct: 561 EPLYKQAIEIDNIALPANHPELATNLNNLAELYRAQGRYSEAEPLYKQAI 610
>gi|390438134|ref|ZP_10226629.1| NB-ARC domain protein [Microcystis sp. T1-4]
gi|389838454|emb|CCI30753.1| NB-ARC domain protein [Microcystis sp. T1-4]
Length = 896
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 72/174 (41%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V S+L +AV+ Q R +EA ++A++ + L + SL+N L LYK G+
Sbjct: 549 VASSLNGLAVLYHLQGRYEEAEPLYLQALELRKQLLGENHPDVATSLNN-LAGLYKSQGK 607
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+E L Q L + K+ + + L NL Y +G
Sbjct: 608 YEETEPLYLQALEL------------------RKRLLGENHTDFATSLNNLAELYRSQGR 649
Query: 165 YTSAEVVY------RKAQLIDPD---ANKACNLSHCLIKQARYTEARSVLEDVL 209
YT AE +Y RK L D A NL+ Q +YTEA S+L + L
Sbjct: 650 YTEAEPLYLEALDLRKRLLGDNHPKVAQSLNNLAGLYYSQGKYTEAESLLLEAL 703
>gi|158341205|ref|YP_001522431.1| TPR repeat-containing serine/threonin protein kinase [Acaryochloris
marina MBIC11017]
gi|158311446|gb|ABW33057.1| serine/threonine protein kinase with TPR repeats [Acaryochloris
marina MBIC11017]
Length = 578
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 13/88 (14%)
Query: 149 SRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDV 208
+R+L NLGWA + G+Y AE ++A DP+ A HCLI Q + L
Sbjct: 493 ARLLKNLGWAEFKLGDYRQAEGRLQQAIRSDPEMAAA----HCLIAQVYQEQGNDTL--- 545
Query: 209 LLGKLSGSTETKTINRVKELLQELEPWQ 236
K R+ L E+E W+
Sbjct: 546 ------AQQSWKACLRLDSFLPEVEEWK 567
>gi|358382374|gb|EHK20046.1| hypothetical protein TRIVIDRAFT_47920 [Trichoderma virens Gv29-8]
Length = 620
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
L + GNYT A+ K LI N AC HCLIKQ+R+ EA +VL
Sbjct: 41 LAQVHFATGNYTRAQSFLSKQDLISR--NPACRYLAGHCLIKQSRFDEALAVL 91
>gi|146306092|ref|YP_001186557.1| hypothetical protein Pmen_1058 [Pseudomonas mendocina ymp]
gi|145574293|gb|ABP83825.1| Tetratricopeptide TPR_2 repeat protein [Pseudomonas mendocina ymp]
Length = 575
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 30/60 (50%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
IL +LGW + GN AE + R+A PD A +L L Q + EAR V D L+
Sbjct: 495 ILDSLGWVNYRLGNLEEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQREARRVWRDALV 554
>gi|255600764|ref|XP_002537530.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
gi|223516024|gb|EEF24853.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
Length = 450
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTE 200
V++ E+ L LG + N E YR+A +DPD +A NLS+ L++QAR+ E
Sbjct: 100 VSLAPESPTPLSQLGAVLVCMHNEPEGEDCYRRALALDPDHARAQFNLSYLLLRQARFDE 159
Query: 201 ARSVLE 206
+L+
Sbjct: 160 GWRMLD 165
>gi|90411197|ref|ZP_01219210.1| hypothetical protein P3TCK_06512 [Photobacterium profundum 3TCK]
gi|90328043|gb|EAS44364.1| hypothetical protein P3TCK_06512 [Photobacterium profundum 3TCK]
Length = 361
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-DPDANKACNLSHCLIKQARYTEAR 202
+ ++ + L NLG++Y G+ AEV+ R+A I D NL I+ RY++A
Sbjct: 218 INHKSRQALTNLGYSYYLSGDLKQAEVINRRATTIYQTDKRAWSNLGLVYIRSKRYSDAL 277
Query: 203 SVLEDVL 209
L++V+
Sbjct: 278 DALQNVM 284
>gi|428212874|ref|YP_007086018.1| capsular polysaccharide biosynthesis protein [Oscillatoria
acuminata PCC 6304]
gi|428001255|gb|AFY82098.1| capsular polysaccharide biosynthesis protein [Oscillatoria
acuminata PCC 6304]
Length = 791
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 17/186 (9%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQESLDN 93
E A F KAI + + + +L+QQ+ ++ A++ KS L +A +L
Sbjct: 195 EEAFSCFQKAIMLQPNLAAGHFKLGYLLQQQNELESAIDCFKSTIELQPDWNEAHNNLGL 254
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGK--------KFQVTV 144
VL +K R +E I K+ + + EA+ T T + GK + + +
Sbjct: 255 VL----RKINREEEAISSFKKAIEINPNFAEAYRNLGT-TLQQQGKLEAAAACLRDCIKI 309
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARS 203
+ + GNLG+ Q+G A+ R A ++PD A NL + L ++ E+ S
Sbjct: 310 QPNFALAHGNLGYVLEQQGKLDEAKASLRHAIALEPDLAMAYGNLGNILHREGELEESIS 369
Query: 204 VLEDVL 209
++ +
Sbjct: 370 CFQNAI 375
>gi|323489123|ref|ZP_08094357.1| Tetratricopeptide repeat protein [Planococcus donghaensis MPA1U2]
gi|323397246|gb|EGA90058.1| Tetratricopeptide repeat protein [Planococcus donghaensis MPA1U2]
Length = 1382
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 72/155 (46%), Gaps = 22/155 (14%)
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSV 204
++T + LG M++ +T AE YRK+ +DP D N++H L+KQ ++ E
Sbjct: 720 EKTDVLHSYLGHIAMEQEQFTEAETEYRKSLELDPTDLFTVTNIAHTLLKQEKHVELNKF 779
Query: 205 LEDVLLGKLSGSTETKTINRVKELL-QELEPWQSIPPSLTTKKSSLEDAFLEGLDDLMNQ 263
V+ K +G +T NR +L QE E + ++ +S +E L L ++ Q
Sbjct: 780 ---VMQYKETG--DTFYFNRTAAMLWQESEDDLAKTLAIDLLESGMERDTL-NLREIAEQ 833
Query: 264 WTP------YRSRRLPIFEEISPFRD-----QLAC 287
+ +RSR +FE FR+ QL C
Sbjct: 834 YAEFGEAPQFRSR---MFERFKRFRETAPDSQLLC 865
>gi|363728118|ref|XP_416374.3| PREDICTED: transmembrane and TPR repeat-containing protein 1
[Gallus gallus]
Length = 797
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 76/188 (40%), Gaps = 22/188 (11%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLC--SKQAQESLDN 93
E A++L +I G A +A +L +Q+R+ EA E K+ C S +
Sbjct: 480 EEAVILLRDSIKYGPDFADAYSSLASLLAEQERLKEAEEVYKAGIENCPESSDLHNNYGV 539
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRM--IYHGEAFN--------GKPTKTARSHGKKFQVT 143
L+D G + + +Q + + +H N G+ + + + +V+
Sbjct: 540 FLVD----TGAPERAVSHYRQAIHLSPTHHVAMVNLGRLHRSLGQNKEAEAWYKRALKVS 595
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN--LSHCLIKQARYTEA 201
K E IL LG Y G Y A VYR+A + P +NK L+ L R EA
Sbjct: 596 RKAE---ILSPLGALYYNTGRYEEALQVYREAASLQP-SNKETRLALAQVLAMMGRTKEA 651
Query: 202 RSVLEDVL 209
+ VL
Sbjct: 652 EKMTNHVL 659
>gi|386851106|ref|YP_006269119.1| cold-shock protein, DNA-binding protein [Actinoplanes sp. SE50/110]
gi|359838610|gb|AEV87051.1| cold-shock protein, DNA-binding protein [Actinoplanes sp. SE50/110]
Length = 1462
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 51/96 (53%), Gaps = 3/96 (3%)
Query: 22 YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAI-KSFR 80
Y +AK+++ KD E A + + AI +SA+KD+A + ++ D + A++ I + F
Sbjct: 369 YDQAKYLEHRAKDLERAKMAYRDAIRKNINAESAIKDLAWLTRRVDGPEAALQVIEEEFA 428
Query: 81 HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKL 116
A SLD +LID Y R ++ + LL+ L
Sbjct: 429 DRLPPSA--SLDQILIDFYMGSQRYEDALRLLEPML 462
>gi|301094298|ref|XP_002896255.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
putative [Phytophthora infestans T30-4]
gi|262109650|gb|EEY67702.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
putative [Phytophthora infestans T30-4]
Length = 977
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 91/230 (39%), Gaps = 22/230 (9%)
Query: 42 FWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQESLDNVLIDLY 99
+ +AI A ++A +LK ++D AVE + L A +L N L
Sbjct: 195 YAQAIRVKPSFAIAWSNLAGLLKDDGQLDAAVEHYREAIRLAPDFADAYSNLGNAL---- 250
Query: 100 KKCGRLDEQIELLKQKLRM-----IYHGE---AFNGKPTKTARSHGKKFQVTVKQETSRI 151
K+ GR+DE I+ K L++ I HG + H + + ++
Sbjct: 251 KESGRVDEAIQAYKSALQIRPNFAIAHGNLASCYYDAGQMELAIHTFRHAIQLEPNFPDA 310
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHC-----LIKQAR--YTEARS 203
NLG A + G+ A YR A + PD A NL + L+K+A YT A
Sbjct: 311 YNNLGNALRECGHLEQAVTCYRTALQLKPDHPHAYNNLGNALKDKGLVKEALHCYTTAAR 370
Query: 204 VLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAF 253
+L + + K ++ + L + +I P+ S++ + F
Sbjct: 371 LLPQFAAAHSNIGSVLKEQGKLDQALAHYQQAITIDPNFADAYSNMGNVF 420
>gi|367036871|ref|XP_003648816.1| hypothetical protein THITE_2106692 [Thielavia terrestris NRRL 8126]
gi|346996077|gb|AEO62480.1| hypothetical protein THITE_2106692 [Thielavia terrestris NRRL 8126]
Length = 1035
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 51/128 (39%), Gaps = 27/128 (21%)
Query: 93 NVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL 152
N L ++Y+K GRL E + ++ A G+ R H Q +
Sbjct: 539 NNLGNVYRKRGRLTEAESMFQR---------ALEGQENALGRDHPSTLQT---------V 580
Query: 153 GNLGWAYMQKGNYTSAEVVYRKA------QLIDPDA---NKACNLSHCLIKQARYTEARS 203
NLG Y +G T AE +++ A L D + C+L KQ R TEA S
Sbjct: 581 HNLGIFYANQGRLTEAESMFQWALEGQEKALAQDDVSTLDTVCSLGSVYWKQGRLTEAES 640
Query: 204 VLEDVLLG 211
+ + L G
Sbjct: 641 MYQRALEG 648
>gi|410914070|ref|XP_003970511.1| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Takifugu rubripes]
Length = 1036
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 46/198 (23%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E I+L ++ L ++++ EA PT
Sbjct: 326 IKREQGNIEEAIQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463
>gi|351700623|gb|EHB03542.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Heterocephalus glaber]
Length = 1046
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 67/162 (41%), Gaps = 37/162 (22%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESL 91
E A+ L+ KA+ +A ++A VL+QQ ++ E + EAI+ +
Sbjct: 344 EEAVRLYRKALEGFPDFAAAHSNLASVLQQQGKLQETLMHYKEAIR-------------I 390
Query: 92 DNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
D Y G + ++++ ++ L+ + A PT A +H
Sbjct: 391 SPTFADAYSNMGNILKEMQDVQGALQC--YTRAIQINPT-FADAH--------------- 432
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
NL +M GN A YR A + PD A CNL+HCL
Sbjct: 433 -SNLASIHMDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCL 473
>gi|254445786|ref|ZP_05059262.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
DG1235]
gi|198260094|gb|EDY84402.1| tetratricopeptide repeat domain protein [Verrucomicrobiae bacterium
DG1235]
Length = 453
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 11/56 (19%)
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL------SHCLIKQARYTEA 201
G LG+ + +GNY SAEV Y +A L +PD NL + +K RY EA
Sbjct: 161 FGMLGYCHYAEGNYISAEVAYNRAVLAEPD-----NLDWLEGKAQTYLKAERYVEA 211
>gi|209886536|ref|YP_002290393.1| hypothetical protein OCAR_7425 [Oligotropha carboxidovorans OM5]
gi|337739942|ref|YP_004631670.1| TPR repeat-containing protein [Oligotropha carboxidovorans OM5]
gi|386028960|ref|YP_005949735.1| TPR repeat protein [Oligotropha carboxidovorans OM4]
gi|209874732|gb|ACI94528.1| TPR repeat protein [Oligotropha carboxidovorans OM5]
gi|336094028|gb|AEI01854.1| TPR repeat protein [Oligotropha carboxidovorans OM4]
gi|336097606|gb|AEI05429.1| TPR repeat protein [Oligotropha carboxidovorans OM5]
Length = 267
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKA 185
GKP + R + ++ + + +L NLG +Y+ AE R+AQ + P D+
Sbjct: 151 GKPDEARRYYANALKI--RPDEPSVLSNLGMSYVLTKELPRAEATLRRAQQLAPNDSRIR 208
Query: 186 CNLSHCLIKQARYTEARSVLE 206
NL L Q R+TEA ++++
Sbjct: 209 QNLGLALGLQGRFTEAEAIVQ 229
>gi|72383071|ref|YP_292426.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
gi|72002921|gb|AAZ58723.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
Length = 909
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTE 200
+ +K + + + NLG GN AE YRKA I PD A+ NL + L + + +T+
Sbjct: 401 IQIKPDYAEVYSNLGNVLKDLGNLQDAEFSYRKAIQIKPDYADAYSNLGNILKELSNFTD 460
Query: 201 ARSVLEDVL 209
A + +D L
Sbjct: 461 AINQFKDAL 469
>gi|47222947|emb|CAF99103.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1037
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 46/198 (23%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 269 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 326
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E I+L ++ L ++++ EA PT
Sbjct: 327 IKREQGNIEEAIQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 386
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 387 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 446
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 447 LKLKPDFPDAYCNLAHCL 464
>gi|347734615|ref|ZP_08867638.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
gi|346922358|gb|EGY02771.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
Length = 579
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTE 200
V +K + S + GNLG A + G +AE YR+A + PD A+ NL + L R+ E
Sbjct: 95 VALKPDDSAMHGNLGIALAKLGRLDAAEGAYRRAIALRPDNADAHSNLGNVLRHHGRWDE 154
Query: 201 ARS 203
A +
Sbjct: 155 AEA 157
>gi|340519385|gb|EGR49624.1| cell division control protein 16 [Trichoderma reesei QM6a]
Length = 616
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
L + GNYT A+ K LI N AC HCLIKQ+R+ EA +VL
Sbjct: 41 LAQVHFATGNYTRAQSFLSKQDLISR--NPACRYLAGHCLIKQSRFEEALAVL 91
>gi|220933709|ref|YP_002512608.1| Tetratricopeptide TPR_2 repeat-containing protein [Thioalkalivibrio
sulfidophilus HL-EbGr7]
gi|219995019|gb|ACL71621.1| Tetratricopeptide TPR_2 repeat protein [Thioalkalivibrio
sulfidophilus HL-EbGr7]
Length = 574
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 31/63 (49%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
E + IL +LGW + G AE R+A D A NL+ L + + EAR VLE
Sbjct: 494 EDAAILDSLGWVLYRMGRLDEAETYLRQAYDTMYDPEIASNLAMLLWDRGQRDEARRVLE 553
Query: 207 DVL 209
D L
Sbjct: 554 DAL 556
>gi|428215261|ref|YP_007088405.1| Tfp pilus assembly protein PilF [Oscillatoria acuminata PCC 6304]
gi|428003642|gb|AFY84485.1| Tfp pilus assembly protein PilF [Oscillatoria acuminata PCC 6304]
Length = 371
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 95/217 (43%), Gaps = 14/217 (6%)
Query: 28 VQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQA 87
V L ++D + A+ F + + R A + + L QQ R+DEAV A++ L +Q
Sbjct: 159 VLLRQEDYDGALQAFQQVTSRDTRYWQAYESIGTALLQQGRIDEAVTALQQAAALAPRQG 218
Query: 88 QESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
S+ +L + G +E +E KQ ++ K + + + V +
Sbjct: 219 --SIQMILGVAFLTQGNTNEALETFKQAAQLEPRNAQLQLAIGKLLQDQEQMIEALVTYQ 276
Query: 148 TSRIL--------GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARY 198
+ L + ++Q+ +Y A V YR+ + PD A NL+ L + R
Sbjct: 277 RAASLAPDLIEPQAAIAAIHLQQEDYLQAIVAYRRLTELAPDNGDAYYNLALALRGRERI 336
Query: 199 TEARSVLEDV-LLGKLSGSTETKTINRVKELLQELEP 234
+EA L+ L + G ET+ + +V+ LL EL+P
Sbjct: 337 SEAIEQLQKAQALYQEQG--ETERLEKVEALLSELKP 371
>gi|427399267|ref|ZP_18890505.1| hypothetical protein HMPREF9710_00101 [Massilia timonae CCUG 45783]
gi|425721758|gb|EKU84666.1| hypothetical protein HMPREF9710_00101 [Massilia timonae CCUG 45783]
Length = 722
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVLLGKL 213
LG + +KG+Y AE +R + ++PDA A NLS L +Q R EA VL++ L +
Sbjct: 489 LGELHARKGDYIKAEQAFRDSIRLEPDAVYAYANLSAVLQRQGREKEALQVLQEGLQVRP 548
Query: 214 SG 215
SG
Sbjct: 549 SG 550
>gi|62319217|dbj|BAD94413.1| spindly [Arabidopsis thaliana]
gi|62319734|dbj|BAD95289.1| spindly [Arabidopsis thaliana]
Length = 535
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 62/151 (41%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ AIV + A + A ++ V+ K +D +D+AVE + + AQ SL+N+
Sbjct: 312 DMAIVFYELAFHFNPHCAEACNNLGVLYKDRDNLDKAVECYQMALSIKPNFAQ-SLNNLG 370
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EAFN N
Sbjct: 371 V-VYTVQGKMDAAASMIEKAILANPTYAEAFN---------------------------N 402
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y GN T A Y + IDPD+ A
Sbjct: 403 LGVLYRDAGNITMAIDAYEECLKIDPDSRNA 433
>gi|329895776|ref|ZP_08271152.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
gi|328922138|gb|EGG29495.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
Length = 807
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 67/150 (44%), Gaps = 28/150 (18%)
Query: 98 LYKKCGRLDEQIELLKQKLRM------IYHGEAFNGKPTKTARSHGKKFQ--VTVKQETS 149
L +K GRL E +++ ++ + + +Y+ A N K F+ +++K E S
Sbjct: 155 LVQKSGRLSESLDIHRKAIDLAPEDADVYYNLAVNLADLKRLDDAVSMFKHALSLKPEFS 214
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN----LSHCLIK----------- 194
+ L NLG Q G AE+ + KA I P+ K N H L K
Sbjct: 215 QALTNLGVVLEQLGEVEDAEICFIKAIRISPNEAKIYNNLGIAQHSLGKLLASESNYRIA 274
Query: 195 ---QARYTEARSVLEDVL--LGKLSGSTET 219
+ ++ EA + L +VL LGKL+ + E+
Sbjct: 275 LTLEPQFAEANTNLGNVLRRLGKLTEAAES 304
>gi|385203261|ref|ZP_10030131.1| Flp pilus assembly protein TadD [Burkholderia sp. Ch1-1]
gi|385183152|gb|EIF32426.1| Flp pilus assembly protein TadD [Burkholderia sp. Ch1-1]
Length = 596
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEARSVLEDVL 209
NLG ++ G + A+ V+R+ ++P+ +ACN L+ L+K RY EA S + L
Sbjct: 174 NLGMVLLKTGRHQEAQSVFRRVLELEPEHCEACNGLAVALMKDYRYEEAESFVRRAL 230
>gi|302911406|ref|XP_003050485.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256731422|gb|EEU44772.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 616
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 38/87 (43%), Gaps = 6/87 (6%)
Query: 123 EAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
+A N ++A G K +T L + GNYT A+ LI
Sbjct: 11 DALNKAQYESAIFIGDKLLALTNDDTDAFW--LAQVHFATGNYTRAQAFLSSQDLISR-- 66
Query: 183 NKACNL--SHCLIKQARYTEARSVLED 207
N +C HCLIKQ+R+ EA +VL D
Sbjct: 67 NPSCRYLAGHCLIKQSRFDEALAVLGD 93
>gi|408418139|ref|YP_006759553.1| tetratricopeptide repeat protein, presursor [Desulfobacula
toluolica Tol2]
gi|405105352|emb|CCK78849.1| tetratricopeptide repeat protein, presursor [Desulfobacula
toluolica Tol2]
Length = 762
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 10/183 (5%)
Query: 12 VHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDE 71
+ K P P + + +K PE A + F KAI+ +A +A +L Q R E
Sbjct: 228 ISKKPGTAEPLILLANFYAGQKKPEKAQISFLKAIDTDPENLNAYMLLARLLNSQKRTAE 287
Query: 72 AVEAIKSFRHLCSKQAQESLDNVLIDLY---KKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
A + I+ + L + S+ N D + K + + ++ + +K + GK
Sbjct: 288 AEKFIQ--KALTIEPDNFSVQNAYADFHFSNKNILKAETLVDEILEKRPNYLPAKMLKGK 345
Query: 129 PTKTARSHGKK---FQVTVKQETSRILGN--LGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
T + H K FQ VK+E + N LG ++ +K ++ A++ KA +P+
Sbjct: 346 ILATRKDHDKAIEIFQDLVKEEPESAMFNFLLGSSFFEKKDFKQAKIFLSKALEKNPNLF 405
Query: 184 KAC 186
KA
Sbjct: 406 KAL 408
>gi|428173589|gb|EKX42490.1| hypothetical protein GUITHDRAFT_88045 [Guillardia theta CCMP2712]
Length = 315
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 8/98 (8%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVL---- 209
LG Y+++ + AE +R+A I+P ++ C L L+ + Y +A +VL +
Sbjct: 163 LGTIYLKQEKFQLAEYHFRRALEINPRNSVLHCYLGMALLSSSCYDDAIAVLNRAIKMDP 222
Query: 210 ---LGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
L KL + +NR +E L+EL Q I P +T
Sbjct: 223 NNPLAKLRKAIALSQLNRNEEALEELISLQQIAPREST 260
>gi|322700030|gb|EFY91787.1| putative anaphase control protein cut9 [Metarhizium acridum CQMa
102]
Length = 613
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 27/53 (50%), Gaps = 4/53 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
L + GNYT A+ K LI N +C HCLIKQ+RY EA VL
Sbjct: 41 LAQVHFATGNYTRAQAFLSKQDLISR--NPSCRYLAGHCLIKQSRYEEALVVL 91
>gi|421504252|ref|ZP_15951195.1| hypothetical protein A471_13280 [Pseudomonas mendocina DLHK]
gi|400345022|gb|EJO93389.1| hypothetical protein A471_13280 [Pseudomonas mendocina DLHK]
Length = 575
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 29/59 (49%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
IL +LGW + GN AE + R+A PD A +L L Q + EAR V D L
Sbjct: 495 ILDSLGWVNYRLGNLEEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQREARRVWRDAL 553
>gi|421486676|ref|ZP_15934212.1| tetratricopeptide repeat family protein 11 [Achromobacter
piechaudii HLE]
gi|400194981|gb|EJO27981.1| tetratricopeptide repeat family protein 11 [Achromobacter
piechaudii HLE]
Length = 628
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 76/176 (43%), Gaps = 13/176 (7%)
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
A +L+ DR+ +AV +++ + + L LY++ RL + +L+Q + +
Sbjct: 452 AQILRDADRIQQAVSTLEAADQALPDTVEIKYE--LAMLYERQNRLADLERMLRQVIALD 509
Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
H A+N A H ++ +T E S IL ++GW + G SA
Sbjct: 510 PDHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKFRMGESESAA 568
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
R+A + P+A+ A +L+ L Q + +A +L L+ T + R+
Sbjct: 569 EYLRRAYSVRPEADIAAHLAEVLWSQGKRDQATELLRAALMKDPKNKTVQDVVKRL 624
>gi|322712900|gb|EFZ04473.1| anaphase control protein cut9 [Metarhizium anisopliae ARSEF 23]
Length = 613
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 27/53 (50%), Gaps = 4/53 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
L + GNYT A+ K LI N +C HCLIKQ+RY EA VL
Sbjct: 41 LAQVHFATGNYTRAQAFLSKQDLISR--NPSCRYLAGHCLIKQSRYEEALVVL 91
>gi|357441257|ref|XP_003590906.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
gi|355479954|gb|AES61157.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
Length = 744
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 10/156 (6%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI + A+ A ++A + +Q ++D A+ K C + E+ +N
Sbjct: 40 PQEAIACYQHALQTRPNYGMAYGNLASIHYEQGQLDMAILHYKQ-AIACDPRFLEAYNN- 97
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
L + K GR++E I+ Q L + G + A + K + V
Sbjct: 98 LGNALKDVGRVEEAIQCYNQCLSLQPNHPQALTNLGNIYMEWNMVAAAASYYKATLNVTT 157
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
S NL Y Q+GNY A Y + IDP A
Sbjct: 158 GLSAPYNNLAIIYKQQGNYADAISCYNEVLRIDPLA 193
>gi|344257396|gb|EGW13500.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Cricetulus griseus]
Length = 1444
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 64/162 (39%), Gaps = 37/162 (22%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E A+ L+ KA+ +A ++A VL+QQ ++ EA+ K + K A
Sbjct: 288 EEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPKFA-------- 339
Query: 96 IDLYKKCGRLDEQIELLKQKL----RMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
D Y G ++++ ++ L R I AF A +H
Sbjct: 340 -DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAF-------ADAHS-------------- 377
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
NL + GN A YR A + PD A CNL+HCL
Sbjct: 378 --NLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCL 417
>gi|186681150|ref|YP_001864346.1| hypothetical protein Npun_R0649 [Nostoc punctiforme PCC 73102]
gi|186463602|gb|ACC79403.1| TPR repeat-containing protein [Nostoc punctiforme PCC 73102]
Length = 411
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 64/147 (43%), Gaps = 12/147 (8%)
Query: 74 EAIKSFRHLCSKQAQESLDNV-LIDLYKKCGRLDEQIELLKQKLRMIYHGEAF------- 125
EA +R L + ++ S + L D+ G++ E I +Q L++ +
Sbjct: 45 EAANIWRSLIERDSKNSYAYIKLADILSSQGKIAETIAAYRQGLQLTPDAAIYLKLGNFL 104
Query: 126 --NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN 183
G+ + + + ++ K +T+ I+ L + GN A V YR+A I+PD +
Sbjct: 105 VEKGRTAEAIAAFRQAVKLDAKSDTASII--LAMNLIAMGNPEEAVVAYRQAIKIEPDDD 162
Query: 184 KACNLSHCLIKQARYTEARSVLEDVLL 210
NL+ L K + EA + + L+
Sbjct: 163 NYNNLADTLFKIGKREEAIAAYREALI 189
>gi|421617611|ref|ZP_16058598.1| TPR domain-containing protein [Pseudomonas stutzeri KOS6]
gi|409780391|gb|EKN60022.1| TPR domain-containing protein [Pseudomonas stutzeri KOS6]
Length = 575
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 28/54 (51%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSV 204
IL +LGW + GN + AE + RKA PD A +L L +EAR+V
Sbjct: 495 ILDSLGWVNFRLGNLSEAEALLRKALQRFPDHEVAAHLGEVLWTLGERSEARAV 548
>gi|424914183|ref|ZP_18337547.1| Prophage tail length tape measure protein [Rhizobium leguminosarum
bv. trifolii WSM597]
gi|392850359|gb|EJB02880.1| Prophage tail length tape measure protein [Rhizobium leguminosarum
bv. trifolii WSM597]
Length = 1528
Score = 38.5 bits (88), Expect = 3.1, Method: Composition-based stats.
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 7/85 (8%)
Query: 138 KKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQA 196
K F VT+ Q+T++ L A M ++AE + R+ LID A A NLS Q
Sbjct: 570 KNFGVTIGQDTAKAGETL--AQMFADPASAAEALSRQYGLIDAKTAEYATNLS----GQN 623
Query: 197 RYTEARSVLEDVLLGKLSGSTETKT 221
R TEA+SVL D L +L+ + E T
Sbjct: 624 RLTEAQSVLLDALPNRLAKAAEATT 648
>gi|334121332|ref|ZP_08495403.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
gi|333455148|gb|EGK83806.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
Length = 1075
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 75/185 (40%), Gaps = 19/185 (10%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQ--AQESLDNVL 95
AI F KA+ + A + + L R EA+ A + + K A + L N L
Sbjct: 269 AIAAFEKALEIDPKEHIAWHGLGITLNALGRNSEAIAAFEKALEIDPKAHIAWKGLGNAL 328
Query: 96 IDLYKKCGRLDEQIELLKQKLRM---IYH-----GEAFN--GKPTKTARSHGKKFQVTVK 145
L GR E I K+ L + +H G N G+ ++ + K ++ K
Sbjct: 329 NAL----GRNSEAIAAFKKALEIDPKFHHAWNGLGAPLNDLGRYSEAIAAFEKALEIDPK 384
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTEARSV 204
+ LG G Y+ A VY KA IDP + A N L + L + RY+EA +
Sbjct: 385 FHFA--WHGLGNVLNALGRYSEAIAVYEKALEIDPKFHFAWNGLGNALGELGRYSEAIAA 442
Query: 205 LEDVL 209
E L
Sbjct: 443 YEKAL 447
>gi|297735486|emb|CBI17926.3| unnamed protein product [Vitis vinifera]
Length = 602
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 18/91 (19%)
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLR-------MIYHGEAFNGKPTKTARSHGKKFQV 142
SLD+ LID+Y KCG +D+ +++ ++ R I G A +G+ K K +
Sbjct: 360 SLDSSLIDMYSKCGDIDKAVQIFEESTRRDLFTWTSIVCGLAMHGRGEKALHYFSKMKEA 419
Query: 143 TVKQETSRILGNL-----------GWAYMQK 162
V+ + ++G L GW Y Q
Sbjct: 420 QVQPDDVTMVGVLSACAHAGLLDQGWWYFQS 450
>gi|357441255|ref|XP_003590905.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
gi|355479953|gb|AES61156.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
Length = 986
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 62/156 (39%), Gaps = 10/156 (6%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ AI + A+ A ++A + +Q ++D A+ K C + E+ +N
Sbjct: 282 PQEAIACYQHALQTRPNYGMAYGNLASIHYEQGQLDMAILHYKQ-AIACDPRFLEAYNN- 339
Query: 95 LIDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQ 146
L + K GR++E I+ Q L + G + A + K + V
Sbjct: 340 LGNALKDVGRVEEAIQCYNQCLSLQPNHPQALTNLGNIYMEWNMVAAAASYYKATLNVTT 399
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDA 182
S NL Y Q+GNY A Y + IDP A
Sbjct: 400 GLSAPYNNLAIIYKQQGNYADAISCYNEVLRIDPLA 435
>gi|399519468|ref|ZP_10760263.1| TPR repeat-containing protein [Pseudomonas pseudoalcaligenes CECT
5344]
gi|399112564|emb|CCH36821.1| TPR repeat-containing protein [Pseudomonas pseudoalcaligenes CECT
5344]
Length = 575
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 30/59 (50%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
IL +LGW ++GN AE + R+A PD A +L L Q + +AR V D L
Sbjct: 495 ILDSLGWVNYRQGNLDEAERLLRQALEKFPDHEVAAHLGEVLWAQGKQRDARRVWRDAL 553
>gi|425465802|ref|ZP_18845109.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389831882|emb|CCI24962.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 830
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 92/214 (42%), Gaps = 47/214 (21%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V ++L ++A++ + Q R EA ++A+ ++ L SL+N L +LY+ GR
Sbjct: 609 VANSLNNLALLYQSQGRYTEAEPLYLQALDLYKRLLGDNHPLVAASLNN-LAELYRSQGR 667
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L + EA + + +H + + L NL Y +G
Sbjct: 668 YTEAEPL---------YLEALDLRKQLLGDNH---------PDVATSLNNLALLYKSQGR 709
Query: 165 YTSAEVVYRKA-----QLIDPD----ANKACNLSHCLIKQARYTEARSV------LEDVL 209
YT AE +Y +A +L+ + A NL++ Q RYTEA + L L
Sbjct: 710 YTEAEPLYLEALDLYKRLLGDNHPHVATSLNNLAYLYESQGRYTEAEPLYLEALDLRKRL 769
Query: 210 LGKLSGSTETKTINRVKELLQELEP-----WQSI 238
LG +T+T N ++ L Q+ +P WQ I
Sbjct: 770 LGDNHPNTKT-CRNNLQRLRQQQKPTSLTWWQWI 802
>gi|147771783|emb|CAN60259.1| hypothetical protein VITISV_007741 [Vitis vinifera]
Length = 602
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 18/91 (19%)
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLR-------MIYHGEAFNGKPTKTARSHGKKFQV 142
SLD+ LID+Y KCG +D+ +++ ++ R I G A +G+ K K +
Sbjct: 360 SLDSSLIDMYSKCGDIDKAVQIFEESTRRDLFTWTSIVCGLAMHGRGEKALHYFSKMKEA 419
Query: 143 TVKQETSRILGNL-----------GWAYMQK 162
V+ + ++G L GW Y Q
Sbjct: 420 QVQPDDVTMVGVLSACAHAGLLDQGWWYFQS 450
>gi|346325254|gb|EGX94851.1| anaphase-promoting complex subunit Cut9 [Cordyceps militaris CM01]
Length = 626
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVL 205
L + GNYT A K LI+ N +C HCLIKQ+R+ EA +VL
Sbjct: 46 LAQVHFATGNYTRAHAFLAKQDLINR--NPSCRYLAGHCLIKQSRFEEALAVL 96
>gi|229593982|ref|XP_001025913.2| TPR Domain containing protein [Tetrahymena thermophila]
gi|225567182|gb|EAS05668.2| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 804
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 69/156 (44%), Gaps = 16/156 (10%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFRHLCS 84
+ AI + K + + DS ++ + K + +DEA++A + F +L
Sbjct: 418 DEAIKSYKKCLEINPKEDSCYYNLGIAYKDKGMLDEAIKAYQKCLEINPKKEICFYNLGI 477
Query: 85 KQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTV 144
+ L + I Y+KC ++ + + L + Y + G + +S+ K ++
Sbjct: 478 AYKAKGLIDEAIQSYQKCLEINPEKDTCLHNLGIAYKAK---GLLDEAIKSYQKCIEINP 534
Query: 145 KQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP 180
K++ + NLG AYM+KG A Y+K I+P
Sbjct: 535 KKDIYYM--NLGLAYMEKGMLNEAIKQYQKCIEINP 568
>gi|322421231|ref|YP_004200454.1| hypothetical protein GM18_3751 [Geobacter sp. M18]
gi|320127618|gb|ADW15178.1| Tetratricopeptide repeat [Geobacter sp. M18]
Length = 636
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 47/200 (23%), Positives = 83/200 (41%), Gaps = 30/200 (15%)
Query: 9 YHVVHKLPPGDSPY-VRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQD 67
Y + K+ P D+ ++ + K+ ++AI + AI A +++A +L++++
Sbjct: 394 YQELAKMNPSDAATQLKLARALVNTKELDSAITAYQGAIKLDAESPEAHRELANLLRKRN 453
Query: 68 RVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNG 127
+DEA + L K+ + L +Y K D LLK+ + +
Sbjct: 454 MMDEAANQYQEVLRL--KKDDQDARTALTAIYVKNKNYDSLANLLKEGVELA-------- 503
Query: 128 KPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN 187
P+ A +H K LG Y + NY +A Y++A + PD KA N
Sbjct: 504 -PS-DANAHYK----------------LGLVYEFQKNYPAATAEYKEAVNLKPDHAKALN 545
Query: 188 -LSHCLIKQARYTEARSVLE 206
+ +K R EA+ LE
Sbjct: 546 AMGRVQMKDGRIAEAKESLE 565
>gi|432879797|ref|XP_004073552.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Oryzias latipes]
Length = 1016
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 246 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 303
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 304 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 363
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 364 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 423
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 424 LKLKPDFPDAYCNLAHCL 441
>gi|66347871|ref|NP_001018116.1| O-linked N-acetylglucosamine transferase isoform 3 [Danio rerio]
gi|56967376|gb|AAW31872.1| O-GlcNAc transferase variant 3 [Danio rerio]
Length = 1046
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 336 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473
>gi|351707941|gb|EHB10860.1| Kinesin light chain 4 [Heterocephalus glaber]
Length = 619
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 9/72 (12%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-----DPD----ANKACNLSHCLIKQAR 197
+ ++ L NL +G Y + E Y++A I PD A NL+ C +KQ +
Sbjct: 335 DVAKQLNNLALLCQNQGKYEAVERYYQRALAIYEGQLGPDNPNVARTKNNLASCYLKQGK 394
Query: 198 YTEARSVLEDVL 209
YTEA ++ +D+L
Sbjct: 395 YTEAETLYKDIL 406
>gi|288819196|ref|YP_003433544.1| hypothetical protein HTH_1899 [Hydrogenobacter thermophilus TK-6]
gi|384129941|ref|YP_005512554.1| Sporulation domain-containing protein [Hydrogenobacter thermophilus
TK-6]
gi|288788596|dbj|BAI70343.1| hypothetical protein HTH_1899 [Hydrogenobacter thermophilus TK-6]
gi|308752778|gb|ADO46261.1| Sporulation domain protein [Hydrogenobacter thermophilus TK-6]
Length = 359
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDV 208
++ LG AY + + AE ++KA IDP +A NL K YT+A+++LED
Sbjct: 61 KVWNALGLAYTEAKEFQKAESSFQKALEIDPAYTEAKMNLGILYYKAKDYTKAKNILEDA 120
Query: 209 L 209
L
Sbjct: 121 L 121
>gi|358400335|gb|EHK49666.1| hypothetical protein TRIATDRAFT_280830 [Trichoderma atroviride IMI
206040]
Length = 1080
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 52/127 (40%), Gaps = 25/127 (19%)
Query: 154 NLGWAYMQKGNYTSAEVVYRKA-----QLIDPDANKAC----NLSHCLIKQARYTEARSV 204
NLG Y ++ + AE +Y +A +++ P+ NL H QARY EA ++
Sbjct: 860 NLGLLYYEQDRFREAEAMYERALQGQEKVLGPEHPATLITVGNLRHVYASQARYEEAEAM 919
Query: 205 LEDVLLG--KLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDA---FLEGLDD 259
E L G K+ G T+N V L SL T ++ E+A + L
Sbjct: 920 YERALQGFEKVLGPENPATLNTVGHL-----------GSLYTSQARFEEAEAMYKRTLKG 968
Query: 260 LMNQWTP 266
W P
Sbjct: 969 FEKAWGP 975
>gi|291613082|ref|YP_003523239.1| hypothetical protein Slit_0612 [Sideroxydans lithotrophicus ES-1]
gi|291583194|gb|ADE10852.1| TPR repeat-containing protein [Sideroxydans lithotrophicus ES-1]
Length = 834
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 67/162 (41%), Gaps = 15/162 (9%)
Query: 59 MAVVLKQQDRVDEAVEAIKSFRHLCSK--QAQESLDNVLIDLYKKCGRLDEQIELLKQKL 116
+ VVL+ R +EA++ ++ L K QA +L N L L GRLDE ++ L
Sbjct: 188 LGVVLQHLGRNEEALQPMQRAVELAPKDAQAHSNLGNTLSYL----GRLDEAETSFRRAL 243
Query: 117 RMIYHGEAFNGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGNYTSA 168
++ + T G+ + + +K + NLG +G A
Sbjct: 244 KINKDFAEAHLNLGATLHDLGRFGEAEVSYRCAIQLKPGLAEAHYNLGNTLKSQGKLEEA 303
Query: 169 EVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
YRKA I P + NL L Q + EA ++L +VL
Sbjct: 304 VASYRKALQIAPGLVGASSNLGAALQAQGKLAEAETILRNVL 345
>gi|440490240|gb|ELQ69816.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae P131]
Length = 664
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
L + G+YT A+ + K+ L+ N +C +HCLIKQ +Y EA +VL D
Sbjct: 41 LAQVHFASGSYTRAQALLSKSDLVSR--NPSCRYLSAHCLIKQGQYDEALNVLGD 93
>gi|342885471|gb|EGU85469.1| hypothetical protein FOXB_03953 [Fusarium oxysporum Fo5176]
Length = 616
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
L + GNYT A+ LI N +C HCLIKQ+R+ EA S+L D
Sbjct: 41 LAQVHFATGNYTRAQAFLSSQDLIAR--NPSCRYLAGHCLIKQSRFDEALSILGD 93
>gi|389627366|ref|XP_003711336.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae 70-15]
gi|351643668|gb|EHA51529.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae 70-15]
gi|440469019|gb|ELQ38146.1| anaphase-promoting complex subunit cut9 [Magnaporthe oryzae Y34]
Length = 664
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNL--SHCLIKQARYTEARSVLED 207
L + G+YT A+ + K+ L+ N +C +HCLIKQ +Y EA +VL D
Sbjct: 41 LAQVHFASGSYTRAQALLSKSDLVS--RNPSCRYLSAHCLIKQGQYDEALNVLGD 93
>gi|432328845|ref|YP_007246989.1| hypothetical protein AciM339_0950 [Aciduliprofundum sp. MAR08-339]
gi|432135554|gb|AGB04823.1| hypothetical protein AciM339_0950 [Aciduliprofundum sp. MAR08-339]
Length = 377
Score = 38.5 bits (88), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 11/95 (11%)
Query: 121 HGEAFNGKPTKTARSHGKKFQVTVK--------QETSRILGNLGWAYMQKGNYTSAEVVY 172
+ FNG A H K+F + K +E+ NLG Y +KG+ A VY
Sbjct: 261 YSSVFNG--MGVALMHMKRFDESEKYLREALRIRESDEYRTNLGNLYYRKGDINKAMEVY 318
Query: 173 RKAQLIDPDANKAC-NLSHCLIKQARYTEARSVLE 206
+ +PD A NL CL+++ + EAR LE
Sbjct: 319 KSVLKRNPDCYLALNNLGRCLMRKGKIDEARRYLE 353
>gi|190340092|gb|AAI63923.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Danio rerio]
Length = 1046
Score = 38.5 bits (88), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 336 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473
>gi|62821820|ref|NP_001017359.1| O-linked N-acetylglucosamine transferase isoform 1 [Danio rerio]
gi|56967372|gb|AAW31870.1| O-GlcNAc transferase variant 1 [Danio rerio]
Length = 1062
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 336 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473
>gi|209525653|ref|ZP_03274190.1| sulfotransferase [Arthrospira maxima CS-328]
gi|209493822|gb|EDZ94140.1| sulfotransferase [Arthrospira maxima CS-328]
Length = 729
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 74/183 (40%), Gaps = 17/183 (9%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFRHLCS 84
E AI + KAI + +AV+L + R+DEA++ + S HL
Sbjct: 162 EQAITAYQKAIELKPDFSWSYHHLAVILTRLGRIDEAIDNYRQEIELNPDFYWSHFHLGD 221
Query: 85 KQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF---- 140
+Q++ I Y+K ++ Q Q+L I + G+ + +
Sbjct: 222 LLSQQNQPEEAIAAYRKAIAINPQQPEAHQRLTEILSRDQKTGEDALLGGRYEEAIAIYR 281
Query: 141 -QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARY 198
V K + S LG A + ++ A V+ +A ++PD + NL +CL KQ
Sbjct: 282 DMVAAKPDYSWGYYGLGLALLNLRQWSEAIEVFNQAINLNPDCFWSYNNLGYCLFKQGEI 341
Query: 199 TEA 201
+A
Sbjct: 342 PQA 344
>gi|449482042|ref|XP_004175980.1| PREDICTED: transmembrane and TPR repeat-containing protein 1
[Taeniopygia guttata]
Length = 837
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 75/184 (40%), Gaps = 14/184 (7%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLC--SKQAQESLDN 93
E A++L +I G A +A +L +Q+R+ EA E K+ C S +
Sbjct: 520 EEAVILLRDSIKYGPEFADAYSSLASLLAEQERLKEAEEVYKAGIENCPESSDLHNNYGV 579
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRM--IYHGEAFNGKPTKTARSHGKKFQVTVKQ--ETS 149
L+D G + + +Q + + +H N + K+ +V K+ + S
Sbjct: 580 FLVD----TGSPERAMSHYRQAILLSPTHHVAMVNLGRLHRSLGQNKEAEVWYKRALKVS 635
Query: 150 R---ILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVL 205
R IL LG Y G Y A VYR+A + P + + L+ L R EA +
Sbjct: 636 RKAEILSPLGALYYNTGRYEEALQVYREAAALQPSNKDIRLALAQVLAMMGRTKEAEKMT 695
Query: 206 EDVL 209
+L
Sbjct: 696 NHIL 699
>gi|390949283|ref|YP_006413042.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiocystis violascens DSM 198]
gi|390425852|gb|AFL72917.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiocystis violascens DSM 198]
Length = 972
Score = 38.1 bits (87), Expect = 3.9, Method: Composition-based stats.
Identities = 54/219 (24%), Positives = 91/219 (41%), Gaps = 23/219 (10%)
Query: 56 LKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQK 115
L D +L++ R +EA ++ + R L A L +L ++ G LDE +E
Sbjct: 328 LSDQGNILRELGRFEEARDSYR--RALAIDPANALAHTNLGNLLRELGHLDEALEHHAAA 385
Query: 116 LRMIY-HGEAFNGK--------PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYT 166
LR+ + E + + AR+H + +++ ++ GNLG + +
Sbjct: 386 LRIAPDYAEGYCNAGLVLQDLGRLEEARAHYSQ-ALSINPNLAQAHGNLGNYWQELKRCH 444
Query: 167 SAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEAR-------SVLEDVLLGKLSGSTE 218
A YR+A I+P A N+ L++Q + EAR S+ D + L+ T
Sbjct: 445 EALECYRRALAIEPRFAEAHNNMGLVLLEQGNFDEARERFEQALSIRPDYVDAYLNLGTC 504
Query: 219 TKTINRVKELLQELEPWQSIPPSLTTKKSSL---EDAFL 254
+ R + L + I P L T K L +AFL
Sbjct: 505 HGRVGRYDKALDCFDRALRISPDLATLKPGLVKVHNAFL 543
>gi|22298814|ref|NP_682061.1| hypothetical protein tlr1271 [Thermosynechococcus elongatus BP-1]
gi|22294995|dbj|BAC08823.1| tlr1271 [Thermosynechococcus elongatus BP-1]
Length = 624
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 60/135 (44%), Gaps = 15/135 (11%)
Query: 86 QAQESLDNVLIDL---------YKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSH 136
+A +L N +DL Y+K L+ + L + H + GK T+ +
Sbjct: 111 EAHSNLGNAYLDLGDTEGAIACYQKALSLNPDLPTTLYNLGLCLHAQ---GKLTEATACY 167
Query: 137 GKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQ 195
+ + Q + + NLG AY++ NYT+A V ++ A +P+ A NL + L Q
Sbjct: 168 EQSLYLEPGQ--ADVHNNLGSAYLELKNYTAATVHFQAALGANPELLVAHYNLGYALHLQ 225
Query: 196 ARYTEARSVLEDVLL 210
AR ++VLL
Sbjct: 226 GNLAAARDRYQEVLL 240
>gi|425461303|ref|ZP_18840783.1| Tetratricopeptide TPR_2 repeat protein [Microcystis aeruginosa PCC
9808]
gi|389825870|emb|CCI24060.1| Tetratricopeptide TPR_2 repeat protein [Microcystis aeruginosa PCC
9808]
Length = 310
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 96/247 (38%), Gaps = 50/247 (20%)
Query: 3 QTRDQPYHVVHKLPPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAV- 61
+ RD ++ K P ++ + L +KD E A+ F A R+D + A+
Sbjct: 34 EARDAFEEILEKDPGSKQAHLGIGSICLKQKDYEGALTHF----GAARRIDPMMAKAALA 89
Query: 62 ---VLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM 118
V +Q+ +D A+EA K D V ID G L LLKQ
Sbjct: 90 VGNVYYKQNELDSAIEAFK--------------DAVNIDPTTPAGYLGMGRVLLKQ---- 131
Query: 119 IYHGEAFNGKPTKTARSHGKKFQVTVKQET-SRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
K A+ +K V Q T R+L + Y ++GN +A A
Sbjct: 132 ---------KKYPQAKEQVQKALVFNPQLTPGRLL--MSQIYQEQGNTKAAIDEIESALR 180
Query: 178 IDPDANKAC-NLSHCLIKQARYTEARSVLEDVL-----------LGKLSGSTETKTINRV 225
++P A A +L + +KQ +Y AR ED +G + E+ +N
Sbjct: 181 MNPTAWSAYQSLGNIYLKQKQYNLARKNFEDAQQLNPKIPVVAKMGYIEALIESNALNEA 240
Query: 226 KELLQEL 232
E+L++L
Sbjct: 241 SEILRDL 247
>gi|154318674|ref|XP_001558655.1| hypothetical protein BC1G_02726 [Botryotinia fuckeliana B05.10]
Length = 586
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 27/51 (52%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
L + GNYT A+ K LI + + HCLIKQ+R+ EA S+L
Sbjct: 41 LAQVHFSTGNYTRAQSFLTKQDLIARNPSCRYLAGHCLIKQSRFEEALSIL 91
>gi|66347879|ref|NP_001018117.1| O-linked N-acetylglucosamine transferase isoform 4 [Danio rerio]
gi|56967378|gb|AAW31873.1| O-GlcNAc transferase variant 4 [Danio rerio]
Length = 1036
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 326 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463
>gi|418295012|ref|ZP_12906887.1| hypothetical protein PstZobell_16958 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
gi|379066370|gb|EHY79113.1| hypothetical protein PstZobell_16958 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
Length = 575
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 35/75 (46%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
IL +LGW + G AEV+ RKA PD A +L L + +EAR+V L
Sbjct: 495 ILDSLGWINFRLGKLAEAEVLLRKALQRFPDHEVAAHLGEVLWTKGERSEARAVWSKALK 554
Query: 211 GKLSGSTETKTINRV 225
+ +TI R+
Sbjct: 555 LQPDSPILLETIKRL 569
>gi|63095209|gb|AAY32335.1| SPY [Phyllostachys praecox]
Length = 781
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 63/152 (41%), Gaps = 32/152 (21%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQA-QESLDNV 94
E AIV + A++ R A + V+ K +D +D+AVE L K + +SL+N+
Sbjct: 159 EMAIVFYELALHFNPRCAEACNSLGVIYKDRDNLDKAVECY--LLALSIKPSFSQSLNNL 216
Query: 95 LIDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILG 153
+ +Y G++D ++ K + + EA+N
Sbjct: 217 GV-VYTVQGKMDAAASMIEKAIIANPTYAEAYN--------------------------- 248
Query: 154 NLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
NLG Y G+ T A Y + IDPD+ A
Sbjct: 249 NLGVLYRDAGSITLAVQAYERCLQIDPDSRNA 280
>gi|148253060|ref|YP_001237645.1| hypothetical protein BBta_1521 [Bradyrhizobium sp. BTAi1]
gi|146405233|gb|ABQ33739.1| hypothetical protein BBta_1521 [Bradyrhizobium sp. BTAi1]
Length = 120
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 33/65 (50%)
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQ 195
H + + ET++IL NLG++YM +G T+A + +A +PD N + L
Sbjct: 50 HAYGQAIKLGGETTQILNNLGYSYMLRGKLTAARTKFMEAYRREPDNPTVLNNINLLNSS 109
Query: 196 ARYTE 200
R+ E
Sbjct: 110 YRFIE 114
>gi|91202613|emb|CAJ72252.1| Hypothetical Protein kustd1507 [Candidatus Kuenenia
stuttgartiensis]
Length = 700
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTE 200
+ + ++ NLG AY++KGN +A YRKA I P +A NL + R+ E
Sbjct: 536 IQLNMHKPQVFNNLGSAYLKKGNPDAAIAQYRKALYIYPGYAEAHSNLGFIYTETNRFEE 595
Query: 201 ARSVLEDVL 209
A S L+ L
Sbjct: 596 ALSELKKAL 604
>gi|146283511|ref|YP_001173664.1| hypothetical protein PST_3184 [Pseudomonas stutzeri A1501]
gi|145571716|gb|ABP80822.1| TPR domain protein [Pseudomonas stutzeri A1501]
Length = 594
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 35/75 (46%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
IL +LGW + G AEV+ RKA PD A +L L + +EAR+V L
Sbjct: 514 ILDSLGWINFRLGKLAEAEVLLRKALQRFPDHEVAAHLGEVLWTKGERSEARAVWSKALK 573
Query: 211 GKLSGSTETKTINRV 225
+ +TI R+
Sbjct: 574 LQPDSPILLETIKRL 588
>gi|347830633|emb|CCD46330.1| similar to cell division cycle protein [Botryotinia fuckeliana]
Length = 612
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 27/51 (52%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
L + GNYT A+ K LI + + HCLIKQ+R+ EA S+L
Sbjct: 41 LAQVHFSTGNYTRAQSFLTKQDLIARNPSCRYLAGHCLIKQSRFEEALSIL 91
>gi|262066446|ref|ZP_06026058.1| tetratricopeptide repeat family protein [Fusobacterium
periodonticum ATCC 33693]
gi|291379873|gb|EFE87391.1| tetratricopeptide repeat family protein [Fusobacterium
periodonticum ATCC 33693]
Length = 349
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 13/142 (9%)
Query: 67 DRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFN 126
++V+E + + +L S Q + L K GR DE I L +Y +
Sbjct: 148 EKVNEKIFIVSELGYLYSVQGKNEEALKYFYLAKDLGRNDEWIYL------HLY----YT 197
Query: 127 GKPTKTARSHGKKF--QVTVKQETSRILGNLGWAYM-QKGNYTSAEVVYRKAQLIDPDAN 183
K +K K F Q ++ + + +L LG YM + NY +AE VY+K + D
Sbjct: 198 IKASKGEEEALKYFEEQAKIEDKNTVLLTALGNIYMLEPANYDAAEKVYQKVFALSGDGQ 257
Query: 184 KACNLSHCLIKQARYTEARSVL 205
+ N L+ +Y EA VL
Sbjct: 258 QLYNRGRALVGLKKYKEAVEVL 279
>gi|117923677|ref|YP_864294.1| hypothetical protein Mmc1_0362 [Magnetococcus marinus MC-1]
gi|117607433|gb|ABK42888.1| Tetratricopeptide TPR_2 repeat protein [Magnetococcus marinus MC-1]
Length = 804
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 71/158 (44%), Gaps = 11/158 (6%)
Query: 61 VVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY 120
++LKQ R DEA + ++ L S + L LY + G+L+E ++L++ + +
Sbjct: 405 LLLKQPGREDEAEQILREAMALDSNNIPPRTE--LGRLYMQQGKLEEAEQVLREVIGLDS 462
Query: 121 HGEAFNGKPTKTARSHGKKFQV-TVKQETSRILGN-------LGWAYMQKGNYTSAEVVY 172
+ + + GK + V QE + N LG YMQ+G AE V
Sbjct: 463 NDIHVRTELGRLYMQQGKLEEAEQVLQEAMALDSNNIPPRTELGRLYMQQGKLKEAEQVL 522
Query: 173 RKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVL 209
++A +D + L ++Q + EAR VLE VL
Sbjct: 523 QEAMALDSNNIPPRTELGRLYMQQGKREEARWVLEQVL 560
>gi|386812651|ref|ZP_10099876.1| peptidase [planctomycete KSU-1]
gi|386404921|dbj|GAB62757.1| peptidase [planctomycete KSU-1]
Length = 619
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 75/192 (39%), Gaps = 11/192 (5%)
Query: 30 LVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE 89
L +D E A+ F KA R A + + + R EA+EA K R +
Sbjct: 246 LAAEDYENALRCFEKAGKENPRYSLAYFYIGYCMDKLGRYSEAIEAYK--RAIRINATFL 303
Query: 90 SLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFNGKPTKTARSHGKKFQ 141
+ L + Y + G E I + KQ +R+ I++ GE + H K
Sbjct: 304 EVHYNLGEDYDRLGCYGEAIGIYKQIIRIQPKNARIHYKLGEDYRILEHYPEAIHAFKKA 363
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTE 200
+ K + +LG G Y+ A Y++ I+P KA L KQ YTE
Sbjct: 364 IDSKPDFVEAYSSLGLVCFNLGYYSEAIEAYQQVITINPHDTKAHIMLGSAYSKQGCYTE 423
Query: 201 ARSVLEDVLLGK 212
A V + V+ K
Sbjct: 424 AIDVFKKVIYSK 435
>gi|423067050|ref|ZP_17055840.1| putative sulfotransferase [Arthrospira platensis C1]
gi|406711336|gb|EKD06537.1| putative sulfotransferase [Arthrospira platensis C1]
Length = 729
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 74/183 (40%), Gaps = 17/183 (9%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFRHLCS 84
E AI + KAI + +AV+L + R+DEA++ + S HL
Sbjct: 162 EQAITAYQKAIELKPDFSWSYHHLAVILTRLGRIDEAIDNYRQEIELNPDFYWSHFHLGD 221
Query: 85 KQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF---- 140
+Q++ I Y+K ++ Q Q+L I + G+ + +
Sbjct: 222 LLSQQNQPEEAIAAYRKAIAINPQQPEAHQRLTEILSRDQKTGEDALLGGRYEEAIAIYR 281
Query: 141 -QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARY 198
V K + S LG A + ++ A V+ +A ++PD + NL +CL KQ
Sbjct: 282 DMVAAKPDYSWGYYGLGLALLNLRQWSEAIEVFNQAINLNPDCFWSYNNLGYCLFKQGEI 341
Query: 199 TEA 201
+A
Sbjct: 342 PQA 344
>gi|357148919|ref|XP_003574938.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like
[Brachypodium distachyon]
Length = 959
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 63/151 (41%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AIV + A++ R A ++ V+ K +D +D AVE + + +Q SL+N+
Sbjct: 308 EMAIVSYELALHFNPRCAEACNNLGVIYKDRDNLDRAVECYQMALTIKPNFSQ-SLNNLG 366
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EA+N N
Sbjct: 367 V-VYTVQGKMDAAASMIEKAILANPTYAEAYN---------------------------N 398
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG Y G+ T + Y + IDPD+ A
Sbjct: 399 LGVLYRDAGSITLSVQAYERCLQIDPDSRNA 429
>gi|291566679|dbj|BAI88951.1| TPR domain protein [Arthrospira platensis NIES-39]
Length = 729
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 77/187 (41%), Gaps = 17/187 (9%)
Query: 32 EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFR 80
+K E AI + KAI + +AV+L + RV+EA+ A + S
Sbjct: 158 QKQDEQAITAYRKAIELKPDFCWSYHHLAVILTRLGRVEEAIAAYRQEITLNPDFYWSHF 217
Query: 81 HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
HL +Q++ I+ Y++ ++ Q Q+L I G+ + +
Sbjct: 218 HLGDLLSQQNQPEEAIEAYRQAIAINPQQPEAHQRLTEILSRHQKTGEDALLTGRYEEAI 277
Query: 141 -----QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIK 194
V + + S LG A + + + A V+ +A I+PD + N L +CL K
Sbjct: 278 AIYRDMVAARPDYSWGYYGLGLALLNQRQWREAIDVFNQAISINPDCFWSYNHLGYCLFK 337
Query: 195 QARYTEA 201
Q + + A
Sbjct: 338 QGKISPA 344
>gi|301117834|ref|XP_002906645.1| Mitochondrial Protein Translocase (MPT) Family [Phytophthora
infestans T30-4]
gi|262107994|gb|EEY66046.1| Mitochondrial Protein Translocase (MPT) Family [Phytophthora
infestans T30-4]
Length = 1104
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 30/63 (47%)
Query: 21 PYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFR 80
P++ A V + DP AI F +A+ R SA D+A VL Q R DEA E +
Sbjct: 1005 PFLNAGRVYVGTNDPMRAIAHFKQALEVDPRCSSAHLDIAQVLFAQGRTDEAFEHFEVAA 1064
Query: 81 HLC 83
C
Sbjct: 1065 SCC 1067
>gi|66347873|ref|NP_001018115.1| O-linked N-acetylglucosamine transferase isoform 2 [Danio rerio]
gi|56967374|gb|AAW31871.1| O-GlcNAc transferase variant 2 [Danio rerio]
Length = 1052
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 326 IKREQGNIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463
>gi|328874522|gb|EGG22887.1| hypothetical protein DFA_05017 [Dictyostelium fasciculatum]
Length = 1297
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 9/84 (10%)
Query: 198 YTEARSVLEDVLLGKLSGS-TETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDAFLEG 256
Y ++ S +E +L L ++T TIN + E+L + W+ K + D FL G
Sbjct: 537 YMKSISQVEKMLKDSLEADKSQTNTINSITEILDNWDIWRK-------NKENAPD-FLSG 588
Query: 257 LDDLMNQWTPYRSRRLPIFEEISP 280
L D++ Q PY P+FE+ P
Sbjct: 589 LIDIVVQEVPYTEESKPMFEQFIP 612
>gi|291301344|ref|YP_003512622.1| SARP family transcriptional regulator [Stackebrandtia nassauensis
DSM 44728]
gi|290570564|gb|ADD43529.1| transcriptional regulator, SARP family [Stackebrandtia nassauensis
DSM 44728]
Length = 935
Score = 38.1 bits (87), Expect = 4.6, Method: Composition-based stats.
Identities = 46/175 (26%), Positives = 77/175 (44%), Gaps = 25/175 (14%)
Query: 55 ALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIE 110
AL+D+ + EA EA+K R + + N L D++++ G D I
Sbjct: 714 ALRDLGQAQRAMKLRPEATTTLTEALKVCRDNGFHERVPGILNFLADVHRQEGEYDTAIG 773
Query: 111 LLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEV 170
LL++ L H ++ ++ + +L +LG+ Y Q G + A
Sbjct: 774 LLREGL--------------AACDEHVADERIAIEIKNG-LLTSLGFNYEQTGQFDHAAE 818
Query: 171 VYRKAQLI---DPDANKAC--NLSHCLIKQARYTEARSVLEDVL-LGKLSGSTET 219
+R+A I DP A+ NL+H L +Q T+A +LE+ L L SG++ T
Sbjct: 819 AHREAIAISHGDPPAHAIALGNLAHVLRRQGEATQAIPLLEEALQLLNDSGNSRT 873
>gi|300676810|gb|ADK26686.1| kinesin light chain 4 [Zonotrichia albicollis]
Length = 622
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 76/183 (41%), Gaps = 33/183 (18%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIEL 111
V + L +A+V + Q++ EA + L S Q +L+N+ + LY K G+ E L
Sbjct: 250 VATMLNILALVYRDQNKYKEAAHLLND--KLLSLQVAATLNNLAV-LYGKRGKYKEAEPL 306
Query: 112 LKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVV 171
K+ L + + GK KQ L NL +G Y E
Sbjct: 307 CKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGKYDEVEYY 348
Query: 172 YRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVLLG---KLSGSTET 219
Y +A I DP+ K NL+ C +KQ +Y +A + +++L K GS E
Sbjct: 349 YCRALEIYESCLGPDDPNVAKTKNNLASCYLKQGKYKDAEVLYKEILTRAHVKEFGSDEH 408
Query: 220 KTI 222
K I
Sbjct: 409 KPI 411
>gi|409993097|ref|ZP_11276252.1| sulfotransferase [Arthrospira platensis str. Paraca]
gi|409936022|gb|EKN77531.1| sulfotransferase [Arthrospira platensis str. Paraca]
Length = 729
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 77/187 (41%), Gaps = 17/187 (9%)
Query: 32 EKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK-----------SFR 80
+K E AI + KAI + +AV+L + RV+EA+ A + S
Sbjct: 158 QKQDEQAITAYRKAIELKPDFCWSYHHLAVILTRLGRVEEAIAAYRQEITLNPDFYWSHF 217
Query: 81 HLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKF 140
HL +Q++ I+ Y++ ++ Q Q+L I G+ + +
Sbjct: 218 HLGDLLSQQNQPEEAIEAYRQAIAINPQQPEAHQRLTEILSRHQKTGEDALLTGRYEEAI 277
Query: 141 -----QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIK 194
V + + S LG A + + + A V+ +A I+PD + N L +CL K
Sbjct: 278 AIYRDMVAARPDYSWGYYGLGLALLNQRQWREAIDVFNQAISINPDCFWSYNHLGYCLFK 337
Query: 195 QARYTEA 201
Q + + A
Sbjct: 338 QGKISPA 344
>gi|153004759|ref|YP_001379084.1| hypothetical protein Anae109_1897 [Anaeromyxobacter sp. Fw109-5]
gi|152028332|gb|ABS26100.1| Tetratricopeptide TPR_2 repeat protein [Anaeromyxobacter sp.
Fw109-5]
Length = 268
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 85/205 (41%), Gaps = 21/205 (10%)
Query: 23 VRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHL 82
VR + L DPE A V F A+ AL + +V +++ +D A + L
Sbjct: 47 VRRGYAYLEAADPERATVAFSHALEFDPDFPEALNGLGIVARREGDLDTARRRFERAVRL 106
Query: 83 CS--KQAQESLDNVLIDLYKKCG---------RLDEQIELLKQKLR--MIYHGEAFNGKP 129
+ +L L+ ++ G R+D + +Q L ++ HG P
Sbjct: 107 APDFAEGHANLGETLLAAGREGGGTDALRAALRIDPDLADARQNLARALLRHG---LDSP 163
Query: 130 TKTAR--SHGKKFQVTVKQETSRILGNLGWAYMQ--KGNYTSAEVVYRKAQLIDPDANKA 185
+ AR +++ ++ R + A+M +G + AE YR+A ++P +++A
Sbjct: 164 DRGARWAEARREYLHLLEAAPDRAAAHHDLAFMDYVEGRFERAEAGYRRAAELEPGSHEA 223
Query: 186 CN-LSHCLIKQARYTEARSVLEDVL 209
+ L L++ AR EA E L
Sbjct: 224 LHGLCVSLVRLARCDEAVRACERCL 248
>gi|376295888|ref|YP_005167118.1| hypothetical protein DND132_1101 [Desulfovibrio desulfuricans
ND132]
gi|323458449|gb|EGB14314.1| Tetratricopeptide TPR_1 repeat-containing protein [Desulfovibrio
desulfuricans ND132]
Length = 208
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Query: 153 GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
GN+G+ YMQ+G+ A +KA +DPD +A +S LI + EA +L+ +L
Sbjct: 97 GNIGFVYMQQGDNDKAHKSLKKALKLDPDFAQAQATMSSVLIAMNDFEEADKLLKTLL 154
>gi|348516826|ref|XP_003445938.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
3 [Oreochromis niloticus]
Length = 1064
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 336 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473
>gi|428182889|gb|EKX51748.1| hypothetical protein GUITHDRAFT_161407 [Guillardia theta CCMP2712]
Length = 458
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 138 KKFQVTVKQE-TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQ 195
K F + +K+E T+ +L NL Y + G++T A +Y++A +D DA NL L K
Sbjct: 262 KYFSMALKEEVTTGVLNNLAIVYQEMGSHTEAVQMYKRALKLDQDDATAHYNLGTALEKA 321
Query: 196 ARYTEA 201
Y EA
Sbjct: 322 GSYKEA 327
>gi|348516822|ref|XP_003445936.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Oreochromis niloticus]
Length = 1048
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 278 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 335
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 336 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 395
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 396 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 455
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 456 LKLKPDFPDAYCNLAHCL 473
>gi|262304971|gb|ACY45078.1| acetylglucosaminyl-transferase [Carcinoscorpius rotundicauda]
Length = 290
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 79/199 (39%), Gaps = 47/199 (23%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
A+ + +AI A ++A LK++ +V EA E + HLC A +SL+N L +
Sbjct: 5 AVDTYRRAIELQPNFPDAYCNLANALKEKGQVSEAEECYNTALHLCPTHA-DSLNN-LAN 62
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E L + L ++++ EA PT
Sbjct: 63 IKREQGYIEEATRLYLKALEVFPEFAAAHSNLASVLQQQGKLNEALMHYKEAIRISPTFA 122
Query: 131 -------KTARSHGKKFQVTVKQETSRILGNLGWA---------YMQKGNYTSAEVVYRK 174
T + G Q ++ T I N G+A + GN A YR
Sbjct: 123 DAYSNMGNTLKEMG-DIQGALQCYTRAIQINPGFADAHSNLASIHKDSGNIPEAIASYRT 181
Query: 175 AQLIDPDANKA-CNLSHCL 192
A + P+ A CNL+HCL
Sbjct: 182 ALKLKPEFPDAYCNLAHCL 200
>gi|348516824|ref|XP_003445937.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Oreochromis niloticus]
Length = 1038
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 326 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463
>gi|293607618|ref|ZP_06689952.1| tetratricopeptide TPR_2 [Achromobacter piechaudii ATCC 43553]
gi|292814051|gb|EFF73198.1| tetratricopeptide TPR_2 [Achromobacter piechaudii ATCC 43553]
Length = 628
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 13/176 (7%)
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM- 118
A +L+ DRV +AV +++ + + L LY++ RL + +L+Q + +
Sbjct: 452 AQILRDADRVPQAVSTLEAADQALPDTVEIKYE--LAMLYERQNRLADLERMLRQVIALD 509
Query: 119 IYHGEAFNGKPTKTARSHGKKFQ-----VTVKQETSR----ILGNLGWAYMQKGNYTSAE 169
H A+N A H ++ +T E S IL ++GW + G SA
Sbjct: 510 PDHAHAYNALGYTLA-DHNQRLPEALDLITQALELSPNDPFILDSMGWVKYRMGESESAA 568
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRV 225
R+A + P+A+ A +L+ L Q + +A +L L+ T I R+
Sbjct: 569 QYLRRAYSVRPEADIAAHLAEVLWSQGKRDQAVELLRVALMKDPKNKTVQDVIKRL 624
>gi|415884106|ref|ZP_11546135.1| acetyl-CoA carboxylase carboxyltransferase subunit alpha [Bacillus
methanolicus MGA3]
gi|387591901|gb|EIJ84218.1| acetyl-CoA carboxylase carboxyltransferase subunit alpha [Bacillus
methanolicus MGA3]
Length = 325
Score = 37.7 bits (86), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 7/99 (7%)
Query: 25 AKHVQLVEKD------PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKS 78
H+ ++E PE A L WK + R +K A LK+ +DE + +K
Sbjct: 210 GNHIHMLENSTYSVISPEGAAALLWKDASQAKRAAETMKITAPDLKELGIIDEIIPEVKG 269
Query: 79 FRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
H K+ E +D +LI K+ ++ ++ EL+ + +
Sbjct: 270 GAHKDVKKQSEYIDQILIKSLKELVKMSDE-ELVDHRYK 307
>gi|406934936|gb|EKD69050.1| TPR Domain containing protein [uncultured bacterium]
Length = 488
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 41/188 (21%), Positives = 83/188 (44%), Gaps = 21/188 (11%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA-IKSFRHLCSKQAQESLDNV 94
+AAI ++ + + G ++ V+ ++++ + +A+E + S++ + E ++N+
Sbjct: 148 DAAIFMYSRVVKTGKGSAQLYNNLGVLYEKKNELKKALECYLNSYKM--DGKCTEVINNI 205
Query: 95 ------------LIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV 142
ID Y K ++ + + L +IY GK + + K V
Sbjct: 206 GLVYAKLNNYNKAIDSYLKTLDIEPENKKALNNLGLIY---GLTGKFYDSISTWKKLLNV 262
Query: 143 TVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTEA 201
++ NLG AY+ GN T+A + A I+P + +A N +++ARY EA
Sbjct: 263 DSGDYSTHF--NLGVAYLNVGNKTNARYYFESALKINPRSAEALMNSGSLYVQEARYDEA 320
Query: 202 RSVLEDVL 209
+ V+
Sbjct: 321 VGEFKKVI 328
>gi|157812758|gb|ABV81124.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase [Limulus polyphemus]
Length = 290
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 79/199 (39%), Gaps = 47/199 (23%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
A+ + +AI A ++A LK++ +V EA E + HLC A +SL+N L +
Sbjct: 5 AVDTYRRAIELQPNFPDAYCNLANALKEKGQVSEAEECYNTALHLCPTHA-DSLNN-LAN 62
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E L + L ++++ EA PT
Sbjct: 63 IKREQGYIEEATRLYLKALEVFPEFAAAHSNLASVLQQQGKLNEALMHYKEAIRISPTFA 122
Query: 131 -------KTARSHGKKFQVTVKQETSRILGNLGWA---------YMQKGNYTSAEVVYRK 174
T + G Q ++ T I N G+A + GN A YR
Sbjct: 123 DAYSNMGNTLKEMG-DIQGALQCYTRAIQINPGFADAHSNLASIHKDSGNIPEAIASYRT 181
Query: 175 AQLIDPDANKA-CNLSHCL 192
A + P+ A CNL+HCL
Sbjct: 182 ALKLKPEFPDAYCNLAHCL 200
>gi|124008635|ref|ZP_01693326.1| tetratricopeptide repeat domain protein [Microscilla marina ATCC
23134]
gi|123985879|gb|EAY25743.1| tetratricopeptide repeat domain protein [Microscilla marina ATCC
23134]
Length = 309
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSH-CLIKQ 195
+L N+GW + KGN A+ +KA + P DA+ NL H C+IKQ
Sbjct: 205 VLMNIGWINLVKGNLKEADTYIKKAHQLAPYDAHTLMNLGHICMIKQ 251
>gi|428319715|ref|YP_007117597.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
gi|428243395|gb|AFZ09181.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
Length = 1533
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 78/177 (44%), Gaps = 17/177 (9%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA-IKSFR---HLCSKQAQESLDN 93
A+ + KAI A ++ A +L Q + +EA EA ++F C+ + E+L
Sbjct: 361 AVSAYEKAIALKPDFAGAFRNFAKLLTQLGKSEEAAEAWYRAFAIDPKSCTAEEHENLAK 420
Query: 94 VLIDLYKKCGRLDEQIELLKQKLRM------IYH--GEAFNGKPTKTARSHGKKFQVTVK 145
LI+ G++D+ IE ++ + + YH GE G+ A +
Sbjct: 421 TLIEQ----GKVDKGIECYRRAVELNPNAGAAYHELGEILKGQEQWEAAVDAYTNAIRNN 476
Query: 146 QETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQARYTEA 201
S NL + ++ + A YRKA ++PD + + NL+ L+K R+ EA
Sbjct: 477 PNLSWSHNNLAESLVKLERWEEAVNAYRKAIELNPDFSWSHNNLADVLLKLERWEEA 533
>gi|297738270|emb|CBI27471.3| unnamed protein product [Vitis vinifera]
Length = 574
Score = 37.7 bits (86), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%), Gaps = 9/101 (8%)
Query: 18 GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK 77
GD R Q+ ++D EA + LF + AG ++D ++A+V Q D V
Sbjct: 167 GDIKSARLLFDQVDQRDTEA-LNLFHRMQTAGIKLD----NVALVSTLQACADLGVLDQG 221
Query: 78 SFRHLCSKQAQESLD----NVLIDLYKKCGRLDEQIELLKQ 114
+ H K+ + +D VLID+Y KCG L+E IE+ ++
Sbjct: 222 KWIHAYIKKHEIEIDPILGCVLIDMYAKCGDLEEAIEVFRK 262
>gi|427415284|ref|ZP_18905469.1| putative transcriptional regulator [Leptolyngbya sp. PCC 7375]
gi|425756049|gb|EKU96908.1| putative transcriptional regulator [Leptolyngbya sp. PCC 7375]
Length = 725
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 9/72 (12%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI------DPDANKACNLSHCLI---KQAR 197
E + L NLG YM +G Y+ AE + ++A ++ D N A NL++ + KQ R
Sbjct: 543 EVALSLNNLGMLYMHQGRYSEAEPLLKRALVMSKRFLGDEHPNVATNLNNLALFYSKQER 602
Query: 198 YTEARSVLEDVL 209
Y+EA + ++ L
Sbjct: 603 YSEAEPLYQESL 614
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 9/73 (12%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAE------VVYRKAQLIDPD---ANKACNLSHCLIKQAR 197
+ +R L NLGW Y ++G Y AE +V RK L D A NL+ KQ R
Sbjct: 627 DVARSLNNLGWLYGKQGRYREAEPLLKEAIVLRKQLLGDEHPSVARSLTNLAMIYDKQER 686
Query: 198 YTEARSVLEDVLL 210
Y EA + ++ ++
Sbjct: 687 YDEAEILYQEAVV 699
>gi|442319830|ref|YP_007359851.1| hypothetical protein MYSTI_02851 [Myxococcus stipitatus DSM 14675]
gi|441487472|gb|AGC44167.1| hypothetical protein MYSTI_02851 [Myxococcus stipitatus DSM 14675]
Length = 353
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD-ANKACNLSHCLIKQARYTEAR 202
QE + NLG Y ++G Y A +R+A ++PD + NL++ L+K +Y EA+
Sbjct: 94 FNQEHLQAYQNLGSLYNEEGAYGKAHDNFRRALKVNPDNIDTRNNLAYTLMKMGKYEEAK 153
Query: 203 SVLEDVL 209
L +L
Sbjct: 154 KELRTIL 160
>gi|348516828|ref|XP_003445939.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
4 [Oreochromis niloticus]
Length = 1054
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 268 AIDTYRRAIELQPHFPDAYCNLANALKEKGNVSEAEECYNTALRLCPTHA-DSLNN-LAN 325
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E ++L ++ L ++++ EA PT
Sbjct: 326 IKREQGYIEEAVQLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 385
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 386 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 445
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 446 LKLKPDFPDAYCNLAHCL 463
>gi|91793507|ref|YP_563158.1| TPR repeat-containing protein [Shewanella denitrificans OS217]
gi|91715509|gb|ABE55435.1| TPR repeat [Shewanella denitrificans OS217]
Length = 448
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 11/97 (11%)
Query: 120 YHGEAF----NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
YHG + P ++ K Q+ V +I LG+ Y+Q ++ SA V Y++A
Sbjct: 124 YHGLGLVQLQSDNPIAATKNFAKALQLGVN--DPQIYSFLGYGYIQSEHFHSAVVAYQQA 181
Query: 176 QLIDPD---ANKACNLSHCLIKQARYTEARSVLEDVL 209
+L +PD N+A ++ + + A S+LE +L
Sbjct: 182 KLFNPDDAQLNQALLYAYNMA--GHHDAALSLLEQML 216
>gi|417411996|gb|JAA52415.1| Putative kinesin light chain, partial [Desmodus rotundus]
Length = 624
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 9/72 (12%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-----DPD----ANKACNLSHCLIKQAR 197
+ ++ L NL +G Y + E YR+A I PD A NL+ C +KQ +
Sbjct: 340 DVAKQLNNLALLCQNQGKYEAVECYYRRALAIYEGQLGPDNPNVARTKNNLASCYLKQGK 399
Query: 198 YTEARSVLEDVL 209
Y EA ++ +++L
Sbjct: 400 YAEAETLYKEIL 411
>gi|315637376|ref|ZP_07892591.1| conserved hypothetical protein [Arcobacter butzleri JV22]
gi|315478350|gb|EFU69068.1| conserved hypothetical protein [Arcobacter butzleri JV22]
Length = 503
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 84/175 (48%), Gaps = 15/175 (8%)
Query: 41 LFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK 100
L+ +AI D+ A ++AV+L ++ +DEA + + + K + + +N+ + L K
Sbjct: 292 LYKEAIKIDDKYSKAYNNLAVLLSEKGEMDEAEKLFREAIKIDDKNSN-TYNNLAL-LLK 349
Query: 101 KCGRLDEQIELLKQKLRM------IYHGEAF----NGKPTKTARSHGKKFQVTVKQETSR 150
G +DE +L ++ +++ Y+ A G+ + + + K + + + S+
Sbjct: 350 NKGEIDEAEKLYREAIKIDDKNSNTYNNLALLLANKGEIDEAEKLY--KEAIKIDDKNSK 407
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKA-QLIDPDANKACNLSHCLIKQARYTEARSV 204
NL + +KG AE +YR+A ++ D D++ NL+ L + EA +
Sbjct: 408 AYNNLAFLLSEKGEIGEAEKLYREAIKIDDKDSDIYYNLALLLKNKGDIDEAEKL 462
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 93/208 (44%), Gaps = 33/208 (15%)
Query: 19 DSPYVRAKH---VQLVEK-DPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV- 73
D Y +A + V L EK + + A LF +AI D+ + ++A++LK + +DEA
Sbjct: 300 DDKYSKAYNNLAVLLSEKGEMDEAEKLFREAIKIDDKNSNTYNNLALLLKNKGEIDEAEK 359
Query: 74 ---EAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRM------IYHGEA 124
EAIK + + +N+ + L K G +DE +L K+ +++ Y+ A
Sbjct: 360 LYREAIK-----IDDKNSNTYNNLALLLANK-GEIDEAEKLYKEAIKIDDKNSKAYNNLA 413
Query: 125 F-------NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA-Q 176
F G+ K R + + + S I NL KG+ AE +YR+A +
Sbjct: 414 FLLSEKGEIGEAEKLYRE-----AIKIDDKDSDIYYNLALLLKNKGDIDEAEKLYREAIK 468
Query: 177 LIDPDANKACNLSHCLIKQARYTEARSV 204
+ D D + NL+ L + EA +
Sbjct: 469 IDDKDLDVYNNLAVLLANKGDIDEAEKL 496
>gi|91202600|emb|CAJ72239.1| hypothetical protein kustd1494 [Candidatus Kuenenia
stuttgartiensis]
Length = 847
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 68/158 (43%), Gaps = 10/158 (6%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
E AI + KA+ A A ++ V + +++ + AVEA +S L QA N L
Sbjct: 648 EDAIAAYKKAVAADPLNSDAYYNLGNVYESKNQFELAVEAYQS--ALAIDQALAYAHNNL 705
Query: 96 IDLYKKCGRLDEQIELLKQKLRM-IYHGEAFNGKPTKTARSHGKKFQVTVKQETSRIL-- 152
LY K G LD+ IE +Q ++ + A N A+ ++ QE +L
Sbjct: 706 GALYDKKGILDKAIEEYRQAIKYDPLYPYAHNNLGASLAKKGDMDSALSEFQEAVHLLPD 765
Query: 153 -----GNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
NLG+ +++ GN A + + I P +A
Sbjct: 766 NPDFRFNLGYVFLRMGNNALALQAFEETIRIKPSHTEA 803
>gi|54302280|ref|YP_132273.1| hypothetical protein PBPRB0600 [Photobacterium profundum SS9]
gi|46915702|emb|CAG22473.1| hypothetical protein PBPRB0600 [Photobacterium profundum SS9]
Length = 361
Score = 37.7 bits (86), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-DPDANKACNLSHCLIKQARYTEAR 202
+ ++ + L NLG++Y G+ AE++ R+A I D NL I+ RY++A
Sbjct: 218 INHKSRQALTNLGYSYYLSGDLKQAEMINRRATTIYQTDKRAWSNLGLVYIRSKRYSDAL 277
Query: 203 SVLEDVL 209
L++V+
Sbjct: 278 DALQNVM 284
>gi|308500566|ref|XP_003112468.1| hypothetical protein CRE_30752 [Caenorhabditis remanei]
gi|308267036|gb|EFP10989.1| hypothetical protein CRE_30752 [Caenorhabditis remanei]
Length = 526
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 74/174 (42%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ I + ++ GR
Sbjct: 243 VATMLNVLAMVYRDQEKYKEATNFLTRALQIREKHYGENHPYVAATLNNLAIIIGRR-GR 301
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
LDE L K+ L++ EA GK + + ++ L NLG G
Sbjct: 302 LDEAQTLCKRSLKI---REAVYGKDSP---------------DVAKQLNNLGLLCQNLGK 343
Query: 165 YTSAEVVYRKAQLI---DPDANKA------CNLSHCLIKQARYTEARSVLEDVL 209
Y AE Y+KA I DAN + +LS +KQ Y EA + VL
Sbjct: 344 YEEAEDYYKKALKIYEMKLDANHSETVKTRNHLSSLYMKQGNYKEAEGFYKQVL 397
>gi|310794305|gb|EFQ29766.1| tetratricopeptide [Glomerella graminicola M1.001]
Length = 615
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 28/51 (54%)
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVL 205
L + GNYT A+ K L+ + + +HCLIKQ+R+ EA +VL
Sbjct: 41 LAQVHFATGNYTRAQTFLAKQDLVARNVSCRYLAAHCLIKQSRFDEAVAVL 91
>gi|189426262|ref|YP_001953439.1| hypothetical protein Glov_3213 [Geobacter lovleyi SZ]
gi|189422521|gb|ACD96919.1| Tetratricopeptide TPR_2 repeat protein [Geobacter lovleyi SZ]
Length = 639
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 79/186 (42%), Gaps = 36/186 (19%)
Query: 22 YVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRH 81
+V+ K++ L E+ +A + L A D + A +++A V + + D+AVE
Sbjct: 419 FVKNKNLNLAEESYKAVLKL------APDNPE-ANRELAAVYRAKGATDKAVEHYTKALE 471
Query: 82 LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
L + + N L+ +Y K + DE ELL++ + + A +H K
Sbjct: 472 LQEEDNESR--NALVAIYVKDKKYDELAELLQEAVEL----------APDDANNHYK--- 516
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACN-LSHCLIKQARYTE 200
LG Y K Y +A Y+KA + PD +A + L +K R +E
Sbjct: 517 -------------LGLIYDFKKEYDNAIASYKKAAELKPDHARALHALGRVYMKTGRLSE 563
Query: 201 ARSVLE 206
AR LE
Sbjct: 564 AREALE 569
>gi|119486472|ref|ZP_01620530.1| kinesin light chain-like protein [Lyngbya sp. PCC 8106]
gi|119456374|gb|EAW37505.1| kinesin light chain-like protein [Lyngbya sp. PCC 8106]
Length = 1127
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 76/187 (40%), Gaps = 43/187 (22%)
Query: 56 LKDMAVVLKQQDRVDEAVEAIKSFRHLCSK-------QAQESLDNVLIDLYKKCGRLDEQ 108
L+ AV L QQ + +EA+ +K + + +SL+N+ I LY+ GR E
Sbjct: 111 LEQQAVQLYQQGKYNEAIPILKRVLEIIERLLGENHPDVAQSLNNLAI-LYRDQGRYSEA 169
Query: 109 IELLKQKLRMIYH----------GEAFN---------GKPTKTARSHGKKFQVTVKQ--- 146
L ++ L IY ++ N G+ ++ H + + K
Sbjct: 170 EPLFQRSL-TIYEKALGENHPDVAQSLNNLAQLYYSQGRYSEAEPLHQRSLAIREKALGE 228
Query: 147 ---ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPD-ANKACNLSHCLIK 194
+ + L NL Y +G Y+ AE +YR++ I PD A NL+ +
Sbjct: 229 NHPDVATSLNNLAQLYDSQGRYSEAEPLYRRSLAILEKALGENHPDVATSLNNLATLYVS 288
Query: 195 QARYTEA 201
Q RY +A
Sbjct: 289 QGRYDKA 295
>gi|189426401|ref|YP_001953578.1| hypothetical protein Glov_3352 [Geobacter lovleyi SZ]
gi|189422660|gb|ACD97058.1| TPR repeat-containing protein [Geobacter lovleyi SZ]
Length = 404
Score = 37.7 bits (86), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 63/151 (41%), Gaps = 10/151 (6%)
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN- 154
ID YK ++D+ + + L +Y NG+ ++ K + +KQ L
Sbjct: 81 IDAYKMSLKVDKTQDTVYTTLAGVYID---NGQKSEAE----KVLKDGIKQNNQNTLAYY 133
Query: 155 -LGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDVLLGK 212
LG Q G+YTSAE +R+ ++P D N L L Q +Y EA L+ K
Sbjct: 134 MLGQLQAQNGDYTSAEANFRQVIKLEPKDGNGYYALGMALNGQEKYDEAVEALQKATDLK 193
Query: 213 LSGSTETKTINRVKELLQELEPWQSIPPSLT 243
S + R L E + Q I +LT
Sbjct: 194 ADFSPALLELGRAYAGLGEKDKAQEIVDTLT 224
>gi|171912400|ref|ZP_02927870.1| Redoxin domain protein [Verrucomicrobium spinosum DSM 4136]
Length = 776
Score = 37.7 bits (86), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 120 YHGEAFNGKPTKTA---RSHGK-KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+HGEAF+ P + A GK KF VT K E ++ G + Y AE +R+A
Sbjct: 25 FHGEAFDDGPRQAAVLMEGCGKVKFPVTTKNELAQKFFTQGIGQIHGFWYYEAERSFRQA 84
Query: 176 QLIDPD 181
+DPD
Sbjct: 85 ADLDPD 90
>gi|451982583|ref|ZP_21930892.1| exported hypothetical protein, contains TPR repeats [Nitrospina
gracilis 3/211]
gi|451760229|emb|CCQ92187.1| exported hypothetical protein, contains TPR repeats [Nitrospina
gracilis 3/211]
Length = 395
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 131 KTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLS 189
K SH K + + + ++ L +LGWA Q G+ +AE R++ ++ D NL
Sbjct: 61 KAGASHLKD-TIRLNKNHTQALRDLGWALYQMGDLAAAEKWLRQSYRLNAKDPKTVANLG 119
Query: 190 HCLIKQARYTEARSVLEDVLLG 211
LI + RY+ A +VL++ + G
Sbjct: 120 AVLIARERYSAAVAVLDNSVPG 141
>gi|350551992|ref|ZP_08921201.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodospira
sibirica ATCC 700588]
gi|349795635|gb|EGZ49431.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodospira
sibirica ATCC 700588]
Length = 581
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 28/59 (47%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
I +LGW + G AE+ R+A + DA A NL+ L + Y EA +L L
Sbjct: 503 ITDSLGWVLFRLGRLDEAEIYLRRAHAMLEDAEIASNLAALLWARGHYDEAWEILRTAL 561
>gi|302344515|ref|YP_003809044.1| hypothetical protein Deba_3097 [Desulfarculus baarsii DSM 2075]
gi|301641128|gb|ADK86450.1| Tetratricopeptide TPR_2 repeat protein [Desulfarculus baarsii DSM
2075]
Length = 803
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 41/84 (48%), Gaps = 5/84 (5%)
Query: 150 RILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLEDV 208
R+L LGW +G + SA +A+ + P DA+ +L L K R EA+ L D
Sbjct: 720 RVLDTLGWVLSLRGAHASALPELEEARKLAPTDASVYYHLGSTLAKMGRANEAKKALRDG 779
Query: 209 LLGKLSGSTETKTINRVKELLQEL 232
L K + T +++LLQEL
Sbjct: 780 LAQK----GDQTTRQEIQKLLQEL 799
>gi|300313205|ref|YP_003777297.1| TPR repeat containing protein [Herbaspirillum seropedicae SmR1]
gi|300075990|gb|ADJ65389.1| TPR repeat containing protein [Herbaspirillum seropedicae SmR1]
Length = 600
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 29/62 (46%)
Query: 147 ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLE 206
E + I +LGW + G AE R+A + PDA +L L + + EAR +
Sbjct: 518 EDAFIADSLGWVLFRMGRLDEAETQMRRAYGLRPDAEIGVHLGEVLWTRGKQDEARKIWR 577
Query: 207 DV 208
DV
Sbjct: 578 DV 579
>gi|254492569|ref|ZP_05105741.1| hypothetical protein MDMS009_2911 [Methylophaga thiooxidans DMS010]
gi|224462461|gb|EEF78738.1| hypothetical protein MDMS009_2911 [Methylophaga thiooxydans DMS010]
Length = 532
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 32/140 (22%), Positives = 59/140 (42%), Gaps = 29/140 (20%)
Query: 42 FWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKK 101
+ KA+ A+ ++ VVL++Q R++EAVEA + L +Q ++ N+ L K
Sbjct: 136 YQKAVELQPGFYEAMANLGVVLQEQGRLEEAVEAYN--KALAVQQDAQTFFNLGTAL-KN 192
Query: 102 CGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQ 161
G+L + I+ Q L + + + + N+G
Sbjct: 193 QGKLGDAIDAYNQAL--------------------------VINPDYAEVHSNIGEVLRD 226
Query: 162 KGNYTSAEVVYRKAQLIDPD 181
+G Y + Y++A +DPD
Sbjct: 227 QGRYDESVKAYKQALTLDPD 246
>gi|153006532|ref|YP_001380857.1| hypothetical protein Anae109_3694 [Anaeromyxobacter sp. Fw109-5]
gi|152030105|gb|ABS27873.1| Tetratricopeptide TPR_2 repeat protein [Anaeromyxobacter sp.
Fw109-5]
Length = 638
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 55/126 (43%), Gaps = 11/126 (8%)
Query: 68 RVDEAVEAIKSFRH-LCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY-HGEAF 125
R +A EA+ + R L S ++L L Y + G+LD + ++ L ++ H EA
Sbjct: 449 RAGKAAEAVSALRSALASDPRDQALLYALGATYHRAGQLDAAVAQMQALLALVPDHAEAL 508
Query: 126 NGKPTKTARSHGK--------KFQVTVKQETSRILGNLGWAYMQKGNYT-SAEVVYRKAQ 176
N A + + V ++ + + +LGW ++G Y +AE + +
Sbjct: 509 NFMGYALAERGTRLDEAERLVRRAVELRPRSGHVRDSLGWVLFRRGEYARAAEALEQADA 568
Query: 177 LIDPDA 182
L PDA
Sbjct: 569 LAGPDA 574
>gi|189347525|ref|YP_001944054.1| hypothetical protein Clim_2046 [Chlorobium limicola DSM 245]
gi|189341672|gb|ACD91075.1| Tetratricopeptide TPR_2 repeat protein [Chlorobium limicola DSM
245]
Length = 208
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 28/120 (23%), Positives = 52/120 (43%), Gaps = 11/120 (9%)
Query: 83 CSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQV 142
C+ + + +DN+ ++K G D L G AF S + +
Sbjct: 30 CASPSSKEIDNLQQQVWKNSGDADAWFRL----------GNAFARNEQYRKASEAYREAL 79
Query: 143 TVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDAN-KACNLSHCLIKQARYTEA 201
++ E ++L LG A +GNY A V + K Q + PD + + ++ + L++ Y +A
Sbjct: 80 SIDPEKEQLLAALGAASFNQGNYREALVYFTKYQALAPDDSLRNYDIGNVLLQMREYDKA 139
>gi|332711060|ref|ZP_08430995.1| hypothetical protein LYNGBM3L_60180 [Moorea producens 3L]
gi|332350186|gb|EGJ29791.1| hypothetical protein LYNGBM3L_60180 [Moorea producens 3L]
Length = 419
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 35/81 (43%), Gaps = 20/81 (24%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQA-------------- 196
+L NLGWA +++ Y AE R A +D + +HCL+ Q
Sbjct: 318 LLKNLGWARLKQERYAEAETYLRDAIELDKTFTRTPAAAHCLLAQVIENKPEKDPDNALA 377
Query: 197 ------RYTEARSVLEDVLLG 211
RYT+ R+ ED+ +G
Sbjct: 378 EWKICLRYTDIRNPDEDIWVG 398
>gi|344337618|ref|ZP_08768552.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
5811]
gi|343802571|gb|EGV20511.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
5811]
Length = 607
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 77/200 (38%), Gaps = 33/200 (16%)
Query: 18 GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK 77
GD R QL ++ PE A++L+ VL+ R DEA+
Sbjct: 398 GDVQRAREILQQLRDRGPENAVLLYLVETE--------------VLESVGRADEAMSVFD 443
Query: 78 SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIY-----HGEAFNG----- 127
S + +N+L +LD +I L +Q LR I H +A N
Sbjct: 444 SALQVFPDD-----ENLLYARALSAVKLD-RIALAEQDLRRIIEIDPEHADALNALGYTL 497
Query: 128 --KPTKTARSHGK-KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANK 184
+ + A + G + +K E IL ++GW + + G+Y +A R+A D
Sbjct: 498 ADRTDRYAEAKGYIEKAYAIKPEEPAILDSMGWIHYRLGDYETAVDYLRRALARMSDGEI 557
Query: 185 ACNLSHCLIKQARYTEARSV 204
A +L L R EA V
Sbjct: 558 AAHLGEVLWAMNRREEALEV 577
>gi|321465641|gb|EFX76641.1| hypothetical protein DAPPUDRAFT_54951 [Daphnia pulex]
Length = 825
Score = 37.4 bits (85), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 63/134 (47%), Gaps = 21/134 (15%)
Query: 44 KAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCG 103
KA++ + + ++ ++ ++Q R +EA+++ +S H + A L+ L+ G
Sbjct: 502 KALSYRNNMADVHYNLGILYQEQKRYEEAIQSYRSAVHYRPRMAMAHLNMGLV--LALMG 559
Query: 104 RLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI--LGNLGWAYMQ 161
DE IE+ ++ ++ +G K R+H ET++I L NLG +
Sbjct: 560 MKDEAIEVYRRCSQL-------DGSGLKDPRTH----------ETTKISALFNLGRLHAD 602
Query: 162 KGNYTSAEVVYRKA 175
G YT A VY +A
Sbjct: 603 DGQYTKAIDVYNEA 616
>gi|332234364|ref|XP_003266379.1| PREDICTED: LOW QUALITY PROTEIN: kinesin light chain 4 [Nomascus
leucogenys]
Length = 606
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 65/150 (43%), Gaps = 31/150 (20%)
Query: 69 VDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGK 128
V+E SFRH C A +L+N+ + LY K G+ E L ++ L +
Sbjct: 275 VEEGGIRSPSFRH-CPVAA--TLNNLAV-LYGKRGKYKEAEPLCQRALEIR--------- 321
Query: 129 PTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI-----DPD-- 181
+K T + ++ L NL +G Y + E Y++A I PD
Sbjct: 322 ---------EKVLGTNHPDVAKQLNNLALLCQNQGKYEAVERYYQRALAIYEGQLGPDNP 372
Query: 182 --ANKACNLSHCLIKQARYTEARSVLEDVL 209
A NL+ C +KQ +Y EA ++ +++L
Sbjct: 373 NVARTKNNLASCYLKQGKYAEAETLYKEIL 402
>gi|322420030|ref|YP_004199253.1| hypothetical protein GM18_2524 [Geobacter sp. M18]
gi|320126417|gb|ADW13977.1| Tetratricopeptide TPR_1 repeat-containing protein [Geobacter sp.
M18]
Length = 572
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 68/155 (43%), Gaps = 27/155 (17%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P A+ + KA++ + AL +M + + Q + +A+++ K + A ++
Sbjct: 199 PREALTYYKKAVDLKPDFEQALIEMGISQETQGLIPDAIDSYKDLLEINPNNA--NVIQH 256
Query: 95 LIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHG-------------KKFQ 141
L LY + RLDE + LL+QK GK +T+R G K FQ
Sbjct: 257 LAQLYIQQKRLDEALALLQQK----------GGKSLETSRKIGLLFLELERYDEAIKTFQ 306
Query: 142 --VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRK 174
+ V+ + ++ L AY +K + A V +RK
Sbjct: 307 DILKVEPDAQQVRFYLASAYEEKEDVDQAIVEFRK 341
>gi|411118571|ref|ZP_11390952.1| methyltransferase, FkbM family [Oscillatoriales cyanobacterium
JSC-12]
gi|410712295|gb|EKQ69801.1| methyltransferase, FkbM family [Oscillatoriales cyanobacterium
JSC-12]
Length = 928
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 80/196 (40%), Gaps = 32/196 (16%)
Query: 18 GDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIK 77
G+ P + Q+VE DPE FW +A+V QQ R EA+ +
Sbjct: 17 GNLPDAETAYRQVVEDDPEHIEAWFW---------------LALVTDQQGRPMEAIAHYQ 61
Query: 78 SFRHL--CSKQAQESLDNVLIDLYKKCGRLDEQI-------ELLKQKLRMIYHGEA--FN 126
L S +A +L +V + K R DE I EL+ Q + Y+ +
Sbjct: 62 KVLQLQPNSAEAHGNLGSVWL----KLRRFDEAIAHHRKSVELMPQNAKAHYNLAIALYE 117
Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
A ++ ++ V + E + NLG A ++G A Y+KA ++P+ A
Sbjct: 118 NNQVDEAITYYQQ-AVALMPEYANAHHNLGMALYRQGKADEAITHYQKAIALEPNHASAR 176
Query: 187 N-LSHCLIKQARYTEA 201
N L L +Q + EA
Sbjct: 177 NSLGVALYQQGKIDEA 192
>gi|307154916|ref|YP_003890300.1| hypothetical protein Cyan7822_5141 [Cyanothece sp. PCC 7822]
gi|306985144|gb|ADN17025.1| TPR repeat-containing protein [Cyanothece sp. PCC 7822]
Length = 965
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 65/173 (37%), Gaps = 33/173 (19%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV------LIDLYKKCGRL 105
V ++L ++A + Q R +EA K L + ++ NV L LY GR
Sbjct: 698 VATSLNNLAALYDSQGRYEEAEPLYKQALELSKRLLGDNHPNVATSLNNLAALYDSQGRY 757
Query: 106 DEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNY 165
+E L KQ L +I K+ + + L NL Y +G Y
Sbjct: 758 EEAEPLYKQALELI------------------KRLLGDNHPDVATSLNNLAALYDSQGRY 799
Query: 166 TSAEVVY------RKAQLIDPDANKAC---NLSHCLIKQARYTEARSVLEDVL 209
AE +Y RK L D N A NL+ Q RY EA + + L
Sbjct: 800 EEAEPLYKQALELRKRLLGDNHPNVASSLNNLAGLYDSQGRYEEAEPLYKQAL 852
>gi|113474349|ref|YP_720410.1| hypothetical protein Tery_0481 [Trichodesmium erythraeum IMS101]
gi|110165397|gb|ABG49937.1| TPR repeat [Trichodesmium erythraeum IMS101]
Length = 3145
Score = 37.4 bits (85), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 79/171 (46%), Gaps = 11/171 (6%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ AI L + IN A ++A ++K++ R++EA+ + L + +
Sbjct: 1044 DIAIQLLNQVINLYPGFTEAYSNLAKLMKKEGRLEEAIAHYQKAISLEPNNSSNYSNLGF 1103
Query: 96 IDLYKKCGRLDEQIELLKQKLRM--------IYHGEAFNGKPTKTARSHGKKFQVTVKQE 147
I L K G+++ I ++ + + G A+ K + S + + ++ +
Sbjct: 1104 IFLEK--GQIESAIINSEKSIEINPNNSQGNFNLGFAWAEKGDLSKASTYYQKAINLQPD 1161
Query: 148 TSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCLIKQAR 197
++ NLG + +KGN + A Y++A I+P+ +A CNL L+KQ +
Sbjct: 1162 YAQAHNNLGLIFQEKGNLSKASNYYQQALEINPNYAEAWCNLGVILLKQGQ 1212
>gi|381158787|ref|ZP_09868020.1| hypothetical protein Thi970DRAFT_02489 [Thiorhodovibrio sp. 970]
gi|380880145|gb|EIC22236.1| hypothetical protein Thi970DRAFT_02489 [Thiorhodovibrio sp. 970]
Length = 338
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 97/256 (37%), Gaps = 49/256 (19%)
Query: 16 PPGDSPYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEA 75
P PY+R+ + + + E A F + +R + V ++ D+ ++AV+A
Sbjct: 83 PTHVQPYLRSGRIYFQQGELEKAREAFLNVLRVNERSAIGHAALGFVYQRSDQANQAVDA 142
Query: 76 IKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARS 135
FR L L L + +K GR + + K LR+ +PT
Sbjct: 143 W--FRALSFNPRMVGLRKRLALILQKLGRQSDAMAQAKAALRI---------RPTDP--- 188
Query: 136 HGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA---CNLSHCL 192
E I G L ++ + A YRKA +DPDA K L+
Sbjct: 189 -----------EAHAINGRL---HLLAKEFDPALRAYRKAIDLDPDAKKLGIRLGLAEAC 234
Query: 193 IKQARYTEARSVLEDV--------LLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTT 244
I EA +VL V LL KL G T+ KE ++E ++ T
Sbjct: 235 ICAGHLDEAETVLSAVPESDQFSALLHKLWGDIYTEK-GLHKEAMEEYR------GAVLT 287
Query: 245 KKSSLEDAFLEGLDDL 260
+E +EGLDDL
Sbjct: 288 SGQEVE---IEGLDDL 300
>gi|148262768|ref|YP_001229474.1| TPR repeat-containing protein [Geobacter uraniireducens Rf4]
gi|146396268|gb|ABQ24901.1| Tetratricopeptide TPR_2 repeat protein [Geobacter uraniireducens
Rf4]
Length = 632
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 22/178 (12%)
Query: 19 DSPYVRAK--HVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAI 76
D+P + K V + KD AAI + + A +D+A + +++++ +EA +
Sbjct: 404 DNPLIHFKLAKVYVNSKDYPAAISEYLETTKLDPDNIEAHRDLAALFRKKNQNEEAEKEY 463
Query: 77 KSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQ-------------KLRMIYHGE 123
+S L K+ L +Y K DE I LLK+ KL +IY +
Sbjct: 464 RSI--LRMKKDDVEARTALTSIYVKNKNYDELINLLKEGVELNPKDPNSHYKLGLIYEFK 521
Query: 124 AFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
K A S K+ V +K + ++ L +G AYM+ G + A+ A+ DP+
Sbjct: 522 ----KDYDAAISQYKE-SVALKSDHAKALNAMGRAYMKSGRISEAKEALETAKKADPE 574
>gi|406953657|gb|EKD82827.1| TPR Domain containing protein, partial [uncultured bacterium]
Length = 1487
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 21/114 (18%)
Query: 86 QAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAF-NGKPT------KTARSHGK 138
QA + LDN ++LY +CG+L ++++L+++ LR EA KP + A +G
Sbjct: 185 QAVKELDNSNVELYYRCGKLYQRLDLMEEALREF---EAIARMKPDWLEIRYELATLYGD 241
Query: 139 K--FQVTVKQ---------ETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPD 181
K +++ +K+ ++ ++ NL +Y G+ A V Y+ A PD
Sbjct: 242 KGEYELQIKEYEAIAAYDTTSAEVMKNLAASYAANGDLNKAIVSYKHALESTPD 295
>gi|170691501|ref|ZP_02882666.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
gi|170143706|gb|EDT11869.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
Length = 613
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 36/76 (47%)
Query: 151 ILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVLL 210
I+ ++GW + G+ T A V RKA I P+A +L L K +AR+ +
Sbjct: 534 IMDSVGWVKYRMGDTTDAIKVLRKAYDIQPNAEIGAHLGEVLWKSGNQDQARAAFREARK 593
Query: 211 GKLSGSTETKTINRVK 226
+ T KT+ R++
Sbjct: 594 LEPDNETLVKTLQRLQ 609
>gi|427706474|ref|YP_007048851.1| NB-ARC domain-containing protein [Nostoc sp. PCC 7107]
gi|427358979|gb|AFY41701.1| NB-ARC domain protein [Nostoc sp. PCC 7107]
Length = 980
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 74/166 (44%), Gaps = 35/166 (21%)
Query: 52 VDSALKDMAVVLKQQDRVDEA----VEAIKSFRHLCSKQ---AQESLDNVLIDLYKKCGR 104
V ++L ++A + + QDR EA ++A+ R L ++ ++L+N+ + LY+ GR
Sbjct: 626 VANSLNNLAELYRSQDRYSEAEALHIQALALRRKLLGEEHPDVAQTLNNLAV-LYRFQGR 684
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
E L H +A + H TS L NL Y +G
Sbjct: 685 YSEAEAL---------HIQALALRRKLLGEEHPDVL-------TS--LNNLALLYYGQGK 726
Query: 165 YTSAEVVY------RKAQLID--PD-ANKACNLSHCLIKQARYTEA 201
Y+ AE +Y R+ L D PD AN NL++ Q+RY+EA
Sbjct: 727 YSEAEPLYIQALKLRRKLLGDEHPDVANSLHNLAYLYRSQSRYSEA 772
>gi|39996647|ref|NP_952598.1| hypothetical protein GSU1547 [Geobacter sulfurreducens PCA]
gi|409912070|ref|YP_006890535.1| hypothetical protein KN400_1571 [Geobacter sulfurreducens KN400]
gi|39983528|gb|AAR34921.1| TPR domain protein [Geobacter sulfurreducens PCA]
gi|298505660|gb|ADI84383.1| TPR domain protein [Geobacter sulfurreducens KN400]
Length = 473
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHCLIKQARYTE 200
+ ++ ++ NLG + AE YR A +DPD ++A NLS+ L++Q R+ E
Sbjct: 105 IELEPDSPAAWSNLGVLQACRKQEREAEESYRTALRLDPDYHRASFNLSYLLLRQGRFDE 164
Query: 201 ARSVLE 206
S+LE
Sbjct: 165 GWSLLE 170
>gi|294138897|ref|YP_003554875.1| cellulose synthase operon C protein [Shewanella violacea DSS12]
gi|293325366|dbj|BAJ00097.1| cellulose synthase operon C protein, putative [Shewanella violacea
DSS12]
Length = 1195
Score = 37.4 bits (85), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 21/83 (25%), Positives = 40/83 (48%), Gaps = 15/83 (18%)
Query: 139 KFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCL------ 192
++ +T + + +LG +G Y++ GN T A ++KAQ +D D + A +
Sbjct: 273 RYALTTRPDDPELLGAIGTIYLRGGNQTQALAYFKKAQELDKDPDAASKWRSLIKVSSYW 332
Query: 193 ---------IKQARYTEARSVLE 206
+K+ RYT+A+ + E
Sbjct: 333 AYLKQGDTALKENRYTDAKHLYE 355
>gi|298492037|ref|YP_003722214.1| hypothetical protein Aazo_3466 ['Nostoc azollae' 0708]
gi|298233955|gb|ADI65091.1| TPR repeat-containing protein ['Nostoc azollae' 0708]
Length = 374
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 29/172 (16%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
EAAI + +A+ A ++ +A ++ V+L +++R EA+ A + +L A N+
Sbjct: 148 EAAIAAYRQALAANPKMAAAQYNLGVILYEEERCQEAIAAYQEAINLDRNNANAYF-NLA 206
Query: 96 IDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNL 155
I L ++ G+L++ I +Q L++ E + NL
Sbjct: 207 IAL-QQEGQLEQAIATYRQILKL--------------------------NPENTVAYNNL 239
Query: 156 GWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTEARSVLE 206
G + +G + A +Y+KA +P +A NL L Q EA + L+
Sbjct: 240 GSLMVIQGQPSEAIAIYQKAIGQNPKNALAYYNLGVTLYNQGNLKEANAALK 291
>gi|193215104|ref|YP_001996303.1| tetratricopeptide domain-containing protein [Chloroherpeton
thalassium ATCC 35110]
gi|193088581|gb|ACF13856.1| Tetratricopeptide TPR_2 repeat protein [Chloroherpeton thalassium
ATCC 35110]
Length = 638
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 82/189 (43%), Gaps = 45/189 (23%)
Query: 29 QLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCS 84
+L E++P+A ++ V +L ++A+V+ QQ R +EA +A+K
Sbjct: 271 KLAEENPDAYLL----------DVAESLYNLAIVMYQQGRYEEAEPLHRQALKIREEKLG 320
Query: 85 K---QAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ 141
SL+N+ I + K GR +E L ++ L++ + K H
Sbjct: 321 DDHPDVASSLNNLAI-VMSKQGRYEEAEPLYQRALKI---------REEKLGDDH----- 365
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLI--------DPDANKAC-NLSHCL 192
+ ++ L NL Q+G Y AE ++R+A I PD K+ NL+ +
Sbjct: 366 ----PDVAKSLNNLAIVMHQQGRYEEAEPLHRQALKIREEKLGDDHPDVAKSLYNLASVM 421
Query: 193 IKQARYTEA 201
+Q RY EA
Sbjct: 422 YEQGRYEEA 430
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 33/165 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYK------KCGRL 105
V S+L ++A+V+ QQ R +EA + + ++ + +V LY K GR
Sbjct: 452 VASSLNNLAIVMHQQGRYEEAESLYQRAIKILEEKLGDGHPDVATSLYNLANVMSKQGRY 511
Query: 106 DEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNY 165
+E L +Q L+ I+ + + P + + L NL ++G Y
Sbjct: 512 EEAEPLYRQALK-IWEEKLGDDHP-----------------DVASSLNNLAIVMSKQGRY 553
Query: 166 TSAEVVYRKAQLIDPD---------ANKACNLSHCLIKQARYTEA 201
AE ++R+A I + A+ NL+ + KQ RY EA
Sbjct: 554 EEAEPLHRQALKIREEKLGDDHPYVAHSLNNLALMMSKQGRYEEA 598
>gi|410664414|ref|YP_006916785.1| hypothetical protein M5M_09350 [Simiduia agarivorans SA1 = DSM
21679]
gi|409026771|gb|AFU99055.1| hypothetical protein M5M_09350 [Simiduia agarivorans SA1 = DSM
21679]
Length = 577
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 60/145 (41%), Gaps = 27/145 (18%)
Query: 60 AVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI 119
A+V +Q DR+D A + ++ A +L+ + L + R DE +L+ Q
Sbjct: 435 ALVNEQLDRLDLAEQDLRKVIRYEPNNAT-ALNALGYTLADRTDRYDEAYQLISQ----- 488
Query: 120 YHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLID 179
A + KP A I+ +LGW + + GNY A + R+A +
Sbjct: 489 ----AHSLKPGDAA-----------------IIDSLGWVHYRLGNYEEALLRLREALQLF 527
Query: 180 PDANKACNLSHCLIKQARYTEARSV 204
PD A +L L EAR+V
Sbjct: 528 PDPEIAAHLGEVLWVTGNQAEARAV 552
>gi|302391517|ref|YP_003827337.1| hypothetical protein [Acetohalobium arabaticum DSM 5501]
gi|302203594|gb|ADL12272.1| Tetratricopeptide TPR_2 repeat protein [Acetohalobium arabaticum
DSM 5501]
Length = 251
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 29/123 (23%), Positives = 53/123 (43%), Gaps = 28/123 (22%)
Query: 58 DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
++ V +QD++D+A+E K + + + Q + NVL Y+K ++ E I+ K+
Sbjct: 106 NLGRVYLKQDKIDQAIEITK--QAIETNQKIDDAYNVLGRAYRKAEKIKEAIDSFKK--- 160
Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQL 177
V + E +L NLG+ Y+Q Y A V ++A
Sbjct: 161 -----------------------AVELNDENYYVLNNLGYTYIQTNQYKEAISVLKQAVA 197
Query: 178 IDP 180
++P
Sbjct: 198 LNP 200
>gi|443242367|ref|YP_007375592.1| TPR-domain containing protein [Nonlabens dokdonensis DSW-6]
gi|442799766|gb|AGC75571.1| TPR-domain containing protein [Nonlabens dokdonensis DSW-6]
Length = 1000
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 83/202 (41%), Gaps = 24/202 (11%)
Query: 59 MAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKL-- 116
+ + + D V+ A E S + E L DL K G+ DE I+L K++L
Sbjct: 172 LGYIAYENDEVETATELFNQVGE--SPEDDEKLSYYKADLNYKSGKFDEAIKLAKEQLPK 229
Query: 117 ----------RMIYHGEA-FNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNY 165
R+I G++ FN + A + ++++ T + + LG+AY ++GN+
Sbjct: 230 SNRREKSQLNRII--GQSLFNQEKYDEALPYLQEYEGTRGKWNNNDYYQLGYAYYKQGNF 287
Query: 166 TSAEVVYRKAQLIDPDANKACN----LSHCLIKQARYTEARSVLEDVLLGKLSGS-TETK 220
A + K +ID A N L IK R +A + + L +E
Sbjct: 288 EKAIETFNK--IIDGANATAQNAYYHLGQSYIKLDRNEDALNAFKKASEMDLDAQISEDA 345
Query: 221 TINRVKELLQELEPWQSIPPSL 242
+ N K + P+ S+P +
Sbjct: 346 SYNYAKLSYENGNPYDSVPAVI 367
>gi|119487755|ref|ZP_01621264.1| TPR repeat protein [Lyngbya sp. PCC 8106]
gi|119455588|gb|EAW36725.1| TPR repeat protein [Lyngbya sp. PCC 8106]
Length = 998
Score = 37.4 bits (85), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 40/73 (54%), Gaps = 3/73 (4%)
Query: 140 FQVTVKQETSRILG--NLGWAYMQKGNYTSAEVVYRKAQLIDPDA-NKACNLSHCLIKQA 196
+Q ++ + S + NLG Q+G+ T AE +YRKA ++P+ NL + L++Q
Sbjct: 343 YQTAIELDNSLVYAYQNLGNLIKQQGSLTEAEAIYRKAIEVNPNHWGSYLNLGNILLEQT 402
Query: 197 RYTEARSVLEDVL 209
+ +A ++ + L
Sbjct: 403 HFEDAIAIYQTAL 415
>gi|3023156|gb|AAD13354.1| decahistidyl-kinesin light chain [Expression vector pPK121]
Length = 524
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381
>gi|195589902|ref|XP_002084688.1| GD12703 [Drosophila simulans]
gi|194196697|gb|EDX10273.1| GD12703 [Drosophila simulans]
Length = 634
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381
>gi|297539283|ref|YP_003675052.1| hypothetical protein M301_2106 [Methylotenera versatilis 301]
gi|297258630|gb|ADI30475.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera versatilis
301]
Length = 579
Score = 37.4 bits (85), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 19/68 (27%), Positives = 34/68 (50%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
+T+ +L +LGWAY +KG A +A ++ D A +L L ++ ++ EA
Sbjct: 494 LTISPNDHYMLDSLGWAYYRKGKLDKAITYLEQAYQVNQDPEIAAHLGEVLWQKGKHDEA 553
Query: 202 RSVLEDVL 209
+ + D L
Sbjct: 554 KKIWTDAL 561
>gi|78067598|ref|YP_370367.1| hypothetical protein Bcep18194_A6129 [Burkholderia sp. 383]
gi|77968343|gb|ABB09723.1| TPR repeat protein [Burkholderia sp. 383]
Length = 607
Score = 37.4 bits (85), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 22/85 (25%), Positives = 39/85 (45%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
+T+ + I+ +LGW + G+ A V R+A + P+A +L L K +A
Sbjct: 519 MTLAPNDAYIMDSLGWVKYRMGDTAGATKVLRRAYELQPNAEIGAHLGEVLWKSGAQDDA 578
Query: 202 RSVLEDVLLGKLSGSTETKTINRVK 226
RS D + T +T+ R++
Sbjct: 579 RSAWRDAQKLEPDNDTLVQTLKRLQ 603
>gi|339499059|ref|YP_004697094.1| hypothetical protein Spica_0422 [Spirochaeta caldaria DSM 7334]
gi|338833408|gb|AEJ18586.1| Tetratricopeptide TPR_1 repeat-containing protein [Spirochaeta
caldaria DSM 7334]
Length = 875
Score = 37.4 bits (85), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 76/176 (43%), Gaps = 15/176 (8%)
Query: 58 DMAVVLKQQDRVD--EAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCG-RLDEQIELLKQ 114
D A++L D VD E + A ++ L E + KKC RLD I+
Sbjct: 616 DNALLLLNDDAVDDPEGLLAHQAGTILYRADRLEEAKAYFAEALKKCPERLDYLIDQANC 675
Query: 115 KLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRK 174
+ + +GEA T A+++ + +E +L +G+ +KG + AE YR
Sbjct: 676 LINLGLYGEA----DTLLAKAYDR-------EENEAVLDLIGYIAYKKGEFPRAEAAYRL 724
Query: 175 AQLIDPDANKACN-LSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELL 229
A DPD + L+ I AR+T A ++ + S E +++ ++E L
Sbjct: 725 ALERDPDNTSFLSALAWIYITMARWTSAEDCIQKLEHLVAPESVEYESVQELRERL 780
>gi|255035013|ref|YP_003085634.1| hypothetical protein Dfer_1220 [Dyadobacter fermentans DSM 18053]
gi|254947769|gb|ACT92469.1| Tetratricopeptide TPR_2 repeat protein [Dyadobacter fermentans DSM
18053]
Length = 399
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 35 PEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNV 94
P+ A++ + KA+ D AL +A V QQ A+ A+K L K + L+N+
Sbjct: 250 PDTALIFYQKALAIQPDYDEALNALATVAMQQRAYPAAIPALKKL--LRKKPQSKELNNM 307
Query: 95 LIDLYKKCGRLD 106
L Y+K G LD
Sbjct: 308 LGYAYEKSGELD 319
>gi|299132292|ref|ZP_07025487.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
gi|298592429|gb|EFI52629.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
Length = 269
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 40/81 (49%), Gaps = 3/81 (3%)
Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKA 185
GKP + R + ++ + + +L NLG +Y+ AE R+AQ + P D+
Sbjct: 152 GKPEEARRYYASALKI--RPDDPSVLSNLGLSYVLTKELPQAEATLRRAQKLAPNDSRIR 209
Query: 186 CNLSHCLIKQARYTEARSVLE 206
NL L Q R+ EA ++++
Sbjct: 210 QNLGLALGLQGRFAEAEAIVQ 230
>gi|325983627|ref|YP_004296029.1| hypothetical protein NAL212_3105 [Nitrosomonas sp. AL212]
gi|325533146|gb|ADZ27867.1| hypothetical protein NAL212_3105 [Nitrosomonas sp. AL212]
Length = 578
Score = 37.0 bits (84), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 40/91 (43%), Gaps = 9/91 (9%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEA 201
V + E I+ +LGW Y + GN T A + PDA A +L L Q +A
Sbjct: 488 VELAPEDPYIMDSLGWVYYRMGNLTEGLNYLNLAFSVRPDAEIAAHLGEVLWMQGAKNDA 547
Query: 202 ----RSVLE-----DVLLGKLSGSTETKTIN 223
RS LE +VLL L T+ K I+
Sbjct: 548 LNIWRSALEKEPGNEVLLETLQRLTKKKVID 578
>gi|325283109|ref|YP_004255650.1| hypothetical protein Deipr_0877 [Deinococcus proteolyticus MRP]
gi|324314918|gb|ADY26033.1| hypothetical protein Deipr_0877 [Deinococcus proteolyticus MRP]
Length = 271
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 64/160 (40%), Gaps = 20/160 (12%)
Query: 69 VDEAVEAIKSFRHLCSKQAQESLD--------NVLIDLYKKCGRLDEQIELLKQKLRMIY 120
V VEA+K + +AQ++L + L L + GRLDE L+ L IY
Sbjct: 83 VQSGVEAVKPLDTKIA-EAQQALSRSDTLANRSSLAHLLTRAGRLDEAQSTLQPLLTGIY 141
Query: 121 HGEAF-----------NGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAE 169
+ G+P + G+ T ++ L L A Q+GN + AE
Sbjct: 142 RDDPVVLLTSAALDLAQGQPALAEQKLGQVELQTSAATRTQALTLLAEAQAQQGNLSGAE 201
Query: 170 VVYRKAQLIDPDANKACNLSHCLIKQARYTEARSVLEDVL 209
Y++A + L+KQ R +A+ VL +L
Sbjct: 202 ARYQEAMHAATTEEPRARYAEFLLKQGRTEQAQEVLNRLL 241
>gi|442632013|ref|NP_001261781.1| kinesin light chain, isoform C [Drosophila melanogaster]
gi|440215713|gb|AGB94474.1| kinesin light chain, isoform C [Drosophila melanogaster]
Length = 507
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381
>gi|292620687|ref|XP_001921543.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Danio rerio]
Length = 1102
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 65/162 (40%), Gaps = 37/162 (22%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAV----EAIKSFRHLCSKQAQESL 91
E A+ L+ KA+ +A ++A VL+QQ ++ EA+ EAI+ +
Sbjct: 400 EEAVQLYRKALEVFPDFAAAHSNLASVLQQQGKLQEALMHYEEAIR-------------I 446
Query: 92 DNVLIDLYKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI 151
D Y G ++++ ++ LR + A P A +H
Sbjct: 447 SPTFADAYSNMGNTLKEMQDIQGALRC--YTRAIQINPA-FADAH--------------- 488
Query: 152 LGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKA-CNLSHCL 192
NL + GN A YR A + PD A CNL+HCL
Sbjct: 489 -SNLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCL 529
>gi|113477254|ref|YP_723315.1| hypothetical protein Tery_3796 [Trichodesmium erythraeum IMS101]
gi|110168302|gb|ABG52842.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 385
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 50/111 (45%), Gaps = 3/111 (2%)
Query: 99 YKKCGRLDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQ--VTVKQETSRILGNLG 156
Y++ R+ +Q L K IY G A+ + ++ +Q + + ++ LG
Sbjct: 78 YQEAIRIYQQAVRLDPKNATIYSGIAYLEAVQENFQAAALFYQQAIAIDPHNAKFQYGLG 137
Query: 157 WAYMQKGNYTSAEVVYRKAQLID-PDANKACNLSHCLIKQARYTEARSVLE 206
+A + NY +A YR+A L+D + N L L + +Y EAR E
Sbjct: 138 YAMAKLENYQAAVEAYRRASLLDRNNVNTHLGLGAALFRLEKYGEARVAYE 188
>gi|304392377|ref|ZP_07374318.1| tetratricopeptide TPR_4 containing protein [Ahrensia sp. R2A130]
gi|303295481|gb|EFL89840.1| tetratricopeptide TPR_4 containing protein [Ahrensia sp. R2A130]
Length = 260
Score = 37.0 bits (84), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 54/114 (47%), Gaps = 9/114 (7%)
Query: 127 GKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC 186
G P + + K Q+ K +L NLG +Y+ +G+ ++E Y K + P A+
Sbjct: 148 GDPKQARLQYRKALQL--KPGEPSVLSNLGMSYLLEGDLRASET-YLKQAIGRPGADSRV 204
Query: 187 --NLSHCLIKQARYTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSI 238
NL+ + Q R+ EA E V G+L + + ++++LQE W+ +
Sbjct: 205 RQNLALVIGLQGRFPEA----EKVAAGELPPAEAQANVAFLRQMLQEQNAWKEL 254
>gi|168061951|ref|XP_001782948.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665566|gb|EDQ52246.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 899
Score = 37.0 bits (84), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 64/151 (42%), Gaps = 30/151 (19%)
Query: 36 EAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVL 95
+ A+V++ A++ A ++ V+ K +D +D+AVE + + +Q SL+N+
Sbjct: 292 DMAVVMYELALHFNPHCAEACNNLGVIYKDRDNLDKAVECYQMALQIKPNFSQ-SLNNLG 350
Query: 96 IDLYKKCGRLDEQIELL-KQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGN 154
+ +Y G++D ++ K L + EA+N N
Sbjct: 351 V-VYTVQGKMDSAASMIEKAILANPSYAEAYN---------------------------N 382
Query: 155 LGWAYMQKGNYTSAEVVYRKAQLIDPDANKA 185
LG + GN A Y + +IDPD+ A
Sbjct: 383 LGVLHRDAGNIPLAIDAYERCLVIDPDSRNA 413
>gi|414075408|ref|YP_006994726.1| TPR repeat domain-containing protein [Anabaena sp. 90]
gi|413968824|gb|AFW92913.1| TPR repeat domain-containing protein [Anabaena sp. 90]
Length = 755
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 87/211 (41%), Gaps = 22/211 (10%)
Query: 14 KLPPGDS-PYVRAKHVQLVEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEA 72
+L P D+ YV +V + P+ A+ + KAI + A + L Q+++ EA
Sbjct: 92 QLNPNDAEAYVGIGNVLNAQGKPDEAVAAYRKAIEFDPKYAKAYNSLGNALYDQEKLKEA 151
Query: 73 VEAIK---SFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLRMI-YHGEAFNGK 128
V A + F H + A +L NVL + LDE + ++ + + + A+N
Sbjct: 152 VAAYRKAIEFDHKYAA-AYYNLGNVLYE----QKELDEAVAAYRKAIELNPKYATAYNN- 205
Query: 129 PTKTARSHGKKFQ--VTVKQETSRI-------LGNLGWAYMQKGNYTSAEVVYRKAQLID 179
A S KK V QE ++ NLG A + A Y+KA +D
Sbjct: 206 -LGNALSDQKKLDEAVAAYQEAIKLNPKDATAYNNLGIALSDQKKLDEAVAAYQKAIELD 264
Query: 180 PD-ANKACNLSHCLIKQARYTEARSVLEDVL 209
P A NL + L Q + EA + + +
Sbjct: 265 PKYATAYYNLGNALSDQKKLDEAVAAYQKAI 295
>gi|333891999|ref|YP_004465874.1| hypothetical protein ambt_02590 [Alteromonas sp. SN2]
gi|332992017|gb|AEF02072.1| tetratricopeptide TPR_2 [Alteromonas sp. SN2]
Length = 928
Score = 37.0 bits (84), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
Query: 142 VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDP-DANKACNLSHCLIKQARYTE 200
+++ T +IL GW ++GNYT A RKA D D + +L L+KQ ++ +
Sbjct: 841 LSLSPNTPQILDTYGWILARQGNYTEALPNLRKAYSFDAVDPSIMYHLGFVLLKQGKHQQ 900
Query: 201 ARSVLEDVLLGKL 213
A+ L + L L
Sbjct: 901 AKVELANALASDL 913
>gi|398942265|ref|ZP_10670203.1| Flp pilus assembly protein TadD [Pseudomonas sp. GM41(2012)]
gi|398160817|gb|EJM49072.1| Flp pilus assembly protein TadD [Pseudomonas sp. GM41(2012)]
Length = 574
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 29/61 (47%)
Query: 144 VKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKACNLSHCLIKQARYTEARS 203
+ E +L +LGW + GN+ AE + R+A PD A +L L + EAR
Sbjct: 487 INPEDPAVLDSLGWVNFRMGNFDEAERLLRQALERFPDQEVAAHLGEVLWANGKQREARQ 546
Query: 204 V 204
+
Sbjct: 547 I 547
>gi|326918804|ref|XP_003205676.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like,
partial [Meleagris gallopavo]
Length = 1038
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 75/198 (37%), Gaps = 45/198 (22%)
Query: 38 AIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLID 97
AI + +AI A ++A LK++ V EA E + LC A +SL+N L +
Sbjct: 271 AIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEECYNTALRLCPTHA-DSLNN-LAN 328
Query: 98 LYKKCGRLDEQIELLKQKLR-------------------------MIYHGEAFNGKPT-- 130
+ ++ G ++E + L ++ L ++++ EA PT
Sbjct: 329 IKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFA 388
Query: 131 KTARSHGKKFQ---------------VTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKA 175
+ G + + + + NL + GN A YR A
Sbjct: 389 DAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTA 448
Query: 176 QLIDPDANKA-CNLSHCL 192
+ PD A CNL+HCL
Sbjct: 449 LKLKPDFPDAYCNLAHCL 466
>gi|241998476|ref|XP_002433881.1| O-linked N-acetylglucosamine transferase, OGT, putative [Ixodes
scapularis]
gi|215495640|gb|EEC05281.1| O-linked N-acetylglucosamine transferase, OGT, putative [Ixodes
scapularis]
Length = 832
Score = 37.0 bits (84), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 73/161 (45%), Gaps = 25/161 (15%)
Query: 58 DMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQESLDNVLIDLYKKCGRLDEQIELLKQKLR 117
++ ++L++Q+R++EA+++ K + A L+ L+ GR DE +++ +
Sbjct: 524 NLGILLQEQNRLEEALQSYKLAIQFRPRLAMAHLNMGLV--LGIMGRKDEAVDVYR---- 577
Query: 118 MIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRI--LGNLGWAYMQKGNYTSAEVVYRKA 175
H + K ++H E+++I L NLG Y +G Y A VY++A
Sbjct: 578 ---HCAQLDSAGLKDPKTH----------ESTKISALFNLGRLYADEGKYKEAIRVYQEA 624
Query: 176 QLIDPDANKACNLSHCL----IKQARYTEARSVLEDVLLGK 212
PD + +L + + K YTEA ++ L K
Sbjct: 625 VAKMPDHYQPQSLYNMMGEAYFKLGEYTEAERWYKEALRAK 665
>gi|195327213|ref|XP_002030316.1| GM24634 [Drosophila sechellia]
gi|194119259|gb|EDW41302.1| GM24634 [Drosophila sechellia]
Length = 498
Score = 37.0 bits (84), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 217 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 275
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 276 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 317
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 318 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 371
>gi|17647555|ref|NP_524049.1| kinesin light chain, isoform A [Drosophila melanogaster]
gi|442632011|ref|NP_001261780.1| kinesin light chain, isoform B [Drosophila melanogaster]
gi|1170679|sp|P46824.1|KLC_DROME RecName: Full=Kinesin light chain; Short=KLC
gi|157782|gb|AAA02481.1| kinesin light chain [Drosophila melanogaster]
gi|157814|gb|AAA28669.1| kinesin light chain [Drosophila melanogaster]
gi|16768986|gb|AAL28712.1| LD13018p [Drosophila melanogaster]
gi|23093578|gb|AAF49890.2| kinesin light chain, isoform A [Drosophila melanogaster]
gi|220943240|gb|ACL84163.1| Klc-PA [synthetic construct]
gi|220953428|gb|ACL89257.1| Klc-PA [synthetic construct]
gi|440215712|gb|AGB94473.1| kinesin light chain, isoform B [Drosophila melanogaster]
Length = 508
Score = 37.0 bits (84), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381
>gi|348687602|gb|EGZ27416.1| hypothetical protein PHYSODRAFT_553979 [Phytophthora sojae]
Length = 986
Score = 37.0 bits (84), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 97/241 (40%), Gaps = 22/241 (9%)
Query: 31 VEKDPEAAIVLFWKAINAGDRVDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCS--KQAQ 88
V+ E A + + +AI A A ++A +LK+ +++ A++ + L A
Sbjct: 184 VQGRVEDAKLCYEQAIRAKPSFAIAWSNLAGLLKEDGQLEAAIDHYREAIRLAPDFADAY 243
Query: 89 ESLDNVLIDLYKKCGRLDEQIELLKQKLRM-----IYHGE---AFNGKPTKTARSHGKKF 140
+L N L K+ GR++E I+ K L++ I HG + H +
Sbjct: 244 SNLGNAL----KEAGRVEEAIQAYKSALQIRPNFAIAHGNLASCYYDAGQMELAIHTFRH 299
Query: 141 QVTVKQETSRILGNLGWAYMQKGNYTSAEVVYRKAQLIDPDANKAC-NLSHC-----LIK 194
+ ++ NLG A + G A YR A + PD A NL + L+K
Sbjct: 300 AIQLEPNFPDAYNNLGNALRECGQLEQAVTCYRTALQLKPDHPHAYNNLGNALKDKGLVK 359
Query: 195 QAR--YTEARSVLEDVLLGKLSGSTETKTINRVKELLQELEPWQSIPPSLTTKKSSLEDA 252
+A YT A +L + + K ++ + L + +I P+ S++ +
Sbjct: 360 EALHCYTTAARLLPQFAAAHSNIGSVLKEQGKLDQALAHYQQAITIDPNFADAYSNMGNV 419
Query: 253 F 253
F
Sbjct: 420 F 420
>gi|194752059|ref|XP_001958340.1| GF10870 [Drosophila ananassae]
gi|190625622|gb|EDV41146.1| GF10870 [Drosophila ananassae]
Length = 508
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381
>gi|195160695|ref|XP_002021210.1| GL24935 [Drosophila persimilis]
gi|198464928|ref|XP_001353418.2| GA18878 [Drosophila pseudoobscura pseudoobscura]
gi|194118323|gb|EDW40366.1| GL24935 [Drosophila persimilis]
gi|198149940|gb|EAL30925.2| GA18878 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 37.0 bits (84), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 35/174 (20%)
Query: 52 VDSALKDMAVVLKQQDRVDEAVEAIKSFRHLCSKQAQE-------SLDNVLIDLYKKCGR 104
V + L +A+V + Q++ EA + + K E +L+N+ + LY K G+
Sbjct: 227 VATMLNILALVYRDQNKYKEAANLLNDALSIRGKTLGENHPAVAATLNNLAV-LYGKRGK 285
Query: 105 LDEQIELLKQKLRMIYHGEAFNGKPTKTARSHGKKFQVTVKQETSRILGNLGWAYMQKGN 164
+ L K+ L + + GK KQ L NL +G
Sbjct: 286 YKDAEPLCKRALEI-------------REKVLGKDHPDVAKQ-----LNNLALLCQNQGK 327
Query: 165 YTSAEVVYRKAQLI--------DPDANKA-CNLSHCLIKQARYTEARSVLEDVL 209
Y E Y++A I DP+ K NL+ C +KQ RYTEA + + VL
Sbjct: 328 YDEVEKYYQRALDIYESKLGPDDPNVAKTKNNLAGCYLKQGRYTEAEILYKQVL 381
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,274,884,084
Number of Sequences: 23463169
Number of extensions: 163823994
Number of successful extensions: 540664
Number of sequences better than 100.0: 840
Number of HSP's better than 100.0 without gapping: 220
Number of HSP's successfully gapped in prelim test: 620
Number of HSP's that attempted gapping in prelim test: 538787
Number of HSP's gapped (non-prelim): 2141
length of query: 287
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 146
effective length of database: 9,050,888,538
effective search space: 1321429726548
effective search space used: 1321429726548
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)