BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15362
(200 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242023889|ref|XP_002432363.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Pediculus humanus corporis]
gi|212517786|gb|EEB19625.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Pediculus humanus corporis]
Length = 1041
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/200 (87%), Positives = 189/200 (94%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ+QTSVNGIV+QNGLATNQTN K ATGEEVPQSIVIT+RQQY LP+DA+VYCNFN
Sbjct: 788 MIASGQIQTSVNGIVVQNGLATNQTNNKAATGEEVPQSIVITTRQQYNLPDDAVVYCNFN 847
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW+N+LK VPN+ILWLL+FPAVGE+NI ATAQ LG+ RILFSNVAAK
Sbjct: 848 QLYKIDPITLQMWMNILKHVPNAILWLLRFPAVGESNIHATAQQLGVSPARILFSNVAAK 907
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 967
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ ++YQDIAIRLGTDR+
Sbjct: 968 IARSRQDYQDIAIRLGTDRE 987
>gi|357612087|gb|EHJ67792.1| hypothetical protein KGM_21815 [Danaus plexippus]
Length = 1054
Score = 367 bits (943), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 172/200 (86%), Positives = 186/200 (93%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTSVNG++LQNGLAT QTN K ATGEEVPQSIVIT+RQQYGLP+DA+VYCNFN
Sbjct: 802 MIASGQVQTSVNGVILQNGLATTQTNNKAATGEEVPQSIVITTRQQYGLPDDAVVYCNFN 861
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MWV +LK VPNS+LWLL+FPAVGE N+QATAQ LGL RI+FSNVAAK
Sbjct: 862 QLYKIDPLTLHMWVYILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRIIFSNVAAK 921
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMD+LWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922 EEHVRRGQLADVCLDTPLCNGHTTSMDILWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIA+RLGTDR+
Sbjct: 982 IARTRQEYQDIAVRLGTDRE 1001
>gi|270004555|gb|EFA01003.1| hypothetical protein TcasGA2_TC003916 [Tribolium castaneum]
Length = 1054
Score = 367 bits (941), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 171/200 (85%), Positives = 188/200 (94%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+RQQYGLP+DA+VYCNFN
Sbjct: 800 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRQQYGLPDDAVVYCNFN 859
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MWVN+LKAVPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 860 QLYKIDPMTLHMWVNILKAVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRILFSNVAAK 919
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT PGETLASRVAASQLATLGCPEL
Sbjct: 920 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTFPGETLASRVAASQLATLGCPEL 979
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ +EYQ+IAI+LGTD++
Sbjct: 980 IARSRQEYQEIAIKLGTDKE 999
>gi|189235894|ref|XP_967579.2| PREDICTED: similar to AGAP006254-PA [Tribolium castaneum]
Length = 1086
Score = 366 bits (940), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 171/200 (85%), Positives = 188/200 (94%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+RQQYGLP+DA+VYCNFN
Sbjct: 832 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRQQYGLPDDAVVYCNFN 891
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MWVN+LKAVPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 892 QLYKIDPMTLHMWVNILKAVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRILFSNVAAK 951
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT PGETLASRVAASQLATLGCPEL
Sbjct: 952 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTFPGETLASRVAASQLATLGCPEL 1011
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ +EYQ+IAI+LGTD++
Sbjct: 1012 IARSRQEYQEIAIKLGTDKE 1031
>gi|383866051|ref|XP_003708485.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Megachile rotundata]
Length = 1094
Score = 363 bits (931), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 171/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 832 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 891
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 892 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031
>gi|350396796|ref|XP_003484668.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Bombus impatiens]
Length = 1065
Score = 362 bits (930), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 862 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 982 IARTRQEYQDIAIRLGTDRE 1001
>gi|328777929|ref|XP_003249419.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform 1
[Apis mellifera]
Length = 1065
Score = 362 bits (930), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 862 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 982 IARTRQEYQDIAIRLGTDRE 1001
>gi|380019749|ref|XP_003693765.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Apis florea]
Length = 1065
Score = 362 bits (930), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 862 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 982 IARTRQEYQDIAIRLGTDRE 1001
>gi|340715916|ref|XP_003396453.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Bombus terrestris]
Length = 1065
Score = 362 bits (930), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 862 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 982 IARTRQEYQDIAIRLGTDRE 1001
>gi|158295621|ref|XP_316319.4| AGAP006254-PA [Anopheles gambiae str. PEST]
gi|157016124|gb|EAA10760.4| AGAP006254-PA [Anopheles gambiae str. PEST]
Length = 1120
Score = 362 bits (929), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 169/200 (84%), Positives = 187/200 (93%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DAIVYCNFN
Sbjct: 868 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAIVYCNFN 927
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ WV +LK VPNS+LWLL+FPAVGEANIQATAQ +G+ RI+FSNVAAK
Sbjct: 928 QLYKIDPLTLQSWVTILKHVPNSVLWLLRFPAVGEANIQATAQQMGIAAGRIIFSNVAAK 987
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 988 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 1047
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ +EYQDIAI+LGTDR+
Sbjct: 1048 IAKSRQEYQDIAIKLGTDRE 1067
>gi|340715918|ref|XP_003396454.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Bombus terrestris]
Length = 1095
Score = 362 bits (929), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 832 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 891
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 892 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031
>gi|380019747|ref|XP_003693764.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Apis florea]
Length = 1095
Score = 362 bits (929), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 832 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 891
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 892 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031
>gi|307193762|gb|EFN76443.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
kDa subunit [Harpegnathos saltator]
Length = 1180
Score = 362 bits (929), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 918 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 977
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 978 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 1037
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 1038 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1097
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+ART +EYQDIAIRLGTDR+
Sbjct: 1098 VARTRQEYQDIAIRLGTDRE 1117
>gi|110760723|ref|XP_623820.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform 2
[Apis mellifera]
Length = 1095
Score = 362 bits (928), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 170/200 (85%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 832 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 891
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 892 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031
>gi|332029693|gb|EGI69572.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
kDa subunit [Acromyrmex echinatior]
Length = 1087
Score = 361 bits (927), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 169/200 (84%), Positives = 184/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 824 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 883
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL RILFSNVAAK
Sbjct: 884 QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 943
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 944 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1003
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+ART +EYQDI+IRLGTDR+
Sbjct: 1004 VARTRQEYQDISIRLGTDRE 1023
>gi|328700630|ref|XP_001950113.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Acyrthosiphon pisum]
Length = 1045
Score = 359 bits (922), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 171/201 (85%), Positives = 187/201 (93%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MIASGQ QTSVNG+VLQNGLA + N K ATGEEVPQ+IVITSRQQY LPEDA+VYCNF
Sbjct: 791 MIASGQAQTSVNGVVLQNGLAVVDNGNNKAATGEEVPQNIVITSRQQYSLPEDAVVYCNF 850
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP+TL MWVN+LKAVPNS+LWLL+FPAVGE +IQ+TA +LG++ RILFSNVAA
Sbjct: 851 NQLYKIDPNTLLMWVNILKAVPNSVLWLLRFPAVGETHIQSTAASLGIEPGRILFSNVAA 910
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE
Sbjct: 911 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 970
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
LIART +EYQDIAIRLGTD++
Sbjct: 971 LIARTREEYQDIAIRLGTDKE 991
>gi|328700632|ref|XP_003241333.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Acyrthosiphon pisum]
Length = 1090
Score = 359 bits (922), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 171/201 (85%), Positives = 187/201 (93%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MIASGQ QTSVNG+VLQNGLA + N K ATGEEVPQ+IVITSRQQY LPEDA+VYCNF
Sbjct: 836 MIASGQAQTSVNGVVLQNGLAVVDNGNNKAATGEEVPQNIVITSRQQYSLPEDAVVYCNF 895
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP+TL MWVN+LKAVPNS+LWLL+FPAVGE +IQ+TA +LG++ RILFSNVAA
Sbjct: 896 NQLYKIDPNTLLMWVNILKAVPNSVLWLLRFPAVGETHIQSTAASLGIEPGRILFSNVAA 955
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE
Sbjct: 956 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 1015
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
LIART +EYQDIAIRLGTD++
Sbjct: 1016 LIARTREEYQDIAIRLGTDKE 1036
>gi|312381341|gb|EFR27109.1| hypothetical protein AND_06385 [Anopheles darlingi]
Length = 833
Score = 359 bits (922), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 167/200 (83%), Positives = 187/200 (93%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DAIVYCNFN
Sbjct: 580 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAIVYCNFN 639
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ W+ +LK VPNS+LWLL+FPAVGEANIQATAQ +G+ RI+FSNVAAK
Sbjct: 640 QLYKIDPLTLQSWIYILKHVPNSVLWLLRFPAVGEANIQATAQQMGIAAGRIIFSNVAAK 699
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 700 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 759
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ +EYQDIA++LGTDR+
Sbjct: 760 IAKSRQEYQDIAVKLGTDRE 779
>gi|157106778|ref|XP_001649477.1| o-linked n-acetylglucosamine transferase, ogt [Aedes aegypti]
gi|108868774|gb|EAT32999.1| AAEL014746-PA [Aedes aegypti]
Length = 793
Score = 359 bits (922), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 167/200 (83%), Positives = 185/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ+QTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DA+VYCNFN
Sbjct: 539 MIASGQIQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAVVYCNFN 598
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL WVN+LK VPNS+LWLL+FPAVGE NIQA AQ LGL RI+FSNVAAK
Sbjct: 599 QLYKIDPHTLASWVNILKHVPNSVLWLLRFPAVGETNIQAAAQQLGLSPGRIIFSNVAAK 658
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 659 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 718
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ +EYQDIAI+LGTD++
Sbjct: 719 IARSRQEYQDIAIKLGTDKE 738
>gi|229577290|ref|NP_001153341.1| O-glycosyltransferase [Nasonia vitripennis]
Length = 1061
Score = 357 bits (917), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 167/200 (83%), Positives = 182/200 (91%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 799 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 858
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW +LK VPNS+LWLL+FPAVGE N+Q TAQ LGL RILFSNVAAK
Sbjct: 859 QLYKIDPLTLHMWAYILKNVPNSVLWLLRFPAVGEQNLQTTAQQLGLAPGRILFSNVAAK 918
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 919 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 978
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+ART +EYQ+IA+RLGTDR+
Sbjct: 979 VARTRQEYQEIAVRLGTDRE 998
>gi|307173081|gb|EFN64211.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
kDa subunit [Camponotus floridanus]
Length = 1092
Score = 355 bits (912), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 165/200 (82%), Positives = 182/200 (91%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ Q SVNG+V+QNG+ T Q N K ATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 830 MIASGQCQMSVNGVVVQNGMTTTQVNNKAATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 889
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW ++LK VPN++LWLL+FPAVGE N+Q+TAQ LGL RILFSNVAAK
Sbjct: 890 QLYKIDPLTLHMWAHILKHVPNAVLWLLRFPAVGEPNLQSTAQQLGLTPGRILFSNVAAK 949
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCP+L
Sbjct: 950 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPDL 1009
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+ART +EYQDIAIRLGTDR+
Sbjct: 1010 VARTRQEYQDIAIRLGTDRE 1029
>gi|170036979|ref|XP_001846338.1| o-linked N-acetylglucosamine transferase, ogt [Culex
quinquefasciatus]
gi|167879966|gb|EDS43349.1| o-linked N-acetylglucosamine transferase, ogt [Culex
quinquefasciatus]
Length = 838
Score = 355 bits (912), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 164/200 (82%), Positives = 185/200 (92%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQ+QTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DA++YCNFN
Sbjct: 579 MIASGQIQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAVIYCNFN 638
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL W N+LK VPN++LWLL+FPAVGEANIQA AQ LG+ RI+FSNVAAK
Sbjct: 639 QLYKIDPHTLTSWCNILKHVPNAVLWLLRFPAVGEANIQAAAQQLGVAPGRIIFSNVAAK 698
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 699 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 758
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ +EYQDIAI+LGTD++
Sbjct: 759 IARSRQEYQDIAIKLGTDKE 778
>gi|321463604|gb|EFX74619.1| hypothetical protein DAPPUDRAFT_324191 [Daphnia pulex]
Length = 1043
Score = 353 bits (905), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 162/200 (81%), Positives = 183/200 (91%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI SGQVQTSVNG+V+QNGLATNQ NT+ ATGEEVPQSIV+T+RQQYGLP+DAI+YCNFN
Sbjct: 789 MIGSGQVQTSVNGVVVQNGLATNQANTRAATGEEVPQSIVVTTRQQYGLPDDAIIYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMWVN+LK VPN++LWLL+FP VGE N+ ATAQALGL RILFSNVAAK
Sbjct: 849 QLYKIDPPTLQMWVNILKRVPNAVLWLLRFPTVGETNMLATAQALGLGSGRILFSNVAAK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTT MDVLW GTP+VTLPGETLASRVAASQL T+GCPEL
Sbjct: 909 EEHVRRGQLADICLDTPLCNGHTTGMDVLWAGTPMVTLPGETLASRVAASQLTTIGCPEL 968
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA + +EY+DIA++LGTD +
Sbjct: 969 IASSPQEYEDIAVKLGTDSE 988
>gi|195430888|ref|XP_002063480.1| GK21381 [Drosophila willistoni]
gi|194159565|gb|EDW74466.1| GK21381 [Drosophila willistoni]
Length = 1059
Score = 348 bits (893), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 163/200 (81%), Positives = 182/200 (91%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 806 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 865
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ + LG+ RI+FSNVAAK
Sbjct: 866 QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISPDRIIFSNVAAK 925
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 926 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 985
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQDIAIRLGT+R+
Sbjct: 986 IARTREEYQDIAIRLGTERE 1005
>gi|195175533|ref|XP_002028500.1| GL13214 [Drosophila persimilis]
gi|194103718|gb|EDW25761.1| GL13214 [Drosophila persimilis]
Length = 881
Score = 347 bits (891), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 162/200 (81%), Positives = 179/200 (89%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 628 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 687
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T LG+ R++FSNVAAK
Sbjct: 688 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDLGISPDRVIFSNVAAK 747
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 748 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLVTLGCPEL 807
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART EYQDIAIRLGT R+
Sbjct: 808 IARTRDEYQDIAIRLGTQRE 827
>gi|198456072|ref|XP_002136393.1| GA29107 [Drosophila pseudoobscura pseudoobscura]
gi|198142772|gb|EDY71489.1| GA29107 [Drosophila pseudoobscura pseudoobscura]
Length = 652
Score = 347 bits (890), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 162/200 (81%), Positives = 179/200 (89%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 399 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 458
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T LG+ R++FSNVAAK
Sbjct: 459 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDLGISPDRVIFSNVAAK 518
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 519 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLVTLGCPEL 578
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART EYQDIAIRLGT R+
Sbjct: 579 IARTRDEYQDIAIRLGTQRE 598
>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
Length = 1050
Score = 345 bits (885), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 159/200 (79%), Positives = 181/200 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIASGQVQTS+NG+++QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 800 MIASGQVQTSLNGVLVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 859
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ + LG+ R++FSNVAAK
Sbjct: 860 QLYKIDPQTLQSWVVILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGVSSERVIFSNVAAK 919
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPV+TLPGETLASRVAASQLATLGCPEL
Sbjct: 920 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVITLPGETLASRVAASQLATLGCPEL 979
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA + +EYQDIAIRLGTD++
Sbjct: 980 IANSREEYQDIAIRLGTDKE 999
>gi|195122698|ref|XP_002005848.1| GI18858 [Drosophila mojavensis]
gi|193910916|gb|EDW09783.1| GI18858 [Drosophila mojavensis]
Length = 1052
Score = 345 bits (885), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 160/200 (80%), Positives = 182/200 (91%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 802 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 861
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ + LG+ R++FSNVAAK
Sbjct: 862 QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISSDRVIFSNVAAK 921
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 922 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 981
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ +EYQDIAIRLGT+R+
Sbjct: 982 IAQSREEYQDIAIRLGTERE 1001
>gi|195356295|ref|XP_002044614.1| GM11101 [Drosophila sechellia]
gi|194132318|gb|EDW53892.1| GM11101 [Drosophila sechellia]
Length = 333
Score = 343 bits (881), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 158/200 (79%), Positives = 180/200 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 83 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 142
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T G+ R++FSNVAAK
Sbjct: 143 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 202
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 203 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 262
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQ+IAIRLGT ++
Sbjct: 263 IARTREEYQNIAIRLGTKKE 282
>gi|17647755|ref|NP_523620.1| super sex combs, isoform B [Drosophila melanogaster]
gi|24585827|ref|NP_724406.1| super sex combs, isoform A [Drosophila melanogaster]
gi|24585829|ref|NP_724407.1| super sex combs, isoform C [Drosophila melanogaster]
gi|6942068|gb|AAF32311.1|AF217788_1 O-glycosyltransferase [Drosophila melanogaster]
gi|7302245|gb|AAF57338.1| super sex combs, isoform B [Drosophila melanogaster]
gi|10728167|gb|AAG22338.1| super sex combs, isoform A [Drosophila melanogaster]
gi|10728168|gb|AAG22339.1| super sex combs, isoform C [Drosophila melanogaster]
gi|189182206|gb|ACD81879.1| SD06381p [Drosophila melanogaster]
Length = 1059
Score = 343 bits (880), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 158/200 (79%), Positives = 180/200 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 809 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 868
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T G+ R++FSNVAAK
Sbjct: 869 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 928
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 929 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 988
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQ+IAIRLGT ++
Sbjct: 989 IARTREEYQNIAIRLGTKKE 1008
>gi|16769108|gb|AAL28773.1| LD16758p [Drosophila melanogaster]
Length = 333
Score = 343 bits (880), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 158/200 (79%), Positives = 180/200 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 83 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 142
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T G+ R++FSNVAAK
Sbjct: 143 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 202
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 203 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 262
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IART +EYQ+IAIRLGT ++
Sbjct: 263 IARTREEYQNIAIRLGTKKE 282
>gi|5052534|gb|AAD38597.1|AF145622_1 BcDNA.GH04245 [Drosophila melanogaster]
Length = 1011
Score = 342 bits (877), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 158/197 (80%), Positives = 178/197 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 809 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 868
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T G+ R++FSNVAAK
Sbjct: 869 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 928
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 929 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 988
Query: 181 IARTHKEYQDIAIRLGT 197
IART +EYQ+IAIRLGT
Sbjct: 989 IARTREEYQNIAIRLGT 1005
>gi|195093725|ref|XP_001997748.1| GH22221 [Drosophila grimshawi]
gi|193905681|gb|EDW04548.1| GH22221 [Drosophila grimshawi]
Length = 666
Score = 342 bits (877), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 159/200 (79%), Positives = 180/200 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 435 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 494
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ + LG+ RI+FSNVAAK
Sbjct: 495 QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISADRIIFSNVAAK 554
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 555 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 614
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA + + YQ+IAIRLGT+R+
Sbjct: 615 IAHSRESYQEIAIRLGTERE 634
>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
Length = 1053
Score = 341 bits (874), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 159/200 (79%), Positives = 180/200 (90%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 803 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 862
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ + LG+ RI+FSNVAAK
Sbjct: 863 QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISADRIIFSNVAAK 922
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 923 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 982
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA + + YQ+IAIRLGT+R+
Sbjct: 983 IAHSRESYQEIAIRLGTERE 1002
>gi|427794599|gb|JAA62751.1| Putative o-linked n-acetylglucosamine transferase ogt, partial
[Rhipicephalus pulchellus]
Length = 1026
Score = 335 bits (860), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 155/201 (77%), Positives = 180/201 (89%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI+SGQVQ VNG+++ NGLA+ Q NT+ ATGEE P++I++T RQQYGLPEDA+VYCNFN
Sbjct: 776 MISSGQVQAQVNGVLVHNGLASTQVNTRAATGEEAPRAILVTCRQQYGLPEDAVVYCNFN 835
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQH-RILFSNVAA 119
QLYK+DP+TL+MWVN+L+ VPNS+LWLL+FPAVGEA++QA AQA GL Q RI+FSNVAA
Sbjct: 836 QLYKLDPATLRMWVNILQRVPNSVLWLLRFPAVGEAHVQAWAQAEGLTQPGRIIFSNVAA 895
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTPVVT PGETLASRVAASQL LG PE
Sbjct: 896 KEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPVVTWPGETLASRVAASQLHCLGVPE 955
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+AR+ +EYQDIA+RLGTDRD
Sbjct: 956 LVARSRREYQDIAVRLGTDRD 976
>gi|387016114|gb|AFJ50176.1| O-linked GlcNAc transferase [Crotalus adamanteus]
Length = 933
Score = 330 bits (847), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 173/198 (87%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPED++VYCNFN
Sbjct: 676 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDSVVYCNFN 735
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 736 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 795
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCPEL
Sbjct: 796 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 855
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 856 IAKSRQEYEDIAVKLGTD 873
>gi|363732735|ref|XP_003641145.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Gallus gallus]
Length = 1045
Score = 328 bits (842), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ LGL Q+RI+FS VA K
Sbjct: 848 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 907
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 967
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 968 IAKSRQEYEDIAVKLGTD 985
>gi|118089398|ref|XP_001232519.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Gallus gallus]
Length = 1035
Score = 328 bits (842), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 837
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ LGL Q+RI+FS VA K
Sbjct: 838 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 897
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 957
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 958 IAKSRQEYEDIAVKLGTD 975
>gi|326918804|ref|XP_003205676.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like,
partial [Meleagris gallopavo]
Length = 1038
Score = 328 bits (841), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 781 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 840
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ LGL Q+RI+FS VA K
Sbjct: 841 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 900
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 901 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 960
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 961 IAKSRQEYEDIAVKLGTD 978
>gi|149042178|gb|EDL95885.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_a [Rattus
norvegicus]
gi|149042179|gb|EDL95886.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_a [Rattus
norvegicus]
Length = 1046
Score = 328 bits (841), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|354502726|ref|XP_003513433.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit
[Cricetulus griseus]
Length = 1046
Score = 328 bits (841), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|449267896|gb|EMC78787.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit, partial [Columba livia]
Length = 1036
Score = 328 bits (841), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 779 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ LGL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976
>gi|431914414|gb|ELK15671.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Pteropus alecto]
Length = 1046
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|46909607|ref|NP_631883.2| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Mus musculus]
gi|146325019|sp|Q8CGY8.2|OGT1_MOUSE RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit;
AltName: Full=O-GlcNAc transferase subunit p110;
AltName: Full=O-linked N-acetylglucosamine transferase
110 kDa subunit; Short=OGT
gi|34785719|gb|AAH57319.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Mus musculus]
gi|148682191|gb|EDL14138.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_b [Mus
musculus]
Length = 1046
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|139948535|ref|NP_058803.2| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Rattus norvegicus]
gi|149042180|gb|EDL95887.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_b [Rattus
norvegicus]
gi|149042181|gb|EDL95888.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_b [Rattus
norvegicus]
Length = 1036
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976
>gi|13775066|gb|AAK39123.1|AF363030_1 UDP-N-acetylglucosaminyltransferase [Mus musculus]
Length = 1046
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|27499606|gb|AAO17363.1| O-linked GlcNAc transferase [Mus musculus]
Length = 1046
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|395858889|ref|XP_003801787.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Otolemur garnettii]
Length = 1046
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986
>gi|148682190|gb|EDL14137.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_a [Mus
musculus]
Length = 1036
Score = 328 bits (840), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976
>gi|395858891|ref|XP_003801788.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Otolemur garnettii]
Length = 1036
Score = 328 bits (840), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976
>gi|417412996|gb|JAA52852.1| Putative o-linked n-acetylglucosamine transferase ogt, partial
[Desmodus rotundus]
Length = 874
Score = 328 bits (840), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 617 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 676
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 677 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 736
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 737 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 796
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 797 IAKSRQEYEDIAVKLGTD 814
>gi|444707375|gb|ELW48655.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Tupaia chinensis]
Length = 1007
Score = 328 bits (840), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 750 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 809
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 810 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 869
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 870 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 929
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 930 IAKNRQEYEDIAVKLGTD 947
>gi|440901878|gb|ELR52744.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit, partial [Bos grunniens mutus]
Length = 1037
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 780 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 839
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 840 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 899
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 900 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 959
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 960 IAKNRQEYEDIAVKLGTD 977
>gi|297304121|ref|XP_002806323.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit [Macaca
mulatta]
Length = 963
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 706 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 765
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 766 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 825
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 826 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 885
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 886 IAKNRQEYEDIAVKLGTD 903
>gi|301791309|ref|XP_002930623.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Ailuropoda melanoleuca]
gi|410988810|ref|XP_004000670.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Felis catus]
gi|355708635|gb|AES03331.1| O-linked N-acetylglucosamine transferase [Mustela putorius furo]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|31873825|emb|CAD97853.1| hypothetical protein [Homo sapiens]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|432118087|gb|ELK37988.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Myotis davidii]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|151554489|gb|AAI49783.1| OGT protein [Bos taurus]
Length = 908
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 651 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 710
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 711 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 770
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 771 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 830
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 831 IAKNRQEYEDIAVKLGTD 848
>gi|344282022|ref|XP_003412774.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Loxodonta africana]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|403305181|ref|XP_003943148.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Saimiri boliviensis boliviensis]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|281337757|gb|EFB13341.1| hypothetical protein PANDA_021118 [Ailuropoda melanoleuca]
Length = 1037
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 780 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 839
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 840 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 899
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 900 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 959
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 960 IAKNRQEYEDIAVKLGTD 977
>gi|32307148|ref|NP_858058.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 1 [Homo sapiens]
gi|397498834|ref|XP_003820179.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Pan paniscus]
gi|426396356|ref|XP_004064413.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Gorilla gorilla gorilla]
gi|68067509|sp|O15294.3|OGT1_HUMAN RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit;
AltName: Full=O-GlcNAc transferase subunit p110;
AltName: Full=O-linked N-acetylglucosamine transferase
110 kDa subunit; Short=OGT
gi|18250915|emb|CAC86128.1| UDP-N-acetylglucosamine: polypeptide-N-acetylglucosaminyl
transferase [Homo sapiens]
gi|23315618|gb|AAH38180.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Homo sapiens]
gi|30268372|emb|CAD89970.1| hypothetical protein [Homo sapiens]
gi|119625691|gb|EAX05286.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_b [Homo
sapiens]
gi|410227760|gb|JAA11099.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Pan troglodytes]
gi|410227764|gb|JAA11101.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Pan troglodytes]
gi|410257116|gb|JAA16525.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Pan troglodytes]
gi|410342219|gb|JAA40056.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Pan troglodytes]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|301791307|ref|XP_002930622.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Ailuropoda melanoleuca]
gi|410988812|ref|XP_004000671.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Felis catus]
Length = 1036
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976
>gi|149758509|ref|XP_001493422.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Equus caballus]
gi|291407677|ref|XP_002720149.1| PREDICTED: O-linked GlcNAc transferase isoform 2 [Oryctolagus
cuniculus]
gi|296235743|ref|XP_002763023.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Callithrix jacchus]
gi|348570518|ref|XP_003471044.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Cavia porcellus]
gi|402910496|ref|XP_003917911.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Papio anubis]
gi|426257214|ref|XP_004022227.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Ovis aries]
gi|378405191|sp|P81436.2|OGT1_RABIT RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit;
AltName: Full=O-GlcNAc transferase subunit p110;
AltName: Full=O-linked N-acetylglucosamine transferase
110 kDa subunit; Short=OGT
gi|355757457|gb|EHH60982.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Macaca fascicularis]
gi|380817296|gb|AFE80522.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 1 [Macaca mulatta]
gi|383422239|gb|AFH34333.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 1 [Macaca mulatta]
gi|384949880|gb|AFI38545.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 1 [Macaca mulatta]
Length = 1046
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|344282024|ref|XP_003412775.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Loxodonta africana]
Length = 1036
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976
>gi|149758511|ref|XP_001493438.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Equus caballus]
gi|291407675|ref|XP_002720148.1| PREDICTED: O-linked GlcNAc transferase isoform 1 [Oryctolagus
cuniculus]
gi|296235745|ref|XP_002763024.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Callithrix jacchus]
gi|348570516|ref|XP_003471043.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Cavia porcellus]
gi|402910498|ref|XP_003917912.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Papio anubis]
gi|426257212|ref|XP_004022226.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Ovis aries]
gi|380817298|gb|AFE80523.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 2 [Macaca mulatta]
gi|383422241|gb|AFH34334.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 2 [Macaca mulatta]
Length = 1036
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976
>gi|403305183|ref|XP_003943149.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Saimiri boliviensis boliviensis]
Length = 1036
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976
>gi|32307150|ref|NP_858059.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit isoform 2 [Homo sapiens]
gi|397498836|ref|XP_003820180.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Pan paniscus]
gi|426396358|ref|XP_004064414.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Gorilla gorilla gorilla]
gi|15680175|gb|AAH14434.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Homo sapiens]
gi|18250914|emb|CAC86127.1| UDP-N-acatylglucosamine: polypeptide-N-acetylglucosaminyl
transferase [Homo sapiens]
gi|119625692|gb|EAX05287.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_c [Homo
sapiens]
gi|123993895|gb|ABM84549.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [synthetic construct]
gi|124000677|gb|ABM87847.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [synthetic construct]
gi|224487755|dbj|BAH24112.1| O-linked N-acetylglucosamine (GlcNAc) transferase [synthetic
construct]
gi|410227762|gb|JAA11100.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Pan troglodytes]
gi|410257114|gb|JAA16524.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Pan troglodytes]
Length = 1036
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976
>gi|332265872|ref|XP_003281937.1| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit
[Nomascus leucogenys]
Length = 1050
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 793 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 852
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 853 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 912
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 913 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 972
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 973 IAKNRQEYEDIAVKLGTD 990
>gi|126342739|ref|XP_001362317.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Monodelphis domestica]
Length = 1035
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 837
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 838 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 897
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 957
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 958 IAKNRQEYEDIAVKLGTD 975
>gi|147899141|ref|NP_001091539.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Bos taurus]
gi|146186901|gb|AAI40543.1| OGT protein [Bos taurus]
gi|296470806|tpg|DAA12921.1| TPA: O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Bos taurus]
Length = 1036
Score = 327 bits (839), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976
>gi|410056634|ref|XP_521123.4| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit [Pan
troglodytes]
Length = 1022
Score = 327 bits (838), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 765 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 824
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 825 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 884
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 885 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 944
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 945 IAKNRQEYEDIAVKLGTD 962
>gi|126342737|ref|XP_001362233.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Monodelphis domestica]
Length = 1045
Score = 327 bits (838), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 847
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 848 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 907
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 967
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 968 IAKNRQEYEDIAVKLGTD 985
>gi|449499306|ref|XP_002189786.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit, partial
[Taeniopygia guttata]
Length = 1038
Score = 327 bits (838), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 172/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 781 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 840
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ LGL Q+RI+FS VA K
Sbjct: 841 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLAQNRIIFSPVAPK 900
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 901 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 960
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 961 IAKSRQEYEDIAVKLGTD 978
>gi|2266994|gb|AAB63466.1| O-linked GlcNAc transferase [Homo sapiens]
gi|6911265|gb|AAF31458.1| HRNT1 [Homo sapiens]
gi|18250916|emb|CAC86129.1| UDP-N-acetylglucosamine: polypeptide-N-acetylglucosaminyl
transferase [Homo sapiens]
gi|119625690|gb|EAX05285.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase), isoform CRA_a [Homo
sapiens]
Length = 920
Score = 327 bits (838), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 663 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 722
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 723 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 782
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 783 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 842
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 843 IAKNRQEYEDIAVKLGTD 860
>gi|319443727|pdb|3PE3|A Chain A, Structure Of Human O-Glcnac Transferase And Its Complex
With A Peptide Substrate
gi|319443728|pdb|3PE3|B Chain B, Structure Of Human O-Glcnac Transferase And Its Complex
With A Peptide Substrate
gi|319443729|pdb|3PE3|C Chain C, Structure Of Human O-Glcnac Transferase And Its Complex
With A Peptide Substrate
gi|319443730|pdb|3PE3|D Chain D, Structure Of Human O-Glcnac Transferase And Its Complex
With A Peptide Substrate
gi|319443731|pdb|3PE4|A Chain A, Structure Of Human O-Glcnac Transferase And Its Complex
With A Peptide Substrate
gi|319443733|pdb|3PE4|C Chain C, Structure Of Human O-Glcnac Transferase And Its Complex
With A Peptide Substrate
gi|357380651|pdb|3TAX|A Chain A, A Neutral Diphosphate Mimic Crosslinks The Active Site Of
Human O- Glcnac Transferase
gi|357380653|pdb|3TAX|C Chain C, A Neutral Diphosphate Mimic Crosslinks The Active Site Of
Human O- Glcnac Transferase
gi|409973752|pdb|4AY5|A Chain A, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
Glycopeptide
gi|409973753|pdb|4AY5|B Chain B, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
Glycopeptide
gi|409973754|pdb|4AY5|C Chain C, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
Glycopeptide
gi|409973755|pdb|4AY5|D Chain D, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
Glycopeptide
gi|409973760|pdb|4AY6|A Chain A, Human O-Glcnac Transferase (Ogt) In Complex With
Udp-5sglcnac And Substrate Peptide
gi|409973761|pdb|4AY6|B Chain B, Human O-Glcnac Transferase (Ogt) In Complex With
Udp-5sglcnac And Substrate Peptide
gi|409973762|pdb|4AY6|C Chain C, Human O-Glcnac Transferase (Ogt) In Complex With
Udp-5sglcnac And Substrate Peptide
gi|409973763|pdb|4AY6|D Chain D, Human O-Glcnac Transferase (Ogt) In Complex With
Udp-5sglcnac And Substrate Peptide
gi|410563064|pdb|4GYW|A Chain A, Crystal Structure Of Human O-Glcnac Transferase In Complex
With Udp And A Glycopeptide
gi|410563066|pdb|4GYW|C Chain C, Crystal Structure Of Human O-Glcnac Transferase In Complex
With Udp And A Glycopeptide
gi|410563068|pdb|4GYY|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With
Udp-5sglcnac And A Peptide Substrate
gi|410563070|pdb|4GYY|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With
Udp-5sglcnac And A Peptide Substrate
gi|410563101|pdb|4GZ3|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With Udp
And A Thioglycopeptide
gi|410563103|pdb|4GZ3|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With Udp
And A Thioglycopeptide
gi|410563105|pdb|4GZ5|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With
Udp-Glcnac
gi|410563106|pdb|4GZ5|B Chain B, Crystal Structure Of Human O-Glcnac Transferase With
Udp-Glcnac
gi|410563107|pdb|4GZ5|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With
Udp-Glcnac
gi|410563108|pdb|4GZ5|D Chain D, Crystal Structure Of Human O-Glcnac Transferase With
Udp-Glcnac
gi|410563109|pdb|4GZ6|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With
Udp-5sglcnac
gi|410563110|pdb|4GZ6|B Chain B, Crystal Structure Of Human O-Glcnac Transferase With
Udp-5sglcnac
gi|410563111|pdb|4GZ6|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With
Udp-5sglcnac
gi|410563112|pdb|4GZ6|D Chain D, Crystal Structure Of Human O-Glcnac Transferase With
Udp-5sglcnac
Length = 723
Score = 327 bits (838), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 471 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 530
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 531 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 590
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 591 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 650
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 651 IAKNRQEYEDIAVKLGTD 668
>gi|395546237|ref|XP_003774996.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit
[Sarcophilus harrisii]
Length = 1075
Score = 327 bits (838), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 818 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 877
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 878 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 937
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 938 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 997
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 998 IAKNRQEYEDIAVKLGTD 1015
>gi|6562168|emb|CAB62528.1| hypothetical protein [Homo sapiens]
Length = 665
Score = 327 bits (838), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 408 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 467
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 468 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 527
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 528 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 587
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 588 IAKNRQEYEDIAVKLGTD 605
>gi|328908909|gb|AEB61122.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kda subunit-like protein, partial [Equus caballus]
Length = 313
Score = 327 bits (837), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 56 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 115
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 116 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 175
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 176 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 235
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 236 IAKNRQEYEDIAVKLGTD 253
>gi|351710822|gb|EHB13741.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
kDa subunit [Heterocephalus glaber]
Length = 1270
Score = 326 bits (836), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 170/198 (85%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP +I++T+R QYGLP+DAIVYCNFN
Sbjct: 1013 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPHTIIVTTRSQYGLPDDAIVYCNFN 1072
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 1073 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 1132
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 1133 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 1192
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 1193 IAKNRQEYEDIAVKLGTD 1210
Score = 286 bits (732), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 131/171 (76%), Positives = 148/171 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 740 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 799
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 800 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 859
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
EEHVRRGQLADVCLDT LCNGHTT MDVLW GTP+VT+PGETLASRVAASQ
Sbjct: 860 EEHVRRGQLADVCLDTLLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQ 910
>gi|74007686|ref|XP_538075.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Canis lupus familiaris]
Length = 1036
Score = 326 bits (835), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ ++Y+DIA++LGTD
Sbjct: 959 IAKNRQDYEDIAVKLGTD 976
>gi|66347879|ref|NP_001018117.1| O-linked N-acetylglucosamine transferase isoform 4 [Danio rerio]
gi|56967378|gb|AAW31873.1| O-GlcNAc transferase variant 4 [Danio rerio]
Length = 1036
Score = 326 bits (835), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 147/200 (73%), Positives = 171/200 (85%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 777 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 836
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ LGL RI+FS VA K
Sbjct: 837 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 896
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCPEL
Sbjct: 897 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 956
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ +EY+D+A++LGTD +
Sbjct: 957 IAQSRQEYEDVAVKLGTDME 976
>gi|89886173|ref|NP_001034837.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Sus scrofa]
gi|122142735|sp|Q27HV0.1|OGT1_PIG RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit;
AltName: Full=O-GlcNAc transferase subunit p110;
AltName: Full=O-linked N-acetylglucosamine transferase
110 kDa subunit; Short=OGT
gi|89114276|gb|ABD61726.1| O-linked N-acetylglucosamine transferase [Sus scrofa]
Length = 1046
Score = 326 bits (835), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +E++DIA++LGTD
Sbjct: 969 IAKNRQEFEDIAVKLGTD 986
>gi|74007670|ref|XP_849392.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Canis lupus familiaris]
Length = 1046
Score = 326 bits (835), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ ++Y+DIA++LGTD
Sbjct: 969 IAKNRQDYEDIAVKLGTD 986
>gi|66347871|ref|NP_001018116.1| O-linked N-acetylglucosamine transferase isoform 3 [Danio rerio]
gi|56967376|gb|AAW31872.1| O-GlcNAc transferase variant 3 [Danio rerio]
Length = 1046
Score = 326 bits (835), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 147/200 (73%), Positives = 171/200 (85%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 787 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 846
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ LGL RI+FS VA K
Sbjct: 847 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 906
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCPEL
Sbjct: 907 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 966
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ +EY+D+A++LGTD +
Sbjct: 967 IAQSRQEYEDVAVKLGTDME 986
>gi|190340092|gb|AAI63923.1| O-linked N-acetylglucosamine (GlcNAc) transferase
(UDP-N-acetylglucosamine:polypeptide-N-
acetylglucosaminyl transferase) [Danio rerio]
Length = 1046
Score = 325 bits (834), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 147/200 (73%), Positives = 171/200 (85%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 787 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 846
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ LGL RI+FS VA K
Sbjct: 847 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 906
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCPEL
Sbjct: 907 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 966
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ +EY+D+A++LGTD +
Sbjct: 967 IAQSRQEYEDVAVKLGTDME 986
>gi|327287038|ref|XP_003228236.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Anolis carolinensis]
Length = 1036
Score = 325 bits (834), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 147/198 (74%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 837
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP+TLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q RI+FS VA K
Sbjct: 838 QLYKIDPATLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 897
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 957
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 958 IAKSRQEYEDIAVKLGTD 975
>gi|327287040|ref|XP_003228237.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 3 [Anolis carolinensis]
Length = 1046
Score = 325 bits (834), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 147/198 (74%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP+TLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q RI+FS VA K
Sbjct: 848 QLYKIDPATLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 907
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 967
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 968 IAKSRQEYEDIAVKLGTD 985
>gi|66793439|ref|NP_001019747.1| O-linked N-acetylglucosamine (GlcNAc) transferase [Xenopus
(Silurana) tropicalis]
gi|60618530|gb|AAH90599.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
[Xenopus (Silurana) tropicalis]
Length = 1035
Score = 325 bits (832), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 146/198 (73%), Positives = 170/198 (85%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 837
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q RI+FS VA K
Sbjct: 838 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 897
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PG+TLASRVAASQL LGCPEL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGDTLASRVAASQLTCLGCPEL 957
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ ++Y+D A++LGTD
Sbjct: 958 IAKGRQDYEDTAVKLGTD 975
>gi|377657074|gb|AFB74087.1| O-linked N-acetylglucosamine transferase [Bubalus bubalis]
Length = 1046
Score = 324 bits (831), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 170/198 (85%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCL TPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLGTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|224587609|gb|ACN58689.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Salmo salar]
Length = 401
Score = 323 bits (827), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 147/200 (73%), Positives = 168/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 142 MINQGQIQVTINGFTVSNGLATTQLNNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 201
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ GL RI+FS VA K
Sbjct: 202 QLYKIDPPTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNFGLPGSRIIFSPVAPK 261
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCPEL
Sbjct: 262 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLQCLGCPEL 321
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA T +EY+D+A++LG D +
Sbjct: 322 IAHTRQEYEDVAVKLGCDME 341
>gi|3914191|sp|P56558.1|OGT1_RAT RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit;
AltName: Full=O-GlcNAc transferase subunit p110;
AltName: Full=O-linked N-acetylglucosamine transferase
110 kDa subunit; Short=OGT
gi|1931579|gb|AAC53121.1| O-GlcNAc transferase, p110 subunit [Rattus norvegicus]
Length = 1036
Score = 323 bits (827), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 148/198 (74%), Positives = 171/198 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQM N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMGANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958
Query: 181 IARTHKEYQDIAIRLGTD 198
IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976
>gi|348516824|ref|XP_003445937.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Oreochromis niloticus]
Length = 1038
Score = 320 bits (821), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 172/200 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q + K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 779 MINQGQIQVTINGFTVSNGLATTQISNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ AQ +GL RI+FS VA K
Sbjct: 839 QLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPGSRIIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCP+L
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLNCLGCPDL 958
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ ++Y+D+A++LG+D +
Sbjct: 959 IAQSRQDYEDVAVKLGSDME 978
>gi|348516822|ref|XP_003445936.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
1 [Oreochromis niloticus]
Length = 1048
Score = 320 bits (821), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 172/200 (86%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q + K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 789 MINQGQIQVTINGFTVSNGLATTQISNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ AQ +GL RI+FS VA K
Sbjct: 849 QLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPGSRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCP+L
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLNCLGCPDL 968
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++ ++Y+D+A++LG+D +
Sbjct: 969 IAQSRQDYEDVAVKLGSDME 988
>gi|292620687|ref|XP_001921543.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform 1
[Danio rerio]
Length = 1102
Score = 320 bits (819), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 146/200 (73%), Positives = 169/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEV ++IV+T+R QYGLP+D+IVYCNFN
Sbjct: 843 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVLKTIVVTTRSQYGLPDDSIVYCNFN 902
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ LGL RI+FS VA K
Sbjct: 903 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 962
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLAS VAASQL LGCPEL
Sbjct: 963 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASCVAASQLTCLGCPEL 1022
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ +EY+DIA++LGTD +
Sbjct: 1023 IARSRQEYEDIAVKLGTDME 1042
>gi|351700623|gb|EHB03542.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Heterocephalus glaber]
Length = 1046
Score = 319 bits (818), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 146/198 (73%), Positives = 168/198 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP +I++T+R QYGLP+DAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPHTIIVTTRSQYGLPDDAIVYCNFN 848
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDT LCNGHTT MDVLW GTP+VT+PGETLASRVA SQL LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTLLCNGHTTGMDVLWAGTPMVTMPGETLASRVATSQLTCLGCLEL 968
Query: 181 IARTHKEYQDIAIRLGTD 198
IA+ +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986
>gi|189527635|ref|XP_001921570.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
2 [Danio rerio]
Length = 1045
Score = 319 bits (818), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 146/200 (73%), Positives = 169/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEV ++IV+T+R QYGLP+D+IVYCNFN
Sbjct: 786 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVLKTIVVTTRSQYGLPDDSIVYCNFN 845
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ LGL RI+FS VA K
Sbjct: 846 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 905
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLAS VAASQL LGCPEL
Sbjct: 906 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASCVAASQLTCLGCPEL 965
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IAR+ +EY+DIA++LGTD +
Sbjct: 966 IARSRQEYEDIAVKLGTDME 985
>gi|47222947|emb|CAF99103.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1037
Score = 319 bits (817), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 169/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q K ATGEEV +++V+T+R QYGLPEDAIVYCNFN
Sbjct: 780 MINQGQIQVTINGFTVSNGLATTQITNKAATGEEVLRTVVVTTRSQYGLPEDAIVYCNFN 839
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+L VPNS+LWLL+FPAVGE NIQ AQ +GL RI+FS VA K
Sbjct: 840 QLYKIDPPTLQMWANILTRVPNSVLWLLRFPAVGEPNIQQYAQKVGLPASRIIFSPVAPK 899
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVA SQL+ LGCPEL
Sbjct: 900 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVATSQLSCLGCPEL 959
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA++H+EY+DIA++LG+D +
Sbjct: 960 IAQSHEEYEDIAVKLGSDME 979
>gi|148230096|ref|NP_001087833.1| O-linked N-acetylglucosamine (GlcNAc) transferase [Xenopus laevis]
gi|51949982|gb|AAH82353.1| Ogt-prov protein [Xenopus laevis]
Length = 1063
Score = 317 bits (812), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 147/216 (68%), Positives = 172/216 (79%), Gaps = 18/216 (8%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q RI+FS VA K
Sbjct: 848 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 907
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP------------------GET 162
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+P G+T
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGKESAESLGLHNKDCFLSGDT 967
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
LASRVAASQL LGCPELIA++ ++Y+DIA++LGTD
Sbjct: 968 LASRVAASQLTCLGCPELIAKSRQDYEDIAVKLGTD 1003
>gi|260833062|ref|XP_002611476.1| hypothetical protein BRAFLDRAFT_117202 [Branchiostoma floridae]
gi|229296847|gb|EEN67486.1| hypothetical protein BRAFLDRAFT_117202 [Branchiostoma floridae]
Length = 1022
Score = 317 bits (812), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 142/200 (71%), Positives = 169/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
M+ GQV T +NG+V+QNGLAT+Q N K ATGEE P ++++T+R QYGLPED++VYCNFN
Sbjct: 771 MVTQGQVSTCMNGVVVQNGLATSQVNNKAATGEEPPANVIVTTRVQYGLPEDSVVYCNFN 830
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP+TLQMWVN+L VPNS+LWLL+FPAVGE N+ A +GL RI+FS+VA K
Sbjct: 831 QLYKIDPATLQMWVNILNRVPNSVLWLLRFPAVGEQNVLNAATQMGLSPGRIIFSHVAPK 890
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLD+PLCNGHTT MDVLW GTP++TLPGETLASRVAASQL LGCPEL
Sbjct: 891 EEHVRRGQLADVCLDSPLCNGHTTGMDVLWAGTPMITLPGETLASRVAASQLGCLGCPEL 950
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+A T EY+DIA++LG D D
Sbjct: 951 VANTRAEYEDIAVQLGNDAD 970
>gi|432879797|ref|XP_004073552.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Oryzias latipes]
Length = 1016
Score = 316 bits (810), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 168/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++N + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 757 MINQGQIQVTINNFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 816
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK V NS+LWLL+FPAVGE NIQ AQ LGL RI+FS VA K
Sbjct: 817 QLYKIDPPTLQMWANILKRVSNSVLWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 876
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGCPEL
Sbjct: 877 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLQCLGCPEL 936
Query: 181 IARTHKEYQDIAIRLGTDRD 200
IA+ ++Y++IA++LG+D +
Sbjct: 937 IAQNRQDYEEIAVKLGSDME 956
>gi|66347873|ref|NP_001018115.1| O-linked N-acetylglucosamine transferase isoform 2 [Danio rerio]
gi|56967374|gb|AAW31871.1| O-GlcNAc transferase variant 2 [Danio rerio]
Length = 1052
Score = 315 bits (808), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 147/216 (68%), Positives = 171/216 (79%), Gaps = 16/216 (7%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQT----------------NTKTATGEEVPQSIVITSR 44
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++IV+T+R
Sbjct: 777 MINQGQIQVTINGFTVSNGLATTQIFTVEEVVVSGTVALQINNKAATGEEVPRTIVVTTR 836
Query: 45 QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
QYGLPED+IVYCNFNQLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ
Sbjct: 837 SQYGLPEDSIVYCNFNQLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQN 896
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
LGL RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 897 LGLPASRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 956
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRVAASQL LGCPELIA++ +EY+D+A++LGTD +
Sbjct: 957 SRVAASQLTCLGCPELIAQSRQEYEDVAVKLGTDME 992
>gi|62821820|ref|NP_001017359.1| O-linked N-acetylglucosamine transferase isoform 1 [Danio rerio]
gi|56967372|gb|AAW31870.1| O-GlcNAc transferase variant 1 [Danio rerio]
Length = 1062
Score = 315 bits (807), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 147/216 (68%), Positives = 171/216 (79%), Gaps = 16/216 (7%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQT----------------NTKTATGEEVPQSIVITSR 44
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++IV+T+R
Sbjct: 787 MINQGQIQVTINGFTVSNGLATTQIFTVEEVVVSGTVALQINNKAATGEEVPRTIVVTTR 846
Query: 45 QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
QYGLPED+IVYCNFNQLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ AQ
Sbjct: 847 SQYGLPEDSIVYCNFNQLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQN 906
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
LGL RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 907 LGLPASRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 966
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRVAASQL LGCPELIA++ +EY+D+A++LGTD +
Sbjct: 967 SRVAASQLTCLGCPELIAQSRQEYEDVAVKLGTDME 1002
>gi|327287036|ref|XP_003228235.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Anolis carolinensis]
Length = 1066
Score = 314 bits (804), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 148/218 (67%), Positives = 171/218 (78%), Gaps = 20/218 (9%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP+TLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q RI+FS VA K
Sbjct: 848 QLYKIDPATLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 907
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT--------------------LPG 160
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT LPG
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGEALRGSVYLFANDPFLPLPG 967
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ETLASRVAASQL LGC ELIA++ +EY+DIA++LGTD
Sbjct: 968 ETLASRVAASQLTCLGCLELIAKSRQEYEDIAVKLGTD 1005
>gi|410914070|ref|XP_003970511.1| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Takifugu rubripes]
Length = 1036
Score = 314 bits (804), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 142/200 (71%), Positives = 168/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q + NG + NGLAT Q + K ATGEEV ++IV+T+R QYGLPED+IVYCNFN
Sbjct: 779 MINQGQIQVTXNGFTVSNGLATTQISNKAATGEEVLRTIVVTTRSQYGLPEDSIVYCNFN 838
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW N+LK VPNS+LWLL+FPAVGE NIQ AQ +GL R++FS VA K
Sbjct: 839 QLYKIDPPTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPASRVIFSPVAPK 898
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVA SQL LGCPEL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVATSQLNCLGCPEL 958
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+A++ +EY+DIA++LG+D +
Sbjct: 959 VAQSRQEYEDIAVKLGSDME 978
>gi|348516828|ref|XP_003445939.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform
4 [Oreochromis niloticus]
Length = 1054
Score = 310 bits (794), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 144/216 (66%), Positives = 171/216 (79%), Gaps = 16/216 (7%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTN----------------TKTATGEEVPQSIVITSR 44
MI GQ+Q ++NG + NGLAT Q K ATGEEVP++IV+T+R
Sbjct: 779 MINQGQIQVTINGFTVSNGLATTQVRVTFRRTLLASSASCKLNKAATGEEVPRTIVVTTR 838
Query: 45 QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
QYGLPED+IVYCNFNQLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ AQ
Sbjct: 839 SQYGLPEDSIVYCNFNQLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQN 898
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
+GL RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 899 MGLPGSRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 958
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRVAASQL LGCP+LIA++ ++Y+D+A++LG+D +
Sbjct: 959 SRVAASQLNCLGCPDLIAQSRQDYEDVAVKLGSDME 994
>gi|348516826|ref|XP_003445938.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit isoform 3
[Oreochromis niloticus]
Length = 1064
Score = 310 bits (794), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 144/216 (66%), Positives = 171/216 (79%), Gaps = 16/216 (7%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTN----------------TKTATGEEVPQSIVITSR 44
MI GQ+Q ++NG + NGLAT Q K ATGEEVP++IV+T+R
Sbjct: 789 MINQGQIQVTINGFTVSNGLATTQVRVTFRRTLLASSASCKLNKAATGEEVPRTIVVTTR 848
Query: 45 QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
QYGLPED+IVYCNFNQLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ AQ
Sbjct: 849 SQYGLPEDSIVYCNFNQLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQN 908
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
+GL RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 909 MGLPGSRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 968
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRVAASQL LGCP+LIA++ ++Y+D+A++LG+D +
Sbjct: 969 SRVAASQLNCLGCPDLIAQSRQDYEDVAVKLGSDME 1004
>gi|443704233|gb|ELU01378.1| hypothetical protein CAPTEDRAFT_181048 [Capitella teleta]
Length = 1019
Score = 308 bits (790), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 140/200 (70%), Positives = 169/200 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI G+ ++ I +QNGL TNQTN K+ATGEE+P SI++++RQQYGL DA+VYCNFN
Sbjct: 766 MIQEGKPTCTLEDIGIQNGLTTNQTNNKSATGEEIPPSIILSARQQYGLSNDAVVYCNFN 825
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP+TL++WVN+LKAVPNS++WLL+FPAVGE N+ A+ LGL +ILFS VA K
Sbjct: 826 QLYKIDPATLEVWVNILKAVPNSVMWLLRFPAVGETNVVDFAKRLGLTADKILFSPVAPK 885
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+P ETLASRVAASQL LGCPEL
Sbjct: 886 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPNETLASRVAASQLHCLGCPEL 945
Query: 181 IARTHKEYQDIAIRLGTDRD 200
+A+ +EYQ IAI+LGTD++
Sbjct: 946 VAKNREEYQRIAIKLGTDKE 965
>gi|391328604|ref|XP_003738777.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit
[Metaseiulus occidentalis]
Length = 1034
Score = 304 bits (779), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 141/199 (70%), Positives = 168/199 (84%), Gaps = 1/199 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
++A + V+G+VLQ+G A N + ATGEE P SI++T+RQQY LPEDAIVYCNFN
Sbjct: 786 VLAGTEEAADVDGVVLQSGAAAT-ANPREATGEEAPSSILVTTRQQYELPEDAIVYCNFN 844
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TLQMW+N+LKAVPNS+LWLL+FPAVGE I A A GL+++R++FSNVAAK
Sbjct: 845 QLYKIDPDTLQMWINILKAVPNSVLWLLRFPAVGEQQILAQVIAAGLEKNRVIFSNVAAK 904
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VTLP E+LASRVA+SQL LGCPEL
Sbjct: 905 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTLPLESLASRVASSQLTCLGCPEL 964
Query: 181 IARTHKEYQDIAIRLGTDR 199
IA+ K+Y++IAIRLGTD+
Sbjct: 965 IAKDRKDYENIAIRLGTDQ 983
>gi|390344140|ref|XP_784504.3| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Strongylocentrotus purpuratus]
Length = 1041
Score = 301 bits (772), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 141/203 (69%), Positives = 169/203 (83%), Gaps = 5/203 (2%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNT-----KTATGEEVPQSIVITSRQQYGLPEDAIV 55
+I SGQVQ ++NGI + NGLA Q K+ATGEEVP +IT+R YGLP+DAIV
Sbjct: 771 LIQSGQVQGTINGITVCNGLALTQVGDYKSIPKSATGEEVPSYPLITTRNMYGLPDDAIV 830
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFS 115
+CNFNQLYKIDP+TL+MWVN+++ VPNS+LWLL+FPA GE ++ ATA LG+ + ++FS
Sbjct: 831 FCNFNQLYKIDPATLRMWVNIIQKVPNSVLWLLRFPAAGEPHLLATATQLGMPKGSLVFS 890
Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
VA+KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP++TLPGETLASRVAASQL TL
Sbjct: 891 AVASKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMITLPGETLASRVAASQLQTL 950
Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
GCPELIA + +EY+DIAIRLGTD
Sbjct: 951 GCPELIASSKQEYEDIAIRLGTD 973
>gi|324502359|gb|ADY41039.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
[Ascaris suum]
Length = 1100
Score = 301 bits (771), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 139/201 (69%), Positives = 165/201 (82%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI SG + +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLPEDAIV+CNF
Sbjct: 836 MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPEDAIVFCNF 895
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDPSTL MW +LK VPNS+LWLL+FP GEAN+ G+D R++FSNVAA
Sbjct: 896 NQLYKIDPSTLTMWCEILKLVPNSVLWLLRFPFHGEANVIRFCAERGIDARRVVFSNVAA 955
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL LG PE
Sbjct: 956 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLVALGVPE 1015
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+ ++Y IA RLGTDR+
Sbjct: 1016 LVAKDREDYIRIAARLGTDRE 1036
>gi|339242851|ref|XP_003377351.1| putative tetratricopeptide repeat-containing domain protein
[Trichinella spiralis]
gi|316973855|gb|EFV57404.1| putative tetratricopeptide repeat-containing domain protein
[Trichinella spiralis]
Length = 1062
Score = 300 bits (769), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 138/199 (69%), Positives = 166/199 (83%), Gaps = 1/199 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGL-ATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
M +GQVQTSV G+ +QNG+ A QT+ K ATGEEVPQSI+ITSR QYGLPEDAIVYCNF
Sbjct: 734 MFYTGQVQTSVEGVPVQNGVTALTQTHIKAATGEEVPQSILITSRHQYGLPEDAIVYCNF 793
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQ+YKIDP TL+MW +L+ VPNS+LWLL+FP+ E N+ ++ G++ RI F+NVAA
Sbjct: 794 NQMYKIDPMTLKMWCEILQRVPNSVLWLLRFPSHAEPNVLKFCESQGVNTKRIKFNNVAA 853
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLAD+CLDTPLCNGHTT MD+LW GTP+VTLP ET ASRVA+SQL LGCPE
Sbjct: 854 KEEHVRRGQLADLCLDTPLCNGHTTGMDMLWAGTPMVTLPLETFASRVASSQLTALGCPE 913
Query: 180 LIARTHKEYQDIAIRLGTD 198
LIA+T ++Y +IA+RLGTD
Sbjct: 914 LIAKTREDYVNIAVRLGTD 932
>gi|405973204|gb|EKC37930.1| hypothetical protein CGI_10009704 [Crassostrea gigas]
Length = 269
Score = 300 bits (767), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 137/199 (68%), Positives = 164/199 (82%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
M +G + +N +V+QNG+ TNQTN K ATGEE+PQ+I+I++R QYGLPEDA++YCNFN
Sbjct: 12 MAQTGLSHSCINDVVIQNGVTTNQTNNKAATGEEIPQNIIISARSQYGLPEDAVIYCNFN 71
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
QLYKIDP TL MW+ +LK VP+ +LWLL+FPAVGE N+ TA GL RI+FS VA K
Sbjct: 72 QLYKIDPQTLHMWIEILKQVPDGVLWLLRFPAVGETNVLQTAANAGLSPGRIIFSPVAPK 131
Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VTL GETLASRVA+SQL TLGCPEL
Sbjct: 132 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTLLGETLASRVASSQLNTLGCPEL 191
Query: 181 IARTHKEYQDIAIRLGTDR 199
IA+T ++Y IA +LGTD+
Sbjct: 192 IAKTSEDYIRIASKLGTDK 210
>gi|393909951|gb|EFO25409.2| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase [Loa
loa]
Length = 1094
Score = 296 bits (759), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 137/201 (68%), Positives = 164/201 (81%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI SG + +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV+CNF
Sbjct: 832 MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIVFCNF 891
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP TL MW ++LK+VPNSILWLL+FP GE N+ +D RI+FSNVAA
Sbjct: 892 NQLYKIDPPTLSMWCDILKSVPNSILWLLRFPYHGEPNVMRFCAEHNIDTRRIVFSNVAA 951
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL LG PE
Sbjct: 952 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 1011
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+ ++Y IA RLGTDR+
Sbjct: 1012 LVAKDREDYIKIAKRLGTDRE 1032
>gi|170571769|ref|XP_001891856.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
[Brugia malayi]
gi|158603386|gb|EDP39326.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Brugia malayi]
Length = 1136
Score = 296 bits (758), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 137/201 (68%), Positives = 163/201 (81%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI SG + +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV+CNF
Sbjct: 874 MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIVFCNF 933
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP TL MW ++LK VPNSILWLL+FP GE N+ +D RI+FSNVAA
Sbjct: 934 NQLYKIDPPTLSMWCDILKLVPNSILWLLRFPYHGEPNVMRFCAERNIDTRRIVFSNVAA 993
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL LG PE
Sbjct: 994 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 1053
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+ ++Y IA RLGTDR+
Sbjct: 1054 LVAKDREDYIKIAKRLGTDRE 1074
>gi|402582718|gb|EJW76663.1| hypothetical protein WUBG_12430 [Wuchereria bancrofti]
Length = 284
Score = 294 bits (753), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 137/201 (68%), Positives = 163/201 (81%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI SG + +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV+CNF
Sbjct: 22 MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIVFCNF 81
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP TL MW ++LK VPNSILWLL+FP GE N+ +D RI+FSNVAA
Sbjct: 82 NQLYKIDPPTLSMWCDILKLVPNSILWLLRFPYHGEPNVMRFCAERNIDTRRIVFSNVAA 141
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL LG PE
Sbjct: 142 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 201
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+ ++Y IA RLGTDR+
Sbjct: 202 LVAKDREDYIKIAKRLGTDRE 222
>gi|358341529|dbj|GAA37010.2| polypeptide N-acetylglucosaminyltransferase [Clonorchis sinensis]
Length = 1056
Score = 291 bits (746), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 133/202 (65%), Positives = 161/202 (79%), Gaps = 2/202 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 785 MIRTNSPSCAINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 844
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD--QHRILFSNVA 118
QLYKIDPST++MWV +LK VPNS+LWLL+FPA GEA + A +GL Q RI+FSNVA
Sbjct: 845 QLYKIDPSTMRMWVEILKNVPNSVLWLLRFPAAGEAGALSAATEMGLQQVQRRIIFSNVA 904
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
KEEHVRRGQ+ADVCLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL LGCP
Sbjct: 905 PKEEHVRRGQVADVCLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHALGCP 964
Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
EL+A T ++Y IA +LG +R+
Sbjct: 965 ELVANTQEDYVRIASKLGNNRE 986
>gi|2266992|gb|AAB63465.1| O-linked GlcNAc transferase [Caenorhabditis elegans]
Length = 1151
Score = 285 bits (729), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 133/201 (66%), Positives = 161/201 (80%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI +GQ+ +V + +QNGL +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 893 MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 952
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDPSTL MW+ +L+ VP SILWLL+FP GE +I+ GLD RI+FSNVAA
Sbjct: 953 NQLYKIDPSTLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGLDPSRIVFSNVAA 1012
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL LG PE
Sbjct: 1013 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 1072
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+T +EY IA+RLGTD D
Sbjct: 1073 LVAKTRQEYVSIAVRLGTDAD 1093
>gi|115532690|ref|NP_001040860.1| Protein OGT-1, isoform a [Caenorhabditis elegans]
gi|33112401|sp|O18158.2|OGT1_CAEEL RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase; AltName: Full=O-GlcNAc;
AltName: Full=OGT
gi|351065633|emb|CCD61614.1| Protein OGT-1, isoform a [Caenorhabditis elegans]
Length = 1151
Score = 285 bits (729), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 133/201 (66%), Positives = 161/201 (80%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI +GQ+ +V + +QNGL +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 893 MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 952
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDPSTL MW+ +L+ VP SILWLL+FP GE +I+ GLD RI+FSNVAA
Sbjct: 953 NQLYKIDPSTLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGLDPSRIVFSNVAA 1012
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL LG PE
Sbjct: 1013 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 1072
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+T +EY IA+RLGTD D
Sbjct: 1073 LVAKTRQEYVSIAVRLGTDAD 1093
>gi|115532692|ref|NP_001040861.1| Protein OGT-1, isoform b [Caenorhabditis elegans]
gi|351065634|emb|CCD61615.1| Protein OGT-1, isoform b [Caenorhabditis elegans]
Length = 973
Score = 285 bits (729), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 133/201 (66%), Positives = 161/201 (80%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI +GQ+ +V + +QNGL +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 715 MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 774
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDPSTL MW+ +L+ VP SILWLL+FP GE +I+ GLD RI+FSNVAA
Sbjct: 775 NQLYKIDPSTLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGLDPSRIVFSNVAA 834
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL LG PE
Sbjct: 835 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 894
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+T +EY IA+RLGTD D
Sbjct: 895 LVAKTRQEYVSIAVRLGTDAD 915
>gi|198434517|ref|XP_002131769.1| PREDICTED: similar to O-linked N-acetylglucosamine transferase
[Ciona intestinalis]
Length = 1042
Score = 285 bits (728), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 131/185 (70%), Positives = 155/185 (83%), Gaps = 1/185 (0%)
Query: 17 QNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNV 76
QNGLA Q + K ATGE+ P I+++SR QY LP +AIV+CNFNQLYKIDPSTL MW N+
Sbjct: 808 QNGLALVQIDAKAATGEKPPSGIIVSSRSQYRLPNNAIVFCNFNQLYKIDPSTLIMWCNI 867
Query: 77 LKAVPNSILWLLKFPAVGEANIQATA-QALGLDQHRILFSNVAAKEEHVRRGQLADVCLD 135
LK VPNS+LWLL+FPAVGEAN++ A Q G++ +RI+FS VA KEEHVRRGQLAD+CLD
Sbjct: 868 LKRVPNSVLWLLRFPAVGEANVKKFARQTCGINANRIIFSPVAPKEEHVRRGQLADICLD 927
Query: 136 TPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
TPLCNGHTT+MDVLW G P+VTLP ETLASRVA+SQLA LGCPELIA ++Y+DIA+RL
Sbjct: 928 TPLCNGHTTAMDVLWAGCPMVTLPKETLASRVASSQLACLGCPELIAENSQQYEDIAVRL 987
Query: 196 GTDRD 200
GTD D
Sbjct: 988 GTDMD 992
>gi|256079484|ref|XP_002576017.1| o-linked n-acetylglucosamine transferase ogt [Schistosoma mansoni]
Length = 1063
Score = 283 bits (725), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 134/202 (66%), Positives = 162/202 (80%), Gaps = 2/202 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 803 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 862
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA
Sbjct: 863 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 922
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
KEEHVRRGQ+ADVCLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 923 PKEEHVRRGQVADVCLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 982
Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
EL+A + ++Y IA +LG +R+
Sbjct: 983 ELVANSQEDYVRIATKLGNNRE 1004
>gi|353231229|emb|CCD77647.1| putative o-linked n-acetylglucosamine transferase, ogt [Schistosoma
mansoni]
Length = 1063
Score = 283 bits (725), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 134/202 (66%), Positives = 162/202 (80%), Gaps = 2/202 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 803 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 862
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA
Sbjct: 863 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 922
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
KEEHVRRGQ+ADVCLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 923 PKEEHVRRGQVADVCLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 982
Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
EL+A + ++Y IA +LG +R+
Sbjct: 983 ELVANSQEDYVRIATKLGNNRE 1004
>gi|226486966|emb|CAX75348.1| O-glycosyltransferase [Schistosoma japonicum]
Length = 1063
Score = 283 bits (725), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 133/202 (65%), Positives = 162/202 (80%), Gaps = 2/202 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 803 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 862
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA
Sbjct: 863 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 922
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
KEEHVRRGQ+AD+CLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 923 PKEEHVRRGQVADICLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 982
Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
EL+A + ++Y IA +LG +R+
Sbjct: 983 ELVANSQEDYVRIATKLGNNRE 1004
>gi|226486968|emb|CAX75349.1| O-glycosyltransferase [Schistosoma japonicum]
Length = 1045
Score = 283 bits (724), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 133/202 (65%), Positives = 162/202 (80%), Gaps = 2/202 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 785 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 844
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA
Sbjct: 845 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 904
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
KEEHVRRGQ+AD+CLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 905 PKEEHVRRGQVADICLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 964
Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
EL+A + ++Y IA +LG +R+
Sbjct: 965 ELVANSQEDYVRIATKLGNNRE 986
>gi|226486964|emb|CAX75347.1| O-glycosyltransferase [Schistosoma japonicum]
Length = 1045
Score = 283 bits (724), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 133/202 (65%), Positives = 162/202 (80%), Gaps = 2/202 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 785 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 844
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA
Sbjct: 845 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 904
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
KEEHVRRGQ+AD+CLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 905 PKEEHVRRGQVADICLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 964
Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
EL+A + ++Y IA +LG +R+
Sbjct: 965 ELVANSQEDYVRIATKLGNNRE 986
>gi|341897224|gb|EGT53159.1| CBN-OGT-1 protein [Caenorhabditis brenneri]
Length = 1171
Score = 278 bits (711), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 129/201 (64%), Positives = 160/201 (79%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI +GQ+ +V + +QNGL +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 910 MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 969
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP+TL MW+ +L+ VP SILWLL+FP GE +I+ G++Q RI+FSNVAA
Sbjct: 970 NQLYKIDPATLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGIEQSRIVFSNVAA 1029
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P ++LASRVA SQL LG PE
Sbjct: 1030 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLDSLASRVATSQLYALGVPE 1089
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+T +EY IA RLG D D
Sbjct: 1090 LVAKTRQEYIRIATRLGNDAD 1110
>gi|357607539|gb|EHJ65577.1| hypothetical protein KGM_15173 [Danaus plexippus]
Length = 873
Score = 278 bits (710), Expect = 1e-72, Method: Composition-based stats.
Identities = 123/199 (61%), Positives = 156/199 (78%)
Query: 2 IASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQ 61
I+ Q Q VN I + NGL+ + K A+GEE +I++T+R+QY LPEDA+V+CNFNQ
Sbjct: 623 ISCKQEQVCVNNIFIDNGLSLRLSKKKIASGEERYDNIILTTRRQYNLPEDAVVFCNFNQ 682
Query: 62 LYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKE 121
LYK DP L+MW+N+L VPNS+LWLL FPA GE+N++ AQ GL RI+FS +A KE
Sbjct: 683 LYKTDPKALEMWINILNNVPNSVLWLLAFPAAGESNLRHFAQIRGLSPDRIIFSKIAPKE 742
Query: 122 EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELI 181
EHVRRGQ++DVCLDTPLCNGHTT+MD+LWTGTPVVTLPG+TLASRVA+SQL L C ELI
Sbjct: 743 EHVRRGQISDVCLDTPLCNGHTTTMDILWTGTPVVTLPGKTLASRVASSQLTALKCTELI 802
Query: 182 ARTHKEYQDIAIRLGTDRD 200
A++ K Y++IA +LG D +
Sbjct: 803 AKSEKNYEEIATKLGMDAE 821
>gi|268573864|ref|XP_002641909.1| C. briggsae CBR-OGT-1 protein [Caenorhabditis briggsae]
Length = 1148
Score = 275 bits (702), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 127/201 (63%), Positives = 157/201 (78%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI +GQ+ +V + +QNGL +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 890 MIMTGQMTMNVMEDVNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 949
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
NQLYKIDP TL MW+ +L+ VP S+LWLL+FP GE +I+ G++ RI+FSNVAA
Sbjct: 950 NQLYKIDPPTLDMWIKILENVPKSVLWLLRFPYQGEEHIRKYCVERGIEPSRIVFSNVAA 1009
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL LG PE
Sbjct: 1010 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 1069
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+T EY IA RLG D +
Sbjct: 1070 LVAKTRNEYVQIATRLGNDAE 1090
>gi|76154911|gb|AAX26306.2| SJCHGC00985 protein [Schistosoma japonicum]
Length = 246
Score = 272 bits (695), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 129/187 (68%), Positives = 154/187 (82%), Gaps = 2/187 (1%)
Query: 16 LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVN 75
+ NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFNQLYK+DPST++MWV
Sbjct: 1 IHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFNQLYKVDPSTMRMWVE 60
Query: 76 VLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVAAKEEHVRRGQLADVC 133
+LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA KEEHVRRGQ+AD+C
Sbjct: 61 ILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVAPKEEHVRRGQVADIC 120
Query: 134 LDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
LDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCPEL+A + ++Y IA
Sbjct: 121 LDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCPELVANSQEDYVRIAT 180
Query: 194 RLGTDRD 200
+LG +R+
Sbjct: 181 KLGNNRE 187
>gi|308501583|ref|XP_003112976.1| CRE-OGT-1 protein [Caenorhabditis remanei]
gi|308265277|gb|EFP09230.1| CRE-OGT-1 protein [Caenorhabditis remanei]
Length = 1255
Score = 271 bits (692), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 126/186 (67%), Positives = 148/186 (79%)
Query: 15 VLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWV 74
V Q L +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNFNQLYKIDPSTL MW+
Sbjct: 1012 VQQLNLFQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNFNQLYKIDPSTLDMWI 1071
Query: 75 NVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCL 134
+L+ VP SILWLL+FP GE +I+ GL+ RI+FSNVAAKEEHVRRGQLADVCL
Sbjct: 1072 KILENVPKSILWLLRFPYQGEEHIRKYCVERGLEPSRIVFSNVAAKEEHVRRGQLADVCL 1131
Query: 135 DTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIR 194
DTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL LG PEL+A+T EY DIA R
Sbjct: 1132 DTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPELVAKTKAEYVDIATR 1191
Query: 195 LGTDRD 200
LG D D
Sbjct: 1192 LGNDAD 1197
>gi|156401113|ref|XP_001639136.1| predicted protein [Nematostella vectensis]
gi|156226262|gb|EDO47073.1| predicted protein [Nematostella vectensis]
Length = 993
Score = 265 bits (678), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 118/169 (69%), Positives = 141/169 (83%)
Query: 31 TGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF 90
+GE +P ++ +T R QYGLPEDA VYCNFNQLYKIDP TL+ W N+L+ VPNS+LWLLKF
Sbjct: 770 SGERIPLTMPLTCRSQYGLPEDAFVYCNFNQLYKIDPKTLKTWANILRRVPNSVLWLLKF 829
Query: 91 PAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
PAVGE+NI+A ++GL Q R++FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW
Sbjct: 830 PAVGESNIKAEVSSMGLSQDRVIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTCMDVLW 889
Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G PVVTLP +TLASRVA+SQL LGC ELIA + +Y++IA+RLG D+
Sbjct: 890 AGCPVVTLPLDTLASRVASSQLNALGCSELIADSRGDYEEIAVRLGNDK 938
>gi|340376830|ref|XP_003386934.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Amphimedon queenslandica]
Length = 1029
Score = 255 bits (651), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 122/194 (62%), Positives = 150/194 (77%), Gaps = 5/194 (2%)
Query: 6 QVQTSVNGIVLQNGLA-TNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYK 64
++Q S+N + Q +A T Q A +P + +T+R QYGLPE A+VYCNFNQLYK
Sbjct: 789 KIQASLNQLSQQTAIASTPQPEHPLA----MPTDLPLTTRAQYGLPEGAVVYCNFNQLYK 844
Query: 65 IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHV 124
IDP+TL+ W N+LK VPNSI+WLL+FPAVGE N+ AT Q LG+ RI+FS VA KEEHV
Sbjct: 845 IDPATLESWANILKRVPNSIMWLLRFPAVGEPNVVATTQKLGIQPSRIIFSPVAPKEEHV 904
Query: 125 RRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIART 184
RRG+LAD+CLDTPLCNGHTT MDVLW GTPV+TLP ETLASRVAASQL +LG PEL+A +
Sbjct: 905 RRGRLADMCLDTPLCNGHTTGMDVLWAGTPVLTLPLETLASRVAASQLISLGFPELVASS 964
Query: 185 HKEYQDIAIRLGTD 198
+EY+DIA+ G +
Sbjct: 965 RQEYEDIAVTYGNN 978
>gi|196006385|ref|XP_002113059.1| hypothetical protein TRIADDRAFT_56833 [Trichoplax adhaerens]
gi|190585100|gb|EDV25169.1| hypothetical protein TRIADDRAFT_56833 [Trichoplax adhaerens]
Length = 996
Score = 247 bits (631), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 147/207 (71%), Gaps = 12/207 (5%)
Query: 4 SGQVQTSVNGI---------VLQNG---LATNQTNTKTATGEEVPQSIVITSRQQYGLPE 51
S + TS N I V+ NG + N + G VP +V+T+R QYGLP
Sbjct: 738 SSHINTSTNLISNGTLGSNAVIPNGGPIINGNGDSVPNGPGVNVPSGMVLTTRAQYGLPS 797
Query: 52 DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHR 111
AI+YCNFNQLYKIDP L+ WV +LK VPNSILWLL+FPA GEA ++ T +GL R
Sbjct: 798 KAIIYCNFNQLYKIDPKILETWVRILKRVPNSILWLLRFPAAGEAMLKNTILKMGLPADR 857
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
++F+NVA KEEHVRRGQLAD+CLDTP+CNGHTT MDVLW G P++T+P ETLASRVA+SQ
Sbjct: 858 VVFTNVAPKEEHVRRGQLADICLDTPMCNGHTTGMDVLWAGVPMITMPKETLASRVASSQ 917
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L +LG ELIA+ EY+D+A++ G+D
Sbjct: 918 LTSLGVSELIAKDLTEYEDVAVKYGSD 944
>gi|313240003|emb|CBY32363.1| unnamed protein product [Oikopleura dioica]
Length = 1054
Score = 246 bits (629), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/201 (57%), Positives = 147/201 (73%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
M+ GQ + I + NGL + + GE Q+ V TSR+QYGLP+DAIVYCNFN
Sbjct: 798 MVIKGQQCLQIGDIKIINGLCLPAFEAQASQGEAKIQNAVYTSRRQYGLPDDAIVYCNFN 857
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-QALGLDQHRILFSNVAA 119
QLYK+DP+T+ W N+LK VP++++WLL+FPA+GE ++ + + + RI+FS VAA
Sbjct: 858 QLYKLDPNTMSAWCNILKKVPDAVIWLLRFPALGERHVHDWCWRHHNIPKERIIFSPVAA 917
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MDVLW G P+VTLP E+ ASRVA+SQ+ TLG E
Sbjct: 918 KEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGCPMVTLPLESFASRVASSQMKTLGLDE 977
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
LIA ++ Y+ IAIRLG D D
Sbjct: 978 LIADDYESYESIAIRLGRDVD 998
>gi|313226628|emb|CBY21773.1| unnamed protein product [Oikopleura dioica]
Length = 1054
Score = 246 bits (627), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 115/201 (57%), Positives = 147/201 (73%), Gaps = 1/201 (0%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
M+ GQ + I + NGL + + GE Q+ V TSR+QYGLP+D+IVYCNFN
Sbjct: 798 MVIKGQQCLQIGDIKIINGLCLPAFEAQASQGEAKIQNAVYTSRRQYGLPDDSIVYCNFN 857
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-QALGLDQHRILFSNVAA 119
QLYK+DP+T+ W N+LK VP++++WLL+FPA+GE ++ + + + RI+FS VAA
Sbjct: 858 QLYKLDPNTMSAWCNILKKVPDAVIWLLRFPALGERHVHDWCWRHHNIPKERIIFSPVAA 917
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MDVLW G P+VTLP E+ ASRVA+SQ+ TLG E
Sbjct: 918 KEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGCPMVTLPLESFASRVASSQMKTLGLDE 977
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
LIA ++ Y+ IAIRLG D D
Sbjct: 978 LIADDYESYESIAIRLGRDVD 998
>gi|172087340|ref|XP_001913212.1| UDP-N-acetylglucosaminyltransferase [Oikopleura dioica]
gi|42601339|gb|AAS21365.1| UDP-N-acetylglucosaminyltransferase [Oikopleura dioica]
Length = 1070
Score = 242 bits (617), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 116/217 (53%), Positives = 147/217 (67%), Gaps = 17/217 (7%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
M+ GQ + I + NGL + + GE Q+ V TSR+QYGLP+DAIVYCNFN
Sbjct: 798 MVIKGQQCLQIGDIKIINGLCLPAFEAQASQGEAKIQNAVYTSRRQYGLPDDAIVYCNFN 857
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ-----------------ATAQ 103
QLYK+DP+T+ W N+LK VP++++WLL+FPA+GE ++ +
Sbjct: 858 QLYKLDPNTMSAWCNILKKVPDAVIWLLRFPALGERHVHDWCWRLVSIIFTKPLTNYSCS 917
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
+ + RI+FS VAAKEEHVRRGQLADVCLDTPLCNGHTT MDVLW G P+VTLP E+
Sbjct: 918 HHNIPKERIIFSPVAAKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGCPMVTLPLESF 977
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRVA+SQ+ TLG ELIA ++ Y+ IAIRLG D D
Sbjct: 978 ASRVASSQMKTLGLDELIADDYESYESIAIRLGRDVD 1014
>gi|312071553|ref|XP_003138661.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase [Loa
loa]
Length = 1205
Score = 239 bits (609), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 119/201 (59%), Positives = 142/201 (70%), Gaps = 26/201 (12%)
Query: 1 MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
MI SG + +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV
Sbjct: 968 MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIV---- 1023
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
PNSILWLL+FP GE N+ +D RI+FSNVAA
Sbjct: 1024 ---------------------PNSILWLLRFPYHGEPNVMRFCAEHNIDTRRIVFSNVAA 1062
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL LG PE
Sbjct: 1063 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 1122
Query: 180 LIARTHKEYQDIAIRLGTDRD 200
L+A+ ++Y IA RLGTDR+
Sbjct: 1123 LVAKDREDYIKIAKRLGTDRE 1143
>gi|167527386|ref|XP_001748025.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773443|gb|EDQ87082.1| predicted protein [Monosiga brevicollis MX1]
Length = 1173
Score = 223 bits (568), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 97/167 (58%), Positives = 129/167 (77%), Gaps = 2/167 (1%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G+E Q ++ R+ YGLPED +++CNFNQLYKIDP W+N+LK VPNS+LW+L+FP
Sbjct: 880 GQEATQPML--HRRFYGLPEDKVIFCNFNQLYKIDPLMFDTWLNILKRVPNSVLWILRFP 937
Query: 92 AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
GE N+Q GL R++FS VA K EHVRRG LAD+CLDT +CNGHTT MD+LW
Sbjct: 938 PAGEQNLQKRIIEAGLTLDRVIFSPVAGKIEHVRRGALADICLDTHVCNGHTTGMDILWA 997
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GTP+++LPG+TLASRVA+S L TLGCPELIAR+H++Y ++A++ ++
Sbjct: 998 GTPMISLPGDTLASRVASSLLKTLGCPELIARSHEDYVNLAVQYASN 1044
>gi|326432574|gb|EGD78144.1| O-linked N-acetylglucosamine transferase [Salpingoeca sp. ATCC 50818]
Length = 1496
Score = 214 bits (545), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 93/153 (60%), Positives = 120/153 (78%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLP+D I+ CNFNQLYKIDP T +W+++LKA N++LW+L+FP GE N++
Sbjct: 1033 TRAYYGLPQDKIILCNFNQLYKIDPMTFSLWLDILKATTNTVLWILRFPPAGEQNLRMHV 1092
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA GLDQ R++FS VA K EHVRRG LAD+CLDT CNGHTT MD+LW GTP++TLPG+T
Sbjct: 1093 QAAGLDQSRVIFSPVAGKVEHVRRGALADLCLDTHACNGHTTGMDILWAGTPMLTLPGDT 1152
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
LASRVA+S + LGCPELIA + ++Y + A+ L
Sbjct: 1153 LASRVASSLVTALGCPELIASSRQDYFNKAVDL 1185
>gi|301094298|ref|XP_002896255.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
putative [Phytophthora infestans T30-4]
gi|262109650|gb|EEY67702.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
putative [Phytophthora infestans T30-4]
Length = 977
Score = 199 bits (506), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 89/176 (50%), Positives = 126/176 (71%), Gaps = 1/176 (0%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N + +V + +R YG+PED V+CNFNQ+YKIDP T W+N+LK VPNS+L
Sbjct: 719 NDHKQSARDVLDTEKCPTRADYGVPEDKFVFCNFNQVYKIDPVTFTTWMNILKRVPNSVL 778
Query: 86 WLLKFPAVGEANIQATAQALGL-DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
WLL+FP + EANI+A A+A G+ DQ R++F++VA K+EH++RG LAD+ LDTP CN HTT
Sbjct: 779 WLLRFPPIAEANIRAEARARGVKDQTRLIFTDVAPKDEHLKRGYLADLFLDTPECNAHTT 838
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
D+LW GTP+VT+P + +A+RVA+S L ELI + +EY+++A+ L +D D
Sbjct: 839 GCDILWGGTPMVTMPKDRMATRVASSLLRAANMSELITNSLEEYEELAVALASDMD 894
>gi|348687602|gb|EGZ27416.1| hypothetical protein PHYSODRAFT_553979 [Phytophthora sojae]
Length = 986
Score = 196 bits (499), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 88/176 (50%), Positives = 125/176 (71%), Gaps = 1/176 (0%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N + +V + +R YG+P+D V+CNFNQ+YKIDP T W+N+LK VPNS+L
Sbjct: 719 NDHKQSARDVLDTEKCPTRADYGVPDDKFVFCNFNQVYKIDPVTFTTWMNILKRVPNSVL 778
Query: 86 WLLKFPAVGEANIQATAQALGL-DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
WLL+FP + EANI+A A+A G+ DQ R++F++VA K EH++RG LAD+ LDTP CN HTT
Sbjct: 779 WLLRFPPIAEANIRAEARARGVKDQTRLIFTDVAPKNEHLKRGYLADLFLDTPECNAHTT 838
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
D+LW GTP+VT+P + +A+RVA+S L ELI + +EY+++A+ L +D D
Sbjct: 839 GCDILWGGTPMVTMPKDRMATRVASSLLRAANMSELITNSLEEYEELAVALASDMD 894
>gi|325187990|emb|CCA22532.1| predicted protein putative [Albugo laibachii Nc14]
Length = 968
Score = 194 bits (494), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 88/173 (50%), Positives = 119/173 (68%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N + EV R+QYG+P+ V+CNFNQ+YKIDP T W+NVLK VPNSIL
Sbjct: 721 NDHKQSAREVLDVSKCPKREQYGVPQGKFVFCNFNQVYKIDPDTFTTWMNVLKRVPNSIL 780
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+FP V EANI+A A+ G+ R+ F++VA KEEH++RG LAD+ LDTP CN HTT
Sbjct: 781 WLLRFPPVAEANIRAQARERGVPDDRLQFTDVAKKEEHLKRGHLADLFLDTPECNAHTTG 840
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
D+LW GTP++TL + +A+RVA+S L E+I + +EY+++A+ L TD
Sbjct: 841 CDILWGGTPMITLAKDRMATRVASSLLRAANLDEMITNSLEEYEELAVTLATD 893
>gi|298710892|emb|CBJ26401.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Ectocarpus siliculosus]
Length = 1080
Score = 183 bits (465), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 83/175 (47%), Positives = 118/175 (67%), Gaps = 2/175 (1%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N + V ++ +R YG+PED V CNFNQLYK+DP+ W++VLK VPN++L
Sbjct: 721 NDHKQSARYVLDRTLLPTRATYGVPEDRFVLCNFNQLYKMDPAIFSTWMSVLKRVPNAVL 780
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+FP GEANI+ A+ G+ + ++ F++VA KEEH++RG LAD+ LDTP CN HTT
Sbjct: 781 WLLRFPPAGEANIRMEARKRGVREEQLHFTDVATKEEHIKRGYLADLFLDTPSCNAHTTG 840
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCP--ELIARTHKEYQDIAIRLGTD 198
D+LW+GTP++T+ G +A+RVA S L G LI + +EY++ A+ L TD
Sbjct: 841 CDILWSGTPMLTMAGSKMATRVAPSLLKAAGAEGTGLIVESLEEYEERAVSLATD 895
>gi|328699992|ref|XP_003241117.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 2 [Acyrthosiphon pisum]
gi|328699994|ref|XP_003241118.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 3 [Acyrthosiphon pisum]
gi|328699996|ref|XP_003241119.1| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 4 [Acyrthosiphon pisum]
gi|328699998|ref|XP_001952129.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
isoform 1 [Acyrthosiphon pisum]
Length = 1108
Score = 182 bits (463), Expect = 5e-44, Method: Composition-based stats.
Identities = 80/166 (48%), Positives = 113/166 (68%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
E+ +S+ R + LPE+ +V+CNF++LYKIDP T +MW+N+L VPNS+LWLL +
Sbjct: 904 ELEESVKSYYRSTFNLPENVVVFCNFSKLYKIDPFTFRMWLNILNNVPNSVLWLLHLSDI 963
Query: 94 GEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
E N++ A L D RI+F++ K +H+ R QLAD+ LDT LCNGH +D +W G
Sbjct: 964 AENNLRTFADHLNFDTSRIIFADFIPKYQHLNRIQLADIYLDTHLCNGHIACLDAIWAGV 1023
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
PV+TLPG+T SRV ASQL TLG + IA+ + Y +IAI+LG ++
Sbjct: 1024 PVITLPGDTYESRVTASQLTTLGIADTIAQNEENYIEIAIQLGLNK 1069
>gi|147857906|emb|CAN80393.1| hypothetical protein VITISV_001595 [Vitis vinifera]
Length = 566
Score = 182 bits (461), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 85/157 (54%), Positives = 108/157 (68%), Gaps = 1/157 (0%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE +++ A
Sbjct: 348 RSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRSYAV 407
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A GL RI+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P+VTLP E +
Sbjct: 408 AQGLQPDRIIFTDVAMKHEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEKM 467
Query: 164 ASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
A+RVA S LAT E+I + KEY++ A+ L +R
Sbjct: 468 ATRVAGSLCLATGLGEEMIVSSMKEYEEKAVSLAMNR 504
>gi|225430656|ref|XP_002270163.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC [Vitis vinifera]
gi|296085167|emb|CBI28662.3| unnamed protein product [Vitis vinifera]
Length = 986
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 85/157 (54%), Positives = 108/157 (68%), Gaps = 1/157 (0%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE +++ A
Sbjct: 768 RSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRSYAV 827
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A GL RI+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P+VTLP E +
Sbjct: 828 AQGLQPDRIIFTDVAMKHEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEKM 887
Query: 164 ASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
A+RVA S LAT E+I + KEY++ A+ L +R
Sbjct: 888 ATRVAGSLCLATGLGEEMIVSSMKEYEEKAVSLAMNR 924
>gi|384254151|gb|EIE27625.1| TPR-like protein [Coccomyxa subellipsoidea C-169]
Length = 945
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/159 (55%), Positives = 109/159 (68%), Gaps = 1/159 (0%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR + GLPED IVY NQLYK DP T W N+L+ VPNS+LWLL+FP GE I+A
Sbjct: 734 LPSRTEVGLPEDRIVYSCSNQLYKYDPETFTTWCNILRRVPNSVLWLLRFPPYGEPRIRA 793
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A A G+D RI+F++VAAK H+RR +ADV LDTPLCN HTT DVLW G P+VTLP
Sbjct: 794 EAAARGVDPARIIFTDVAAKPLHIRRSGIADVFLDTPLCNAHTTGCDVLWGGCPMVTLPL 853
Query: 161 ETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
E +ASRVAAS G E++ + EY+D A+ LG +
Sbjct: 854 ERMASRVAASLCYATGLGHEMVVSSQHEYEDRAVELGLN 892
>gi|326498109|dbj|BAJ94917.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 986
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 82/157 (52%), Positives = 110/157 (70%), Gaps = 1/157 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLP+D ++ FNQLYK+DP W N++K VPNS+LWLL+FPA GE ++A A
Sbjct: 767 KRSDYGLPKDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVLWLLRFPATGEMRVKAHA 826
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR +LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 827 AARGVSPDQIIFTDVAMKHEHIRRSELADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 886
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
+A+RVA S LAT E+I + KEY+D A+ L T+
Sbjct: 887 MATRVAGSLCLATGLGDEMIVSSTKEYEDRAVELATN 923
>gi|326523469|dbj|BAJ92905.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 181 bits (458), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 82/157 (52%), Positives = 110/157 (70%), Gaps = 1/157 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLP+D ++ FNQLYK+DP W N++K VPNS+LWLL+FPA GE ++A A
Sbjct: 149 KRSDYGLPKDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVLWLLRFPATGEMRVKAHA 208
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR +LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 209 AARGVSPDQIIFTDVAMKHEHIRRSELADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 268
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
+A+RVA S LAT E+I + KEY+D A+ L T+
Sbjct: 269 MATRVAGSLCLATGLGDEMIVSSTKEYEDRAVELATN 305
>gi|357441255|ref|XP_003590905.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
gi|355479953|gb|AES61156.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
Length = 986
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 84/158 (53%), Positives = 107/158 (67%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLLKFPA GE ++A A
Sbjct: 767 KRSDYGLPEDKFLFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLKFPAAGEMRLRAYA 826
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P+VTLP E
Sbjct: 827 AAQGVQPDQIIFTDVAMKGEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEK 886
Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTDR 199
+A+RVA S + G E+I + KEY+D A+ L +R
Sbjct: 887 MATRVAGSLCISTGLGEEMIVSSMKEYEDRAVSLALNR 924
>gi|357441257|ref|XP_003590906.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
gi|355479954|gb|AES61157.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
Length = 744
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/158 (53%), Positives = 107/158 (67%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLLKFPA GE ++A A
Sbjct: 525 KRSDYGLPEDKFLFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLKFPAAGEMRLRAYA 584
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P+VTLP E
Sbjct: 585 AAQGVQPDQIIFTDVAMKGEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEK 644
Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTDR 199
+A+RVA S + G E+I + KEY+D A+ L +R
Sbjct: 645 MATRVAGSLCISTGLGEEMIVSSMKEYEDRAVSLALNR 682
>gi|356535232|ref|XP_003536152.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Glycine max]
Length = 988
Score = 180 bits (456), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 84/158 (53%), Positives = 108/158 (68%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 769 KRSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYA 828
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LD+PLCN HTT D+LW G P+VTLP E
Sbjct: 829 AAQGVQPDQIIFTDVATKNEHIRRSSLADLFLDSPLCNAHTTGTDILWAGLPMVTLPLEK 888
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
+A+RVA S LAT E+I + KEY+D A+ L +R
Sbjct: 889 MATRVAGSLCLATGLGDEMIVSSMKEYEDRAVSLALNR 926
>gi|357162940|ref|XP_003579571.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Brachypodium
distachyon]
Length = 983
Score = 179 bits (455), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 84/174 (48%), Positives = 113/174 (64%), Gaps = 1/174 (0%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N +V + R YGLPED ++ FNQLYK+DP W N++K VPNS+L
Sbjct: 747 NDYKQKNRDVFSPLCPHKRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVL 806
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+FPA GE ++A A A G+ +I+F++VA K EH+RR +LAD+ LDTPLCN HTT
Sbjct: 807 WLLRFPATGEMRVKAHAAARGVSPDQIIFTDVAVKHEHIRRSELADLFLDTPLCNAHTTG 866
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
D+LW G P++TL E +A+RVA S LAT E+I + KEY+D A+ L T+
Sbjct: 867 TDILWAGLPMITLALEKMATRVAGSLCLATGLGDEMIVSSTKEYEDRAVELATN 920
>gi|428175412|gb|EKX44302.1| hypothetical protein GUITHDRAFT_95096, partial [Guillardia theta
CCMP2712]
Length = 286
Score = 178 bits (452), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 101/139 (72%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR QYG+PED IV CNFNQLYKIDP L +W +L +P ++LWLL+FP GEA I+ A
Sbjct: 143 SRAQYGIPEDKIVLCNFNQLYKIDPLILDVWCRILHKLPETVLWLLRFPHAGEAGIRREA 202
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+ RI+F++VA K EH+RRG AD+ LD+ CN HTT D+LW+GTP+VT P E
Sbjct: 203 RARGIADDRIIFTDVANKSEHIRRGVCADLFLDSYQCNAHTTGCDILWSGTPIVTTPQEK 262
Query: 163 LASRVAASQLATLGCPELI 181
++ RVAAS + LGC ELI
Sbjct: 263 MSCRVAASLVNALGCKELI 281
>gi|224123642|ref|XP_002319130.1| predicted protein [Populus trichocarpa]
gi|222857506|gb|EEE95053.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 83/158 (52%), Positives = 109/158 (68%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 707 KRSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYA 766
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K+EH+RR LAD+ LDTPLCN HTT D+LW G P+VT+P E
Sbjct: 767 VAQGVQPDQIIFTDVAMKQEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTMPLEK 826
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
+A+RVA S LAT E+I + KEY++ A+ L +R
Sbjct: 827 MATRVAGSLCLATGLGDEMIVSSMKEYEERAVSLALNR 864
>gi|452824446|gb|EME31449.1| polypeptide N-acetylglucosaminyltransferase [Galdieria sulphuraria]
Length = 1024
Score = 178 bits (451), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/168 (51%), Positives = 111/168 (66%), Gaps = 7/168 (4%)
Query: 40 VITSRQQYGLPEDA-------IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
+I +R+ YGLP+ +++ NFNQLYK+DPSTL +W N+LK VP S +WLL+FP
Sbjct: 776 IILTRESYGLPKSVYEGGDCTVLFANFNQLYKLDPSTLNIWCNILKRVPGSKIWLLRFPP 835
Query: 93 VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
EA + A + RI+F++VA KEEH+ R LAD+ LDTP+CN HTT+ D LW+G
Sbjct: 836 AAEARLLAQVSKRHISTDRIIFTDVAPKEEHIARCGLADIFLDTPVCNAHTTATDALWSG 895
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
PVVT P LASRVAAS L G ELIAR EYQDIAI L + +
Sbjct: 896 LPVVTGPTVRLASRVAASVLHAAGLDELIARDMNEYQDIAIYLAYNYE 943
>gi|302765547|ref|XP_002966194.1| peptide-N-acetylglucosaminyltransferase [Selaginella
moellendorffii]
gi|300165614|gb|EFJ32221.1| peptide-N-acetylglucosaminyltransferase [Selaginella
moellendorffii]
Length = 985
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 84/174 (48%), Positives = 111/174 (63%), Gaps = 1/174 (0%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N +V S + R YGLPED ++ FNQLYK+DP W N+LK VPNS L
Sbjct: 750 NDYKQKNRDVLDSTNLPKRSDYGLPEDKFIFACFNQLYKMDPEIFTTWCNILKRVPNSAL 809
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+FPA GE ++A A A G+ +I+F++VAAK EH+RR LAD+ LDTPLCN HTT
Sbjct: 810 WLLRFPAAGEDRLKAFAVAQGVGPGQIIFTDVAAKNEHIRRSALADLFLDTPLCNAHTTG 869
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
DVLW G P++TLP E +A+RVA S G E++ + +EY+D A+ L +
Sbjct: 870 TDVLWAGLPMITLPLEKMATRVAMSLCYAAGFGDEMVVKKMQEYEDRAVMLAEN 923
>gi|255562045|ref|XP_002522031.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
gi|223538835|gb|EEF40435.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
Length = 979
Score = 177 bits (449), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 82/158 (51%), Positives = 109/158 (68%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE +++ A
Sbjct: 760 KRSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRSYA 819
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ +I+F++VA K+EH+RR LAD+ LDTPLCN HTT D+LW G P+VTLP E
Sbjct: 820 VSQGVQPEQIIFTDVAMKQEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEK 879
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
+A+RVA S LAT E+I + KEY++ A+ L +R
Sbjct: 880 MATRVAGSLCLATGLGDEMIVSSMKEYEEKAVSLALNR 917
>gi|413936918|gb|AFW71469.1| hypothetical protein ZEAMMB73_864118 [Zea mays]
Length = 706
Score = 177 bits (449), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 487 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAYA 546
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 547 AARGVRSDQIVFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 606
Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
+A+RVA S G E+I + K+Y+D A+ L
Sbjct: 607 MATRVAGSLCVATGLGEEMIVSSMKKYEDRAVELA 641
>gi|242065108|ref|XP_002453843.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
gi|241933674|gb|EES06819.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
Length = 1011
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 81/155 (52%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 792 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 851
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 852 VARGVRSDQIIFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 911
Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
+A+RVA S G E+I + KEY+D A+ L
Sbjct: 912 MATRVAGSLCVATGLGEEMIVSSMKEYEDRAVDLA 946
>gi|297833104|ref|XP_002884434.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
lyrata]
gi|297330274|gb|EFH60693.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
lyrata]
Length = 977
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 83/158 (52%), Positives = 108/158 (68%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP + W N+LK VPNS LWLL+FPA GE + A
Sbjct: 758 KRSDYGLPEDKFIFACFNQLYKMDPEIVNTWCNILKRVPNSALWLLRFPAAGEMRFRTYA 817
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LADV LDTPLCNGHTT DVLW G P++TLP E
Sbjct: 818 AAQGVQSDQIIFTDVAMKSEHIRRSVLADVILDTPLCNGHTTGTDVLWAGVPMITLPLEK 877
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
+A+RVA S LAT E+I + +EY++ A+ L ++
Sbjct: 878 MATRVAGSLCLATGLGHEMIVNSLEEYEEKAVSLALNK 915
>gi|224144666|ref|XP_002325368.1| predicted protein [Populus trichocarpa]
gi|222862243|gb|EEE99749.1| predicted protein [Populus trichocarpa]
Length = 873
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 82/158 (51%), Positives = 110/158 (69%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 654 KRLDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYA 713
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G++ +I+F++VA K+EH+RR LAD+ LDTPLCN HTT D+LW G P+VT+P E
Sbjct: 714 VAQGVNPDQIIFTDVAMKQEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTMPLEK 773
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
+A+RVA S LAT E+I + KEY++ A+ L ++
Sbjct: 774 MATRVAGSLCLATGLGDEMIVSSMKEYEERAVSLALNK 811
>gi|293334567|ref|NP_001167956.1| uncharacterized protein LOC100381672 [Zea mays]
gi|223945123|gb|ACN26645.1| unknown [Zea mays]
Length = 1009
Score = 176 bits (447), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 790 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAYA 849
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 850 AARGVRSDQIVFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 909
Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
+A+RVA S G E+I + K+Y+D A+ L
Sbjct: 910 MATRVAGSLCVATGLGEEMIVSSMKKYEDRAVELA 944
>gi|328714601|ref|XP_001947755.2| PREDICTED: UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110 kDa subunit-like
[Acyrthosiphon pisum]
Length = 1076
Score = 176 bits (447), Expect = 4e-42, Method: Composition-based stats.
Identities = 79/156 (50%), Positives = 105/156 (67%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + LPE+ +V+CNF++LYKIDP TL+MW+ +L VP S+LWLL V E N++ A
Sbjct: 876 RSTFNLPENVVVFCNFSKLYKIDPFTLRMWITILNNVPKSVLWLLHLNDVAENNLRTFAD 935
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
L D RI+F++ K +H+ R QLAD+ LDT L NGH +D LW G PV+TLPG+T
Sbjct: 936 VLNFDTSRIIFADFIPKYQHLNRIQLADIYLDTHLYNGHIACLDALWAGVPVITLPGDTY 995
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV SQL TLG + IA+ + Y DIAI LG ++
Sbjct: 996 ASRVTTSQLTTLGITDTIAQNEENYIDIAIELGLNK 1031
>gi|323452442|gb|EGB08316.1| putative O-linked GlcNAc transferase [Aureococcus anophagefferens]
Length = 1151
Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats.
Identities = 83/176 (47%), Positives = 114/176 (64%), Gaps = 1/176 (0%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDA-IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI 84
N + + + + R YG+PE A V+CNFNQ+YKIDP L +W +L V SI
Sbjct: 770 NDHAQSASYIFDAAALPKRADYGVPERASFVFCNFNQIYKIDPGILDVWCRILTRVDGSI 829
Query: 85 LWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
LWLL+FP GEANI+A A+ G+ RI F+ V+AK+EH++RG LAD+ LDTP CN HTT
Sbjct: 830 LWLLRFPPTGEANIRAEARKRGVSDDRIHFTAVSAKDEHIKRGVLADLFLDTPQCNAHTT 889
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
D+LW G P+VT G +A+RVAAS L G +L+A Y+++A+ L TD++
Sbjct: 890 GCDILWGGCPMVTCLGGKMATRVAASLLGAAGLADLVAPDLGGYEELAVALATDKE 945
>gi|60599458|gb|AAX26274.1| unknown [Schistosoma japonicum]
Length = 180
Score = 176 bits (446), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 84/139 (60%), Positives = 106/139 (76%), Gaps = 2/139 (1%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI + ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 34 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 93
Query: 61 QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA A A +GL Q RILFSNVA
Sbjct: 94 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 153
Query: 119 AKEEHVRRGQLADVCLDTP 137
KEEHVRRGQ++ L TP
Sbjct: 154 PKEEHVRRGQVSSYFLITP 172
>gi|357142312|ref|XP_003572530.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Brachypodium
distachyon]
Length = 1001
Score = 175 bits (444), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 81/157 (51%), Positives = 107/157 (68%), Gaps = 1/157 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LW+L+FPA GE ++A A
Sbjct: 782 KRSDYGLPEDKFIFACFNQLYKMDPEIFDRWCNILKRVPNSALWILRFPAAGETRVRAHA 841
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 842 AARGVRSDQIIFTDVAMKTEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 901
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
+A+RVA S LAT E+I + +EY+D A+ L +
Sbjct: 902 MATRVAGSLCLATGLGEEMIVSSMQEYEDRAVALAQN 938
>gi|89242353|gb|ABD64595.1| O-linked N-acetylglucosamine transferase, partial [Sus scrofa]
gi|89242355|gb|ABD64596.1| O-linked N-acetylglucosamine transferase, partial [Sus scrofa]
Length = 103
Score = 175 bits (444), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 83/103 (80%), Positives = 90/103 (87%)
Query: 77 LKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDT 136
LK VPNS+LWLL+FPAVGE NIQ AQ +GL Q+RI+FS VA KEEHVRRGQLADVCLDT
Sbjct: 1 LKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDT 60
Query: 137 PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
PLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL LGC E
Sbjct: 61 PLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLE 103
>gi|218190772|gb|EEC73199.1| hypothetical protein OsI_07263 [Oryza sativa Indica Group]
Length = 1004
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/157 (53%), Positives = 108/157 (68%), Gaps = 3/157 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 783 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 842
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 843 AARGVRPDQIIFTDVAMKNEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 902
Query: 163 LASRVAASQ-LAT-LGCPELIAR-THKEYQDIAIRLG 196
+A+RVA S LAT LG +++R KEY+D A+ L
Sbjct: 903 MATRVAGSLCLATGLGEEMIVSRQVMKEYEDRAVDLA 939
>gi|222622881|gb|EEE57013.1| hypothetical protein OsJ_06783 [Oryza sativa Japonica Group]
Length = 1004
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/157 (53%), Positives = 108/157 (68%), Gaps = 3/157 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 783 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 842
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E
Sbjct: 843 AARGVRPDQIIFTDVAMKNEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 902
Query: 163 LASRVAASQ-LAT-LGCPELIAR-THKEYQDIAIRLG 196
+A+RVA S LAT LG +++R KEY+D A+ L
Sbjct: 903 MATRVAGSLCLATGLGEEMIVSRQVMKEYEDRAVDLA 939
>gi|403344986|gb|EJY71848.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oxytricha trifallax]
Length = 1211
Score = 173 bits (439), Expect = 3e-41, Method: Composition-based stats.
Identities = 75/160 (46%), Positives = 111/160 (69%), Gaps = 1/160 (0%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+ R+QYGLP D ++ NFNQLYK+DP+T +W+ +L VPNSILW++++PA + N+
Sbjct: 915 SCREQYGLPNDKFIFANFNQLYKLDPNTFTVWMKILGRVPNSILWIVEYPAEAKENLLKE 974
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP-G 160
A A G+D RI+ + A K EH+ R LAD+ LD P+ NGHTT+ D+LW+G P++T P
Sbjct: 975 AVARGIDPSRIIMTTKAPKHEHIERCHLADLSLDNPITNGHTTTCDLLWSGLPIITYPFS 1034
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
E + SRVAAS L CPE+I ++++Y++ A++L T D
Sbjct: 1035 ENMPSRVAASICQALECPEMIVTSYQDYEEYAVKLATPTD 1074
>gi|15229253|ref|NP_187074.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC [Arabidopsis
thaliana]
gi|75336082|sp|Q9M8Y0.1|SEC_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC; AltName:
Full=Protein SECRET AGENT
gi|6721161|gb|AAF26789.1|AC016829_13 putative O-linked GlcNAc transferase [Arabidopsis thaliana]
gi|18139887|gb|AAL60196.1|AF441079_1 O-linked N-acetyl glucosamine transferase [Arabidopsis thaliana]
gi|20259324|gb|AAM13988.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
gi|21436429|gb|AAM51415.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
gi|110742062|dbj|BAE98963.1| O-linked GlcNAc transferase like protein [Arabidopsis thaliana]
gi|332640535|gb|AEE74056.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC [Arabidopsis
thaliana]
Length = 977
Score = 173 bits (439), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 82/158 (51%), Positives = 107/158 (67%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP + W N+LK VPNS LWLL+FPA GE + A
Sbjct: 758 KRSDYGLPEDKFIFACFNQLYKMDPEIVNTWCNILKRVPNSALWLLRFPAAGEMRFRTYA 817
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ +I+F++VA K EH+RR LADV LDTPLCNGHTT DVLW G P++TLP E
Sbjct: 818 AAQGVQPDQIIFTDVAMKSEHIRRSVLADVILDTPLCNGHTTGTDVLWAGVPMITLPLEK 877
Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
+A+RVA S LAT +I + +EY++ A+ L ++
Sbjct: 878 MATRVAGSLCLATGLGHGMIVNSLEEYEEKAVSLALNK 915
>gi|357131539|ref|XP_003567394.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Brachypodium
distachyon]
Length = 980
Score = 172 bits (437), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 110/174 (63%), Gaps = 1/174 (0%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
N ++ + R YGLPED ++ FNQLYK+DP W N++K VPNS L
Sbjct: 744 NDYKKKNQDALDQVCPHKRADYGLPEDKFIFACFNQLYKMDPDIFGTWCNIVKRVPNSAL 803
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+FP GE ++A A + G+ +I+F++VAAK EH+RR LAD+ LDTPLCNGHTT
Sbjct: 804 WLLRFPPAGEMRLRAYAISKGVRADQIIFTDVAAKSEHIRRSALADLFLDTPLCNGHTTG 863
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
D+LW G P++TLP E +A+RVA S G E+I + KEY++ A+ L +
Sbjct: 864 TDILWAGLPMITLPLEKMATRVAGSLCRATGIGEEMIVSSLKEYEERAVFLAKN 917
>gi|168054571|ref|XP_001779704.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668902|gb|EDQ55500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 922
Score = 172 bits (436), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 105/155 (67%), Gaps = 1/155 (0%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YGLPED ++ FNQLYK+DP W +LK VPNS LWLL+FPA GE ++A A
Sbjct: 704 RSDYGLPEDKFLFACFNQLYKMDPEIFSTWCRILKRVPNSALWLLRFPAAGETRLKAFAI 763
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+ +I+F++VAAK EH+RR LAD+ LD+PLCN HTT DVLW G P+VTLP E +
Sbjct: 764 AQGVRPDQIIFTDVAAKNEHIRRSGLADLFLDSPLCNAHTTGTDVLWAGLPMVTLPLEKM 823
Query: 164 ASRVAASQLATLGC-PELIARTHKEYQDIAIRLGT 197
A+RVA S G E++ + +EY++ A+ L T
Sbjct: 824 ATRVAMSLCYAAGFGEEMVVSSMQEYEERAVMLAT 858
>gi|168044085|ref|XP_001774513.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674225|gb|EDQ60737.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 932
Score = 170 bits (430), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 78/160 (48%), Positives = 105/160 (65%), Gaps = 1/160 (0%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
+ R Y LPED ++ FNQLYK+DP W +LK VPNS LWLL+FPA GE ++
Sbjct: 710 ISMKRSDYSLPEDKFLFACFNQLYKMDPEIFSTWCRILKRVPNSALWLLRFPAAGETRLK 769
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A A A G+ + +F++VAAK EH+RR LAD+ LD+PLCN HTT DVLW G P+VTLP
Sbjct: 770 AFAIAQGVRPEQFIFTDVAAKNEHIRRSALADLFLDSPLCNAHTTGTDVLWAGLPMVTLP 829
Query: 160 GETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
E +A+RVA S G E++ + +EY++ A+ L T+
Sbjct: 830 LEKMATRVAMSLCYAAGFGEEMVVSSMQEYEERAVMLATN 869
>gi|449449593|ref|XP_004142549.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like [Cucumis
sativus]
Length = 975
Score = 167 bits (422), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 78/152 (51%), Positives = 104/152 (68%), Gaps = 1/152 (0%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+ ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A A G+
Sbjct: 762 LPQGKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYAVAQGVQ 821
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
+I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E +A+RVA
Sbjct: 822 PEQIIFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEKMATRVA 881
Query: 169 ASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
S LAT E+I + KEY++ A+ L +R
Sbjct: 882 GSLCLATGLGDEMIVSSMKEYEEKAVTLALNR 913
>gi|323449294|gb|EGB05183.1| hypothetical protein AURANDRAFT_54803 [Aureococcus anophagefferens]
Length = 897
Score = 166 bits (419), Expect = 7e-39, Method: Composition-based stats.
Identities = 73/157 (46%), Positives = 105/157 (66%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R +YGLPEDA ++ F QL K+DP +W+++LK VPNSIL L+K P +++ A+
Sbjct: 687 RSKYGLPEDAFIFAYFGQLNKVDPIAYGVWMSILKRVPNSILLLIKSPESAVVHLKKAAR 746
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
L + + R++F + A K EH+ RG LAD+ LDTP CN +MD LWTGTPV+TL G ++
Sbjct: 747 DLQVHEQRLIFMDSAPKREHILRGVLADLLLDTPACNAFDATMDALWTGTPVITLLGNSI 806
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R +AS +GCP+L+ T EY+D+A+ L D D
Sbjct: 807 ATRASASLCTAVGCPDLVTSTLDEYEDLAVSLALDAD 843
>gi|349605516|gb|AEQ00726.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit-like protein, partial [Equus caballus]
Length = 158
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 76/97 (78%), Positives = 86/97 (88%)
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
AQ +GL Q+RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGE
Sbjct: 2 AQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGE 61
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
TLASRVAASQL LGC ELIA+ +EY+DIA++LGTD
Sbjct: 62 TLASRVAASQLTCLGCLELIAKNRQEYEDIAVKLGTD 98
>gi|449532483|ref|XP_004173210.1| PREDICTED: LOW QUALITY PROTEIN: probable
UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC-like, partial
[Cucumis sativus]
Length = 434
Score = 163 bits (413), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 71/127 (55%), Positives = 90/127 (70%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YGLPE ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 294 RSDYGLPEGKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYAV 353
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+ +I+F++VA K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E +
Sbjct: 354 AQGVQPEQIIFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEKM 413
Query: 164 ASRVAAS 170
A+RVA S
Sbjct: 414 ATRVAGS 420
>gi|425442300|ref|ZP_18822552.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9717]
gi|389716770|emb|CCH99039.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9717]
Length = 1039
Score = 162 bits (411), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 830 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 889
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+D R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T PG T
Sbjct: 890 QARGIDGDRLIFAHLQPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 949
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 950 FPSRVGASLLTAMGLPELITKNLEEYKNLAINLAKSPD 987
>gi|400754587|ref|YP_006562955.1| hypothetical protein PGA2_c17130 [Phaeobacter gallaeciensis 2.10]
gi|398653740|gb|AFO87710.1| hypothetical protein PGA2_c17130 [Phaeobacter gallaeciensis 2.10]
Length = 585
Score = 162 bits (410), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 83/194 (42%), Positives = 117/194 (60%), Gaps = 8/194 (4%)
Query: 4 SGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLY 63
G+ S + + N N + A EE P +R GLPEDA+V+C+FN
Sbjct: 342 EGRRHFSEKILYMPNCYQVN--DNSRAHPEEKP------TRADMGLPEDAVVFCSFNNHN 393
Query: 64 KIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEH 123
K+ P+ +W+++LK V NS+LW L + ANI A+A G+ RI+F+ + +H
Sbjct: 394 KVTPAEFDIWMDLLKDVDNSVLWFLAAADIVRANILKEAEARGVPADRIVFAGRCSTPDH 453
Query: 124 VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIAR 183
V R LAD+ LDT CN HTT+ ++LW+G PVVT PGE A+RVAAS + +GCPELIA
Sbjct: 454 VARLPLADIFLDTFACNAHTTASEMLWSGVPVVTKPGEQFAARVAASIVTAVGCPELIAE 513
Query: 184 THKEYQDIAIRLGT 197
T +EY+ +A+RL T
Sbjct: 514 TDEEYRALALRLAT 527
>gi|26338580|dbj|BAC32961.1| unnamed protein product [Mus musculus]
Length = 153
Score = 161 bits (407), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/93 (79%), Positives = 84/93 (90%)
Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
GL Q+RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLAS
Sbjct: 1 GLPQNRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLAS 60
Query: 166 RVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
RVAASQL LGC ELIA++ +EY+DIA++LGTD
Sbjct: 61 RVAASQLTCLGCLELIAKSRQEYEDIAVKLGTD 93
>gi|166368783|ref|YP_001661056.1| hypothetical protein MAE_60420 [Microcystis aeruginosa NIES-843]
gi|166091156|dbj|BAG05864.1| TPR repeat protein [Microcystis aeruginosa NIES-843]
Length = 631
Score = 161 bits (407), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 422 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMQILANVPGSVLWLFSRVAEAEANLRREA 481
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T PG T
Sbjct: 482 QARGIEGDRLIFAHLQPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 541
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 542 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 579
>gi|422303914|ref|ZP_16391264.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC (fragment)
[Microcystis aeruginosa PCC 9806]
gi|389791071|emb|CCI13110.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC (fragment)
[Microcystis aeruginosa PCC 9806]
Length = 288
Score = 160 bits (406), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 79 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILDNVPGSVLWLFSRVAEAEANLRREA 138
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T PG T
Sbjct: 139 QARGIEGDRLIFAHLQPKPEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGTT 198
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 199 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 236
>gi|399992912|ref|YP_006573152.1| hypothetical protein PGA1_c17330 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
gi|398657467|gb|AFO91433.1| hypothetical protein PGA1_c17330 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
Length = 585
Score = 160 bits (406), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/194 (42%), Positives = 117/194 (60%), Gaps = 8/194 (4%)
Query: 4 SGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLY 63
G+ S + + N N + A EE P +R GLPE+A+V+C+FN
Sbjct: 342 EGRRHFSEKILYMPNCYQVN--DNSRAHPEEKP------TRADMGLPENAVVFCSFNNHN 393
Query: 64 KIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEH 123
K+ P+ +W+++LK V NS+LW L V ANI A+A G+ RI+F+ + +H
Sbjct: 394 KVTPAEFDIWMDLLKDVDNSVLWFLAAADVVRANILKEAEARGVPADRIVFAGRCSTPDH 453
Query: 124 VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIAR 183
V R LAD+ LDT CN HTT+ ++LW+G PVVT PGE A+RVAAS + +GCPELIA
Sbjct: 454 VARLPLADIFLDTFACNAHTTASEMLWSGVPVVTKPGEQFAARVAASIVTAVGCPELIAE 513
Query: 184 THKEYQDIAIRLGT 197
T +EY+ +A+RL T
Sbjct: 514 TDEEYRALALRLAT 527
>gi|425449527|ref|ZP_18829366.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
7941]
gi|389763752|emb|CCI09797.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
7941]
Length = 1038
Score = 160 bits (404), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 829 TRSHYGLPESGFVFCCFNHTYKIEPQVFTVWMQILANVPGSVLWLFSRVAEAEANLRREA 888
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T PG T
Sbjct: 889 KARGIEGDRLIFAHLEPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 948
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 949 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 986
>gi|440755140|ref|ZP_20934342.1| TPR domain protein [Microcystis aeruginosa TAIHU98]
gi|440175346|gb|ELP54715.1| TPR domain protein [Microcystis aeruginosa TAIHU98]
Length = 388
Score = 159 bits (403), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 179 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 238
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T PG T
Sbjct: 239 KARGIEGDRLIFAHLEPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 298
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 299 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 336
>gi|390440632|ref|ZP_10228858.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC (fragment)
[Microcystis sp. T1-4]
gi|389836094|emb|CCI32984.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC (fragment)
[Microcystis sp. T1-4]
Length = 450
Score = 159 bits (403), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 241 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 300
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T PG T
Sbjct: 301 QARGIEGDRLIFAHLQPKLEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 360
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 361 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 398
>gi|425471484|ref|ZP_18850344.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC (fragment)
[Microcystis aeruginosa PCC 9701]
gi|389882649|emb|CCI36920.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SEC (fragment)
[Microcystis aeruginosa PCC 9701]
Length = 404
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R QYGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 195 TRSQYGLPESGFVFCCFNHTYKIEPQIFTVWMQILANVPGSVLWLFSRVAEAEANLRREA 254
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T G T
Sbjct: 255 QARGIEGDRLIFAHLQPKPEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCRGAT 314
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 315 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 352
>gi|425437323|ref|ZP_18817742.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9432]
gi|389677704|emb|CCH93372.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9432]
Length = 1059
Score = 156 bits (394), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R QYGLPE V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 850 TRSQYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 909
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G++ R++F+++ K EH+ R QLAD+ LDT N HTT D LW G P++T ET
Sbjct: 910 KARGIEGDRLIFAHLEPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIMTCLEET 969
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L +G PELI + +EY+++AI L D
Sbjct: 970 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 1007
>gi|425448104|ref|ZP_18828084.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9443]
gi|389731183|emb|CCI04721.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9443]
Length = 1040
Score = 156 bits (394), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R QYGLP+ V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 831 TRSQYGLPKSGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 890
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G++ R++F+++ K EH+ R QLAD+ +DT N HTT D LW G P++T GET
Sbjct: 891 KARGIEGDRLIFAHLEPKSEHLARHQLADLFIDTLYYNAHTTGSDALWAGLPIITCLGET 950
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV AS L G PELI + +EY+++AI L D
Sbjct: 951 FPSRVGASLLTATGLPELITKNLEEYKNLAINLAKSPD 988
>gi|425453499|ref|ZP_18833256.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9807]
gi|389802856|emb|CCI18142.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
9807]
Length = 1039
Score = 155 bits (393), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 70/158 (44%), Positives = 99/158 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R QYGLP+ V+C FN YKI+P +W+ +L VP S+LWL A EAN++ A
Sbjct: 830 TRSQYGLPKSGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 889
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G++ R++F+++ K EH+ R QLAD+ +DT N HTT D LW G P++T GET
Sbjct: 890 KARGIEGDRLIFAHLEPKSEHLARHQLADLFIDTLYYNAHTTGSDALWAGLPIITCLGET 949
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRV S L +G PELI + +EY+++AI L D
Sbjct: 950 FPSRVGGSLLTAIGLPELITKNLEEYKNLAINLAKSPD 987
>gi|320033304|gb|EFW15252.1| UDP-N-acetylglucosaminyltransferase [Coccidioides posadasii str.
Silveira]
Length = 279
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ +A A +
Sbjct: 55 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 114
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP +
Sbjct: 115 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 174
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELI + +EY++ AIRLG D
Sbjct: 175 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 217
>gi|119189521|ref|XP_001245367.1| hypothetical protein CIMG_04808 [Coccidioides immitis RS]
Length = 1610
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ +A A +
Sbjct: 1386 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1445
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP +
Sbjct: 1446 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 1505
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELI + +EY++ AIRLG D
Sbjct: 1506 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 1548
>gi|254475400|ref|ZP_05088786.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit, putative [Ruegeria sp. R11]
gi|214029643|gb|EEB70478.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit, putative [Ruegeria sp. R11]
Length = 581
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 106/161 (65%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P+ +R GLP+D +V+C FN K+ P+ +W+++LK V S+LW L
Sbjct: 366 PRPEQTPTRADMGLPQDGVVFCAFNNHNKVSPAEFDIWMDLLKQVDGSVLWFLAGAEPLR 425
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
ANI+ A+A G+ RI+F++ + +H+ R LAD+ LDT CN HTT+ +++W+G PV
Sbjct: 426 ANIRKEAEARGVSAERIVFADRCSTPDHIARLPLADIFLDTFACNAHTTASELMWSGVPV 485
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
VT+PG+ A+RVAAS ++ + CPELIA++ EY+DIA+RL
Sbjct: 486 VTMPGQQFAARVAASIVSAVNCPELIAQSVDEYRDIALRLA 526
>gi|303323021|ref|XP_003071502.1| TPR Domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240111204|gb|EER29357.1| TPR Domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1676
Score = 154 bits (388), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ +A A +
Sbjct: 1452 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1511
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP +
Sbjct: 1512 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 1571
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELI + +EY++ AIRLG D
Sbjct: 1572 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 1614
>gi|392868271|gb|EAS34034.2| UDP-N-acetylglucosaminyltransferase [Coccidioides immitis RS]
Length = 1676
Score = 154 bits (388), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ +A A +
Sbjct: 1452 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1511
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP +
Sbjct: 1512 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 1571
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELI + +EY++ AIRLG D
Sbjct: 1572 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 1614
>gi|434406348|ref|YP_007149233.1| TIGR03032 family protein [Cylindrospermum stagnale PCC 7417]
gi|428260603|gb|AFZ26553.1| TIGR03032 family protein [Cylindrospermum stagnale PCC 7417]
Length = 1083
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 100/155 (64%), Gaps = 2/155 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL-KFPAVGEANIQAT 101
+R QYGLPE V+C FN YKI+P W+ +L VP +LWLL +FP V + N++
Sbjct: 871 TRSQYGLPESGFVFCCFNNNYKIEPLIFDAWMRILADVPEGVLWLLPRFP-VAKENLRRE 929
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G+ + R++F++V K EH+ R +LAD+ LDT N HT++ D LW G PV+T PG
Sbjct: 930 AEMRGISEERLIFADVEPKPEHLARHRLADLFLDTLYYNAHTSATDALWAGLPVITCPGT 989
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
T ASRVAAS L +G PEL+ +EY+ +AI L
Sbjct: 990 TFASRVAASLLTAVGLPELVTANLQEYEQLAINLA 1024
>gi|169605775|ref|XP_001796308.1| hypothetical protein SNOG_05914 [Phaeosphaeria nodorum SN15]
gi|160706839|gb|EAT86978.2| hypothetical protein SNOG_05914 [Phaeosphaeria nodorum SN15]
Length = 1609
Score = 152 bits (384), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 111/163 (68%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP++AI+ NFNQLYKI+P+T + W+ +L+ +PN+ILWLL+FP +GE N++ TA
Sbjct: 1377 LPDNAIILGNFNQLYKIEPTTFRTWLRILERLPNAILWLLRFPDLGETNLKQTAYMWAGA 1436
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ +RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1437 EVANRIIFTDVAQKSQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1496
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L ELIA ++Y++ A+RLG+D
Sbjct: 1497 SRMAASILKGALPKSSEGVQAAKELIAANDEDYEEKAVRLGSD 1539
>gi|440474923|gb|ELQ43638.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Magnaporthe oryzae Y34]
gi|440489796|gb|ELQ69415.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Magnaporthe oryzae P131]
Length = 1524
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/160 (45%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L VP ++LWLL+FP +GE N++ TA+A +
Sbjct: 1297 LPDDAIILGNFNQLYKIDPTTFRTWLRILSQVPKAVLWLLRFPELGECNLRNTARAWAGE 1356
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1357 KVASRIIFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1416
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIA K+Y++ A+RL
Sbjct: 1417 SRMAASILKGALPKSEEGARAATELIASDEKDYEESAVRL 1456
>gi|389623941|ref|XP_003709624.1| UDP-N-acetylglucosaminyltransferase [Magnaporthe oryzae 70-15]
gi|351649153|gb|EHA57012.1| UDP-N-acetylglucosaminyltransferase [Magnaporthe oryzae 70-15]
Length = 1541
Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/160 (45%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L VP ++LWLL+FP +GE N++ TA+A +
Sbjct: 1314 LPDDAIILGNFNQLYKIDPTTFRTWLRILSQVPKAVLWLLRFPELGECNLRNTARAWAGE 1373
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1374 KVASRIIFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1433
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIA K+Y++ A+RL
Sbjct: 1434 SRMAASILKGALPKSEEGARAATELIASDEKDYEESAVRL 1473
>gi|367036987|ref|XP_003648874.1| glycosyltransferase family 41 protein [Thielavia terrestris NRRL
8126]
gi|346996135|gb|AEO62538.1| glycosyltransferase family 41 protein [Thielavia terrestris NRRL
8126]
Length = 1601
Score = 151 bits (381), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 74/160 (46%), Positives = 106/160 (66%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE +++ TA+ +
Sbjct: 1354 LPDDAIILGNFNQLYKIDPTTFRTWLRILAAAPKAYLWLLRFPELGETHLRRTARDWSGE 1413
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1414 GVARRIIFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1473
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIAR KEY++ AIRL
Sbjct: 1474 SRMAASILKGALPKGEEGEQAARELIARDEKEYEEFAIRL 1513
>gi|429864035|gb|ELA38419.1| udp-n-acetylglucosaminyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 1093
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/160 (47%), Positives = 109/160 (68%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L VP +ILWLL+FP +GE N++ TA+A D
Sbjct: 867 LPDDAIILGNFNQLYKIDPTTFRTWLRILSQVPKAILWLLRFPELGETNLRRTAKAWAGD 926
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ +P++TLP +
Sbjct: 927 EAPSRIIFTDVAPKNQHISRARVCDLFLDTPECNAHTTAADVLWSSSPLLTLPRYPYKMC 986
Query: 165 SRVAASQL--ATLGCP-------ELIARTHKEYQDIAIRL 195
SR+AAS L A P +LIA + +EY+ AI+L
Sbjct: 987 SRMAASILKGALPRSPSGLEAANDLIASSEEEYEAFAIKL 1026
>gi|384487344|gb|EIE79524.1| hypothetical protein RO3G_04229 [Rhizopus delemar RA 99-880]
Length = 456
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 104/154 (67%), Gaps = 5/154 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I++ NFNQLYK++P T ++W+ +L+ +PNSILWLL+FP GE ++++ A
Sbjct: 211 LPDDVIIFANFNQLYKLEPGTFRLWLRILERIPNSILWLLRFPPAGEHHLKSCAAEWAGP 270
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
Q R++F++VA K H+ RG++AD+ LDTP CN HTT+ D+LW+GTP+VT P +
Sbjct: 271 QVAQRVVFTDVAPKHIHIHRGRIADIFLDTPECNAHTTAADILWSGTPIVTWPKYLHKMC 330
Query: 165 SRVAAS-QLATLGCPELIARTHKEYQDIAIRLGT 197
SRV AS +AT E+I ++Y+ + L T
Sbjct: 331 SRVGASIAMATGFGKEMIVMNEQQYETRTVELAT 364
>gi|396488362|ref|XP_003842860.1| hypothetical protein LEMA_P086200.1 [Leptosphaeria maculans JN3]
gi|312219437|emb|CBX99381.1| hypothetical protein LEMA_P086200.1 [Leptosphaeria maculans JN3]
Length = 1999
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 108/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA--QALG 106
LPEDAI+ NFNQLYKI+P+T + W+ +L+ +P +ILWLL+FP +GE N++ TA A
Sbjct: 1766 LPEDAIILANFNQLYKIEPTTFRTWLRILERLPRAILWLLRFPDLGETNLKNTALKWAGP 1825
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1826 AVASRIMFTDVAQKSQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1885
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L ELIA + ++Y+D A+RL D
Sbjct: 1886 SRMAASILKGALPKNDAGVRAAMELIASSDEDYEDKAVRLARD 1928
>gi|451849276|gb|EMD62580.1| glycosyltransferase family 41 protein [Cochliobolus sativus ND90Pr]
Length = 1644
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 109/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ++AI+ NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA
Sbjct: 1421 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTALMWAGA 1480
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1481 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKHKMC 1540
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L ELIA + +EY+D+A+RL D
Sbjct: 1541 SRMAASILKGALPKTPAGAQAAKELIASSDEEYEDMAVRLTRD 1583
>gi|403161889|ref|XP_003322196.2| hypothetical protein PGTG_03733 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375171971|gb|EFP77777.2| hypothetical protein PGTG_03733 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1596
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 108/155 (69%), Gaps = 7/155 (4%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L E +++ NFNQLYKIDP + W+++L A+PNSILWLL+FPA GE +++ TA+
Sbjct: 1312 LDEQTVIFANFNQLYKIDPLIFKYWLDILAAIPNSILWLLRFPAPGEPHLKITAERWAGR 1371
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET---- 162
+ RI F++VA K H++RG++AD+ LD+ CN HTT+ D+LW+GTP++TLP T
Sbjct: 1372 EVAGRIRFTDVAPKPIHIKRGRIADLFLDSTECNAHTTAADILWSGTPILTLPRPTHAHK 1431
Query: 163 LASRVAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
+ SRVAAS LAT ++IA + +Y++ AIR G
Sbjct: 1432 MCSRVAASIALATGYGSQMIATSMDDYKERAIRFG 1466
>gi|156037528|ref|XP_001586491.1| hypothetical protein SS1G_12478 [Sclerotinia sclerotiorum 1980]
gi|154697886|gb|EDN97624.1| hypothetical protein SS1G_12478 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1584
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 74/160 (46%), Positives = 106/160 (66%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L VP +ILWLL+FP +GE+N++ATA+ +
Sbjct: 1358 LSDDAIILGNFNQLYKIEPTTFRTWLRILDKVPRAILWLLRFPDLGESNLKATARDWAGE 1417
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1418 SVASRIWFTDVAPKHQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYKYKMC 1477
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIAR ++Y++ AIRL
Sbjct: 1478 SRMAASILKGALPRSEEGKRAASELIARDEEQYEEFAIRL 1517
>gi|452001265|gb|EMD93725.1| glycosyltransferase family 41 protein [Cochliobolus heterostrophus
C5]
Length = 1544
Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 109/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ++AI+ NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA
Sbjct: 1321 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTALMWAGA 1380
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1381 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1440
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L ELIA + +EY+D+A+RL D
Sbjct: 1441 SRMAASILKGALPKTPAGAQAAKELIASSDEEYEDMAVRLTRD 1483
>gi|154245494|ref|YP_001416452.1| hypothetical protein Xaut_1547 [Xanthobacter autotrophicus Py2]
gi|154159579|gb|ABS66795.1| conserved hypothetical protein [Xanthobacter autotrophicus Py2]
Length = 459
Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 96/158 (60%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR++ GLP D V+C FNQ YKI P +W +L VP+S+LWLL P N+++ A
Sbjct: 251 SRREAGLPADGFVFCCFNQPYKISPQMFDIWCRLLACVPDSVLWLLDTPHAA-GNLRSEA 309
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+D R++F + EH+RR QLAD+ LDT N HTT+ D LW G P VT PG+T
Sbjct: 310 IARGIDARRLVFGANMGQAEHLRRLQLADLMLDTSPFNAHTTASDALWVGLPFVTCPGDT 369
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRVA S L +G P+L+ + Y D+A L TD +
Sbjct: 370 FPSRVAGSILHAIGLPDLVTDSPDAYFDLAYTLATDEE 407
>gi|254464640|ref|ZP_05078051.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit [Rhodobacterales bacterium Y4I]
gi|206685548|gb|EDZ46030.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit [Rhodobacterales bacterium Y4I]
Length = 287
Score = 149 bits (376), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 77/177 (43%), Positives = 102/177 (57%), Gaps = 6/177 (3%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q N T VP SR + GLPEDA V+C+FN YK+ P +W+ +LK VP+S
Sbjct: 66 QPNDNKRTASRVP------SRAEAGLPEDAFVFCSFNSPYKVSPEEFDIWMKLLKLVPDS 119
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LW A +ANI A+ G+ RI+F+ A +E+H+ R QLADV LDT N HT
Sbjct: 120 VLWFYAPRAETKANILKEAKKRGVAADRIVFAGFAGQEDHLARLQLADVFLDTFAVNAHT 179
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T+ D LW G PVVT G+ A+R A S L G EL A T + YQ +A++L + D
Sbjct: 180 TASDALWAGVPVVTKTGKQFAARFATSILNAAGLRELAASTPQRYQALALKLARNPD 236
>gi|310790631|gb|EFQ26164.1| tetratricopeptide [Glomerella graminicola M1.001]
Length = 1607
Score = 149 bits (376), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 76/160 (47%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D+I+ NFNQLYKIDP+T + W+ +L V N+ILWLL+FP +GE N++ TA+A
Sbjct: 1381 LPDDSIIMGNFNQLYKIDPTTFRTWLRILSKVDNAILWLLRFPELGETNLRRTAEAWAGQ 1440
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1441 EVAKRIIFTDVAPKNQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1500
Query: 165 SRVAASQL--ATLGCPE-------LIARTHKEYQDIAIRL 195
SR+AAS L A PE LIA + EY++ A+RL
Sbjct: 1501 SRMAASILRGALPHGPEGVEAAEDLIAGSEDEYEEFAVRL 1540
>gi|258565995|ref|XP_002583742.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907443|gb|EEP81844.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1554
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 107/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP++ I+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ +A A +
Sbjct: 1330 LPDNTIILGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1389
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTTS D+LW+GTP++T P +
Sbjct: 1390 ATASRIIFTDVAPKHAHISRARVCDLFLDTPECNAHTTSADILWSGTPLITFPRYKYKMC 1449
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELI + +EY++ AIRLG D
Sbjct: 1450 SRMASSILTSALPQNAAGREAASELITSSEEEYENNAIRLGLD 1492
>gi|407921016|gb|EKG14185.1| Tetratricopeptide TPR-1 [Macrophomina phaseolina MS6]
Length = 1605
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 108/162 (66%), Gaps = 13/162 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKI+P+T + W+ +L +P +ILWLL+FP VGE +++ TA A
Sbjct: 1362 LPDDVIILGNFNQLYKIEPTTFRTWLRILANLPKAILWLLRFPDVGETHLKQTALAWAGP 1421
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1422 EVASRVIFTDVAPKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYPYKMC 1481
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
SR+AAS L ELIA+ +EY+ +AI+LG+
Sbjct: 1482 SRMAASILKGALPDNDAGQQAAKELIAQNEEEYERLAIKLGS 1523
>gi|343428307|emb|CBQ71837.1| related to UDP-N-acetylglucosaminyltransferase [Sporisorium reilianum
SRZ2]
Length = 2070
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 70/152 (46%), Positives = 105/152 (69%), Gaps = 5/152 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D +++ +FNQLYK DPS ++W+ +L VP SILWLL+FPA GE ++ A+A D
Sbjct: 1700 LPDDYVIFADFNQLYKCDPSLFRLWLRILARVPRSILWLLRFPAAGEHHLLREARAYAGD 1759
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
R++F++VA K H+ RG++AD+ LDT CN HTT++D+LW+ TPV+T P +
Sbjct: 1760 AVAARVIFTDVAPKHIHIHRGRIADLFLDTTECNAHTTAVDILWSATPVLTWPRHMHKMC 1819
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAIRL 195
SRVAAS ++ G E+ R+ +EY++ A+RL
Sbjct: 1820 SRVAASIVSATGFGDEMTVRSAEEYEERAVRL 1851
>gi|288963186|ref|YP_003453465.1| hypothetical protein AZL_f01610 [Azospirillum sp. B510]
gi|288915438|dbj|BAI76921.1| hypothetical protein AZL_f01610 [Azospirillum sp. B510]
Length = 669
Score = 148 bits (373), Expect = 1e-33, Method: Composition-based stats.
Identities = 72/155 (46%), Positives = 98/155 (63%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YGLPE++ V+C+FN L K+ P+ +W+ +L VP S+LWL GE N++ A
Sbjct: 456 RAAYGLPENSFVFCSFNALQKLSPAIFALWMRLLARVPGSVLWLYGGAPDGERNLRREAA 515
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+ R+LF++ +H+ R ++AD+CLDT GHTT+ D LW G PVVT G T
Sbjct: 516 ARGVAPDRLLFASKLPLADHLARYRVADLCLDTLPYTGHTTTSDALWMGCPVVTCLGGTF 575
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G PELI T +EY+ +A+RL D
Sbjct: 576 ASRVAASLLTAAGLPELITHTLEEYETLAVRLAED 610
>gi|380485005|emb|CCF39637.1| UDP-N-acetylglucosamine-peptideN-acetylglucosaminyltransferase
[Colletotrichum higginsianum]
Length = 401
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/161 (44%), Positives = 105/161 (65%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKIDP+T + W+ +L V SILWLL+FP +GE N++ TA+A +
Sbjct: 175 LPDDTIIMGNFNQLYKIDPTTFRTWLRILSKVDKSILWLLRFPELGETNLRRTAKAWAGE 234
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 235 EVARRIIFTDVAPKNQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYPYKMC 294
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
SR+AAS L +LIA + EY++ A+RL
Sbjct: 295 SRMAASILKGALPNDAEGLRAAKDLIAGSEDEYEEFAVRLA 335
>gi|402081136|gb|EJT76281.1| UDP-N-acetylglucosaminyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1576
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKIDP+T + W+ +L AVP +ILWLL+FP +GE N++ TA+ +
Sbjct: 1349 LPDDTIILGNFNQLYKIDPTTFRAWLRILAAVPKAILWLLRFPDLGEVNLKRTAKEWAGE 1408
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
Q +RILF++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1409 QVANRILFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1468
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L +L+A +Y+++A++L
Sbjct: 1469 SRMAASILKGALPNSEEGLRAASDLVAEDDVDYENLAVKL 1508
>gi|239609400|gb|EEQ86387.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis ER-3]
Length = 1687
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1463 LRDDAIILGNFNQLYKIEPTTFRTWLRILAHIPNAILWLLRFPELGEHNLKQTAVAWAGE 1522
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1523 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1582
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELIA + +EY+ AI LG D
Sbjct: 1583 SRMASSILTSALPQTEAGRQAAHELIASSDEEYEKRAISLGLD 1625
>gi|302404586|ref|XP_003000130.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Verticillium albo-atrum VaMs.102]
gi|261360787|gb|EEY23215.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Verticillium albo-atrum VaMs.102]
Length = 234
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 105/163 (64%), Gaps = 13/163 (7%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
G +D ++ NFNQLYKIDPST + W+ +L VP ++LWLL+FP +GE N++ TA+A
Sbjct: 8 GFSDDTVILGNFNQLYKIDPSTFRSWLRILSRVPKAVLWLLRFPELGETNLKRTAKAWAG 67
Query: 108 DQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--L 163
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P + +
Sbjct: 68 AEVASRIIFTDVAPKHLHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTFPRYSFKM 127
Query: 164 ASRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
SR+AAS L ELIA + +EY+ +AIRL +
Sbjct: 128 CSRMAASILKGALPKGPEGQRAAQELIASSDEEYESLAIRLAS 170
>gi|327355846|gb|EGE84703.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1687
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1463 LRDDAIILGNFNQLYKIEPTTFRTWLRILAHIPNAILWLLRFPELGEHNLKQTAVAWAGE 1522
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1523 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1582
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELIA + +EY+ AI LG D
Sbjct: 1583 SRMASSILTSALPQTEAGRQAAHELIASSDEEYEKRAISLGLD 1625
>gi|261196356|ref|XP_002624581.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239595826|gb|EEQ78407.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis
SLH14081]
Length = 1659
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1435 LRDDAIILGNFNQLYKIEPTTFRTWLRILAHIPNAILWLLRFPELGEHNLKQTAVAWAGE 1494
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1495 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1554
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + ELIA + +EY+ AI LG D
Sbjct: 1555 SRMASSILTSALPQTEAGRQAAHELIASSDEEYEKRAISLGLD 1597
>gi|367024217|ref|XP_003661393.1| glycosyltransferase family 41 protein [Myceliophthora thermophila
ATCC 42464]
gi|347008661|gb|AEO56148.1| glycosyltransferase family 41 protein [Myceliophthora thermophila
ATCC 42464]
Length = 1574
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/160 (45%), Positives = 105/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE +++ TA+ G +
Sbjct: 1348 LPDDAILLGNFNQLYKIDPTTFRSWLRILAAAPKAYLWLLRFPELGETHLRRTARDWGGE 1407
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1408 GVASRIIFTDVAPKLQHISRARVCDLFLDTPECNAHTTAADILWSNTPLLTLPRYEYKMC 1467
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIAR EY++ A+RL
Sbjct: 1468 SRMAASILKGALPKGEAGEQAARELIARDEAEYEEFAVRL 1507
>gi|375104105|ref|ZP_09750366.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Burkholderiales bacterium JOSHI_001]
gi|374664836|gb|EHR69621.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Burkholderiales bacterium JOSHI_001]
Length = 630
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/153 (47%), Positives = 98/153 (64%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YGLPEDA VYC+FN +K P W+ +LKAVPNS+LWLL EAN++A A+
Sbjct: 418 RNTYGLPEDAFVYCSFNNNFKFTPEVFGAWMRMLKAVPNSVLWLLADNRWAEANMRAFAK 477
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+D R++F+ + E+V R QLAD+ LDT N TT+ DVLW GTP++TL G +
Sbjct: 478 ENGVDAARLIFAPRVSPAEYVARFQLADLVLDTFPYNAGTTANDVLWAGTPILTLSGRSY 537
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
SR+ S L +G PELI + EY+ AI++G
Sbjct: 538 ISRMCGSLLTAVGLPELITTSLAEYEQRAIQIG 570
>gi|342321253|gb|EGU13187.1| TPR Domain containing protein [Rhodotorula glutinis ATCC 204091]
Length = 2015
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 69/153 (45%), Positives = 104/153 (67%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D +++ N+NQLYKIDP ++W+ +LK PNS+LWLL+FPA GEA+++ TA
Sbjct: 1754 LRDDTVIFANWNQLYKIDPFIFRIWLEILKKHPNSVLWLLRFPAPGEAHLKETALRWAGK 1813
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K +H+ RG++AD+ LDT CN HTT+ D+LW+GTP++T P +
Sbjct: 1814 EVADRVIFTDVANKNDHIHRGRIADLFLDTTECNAHTTAADILWSGTPILTFPRHAHKMC 1873
Query: 165 SRVAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
SRVAAS +AT P++I EY+ A+ L
Sbjct: 1874 SRVAASIAMATGFGPQMIVNNEHEYERRALELA 1906
>gi|428179162|gb|EKX48034.1| hypothetical protein GUITHDRAFT_68991 [Guillardia theta CCMP2712]
Length = 493
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 106/178 (59%), Gaps = 1/178 (0%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
QTN+ E+ S + SR+ +GLPE A+V+C FN+L +I W+ +LK VPNS
Sbjct: 249 QTNSFQDLYPEILSSSCLPSRKDHGLPEHALVFCTFNRLGRITEHIFHAWIRILKRVPNS 308
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGH 142
+LWL K P + + A+ G+D R +F+ + K EH++R LADV LDT + NGH
Sbjct: 309 VLWLYKHPTMAVLRLLRAAREQGIDATRFVFAGPMMPKIEHLKRLTLADVYLDTHIYNGH 368
Query: 143 TTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
TT D LW G P+VTL G+T SRV AS +G E+IA Y++ A+ LG +++
Sbjct: 369 TTGSDFLWAGVPMVTLQGDTFPSRVGASLARAVGMQEMIATDLAGYEEKAVELGNNKE 426
>gi|240279775|gb|EER43280.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces capsulatus H143]
gi|325092907|gb|EGC46217.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces capsulatus H88]
Length = 1696
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 108/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1472 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPDLGEQNLKQTAVAWAGE 1531
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1532 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1591
Query: 165 SRVAASQLAT---------LGCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + L +LI+ + EY++ AI LG D
Sbjct: 1592 SRMASSILTSALPQTEAGRLAARDLISSSDDEYENRAISLGLD 1634
>gi|225562952|gb|EEH11231.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces capsulatus G186AR]
Length = 1696
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 108/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1472 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPDLGEQNLKQTAVAWAGE 1531
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1532 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1591
Query: 165 SRVAASQLAT---------LGCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + L +LI+ + EY++ AI LG D
Sbjct: 1592 SRMASSILTSALPQTEAGRLAARDLISSSDDEYENRAISLGLD 1634
>gi|428179456|gb|EKX48327.1| hypothetical protein GUITHDRAFT_105934 [Guillardia theta CCMP2712]
Length = 813
Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats.
Identities = 73/163 (44%), Positives = 103/163 (63%), Gaps = 3/163 (1%)
Query: 34 EVPQSIVITSRQQYG-LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
++P S V SR+ + LP I+ FNQLYKIDP Q W+ +LK P+++LWLL+FP
Sbjct: 606 DIPSSRV--SRKNFAELPAHGILLACFNQLYKIDPLVFQTWMKILKQHPHALLWLLEFPP 663
Query: 93 VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
V NI+ A A G+ R++F K E++ RG LAD+ LDTPL N HT++ D+LW G
Sbjct: 664 VAVENIRKEAIARGVSSDRLVFGKTLPKREYLERGALADLFLDTPLVNAHTSATDILWAG 723
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
P++TL E++ SRVAAS G PE++A +EY+ +A L
Sbjct: 724 VPMLTLSRESMISRVAASLCLAAGRPEMVAINLEEYEQVASAL 766
>gi|194335759|ref|YP_002017553.1| hypothetical protein Ppha_0637 [Pelodictyon phaeoclathratiforme BU-1]
gi|194308236|gb|ACF42936.1| TPR repeat-containing protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 4489
Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats.
Identities = 70/158 (44%), Positives = 100/158 (63%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R++ GLPE V+C FN YKI P T + W+ +LK V S+LWLL+ AN++
Sbjct: 2654 VFTRKELGLPEIGFVFCCFNNNYKITPVTFEGWMRILKRVEGSVLWLLEDNPKAAANLRK 2713
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A G+D R++F+ EH+ R ++AD+ LDT CN HTT+ D LW G P++TL G
Sbjct: 2714 EATERGVDAERLIFAKRMPLAEHLARHRIADLFLDTFPCNAHTTASDALWAGLPLLTLAG 2773
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ET A+RVAAS L + PELI T ++Y+ +AI L T+
Sbjct: 2774 ETFAARVAASLLNAIQLPELITFTQEKYEALAIELATN 2811
Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats.
Identities = 69/158 (43%), Positives = 97/158 (61%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R++ GLPE V+C FN YKI P T W+ +LK V S+LWLL N++
Sbjct: 3416 VFTRKECGLPEKGFVFCCFNNNYKITPVTFDGWMRILKRVEGSVLWLLGGNPKAVENLRK 3475
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
+ G+D R++F+ EH+ R +LAD+ +DT CN HTT+ D LW G P++TL G
Sbjct: 3476 ESTERGVDAERLIFAKRMPLAEHLARHRLADLFIDTFPCNAHTTASDALWAGLPLLTLAG 3535
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ET A+RVAAS L + PELI T +EY+ +AI L T+
Sbjct: 3536 ETFAARVAASLLNAIQLPELITSTQEEYEALAIELATN 3573
Score = 142 bits (358), Expect = 7e-32, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 95/158 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R++ GLPE V+C FN YKI P W+ +L+ V S+ W+ + N++ A
Sbjct: 545 TRKECGLPEKGFVFCCFNNNYKITPFVFDSWMRILRQVEGSVFWIFEDNPKAAENLRKEA 604
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D +R++FS EH+ R +LAD+ LDT CN HTT+ D LW G P++TL GET
Sbjct: 605 TRRGVDSNRLIFSKRIPLAEHLARHRLADLFLDTHPCNAHTTASDALWAGLPLLTLAGET 664
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+RVAAS L + PELI T +EY+ +AI L T+ D
Sbjct: 665 FVARVAASLLNAIQLPELITCTQEEYEALAIELATNPD 702
Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 96/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R++ GLPE V+C FN YKI P T W+ +LK V S+LWLL N++
Sbjct: 2029 VFTRKECGLPEKGFVFCCFNNNYKITPVTFDGWMRILKRVEGSVLWLLGGNPKAVENLRK 2088
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
+ G+D R++F+ EH+ R ++AD+ +DT CN HTT+ D LW G P++TL G
Sbjct: 2089 ESTERGVDAERLIFAKRMPLAEHLARHRIADLFIDTFPCNAHTTASDALWAGLPLLTLAG 2148
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ET +RVAAS L + PELI T +EY+ +AI L T+
Sbjct: 2149 ETFVARVAASLLNAIQLPELITSTQEEYEALAIELATN 2186
Score = 140 bits (354), Expect = 2e-31, Method: Composition-based stats.
Identities = 68/158 (43%), Positives = 96/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R++ GLPE ++C FN YKI P T W+ +L+ V S+LWLL+ N++
Sbjct: 4279 VFTRKECGLPEKGFIFCCFNNNYKITPVTFDGWMRLLRQVEGSVLWLLEDNVTAADNLRK 4338
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A G+D RI+F+ EH+ R +L + LDT CN HTT+ D LW G PV+TL G
Sbjct: 4339 EAIKRGIDAARIVFAQRMPLAEHLARQRLGYLFLDTFPCNAHTTASDALWVGLPVLTLAG 4398
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASRVAAS L + PELI T +EY+ +AI + T+
Sbjct: 4399 ESFASRVAASLLNAMQLPELITSTQEEYEALAIDVATN 4436
Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats.
Identities = 66/156 (42%), Positives = 95/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R++ GLPE V+C FN YKI P W+ +L+ V S+LWL + N++ A
Sbjct: 1337 TRKELGLPEIGFVFCCFNNNYKITPLIFDGWMRILRQVEGSVLWLFEDTQRAAQNLRREA 1396
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F+ EH+ R ++AD+ LDT CN HTT+ D LW G P++TL GET
Sbjct: 1397 IEKGVDAERLIFAKRMPLAEHLARHRIADLFLDTFPCNAHTTASDALWAGLPLLTLAGET 1456
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVAAS L + PELI T ++Y+ +AI L T+
Sbjct: 1457 FAARVAASLLNAIQLPELITFTQEKYEALAIELATN 1492
>gi|154280266|ref|XP_001540946.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150412889|gb|EDN08276.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1165
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 74/163 (45%), Positives = 108/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 941 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPDLGEQNLKQTAVAWAGE 1000
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1001 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1060
Query: 165 SRVAASQLAT---------LGCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L + L +LI+ + EY++ AI LG D
Sbjct: 1061 SRMASSILTSALPQTEAGRLAARDLISSSDDEYENRAISLGLD 1103
>gi|189197921|ref|XP_001935298.1| UDP-N-acetylglucosaminyltransferase [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187981246|gb|EDU47872.1| UDP-N-acetylglucosaminyltransferase [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1529
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 107/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ++AI+ NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA
Sbjct: 1300 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTAHLWAGP 1359
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1360 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1419
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L ELIA EY+++A+RL +
Sbjct: 1420 SRMAASILKGALPKTPAGAQAAKELIASNDNEYEEMAVRLARN 1462
>gi|330927740|ref|XP_003301979.1| hypothetical protein PTT_13645 [Pyrenophora teres f. teres 0-1]
gi|311322904|gb|EFQ89927.1| hypothetical protein PTT_13645 [Pyrenophora teres f. teres 0-1]
Length = 1669
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 107/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ++AI+ NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA
Sbjct: 1440 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTAHLWAGA 1499
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1500 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1559
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L ELIA EY+++A+RL +
Sbjct: 1560 SRMAASILKGALPKTPAGAQAAKELIASNDDEYEEMAVRLARN 1602
>gi|407772277|ref|ZP_11119579.1| hypothetical protein TH2_00220 [Thalassospira profundimaris WP0211]
gi|407284230|gb|EKF09746.1| hypothetical protein TH2_00220 [Thalassospira profundimaris WP0211]
Length = 680
Score = 146 bits (368), Expect = 5e-33, Method: Composition-based stats.
Identities = 71/165 (43%), Positives = 102/165 (61%), Gaps = 2/165 (1%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G E P++ +R + GLPED V+C+FN YKI P +W+ +LK V S+LWL +
Sbjct: 461 GREFPENT--QTRAELGLPEDGFVFCSFNANYKISPREFDIWMRLLKKVDGSVLWLFEGS 518
Query: 92 AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
N++ A G+D R++F+ +E+H+ R + AD+ LDT N HTT+ D LWT
Sbjct: 519 EDAMLNLRKEAGNRGVDPARLVFAGFLPEEQHLARHKHADLFLDTFNVNAHTTASDALWT 578
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
G P+VTLPG+ A+RVAAS L PELIA++ +EY+ IA+ L
Sbjct: 579 GLPLVTLPGKQFAARVAASVLKAANLPELIAKSEEEYEAIALDLA 623
>gi|295663513|ref|XP_002792309.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226278979|gb|EEH34545.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 1577
Score = 145 bits (367), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 107/162 (66%), Gaps = 15/162 (9%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1353 LRDDAIILGNFNQLYKIEPTTFRTWLRILSRIPNAILWLLRFPDLGEHNLKQTAIAWAGE 1412
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1413 AIASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1472
Query: 165 SRVAASQLAT----------LGCPELIARTHKEYQDIAIRLG 196
SR+A+S L + C ELIA + +EY+ AI LG
Sbjct: 1473 SRMASSILTSALPQTEAGRQAAC-ELIASSEEEYESRAISLG 1513
>gi|86137175|ref|ZP_01055753.1| TPR repeat protein [Roseobacter sp. MED193]
gi|85826499|gb|EAQ46696.1| TPR repeat protein [Roseobacter sp. MED193]
Length = 741
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 100/158 (63%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLP++ V+C+FN YKI P +W+ +L V +S+LWLL + N++ A
Sbjct: 507 TRKDCGLPDNGFVFCSFNSSYKITPVEFDIWMRLLDQVEDSVLWLLDCSETSKTNLRKEA 566
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G D R++F+ A+EEH+ R + AD+ LDT + N HTT+ D LW G PV+TLPG
Sbjct: 567 KRRGQDPDRLIFAPRIAQEEHLARHRAADLFLDTFVVNAHTTASDALWAGLPVLTLPGRQ 626
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRV AS ++ +G PE+IA++ +Y+ A+ L D D
Sbjct: 627 FASRVGASLVSAMGLPEMIAKSAADYEARALELANDLD 664
>gi|395334322|gb|EJF66698.1| TPR-like protein [Dichomitus squalens LYAD-421 SS1]
Length = 1243
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 71/153 (46%), Positives = 101/153 (66%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
+P+D +++ NFNQLYK+DPS +WV +L+ VP SILWLL+FP+ GE ++ +A+ G D
Sbjct: 991 VPQDVVIFANFNQLYKVDPSIFLVWVRILRKVPRSILWLLRFPSAGEEHLLRSARLWGGD 1050
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
RI F++VA K+ HV R + AD+ LDT CN HT + DVLWTGTP++T P +
Sbjct: 1051 DVASRIHFTDVAKKDWHVYRARAADIFLDTAECNAHTIAADVLWTGTPILTWPKHRHKMC 1110
Query: 165 SRVAASQLATLGCPE-LIARTHKEYQDIAIRLG 196
SRVAAS L G + +I + +EY+ A+ L
Sbjct: 1111 SRVAASMLQATGFADHMIVHSAEEYEARAVALA 1143
>gi|83765645|dbj|BAE55788.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1612
Score = 145 bits (366), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 109/160 (68%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1388 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLKETAVAWAGE 1447
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1448 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1507
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ ELIA + ++Y++ AIRL
Sbjct: 1508 SRMASSILSSALPDSDDGQKAREELIATSDEDYEEKAIRL 1547
>gi|238483389|ref|XP_002372933.1| UDP-N-acetylglucosaminyltransferase [Aspergillus flavus NRRL3357]
gi|220700983|gb|EED57321.1| UDP-N-acetylglucosaminyltransferase [Aspergillus flavus NRRL3357]
gi|391864779|gb|EIT74073.1| O-linked N-acetylglucosamine transferase OGT [Aspergillus oryzae
3.042]
Length = 1668
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 109/160 (68%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1444 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLKETAVAWAGE 1503
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1504 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1563
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ ELIA + ++Y++ AIRL
Sbjct: 1564 SRMASSILSSALPDSDDGQKAREELIATSDEDYEEKAIRL 1603
>gi|317139826|ref|XP_001817790.2| UDP-N-acetylglucosaminyltransferase [Aspergillus oryzae RIB40]
Length = 1457
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 109/160 (68%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1233 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLKETAVAWAGE 1292
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1293 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1352
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ ELIA + ++Y++ AIRL
Sbjct: 1353 SRMASSILSSALPDSDDGQKAREELIATSDEDYEEKAIRL 1392
>gi|225677798|gb|EEH16082.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides brasiliensis
Pb03]
Length = 1671
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 107/162 (66%), Gaps = 15/162 (9%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1447 LRDDAIILGNFNQLYKIEPTTFRTWLRILSRIPNAILWLLRFPDLGEHNLKQTAIAWAGE 1506
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1507 AIASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1566
Query: 165 SRVAASQLAT----------LGCPELIARTHKEYQDIAIRLG 196
SR+A+S L + C ELIA + +EY+ A+ LG
Sbjct: 1567 SRMASSILTSALPQTEAGRQAAC-ELIASSEEEYESRAVSLG 1607
>gi|346979671|gb|EGY23123.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Verticillium dahliae VdLs.17]
Length = 1546
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 104/160 (65%), Gaps = 13/160 (8%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+D ++ NFNQLYKIDPST + W+ +L VP ++LWLL+FP +GE N++ TA+A +
Sbjct: 1323 DDTVILGNFNQLYKIDPSTFRSWLRILSRVPKAVLWLLRFPELGETNLKRTAKAWAGAEV 1382
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P + + SR
Sbjct: 1383 ASRIIFTDVAPKHLHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTFPRYSFKMCSR 1442
Query: 167 VAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
+AAS L ELIA + +EY+ +AIRL +
Sbjct: 1443 MAASILKGALPKGPEGQRAAQELIASSDEEYESLAIRLAS 1482
>gi|226287468|gb|EEH42981.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides brasiliensis
Pb18]
Length = 1701
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 107/162 (66%), Gaps = 15/162 (9%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A +
Sbjct: 1477 LRDDAIILGNFNQLYKIEPTTFRTWLRILSRIPNAILWLLRFPDLGEHNLKQTAIAWAGE 1536
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1537 AIASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1596
Query: 165 SRVAASQLAT----------LGCPELIARTHKEYQDIAIRLG 196
SR+A+S L + C ELIA + +EY+ A+ LG
Sbjct: 1597 SRMASSILTSALPQTEAGRQAAC-ELIASSEEEYESRAVSLG 1637
>gi|374999539|ref|YP_004975627.1| hypothetical protein AZOLI_p60044 [Azospirillum lipoferum 4B]
gi|357428510|emb|CBS91467.1| conserved protein of unknown function; putative TPR domain
[Azospirillum lipoferum 4B]
Length = 679
Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats.
Identities = 69/158 (43%), Positives = 99/158 (62%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR + GLP D +V+ FN YKI P+ +W+ +LK VP S+LWL + + AN+ A
Sbjct: 467 VPSRAECGLPADGVVFAAFNTAYKISPAMFALWMRILKRVPGSVLWLFEANPLAAANLTA 526
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
+AQA G+D R++F+ +H+ R ++AD+ LDT GHTT+ D LW G PVVT G
Sbjct: 527 SAQAQGVDPARLVFAPPRPLPDHIARYRVADLALDTLPYTGHTTTSDALWAGCPVVTCLG 586
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T ASRVAAS L G P+ + R+ EY+++A+ L D
Sbjct: 587 GTFASRVAASLLNAAGLPDTVTRSLAEYEEMAVALAGD 624
>gi|332526962|ref|ZP_08403051.1| TPR repeat-containing protein [Rubrivivax benzoatilyticus JA2]
gi|332111401|gb|EGJ11384.1| TPR repeat-containing protein [Rubrivivax benzoatilyticus JA2]
Length = 622
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/157 (44%), Positives = 99/157 (63%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR++ GLPE A V+C FNQ YK++P+ L +W +L VP+S+LWLL++ N+ A
Sbjct: 408 SRREEGLPEGATVFCCFNQSYKLNPAMLDLWARILAQVPDSVLWLLEWNGQAPRNLLAEL 467
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ + R++F + E+H+ R Q AD+ LDT CN HTT+ + LW G PV+T+PG+T
Sbjct: 468 AARGIARERVVFGKRVSIEDHLARLQCADLFLDTWPCNAHTTASEALWAGVPVLTVPGQT 527
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRVAAS L EL+ Y D+A+ L DR
Sbjct: 528 FASRVAASLLHACELDELVCGDGDAYVDLAVGLARDR 564
>gi|154316161|ref|XP_001557402.1| hypothetical protein BC1G_03666 [Botryotinia fuckeliana B05.10]
gi|347836428|emb|CCD51000.1| glycosyltransferase family 41 protein [Botryotinia fuckeliana]
Length = 1576
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 104/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L VP +ILWLL+FP +GE+N++ATA+ +
Sbjct: 1350 LSDDAIILGNFNQLYKIEPTTFRTWLRILDKVPKAILWLLRFPDLGESNLKATAREWAGE 1409
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1410 SVASRIWFTDVAPKHLHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYKYKMC 1469
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELI++ +Y++ AIRL
Sbjct: 1470 SRMAASILKGALPRSEEGDRAASELISKDEAQYEEFAIRL 1509
>gi|449551087|gb|EMD42051.1| glycosyltransferase family 41 protein [Ceriporiopsis subvermispora B]
Length = 1365
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 69/150 (46%), Positives = 100/150 (66%), Gaps = 5/150 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
+P+D +++ NFNQLYKIDP +W+ +L VP S+LWLL+FPA GE +I+ TA+ +
Sbjct: 1114 MPQDVVIFANFNQLYKIDPGIFAVWLRILVQVPRSVLWLLRFPAAGEEHIKCTARMWANE 1173
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RILF++V KEEHV R ++AD+ LDT CN HT + DVLWTGTP++T P +
Sbjct: 1174 EVASRILFTDVTRKEEHVYRTRVADLFLDTVECNAHTIAADVLWTGTPIITFPKHRHKMC 1233
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAI 193
SRVAAS G +++A + +Y+ A+
Sbjct: 1234 SRVAASMAHATGFGDQMVASSMDDYETRAV 1263
>gi|329895776|ref|ZP_08271152.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
gi|328922138|gb|EGG29495.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
Length = 807
Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats.
Identities = 68/159 (42%), Positives = 98/159 (61%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
TSR +GLPE+A V C FN +KI P +W+ +L + ++LWLLK E N++
Sbjct: 591 TSRADFGLPENAFVLCCFNNNFKISPREFDIWMRILNRLEKAVLWLLKTNKWTEHNLRKE 650
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G+D RI+F+ + EH+ R + AD+ +DT CN HTT+ D LW G PVVT G
Sbjct: 651 AKQRGIDPSRIVFAERLPQSEHLARHKHADLFIDTFNCNAHTTASDALWAGLPVVTKQGS 710
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+RV+AS L ++G PELI T +EY+ + + L TD +
Sbjct: 711 QFAARVSASLLNSIGLPELIVNTQEEYEALILELATDTN 749
>gi|381166068|ref|ZP_09875286.1| conserved hypothetical protein [Phaeospirillum molischianum DSM
120]
gi|380684800|emb|CCG40098.1| conserved hypothetical protein [Phaeospirillum molischianum DSM
120]
Length = 671
Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats.
Identities = 73/163 (44%), Positives = 96/163 (58%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P S R YGLP+ A V+C FNQ YKI+P W+ +L+ VP+S+LWLL E
Sbjct: 455 PISPDSAERAAYGLPDGAFVFCCFNQAYKIEPIMFGRWMRILERVPDSVLWLLGDSVAME 514
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N++ AQA G+D R++F+ K H+ R + A + LDT N HTT+ D LW G P+
Sbjct: 515 TNLRREAQARGIDPARLVFAAREPKPRHLARHRHAGLGLDTLFYNAHTTASDALWAGLPI 574
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+T PGE ASRV AS L LG PELI Y++ A+ L TD
Sbjct: 575 LTTPGEAFASRVGASLLGALGLPELICPDLDAYEEKAVALATD 617
>gi|358366822|dbj|GAA83442.1| UDP-N-acetylglucosaminyltransferase [Aspergillus kawachii IFO 4308]
Length = 1670
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1446 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1505
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1506 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1565
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ +LIA + ++Y+D AIRL
Sbjct: 1566 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYEDKAIRL 1605
>gi|406867525|gb|EKD20563.1| UDP-N-acetylglucosaminyltransferase [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1710
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 105/162 (64%), Gaps = 13/162 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L VP +ILWLL+FP +GE+N++ TAQ +
Sbjct: 1410 LSDDAIILGNFNQLYKIEPTTFRTWLRILDRVPKAILWLLRFPDLGESNLKRTAQKWAGE 1469
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1470 NVASRIWFTDVAPKHQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYKYKMC 1529
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
SR+AAS L ELIA +Y+++AI+L +
Sbjct: 1530 SRMAASILKGALPRGEEGRRAAEELIAEDDTQYEELAIKLAS 1571
>gi|296811472|ref|XP_002846074.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Arthroderma otae CBS 113480]
gi|238843462|gb|EEQ33124.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Arthroderma otae CBS 113480]
Length = 1698
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 104/161 (64%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP VGE N++ TA+A +
Sbjct: 1473 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEQNLRQTAKAWAGE 1532
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1533 ATASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1592
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
SR+A+S L + +LI+ + EY+ AI LG
Sbjct: 1593 SRMASSILTSALPQTEDGKQAARQLISSSEAEYEQTAIELG 1633
>gi|320588655|gb|EFX01123.1| udp-n-acetylglucosaminyltransferase [Grosmannia clavigera kw1407]
Length = 1733
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/161 (45%), Positives = 100/161 (62%), Gaps = 13/161 (8%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+D ++ NFNQLYKIDP+T + W+ +L VPN+ILWLL+FP GEAN++ TA +
Sbjct: 1514 DDTVILGNFNQLYKIDPTTFRSWLRILAGVPNAILWLLRFPEAGEANLRRTALTWAGESV 1573
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASR 166
RI+F++VA K H+ R + D+ LDTP CN HTT+ DVLW+GTP++TLP + SR
Sbjct: 1574 ACRIIFTDVAPKHLHIARASVCDIFLDTPECNAHTTAADVLWSGTPLLTLPRYNYKMCSR 1633
Query: 167 VAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
+AAS L ELIA +Y+ AIRL +
Sbjct: 1634 IAASILRGALPKDAEGKQAAAELIAVDESDYEKSAIRLANN 1674
>gi|388852878|emb|CCF53563.1| related to UDP-N-acetylglucosaminyltransferase [Ustilago hordei]
Length = 2087
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 100/153 (65%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D +++ +FNQLYK DP ++W+ +LK VP SILWLL+FPA GE ++ A+ D
Sbjct: 1690 LPDDYVIFADFNQLYKCDPMLFKLWLRILKRVPKSILWLLRFPAAGEHHLLRQARRYAGD 1749
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ R++F++VA K H+ RG++AD+ LDT CN HTT+ D+LW+ TPV+T P +
Sbjct: 1750 EVASRVIFTDVAPKHIHIHRGRIADLFLDTTECNAHTTAADILWSATPVLTWPRHMHKMC 1809
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
SRVAAS + G E+ + +EY+D A+
Sbjct: 1810 SRVAASIVHATGFGEEMTVNSEQEYEDRAVEFA 1842
>gi|407772275|ref|ZP_11119577.1| hypothetical protein TH2_00210 [Thalassospira profundimaris WP0211]
gi|407284228|gb|EKF09744.1| hypothetical protein TH2_00210 [Thalassospira profundimaris WP0211]
Length = 628
Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats.
Identities = 68/155 (43%), Positives = 94/155 (60%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R +GLPE V+C+FN YKI P +W+ +L V S+LWL K N++
Sbjct: 416 TTRADFGLPETGFVFCSFNNTYKITPREFDIWMRLLSKVEGSVLWLFKGNDYAAENLRKE 475
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
AQ G+D R++F++ + EH+ R + AD+ LDT N HTT+ D LW G P+VTLPGE
Sbjct: 476 AQKRGIDPGRLVFADKMPEPEHLARHKHADLLLDTFNVNAHTTASDALWAGLPLVTLPGE 535
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
A+RVAAS L PELIA+ +Y+ IA+ L
Sbjct: 536 QFAARVAASILTAANLPELIAKDEADYEAIALDLA 570
>gi|313201112|ref|YP_004039770.1| hypothetical protein MPQ_1373 [Methylovorus sp. MP688]
gi|312440428|gb|ADQ84534.1| Tetratricopeptide TPR_2 repeat protein [Methylovorus sp. MP688]
Length = 697
Score = 143 bits (361), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 97/156 (62%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE+A VYC FNQ +KI P+ W+ +L+ VP+S+LWLL+ A AN+Q A
Sbjct: 500 ARVDYGLPEEAFVYCCFNQSFKITPTVFACWMRILQQVPDSVLWLLESNATATANLQQAA 559
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+D R++F+ A +H+ R LAD+ LDT N HTT+ D LW PV+T G+T
Sbjct: 560 KTAGIDPARLVFAPRVAMADHLARHALADLFLDTLPYNAHTTASDALWMCLPVLTCSGDT 619
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVA S L + PELIA Y+ +AI L D
Sbjct: 620 FASRVAGSLLHAVNLPELIAPDMAAYEAMAISLRHD 655
>gi|398390604|ref|XP_003848762.1| hypothetical protein MYCGRDRAFT_101342 [Zymoseptoria tritici IPO323]
gi|339468638|gb|EGP83738.1| UDP-N-ACETYLGLUCOSAMINE--peptide N-ACETYLGLUCOSAMINYLtransferase
[Zymoseptoria tritici IPO323]
Length = 1526
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/161 (44%), Positives = 105/161 (65%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ++ I++ NFNQLYKIDP+T + W+ +L VPNSILWLL+FP +GE+++ ATA+
Sbjct: 1295 LSDETIIFGNFNQLYKIDPTTFRTWLRILARVPNSILWLLRFPDLGESHLLATARLWAGS 1354
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K H+ R ++ D+ LDT CN HTT+ DVLW+GTP++TLP +
Sbjct: 1355 EVASRVIFTDVAPKHLHISRARVCDLVLDTAECNAHTTAADVLWSGTPLLTLPRYAYKMC 1414
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
SR+AAS L ELIA ++Y+D A+ LG
Sbjct: 1415 SRMAASILRGALPKNEEGDQAAVELIAHNEEDYEDKAVALG 1455
>gi|119357636|ref|YP_912280.1| hypothetical protein Cpha266_1840 [Chlorobium phaeobacteroides DSM
266]
gi|119354985|gb|ABL65856.1| TPR repeat-containing protein [Chlorobium phaeobacteroides DSM 266]
Length = 3560
Score = 143 bits (360), Expect = 4e-32, Method: Composition-based stats.
Identities = 69/158 (43%), Positives = 93/158 (58%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L VP S+LWL + N+
Sbjct: 562 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVPGSVLWLFEDNPAVTGNLGR 621
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A G+ R++F+ EH+ R +L D+ LDT CN HTT+ D LW G PV+T G
Sbjct: 622 EAATRGIAGDRLVFAKRMPLPEHLARHRLGDLFLDTLPCNAHTTASDALWAGLPVLTCMG 681
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASRVAAS L +G PELI T +EY+ +AI L D
Sbjct: 682 ESFASRVAASVLTAIGLPELITTTQEEYEALAIELALD 719
Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 95/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L VP S+LWL + A N++
Sbjct: 3347 VFTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILGQVPGSVLWLFEENAKAAENLRR 3406
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A + G+D R++F E++ R ++AD+ LDT N TT+ D LW G PV+TL G
Sbjct: 3407 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 3466
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASR+AAS L +G PELI +EY+ +AI L D
Sbjct: 3467 ESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 3504
Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats.
Identities = 74/182 (40%), Positives = 104/182 (57%), Gaps = 7/182 (3%)
Query: 20 LATNQTN-TKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLK 78
L++ Q N TK E V +R + GLPE V+C FN YKI P+T W+ +L
Sbjct: 1973 LSSYQVNDTKRRIAERV------FTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILG 2026
Query: 79 AVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPL 138
VP S+LWL + A N++ A + G+D R++F E++ R ++AD+ LDT
Sbjct: 2027 QVPGSVLWLYEENAKAAENLRREAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLP 2086
Query: 139 CNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
N TT+ D LW G PV+TL GE+ ASR+AAS L +G PELI +EY+ +AI L D
Sbjct: 2087 YNAGTTASDALWAGLPVLTLRGESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 2146
Query: 199 RD 200
+
Sbjct: 2147 SE 2148
Score = 136 bits (342), Expect = 6e-30, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 94/158 (59%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L VP S+LWL + A N++
Sbjct: 1241 VFTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILGQVPGSVLWLFEENAKAAENLRR 1300
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A + G+D R++F E++ R ++AD+ LDT N TT+ D LW G PV+TL G
Sbjct: 1301 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 1360
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASR+AAS L +G PELI +EY+ AI L D
Sbjct: 1361 ESFASRMAASLLTAIGLPELITSGQEEYEARAIELALD 1398
Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats.
Identities = 71/180 (39%), Positives = 102/180 (56%), Gaps = 7/180 (3%)
Query: 20 LATNQTN-TKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLK 78
L++ Q N TK E V +R + GLPE V+C FN YKI P+T W+ +L
Sbjct: 2652 LSSYQVNDTKRRIAERV------FTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILG 2705
Query: 79 AVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPL 138
V S+L+L +N++ A++ G+ + R++F EH+ R ++AD+ LDT
Sbjct: 2706 QVEGSVLFLYTDNEAAASNLKKEAESRGVKRDRLIFGKRLPLAEHLARYRVADLFLDTNP 2765
Query: 139 CNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
N TT+ D LW G PV+TL GE+ ASR+AAS L +G PELI +EY+ +AI L D
Sbjct: 2766 YNAGTTASDALWAGLPVLTLRGESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 2825
>gi|336472604|gb|EGO60764.1| hypothetical protein NEUTE1DRAFT_57508 [Neurospora tetrasperma FGSC
2508]
gi|350294161|gb|EGZ75246.1| hypothetical protein NEUTE2DRAFT_104796 [Neurospora tetrasperma FGSC
2509]
Length = 1655
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 105/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKIDP+T + W+ +L VP +ILWLL+FP +GE N++ TA+ +
Sbjct: 1406 LPDDRIILGNFNQLYKIDPTTFRTWLRILAHVPKAILWLLRFPELGENNLRRTAKLWAGE 1465
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1466 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1525
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L +LIA EY++ AI+L
Sbjct: 1526 SRMAASILKGALPKSEAGRQAAEDLIAGDDYEYENRAIKL 1565
>gi|116206628|ref|XP_001229123.1| hypothetical protein CHGG_02607 [Chaetomium globosum CBS 148.51]
gi|88183204|gb|EAQ90672.1| hypothetical protein CHGG_02607 [Chaetomium globosum CBS 148.51]
Length = 1434
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 104/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE +++ TA+ +
Sbjct: 1209 LPDDAILLGNFNQLYKIDPTTFRSWLRILAAAPKAYLWLLRFPELGETHLRRTARDWSGE 1268
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1269 GVASRIIFTDVAPKLQHISRARVCDLFLDTPECNAHTTAADILWSNTPLLTLPRYEYKMC 1328
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIAR EY++ A++L
Sbjct: 1329 SRMAASILKGALPKGPEGERAAQELIARDEAEYEEFAVQL 1368
>gi|315044201|ref|XP_003171476.1| hypothetical protein MGYG_06022 [Arthroderma gypseum CBS 118893]
gi|311343819|gb|EFR03022.1| hypothetical protein MGYG_06022 [Arthroderma gypseum CBS 118893]
Length = 1698
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 104/161 (64%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP VGE N++ TA+A +
Sbjct: 1473 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAVLWLLRFPDVGEQNLRQTAKAWAGE 1532
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1533 ATASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1592
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
SR+A+S L + +L++ + EY+ AI LG
Sbjct: 1593 SRMASSILTSALPQTEDGKDAARQLVSSSEAEYEKTAIELG 1633
>gi|71006212|ref|XP_757772.1| hypothetical protein UM01625.1 [Ustilago maydis 521]
gi|46097017|gb|EAK82250.1| hypothetical protein UM01625.1 [Ustilago maydis 521]
Length = 2239
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 101/153 (66%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D +++ +FNQLYK DP ++W+ +L+ VP SILWLL+FPA GE ++ A+ D
Sbjct: 1862 LPDDYVIFADFNQLYKCDPMLFRLWLRILERVPKSILWLLRFPAAGEEHLMHEARRYAGD 1921
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ R++F++VA K H+ RG++AD+ LDT CN HTT+ D+LW+ TP++T P +
Sbjct: 1922 RVACRVIFTDVAPKHIHIHRGRIADLFLDTIECNAHTTAADILWSATPMLTWPRHVHKMC 1981
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
SRVAAS + G E+ + KEY+D A+ L
Sbjct: 1982 SRVAASIVHATGFGDEMTVHSEKEYEDRAVELA 2014
>gi|300022675|ref|YP_003755286.1| hypothetical protein Hden_1151 [Hyphomicrobium denitrificans ATCC
51888]
gi|299524496|gb|ADJ22965.1| TPR repeat-containing protein [Hyphomicrobium denitrificans ATCC
51888]
Length = 818
Score = 143 bits (360), Expect = 5e-32, Method: Composition-based stats.
Identities = 70/176 (39%), Positives = 105/176 (59%), Gaps = 5/176 (2%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q N + T E+P +R GLPEDA V+C+FN YK++ + +W+++L+ V S
Sbjct: 595 QPNDRKRTISELP-----VTRADAGLPEDAFVFCSFNNSYKLNATMFDVWMSLLRNVDGS 649
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LWLL A N++ A G+D R++F++ EH+ R +LAD+ LD CN HT
Sbjct: 650 VLWLLVPTATCRENLRREAAQRGVDPDRLVFASRKPIAEHLARHRLADLFLDALPCNAHT 709
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
T+ D LW G PV+T GET + RVAAS L +G PEL+ + +Y ++A+ L D+
Sbjct: 710 TASDALWAGLPVITATGETFSGRVAASLLTAVGLPELVTKNLDDYAELALALARDK 765
>gi|452845777|gb|EME47710.1| glycosyltransferase family 41 protein [Dothistroma septosporum NZE10]
Length = 1653
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 105/159 (66%), Gaps = 13/159 (8%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+D I++ NFNQLYKI+P+T + W+ +L VPN+ILWLL+FP +GE+N+ ATA+ G
Sbjct: 1425 DDTIIFGNFNQLYKIEPTTFRTWLRILARVPNAILWLLRFPDLGESNLLATARMWGGPDV 1484
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLASR 166
R+ F++VA K H+ R ++ D+ LDT CN HTT+ DVLW+GTP++TLP + SR
Sbjct: 1485 ASRVTFTDVAPKHLHISRARVCDLVLDTAECNAHTTAADVLWSGTPLLTLPRYEYKMCSR 1544
Query: 167 VAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
+AAS L EL+A + ++Y++ AI+LG
Sbjct: 1545 MAASILKGALPKGDEGDRAAEELVATSEEDYEEKAIKLG 1583
>gi|346322980|gb|EGX92578.1| UDP-N-acetylglucosaminyltransferase [Cordyceps militaris CM01]
Length = 1592
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 73/161 (45%), Positives = 103/161 (63%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ED ++ NFNQLYKIDPST + W+ +L VP ++LWLL+FP GEAN++ TA+
Sbjct: 1283 LQEDRVILANFNQLYKIDPSTFRSWLRILAQVPKAVLWLLRFPEAGEANLRRTAELWAGP 1342
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P +
Sbjct: 1343 EVASRLIFTDVAQKSQHISRARICDLFLDTPECNAHTTAADVLWSSTPLLTFPRYPYKMC 1402
Query: 165 SRVAASQL--ATLGCP-------ELIARTHKEYQDIAIRLG 196
SR+AAS L A P ELIA + K Y+ +AI L
Sbjct: 1403 SRMAASILRGALPKSPEGRHAAGELIATSEKRYEKVAIDLA 1443
>gi|443899251|dbj|GAC76582.1| animal-type fatty acid synthase and related proteins [Pseudozyma
antarctica T-34]
Length = 2043
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 101/153 (66%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D +++ +FNQLYK DP ++W+ +LK VP SILWLL+FPA GE ++ A+ D
Sbjct: 1723 LPDDYVIFADFNQLYKCDPMLFKLWLRILKRVPKSILWLLRFPAAGEHHLLREARQYAGD 1782
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
R++F++VA K H+ RG++AD+ LDT CN HTT+ D+LW+ TPV+T P +
Sbjct: 1783 DVAARVIFTDVAPKHIHIHRGRIADLFLDTTECNAHTTAADILWSATPVLTWPRHMHKMC 1842
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
SRVAAS + G E+ + +EY++ A++L
Sbjct: 1843 SRVAASIVHATGFGDEMTVHSEREYEERAVQLA 1875
>gi|326476337|gb|EGE00347.1| UDP-N-acetylglucosaminyltransferase [Trichophyton tonsurans CBS
112818]
Length = 1527
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP VGE N++ TA+A +
Sbjct: 1302 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEYNLRQTAKAWAGE 1361
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1362 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1421
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
SR+A+S L + +LI+ EY+ AI LG
Sbjct: 1422 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1462
>gi|336266788|ref|XP_003348161.1| hypothetical protein SMAC_04006 [Sordaria macrospora k-hell]
Length = 1443
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 105/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKIDP+T + W+ +L+ VP ++LWLL+FP +GE N++ TA+ +
Sbjct: 1197 LPDDRIILGNFNQLYKIDPTTFRSWLRILQDVPKAVLWLLRFPELGENNLRRTAKQWAGE 1256
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1257 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1316
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L +LIA EY++ AI L
Sbjct: 1317 SRMAASILKGALPKSEAGRRAAEDLIADDDYEYENKAIEL 1356
>gi|380091097|emb|CCC11303.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1514
Score = 142 bits (359), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 105/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKIDP+T + W+ +L+ VP ++LWLL+FP +GE N++ TA+ +
Sbjct: 1268 LPDDRIILGNFNQLYKIDPTTFRSWLRILQDVPKAVLWLLRFPELGENNLRRTAKQWAGE 1327
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1328 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1387
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L +LIA EY++ AI L
Sbjct: 1388 SRMAASILKGALPKSEAGRRAAEDLIADDDYEYENKAIEL 1427
>gi|255935459|ref|XP_002558756.1| Pc13g03180 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583376|emb|CAP91387.1| Pc13g03180 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1515
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 108/163 (66%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYK++P+T + W+ +L +PN++LWLL+FP GE N++ A+A +
Sbjct: 1292 LSDDTIILGNFNQLYKVEPTTFRTWLRILARIPNAVLWLLRFPDTGEQNLRDIAKAWAGE 1351
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1352 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1411
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L++ ELIA + ++Y++ AIRL D
Sbjct: 1412 SRMASSILSSALPSSEAGQRAREELIASSDEDYENKAIRLCLD 1454
>gi|302661258|ref|XP_003022298.1| hypothetical protein TRV_03509 [Trichophyton verrucosum HKI 0517]
gi|291186238|gb|EFE41680.1| hypothetical protein TRV_03509 [Trichophyton verrucosum HKI 0517]
Length = 1697
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP VGE N++ TA+A +
Sbjct: 1472 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEHNLRQTAKAWAGE 1531
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1532 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1591
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
SR+A+S L + +LI+ EY+ AI LG
Sbjct: 1592 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1632
>gi|326481193|gb|EGE05203.1| UDP-N-acetylglucosaminyltransferase [Trichophyton equinum CBS 127.97]
Length = 1564
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP VGE N++ TA+A +
Sbjct: 1339 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEYNLRQTAKAWAGE 1398
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1399 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1458
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
SR+A+S L + +LI+ EY+ AI LG
Sbjct: 1459 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1499
>gi|317025352|ref|XP_001388903.2| UDP-N-acetylglucosaminyltransferase [Aspergillus niger CBS 513.88]
Length = 1670
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1446 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1505
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1506 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1565
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ +LIA + ++Y++ AIRL
Sbjct: 1566 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYENKAIRL 1605
>gi|134055003|emb|CAK37011.1| unnamed protein product [Aspergillus niger]
Length = 1546
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1322 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1381
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1382 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1441
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ +LIA + ++Y++ AIRL
Sbjct: 1442 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYENKAIRL 1481
>gi|342879983|gb|EGU81213.1| hypothetical protein FOXB_08246 [Fusarium oxysporum Fo5176]
Length = 1491
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 106/162 (65%), Gaps = 13/162 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L VP +++WLL+FP +GEAN++ TA+A +
Sbjct: 1265 LSDDTIIMGNFNQLYKIEPTTFRTWLRILAQVPKAVIWLLRFPELGEANLRRTAKAWAGE 1324
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1325 EVASRLIFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1384
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
SR+AAS L ELIA + +EY+ A+ L T
Sbjct: 1385 SRMAASILKGALPKSNEGQEAAAELIAASEEEYEQRAVELAT 1426
>gi|425766660|gb|EKV05262.1| UDP-N-acetylglucosaminyltransferase [Penicillium digitatum Pd1]
gi|425775265|gb|EKV13543.1| UDP-N-acetylglucosaminyltransferase [Penicillium digitatum PHI26]
Length = 1642
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 106/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYK++P+T + W+ +L +PN++LWLL+FP GE N++ A+A +
Sbjct: 1419 LSDDTIILGNFNQLYKVEPTTFRTWLRILARIPNAVLWLLRFPDTGEKNLRDIAKAWAGE 1478
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP------- 159
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP
Sbjct: 1479 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1538
Query: 160 ----GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
L+S + +S++ +LIA + ++Y++ AIRL D
Sbjct: 1539 SRMASSILSSALPSSEVGQRAREDLIASSDEDYENKAIRLCLD 1581
>gi|302897315|ref|XP_003047536.1| hypothetical protein NECHADRAFT_105493 [Nectria haematococca mpVI
77-13-4]
gi|256728467|gb|EEU41823.1| hypothetical protein NECHADRAFT_105493 [Nectria haematococca mpVI
77-13-4]
Length = 1538
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 106/160 (66%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L VP ++LWLL+FP +GEAN++ TA+A
Sbjct: 1312 LADDAIIMGNFNQLYKIEPTTFRSWLRILAQVPKAVLWLLRFPELGEANLRRTAKAWAGT 1371
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1372 EVASRLIFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1431
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIA + +EY+ A++L
Sbjct: 1432 SRMAASILKGALPKSEDGRKAAAELIASSEEEYEQRAVQL 1471
>gi|164426318|ref|XP_961401.2| hypothetical protein NCU01315 [Neurospora crassa OR74A]
gi|157071287|gb|EAA32165.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1655
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 105/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKIDP+T + W+ +L VP ++LWLL+FP +GE N++ TA+ +
Sbjct: 1406 LPDDRIILGNFNQLYKIDPTTFRTWLRILAHVPKAVLWLLRFPELGENNLRRTAKLWAGE 1465
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1466 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1525
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L +LIA EY++ AI+L
Sbjct: 1526 SRMAASILKGALPKSEAGRQAAEDLIAGDDYEYENRAIKL 1565
>gi|449300658|gb|EMC96670.1| glycosyltransferase family 41 protein [Baudoinia compniacensis UAMH
10762]
Length = 1439
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 13/159 (8%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
++ I++ NFNQLYKIDP+T + W+ VL AVPNSILWLL+FP +GE+++ ATA+ +
Sbjct: 1210 DETIIFGNFNQLYKIDPTTFRTWLRVLAAVPNSILWLLRFPDLGESHLMATARLWATTEV 1269
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
R++F++VA K H+ R ++ D+ LDT CN HTT+ DVLW+GTP++TLP + + SR
Sbjct: 1270 ASRLIFTDVAPKHLHISRARICDLFLDTAECNAHTTAADVLWSGTPLLTLPRYSYKMCSR 1329
Query: 167 VAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
+AAS L ELIA +Y+ AI LG
Sbjct: 1330 MAASILRGALPKTGEGVQAARELIATDEDDYESKAIALG 1368
>gi|327296958|ref|XP_003233173.1| UDP-N-acetylglucosaminyltransferase [Trichophyton rubrum CBS 118892]
gi|326464479|gb|EGD89932.1| UDP-N-acetylglucosaminyltransferase [Trichophyton rubrum CBS 118892]
Length = 1678
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP VGE N++ TA+A +
Sbjct: 1453 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEHNLRQTAKAWAGE 1512
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1513 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1572
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
SR+A+S L + +LI+ EY+ AI LG
Sbjct: 1573 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1613
>gi|350638065|gb|EHA26421.1| hypothetical protein ASPNIDRAFT_172247 [Aspergillus niger ATCC 1015]
Length = 1603
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 108/160 (67%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1379 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1438
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1439 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1498
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ +LIA + ++Y++ AIRL
Sbjct: 1499 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYENKAIRL 1538
>gi|440636477|gb|ELR06396.1| hypothetical protein GMDG_02113 [Geomyces destructans 20631-21]
Length = 1643
Score = 142 bits (358), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 69/162 (42%), Positives = 106/162 (65%), Gaps = 13/162 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I+ NFNQLYKI+P+T + W+ +L VP ++LWLL+FP +GE+N++ TA+ +
Sbjct: 1421 LPDDVIILGNFNQLYKIEPTTFRTWLRILANVPKAVLWLLRFPDLGESNLRRTARLWAGE 1480
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RILF++VA K H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1481 EVASRILFTDVAPKNLHISRARICDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1540
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGT 197
SR+AAS L ELIA ++Y++ A++L +
Sbjct: 1541 SRMAASILKGALPRSDEGFDAASELIATDEEQYEEFAVKLAS 1582
>gi|328766887|gb|EGF76939.1| hypothetical protein BATDEDRAFT_33767 [Batrachochytrium
dendrobatidis JAM81]
Length = 452
Score = 142 bits (358), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 64/126 (50%), Positives = 88/126 (69%), Gaps = 4/126 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L ED +++ NFNQLYK+DP W+ +L VPNSILWLL+FP GEA+++ A L +
Sbjct: 100 LAEDTVIFANFNQLYKVDPHIFATWMRILSRVPNSILWLLRFPPAGEAHLRRKAVELVGE 159
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE--TLA 164
R++F++VA K H+ RG++ADV LDTP CN HTT+ D+LW+GTP++T P +
Sbjct: 160 SVSRRLIFTDVAPKHLHIHRGRIADVFLDTPECNAHTTAADILWSGTPIITYPKYDFKMC 219
Query: 165 SRVAAS 170
SRVAAS
Sbjct: 220 SRVAAS 225
>gi|338739161|ref|YP_004676123.1| hypothetical protein HYPMC_2335 [Hyphomicrobium sp. MC1]
gi|337759724|emb|CCB65555.1| TPR repeat-containing protein [Hyphomicrobium sp. MC1]
Length = 835
Score = 142 bits (357), Expect = 9e-32, Method: Composition-based stats.
Identities = 66/157 (42%), Positives = 97/157 (61%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR +GLPEDA V+C+FN YK++ + +W+ +L+ VP S+LWLL N++ A
Sbjct: 610 SRADFGLPEDAFVFCSFNNSYKLNATMFDVWMPLLQKVPGSVLWLLVPNTTCAENLRREA 669
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D R++F+ EH+ R + AD+ LD CN HTT+ D LW G PV+T G+T
Sbjct: 670 EARGVDPSRLVFAKRMPIAEHLARHRFADLFLDALPCNAHTTTTDALWAGLPVLTCLGDT 729
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
A RVAAS L+ + PELI +Y D+A+ L ++
Sbjct: 730 FAGRVAASLLSAIELPELITTNLADYSDLALELAQNK 766
>gi|453087624|gb|EMF15665.1| glycosyltransferase family 41 protein [Mycosphaerella populorum
SO2202]
Length = 1685
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 107/162 (66%), Gaps = 13/162 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I++ NFNQLYKI+P+T + W+ +L VPNSILWLL+FP +GE+++ ATA+
Sbjct: 1452 LSDDTIIFGNFNQLYKIEPTTFRTWLRILARVPNSILWLLRFPDLGESHLLATARLWAGR 1511
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ R++F++VA K H+ R ++ D+ LDT CN HTT+ DVLW+GTP++TLP +
Sbjct: 1512 EVASRVIFTDVAPKHLHISRARVCDLVLDTAECNAHTTAADVLWSGTPLLTLPRYQYKMC 1571
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
SR+AAS L ELIA+ ++Y++ A+ LGT
Sbjct: 1572 SRMAASILKGALPKNEEGRRAETELIAKHEEDYEEKAVALGT 1613
>gi|67515967|ref|XP_657869.1| hypothetical protein AN0265.2 [Aspergillus nidulans FGSC A4]
gi|40746982|gb|EAA66138.1| hypothetical protein AN0265.2 [Aspergillus nidulans FGSC A4]
gi|259489517|tpe|CBF89854.1| TPA: UDP-N-acetylglucosaminyltransferase (AFU_orthologue;
AFUA_1G03380) [Aspergillus nidulans FGSC A4]
Length = 1596
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 107/160 (66%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L ++PN++LWLL+FP +GE N++ TA A +
Sbjct: 1372 LSDDTIILGNFNQLYKIEPTTFRTWLRILASIPNAVLWLLRFPDIGEQNLRETAVAWAGE 1431
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP +
Sbjct: 1432 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1491
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ ELIA + ++Y+ AI L
Sbjct: 1492 SRMASSILSSALPKTESGQQAREELIALSDEDYEKKAIHL 1531
>gi|115388427|ref|XP_001211719.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195803|gb|EAU37503.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1573
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 107/160 (66%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +DAI+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A +
Sbjct: 1349 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRDTAVAWAGE 1408
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1409 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTYPRYKYKMC 1468
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ ELIA + +Y++ AIRL
Sbjct: 1469 SRMASSILSSALPDSDAGNQAREELIAVSDDDYENKAIRL 1508
>gi|340904950|gb|EGS17318.1| transferase-like protein [Chaetomium thermophilum var. thermophilum
DSM 1495]
Length = 1621
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 102/160 (63%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DA++ NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE N+ TA+A +
Sbjct: 1386 LPDDAVILGNFNQLYKIDPTTFRTWLRILAACPKAYLWLLRFPDLGERNLMETARAWSGN 1445
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RI F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1446 AVASRIRFTDVAPKQDHISRAKVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1505
Query: 165 SRVAAS---------QLATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L EL+ +EY++ A+ L
Sbjct: 1506 SRMAASILRGALPRGPLGDQAARELVCADEREYEERAVAL 1545
>gi|358057401|dbj|GAA96750.1| hypothetical protein E5Q_03421 [Mixia osmundae IAM 14324]
Length = 1930
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/153 (45%), Positives = 103/153 (67%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L + +++ NFNQLYKIDPST +W+ +L+AVPNS+LWLL+FPA GE N+ TA+ G +
Sbjct: 1690 LAPETVIFANFNQLYKIDPSTFCIWLKILQAVPNSVLWLLRFPAAGEPNLLDTAKRWGGE 1749
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
RILF++VA+K+EH+ RG++AD+ +DT HTT +DVLW+G+P+V P E +
Sbjct: 1750 AIAKRILFTSVASKDEHLNRGRVADLFIDTLELGSHTTCVDVLWSGSPIVVSPRKREKMG 1809
Query: 165 SRVAASQLATLGCPE-LIARTHKEYQDIAIRLG 196
SRV+ S + G + L+ +Y+ A LG
Sbjct: 1810 SRVSTSIVTASGFADALVVEDEAQYERRAAELG 1842
>gi|46110068|ref|XP_382092.1| hypothetical protein FG01916.1 [Gibberella zeae PH-1]
Length = 1596
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 104/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L VP +++WLL+FP +GEAN++ TA+ +
Sbjct: 1370 LADDTIIMGNFNQLYKIEPTTFRTWLRILAQVPKAVIWLLRFPELGEANLRRTAKEWAGE 1429
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R+LF++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1430 EVASRLLFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1489
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIA KEY+ A++L
Sbjct: 1490 SRMAASILKGALPKSDEGRQAAAELIAGDEKEYEQRAVKL 1529
>gi|253999006|ref|YP_003051069.1| hypothetical protein Msip34_1296 [Methylovorus glucosetrophus
SIP3-4]
gi|253985685|gb|ACT50542.1| Tetratricopeptide TPR_2 repeat protein [Methylovorus glucosetrophus
SIP3-4]
Length = 697
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 96/156 (61%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R YGLPE+A VYC FNQ +KI P+ W+ +L+ VP+ +LWLL+ A N+Q A
Sbjct: 500 ARVDYGLPEEAFVYCCFNQSFKITPAVFACWMRILQQVPDGVLWLLESNATATLNLQQAA 559
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D R++F+ A +H+ R LAD+ LDT N HTT+ D LW PV+T G+T
Sbjct: 560 KAAGVDPARLVFAPRVAMADHLARHALADLFLDTLPYNAHTTASDALWMCLPVLTCSGDT 619
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVA S L + PELIA Y+ +AI L D
Sbjct: 620 FASRVAGSLLHAVNLPELIAPDMAAYEAMAINLKHD 655
>gi|452986315|gb|EME86071.1| glycosyltransferase family 41 protein, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1463
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 106/161 (65%), Gaps = 13/161 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I++ NFNQLYKI+P+T + W+ +L VPNSILWLL+FP +GE+++ ATA+
Sbjct: 1237 LADDTIIFGNFNQLYKIEPTTFRTWLRILARVPNSILWLLRFPDLGESHLLATARMWAGA 1296
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R++F++VA K H+ R ++ D+ +DT CN HTT+ DVLW+GTP++TLP +
Sbjct: 1297 EVASRVIFTDVAPKHLHISRARICDLVVDTAECNAHTTAADVLWSGTPLLTLPRYPYKMC 1356
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
SR+AAS L ELIA + +EY++ A+ LG
Sbjct: 1357 SRMAASILKGALPKNAEGERAARELIAASEEEYEEKAVALG 1397
>gi|84503473|ref|ZP_01001528.1| TPR repeat protein [Oceanicola batsensis HTCC2597]
gi|84388151|gb|EAQ01104.1| TPR repeat protein [Oceanicola batsensis HTCC2597]
Length = 630
Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats.
Identities = 68/164 (41%), Positives = 99/164 (60%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P+ + +R +GLPE+ V C FN KI P +W+ +L+AVP S+LWLL+
Sbjct: 410 PEPRITGTRADHGLPEEGPVLCCFNAGRKIGPQEFDLWMRLLRAVPASVLWLLRSNPRMV 469
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N++ A G+D R++F++ E H+ R AD+ LDT NGHTT+ D LW G PV
Sbjct: 470 GNLRGEAARRGVDPDRLVFADPCPHESHIARYVHADIVLDTFRYNGHTTTSDALWAGVPV 529
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
VT+ G A+RVAAS L+ G P+L+ T +EY+ +++ L TDR
Sbjct: 530 VTMAGRQFAARVAASLLSASGLPDLVTETVQEYEALSLALATDR 573
>gi|171692141|ref|XP_001910995.1| hypothetical protein [Podospora anserina S mat+]
gi|170946019|emb|CAP72820.1| unnamed protein product [Podospora anserina S mat+]
Length = 1635
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/160 (43%), Positives = 103/160 (64%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKIDP+T + W+ +L + P + LWLL+FP +GE +++ TA+ +
Sbjct: 1405 LPDDAIILGNFNQLYKIDPTTFRTWLRILASCPKAHLWLLRFPELGETHLRRTAKDWAGE 1464
Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
RI F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP +
Sbjct: 1465 AVASRIHFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1524
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIA K+Y++ A+ L
Sbjct: 1525 SRMAASILKGALPKSEEGERAAKELIAEDEKQYEEFAVGL 1564
>gi|296422902|ref|XP_002840997.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637225|emb|CAZ85188.1| unnamed protein product [Tuber melanosporum]
Length = 1503
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 115/201 (57%), Gaps = 24/201 (11%)
Query: 8 QTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDP 67
Q++ + Q G QT E PQ L D ++ NFNQLYKI+P
Sbjct: 1243 QSAPDSKSRQLGWEEEQTRRWAKRKELFPQ-----------LKPDNVILGNFNQLYKIEP 1291
Query: 68 STLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVAAKEEHVR 125
+T + W+ +L VP ++LWLL+FP +GE+N++A A A ++ RI+F++VA K +H+
Sbjct: 1292 TTFRTWLRILSRVPKAVLWLLRFPDLGESNLKALAVAWAGEEIASRIIFTDVAPKLQHIS 1351
Query: 126 RGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASRVAASQL--ATLGCPE-- 179
R Q+ D+ LDTP CN HTT+ DVLW+GTP++T P + SR+AAS L A PE
Sbjct: 1352 RAQVCDIFLDTPECNAHTTAADVLWSGTPLLTFPRHKHKMCSRIAASILRAAVPQTPEGK 1411
Query: 180 -----LIARTHKEYQDIAIRL 195
LI + +EY+D A L
Sbjct: 1412 AMANSLIVDSEEEYEDRAAAL 1432
>gi|121703103|ref|XP_001269816.1| UDP-N-acetylglucosaminyltransferase [Aspergillus clavatus NRRL 1]
gi|119397959|gb|EAW08390.1| UDP-N-acetylglucosaminyltransferase [Aspergillus clavatus NRRL 1]
Length = 1669
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 106/160 (66%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN+ILWLL+FP +GE N++ TA A
Sbjct: 1445 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPELGEQNLRETAIAWAGQ 1504
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1505 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTFPRYKYKMC 1564
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ EL+A + ++Y++ AIRL
Sbjct: 1565 SRMASSILSSALPNSEAGHKARTELMAVSDEDYEEKAIRL 1604
>gi|452963349|gb|EME68422.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
SO-1]
Length = 602
Score = 140 bits (354), Expect = 2e-31, Method: Composition-based stats.
Identities = 68/165 (41%), Positives = 100/165 (60%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P++ + ++GLP DA V FN +KI+ +W++VL+A P+++LW ++F
Sbjct: 388 PRAAAREPKAKWGLPGDAFVVAVFNNSFKINAEAFAVWISVLQAQPDAVLWFVEFHPAAT 447
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
A+++A AQA+G+D R++F+ ++ EH+ R AD+ LDT C GHTT+ D LW G P+
Sbjct: 448 ASLRAMAQAVGIDPARLVFAPRLSQAEHLARLSAADLFLDTWPCGGHTTASDALWAGVPL 507
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
V G T ASRVA S L LG ELIA + Y +A L DRD
Sbjct: 508 VAWAGRTFASRVAGSLLHALGFDELIAESQGAYYALAQHLAKDRD 552
>gi|83311456|ref|YP_421720.1| SPY protein [Magnetospirillum magneticum AMB-1]
gi|82946297|dbj|BAE51161.1| SPY protein [Magnetospirillum magneticum AMB-1]
Length = 798
Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats.
Identities = 70/166 (42%), Positives = 95/166 (57%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G+ P + R YGLPEDA VYC FN +K +W ++L+AVP +LWL +
Sbjct: 580 GDLAPVGEPVQPRSAYGLPEDAFVYCGFNNPFKFRAEVFDLWADILRAVPQGVLWLREDN 639
Query: 92 AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
N+ A G+D R++F+ EH+ R +LAD+ LD N HTT+ D LW
Sbjct: 640 DYSRNNLGREIAARGIDPARLIFAQRTDFAEHMARHRLADLFLDCLPYNAHTTASDALWA 699
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
G PV+T GET ASRVA S L+ LG PELI + +EY++ AI L +
Sbjct: 700 GLPVLTRVGETFASRVAGSLLSGLGLPELITESAEEYRERAIALAS 745
>gi|383757802|ref|YP_005436787.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
gi|381378471|dbj|BAL95288.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
Length = 639
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 69/153 (45%), Positives = 96/153 (62%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLPEDA+V C FNQ YKI P L +W VL +P ++LWLL + A G+ + A +
Sbjct: 432 RAELGLPEDALVLCCFNQSYKITPEVLALWARVLHELPQAVLWLLAWNADGQRRLLAALE 491
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
ALG+ R++F+ ++H+ R + AD+ LDT N HTT+ + LW PV+T+PGET
Sbjct: 492 ALGIGAERVVFAERLPVDQHIARLRAADLFLDTWPYNAHTTASESLWAAVPVLTVPGETF 551
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRVAAS +A G PEL ++Y A+ LG
Sbjct: 552 ASRVAASLVAACGLPELACADAEDYVRTAVELG 584
>gi|148258992|ref|YP_001243577.1| hypothetical protein BBta_7846 [Bradyrhizobium sp. BTAi1]
gi|146411165|gb|ABQ39671.1| hypothetical protein BBta_7846 [Bradyrhizobium sp. BTAi1]
Length = 452
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 95/160 (59%), Gaps = 1/160 (0%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR + GLPE V+C FNQ YK PS +W +L A P+S+LWLL + N++
Sbjct: 233 LPSRAEAGLPETGFVFCCFNQAYKFTPSVFDLWCRLLAATPDSVLWLLA-SDQAQGNLRG 291
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A G+ R++F+ + EH+RR QLAD+ LDT HTT+ D LW G P+VT G
Sbjct: 292 EALRRGVSPDRLVFAPQIGQSEHLRRLQLADLVLDTAPYGAHTTASDALWAGVPIVTCAG 351
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+TLASRVA S L +G PELIA +Y +A+ L + D
Sbjct: 352 DTLASRVAGSLLHAVGLPELIASDEADYVAVALTLAAEPD 391
>gi|70990478|ref|XP_750088.1| UDP-N-acetylglucosaminyltransferase [Aspergillus fumigatus Af293]
gi|66847720|gb|EAL88050.1| UDP-N-acetylglucosaminyltransferase [Aspergillus fumigatus Af293]
gi|159130569|gb|EDP55682.1| UDP-N-acetylglucosaminyltransferase [Aspergillus fumigatus A1163]
Length = 1634
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 105/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A
Sbjct: 1410 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAIAWAGR 1469
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1470 ETASRIIFTDVAPKNAHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTYPRYKYKMC 1529
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ EL A + ++Y+D AIRL
Sbjct: 1530 SRMASSILSSALPDSDAGRKARDELTAVSDEDYEDKAIRL 1569
>gi|291614668|ref|YP_003524825.1| hypothetical protein Slit_2210 [Sideroxydans lithotrophicus ES-1]
gi|291584780|gb|ADE12438.1| TPR repeat-containing protein [Sideroxydans lithotrophicus ES-1]
Length = 736
Score = 140 bits (352), Expect = 3e-31, Method: Composition-based stats.
Identities = 68/162 (41%), Positives = 103/162 (63%), Gaps = 3/162 (1%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
PQ + +R++ GLP V+C N YKI P L +W+ +LK VP S+LWL K A E
Sbjct: 505 PQQL---TREELGLPAHGFVFCCHNSSYKITPVDLDIWMRLLKRVPGSVLWLYKSSAEVE 561
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
AN++ A++ G++ HR++F+N A ++ R ++AD+ LDT N TT+ + LW G PV
Sbjct: 562 ANLRREAESRGVEPHRLVFANQAPHATYLARYRMADLFLDTAFYNAQTTAAEALWAGLPV 621
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
+T PG T+ASRVA+ L +G E+IA + ++Y++ A RL T
Sbjct: 622 LTCPGVTMASRVASGLLHAIGLEEMIAGSPQQYEECAYRLAT 663
>gi|407772276|ref|ZP_11119578.1| hypothetical protein TH2_00215 [Thalassospira profundimaris WP0211]
gi|407284229|gb|EKF09745.1| hypothetical protein TH2_00215 [Thalassospira profundimaris WP0211]
Length = 629
Score = 140 bits (352), Expect = 4e-31, Method: Composition-based stats.
Identities = 66/155 (42%), Positives = 92/155 (59%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
TSR YGLP+ V+C+FN YKI P +W+ +L V S+LWL K N++
Sbjct: 416 TSRADYGLPDSGFVFCSFNNTYKISPREFDIWMRLLNQVEGSVLWLFKGNDYAAQNLRKE 475
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
AQ +D R++ + ++ EH+ R + D+ LDT N HTT+ D LW G P+VTLPGE
Sbjct: 476 AQKRDVDPDRLICAEKLSQSEHLARQKHGDLLLDTFNVNAHTTASDALWAGLPLVTLPGE 535
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
A+RVAAS L+ PELIA+ +Y+ IA+ L
Sbjct: 536 QFAARVAASILSAANLPELIAKDEADYEAIALDLA 570
>gi|428164760|gb|EKX33775.1| hypothetical protein GUITHDRAFT_81153 [Guillardia theta CCMP2712]
Length = 851
Score = 140 bits (352), Expect = 4e-31, Method: Composition-based stats.
Identities = 63/155 (40%), Positives = 96/155 (61%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+RQ+Y LP I++ NF QLYK++PS +WV ++K N+ LWLLKFP + A
Sbjct: 599 TRQEYKLPSSGIIFGNFGQLYKVEPSLFDVWVRIIKRTSNTSLWLLKFPKEAVKRLLKEA 658
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
GL + R++ +++ + H+ +ADV LDT + NGHTT D LW+G P+V+L GE
Sbjct: 659 DKRGLPRDRLVLTSLLPIDSHLAIKAVADVALDTNMFNGHTTGADTLWSGLPLVSLSGEQ 718
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
+ SR AS LG +AR+ ++Y++IA+RL +
Sbjct: 719 MRSRAGASMAYALGVTRWLARSLEDYEEIAVRLAS 753
>gi|296448837|ref|ZP_06890677.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
gi|296253657|gb|EFH00844.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
Length = 452
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 92/155 (59%), Gaps = 1/155 (0%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE +V+C FNQ YK+ P +W +L A P S+LWLL P + E N++ A
Sbjct: 246 RASVGLPEQGVVFCCFNQAYKLTPEIFDVWCRLLDAAPGSVLWLLAAP-MAEGNLRNEAW 304
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G++ +R++F+ + +H+ R QLAD+ LDT N HTT+ D LW G P+VT G T
Sbjct: 305 KRGVNGNRLVFAPDMGQGDHLARLQLADLVLDTAPYNAHTTASDALWAGVPIVTCSGSTF 364
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVA S L +G PELI Y D+A RL D
Sbjct: 365 ASRVAGSVLRAVGMPELITEDLDGYFDLASRLAND 399
>gi|322695669|gb|EFY87473.1| UDP-N-acetylglucosaminyltransferase [Metarhizium acridum CQMa 102]
Length = 1746
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 73/160 (45%), Positives = 100/160 (62%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L DAI+ NFNQLYKIDP+T + W+ +L VP ++LWLL+FP GE+N++ATA+A
Sbjct: 1520 LGNDAIILGNFNQLYKIDPTTFRAWLRILARVPRAVLWLLRFPEPGESNLRATAKAWAGP 1579
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R + D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1580 EVAERIIFTDVAPKSQHISRATVCDMFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1639
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELI T EY+ A L
Sbjct: 1640 SRMAASILRGALPRSLEGQQAAEELITYTEAEYEQRAADL 1679
>gi|242803242|ref|XP_002484134.1| UDP-N-acetylglucosaminyltransferase [Talaromyces stipitatus ATCC
10500]
gi|218717479|gb|EED16900.1| UDP-N-acetylglucosaminyltransferase [Talaromyces stipitatus ATCC
10500]
Length = 1666
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 106/163 (65%), Gaps = 13/163 (7%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKI+P+T + W+ +L +P ++LWLL+FP +GE N++ A +
Sbjct: 1438 LPDDAIILGNFNQLYKIEPTTFRTWLRILSDIPKAVLWLLRFPELGEQNLKDCAVKWANE 1497
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K+ H+ R Q+ D+ LDTP CN HTT+ D+LW+GTP++T P +
Sbjct: 1498 EIASRIIFTDVAPKQAHIARAQVVDLFLDTPECNAHTTAADILWSGTPMLTYPRYKYKMC 1557
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
SR+A+S L++ ELIA + ++Y+ A RL D
Sbjct: 1558 SRMASSILSSALPDTEAGHQARKELIAISDEDYRAKASRLCRD 1600
>gi|298704865|emb|CBJ28382.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Ectocarpus siliculosus]
Length = 916
Score = 139 bits (351), Expect = 5e-31, Method: Composition-based stats.
Identities = 62/147 (42%), Positives = 93/147 (63%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+V CNFN+L+K+DP T W+ VL+ VP ++LWL+ N+ A+ G+D+ R++
Sbjct: 701 VVLCNFNRLHKLDPHTFGAWMEVLRTVPGTVLWLIDGGETARTNLLRQARLAGVDEWRVV 760
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F+ + KEEH++R +LAD+ +DTPL N HT D LW G P+VTL G +ASRV AS +
Sbjct: 761 FAPLVGKEEHLQRLRLADLFVDTPLYNAHTVGCDALWAGVPMVTLRGAKMASRVGASLVE 820
Query: 174 TLGCPELIARTHKEYQDIAIRLGTDRD 200
G PEL+ + +EY + + L D +
Sbjct: 821 AAGMPELVTDSLEEYTQLVLALARDNE 847
>gi|323447882|gb|EGB03789.1| putative O-linked GlcNAc transferase [Aureococcus anophagefferens]
Length = 971
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 96/158 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ YGLP+DA V+ F +IDP +W+++LK VPNS+LWL ++ A
Sbjct: 760 ARETYGLPQDAFVFAYFGDSDRIDPIIFSVWMSILKRVPNSLLWLHMNSETVIDRLKKEA 819
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ + + R++FSN + EHV LAD+ LDTP CN ++D LW GTPV+ L G T
Sbjct: 820 RGHQVIEQRLIFSNSVPRREHVFHAMLADIVLDTPACNALDATLDALWAGTPVIALLGNT 879
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+A+R +AS + +GC ELI+ +Y+D+AI L D D
Sbjct: 880 IATRTSASLCSAVGCHELISANLGDYEDLAISLAIDSD 917
>gi|212539952|ref|XP_002150131.1| UDP-N-acetylglucosaminyltransferase [Talaromyces marneffei ATCC
18224]
gi|210067430|gb|EEA21522.1| UDP-N-acetylglucosaminyltransferase [Talaromyces marneffei ATCC
18224]
Length = 1667
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 107/164 (65%), Gaps = 15/164 (9%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+DAI+ NFNQLYKI+P+T + W+ +L +P ++LWLL+FP +GE N++ A +
Sbjct: 1434 LPDDAIILGNFNQLYKIEPTTFRTWLRILSDIPKAVLWLLRFPELGEQNLKDCAVQWANE 1493
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K+ H+ R Q+ D+ LDTP CN HTT+ D+LW+GTP++T P +
Sbjct: 1494 EIASRIIFTDVAPKQAHIARAQVVDLFLDTPECNAHTTAADILWSGTPMLTFPRYKYKMC 1553
Query: 165 SRVAASQLATLGCP----------ELIARTHKEYQDIAIRLGTD 198
SR+A+S L++ P EL+A + ++Y+ A RL D
Sbjct: 1554 SRMASSILSS-ALPDTEAGRQARNELVAISDEDYRAKASRLCRD 1596
>gi|392571498|gb|EIW64670.1| TPR-like protein [Trametes versicolor FP-101664 SS1]
Length = 1237
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 99/153 (64%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP+D I++ NFNQLYKIDP +W+ +L+ VP S+LWLL+FP GE ++ TA+ +
Sbjct: 984 LPQDVIIFANFNQLYKIDPGIFLVWLRILRQVPRSVLWLLRFPGAGEEQLRRTAKMWAGE 1043
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ RI F++VA K+ HV R ++AD+ LDT CN HT + DVLWTGTP++T P +
Sbjct: 1044 EVAARIRFTDVAKKDWHVFRARVADLFLDTAECNAHTIAADVLWTGTPILTWPKHRHKMC 1103
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
SRVAAS + G ++ + +EY+ A+ L
Sbjct: 1104 SRVAASMVNATGFGAQMTVHSAEEYERRAVALA 1136
>gi|418056785|ref|ZP_12694836.1| Tetratricopeptide TPR_2 repeat-containing protein [Hyphomicrobium
denitrificans 1NES1]
gi|353207557|gb|EHB72964.1| Tetratricopeptide TPR_2 repeat-containing protein [Hyphomicrobium
denitrificans 1NES1]
Length = 818
Score = 139 bits (349), Expect = 9e-31, Method: Composition-based stats.
Identities = 69/176 (39%), Positives = 103/176 (58%), Gaps = 5/176 (2%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q N + E P +R +GLPEDA V+C+FN YK++ + +W+++L+ V S
Sbjct: 595 QPNDRKRKISETP-----VTRADFGLPEDAFVFCSFNNSYKLNATMFDVWMSLLRRVAGS 649
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LWLL + N++ A A G+D R++F++ EH+ R +LAD+ LD CN HT
Sbjct: 650 VLWLLVPTSACRENLRREAAARGVDPDRLVFASRLPIAEHLARHRLADLFLDALPCNAHT 709
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
T+ D LW G PV+T GET + RVAAS L G PEL+ + +Y + A+ L D+
Sbjct: 710 TASDALWAGLPVLTAMGETFSGRVAASLLTAAGLPELVTGSLADYAEAALALAQDK 765
>gi|443326782|ref|ZP_21055424.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xenococcus sp. PCC 7305]
gi|442793575|gb|ELS03020.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xenococcus sp. PCC 7305]
Length = 1023
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 96/157 (61%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR++ LP ++C FN YKI P +W+ +LKAVP+S+LWLL N++
Sbjct: 525 VPSREECNLPPQGFIFCGFNNSYKITPEFFNIWMRLLKAVPHSVLWLLSKNQDMTENLRQ 584
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
AQA G+D R++F+ +H+ R QLAD+ LD CN HTT+ D LW G PV+T G
Sbjct: 585 EAQARGVDPDRLVFAPKKDLPDHLARQQLADLFLDNLPCNAHTTTSDALWVGLPVLTCAG 644
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
++ A+RVA S L +G PEL+ +Y+ +A+++ T
Sbjct: 645 QSFAARVAGSLLHAIGLPELVTYNLPDYEALALKIAT 681
>gi|83312909|ref|YP_423173.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
magneticum AMB-1]
gi|82947750|dbj|BAE52614.1| Predicted O-linked N-acetylglucosamine transferase
[Magnetospirillum magneticum AMB-1]
Length = 591
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 101/164 (61%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P+++ + ++GLPE+++V FN +KI+ TL +W++VL+A P+++LW ++F
Sbjct: 377 PRALTREPKAKWGLPEESLVVAVFNNTFKINAETLAVWISVLQAQPDAVLWFVEFHPAAT 436
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
A+++A A+G+D R++F+ ++ EH+ R AD+ LDT C GHTT+ D LW G PV
Sbjct: 437 ASLRAMISAVGIDPARLIFAPRLSQAEHMARLSAADLFLDTWPCAGHTTASDALWAGVPV 496
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
V G T ASRVA S L LG ELI + Y +A L DR
Sbjct: 497 VAWVGRTFASRVAGSLLHALGLDELITESQGAYHALAQHLAKDR 540
>gi|145588309|ref|YP_001154906.1| hypothetical protein Pnuc_0121 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145046715|gb|ABP33342.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
Length = 732
Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats.
Identities = 69/160 (43%), Positives = 96/160 (60%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S +I SR++ GLPE V+C FN YKI P+ L WV +L AV S+LWL + + AN
Sbjct: 515 SPLIKSRKELGLPESGFVFCCFNNNYKITPAVLDGWVKILLAVEGSVLWLYEDNPIAVAN 574
Query: 98 IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
++ A GLD R +F+ +H+ R + A++ LDT CN HTT+ D LW G PV+T
Sbjct: 575 LKQEALTRGLDAGRFIFAGRMDSADHLARYKNANLFLDTTPCNAHTTASDALWAGLPVLT 634
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
L GE+ +RVAAS +G L T +EY+ +AI+L T
Sbjct: 635 LAGESFGARVAASLNNAVGLSGLTVETQEEYEALAIQLAT 674
>gi|119357637|ref|YP_912281.1| hypothetical protein Cpha266_1841 [Chlorobium phaeobacteroides DSM
266]
gi|119354986|gb|ABL65857.1| TPR repeat-containing protein [Chlorobium phaeobacteroides DSM 266]
Length = 3035
Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 95/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L VP S+LWL + A N++
Sbjct: 2816 VFTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILGQVPGSVLWLYEENAKAAENLRR 2875
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A + G+D R++F E++ R ++AD+ LDT N TT+ D LW G PV+TL G
Sbjct: 2876 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 2935
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASR+AAS L +G PELI +EY+ +AI L D
Sbjct: 2936 ESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 2973
Score = 137 bits (346), Expect = 2e-30, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 95/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L VP S+LWL + A N++
Sbjct: 1390 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVPGSVLWLFEENAKAAENLRR 1449
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A + G+D R++F E++ R ++AD+ LDT N TT+ D LW G PV+TL G
Sbjct: 1450 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 1509
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASR+AAS L +G PELI +EY+ +AI L D
Sbjct: 1510 ESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 1547
Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats.
Identities = 64/158 (40%), Positives = 95/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L VP S+L+L +N++
Sbjct: 2035 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVPGSVLFLYTDNEAAASNLKK 2094
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A++ G+ + R++F EH+ R ++AD+ LDT N TT+ D LW G PV+TL G
Sbjct: 2095 EAESRGVKRDRLIFGKRLPLAEHLARYRVADLFLDTNPYNAGTTASDALWAGLPVLTLRG 2154
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASR+AAS L +G PELI +EY+ +A+ L D
Sbjct: 2155 ESFASRMAASLLTAIGLPELITSGQEEYEALAVELALD 2192
Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats.
Identities = 65/158 (41%), Positives = 94/158 (59%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPE V+C FN YKI P+T W+ +L V S+L+L +N++
Sbjct: 575 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVEGSVLFLYTDNEAAASNLKK 634
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A++ G+ + R++F EH+ R ++AD+ LDT N TT+ D LW G PV+TL G
Sbjct: 635 EAESRGVKRDRLIFGKRLPLAEHLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 694
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ ASR+AAS L +G PELI T +EY+ AI L D
Sbjct: 695 ESFASRMAASLLTAIGLPELITTTQEEYEARAIELALD 732
>gi|390604347|gb|EIN13738.1| hypothetical protein PUNSTDRAFT_140213 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1253
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 99/150 (66%), Gaps = 5/150 (3%)
Query: 52 DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ-- 109
D +++ NF+QLYKI P+T MW+ +L+ VP SILWLL+FPA G+ ++ TA A D+
Sbjct: 1006 DFVIFANFSQLYKICPATFSMWLKILERVPRSILWLLRFPAAGQEHLLRTASAWANDEVA 1065
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASRV 167
RI F++V K +HV RG++AD+ LDT CN HT + D LW+GTP++T P +ASRV
Sbjct: 1066 ARIRFTDVTDKHQHVIRGRVADLFLDTIECNAHTVAADALWSGTPLITWPRYSHKMASRV 1125
Query: 168 AASQLATLGCPE-LIARTHKEYQDIAIRLG 196
AS + G + ++ + +EYQD A+ L
Sbjct: 1126 GASIVNATGFGDRMVVHSEEEYQDRAVFLA 1155
>gi|393240238|gb|EJD47765.1| protein prenylyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 1522
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/153 (46%), Positives = 101/153 (66%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I++ NFNQLYKIDP W+ +L+ VPNSILWLL+FPA GE N+++ A+ +
Sbjct: 1285 LHDDTIIFANFNQLYKIDPVVFATWLRILRQVPNSILWLLRFPAAGENNLRSFARLWAGE 1344
Query: 109 QH--RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE--TLA 164
+ RI F++V AKE H++R ++AD+ LDT CN HT + DVLW+GTP++T P +
Sbjct: 1345 EFSARIRFTDVCAKERHIQRSRVADLFLDTIECNAHTIATDVLWSGTPILTWPKYKFKMC 1404
Query: 165 SRVAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
SRVAAS AT ++ + +EY+ AI L
Sbjct: 1405 SRVAASIARATGFGQQMTVSSLEEYEARAIALA 1437
>gi|400596152|gb|EJP63936.1| tetratricopeptide-like protein [Beauveria bassiana ARSEF 2860]
Length = 811
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 102/160 (63%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKIDPST + W+ +L VP +ILWLL+FP GEAN++ TA+
Sbjct: 585 LQDDTVILANFNQLYKIDPSTFRSWLRILAQVPKAILWLLRFPEAGEANLRRTAELWAGP 644
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P +
Sbjct: 645 EVASRLVFTDVAQKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTYPRYPYKMC 704
Query: 165 SRVAASQL--ATLGCP-------ELIARTHKEYQDIAIRL 195
+R+AAS L A P ELIA + K Y+ AI L
Sbjct: 705 ARMAASILKGALPRTPEGQRVAGELIATSEKMYEKSAIEL 744
>gi|119497099|ref|XP_001265316.1| UDP-N-acetylglucosaminyltransferase [Neosartorya fischeri NRRL 181]
gi|119413478|gb|EAW23419.1| UDP-N-acetylglucosaminyltransferase [Neosartorya fischeri NRRL 181]
Length = 1661
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 69/160 (43%), Positives = 104/160 (65%), Gaps = 13/160 (8%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYKI+P+T + W+ +L +PN++LWLL+FP +GE N++ TA A
Sbjct: 1437 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAIAWAGR 1496
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P +
Sbjct: 1497 ETASRIIFTDVAPKNAHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTYPRYKYKMC 1556
Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
SR+A+S L++ EL A + ++Y+ AIRL
Sbjct: 1557 SRMASSILSSALPDSDAGRKARDELTAVSDEDYEGKAIRL 1596
>gi|300693287|ref|YP_003749260.1| hypothetical protein RPSI07_mp0265 [Ralstonia solanacearum PSI07]
gi|299075324|emb|CBJ34618.1| conserved hypothethical protein with TPR repeat domain [Ralstonia
solanacearum PSI07]
Length = 675
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 95/155 (61%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP D V+C FNQ YKI + W +L P S+LWLL+ A +A + +A+
Sbjct: 471 RADVGLPADGFVFCCFNQAYKITEARANTWFTILSRTPGSVLWLLEPDASAKAALLESAR 530
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G++ HR++F+ A+ EH+ R QLAD+ LDT HTT+ D+LW G P++T G+T+
Sbjct: 531 RHGVESHRLVFAPQVAQREHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRMGDTM 590
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G +L+ T ++Y D A+RL D
Sbjct: 591 ASRVAASILQAAGLHDLVVTTEEDYVDAAVRLAGD 625
>gi|344171339|emb|CCA83825.1| conserved hypothethical protein with TPR repeat domain [blood
disease bacterium R229]
Length = 646
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 95/155 (61%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP D V+C FNQ YKI + W +L P S+LWLL+ A +A + +A+
Sbjct: 442 RADVGLPADGFVFCCFNQAYKITEARANTWFTILSRTPGSVLWLLEPDASAKAALLESAR 501
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G++ HR++F+ A+ EH+ R QLAD+ LDT HTT+ D+LW G P++T G+T+
Sbjct: 502 RHGVESHRLVFAPQVAQREHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRMGDTM 561
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G +L+ T ++Y D A+RL D
Sbjct: 562 ASRVAASILQAAGLHDLVVTTEEDYVDAAVRLAGD 596
>gi|145588676|ref|YP_001155273.1| hypothetical protein Pnuc_0489 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145047082|gb|ABP33709.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
Length = 618
Score = 137 bits (344), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPE VYC FN YKI P T W+ +LKAV S+LWL++ E N++A A
Sbjct: 412 TRAELGLPESGFVYCCFNNNYKITPETFDSWMRILKAVEGSVLWLIQDNVPAEENLKAEA 471
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ EH+ R ++AD+ LDT N HTT+ D LW G PV+TL G T
Sbjct: 472 LKRGISSDRIIFAQRLPLPEHLARHKIADLFLDTLPYNAHTTASDALWAGVPVLTLLGNT 531
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
RVAAS L +G EL+ T + Y+ AI L D
Sbjct: 532 FPGRVAASLLNAVGLAELVTHTPQAYEQRAIELARD 567
>gi|398825494|ref|ZP_10583785.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Bradyrhizobium sp. YR681]
gi|398223362|gb|EJN09707.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Bradyrhizobium sp. YR681]
Length = 757
Score = 137 bits (344), Expect = 3e-30, Method: Composition-based stats.
Identities = 64/163 (39%), Positives = 98/163 (60%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P + + +R++ GLP ++C FN YKI P +W+ +L+ V S+LWL + E
Sbjct: 539 PDAAHMPTREEAGLPCHGFIFCCFNSSYKITPEIFDIWMRLLRQVSGSVLWLYADRSSAE 598
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N++ A A G+D R++F+ A+ +H+ R +LAD+ LDT N HTT+ D LW G PV
Sbjct: 599 VNLRQAAAAHGIDPARLVFARRMAQGDHLARHRLADLFLDTLPYNAHTTAADALWAGLPV 658
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT PG++ A RVA+S L +G PEL+ Y+ +A+RL +
Sbjct: 659 VTCPGKSFAGRVASSLLRGIGMPELVTSDLGSYERLALRLALE 701
>gi|345564020|gb|EGX47001.1| hypothetical protein AOL_s00097g47 [Arthrobotrys oligospora ATCC
24927]
Length = 1822
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 60/126 (47%), Positives = 92/126 (73%), Gaps = 4/126 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D ++ NFNQLYKI+P+T + W+ +L VPN++LWLL+FP +GE+N+ A+ D
Sbjct: 1598 LSDDKVILGNFNQLYKIEPTTFRTWLRILMQVPNAVLWLLRFPDLGESNLLRFARLWAGD 1657
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE--TLA 164
+ RI+F++VAAK++H+ R ++ D+ LDTP CN HTT+ DVLW+G+P++T P +
Sbjct: 1658 EVASRIIFTDVAAKDQHISRARVCDLFLDTPECNAHTTAADVLWSGSPLLTFPRHEYKMC 1717
Query: 165 SRVAAS 170
SR+AAS
Sbjct: 1718 SRIAAS 1723
>gi|398379826|ref|ZP_10537946.1| hypothetical protein PMI03_03578 [Rhizobium sp. AP16]
gi|397722458|gb|EJK83002.1| hypothetical protein PMI03_03578 [Rhizobium sp. AP16]
Length = 647
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 98/155 (63%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R Q+GLPEDA V+ +FN KI + +W N+LK NS+LWLL EAN+ A
Sbjct: 416 TTRAQHGLPEDAFVFASFNGNRKITSQMVDVWCNILKRTKNSVLWLLCNGPRAEANLWAR 475
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+A G+++ R++F+ E+H+ R QLAD+ LDT NGHTT+ + LW G PV+T+ G
Sbjct: 476 LEARGVNRKRVVFTTRIRYEDHIDRQQLADLGLDTFPVNGHTTTSEQLWGGLPVLTMKGT 535
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRV+ S L +G PEL+A + Y+++A+
Sbjct: 536 NFASRVSESLLNAIGVPELVAADVQAYENMAVEFA 570
>gi|392378325|ref|YP_004985485.1| conserved protein of unknown function; putative TPR domain
[Azospirillum brasilense Sp245]
gi|356879807|emb|CCD00735.1| conserved protein of unknown function; putative TPR domain
[Azospirillum brasilense Sp245]
Length = 827
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPED V+C FN YK+ P+ +W +L+AVP S+LWL + N++ A
Sbjct: 624 SRADCGLPEDGFVFCCFNNPYKLTPAVFDVWARLLRAVPGSVLWLYAGNPLVAGNLRGEA 683
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ R++F+ EH+ R +LAD+ LDT N HTT+ D LWTG PVVT G T
Sbjct: 684 AARGVAPDRLVFAPPRPLVEHLARHRLADLFLDTLPYNAHTTASDALWTGLPVVTCRGAT 743
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A RVAAS L T+G PEL+ + Y+ +A+ L D
Sbjct: 744 FAGRVAASLLDTVGLPELVTDSPAAYEALALGLARD 779
>gi|222084872|ref|YP_002543401.1| glycosyltransferase domain-containing protein [Agrobacterium
radiobacter K84]
gi|221722320|gb|ACM25476.1| glycosyltransferase TPR domain protein [Agrobacterium radiobacter
K84]
Length = 673
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 98/155 (63%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R Q+GLPEDA V+ +FN KI + +W N+LK NS+LWLL EAN+ A
Sbjct: 442 TTRAQHGLPEDAFVFASFNGNRKITSQMVDVWCNILKRTKNSVLWLLCNGPRAEANLWAR 501
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+A G+++ R++F+ E+H+ R QLAD+ LDT NGHTT+ + LW G PV+T+ G
Sbjct: 502 LEARGVNRKRVVFTTRIRYEDHIDRQQLADLGLDTFPVNGHTTTSEQLWGGLPVLTMKGT 561
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRV+ S L +G PEL+A + Y+++A+
Sbjct: 562 NFASRVSESLLNAIGVPELVAADVQAYENMAVEFA 596
>gi|86138806|ref|ZP_01057378.1| Glycosyl transferase group 1:TPR repeat [Roseobacter sp. MED193]
gi|85824453|gb|EAQ44656.1| Glycosyl transferase group 1:TPR repeat [Roseobacter sp. MED193]
Length = 629
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 64/158 (40%), Positives = 95/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
I +R + GLPED V+C+ N YK+ P +W+ +L AVP+S+LWLL + N+
Sbjct: 406 IPTRTELGLPEDKFVFCSLNNSYKVTPVEYDIWMRLLHAVPDSVLWLLAANDYVQKNLID 465
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A A G+ R+ F+ + H+ R AD+ LDT C HTT+ + LW+G P++T PG
Sbjct: 466 EAAARGIGPERLFFAGRVSTTAHLARLPQADLFLDTFNCCAHTTASETLWSGVPLITKPG 525
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ASRVAAS L +GC +LI + + Y D+A++L D
Sbjct: 526 DQFASRVAASILTAIGCEDLITDSAENYYDLALKLAQD 563
>gi|126737136|ref|ZP_01752871.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
gi|126721721|gb|EBA18424.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
Length = 738
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLP+D V+C+FN YKI P +W+ +L S+LWLL+ N + A
Sbjct: 503 TRADCGLPDDGFVFCSFNNSYKITPREFDIWMRLLAQTDGSVLWLLQTSQSSTENFRQEA 562
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+D R++F+ + EH+ R Q+AD+ LD+ N HTT D LW G P++TLPG+
Sbjct: 563 EKRGIDASRLIFAPPLPQAEHLARQQVADLFLDSFTVNAHTTGSDALWAGVPILTLPGKQ 622
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RV AS L +G PE+IA + +Y+ A+ L D
Sbjct: 623 FAARVGASLLHAVGLPEMIATSEADYEKRALELAHD 658
>gi|424880058|ref|ZP_18303690.1| hypothetical protein Rleg8DRAFT_1588 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392516421|gb|EIW41153.1| hypothetical protein Rleg8DRAFT_1588 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 638
Score = 136 bits (342), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 95/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+RQQ GLP+DA ++ +FN KI P T+ W +LK PNS+LWL+ +AN+
Sbjct: 418 TRQQLGLPDDAFIFASFNGNRKITPETIGSWCRILKRAPNSVLWLMANTPRNQANLLKQF 477
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ RI+F A EEH+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 478 QAAGISAKRIIFCPRAPYEEHIDRQQAADIGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 537
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + P+L+A + Y+D+A+ L +
Sbjct: 538 FASRVSESLLRAIDLPDLVADDLQAYEDMAVELAEN 573
>gi|23016713|ref|ZP_00056466.1| COG3914: Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Magnetospirillum magnetotacticum MS-1]
Length = 602
Score = 135 bits (341), Expect = 7e-30, Method: Composition-based stats.
Identities = 67/157 (42%), Positives = 95/157 (60%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
+ ++GLPEDA V FN +K + TL +WV VL+A P+++LW ++F A+++A
Sbjct: 396 KAKWGLPEDAFVIAVFNNSFKFNAETLAVWVAVLQAQPDAVLWFVEFHPAATASLRAMMG 455
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A+G+D R++F+ + EH+ R AD+ LDT C GHTT+ D LW G P+V G T
Sbjct: 456 AVGIDASRLIFAPRLPQAEHMLRLSAADLFLDTWPCAGHTTASDALWAGVPLVAWAGRTF 515
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRVA S L LG ELIA + Y +A L DR+
Sbjct: 516 ASRVAGSLLHALGLDELIAESQGGYHALAQHLAKDRE 552
>gi|456358776|dbj|BAM93221.1| hypothetical protein S58_72570 [Agromonas oligotrophica S58]
Length = 453
Score = 135 bits (341), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 71/177 (40%), Positives = 100/177 (56%), Gaps = 8/177 (4%)
Query: 22 TNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVP 81
+ Q + +TA G +R + GLPE V+C FNQ YK+ P+ +W +L A P
Sbjct: 230 SYQPHGRTALGPP-------PTRAEVGLPEAGFVFCCFNQAYKLTPTVFDLWCRLLDATP 282
Query: 82 NSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
+S+LWLL E N++ A G+ R++F+ + EH+RR QLAD+ LDT
Sbjct: 283 DSVLWLLA-SDQAEGNLRGEALRRGVAPGRLVFAPEMKQSEHLRRLQLADLVLDTAPYGA 341
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
HTT+ D LW G P+VT G+T ASRVA S L +G PELIA +Y +A+ L +
Sbjct: 342 HTTASDALWAGVPIVTCAGDTFASRVAGSLLHAVGLPELIAADEADYVAVALTLAAE 398
>gi|158425090|ref|YP_001526382.1| hypothetical protein AZC_3466 [Azorhizobium caulinodans ORS 571]
gi|158331979|dbj|BAF89464.1| TPR repeat protein [Azorhizobium caulinodans ORS 571]
Length = 779
Score = 135 bits (341), Expect = 7e-30, Method: Composition-based stats.
Identities = 68/161 (42%), Positives = 94/161 (58%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S + SR GLPE V+C FN YKI P+ +W+ +L VP+S+LWL + A N
Sbjct: 567 SETLPSRAACGLPESGFVFCCFNNTYKITPAVFSVWMRLLDEVPDSVLWLYEANAAARDN 626
Query: 98 IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ A G++ RI+F+ A +H+ R AD+ LDT N HTT+ D LW G P+VT
Sbjct: 627 LAYEAAKFGIEPDRIIFAPNALLADHLARHAHADLFLDTLPYNAHTTASDALWAGVPIVT 686
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GET A+RVAAS L +G PELI + +Y+ +A+ L D
Sbjct: 687 CAGETFAARVAASLLDAVGMPELITTSLADYEALALALARD 727
>gi|402771865|ref|YP_006591402.1| hypothetical protein BN69_1300 [Methylocystis sp. SC2]
gi|401773885|emb|CCJ06751.1| TPR repeat protein [Methylocystis sp. SC2]
Length = 451
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 91/156 (58%), Gaps = 1/156 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLP + VYC FNQ YK+ P +W +L+A P ++LWL + E N++
Sbjct: 244 SRAAAGLPAEGFVYCCFNQAYKLTPFIFDLWARLLEATPGAVLWL-SAAMLAEGNLRNEM 302
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+D R++F+ + EH+ R QLAD+ LDT HTT+ D LW G P+VT G+T
Sbjct: 303 RRRGIDAARMIFAPHLPQAEHLARLQLADLALDTAPFGSHTTASDALWAGVPIVTCAGDT 362
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SRVA S L +G PELIA EY +IA+ L D
Sbjct: 363 FPSRVAGSLLHAIGMPELIAADFDEYLEIALVLAGD 398
>gi|387131239|ref|YP_006294129.1| TPR domain-containing protein [Methylophaga sp. JAM7]
gi|386272528|gb|AFJ03442.1| TPR domain protein, putative component of TonB system [Methylophaga
sp. JAM7]
Length = 646
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 98/155 (63%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+ R+ +GLP++AIV C+FNQLYK+ +T Q W+++L +P ++LW+L G +Q +
Sbjct: 415 SCREDHGLPQEAIVLCSFNQLYKVTEATWQSWLSILVRIPETVLWVLAPSQTGRVVLQQS 474
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A G+D R++F++ + +++ R Q AD+ LDT + N HTT++D LW G P+V + G
Sbjct: 475 LVAHGIDSQRLIFADRTGQAKNLERMQHADLMLDTEIYNAHTTAVDALWAGVPIVAVKGG 534
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRV+AS L G PELI E D+A+ L
Sbjct: 535 HFASRVSASLLQACGLPELITHNLDEMADLAVALS 569
>gi|384263551|ref|YP_005418740.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
photometricum DSM 122]
gi|378404654|emb|CCG09770.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
photometricum DSM 122]
Length = 682
Score = 135 bits (340), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 93/158 (58%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR + GLPE+A+VYC FN YKI T + W++VL AVP+ +LWLL +QA
Sbjct: 431 VPSRAEAGLPEEAVVYCCFNAAYKITRFTFERWISVLAAVPDGVLWLLDPGRTTRERLQA 490
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
AQA G+D R++F+ H+ R LAD+ LDT HTT+ D LW G PV+T G
Sbjct: 491 EAQARGIDPARLVFAPRVPNPVHLARYALADLFLDTAPYGAHTTASDALWRGVPVLTWSG 550
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV S + G P+L+ ++ ++Y AI LG D
Sbjct: 551 RAFASRVCGSLVRAAGLPDLVCKSPEDYTARAIMLGQD 588
>gi|319761242|ref|YP_004125179.1| tetratricopeptide tpr_1 repeat-containing protein [Alicycliphilus
denitrificans BC]
gi|317115803|gb|ADU98291.1| Tetratricopeptide TPR_1 repeat-containing protein [Alicycliphilus
denitrificans BC]
Length = 734
Score = 135 bits (340), Expect = 9e-30, Method: Composition-based stats.
Identities = 72/157 (45%), Positives = 95/157 (60%), Gaps = 2/157 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-FPAVGEANIQAT 101
+R+ GLP V+C FN YKI P L W +L AV +S+LWLL+ PAV N+
Sbjct: 527 TREALGLPATGFVFCCFNNNYKILPQMLDGWGRILHAVEDSVLWLLEDNPAVSR-NLLRE 585
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
AQA G+ R++F+ +EH+ R +LAD+ LDT CN HTT+ D LW G PV+T G+
Sbjct: 586 AQARGIAPQRLVFAQRMPLDEHLARHRLADLFLDTLPCNAHTTASDALWAGLPVLTCAGQ 645
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ASRVAAS L +G PEL+ T Y+ AI L D
Sbjct: 646 SFASRVAASLLHAVGLPELVTETQGAYEARAIALARD 682
>gi|330823109|ref|YP_004386412.1| hypothetical protein Alide2_0477 [Alicycliphilus denitrificans
K601]
gi|329308481|gb|AEB82896.1| Tetratricopeptide TPR_1 repeat-containing protein [Alicycliphilus
denitrificans K601]
Length = 734
Score = 135 bits (340), Expect = 9e-30, Method: Composition-based stats.
Identities = 72/157 (45%), Positives = 95/157 (60%), Gaps = 2/157 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-FPAVGEANIQAT 101
+R+ GLP V+C FN YKI P L W +L AV +S+LWLL+ PAV N+
Sbjct: 527 TREALGLPATGFVFCCFNNNYKILPQMLDGWGRILHAVEDSVLWLLEDNPAVSR-NLLRE 585
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
AQA G+ R++F+ +EH+ R +LAD+ LDT CN HTT+ D LW G PV+T G+
Sbjct: 586 AQARGIAPQRLVFAQRMPLDEHLARHRLADLFLDTLPCNAHTTASDALWAGLPVLTCAGQ 645
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ASRVAAS L +G PEL+ T Y+ AI L D
Sbjct: 646 SFASRVAASLLHAVGLPELVTETQGAYEARAIALARD 682
>gi|440225535|ref|YP_007332626.1| TPR repeat-containing protein [Rhizobium tropici CIAT 899]
gi|440037046|gb|AGB70080.1| TPR repeat-containing protein [Rhizobium tropici CIAT 899]
Length = 647
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 98/155 (63%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R Q+ LPEDA V+ +FN KI + +W N+LK NS+LWLL EAN+ A
Sbjct: 416 TTRAQHKLPEDAFVFASFNGNRKITSQMVDVWCNILKRSKNSVLWLLCNGPRAEANLWAR 475
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+A G+ + R++F+ E+H+ R QLAD+ LDT NGHTT+ + LW G PV+T+ G
Sbjct: 476 LEARGISRKRVVFTTRIRYEDHIDRQQLADLGLDTFPVNGHTTTSEQLWGGLPVLTVKGT 535
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRV+ S L +G PEL+A + Y+D+A++L
Sbjct: 536 NFASRVSESLLNAIGLPELVAADIQTYEDMAVQLA 570
>gi|319791963|ref|YP_004153603.1| hypothetical protein Varpa_1276 [Variovorax paradoxus EPS]
gi|315594426|gb|ADU35492.1| Tetratricopeptide repeat [Variovorax paradoxus EPS]
Length = 833
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/159 (40%), Positives = 97/159 (61%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+R +GLPEDA V CNF+Q +KI P T WV +++ +PN++LWL + P E N++A
Sbjct: 617 ATRADHGLPEDAFVLCNFSQSFKIQPETFAAWVRIVRRIPNAVLWLAQGPNGFETNLRAQ 676
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+A GL R++ S + H+ R LAD+ +DT + T+ DVLW G P++ L G
Sbjct: 677 WEAAGLPAERLIVSPRMPIDRHLARIGLADLFIDTFPYSSGATANDVLWAGVPLLALTGT 736
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T+ SR+A S L+ +G PEL+A T EY + A+ T R+
Sbjct: 737 TMVSRMAGSLLSAIGLPELVATTRDEYVEKAVHYATHRN 775
>gi|217978051|ref|YP_002362198.1| hypothetical protein Msil_1891 [Methylocella silvestris BL2]
gi|217503427|gb|ACK50836.1| conserved hypothetical protein [Methylocella silvestris BL2]
Length = 492
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 92/158 (58%), Gaps = 1/158 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R Q GLPE V+C FNQ YKI P+ +W +L P S+LWLL+ E N++ A
Sbjct: 280 TRAQAGLPEQGFVFCCFNQAYKITPAVFDLWRRLLDEFPGSVLWLLR-AEQAEGNLRGEA 338
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ R++F+ + +H+ R QLAD+ LDT HTT+ D LW G P+VT GET
Sbjct: 339 LRRGVSPDRLIFAPDMNQVDHLGRLQLADLVLDTSPYGAHTTASDALWAGVPIVTFAGET 398
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SRVA S L +G PELIA +EY IA L ++ D
Sbjct: 399 FPSRVAGSLLHAVGMPELIAADAEEYLAIASVLASEPD 436
>gi|182677437|ref|YP_001831583.1| hypothetical protein Bind_0441 [Beijerinckia indica subsp. indica
ATCC 9039]
gi|182633320|gb|ACB94094.1| TPR repeat-containing protein [Beijerinckia indica subsp. indica
ATCC 9039]
Length = 1085
Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats.
Identities = 65/156 (41%), Positives = 94/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLPED V+C FN YK+ P +W+ +L+AVP S+ WLL+ + N+Q A
Sbjct: 559 TRADCGLPEDGFVFCCFNNSYKLTPVFFDLWMRLLQAVPGSVFWLLESGPLVRENLQHEA 618
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+D +R++F+ A +H+ R +LAD+ LD N HTT+ D LW G PV+T GET
Sbjct: 619 EKRGIDPNRLVFAPRIAIPDHLARHRLADLFLDCLPYNAHTTTSDALWAGLPVLTCAGET 678
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A RVA S L +G PELI + +Y+ A+ L +
Sbjct: 679 FAGRVAGSLLQAVGLPELITTSLADYEARALHLARN 714
>gi|322709482|gb|EFZ01058.1| UDP-N-acetylglucosaminyltransferase [Metarhizium anisopliae ARSEF 23]
Length = 1605
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/128 (50%), Positives = 90/128 (70%), Gaps = 4/128 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L DAI+ NFNQLYKIDP+ + W+ +L VP ++LWLL+FP GE+N++ATA+A
Sbjct: 1379 LGNDAIILGNFNQLYKIDPTIFRAWLRILARVPRAVLWLLRFPEPGESNLRATAKAWAGP 1438
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ RI+F++VA K +H+ R + D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1439 EVADRIIFTDVAPKSQHISRATVCDMFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1498
Query: 165 SRVAASQL 172
SR+AAS L
Sbjct: 1499 SRMAASIL 1506
>gi|148555054|ref|YP_001262636.1| hypothetical protein Swit_2139 [Sphingomonas wittichii RW1]
gi|148500244|gb|ABQ68498.1| TPR repeat-containing protein [Sphingomonas wittichii RW1]
Length = 654
Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats.
Identities = 69/170 (40%), Positives = 97/170 (57%)
Query: 30 ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
AT + P + R GLPE V+ +FN YKI P +W+ +L V S+LWLL
Sbjct: 432 ATDDRRPIADRPPRRADVGLPEQGFVFASFNASYKITPDLFAIWMRLLDRVEGSVLWLLG 491
Query: 90 FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
A AN++ A+ G+D R+LF+ EH+ R +LAD+ LD+ +CN HTT+ D L
Sbjct: 492 DDADAAANLRGHARRHGIDPARLLFAAKLPVAEHLARQKLADLFLDSFVCNAHTTASDAL 551
Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
W G P+VT G A+RV AS L +G PEL+ + +Y+ +A L TDR
Sbjct: 552 WAGLPIVTRIGAGFAARVCASLLRAVGLPELVTDSAADYERLAFALATDR 601
>gi|421587794|ref|ZP_16033148.1| hypothetical protein RCCGEPOP_04026 [Rhizobium sp. Pop5]
gi|403707632|gb|EJZ22587.1| hypothetical protein RCCGEPOP_04026 [Rhizobium sp. Pop5]
Length = 638
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 95/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R++ GLPEDA ++ +FN KI P T+ W +LK PNS+LWL+ +AN+
Sbjct: 418 TREKLGLPEDAFIFASFNGNRKITPETIDSWCRILKRAPNSVLWLMANSPRNQANLLKRF 477
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q+ G+ RI+F A EEH+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 478 QSSGISSKRIIFCPRAPYEEHIDRQQAADIGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 537
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + P+L+A + Y+D+A+ L +
Sbjct: 538 FASRVSESLLRAIDLPDLVADDLQAYEDMAVELAEN 573
>gi|372488421|ref|YP_005027986.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Dechlorosoma suillum PS]
gi|359354974|gb|AEV26145.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Dechlorosoma suillum PS]
Length = 625
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 102/175 (58%), Gaps = 7/175 (4%)
Query: 25 TNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI 84
T++ E P SR + GLPE +V+C+F+ YK+ P +W+N+L+ VP S+
Sbjct: 404 TDSGVKISERTP------SRSECGLPEQGVVFCSFSHDYKVSPPVFDIWMNLLRQVPGSV 457
Query: 85 LWLLKFPAVGEANIQATAQALGLDQHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHT 143
LWL+ V + N++ AQA G+D R++F+ V E+H+ R + AD+ LDT N HT
Sbjct: 458 LWLVSRNEVSQGNLRKEAQARGVDPSRLVFAGRVPLVEDHLARYRQADIFLDTHPYNAHT 517
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T+ D L G PVVT G +RVA S + +G PEL ++ EY+ +A+ L TD
Sbjct: 518 TAADALMAGLPVVTYMGNAFPARVAGSLVHAVGLPELATKSLAEYEALALALATD 572
>gi|392378891|ref|YP_004986051.1| protein of unknown function; putative TPR domain [Azospirillum
brasilense Sp245]
gi|356880373|emb|CCD01322.1| protein of unknown function; putative TPR domain [Azospirillum
brasilense Sp245]
Length = 670
Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 93/158 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPE +V+C FN YKI P+ +W +L+AVP+S+LWLL AN++ A
Sbjct: 459 SRAACGLPERGVVFCCFNAAYKITPALFDLWCRLLRAVPDSVLWLLDSHPEASANLRREA 518
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ R++F+ EH+ R +LAD+ LDT HTT+ D LW G PV+T+ G+
Sbjct: 519 VRRGVAADRLVFAPRRPPAEHLARYRLADLFLDTTPVGAHTTASDALWAGLPVLTVAGKG 578
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRV S L +G PEL R+ +Y+ A+RL D D
Sbjct: 579 FASRVGMSLLRAVGLPELAVRSPADYEAAAVRLAGDPD 616
>gi|384083989|ref|ZP_09995164.1| Tetratricopeptide TPR_4 [Acidithiobacillus thiooxidans ATCC 19377]
Length = 708
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/158 (40%), Positives = 96/158 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+RQ+ GLPE +V+C N YK+DP +W +L+ +P S LWLL+ P N++ A
Sbjct: 507 TRQEAGLPEQGLVFCCMNNSYKLDPFVFSIWCRLLEQIPGSHLWLLQGPQEMVTNLREAA 566
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ R++F+ ++++H+ R Q AD+ LDT N HTT+ D LW G PV+T+ GET
Sbjct: 567 SAQGIQSERLIFAPRVSRQQHLTRLQCADMFLDTRFYNAHTTATDALWAGVPVLTVAGET 626
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
++RVAAS + L PEL+ Y+ A+RL + D
Sbjct: 627 FSARVAASLVHALEMPELVQPDWAGYEAEALRLAQNPD 664
>gi|85710265|ref|ZP_01041330.1| TPR repeat protein [Erythrobacter sp. NAP1]
gi|85688975|gb|EAQ28979.1| TPR repeat protein [Erythrobacter sp. NAP1]
Length = 694
Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats.
Identities = 68/166 (40%), Positives = 97/166 (58%), Gaps = 2/166 (1%)
Query: 35 VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
VPQ +R +GLP DA V+C FN YKI +W+ L AV S+LWL + AV
Sbjct: 479 VPQPS--DTRADHGLPGDAFVFCCFNASYKITRDRFALWMRALGAVEGSVLWLYRSDAVA 536
Query: 95 EANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
EAN++ A+ G+D HR++F+ + +H+ R + A++ LDT HTT+ D LW G P
Sbjct: 537 EANLRMEAKRCGIDPHRLVFAYHLPRTQHLARHRHANLALDTSAYGAHTTASDCLWAGLP 596
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+VT G+ A+RVAAS L G EL+ + EY+ + + L TD D
Sbjct: 597 IVTRAGDQFAARVAASVLHAAGLDELVTHSDAEYETLIVALATDAD 642
>gi|302511405|ref|XP_003017654.1| hypothetical protein ARB_04536 [Arthroderma benhamiae CBS 112371]
gi|291181225|gb|EFE37009.1| hypothetical protein ARB_04536 [Arthroderma benhamiae CBS 112371]
Length = 1707
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 103/171 (60%), Gaps = 23/171 (13%)
Query: 49 LPEDAIVYCNFNQLYK----------IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
L +D ++ NFNQLYK I+P+T + W+ +L +PN+ILWLL+FP VGE N+
Sbjct: 1472 LKDDTVILGNFNQLYKNMNKLTMSCKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEHNL 1531
Query: 99 QATAQALGLDQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
+ TA+A + RI+F++VA K H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++
Sbjct: 1532 RQTAKAWAGEATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLL 1591
Query: 157 TLP--GETLASRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
T P + SR+A+S L + +LI+ EY+ AI LG
Sbjct: 1592 TFPRYKYKMCSRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1642
>gi|358377457|gb|EHK15141.1| hypothetical protein TRIVIDRAFT_196588 [Trichoderma virens Gv29-8]
Length = 1566
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 100/158 (63%), Gaps = 13/158 (8%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+D I+ NFNQLYKIDP+T + W+ +L P +ILWLL+FP +GE N++ TA++ +
Sbjct: 1342 DDVIILGNFNQLYKIDPTTFRSWLRILARTPKAILWLLRFPELGETNLRQTAESWAGAEV 1401
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
R++F++VA K +H+ R ++ D+ LDT CN HTT+ DVLW+ TP++TLP + + SR
Sbjct: 1402 ASRLIFTDVAPKSQHITRARVCDLFLDTAECNAHTTAADVLWSSTPLLTLPRYSYKMCSR 1461
Query: 167 VAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
+AAS ELIA + EY+D A +L
Sbjct: 1462 MAASIFRGALPKTAEGRQAAEELIADSETEYEDSATQL 1499
>gi|241203113|ref|YP_002974209.1| hypothetical protein Rleg_0359 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240857003|gb|ACS54670.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 657
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 94/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLP+DA ++ +FN KI P T+ W +LK PNS+LWL+ +AN+
Sbjct: 437 TREQLGLPDDAFIFASFNGNRKITPETIDSWCRILKRAPNSVLWLMANTPRNQANLLKQF 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 497 QTAGISAKRIIFCPRAPYEQHIDRQQAADIGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + PEL+A + Y+D+A+ L +
Sbjct: 557 FASRVSESLLRAIDLPELVAPDLRAYEDMAVELAEN 592
>gi|86356319|ref|YP_468211.1| glycosyltransferase domain-containing protein [Rhizobium etli CFN
42]
gi|86280421|gb|ABC89484.1| putative glycosyltransferase TPR domain protein [Rhizobium etli CFN
42]
Length = 544
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPEDA ++ +FN KI P + W +LK PNS+LWL+ +AN+
Sbjct: 324 TREQLGLPEDAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 383
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 384 QTAGISPKRIIFCPRAPYEDHISRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 443
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + PEL+A + Y+D+A+ L +
Sbjct: 444 FASRVSESLLKAIDLPELVASDLQAYEDLAVELAQN 479
>gi|428181597|gb|EKX50460.1| hypothetical protein GUITHDRAFT_103694 [Guillardia theta CCMP2712]
Length = 807
Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats.
Identities = 67/174 (38%), Positives = 98/174 (56%), Gaps = 1/174 (0%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q N A+ P+ + R++ LPE A V+CNFN LYKI+P ++WV ++ VP+S
Sbjct: 484 QVNDHRASYPIRPEKTISEIRRKERLPEHAFVFCNFNGLYKIEPQVFEIWVRIMHRVPDS 543
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LWL+ ANI A+ G+ RI+ + EH+ R + AD+ LD N HT
Sbjct: 544 VLWLVNENNESNANILRQAEVRGISSSRIILAEREDFAEHIFRVRAADMFLDCLTYNAHT 603
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLG-CPELIARTHKEYQDIAIRLG 196
T++D LW G PV+TLPG + R+A L++ G E RT EY++IA+ L
Sbjct: 604 TAVDSLWGGLPVMTLPGWNMNMRLATGALSSFGPASETCVRTESEYEEIAVALA 657
>gi|414168065|ref|ZP_11424269.1| hypothetical protein HMPREF9696_02124 [Afipia clevelandensis ATCC
49720]
gi|410888108|gb|EKS35912.1| hypothetical protein HMPREF9696_02124 [Afipia clevelandensis ATCC
49720]
Length = 661
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 93/155 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLP+D V+C FN YKI P T W+ +L+ VP S+LWL++ + AN++ A
Sbjct: 451 TRTELGLPQDGFVFCCFNNNYKITPQTFAGWMRILERVPGSVLWLIEDNPLASANLRKEA 510
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F+ E+H+ R + AD+ LDT N HTT+ D LW PV+T G T
Sbjct: 511 GVHGVDPSRLVFAERVPPEDHLARHRCADLFLDTLPYNAHTTASDALWARLPVLTCTGGT 570
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
A RVAAS L + PEL+ + +Y+ +A+ L T
Sbjct: 571 FAGRVAASLLLNVALPELVTSSQDDYERLAVELAT 605
>gi|338975698|ref|ZP_08631047.1| TPR domain-containing protein [Bradyrhizobiaceae bacterium SG-6C]
gi|338231007|gb|EGP06148.1| TPR domain-containing protein [Bradyrhizobiaceae bacterium SG-6C]
Length = 661
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 93/155 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLP+D V+C FN YKI P T W+ +L+ VP S+LWL++ + AN++ A
Sbjct: 451 TRTELGLPQDGFVFCCFNNNYKITPQTFAGWMRILERVPGSVLWLIEDNPLASANLRKEA 510
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F+ E+H+ R + AD+ LDT N HTT+ D LW PV+T G T
Sbjct: 511 GVHGVDPSRLVFAERVPPEDHLARHRCADLFLDTLPYNAHTTASDALWARLPVLTCTGGT 570
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
A RVAAS L + PEL+ + +Y+ +A+ L T
Sbjct: 571 FAGRVAASLLLNVALPELVTSSQDDYERLAVELAT 605
>gi|409051565|gb|EKM61041.1| glycosyltransferase family 41 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 1339
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 95/153 (62%), Gaps = 5/153 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP +++ NFNQLYKIDP W+ +L VPNSILWLL+FPA GE ++ AQ +
Sbjct: 1088 LPHTVVIFANFNQLYKIDPGVFAAWLRILVRVPNSILWLLRFPAQGEEHLLRQAQMWAGN 1147
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
+ R+ F++VA K HV RG++AD+ LDT CN HT + D+LW GTP++T P +
Sbjct: 1148 EVASRVKFTDVARKHVHVHRGRVADLFLDTMECNAHTIAGDILWPGTPLITFPKHPHKMC 1207
Query: 165 SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
SRVAAS G E++ + + Y++ A+ L
Sbjct: 1208 SRVAASMANATGFGDEMVVDSLEAYENRAVALA 1240
>gi|344174927|emb|CCA87561.1| conserved hypothethical protein with TPR repeat domain [Ralstonia
syzygii R24]
Length = 675
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 93/155 (60%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP D V C FNQ YKI + W +L P S+LWLL+ A +A + A+
Sbjct: 471 RAGVGLPADGFVLCCFNQAYKITEARANTWFTILSRTPGSVLWLLEPDASAKAALLERAR 530
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G++ HR++F+ A+ EH+ R QLAD+ LDT HTT+ D+LW G P++T G+T+
Sbjct: 531 RHGVESHRLVFAPQVAQREHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRMGDTM 590
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G +L+ T ++Y D A+RL D
Sbjct: 591 ASRVAASILQAAGLHDLVVTTEEDYVDAAVRLAGD 625
>gi|299135312|ref|ZP_07028503.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
gi|298590289|gb|EFI50493.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
Length = 700
Score = 132 bits (333), Expect = 5e-29, Method: Composition-based stats.
Identities = 75/186 (40%), Positives = 107/186 (57%), Gaps = 8/186 (4%)
Query: 16 LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVN 75
L N N T T+ +P V R+ +GLP+ V+C+FN KI+P T W+
Sbjct: 470 LPNCYQVNDTTTRV-----IPNDPVF--RRDHGLPDAGFVFCSFNNNNKINPGTFTDWMT 522
Query: 76 VLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLD 135
+L+ VP S+LWLLK V N++ A A G++ +R++F+ A H+ R +LAD+ LD
Sbjct: 523 ILRRVPGSVLWLLKNSDVVAQNLRREASARGINPNRLVFAPRADLPRHLARHRLADLFLD 582
Query: 136 TPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE-LIARTHKEYQDIAIR 194
T N HTT+ D L G PV+TL GE A RVAAS L T+G + L+ + +EY + A+
Sbjct: 583 TLPYNAHTTTSDALLAGLPVLTLQGEAFAGRVAASILTTVGLQDGLVTFSREEYVERAVA 642
Query: 195 LGTDRD 200
L TD D
Sbjct: 643 LATDAD 648
>gi|428179024|gb|EKX47897.1| hypothetical protein GUITHDRAFT_69287, partial [Guillardia theta
CCMP2712]
Length = 437
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 104/177 (58%), Gaps = 2/177 (1%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
QTN+ E+ + +R + LP+ +V+CNF +L +I Q+W+N+L+ VPN
Sbjct: 220 QTNSFRHLYPEILDQKNLPTRDDHQLPDHPTVVFCNFCRLGRITADLFQIWMNILRRVPN 279
Query: 83 SILWLLKFPAVGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNG 141
S++WL K P +Q A+ G+ R++F + K EH++R LAD+CLDT + NG
Sbjct: 280 SVIWLYKHPKAAVPRLQLQAKRSGIPPERLVFGPPCSPKIEHLKRVTLADLCLDTIVYNG 339
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
HTT+ D+LW G P VT+ G+ S VA ++G PE+ EY+++A++LG D
Sbjct: 340 HTTASDMLWAGVPFVTVSGDNWPSLVATCIAKSVGMPEMAVSNLAEYEELAVKLGND 396
>gi|222147822|ref|YP_002548779.1| hypothetical protein Avi_1086 [Agrobacterium vitis S4]
gi|221734810|gb|ACM35773.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 632
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 91/155 (58%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R +GLPE +V+C+FNQ KI + W+ +LK+V S+LWL+ V N++A A
Sbjct: 434 RSLHGLPEQGVVFCSFNQAVKIRYQVFKTWMEILKSVDGSVLWLIDMLPVTRDNLRAAAV 493
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
LG+ R++F+ EH+RR AD+ LDT CNGHTT+ D LW G PV+T G
Sbjct: 494 RLGVAPERLIFAPKKPLSEHLRRLPYADIALDTGPCNGHTTTADALWAGVPVLTWKGTNF 553
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A RV+ S L+ +G EL+A E+ +A+ L D
Sbjct: 554 AGRVSESLLSAVGLTELVADDLTEFGRLAVELAQD 588
>gi|393218462|gb|EJD03950.1| TPR-like protein [Fomitiporia mediterranea MF3/22]
Length = 1383
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 94/151 (62%), Gaps = 5/151 (3%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+D I++ NFNQ+YK+DPST W+ +LK VPNSILWLL+FP GE ++ TA+ +
Sbjct: 1143 DDTIIFANFNQMYKLDPSTYATWLRILKRVPNSILWLLRFPPAGEEHLLRTAEQWASAEI 1202
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
R+ F+ V K+EH+ R +AD+ LDT CN HT + DVLW GTP++T P T + SR
Sbjct: 1203 ASRVRFTGVVPKDEHIERVGVADIFLDTVECNAHTIAADVLWGGTPIITWPKYTFKMCSR 1262
Query: 167 VAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
V AS AT ++ + Y++ A+ L
Sbjct: 1263 VGASIAFATGFGNRMVVDSLAAYEERAVSLA 1293
>gi|116250489|ref|YP_766327.1| hypothetical protein RL0717 [Rhizobium leguminosarum bv. viciae
3841]
gi|115255137|emb|CAK06211.1| putative conserved hypothetical TPR repeat protein [Rhizobium
leguminosarum bv. viciae 3841]
Length = 657
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 94/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPEDA ++ +FN KI P T+ W +LK PNS+LWL+ +AN+
Sbjct: 437 TREQLGLPEDAFIFASFNGNRKITPETIGSWCRILKRAPNSVLWLMANTPRNQANLLKQF 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 497 QGAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + P+L+A + Y+D+A+ L +
Sbjct: 557 FASRVSESLLRAIDLPDLVAPDLQAYEDMAVELAEN 592
>gi|424873732|ref|ZP_18297394.1| hypothetical protein Rleg5DRAFT_5272 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393169433|gb|EJC69480.1| hypothetical protein Rleg5DRAFT_5272 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 657
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 94/156 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPEDA ++ +FN KI P T+ W +LK PNS+LWL+ +AN+
Sbjct: 437 TREQLGLPEDAFIFASFNGNRKITPETIGSWCRILKRAPNSVLWLMANTPRNQANLLKQF 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 497 QGAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + P+L+A + Y+D+A+ L +
Sbjct: 557 FASRVSESLLRAIDLPDLVAPDLQAYEDMAVELAEN 592
>gi|194335763|ref|YP_002017557.1| hypothetical protein Ppha_0641 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194308240|gb|ACF42940.1| TPR repeat-containing protein [Pelodictyon phaeoclathratiforme
BU-1]
Length = 1094
Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats.
Identities = 64/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R++ GLPE V+ FN KI P+T W+ +L V S+LWLL+ N++ A
Sbjct: 607 TRKECGLPEQGFVFACFNNNNKITPATFDGWMRILGQVEGSVLWLLEDNVAAADNLRNEA 666
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F+ EH+ R +L D+ LDT CN HTT+ D LW G P++TL GE+
Sbjct: 667 MQRGVDAARLVFAKRMPLAEHLARQRLGDLFLDTFPCNAHTTASDALWAGLPLLTLLGES 726
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A RVAAS L + PELI +EY+ +AI L ++
Sbjct: 727 FAGRVAASLLNAMQLPELITSKQEEYEALAIELASN 762
>gi|255616588|ref|XP_002539762.1| hypothetical protein RCOM_1987210 [Ricinus communis]
gi|223502601|gb|EEF22621.1| hypothetical protein RCOM_1987210 [Ricinus communis]
Length = 268
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 94/157 (59%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
I +R GL V+C FN YKI P W+ +L V S+LWLL+ A AN++
Sbjct: 57 IFTRGDVGLAPAGFVFCCFNNNYKITPDMFDRWMRILDRVEGSMLWLLEASAESAANLRK 116
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A A G+ R++F+ A+ +H+ R LAD+ LD+ N HTT+ D LW G PV+T+ G
Sbjct: 117 EAIARGVAAERLVFAQRASLPDHLARHCLADLFLDSTPYNAHTTASDALWAGLPVLTVLG 176
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
ET ASRVAAS L +G PELIA + + Y+ +A+ L T
Sbjct: 177 ETFASRVAASLLHAVGLPELIAESPEAYEQMAVDLAT 213
>gi|417097828|ref|ZP_11959414.1| hypothetical protein RHECNPAF_1760042 [Rhizobium etli CNPAF512]
gi|327193044|gb|EGE59956.1| hypothetical protein RHECNPAF_1760042 [Rhizobium etli CNPAF512]
Length = 638
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPEDA ++ +FN KI P + W +LK PNS+LWL+ +AN+
Sbjct: 418 TREQLGLPEDAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 477
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A ++H+ R Q AD+ +DT NGHTT+ + LW G PV+T G
Sbjct: 478 QTAGISPKRIIFCPRAPYDQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 537
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + PEL+A + Y+D+A+ L +
Sbjct: 538 FASRVSESLLQAIDLPELVAGDLQAYEDLAVELAQN 573
>gi|399040605|ref|ZP_10735943.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rhizobium sp. CF122]
gi|398061392|gb|EJL53188.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rhizobium sp. CF122]
Length = 641
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/159 (41%), Positives = 95/159 (59%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R + GLPE A V+ +FN KI+ + L +W N+LK PNS+LWL+ + N+
Sbjct: 419 TTRAKVGLPEGAFVFASFNANRKINTAILDVWCNILKRAPNSVLWLMLSSPRTQTNLLNY 478
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
GL+ R++F + EEH+ R Q+AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 479 MNKRGLESDRVIFCPRVSYEEHIDRQQVADLGVDTFPVNGHTTTSEQLWGGLPVLTVKGT 538
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRV+ S L +G PEL+A K Y+D A+ L D
Sbjct: 539 NFASRVSESLLNAIGLPELVASDIKAYEDKAVELANAPD 577
>gi|222147876|ref|YP_002548833.1| hypothetical protein Avi_1161 [Agrobacterium vitis S4]
gi|221734864|gb|ACM35827.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 637
Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats.
Identities = 65/159 (40%), Positives = 95/159 (59%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+ R +G+PE A V+ +FN ++KI P T+ +W VL+A P+S+LW+L A+ N++A
Sbjct: 424 SRRADHGVPEHAFVFASFNGVHKITPQTMSLWARVLRAAPDSLLWMLCPDAIARTNLEAA 483
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A G+D RILF+ +HV R LAD+ LDT CNGHTT+ D+LW G PV+T G
Sbjct: 484 FVAEGIDPARILFAAKQDYGDHVHRLPLADLALDTFPCNGHTTTSDMLWGGLPVLTKRGH 543
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A RV+ S L +G +L+A + + +A L D
Sbjct: 544 CFAGRVSESLLKAVGLDQLVADDEEAFVSLAAELARQPD 582
>gi|126737924|ref|ZP_01753654.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
gi|126721317|gb|EBA18021.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
Length = 603
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 97/164 (59%)
Query: 37 QSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA 96
Q V+ SR + GLPE ++C+ N K+ P+ +W+ +L VP S+LW+L +
Sbjct: 383 QPDVVPSRTELGLPEGKFIFCSLNNPNKVTPAEFDVWMKLLCNVPESVLWILAPNEEIKN 442
Query: 97 NIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
N+ A A G+ R++F+ + ++H+ R + AD+ LD CN HTT+ + +W G P+V
Sbjct: 443 NLTREANARGVGGERLIFAERVSMDDHLARMRQADLFLDAFNCNAHTTASEAVWAGVPLV 502
Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T G A+RVAAS + +GCP+L+ T EY D+A +L T+ D
Sbjct: 503 TKAGNQFAARVAASVVTAIGCPDLVTETIDEYYDLAYKLATEAD 546
>gi|407774610|ref|ZP_11121908.1| hypothetical protein TH2_11919 [Thalassospira profundimaris WP0211]
gi|407282652|gb|EKF08210.1| hypothetical protein TH2_11919 [Thalassospira profundimaris WP0211]
Length = 734
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 97/158 (61%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPED V+C N YKI P+ +W+ V+ V S+LWL V + N++A A
Sbjct: 525 TRAELGLPEDGFVFCCLNNNYKIMPTEFAIWMRVMAKVEGSVLWLWCNNDVAKENLRAAA 584
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F+ + EH+ R + AD+ +DT N HTT+ D LW G PVVTL G+
Sbjct: 585 EKQGISGDRLIFAGYMPQSEHLARLRHADLFIDTFHVNAHTTASDALWAGLPVVTLAGKQ 644
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+RVAAS L+ +G ELI+ T + Y+++ ++L + D
Sbjct: 645 FAARVAASLLSAVGLRELISETPEAYEELILKLAQNPD 682
>gi|257092967|ref|YP_003166608.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
gi|257045491|gb|ACV34679.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
Length = 608
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 97/157 (61%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLP+DA V+C+FN YK P W+N+L+ V S+LWLL A EAN++ A
Sbjct: 405 TRESCGLPQDAFVFCSFNNNYKYTPEMFTTWMNILRRVAGSVLWLLADNAWAEANLRQEA 464
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F++ A E ++ R ++AD+ LD+ N TT+ D LW G PV+T G T
Sbjct: 465 ITRGIDPARLIFASRVAPENYLARYRVADLFLDSFPFNAGTTANDALWMGLPVLTRSGRT 524
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR+A + L EL+ T ++Y++ A+ LG+DR
Sbjct: 525 FASRMAGALLTACRLSELVTYTLQDYEEKAVSLGSDR 561
>gi|295688661|ref|YP_003592354.1| hypothetical protein Cseg_1236 [Caulobacter segnis ATCC 21756]
gi|295430564|gb|ADG09736.1| TPR repeat-containing protein [Caulobacter segnis ATCC 21756]
Length = 676
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 92/158 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLP+DA V+C FN + KI P W+ +LK P S+LWLL A ++ +A
Sbjct: 411 SRADAGLPDDAFVFCCFNGVQKITPHMFDRWIQILKRTPGSVLWLLDSNPEANARLRDSA 470
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D+ RI+F+ H+ R +LAD+ LDT HTT+ D LW PV+T G +
Sbjct: 471 EAKGVDRTRIVFAPKLQNAYHLARYRLADLFLDTTPYGAHTTASDALWMAVPVLTWSGRS 530
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRV S + + G PEL+ + + Y + A+ +G DR+
Sbjct: 531 FASRVCGSLVRSAGLPELVVDSGEAYVEKAVEIGADRE 568
>gi|424915500|ref|ZP_18338864.1| hypothetical protein Rleg9DRAFT_3046 [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392851676|gb|EJB04197.1| hypothetical protein Rleg9DRAFT_3046 [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 657
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 91/154 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPE+A ++ +FN KI P + W +LK PNS+LWL+ +AN+
Sbjct: 437 TREQLGLPEEAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLLKHF 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A EEH+ R Q AD+ +DT NGHTT+ + LW G PV+T G
Sbjct: 497 QTAGISPKRIIFCPRAPYEEHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRV+ S L + P+L+A + Y+D+A+ L
Sbjct: 557 FASRVSESLLRAIDLPDLVASDLQAYEDMAVELA 590
>gi|378734069|gb|EHY60528.1| polypeptide N-acetylglucosaminyltransferase [Exophiala dermatitidis
NIH/UT8656]
Length = 1641
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 102/163 (62%), Gaps = 17/163 (10%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L + A++ NFNQLYKIDP+T M++ +LK VPN+ILWLL+FP +GE + A+
Sbjct: 1400 LSDTAVILGNFNQLYKIDPATFDMYLQILKMVPNAILWLLRFPDLGEQKLLQYARDWASP 1459
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE---TL 163
+ RI+F++VAAK H+ R + D+ LDTP CN HTT+ DV+W+GTP++T G+ +
Sbjct: 1460 EVASRIVFTDVAAKGTHIMRASVVDLFLDTPECNAHTTAADVIWSGTPILTW-GKWKYKM 1518
Query: 164 ASRVAASQLATLGCPE----------LIARTHKEYQDIAIRLG 196
SR+A S +A+ PE L+ + KEY D AI L
Sbjct: 1519 CSRMAGSIVAS-ALPEGREGDEARRDLLVNSEKEYIDAAIELA 1560
>gi|336383400|gb|EGO24549.1| glycosyltransferase family 41 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 939
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 96/155 (61%), Gaps = 7/155 (4%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP D +++ NFNQLYKIDP W+ +L VP SILWLL+FPA GEA++ TAQ
Sbjct: 697 LPLDVVIFANFNQLYKIDPVIFATWLRILAQVPRSILWLLRFPAAGEAHLLRTAQMWAGQ 756
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE----T 162
+ RI F++VA KE+H+ R ++ D+ LDT C+ HT + DVLW+GTP++
Sbjct: 757 EVASRIHFTDVARKEQHIYRCRVVDLFLDTVECSAHTVAADVLWSGTPIIACAWSEHRYK 816
Query: 163 LASRVAASQLATLGCPE-LIARTHKEYQDIAIRLG 196
+ SRVAAS G + ++ + +EY++ AI L
Sbjct: 817 MCSRVAASVANATGFGQHMVVSSREEYEERAISLA 851
>gi|402490310|ref|ZP_10837099.1| hypothetical protein RCCGE510_21284 [Rhizobium sp. CCGE 510]
gi|401810336|gb|EJT02709.1| hypothetical protein RCCGE510_21284 [Rhizobium sp. CCGE 510]
Length = 623
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 95/161 (59%), Gaps = 1/161 (0%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P+ +T R+Q GLPEDA ++ +FN KI P T+ W +LK NS+LWL+ +
Sbjct: 397 PKPRAVT-REQLGLPEDAFIFASFNGNRKITPETIDSWCRILKRAQNSVLWLMANTPRNQ 455
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
AN+ Q G+ RI+F A EEH+ R Q AD+ +DT NGHTT+ + LW G PV
Sbjct: 456 ANLLKQFQTAGISPKRIIFCPRAPYEEHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPV 515
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+T+ G ASRV+ S L + P+L+A + Y+D+A+ L
Sbjct: 516 LTVKGTNFASRVSESLLRAIDLPDLVADDLQAYEDMAVELA 556
>gi|315497291|ref|YP_004086095.1| hypothetical protein Astex_0244 [Asticcacaulis excentricus CB 48]
gi|315415303|gb|ADU11944.1| hypothetical protein Astex_0244 [Asticcacaulis excentricus CB 48]
Length = 675
Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats.
Identities = 66/158 (41%), Positives = 89/158 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLP+ A V+C FN +KI T W+ VL V S+LWLL ++A A
Sbjct: 415 TRAEAGLPDKAFVFCCFNGTHKITRRTFDRWLAVLDRVAGSVLWLLSGSNSSHLRLKAYA 474
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+D R++F+ A H+ R LAD+ LDT HTT D LW G PV+TL G +
Sbjct: 475 QARGIDPSRLIFAEKLANPAHLARYPLADLFLDTSPYGAHTTCSDALWMGVPVLTLSGRS 534
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRV S L G PEL+ +++ D+A+ LG D D
Sbjct: 535 FASRVCGSLLRAAGLPELVTTRPEDFVDMAVSLGRDSD 572
>gi|209547935|ref|YP_002279852.1| hypothetical protein Rleg2_0327 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209533691|gb|ACI53626.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 657
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPE+A ++ +FN KI P + W +LK PNS+LWL+ +AN+
Sbjct: 437 TREQLGLPEEAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLLKHF 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 497 QTAGISSKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + PEL+A + Y+D+A+ L +
Sbjct: 557 FASRVSESLLRAIDLPELVAGDLQAYEDMAVELAQN 592
>gi|345869369|ref|ZP_08821327.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
drewsii AZ1]
gi|343923292|gb|EGV33984.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
drewsii AZ1]
Length = 844
Score = 130 bits (326), Expect = 3e-28, Method: Composition-based stats.
Identities = 68/156 (43%), Positives = 88/156 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE V+C FN YKI + W +L AVP S+LWL V N++ A
Sbjct: 630 SRTEAGLPEQGFVFCCFNDPYKITSEIFERWCAILNAVPGSVLWLYAKTRVVADNLRREA 689
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ GLD R+ F++ ++ EH+ R LAD+ LDT N HTT+ D LW G PV+T GET
Sbjct: 690 ERRGLDSERLYFASKVSQPEHLARLALADLVLDTLPYNAHTTASDALWVGVPVLTCQGET 749
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SRVAAS L G PELI Y+ A+ L +
Sbjct: 750 FPSRVAASLLRAAGLPELITGDLAGYEARAVALAKE 785
>gi|239815273|ref|YP_002944183.1| hypothetical protein Vapar_2288 [Variovorax paradoxus S110]
gi|239801850|gb|ACS18917.1| TPR repeat-containing protein [Variovorax paradoxus S110]
Length = 740
Score = 130 bits (326), Expect = 4e-28, Method: Composition-based stats.
Identities = 65/159 (40%), Positives = 92/159 (57%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
++R GLP V+C FN YKI P+ W+ +L+AVP+S+LWL + + +Q
Sbjct: 530 SARMAAGLPASGFVFCCFNNTYKITPTVFATWMRILRAVPDSVLWLYETDSRVGPRLQEE 589
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A G++ R++F+ EH+ R +LAD+ LDT CN HTT D LW G P++T G
Sbjct: 590 AVKNGIEPGRLVFAPRQPLGEHLCRHELADLFLDTFPCNAHTTGSDALWAGLPLLTCLGT 649
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T ASRVAAS L +G PE++ T Y AI L ++ D
Sbjct: 650 TFASRVAASLLHAVGLPEMVMPTLDAYATRAIELASEPD 688
>gi|344338856|ref|ZP_08769787.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
5811]
gi|343801438|gb|EGV19381.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
5811]
Length = 788
Score = 130 bits (326), Expect = 4e-28, Method: Composition-based stats.
Identities = 66/154 (42%), Positives = 89/154 (57%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE+ V+C FN KI P + W +L AVP S+LWL N++ A+
Sbjct: 574 RAAAGLPEEGFVFCCFNNPRKITPEVFERWCRLLHAVPGSLLWLFASQDAVIDNLKQEAE 633
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+D R++ + ++EH+ R LAD+ LDT N HTT+ D LW G PV+T G+T
Sbjct: 634 RRGIDPRRLVLAPRVPQQEHLARLALADLMLDTLPYNAHTTASDALWMGVPVLTCVGDTF 693
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
ASRVAAS L G PELI + +Y+ A+RL T
Sbjct: 694 ASRVAASLLGAAGLPELITSSLDDYEAEALRLAT 727
>gi|375104380|ref|ZP_09750641.1| hypothetical protein BurJ1DRAFT_1023 [Burkholderiales bacterium
JOSHI_001]
gi|374665111|gb|EHR69896.1| hypothetical protein BurJ1DRAFT_1023 [Burkholderiales bacterium
JOSHI_001]
Length = 464
Score = 130 bits (326), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 89/147 (60%), Gaps = 1/147 (0%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+RQ GLPE A+V CNFNQ YK P +W +L+ P S+LWLL + N++
Sbjct: 255 TTRQDAGLPEGALVLCNFNQAYKFTPQVFGLWCELLERAPGSVLWLLA-DEQAQGNLRQE 313
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A A G++ HR++F+ + +H+ R LAD+ LDT N HTT+ D LW G PV+T G+
Sbjct: 314 ALARGINPHRLVFAPPLPQAQHLARLALADLVLDTLPYNAHTTASDALWAGVPVLTCAGD 373
Query: 162 TLASRVAASQLATLGCPELIARTHKEY 188
T A+RVA S L +G PEL+ +Y
Sbjct: 374 TFAARVAGSLLHAVGLPELVTHRLDDY 400
>gi|351730791|ref|ZP_08948482.1| tetratricopeptide tpr_1 repeat-containing protein [Acidovorax
radicis N35]
Length = 809
Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats.
Identities = 65/158 (41%), Positives = 96/158 (60%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R++ LP V+C FN KI P W+ +L+AVP+S+LWLL E +++
Sbjct: 600 VFTREEARLPATGRVFCCFNNNQKILPQVFDSWMRILQAVPDSVLWLLADNPAVEGSLRR 659
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
AQA G+ R++F+ ++H+ R +LAD+ LDT N HTT+ D LW G PV+T G
Sbjct: 660 EAQARGVAPERLVFAQRLPLDQHLARHRLADLFLDTLPYNAHTTASDALWAGLPVLTQLG 719
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
E+ A+RVAAS L +G PEL+ + EY+ +AI L D
Sbjct: 720 ESFAARVAASLLHAVGLPELVTHSAAEYEALAISLARD 757
>gi|259419160|ref|ZP_05743077.1| TPR repeat-containing protein [Silicibacter sp. TrichCH4B]
gi|259345382|gb|EEW57236.1| TPR repeat-containing protein [Silicibacter sp. TrichCH4B]
Length = 616
Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats.
Identities = 64/158 (40%), Positives = 93/158 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ + LPE++ V+ FN YK+ P +W+N+LK VP+S+LW A ++ A
Sbjct: 406 SRESHDLPEESFVFSTFNNPYKVTPREFDIWMNLLKEVPDSVLWYYVSNADIIDRLRKEA 465
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D RI+ + E H+ R + AD+ LDT N HTT+ D LW G P+VT GE
Sbjct: 466 EARGVDGARIIPTGRMQPEYHLARLKHADLFLDTFNVNAHTTASDALWAGLPLVTKTGEQ 525
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+RVA S L G +L+ T K+Y D+A+R+ D D
Sbjct: 526 FAARVAGSILTAAGLEDLVTTTEKKYHDVALRIAQDPD 563
>gi|288957552|ref|YP_003447893.1| hypothetical protein AZL_007110 [Azospirillum sp. B510]
gi|288909860|dbj|BAI71349.1| hypothetical protein AZL_007110 [Azospirillum sp. B510]
Length = 888
Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats.
Identities = 67/154 (43%), Positives = 87/154 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPED V+C FN +K+ P+ L W +L AVP+S+LWL N++
Sbjct: 660 SRAACGLPEDGFVFCCFNSTHKLTPALLDGWARILAAVPDSLLWLYAGNPQAADNLRREG 719
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G D R++F+ EH+ R +LAD+ LDT N HTT+ D LW G PV+T G T
Sbjct: 720 EARGNDPRRLVFAAPLPHAEHLARHRLADLFLDTLPYNAHTTASDALWAGLPVLTRRGTT 779
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
A RVAAS L G PELI + Y+ AI L
Sbjct: 780 FAGRVAASLLRAAGLPELIVEDQQAYEAAAISLA 813
>gi|424888611|ref|ZP_18312214.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rhizobium leguminosarum bv. trifolii WSM2012]
gi|393174160|gb|EJC74204.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rhizobium leguminosarum bv. trifolii WSM2012]
Length = 677
Score = 129 bits (325), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPE+ ++ +FN KI P T+ W +LK PNS+LWL+ +AN+
Sbjct: 457 TREQLGLPEETFIFASFNGNRKITPETIDSWCRILKRAPNSVLWLMANTPRNQANLLKQF 516
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 517 QTAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 576
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + P+L+A + Y+D+A+ L +
Sbjct: 577 FASRVSESLLRAIDLPDLVASDLQAYEDMAVELAEN 612
>gi|300696849|ref|YP_003747510.1| conserved protein of unknown function with TPR repeat domain
[Ralstonia solanacearum CFBP2957]
gi|299073573|emb|CBJ53093.1| conserved protein of unknown function with TPR repeat domain
[Ralstonia solanacearum CFBP2957]
Length = 672
Score = 129 bits (325), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLP D +V+C FNQ YKI + Q W +L P S+LWLL+ A +Q A
Sbjct: 467 NRAEVGLPADGLVFCCFNQAYKITEARAQTWFAILSRTPGSVLWLLEPDASARTALQEEA 526
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ R++F+ A+ H+ R QLAD+ LDT HTT+ D+LW G P++T G+T
Sbjct: 527 GRHGIASERLVFAPQVAQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDT 586
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ASRVAAS L G +L+ T +Y + A+RL D
Sbjct: 587 MASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622
>gi|226941622|ref|YP_002796696.1| TPR repeat protein [Laribacter hongkongensis HLHK9]
gi|226716549|gb|ACO75687.1| TPR repeat protein [Laribacter hongkongensis HLHK9]
Length = 839
Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats.
Identities = 62/157 (39%), Positives = 95/157 (60%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
+I SR++ GLPE+A V+C+FN L K+ P +W +L++V S+LWL+ + NI
Sbjct: 350 LIPSRKEMGLPENAFVFCSFNTLVKLTPEMFDVWCRLLQSVSGSVLWLIATDKKAQENIV 409
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A G+D R +F++ A+ +H+ R LAD+ LD+ HTT+ D LW G P++T+
Sbjct: 410 REAMERGVDASRFVFASQVAQSQHLARLTLADLFLDSFPVVAHTTASDSLWAGLPILTMA 469
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
GE+ SRVA S L T+G ELI ++ +Y A+ L
Sbjct: 470 GESFVSRVAGSILKTIGVDELIVYSYDQYFQKALFLA 506
>gi|12718407|emb|CAC28786.1| related to UDP-N-ACETYLGLUCOSAMINE--PEPTIDE
N-ACETYLGLUCOSAMINYLTRANSFERASE [Neurospora crassa]
Length = 1519
Score = 129 bits (324), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 105/183 (57%), Gaps = 36/183 (19%)
Query: 49 LPEDAIVYCNFNQLYK-----------------------IDPSTLQMWVNVLKAVPNSIL 85
LP+D I+ NFNQLYK IDP+T + W+ +L VP ++L
Sbjct: 1282 LPDDRIILGNFNQLYKVRFLNFFQGIACAAPLTNVLVQQIDPTTFRTWLRILAHVPKAVL 1341
Query: 86 WLLKFPAVGEANIQATAQALGLDQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
WLL+FP +GE N++ TA+ ++ RI+F++VA K++H+ R ++ D+ LDTP CN HT
Sbjct: 1342 WLLRFPELGENNLRRTAKLWAGEEVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHT 1401
Query: 144 TSMDVLWTGTPVVTLPG--ETLASRVAASQL---------ATLGCPELIARTHKEYQDIA 192
T+ DVLW+ TP++TLP + SR+AAS L +LIA EY++ A
Sbjct: 1402 TAADVLWSSTPLLTLPRYEYKMCSRMAASILKGALPKSEAGRQAAEDLIAGDDYEYENRA 1461
Query: 193 IRL 195
I+L
Sbjct: 1462 IKL 1464
>gi|409436081|ref|ZP_11263278.1| putative glycosyltransferase TPR domain protein (SPINDLY-related)
[Rhizobium mesoamericanum STM3625]
gi|408752217|emb|CCM74427.1| putative glycosyltransferase TPR domain protein (SPINDLY-related)
[Rhizobium mesoamericanum STM3625]
Length = 641
Score = 129 bits (323), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 65/159 (40%), Positives = 94/159 (59%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R Q GLP A V+ +FN KI+ + L +W N+LK PNS+LWL+ + N+
Sbjct: 419 TTRAQVGLPVGAFVFASFNANRKINAAILDVWCNILKRAPNSVLWLMLSSPRTQNNLLNY 478
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
G++ R++F EEH+ R Q+AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 479 MNKKGVESDRVIFCPRVPYEEHIDRQQMADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGT 538
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASRV+ S L +G PEL+A K Y+D+A+ L D
Sbjct: 539 NFASRVSESLLNAIGLPELVASDIKVYEDMAVELANTPD 577
>gi|171057695|ref|YP_001790044.1| hypothetical protein Lcho_1008 [Leptothrix cholodnii SP-6]
gi|170775140|gb|ACB33279.1| TPR repeat-containing protein [Leptothrix cholodnii SP-6]
Length = 672
Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 90/156 (57%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
V SRQ+ GLP+DA+V +FNQ YK P W +L A P ++LW+L A +A ++
Sbjct: 456 VAWSRQRCGLPDDALVLASFNQSYKTTPEVFAAWCRILAAQPRALLWMLVPDADTQARLR 515
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A G+D R++F+ E H R AD+ LDT C+GHTT+ D LW G PV+TL
Sbjct: 516 EVAAGHGVDPQRVVFAPFVDIESHRARLPQADLILDTFPCSGHTTTSDALWAGVPVLTLT 575
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
G A+RVAAS + TLG ELI Y D AI L
Sbjct: 576 GRNFAARVAASLVHTLGLDELICDDLATYVDRAIEL 611
>gi|424898309|ref|ZP_18321883.1| hypothetical protein Rleg4DRAFT_4290 [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393182536|gb|EJC82575.1| hypothetical protein Rleg4DRAFT_4290 [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 657
Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPE+A ++ +FN KI P T+ W +LK P S+LWL+ +AN+
Sbjct: 437 TREQLGLPEEAFIFASFNGNRKITPETIDSWCRILKRAPTSVLWLMANTPRNQANLLKQF 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV+T+ G
Sbjct: 497 QTAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + P+L+A + Y+D+A+ L +
Sbjct: 557 FASRVSESLLRAIDLPDLVASDLQAYEDMAVELAEN 592
>gi|452964763|gb|EME69797.1| SPY protein [Magnetospirillum sp. SO-1]
Length = 794
Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats.
Identities = 65/158 (41%), Positives = 92/158 (58%), Gaps = 2/158 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLPED ++ C F+ YKI+P +W+ VL VP S+LWLL A AN+ A
Sbjct: 589 ARGACGLPEDGVILCAFHNAYKINPPLFDVWMRVLAQVPGSVLWLLDGSA--RANLCREA 646
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D R++F+ ++ R +LAD+ LDT N H ++ D LW G PVVT G T
Sbjct: 647 KACGIDPGRLVFAPRVGIGAYLARHRLADLYLDTLPYNAHGSAADALWMGVPVVTCLGRT 706
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
RVAAS L G PEL+ RT +Y+++ + L D +
Sbjct: 707 FPGRVAASVLKAAGLPELVTRTPAQYEELVLALARDSE 744
>gi|171058438|ref|YP_001790787.1| hypothetical protein Lcho_1755 [Leptothrix cholodnii SP-6]
gi|170775883|gb|ACB34022.1| Tetratricopeptide TPR_2 repeat protein [Leptothrix cholodnii SP-6]
Length = 647
Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats.
Identities = 63/155 (40%), Positives = 92/155 (59%), Gaps = 1/155 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE +V+C+FN YKI P W+ +L A P S+LWL+ A + N++A A
Sbjct: 439 SRTECGLPESGLVFCSFNHDYKISPHIFAAWMRILAATPGSVLWLMSRGAASQRNLRAAA 498
Query: 103 QALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
QA G+ R++F+ V E+H+ R + AD+ LDT N HTT+ D L G PV+T G
Sbjct: 499 QAQGVAPERLVFAQRVPRVEDHLARYRQADLFLDTHPYNAHTTAADALLAGLPVLTYSGN 558
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+RVA S L G P+L+ + +Y+ +A+RL
Sbjct: 559 AFPARVAGSLLHAAGLPDLVTHSLSDYEALAVRLA 593
>gi|398955966|ref|ZP_10676713.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Pseudomonas sp. GM33]
gi|398150432|gb|EJM39025.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Pseudomonas sp. GM33]
Length = 611
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 94/160 (58%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLP++ V+C+FN YK + W+ +L+ P+S+LWLL + N+ A
Sbjct: 412 LPTRAECGLPQERFVFCSFNNNYKFNEEVFDCWMRILQRAPDSVLWLLADNQWAQENLCA 471
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
AQA G+D R+LF+ A E+++ R +AD+ LD N TT+ D LW G PV+T G
Sbjct: 472 RAQAHGVDPARLLFAPRVAPEQYLARYSVADLFLDAYPFNAGTTANDALWMGLPVLTRSG 531
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T ASR+A S L L PELI T EY++ A+ L T D
Sbjct: 532 RTFASRMAGSLLTALDLPELITTTLAEYEERAVELATRAD 571
>gi|218512972|ref|ZP_03509812.1| hypothetical protein Retl8_04370 [Rhizobium etli 8C-3]
Length = 173
Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPEDA ++ +FN KI + W +LK PNS+LWL+ +AN+
Sbjct: 6 TREQLGLPEDAFIFASFNGNRKITHEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 65
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A ++H+ R Q AD+ +DT NGHTT+ + LW G PV+T G
Sbjct: 66 QTAGISPKRIIFCPRAPYDQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 125
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV+ S L + PEL+A + Y+D+A+ L +
Sbjct: 126 FASRVSESLLQAIDLPELVAGDLQAYEDLAVELAQN 161
>gi|288958280|ref|YP_003448621.1| hypothetical protein AZL_014390 [Azospirillum sp. B510]
gi|288910588|dbj|BAI72077.1| hypothetical protein AZL_014390 [Azospirillum sp. B510]
Length = 635
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 64/156 (41%), Positives = 88/156 (56%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ R GLP D +V+C FN YKI P W +L+ VP S+LWLL+ PA N++
Sbjct: 422 VPERADCGLPADGVVFCAFNASYKIGPELFGRWCRILEGVPGSMLWLLEGPAEVALNLRR 481
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A A G+ R++F+ +H+ R +LAD+ LD+ HTT+ D LW G PV+T+PG
Sbjct: 482 AASARGVAPERLVFAPRLPGPDHLARHRLADLFLDSSPVGAHTTASDALWAGLPVLTVPG 541
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
T A RVAAS L + PEL+ Y+ A RL
Sbjct: 542 RTFAGRVAASLLRAVDLPELVLPDWDAYEATARRLA 577
>gi|452963367|gb|EME68440.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
SO-1]
Length = 673
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 89/157 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPED V+C FN KI + W+ +L +VP S+LWLL+ + A
Sbjct: 409 SRSEVGLPEDGAVFCCFNSQQKISRLMFERWMRILHSVPGSVLWLLEANDEVHNRLWEKA 468
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+ ++R++F +H+ R LAD+ LDT HTT+ D LW P++TL G +
Sbjct: 469 EACGIGRNRVIFGRRLPSPDHLARMTLADLFLDTFPYGAHTTASDALWMSVPILTLSGRS 528
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S L + G PEL+ T EY + AI LG DR
Sbjct: 529 FASRVGGSLLRSAGLPELVCSTPDEYVETAIALGNDR 565
>gi|392377850|ref|YP_004985009.1| conserved protein of unknown function [Azospirillum brasilense
Sp245]
gi|356879331|emb|CCD00243.1| conserved protein of unknown function [Azospirillum brasilense
Sp245]
Length = 645
Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats.
Identities = 67/163 (41%), Positives = 91/163 (55%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P + R GLP D V+C FN +KI PS +W+ VL VP S+LWL + G
Sbjct: 428 PLDAPVPDRAACGLPPDGFVFCAFNAPFKITPSLFGLWMRVLARVPGSVLWLQQPGRDGT 487
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N++ A G+D R++F+ + EH+ R +LAD+ LD+ GHTT D LW G PV
Sbjct: 488 DNLRREAARRGVDPGRLVFAPHRPQAEHLARYRLADLFLDSFPYTGHTTVSDALWMGLPV 547
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT G+T ASRVAA L G PE + + Y+ +A+RL D
Sbjct: 548 VTRMGDTFASRVAAGLLNAAGLPETVTTSFDGYEALAVRLAGD 590
>gi|222147877|ref|YP_002548834.1| hypothetical protein Avi_1162 [Agrobacterium vitis S4]
gi|221734865|gb|ACM35828.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 633
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 94/155 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R +GLPE+ V+ +FN KI P ++ +W +++ AVP+S+LW+L A +AN
Sbjct: 416 KRADHGLPEEVFVFASFNSPAKISPQSITLWASIMNAVPDSLLWILCSGAQLQANFAEEF 475
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
LG+ + RI+F+ +H+ R LAD+ LDT NGHTT+ D+LW G PV+T G +
Sbjct: 476 ARLGIGRDRIVFAQGVDYPDHLSRVGLADLALDTFPYNGHTTTSDLLWGGLPVLTKKGRS 535
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
A+RV+ S L +G PEL+AR +E+ + A+
Sbjct: 536 FAARVSESLLTAIGLPELVARDGEEFVERAVEFAA 570
>gi|190890370|ref|YP_001976912.1| hypothetical protein RHECIAT_CH0000745 [Rhizobium etli CIAT 652]
gi|190695649|gb|ACE89734.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
Length = 638
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLPEDA ++ +FN KI + W +LK PNS+LWL+ +AN+
Sbjct: 418 TREQLGLPEDAFIFASFNGNRKITHEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 477
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ RI+F A ++H+ R Q AD+ +DT NGHTT+ + LW G PV+T G
Sbjct: 478 QTAGISPKRIIFCPRAPYDQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 537
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASRV+ S L + PEL+A + Y+D+A+ L
Sbjct: 538 FASRVSESLLQAIDLPELVAGDLQAYEDLAVELA 571
>gi|83312890|ref|YP_423154.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
magneticum AMB-1]
gi|82947731|dbj|BAE52595.1| Predicted O-linked N-acetylglucosamine transferase
[Magnetospirillum magneticum AMB-1]
Length = 636
Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats.
Identities = 63/157 (40%), Positives = 91/157 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPED +V+C FN KI P W+++L +VP S+LWLL+ + + A
Sbjct: 372 SRAEVGLPEDKMVFCCFNSQQKISPLMFDRWMHILHSVPESVLWLLECGEEIKNRLWEHA 431
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ + R++F +H+ R LAD+ LDT HTT+ D LW P++TL G +
Sbjct: 432 ERGGIARERVIFGKRLPSPDHLARMTLADLFLDTFPYGAHTTASDALWMSVPILTLSGHS 491
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S + G PEL+ T +EY ++AI LG DR
Sbjct: 492 FASRVGGSLSRSAGLPELVCSTPEEYVEMAIALGNDR 528
>gi|374291680|ref|YP_005038715.1| hypothetical protein AZOLI_1155 [Azospirillum lipoferum 4B]
gi|357423619|emb|CBS86479.1| protein of unknown function; putative TPR domain [Azospirillum
lipoferum 4B]
Length = 673
Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats.
Identities = 64/156 (41%), Positives = 87/156 (55%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR GLP DA+V+C FN YKI P W +L+ VP S+LWLL+ PA N++
Sbjct: 460 VPSRADCGLPADAVVFCAFNAPYKIGPELFGRWCRILQRVPGSVLWLLEGPAEVAVNLRR 519
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A G+ R++F+ H+ R +LAD+ LD+ HTT+ D LW G PV+T+PG
Sbjct: 520 AAAERGIAAERLVFAPRLPGPAHLARHRLADLFLDSSPVGAHTTASDALWAGLPVLTVPG 579
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+ A RVAAS L +G PE Y+ A RL
Sbjct: 580 HSFAGRVAASLLHAVGLPESAVPDWDAYEATAQRLA 615
>gi|46200779|ref|ZP_00056483.2| COG3914: Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Magnetospirillum magnetotacticum MS-1]
Length = 673
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 96/174 (55%), Gaps = 6/174 (3%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
+ K A E P SR GLPEDA+V+C FN KI + W+++L +VP S+L
Sbjct: 398 DRKRAVASEPP------SRTDLGLPEDAMVFCCFNSQQKISKMMFERWMHILNSVPGSVL 451
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+ + + A+ G+ + R++F A +H+ R AD+ LDT HTT+
Sbjct: 452 WLLESGEEIQGRLWDHAERCGIARDRLIFGKRLASPDHLARMTQADLFLDTFPYGAHTTA 511
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
D LW P++TL G + ASRV S + G PEL+ T +EY ++AI LG DR
Sbjct: 512 SDALWMSVPILTLSGRSFASRVGGSLSRSAGLPELVCSTPEEYVEMAIALGNDR 565
>gi|40063717|gb|AAR38498.1| TPR repeat protein [uncultured marine bacterium 583]
Length = 733
Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats.
Identities = 63/154 (40%), Positives = 91/154 (59%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLP ++C FN YKI PST W+ +LKAV +S+LWL + N++ A
Sbjct: 525 RYELGLPNTGFIFCCFNNSYKITPSTFTGWMRILKAVEDSVLWLFENNNNTAKNLKKEAI 584
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+++ R++F+ E+H+ R + AD+ +DT N HTT+ D L G PV+T G +
Sbjct: 585 KFGINEDRLVFAKYMPVEDHLNRIKQADLFIDTLPYNAHTTASDALRMGIPVLTCIGSSF 644
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
ASRVAAS L + PELI T ++Y+ AI L T
Sbjct: 645 ASRVAASLLNAVNLPELITTTQEQYESFAIELAT 678
>gi|40063714|gb|AAR38495.1| TPR repeat protein [uncultured marine bacterium 583]
Length = 697
Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats.
Identities = 63/154 (40%), Positives = 91/154 (59%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLP ++C FN YKI PST W+ +LKAV +S+LWL + N++ A
Sbjct: 489 RYELGLPNTGFIFCCFNNNYKITPSTFTGWMRILKAVEDSVLWLFENNNNTAKNLKKEAI 548
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+++ R++F+ E+H+ R + AD+ +DT N HTT+ D L G PV+T G +
Sbjct: 549 KFGINEDRLVFATHMPVEDHLNRIKQADLFIDTLPYNAHTTASDALRMGIPVLTCIGSSF 608
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
ASRVAAS L + PELI T ++Y+ AI L T
Sbjct: 609 ASRVAASLLNAVNLPELITTTQEQYESFAIELAT 642
>gi|374291195|ref|YP_005038230.1| hypothetical protein AZOLI_0612 [Azospirillum lipoferum 4B]
gi|357423134|emb|CBS85979.1| conserved protein of unknown function; putative TPR domain
[Azospirillum lipoferum 4B]
Length = 877
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 86/156 (55%), Gaps = 6/156 (3%)
Query: 26 NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
+ + A E P R + GLP D V+C FN YK+ P+ W +L AVP S+L
Sbjct: 640 DRRRAIAEATPM------RAECGLPADGFVFCCFNSAYKLTPALFDGWARILAAVPGSVL 693
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WL AN++ A+A G+D R++F+ EH+ R +LAD+ LDT N HTT+
Sbjct: 694 WLYAGNPQVAANLRREAEARGVDPRRLVFAKPLPHAEHLARHRLADLFLDTLPYNAHTTA 753
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELI 181
D LW G PV+T G T A RVAAS L G PELI
Sbjct: 754 SDALWAGLPVLTRCGTTFAGRVAASLLRAAGLPELI 789
>gi|443326054|ref|ZP_21054721.1| TIGR03032 family protein [Xenococcus sp. PCC 7305]
gi|442794327|gb|ELS03747.1| TIGR03032 family protein [Xenococcus sp. PCC 7305]
Length = 1299
Score = 126 bits (317), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 95/157 (60%), Gaps = 2/157 (1%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R+++GLP+DA V+C+FN YKI+P W+ +L+ VP S+LWL G N++A
Sbjct: 1097 VMTREEFGLPDDAFVFCSFNAHYKINPELFDAWLRILEQVPQSVLWLAS--GSGRDNLRA 1154
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+ GL R++F+ EE++ R LAD+ LDT + N +T+ VLW+G P++T PG
Sbjct: 1155 EAKKRGLAADRLIFAEKIPHEEYLARYALADLYLDTLIYNAGSTAAAVLWSGLPMLTCPG 1214
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
T ASR+ AS G I + +EY+ A+ L T
Sbjct: 1215 NTNASRMGASICLAGGLETAICNSLEEYEQKAVYLAT 1251
>gi|445499828|ref|ZP_21466683.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Janthinobacterium sp.
HH01]
gi|444789823|gb|ELX11371.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Janthinobacterium sp.
HH01]
Length = 1067
Score = 126 bits (316), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 96/157 (61%), Gaps = 2/157 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL-KFPAVGEANIQAT 101
+R LPEDA V+ +FN +K P +W+N+L+ VPNS+LWL+ +P V E N+
Sbjct: 417 TRASVNLPEDAFVFVSFNNNFKFTPDLFTVWMNILRRVPNSVLWLVADYPEVRE-NLYRY 475
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G+ + R++F++ A E++ R QLAD+ LDT N TT+ D LW G P++T G+
Sbjct: 476 AEEAGIARERLIFNSRAVPAEYLARYQLADLFLDTFPFNAGTTASDALWAGLPLLTCAGQ 535
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T +SR+A S L + P+LI +Y++ A+ L D
Sbjct: 536 TFSSRMAGSLLRAVDLPQLITYNFADYEEKAVALAND 572
>gi|421900323|ref|ZP_16330686.1| tetratricopeptide-like reapts harboring protein [Ralstonia
solanacearum MolK2]
gi|206591529|emb|CAQ57141.1| tetratricopeptide-like reapts harboring protein [Ralstonia
solanacearum MolK2]
Length = 672
Score = 125 bits (315), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 90/155 (58%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP D +V+C FNQ YKI + Q W +L P ++LWLL+ A A + A
Sbjct: 468 RAAVGLPADGLVFCCFNQAYKITEARAQTWFAILGRTPGAVLWLLEPDASARAALLEKAG 527
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R++F+ A+ H+ R QLAD+ LDT HTT+ D+LW G P++T G+T+
Sbjct: 528 RHGIASERLVFAPQVAQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDTM 587
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G +L+ T +Y + A+RL D
Sbjct: 588 ASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622
>gi|344257396|gb|EGW13500.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit [Cricetulus griseus]
Length = 1444
Score = 125 bits (314), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 54/75 (72%), Positives = 63/75 (84%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 733 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 792
Query: 61 QLYKIDPSTLQMWVN 75
QLYKIDPSTLQMW N
Sbjct: 793 QLYKIDPSTLQMWAN 807
>gi|83745815|ref|ZP_00942872.1| Predicted O-linked N-acetylglucosamine transferase [Ralstonia
solanacearum UW551]
gi|207739799|ref|YP_002258192.1| tetratricopeptide-like reapts harboring protein [Ralstonia
solanacearum IPO1609]
gi|83727505|gb|EAP74626.1| Predicted O-linked N-acetylglucosamine transferase [Ralstonia
solanacearum UW551]
gi|206593181|emb|CAQ60087.1| tetratricopeptide-like reapts harboring protein [Ralstonia
solanacearum IPO1609]
Length = 672
Score = 125 bits (314), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 90/155 (58%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP D +V+C FNQ YKI + Q W +L P ++LWLL+ A A + A
Sbjct: 468 RAAVGLPADGLVFCCFNQAYKITEARAQTWFAILGRTPGAVLWLLEPDASARAALLEKAG 527
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R++F+ A+ H+ R QLAD+ LDT HTT+ D+LW G P++T G+T+
Sbjct: 528 RHGIASERLVFAPQVAQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDTM 587
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G +L+ T +Y + A+RL D
Sbjct: 588 ASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622
>gi|188992878|ref|YP_001904888.1| hypothetical protein xccb100_3483 [Xanthomonas campestris pv.
campestris str. B100]
gi|167734638|emb|CAP52848.1| Conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 568
Score = 125 bits (314), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE +V C FN YK++P ++ + VL+ VP+S+LWLL P +A ++A A
Sbjct: 367 SRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+D R++F +++ R + AD+ LDT N HTT+ D LWTG PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASD 522
>gi|340788763|ref|YP_004754228.1| TPR repeat protein [Collimonas fungivorans Ter331]
gi|340554030|gb|AEK63405.1| TPR repeat protein [Collimonas fungivorans Ter331]
Length = 730
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 89/155 (57%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
RQ GLP DA V+C+FNQ +K+ PS W +L AVP+S+LWL A +N++ AQ
Sbjct: 517 RQSEGLPNDAFVFCSFNQCFKLTPSLWDDWCQILNAVPHSVLWLAPMNAAACSNLRREAQ 576
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R++F+ +H R LAD+ LDT N TT+ D L G P+VT G T
Sbjct: 577 QRGVAPERLVFAARRPLAQHQSRLALADLALDTMPYNSGTTASDALRAGVPLVTAAGSTF 636
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SR+AAS L LG LIA + Y D+AI L +D
Sbjct: 637 VSRMAASLLHNLGMDGLIAADRRSYVDLAIALASD 671
>gi|21230340|ref|NP_636257.1| hypothetical protein XCC0866 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66769666|ref|YP_244428.1| hypothetical protein XC_3364 [Xanthomonas campestris pv. campestris
str. 8004]
gi|213424033|pdb|2VSY|A Chain A, Xanthomonas Campestris Putative Ogt (Xcc0866),
Apostructure
gi|213424034|pdb|2VSY|B Chain B, Xanthomonas Campestris Putative Ogt (Xcc0866),
Apostructure
gi|215261005|pdb|2JLB|A Chain A, Xanthomonas Campestris Putative Ogt (Xcc0866), Complex
With Udp-Glcnac Phosphonate Analogue
gi|215261006|pdb|2JLB|B Chain B, Xanthomonas Campestris Putative Ogt (Xcc0866), Complex
With Udp-Glcnac Phosphonate Analogue
gi|304445671|pdb|2XGM|A Chain A, Substrate And Product Analogues As Human O-Glcnac
Transferase Inhibitors.
gi|304445672|pdb|2XGM|B Chain B, Substrate And Product Analogues As Human O-Glcnac
Transferase Inhibitors.
gi|304445673|pdb|2XGO|A Chain A, Xcogt In Complex With Udp-S-Glcnac
gi|304445674|pdb|2XGO|B Chain B, Xcogt In Complex With Udp-S-Glcnac
gi|304445675|pdb|2XGS|A Chain A, Xcogt In Complex With C-Udp
gi|304445676|pdb|2XGS|B Chain B, Xcogt In Complex With C-Udp
gi|21111893|gb|AAM40181.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66574998|gb|AAY50408.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 568
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE +V C FN YK++P ++ + VL+ VP+S+LWLL P +A ++A A
Sbjct: 367 SRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+D R++F +++ R + AD+ LDT N HTT+ D LWTG PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASD 522
>gi|190016248|pdb|2VSN|A Chain A, Structure And Topological Arrangement Of An O-Glcnac
Transferase Homolog: Insight Into Molecular Control Of
Intracellular Glycosylation
gi|190016249|pdb|2VSN|B Chain B, Structure And Topological Arrangement Of An O-Glcnac
Transferase Homolog: Insight Into Molecular Control Of
Intracellular Glycosylation
Length = 568
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE +V C FN YK++P ++ + VL+ VP+S+LWLL P +A ++A A
Sbjct: 367 SRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+D R++F +++ R + AD+ LDT N HTT+ D LWTG PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASD 522
>gi|399066550|ref|ZP_10748490.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Novosphingobium sp. AP12]
gi|398027979|gb|EJL21504.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Novosphingobium sp. AP12]
Length = 741
Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats.
Identities = 61/137 (44%), Positives = 86/137 (62%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R ++GLP+ V+ +FNQ YKI P+ +W+ +L+ VP S+LWLL A +AN++ A+
Sbjct: 534 RTRWGLPDTGFVFASFNQSYKISPAEWDIWMGLLREVPGSVLWLLDTGAQVQANLRREAE 593
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G++ R++FS A EH+ R AD+ LDT N HTT+ D LW G PV+TL G
Sbjct: 594 VRGVEPERLVFSKPIAHAEHLGRLIHADLFLDTFAVNAHTTASDALWAGLPVLTLAGRQF 653
Query: 164 ASRVAASQLATLGCPEL 180
A+RVAAS + G PEL
Sbjct: 654 AARVAASLVHAAGLPEL 670
>gi|194364498|ref|YP_002027108.1| hypothetical protein Smal_0720 [Stenotrophomonas maltophilia
R551-3]
gi|194347302|gb|ACF50425.1| Tetratricopeptide TPR_2 repeat protein [Stenotrophomonas
maltophilia R551-3]
Length = 570
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 90/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLPE +V+C FN YK++P ++ VL+AVP S+LWLL P +A ++A A
Sbjct: 364 TRADCGLPEHGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRAAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q+ GLD R++F +++ R QLAD+ LDT N HTT+ D LW G PV+T PG T
Sbjct: 424 QSAGLDPMRLVFMPKLPHPQYLARYQLADLFLDTNPYNAHTTASDALWAGCPVLTCPGAT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + A + A LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNAADDAAFIATASVLGND 519
>gi|222148903|ref|YP_002549860.1| hypothetical protein Avi_2579 [Agrobacterium vitis S4]
gi|221735889|gb|ACM36852.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 588
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 89/158 (56%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
TSR LPEDA V +FN + KI P T ++W +L A+P+SILW+L + A
Sbjct: 365 TSRSLLDLPEDAFVLASFNMVRKISPQTARLWARMLDAIPSSILWVLCAGREQRDRLTAF 424
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
G+++ R+ F+ + H+ R Q AD+ LDT NGHTT+ D LW G PV+T G
Sbjct: 425 MTGCGIERSRLYFTGAESYAPHIARMQAADLGLDTYPYNGHTTTSDKLWAGLPVITFKGS 484
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV+ S L LG P+L+A T + +A L DR
Sbjct: 485 NFASRVSESLLTALGVPQLVAETPDDMVRLAADLAQDR 522
>gi|402564953|ref|YP_006614298.1| tpr repeat-containing protein [Burkholderia cepacia GG4]
gi|402246150|gb|AFQ46604.1| tpr repeat-containing protein [Burkholderia cepacia GG4]
Length = 604
Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats.
Identities = 61/155 (39%), Positives = 91/155 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPED V+C+FN YK P W+ +L+ VP S+LWLL EAN++ A
Sbjct: 407 SRASCGLPEDGFVFCSFNNNYKFTPEVFGAWMRILQRVPASVLWLLADNPWAEANLRKEA 466
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F++ + E ++ R +AD+ LD+ N TT+ D LW G PV+TL G++
Sbjct: 467 ARYGIDGGRLVFASRVSPENYLARFAVADLFLDSFPFNAGTTANDALWMGLPVLTLSGKS 526
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
ASR+A + L G ELI + Y++ A+ L +
Sbjct: 527 FASRMAGAMLTAAGIEELITYDLQAYEEKAVALAS 561
>gi|386335303|ref|YP_006031473.1| tetratricopeptide-like reapts harboring protein [Ralstonia
solanacearum Po82]
gi|334197753|gb|AEG70937.1| tetratricopeptide-like reapts harboring protein [Ralstonia
solanacearum Po82]
Length = 672
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 90/155 (58%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLP D +V+C FNQ YKI + Q W +L P ++LWLL+ A A + A
Sbjct: 468 RAELGLPADGLVFCCFNQAYKITEARAQTWFAILGRTPGAVLWLLEPDASARAALLEKAG 527
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R++F+ + H+ R QLAD+ LDT HTT+ D+LW G P++T G+T+
Sbjct: 528 RHGIASERLVFAPQVVQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDTM 587
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L G +L+ T +Y + A+RL D
Sbjct: 588 ASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622
>gi|296448170|ref|ZP_06890068.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
gi|296254301|gb|EFH01430.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
Length = 676
Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats.
Identities = 63/156 (40%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SRQ+ GLPE A V+C+ N + K+ T W+ +L AVP+S+LWLL I+A A
Sbjct: 417 SRQEAGLPESAFVFCSLNGMQKLTALTFDRWMTILAAVPDSVLWLLTGTQATNERIRAAA 476
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ RI+F+ H+ R LAD+ LD HTT+ D LW G P++T PG +
Sbjct: 477 SARGVAPERIVFAQKMPNPRHLARYPLADLFLDNFPYGAHTTAADSLWMGVPILTFPGRS 536
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV AS + G EL+ ++Y AI LG +
Sbjct: 537 FASRVCASLVRAAGVEELVCADAEDYVARAIELGRN 572
>gi|259415576|ref|ZP_05739497.1| tetratricopeptide TPR_2 repeat protein [Silicibacter sp. TrichCH4B]
gi|259348806|gb|EEW60568.1| tetratricopeptide TPR_2 repeat protein [Silicibacter sp. TrichCH4B]
Length = 582
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 90/155 (58%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R ++GLPED ++ NFN +K+ PS W+++L+ VPNS+L A + + A+
Sbjct: 371 RSKFGLPEDKFIFANFNHPHKVGPSEFATWMDILRQVPNSVLLFYAGKNDLSAELASRAE 430
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+D RIL +++EH+ R D+CLD N HTT+ D LW G P++TL G
Sbjct: 431 AHGIDAARILPCGSLSQQEHLERIAQVDLCLDCFAYNAHTTASDALWAGVPLLTLCGRQF 490
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S ++ G PEL + + Y A+R+ T+
Sbjct: 491 AARVATSIMSAAGVPELSTDSKEAYVAAAVRIATN 525
>gi|91775742|ref|YP_545498.1| TPR repeat-containing protein [Methylobacillus flagellatus KT]
gi|91709729|gb|ABE49657.1| TPR repeat [Methylobacillus flagellatus KT]
Length = 700
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR G+PEDA VYC FNQ +KI P W+ +L+A P S+LWLL+ + N++ A
Sbjct: 502 SRASCGIPEDAFVYCCFNQSFKITPEVFSCWMRLLQATPGSVLWLLEGHSNACNNLRQAA 561
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F+ ++H+ R AD+ LDT N HTT+ D LW G P++T G+
Sbjct: 562 EHHGVAASRLVFAPRVPMDQHLARHAHADLFLDTLPYNAHTTASDALWMGVPLITCSGDA 621
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVAAS L +G EL+ Y+ +A +L D
Sbjct: 622 FASRVAASLLQAVGLKELVTTDLFAYETLARQLAHD 657
>gi|344206149|ref|YP_004791290.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
gi|343777511|gb|AEM50064.1| Tetratricopeptide TPR_1 repeat-containing protein [Stenotrophomonas
maltophilia JV3]
Length = 568
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 89/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE +V+C FN YK++P ++ VL+AVP S+LWLL P + ++ A
Sbjct: 364 SRTECGLPEHGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADTRLRTAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA GLD R++F +++ R QLAD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYQLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAAFIATASALGND 519
>gi|325921456|ref|ZP_08183311.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xanthomonas gardneri ATCC 19865]
gi|325548003|gb|EGD19002.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xanthomonas gardneri ATCC 19865]
Length = 568
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE +V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRTQCGLPEHGVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAVA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA +D R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 QAQDVDAQRLVFMPKLPHAQYLARYRHADLFLDTHPYNAHTTASDALWVGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLAEMNVADDAAFVAKAVALASD 522
>gi|304312282|ref|YP_003811880.1| hypothetical protein HDN1F_26540 [gamma proteobacterium HdN1]
gi|301798015|emb|CBL46237.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
Length = 730
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 96/157 (61%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R++YGLP V C+ N +KI P +W+N+L+ VPNS+LWLL + N+ A
Sbjct: 402 TRKEYGLPARKTVLCSLNNNHKITPEVFDVWMNILRRVPNSVLWLLADNEWAKENLIKEA 461
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+ +++F+ + + +++ R +AD+ LDT N TT+ D LW G PV+T+ G
Sbjct: 462 KARGIPAKQLVFAERSGQGDYLARYAVADLFLDTFPFNAGTTANDALWMGLPVLTMSGRA 521
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR+A + L G PELI + Y+++A++L +D+
Sbjct: 522 FASRMAGALLTAAGLPELITHDLQGYEELAVQLASDK 558
>gi|299135117|ref|ZP_07028308.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
gi|298590094|gb|EFI50298.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
Length = 732
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLP +V+C FN +KI P +W+ +L V S+LWL + A +AN+ A
Sbjct: 518 SREDVGLPASGLVFCCFNNTFKITPEVFDIWMRLLGQVEGSVLWLFEGNATAKANLIKEA 577
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ R++F+ +A ++ R QLAD+ LD NGH+T+ D L G PV+T G +
Sbjct: 578 VARGISPERLVFAPLADYATYLARTQLADLFLDNNYWNGHSTASDALRCGVPVITCQGTS 637
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRVA+S L + PELI + ++Y+ +A++L D
Sbjct: 638 FASRVASSLLRAIEMPELITTSLEDYEALALKLACD 673
>gi|99080945|ref|YP_613099.1| hypothetical protein TM1040_1104 [Ruegeria sp. TM1040]
gi|99037225|gb|ABF63837.1| TPR repeat [Ruegeria sp. TM1040]
Length = 616
Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats.
Identities = 61/161 (37%), Positives = 94/161 (58%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
V +R + LPE+ V+ +FN YK+ P +W+++LK VP+S+LW A ++
Sbjct: 403 VADTRASHDLPEEGFVFSSFNNPYKVTPREFGIWMDLLKEVPDSVLWFYVSKAEIIDRLR 462
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A++ G+D RI+ + E H+ R + AD+ LDT N HTT+ D LW G PVVT
Sbjct: 463 KEAESRGVDGARIIPTGRMQPEYHLARLKHADLFLDTFNVNAHTTASDALWAGLPVVTKT 522
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+RVA S L+ G +L+ + K+Y ++A+R+ D D
Sbjct: 523 GEQFAARVAGSILSAAGLEDLVTHSEKKYYEVALRIAQDPD 563
>gi|410462143|ref|ZP_11315745.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfovibrio magneticus str. Maddingley MBC34]
gi|409984752|gb|EKO41039.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfovibrio magneticus str. Maddingley MBC34]
Length = 672
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 90/157 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE+ +V+C FN +KI +T W+ +L P S+LWLL + A
Sbjct: 409 SRAEAGLPEEGMVFCCFNGAHKIHRATFDRWLEILARTPGSVLWLLGGDEGVSKRLGDYA 468
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+ + R++F+ A H+ R LAD+ LDT HTT+ D LW+G PV+T+ G +
Sbjct: 469 EAKGIARERLVFAKKTANAAHLARYPLADLFLDTTPYGAHTTASDALWSGVPVLTVSGRS 528
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S + G PEL+ T ++Y + A+ G DR
Sbjct: 529 FASRVCGSLVRAAGLPELVCATTQDYVERAVAYGNDR 565
>gi|114769247|ref|ZP_01446873.1| TPR repeat [Rhodobacterales bacterium HTCC2255]
gi|114550164|gb|EAU53045.1| TPR repeat [Rhodobacterales bacterium HTCC2255]
Length = 688
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 61/156 (39%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR ++GLPE+ IV+C FN YKI +W+ +L S+LWL K + NIQ A
Sbjct: 473 SRSEFGLPENEIVFCCFNNNYKITADEFNIWMRILIKNKKSVLWLKKSNKWSKENIQLAA 532
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G++ RI+F+ E+H+ +LAD+ +DT N HTT+ + LW G PV+T G+
Sbjct: 533 KERGVNPKRIIFAEKLPIEKHLASYKLADIFIDTFSYNAHTTATEALWAGLPVITKIGKG 592
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S L+ L ELI + EY+++ +L +D
Sbjct: 593 FAARVAGSLLSALDLKELITASEIEYENLIKKLASD 628
>gi|16125359|ref|NP_419923.1| hypothetical protein CC_1107 [Caulobacter crescentus CB15]
gi|221234101|ref|YP_002516537.1| O-linked GLCNAC transferase [Caulobacter crescentus NA1000]
gi|13422417|gb|AAK23091.1| TPR domain protein [Caulobacter crescentus CB15]
gi|220963273|gb|ACL94629.1| O-linked GLCNAC transferase [Caulobacter crescentus NA1000]
Length = 672
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 89/159 (55%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLPED V+C FN KI P W+ +LK S+LWLL A ++
Sbjct: 409 VPTRTEAGLPEDGFVFCCFNGTQKITPHVFDRWMEILKRTSGSVLWLLDSNPETNARLRD 468
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+A G+D+ R++F+ H+ R +LAD+ LDT HTT+ D LW PV+T G
Sbjct: 469 AAEARGVDRMRLVFAPKMPNAFHLARYRLADLFLDTTPYGAHTTASDALWMAVPVLTWSG 528
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+ ASRV S + + G PEL+ + Y + A+ +G+DR
Sbjct: 529 RSFASRVCGSLVRSAGLPELVVDSAAAYVEKAVEIGSDR 567
>gi|386717186|ref|YP_006183512.1| glycosyl transferases group 1:TPR repeat [Stenotrophomonas
maltophilia D457]
gi|384076748|emb|CCH11333.1| Glycosyl transferases group 1:TPR repeat [Stenotrophomonas
maltophilia D457]
Length = 568
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 70/175 (40%), Positives = 94/175 (53%), Gaps = 6/175 (3%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q + T T E P SR + GLPE +V+C FN YK++P ++ VL+AVP S
Sbjct: 351 QPSDNTRTLEPSP------SRAECGLPEHGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGS 404
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LWLL P +A ++ AQA GLD R++F ++ R LAD+ LDT N HT
Sbjct: 405 LLWLLSGPGQADARLRTAAQAAGLDPARLVFMAKLPHPRYLARYPLADLFLDTHPYNAHT 464
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T+ D LW G PV+T PG+T A+RVA S LG + + A LG D
Sbjct: 465 TASDALWAGCPVLTCPGDTFAARVAGSLNHHLGLARMNVADDAAFIATASALGND 519
>gi|445499833|ref|ZP_21466688.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Janthinobacterium sp.
HH01]
gi|444789828|gb|ELX11376.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Janthinobacterium sp.
HH01]
Length = 1058
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 89/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R LPE A V+C+FN YK P W+N+L+ VP S+LWL+ + N++ A
Sbjct: 413 TRASVKLPEHAFVFCSFNNSYKFKPELFAHWMNILRRVPGSVLWLVADTPLVADNLKRHA 472
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F+ A +++ R QLAD+ LDT N TT+ D LW G P++T G T
Sbjct: 473 AEAGIDSDRLIFAERALPADYLARYQLADLFLDTYPFNAGTTASDALWAGLPLLTCAGPT 532
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASR+A S L + P+LI +Y++ A+ L D
Sbjct: 533 FASRMAGSLLRAVDLPQLITYNFADYEEQAVALAND 568
>gi|253996225|ref|YP_003048289.1| hypothetical protein Mmol_0852 [Methylotenera mobilis JLW8]
gi|253982904|gb|ACT47762.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera mobilis JLW8]
Length = 701
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 90/154 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+RQ LP++A V+C+FNQ +KI +W+ +L +PNS+LWLL + N+ A
Sbjct: 504 TRQACNLPDNAFVFCSFNQSFKITADVFSIWLRLLNKLPNSVLWLLDCNTWAKQNLWLAA 563
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ + RI+F+ +H+ R AD+ LDT N HTT D LW G PV+T G+T
Sbjct: 564 QRSGIAKERIVFAPRVNIADHLARHIHADLFLDTQPYNAHTTCSDALWMGLPVLTCAGDT 623
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+SRVA S L G P+LI + EY+ A++L
Sbjct: 624 FSSRVAGSLLQAAGLPQLITHSLVEYEARALQLA 657
>gi|398872865|ref|ZP_10628142.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Pseudomonas sp. GM74]
gi|398201465|gb|EJM88343.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Pseudomonas sp. GM74]
Length = 611
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 92/160 (57%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ +R + GLP++ V+C+FN YK + W+ +L+ P+S+LWLL + N+ A
Sbjct: 412 LPTRAECGLPQERFVFCSFNNNYKFNEEVFDCWMRILQRAPDSVLWLLADNPWAQENLCA 471
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
AQA G++ R+LF+ A +++ R AD+ LD N TT+ D LW G PV+T G
Sbjct: 472 RAQAHGVNPARLLFAPRVAPAQYLARYSAADLFLDAYPFNAGTTANDALWMGLPVLTRSG 531
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T ASR+A S L L PELI T EY++ A+ L T D
Sbjct: 532 RTFASRMAGSLLTALDLPELITTTLAEYEERAVELATQAD 571
>gi|16127948|ref|NP_422512.1| hypothetical protein CC_3718 [Caulobacter crescentus CB15]
gi|221236770|ref|YP_002519207.1| hypothetical protein CCNA_03834 [Caulobacter crescentus NA1000]
gi|13425486|gb|AAK25680.1| TPR domain protein [Caulobacter crescentus CB15]
gi|220965943|gb|ACL97299.1| tetratricopeptide repeat family protein [Caulobacter crescentus
NA1000]
Length = 596
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 88/160 (55%)
Query: 39 IVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
I SR GLPE V+C FN KI P W+ +L+A P+S+LWL N+
Sbjct: 401 IPAPSRADMGLPERVRVFCCFNNPAKITPEVFATWMAILRAAPDSVLWLYAGAPGAADNL 460
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
+ A+A G+ R++F+ A E H+ R LAD+ LDT HTT+ D L G PV+TL
Sbjct: 461 RGHARAAGVAPERLVFAEPAPHEAHLARHVLADLVLDTWPYGAHTTASDALRMGVPVLTL 520
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
PGE+ ASRV AS +G ELIA + +Y A+ + D
Sbjct: 521 PGESFASRVGASLATAIGMTELIATSRADYVAKALAMAAD 560
>gi|303246263|ref|ZP_07332543.1| conserved hypothetical protein [Desulfovibrio fructosovorans JJ]
gi|302492326|gb|EFL52198.1| conserved hypothetical protein [Desulfovibrio fructosovorans JJ]
Length = 676
Score = 123 bits (308), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPE+ +VYC FN L+K+ T W+++L VP S+LWLL E ++ A
Sbjct: 408 TRAEAGLPEEGMVYCCFNGLHKVHRFTFDRWLSILAGVPGSVLWLLGSSETVEKRLRDYA 467
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ H+ R LAD+ LDT HTT+ D LW G PV+TL G +
Sbjct: 468 AGRGIAGERIVFAPKVKNPNHLARYPLADLFLDTTPYGAHTTASDALWMGVPVLTLSGRS 527
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASRV S + G PE++ T + + + A+ G D
Sbjct: 528 FASRVCGSLVRAAGLPEMVCETPEAFVERAVAFGKD 563
>gi|118594932|ref|ZP_01552279.1| SPY protein [Methylophilales bacterium HTCC2181]
gi|118440710|gb|EAV47337.1| SPY protein [Methylophilales bacterium HTCC2181]
Length = 621
Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats.
Identities = 64/160 (40%), Positives = 93/160 (58%), Gaps = 1/160 (0%)
Query: 33 EEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
++ P++ I + YG+ +DA V+ +F Q KI S ++W+ +LK SILWLL+ A
Sbjct: 416 DDRPKTQAI-KKSVYGIDDDAFVFASFGQSIKITESMFRLWIKLLKQKDGSILWLLESNA 474
Query: 93 VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
E NI A+ G+D RI F+ +H+ R + D+ LDT N HT+S D +W G
Sbjct: 475 QAENNIHKYAKKEGVDSVRIKFAPKVNFNDHINRHSIIDLFLDTYPYNAHTSSSDAIWAG 534
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIA 192
PV+TL G+T ASRVA S L + C LI +T +EY +A
Sbjct: 535 CPVLTLSGKTFASRVAGSILKEISCEMLITKTEEEYFKVA 574
>gi|348687110|gb|EGZ26924.1| hypothetical protein PHYSODRAFT_308484 [Phytophthora sojae]
Length = 1367
Score = 123 bits (308), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 98/159 (61%), Gaps = 5/159 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA-- 100
+R Q+GLP DA+V+CNFN + K++ + +W+++LK VP S+LWLL V + ++
Sbjct: 528 NRSQHGLPSDAVVFCNFNTINKMESESFSVWMSILKQVPKSVLWLLAPSGVDASRVKELL 587
Query: 101 --TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A A G+ R++F+ K H+ R +AD+ LD+ + N H+T+ D LW P+VT+
Sbjct: 588 HDQAMAHGVLPSRVIFAPRVDKFSHLARVTVADLFLDSFIYNAHSTAADALWANVPIVTI 647
Query: 159 PGETLASRVAASQLA-TLGCPELIARTHKEYQDIAIRLG 196
G+T SRVAAS + + PELI+ + K+Y+ A+ L
Sbjct: 648 WGDTFPSRVAASLIQNAIPFPELISHSVKDYERTAVYLA 686
>gi|83311603|ref|YP_421867.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
magneticum AMB-1]
gi|82946444|dbj|BAE51308.1| Predicted O-linked N-acetylglucosamine transferase
[Magnetospirillum magneticum AMB-1]
Length = 658
Score = 122 bits (307), Expect = 6e-26, Method: Composition-based stats.
Identities = 60/160 (37%), Positives = 92/160 (57%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
V SR GLP+D +V C+FN K+D +T +W+ +L +P ++LWLL P + ++
Sbjct: 448 VAPSRASLGLPDDRLVLCSFNGHRKLDRATFTLWLEILAELPQAVLWLLSPPDLARQRLE 507
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A A G+D R++++ + +H+ R AD+ +D +C HTT+ D L G P++T
Sbjct: 508 AAAAKAGIDSARLIWAPSLPRPDHLARLPAADLFVDALVCGAHTTAADSLRMGVPLITAA 567
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G L SRVAAS L LG P+L + + +AI LG DR
Sbjct: 568 GNRLGSRVAASLLHALGLPDLAVESPSALKALAIELGRDR 607
>gi|239908057|ref|YP_002954798.1| hypothetical protein DMR_34210 [Desulfovibrio magneticus RS-1]
gi|239797923|dbj|BAH76912.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 672
Score = 122 bits (307), Expect = 6e-26, Method: Composition-based stats.
Identities = 62/157 (39%), Positives = 89/157 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE A+V+C FN +KI +T W+ +L P S+LWLL + A
Sbjct: 409 SRAEAGLPEGAMVFCCFNGAHKIHRATFDRWLEILARAPGSVLWLLGGDEGVSKRLSDYA 468
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ + R++F+ A H+ R LAD+ LDT HTT+ D LW+G PV+T+ G +
Sbjct: 469 AAKGVARERLVFAKKTANAAHLARYPLADLFLDTTPYGAHTTASDALWSGVPVLTVSGRS 528
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S + G PEL+ T ++Y + A+ G DR
Sbjct: 529 FASRVCGSLVRAAGLPELVCETTQDYVERAVAYGNDR 565
>gi|333981959|ref|YP_004511169.1| hypothetical protein [Methylomonas methanica MC09]
gi|333806000|gb|AEF98669.1| Tetratricopeptide TPR_2 repeat-containing protein [Methylomonas
methanica MC09]
Length = 604
Score = 122 bits (307), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 92/158 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ LP + V+C+FN YK P + W+N+L+ VP+S LWLL E N++ A
Sbjct: 407 SRESCNLPAEGFVFCSFNNNYKYTPELFETWMNILRRVPDSTLWLLADNPWSEINLRNHA 466
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+D +R++F+ A E+++ R +AD+ LD+ N TT+ D LW G PV+TL G +
Sbjct: 467 KNFGIDPNRLIFAGRVAPEDYLARYAVADLFLDSFPFNAGTTANDALWMGLPVLTLTGRS 526
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASR+A + L PELI +Y++ A+ L D
Sbjct: 527 FASRMAGALLTAAELPELITYNLLDYENKAVELAQAPD 564
>gi|356960833|ref|ZP_09063815.1| TPR repeat-containing protein, partial [gamma proteobacterium SCGC
AAA001-B15]
Length = 262
Score = 122 bits (307), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 93/156 (59%), Gaps = 1/156 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLP + V+C FN YKI P T W+ +L V S+LWL + + N++ A
Sbjct: 52 TRAEIGLPINGFVFCCFNNHYKITPDTFIGWMRILSQVDGSVLWLSEANSTAVNNLKKEA 111
Query: 103 QALGLDQHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ G+D+ R++F+ + KE+H+ R + AD+ +DT N HTT+ D L G PV+T G
Sbjct: 112 KKNGVDEDRLIFAPRLTFKEDHLNRIKQADLFIDTLPYNAHTTASDALRMGLPVLTCIGS 171
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
+ ASRVAAS L + PELI T ++Y+ AI L T
Sbjct: 172 SFASRVAASLLNAVNLPELITTTQEQYESFAIELAT 207
>gi|384426734|ref|YP_005636091.1| hypothetical protein XCR_1056 [Xanthomonas campestris pv. raphani
756C]
gi|341935834|gb|AEL05973.1| tetratricopeptide repeat domain protein [Xanthomonas campestris pv.
raphani 756C]
Length = 568
Score = 122 bits (307), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLP +V C FN YK++P ++ + VL+AVP+S+LWLL P +A ++A A
Sbjct: 367 SRTQCGLPAQGVVLCCFNNSYKLNPRSMARMLAVLRAVPDSVLWLLSGPGEADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+D R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWVGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVVDDAAFVAKAVALASD 522
>gi|50543546|ref|XP_499939.1| YALI0A10296p [Yarrowia lipolytica]
gi|49645804|emb|CAG83866.1| YALI0A10296p [Yarrowia lipolytica CLIB122]
Length = 1330
Score = 122 bits (307), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 95/158 (60%), Gaps = 8/158 (5%)
Query: 44 RQQY--GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
RQQ +P++A++ +FNQLYKI P TL MW+ +L+ PN+ LWLL+FP GE+ I+
Sbjct: 1103 RQQIFPDIPKEAVLLGSFNQLYKITPETLFMWLLILQRQPNAYLWLLQFPPAGESKIREM 1162
Query: 102 AQAL--GLDQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A G D RILF+ V K V R ++ D+ LDTP CN HTT+ DV+W GTP++T
Sbjct: 1163 ASKWYQGTDDINKRILFTPVTDKNVFVTRSRVCDLFLDTPECNAHTTAADVIWNGTPILT 1222
Query: 158 LP--GETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
P ++SRVA+S +A A Y+D+ +
Sbjct: 1223 FPKHDHKMSSRVASSIIAASIPHPPSAEAEAMYRDLVV 1260
>gi|405378056|ref|ZP_11031985.1| hypothetical protein PMI11_01952 [Rhizobium sp. CF142]
gi|397325406|gb|EJJ29742.1| hypothetical protein PMI11_01952 [Rhizobium sp. CF142]
Length = 639
Score = 122 bits (307), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 93/161 (57%), Gaps = 3/161 (1%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
PQ + +R+Q GLPE+ ++ +FN KI + W +LK P+S+LWL+ +
Sbjct: 414 PQPV---TREQLGLPENTFIFASFNGNRKITSQVVDSWSRILKRAPDSVLWLMANTTRNK 470
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N+ QA G+ RI+F A E+H+ R Q AD+ +DT NGHTT+ + LW G PV
Sbjct: 471 ENLLERFQANGISAKRIIFCPRAPYEQHISRQQAADLGIDTFPVNGHTTTSEQLWGGLPV 530
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+T+ G ASRV+ S L + PEL+A + Y+D+A+ L
Sbjct: 531 LTVKGTNFASRVSESLLRAIDLPELVADDLQAYEDLAVELA 571
>gi|254522827|ref|ZP_05134882.1| TPR repeat protein [Stenotrophomonas sp. SKA14]
gi|219720418|gb|EED38943.1| TPR repeat protein [Stenotrophomonas sp. SKA14]
Length = 568
Score = 122 bits (306), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 89/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPE +V+C FN YK++P ++ VL+AVP S+LWLL P +A ++A A
Sbjct: 364 TRAECGLPERGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRAAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q GLD R++F +++ R LAD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 424 QKAGLDPARLVFMAKLPHPQYLARYPLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAAFIATASALGND 519
>gi|428175509|gb|EKX44399.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Guillardia theta CCMP2712]
Length = 1433
Score = 122 bits (306), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 63/159 (39%), Positives = 95/159 (59%), Gaps = 3/159 (1%)
Query: 44 RQQYG-LPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV-GEANIQA 100
R YG LP D ++ FNQL+KI+P W+ ++ P + LWL+K + E N++A
Sbjct: 1211 RLAYGSLPPDIPFLFGAFNQLHKIEPDIWDTWMEAMRNAPWAHLWLIKIHSRDAELNLKA 1270
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+ LG+++ R+ +EEH+ AD+ LDTP N H+T DVLW+GTP++T+PG
Sbjct: 1271 RARQLGVEEERVHVVQGYNEEEHLYVKAAADLFLDTPSYNAHSTGCDVLWSGTPLLTIPG 1330
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+ + SRVAAS +GC E RT EY ++A+ L +
Sbjct: 1331 DKMGSRVAASLNRAIGCEETTVRTEDEYAELAVALAKGK 1369
>gi|424667172|ref|ZP_18104197.1| hypothetical protein A1OC_00730 [Stenotrophomonas maltophilia
Ab55555]
gi|401069841|gb|EJP78362.1| hypothetical protein A1OC_00730 [Stenotrophomonas maltophilia
Ab55555]
Length = 570
Score = 122 bits (306), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 88/156 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLP +V+C FN YK++P ++ VL+AVP S+LWLL P +A ++ A
Sbjct: 364 TRANCGLPAQGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRTAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA GLD R++F +++ R QLAD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYQLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A LG D
Sbjct: 484 FAARVAGSLNHHLGMARMNVADDAAFITTASALGND 519
>gi|301123575|ref|XP_002909514.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase 110
kDa subunit, putative [Phytophthora infestans T30-4]
gi|262100276|gb|EEY58328.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase 110
kDa subunit, putative [Phytophthora infestans T30-4]
Length = 1353
Score = 122 bits (306), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 101/160 (63%), Gaps = 7/160 (4%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-----N 97
+R Q+GLP D +V+CNFN + K++ + +W+++L+ +PNS+LWLL P+ +A
Sbjct: 527 NRSQHGLPTDGLVFCNFNTINKMESESFAVWMSILRQIPNSVLWLLA-PSGPDALRVMEK 585
Query: 98 IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ A A G+ R++F+ K H+ R +AD+ LD+ + N H+T+ D LW P+VT
Sbjct: 586 LHDQAMAHGVLPSRVIFAPRVDKYSHLARITVADIFLDSFVYNAHSTAADALWANVPIVT 645
Query: 158 LPGETLASRVAASQLAT-LGCPELIARTHKEYQDIAIRLG 196
L G+T SRVAAS + + PEL++++ K+Y+ +A+ L
Sbjct: 646 LWGDTFPSRVAASLIRNAIPYPELVSQSVKDYERMAVYLA 685
>gi|333902093|ref|YP_004475966.1| hypothetical protein Psefu_3916 [Pseudomonas fulva 12-X]
gi|333117358|gb|AEF23872.1| Tetratricopeptide repeat-containing protein [Pseudomonas fulva
12-X]
Length = 611
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 89/158 (56%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR + GLPED V+C FN YK + W+ +L AVP+S+ WLL +AN+ A
Sbjct: 411 LPSRSECGLPEDRFVFCCFNNNYKFNEQMFSCWMRILAAVPDSLFWLLADNQWSQANLIA 470
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+ G+ R++F+ A + ++ R +AD+ LD NG TT+ D LW G PV+T G
Sbjct: 471 CAERHGVSADRLVFAPRVAPDLYLARYTVADLFLDAYPFNGGTTANDALWMGLPVLTRSG 530
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T ASR+A + L L PELI +Y+ AI L D
Sbjct: 531 RTFASRMAGALLTALELPELITENLADYEQRAIELARD 568
>gi|145590007|ref|YP_001156604.1| hypothetical protein Pnuc_1827 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145048413|gb|ABP35040.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
Length = 761
Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats.
Identities = 62/158 (39%), Positives = 92/158 (58%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
I SR+ GLP+D V+ FN YKI P+T W+N+LKA P S+L+L + N+
Sbjct: 551 IFSRESLGLPKDEFVFACFNNNYKILPATFNSWMNILKATPKSVLYLYADNPWSKDNLMK 610
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+A G+ R++F ++++ R + D+ LDT N TT+ D LW G PV+TL G
Sbjct: 611 EAEARGVKADRLIFGGRIDADQYLARYRACDLFLDTAPYNAGTTASDALWAGLPVLTLIG 670
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
++ SRVA+S L +G PEL+ + EY+ AI L +
Sbjct: 671 QSFPSRVASSLLNAVGLPELVTSSAAEYEIRAIELAMN 708
>gi|428185342|gb|EKX54195.1| hypothetical protein GUITHDRAFT_84189 [Guillardia theta CCMP2712]
Length = 367
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 85/150 (56%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP + V CNFNQ +K+DP T W ++L V N++LW+L+ V E N++ QA G+
Sbjct: 138 LPATSFVLCNFNQHFKVDPVTFSSWTSLLGNVSNAVLWMLEGTPVSERNLRRELQAAGVS 197
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R++F+ AA ++H+RR L D+ LDTP N T D L+ G P++ LP R+
Sbjct: 198 PDRLIFARRAAVKDHLRRAALCDLSLDTPNYNSGATGADTLFAGVPILHLPSTKPIGRMM 257
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+S L L ELI R H EY +A R +
Sbjct: 258 SSMLRALRLQELIVRDHDEYFKLAHRFARE 287
>gi|73540476|ref|YP_294996.1| hypothetical protein Reut_A0772 [Ralstonia eutropha JMP134]
gi|72117889|gb|AAZ60152.1| TPR repeat protein [Ralstonia eutropha JMP134]
Length = 611
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 88/154 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ LP D V+C+FN YK P+ W+ +L VP S+LWLL EAN++ A
Sbjct: 414 SRESCNLPADGFVFCSFNNNYKFTPAVFGTWMRILSRVPGSVLWLLADNEWAEANLRKEA 473
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ LG+D R++F+ A E ++ R AD+ LDT N TT+ D LWTG PV+T G +
Sbjct: 474 EKLGVDGGRLIFAPRVAPENYLARFAAADLFLDTFPFNAGTTANDALWTGLPVLTRSGRS 533
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASR+A + L ELI + +Y++ A+ L
Sbjct: 534 FASRMAGALLTAAELDELITYSPNDYEETAVALA 567
>gi|297538669|ref|YP_003674438.1| hypothetical protein M301_1481 [Methylotenera versatilis 301]
gi|297258016|gb|ADI29861.1| TPR repeat-containing protein [Methylotenera versatilis 301]
Length = 629
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ LPE A V+C FNQ +KI P ++W+ +L AVPNS+LWLL+ + N+ A
Sbjct: 421 SRESCNLPEGAFVFCCFNQSFKITPEFFKVWMRLLNAVPNSVLWLLESNTWAKQNLINQA 480
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ R++F+ + +H+ R AD+ LDT N HTT D LW G PV+T G+T
Sbjct: 481 AKNNITAERLIFAPRMSIADHLARHIHADLFLDTSPYNAHTTCSDGLWMGLPVLTCVGDT 540
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S L PELI + ++Y++ A+ L +
Sbjct: 541 FAARVAGSLLTAADMPELITYSLQDYENKALYLANN 576
>gi|46202065|ref|ZP_00053827.2| COG3914: Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Magnetospirillum magnetotacticum MS-1]
Length = 722
Score = 121 bits (304), Expect = 1e-25, Method: Composition-based stats.
Identities = 66/170 (38%), Positives = 90/170 (52%), Gaps = 6/170 (3%)
Query: 30 ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
+ E VP+ R +GLPE V+C FN K P +W+ +LKAVP+S+LWLL
Sbjct: 506 SMAESVPK------RSDFGLPEAGFVFCCFNNAGKFTPEVFAVWMRLLKAVPDSVLWLLD 559
Query: 90 FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
+ NI A A G+D+ R++ + H+ R QLAD+ LDT N H T+ D L
Sbjct: 560 RNGTVKDNILAQVDAHGVDRGRVVLAPRVPLPLHLARQQLADLFLDTLPYNAHVTASDAL 619
Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
W G P++T G A RVA S L L ELI ++Y+ A+ L DR
Sbjct: 620 WAGLPLLTCLGHAFAGRVAGSLLKVLELDELITTDLEQYEARALELARDR 669
>gi|410692351|ref|YP_003622972.1| putative UDP-N-ACETYLGLUCOSAMINE--PEPTIDE
N-ACETYLGLUCOSAMINYLTRANSFERASE SUBUNIT [Thiomonas sp.
3As]
gi|294338775|emb|CAZ87109.1| putative UDP-N-ACETYLGLUCOSAMINE--PEPTIDE
N-ACETYLGLUCOSAMINYLTRANSFERASE SUBUNIT [Thiomonas sp.
3As]
Length = 753
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 89/153 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLP V+C FN KI W+ +L+ P S+LWL + + + N++ A
Sbjct: 540 AREQAGLPPQGFVFCCFNNTNKISAEVFARWMRILQRTPGSVLWLYRTHDLVDENLRRAA 599
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
ALG++ R++F+ +E H+ R QLAD+ LDT HTT+ D L G PV+T+PG+T
Sbjct: 600 AALGVEPQRLVFAPHLPREWHMARLQLADLFLDTTPYGAHTTTGDALRAGVPVLTVPGQT 659
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
ASRVAAS L PE I Y++ A+RL
Sbjct: 660 FASRVAASMLDAARLPECIQPDWPAYEEEAVRL 692
>gi|296134823|ref|YP_003642065.1| Tetratricopeptide TPR_4 [Thiomonas intermedia K12]
gi|295794945|gb|ADG29735.1| Tetratricopeptide TPR_4 [Thiomonas intermedia K12]
Length = 733
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 89/153 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+Q GLP V+C FN KI W+ +L+ P S+LWL + + + N++ A
Sbjct: 520 AREQAGLPPQGFVFCCFNNTNKISAEVFARWMRILQRTPGSVLWLYRTHDLVDENLRRAA 579
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
ALG++ R++F+ +E H+ R QLAD+ LDT HTT+ D L G PV+T+PG+T
Sbjct: 580 AALGVEPQRLVFAPHLPREWHMARLQLADLFLDTTPYGAHTTTGDALRAGVPVLTVPGQT 639
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
ASRVAAS L PE I Y++ A+RL
Sbjct: 640 FASRVAASMLDAARLPECIQPDWPAYEEEAVRL 672
>gi|171463940|ref|YP_001798053.1| hypothetical protein Pnec_1319 [Polynucleobacter necessarius subsp.
necessarius STIR1]
gi|171193478|gb|ACB44439.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
necessarius STIR1]
Length = 295
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 82/144 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE VYC FN YKI P W +L AV S+LWL++ A E N++A
Sbjct: 144 SRTELGLPEAGFVYCCFNNNYKITPEIFDRWTQILLAVEGSVLWLIQDNAPAEKNLKAEV 203
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ + EH+ R ++AD+ LDT N HTT+ D LW G PV+TL G T
Sbjct: 204 LKRGVAPERIIFAKRLSLPEHLARHKMADLFLDTLPYNAHTTASDSLWAGVPVLTLLGST 263
Query: 163 LASRVAASQLATLGCPELIARTHK 186
RV+AS L L EL+ + K
Sbjct: 264 FPGRVSASLLNALDLQELLLKHQK 287
>gi|408822221|ref|ZP_11207111.1| hypothetical protein PgenN_03833 [Pseudomonas geniculata N1]
Length = 568
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 88/156 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPE +V+C FN YK++P ++ +L+AVP S+LWL+ P +A ++A A
Sbjct: 364 SRADCGLPEQGVVFCCFNNSYKLNPRSMGRAFAILQAVPGSVLWLMSGPGQADARLRAAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q GLD R++F +++ R LAD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 424 QTAGLDPARLVFMAKLPHPQYLARYLLADLFLDTYPYNAHTTASDALWAGCPVLTCPGDT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAGFIATASALGND 519
>gi|347736202|ref|ZP_08868901.1| glycosyl transferase group 1:TPR repeat-containing protein
[Azospirillum amazonense Y2]
gi|346920380|gb|EGY01508.1| glycosyl transferase group 1:TPR repeat-containing protein
[Azospirillum amazonense Y2]
Length = 619
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 96/173 (55%), Gaps = 6/173 (3%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q N A ++ P +R+ GLP D VY F+ YKI W+ +L V NS
Sbjct: 405 QVNDSKALVDQSP------TRESVGLPADKFVYTCFSNTYKITEEVFDAWMRILGGVDNS 458
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LWLL N+Q A+A G+D R+LF+ A+ +++ R LADV LDT N T
Sbjct: 459 VLWLLARTPWARQNMQERARARGIDPGRLLFAGPASPAQYLARLPLADVFLDTYPYNSGT 518
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
T+ D L G P+VTL G+T ASR+A S L T+G P+ + + +Y +A++LG
Sbjct: 519 TASDALRMGLPIVTLSGKTFASRMAGSLLRTVGLPQGVTTSVDDYVKLAVKLG 571
>gi|383757550|ref|YP_005436535.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
gi|381378219|dbj|BAL95036.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
Length = 670
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 92/160 (57%), Gaps = 8/160 (5%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+SR + GLPE A V+ FNQ YKI W +L+ VP S+LWLL V + IQA
Sbjct: 456 SSRAEAGLPEHAFVFACFNQSYKITEPVFTRWCRILERVPGSVLWLL----VPQVEIQAA 511
Query: 102 AQAL----GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+A G++ R++F+ EH+ R AD+ LDT HTT D LW G PV+T
Sbjct: 512 LRARAAERGIEPERLIFAPFVTPSEHLARLPQADLFLDTFPYGAHTTCSDALWMGLPVLT 571
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
G + ++RVAAS LA +G PEL + + Y+++A+RL T
Sbjct: 572 QIGRSFSARVAASLLAAVGLPELAVESEEAYEELAVRLAT 611
>gi|381166882|ref|ZP_09876095.1| Predicted O-linked N-acetylglucosamine transferase [Phaeospirillum
molischianum DSM 120]
gi|380683934|emb|CCG40907.1| Predicted O-linked N-acetylglucosamine transferase [Phaeospirillum
molischianum DSM 120]
Length = 658
Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats.
Identities = 58/138 (42%), Positives = 86/138 (62%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLP+D +V+C+FN K+D +T +W+ VL AVP S+LW L P + ++ A
Sbjct: 452 RSELGLPDDGVVFCSFNGHRKLDRATFSLWMRVLAAVPGSVLWQLAPPPIARTRLEQAAI 511
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+ R++++ +EH++R AD+ LD +C HTT+ D L G P++T G L
Sbjct: 512 AAGIAPERLIWAPRLPWDEHLKRLPAADLFLDALVCGAHTTAADALRMGVPLLTCAGPRL 571
Query: 164 ASRVAASQLATLGCPELI 181
ASRVAAS L ++G PELI
Sbjct: 572 ASRVAASLLESIGLPELI 589
>gi|393765846|ref|ZP_10354406.1| tetratricopeptide tpr 2 repeat protein [Methylobacterium sp. GXF4]
gi|392728738|gb|EIZ86043.1| tetratricopeptide tpr 2 repeat protein [Methylobacterium sp. GXF4]
Length = 618
Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats.
Identities = 61/163 (37%), Positives = 92/163 (56%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P + +R GLPE V+C+F+ YKI P+ +W+ +L+A+P S+LWLL
Sbjct: 417 PPRSALPARSTEGLPEAGFVFCSFSNSYKIVPTIFAIWMRLLQAMPGSVLWLLADNEPAR 476
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N++ A G+D R++F+ + E ++ R LAD+ LDT N TT+ D LW G P+
Sbjct: 477 GNLRREAGRHGVDPQRLVFAQRVSYEAYLARLPLADLFLDTFPFNAGTTASDALWMGLPL 536
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+T G ASR+A S L LG PEL+ + Y+ A++L D
Sbjct: 537 MTCSGRAYASRMAGSLLTALGLPELVTDSLDAYERQALQLARD 579
>gi|119898993|ref|YP_934206.1| hypothetical protein azo2703 [Azoarcus sp. BH72]
gi|119671406|emb|CAL95319.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length = 774
Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats.
Identities = 61/156 (39%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLP+DA+V F+Q YKI ++ +W+ +L+ +P+++LWLL+ A + A
Sbjct: 566 SRAEAGLPDDALVLAAFHQHYKITRASFALWMRLLRGLPDALLWLLEGAPSAMARLSQEA 625
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D R+ ++ H+ R +AD+ LD N HTT+ D LW G P V G+T
Sbjct: 626 RAHGVDPARLCWAPRVPVVHHLNRLAVADIALDAFPVNAHTTASDALWAGVPQVARRGDT 685
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SRV+AS L G PELIA Y+ + + LGTD
Sbjct: 686 FVSRVSASILDAAGLPELIAADDASYEALVLALGTD 721
>gi|456738066|gb|EMF62743.1| TPR domain protein, putative component of TonB system
[Stenotrophomonas maltophilia EPM1]
Length = 570
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLP +V+C FN YK++P ++ VL+AVP S+LWLL P +A ++ A
Sbjct: 364 TRADCGLPAQGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRTAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA GLD R++F +++ R LAD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYPLADLFLDTHPYNAHTTASDALWAGCPVLTYPGDT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A LG D
Sbjct: 484 FAARVAGSLNHHLGMARMNVADDAAFITTASALGND 519
>gi|255606103|ref|XP_002538503.1| hypothetical protein RCOM_1858570 [Ricinus communis]
gi|223511823|gb|EEF23880.1| hypothetical protein RCOM_1858570 [Ricinus communis]
Length = 182
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 85/134 (63%)
Query: 63 YKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEE 122
+KI P+ +W+ +L+ PNS+LWLL+ + E N++A AQA G+ R++F+ +
Sbjct: 2 FKILPAVFDVWMRLLRETPNSVLWLLQSNPLAERNLKAEAQARGVAAERLIFAPRLPLAQ 61
Query: 123 HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIA 182
H+ R Q AD+ LDT N HTT+ D LWTG P+VT G+T A RVAAS L +G PELI
Sbjct: 62 HLARQQWADLLLDTLPYNAHTTASDALWTGVPMVTCLGDTFAGRVAASLLHAVGLPELIT 121
Query: 183 RTHKEYQDIAIRLG 196
+ ++Y A+RL
Sbjct: 122 HSLEDYAAWAVRLA 135
>gi|190572912|ref|YP_001970757.1| TPR domain-containing protein [Stenotrophomonas maltophilia K279a]
gi|190010834|emb|CAQ44443.1| putative TPR domain protein [Stenotrophomonas maltophilia K279a]
Length = 570
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLP +V+C FN YK++P ++ VL+AVP S+LWLL P +A ++ A
Sbjct: 364 TRADCGLPAQGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRTAA 423
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA GLD R++F +++ R LAD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYPLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAAFITTASALGND 519
>gi|23013306|ref|ZP_00053217.1| COG3914: Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Magnetospirillum magnetotacticum MS-1]
Length = 251
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLPED +V C+FN K+D ++ +W+ +L +P ++LW L P + + ++ A
Sbjct: 44 SREALGLPEDRLVLCSFNGHRKLDRASFALWLEILADLPQAVLWQLSPPDMAKRRLEDAA 103
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++++ + EH+ R AD+ +D +C HTT+ D L G P+VT+ GE
Sbjct: 104 AKAGIDPARLIWAPSLPRAEHLARLPAADLFVDALVCGAHTTAADSLRMGVPLVTVAGER 163
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
L SRVAAS L +G PEL ++ +D+ + LG +
Sbjct: 164 LGSRVAASILNAVGLPELAVQSAAAMRDLVVALGRN 199
>gi|428180553|gb|EKX49420.1| hypothetical protein GUITHDRAFT_67997 [Guillardia theta CCMP2712]
Length = 256
Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 56/148 (37%), Positives = 88/148 (59%), Gaps = 1/148 (0%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQH 110
+ + ++ FN+ KI P ++W+ VLKAVP S+LWL +F + N+ A+ LG+
Sbjct: 97 QTSFIFACFNRNLKIKPDVFEVWMKVLKAVPRSVLWLRRFHIRVQENLLRHAERLGVSNA 156
Query: 111 RILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
I+F+ + + +H+ R +LAD+ LDTP GH+T DV W G P + +PGE +SR+ A
Sbjct: 157 SIIFAGRIPDRSKHLARHRLADLFLDTPRFGGHSTLADVFWAGVPSLVIPGEGASSRIGA 216
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGT 197
S + L IAR +YQ +AI+L +
Sbjct: 217 SMASALNGSVFIARDLHDYQQLAIQLAS 244
>gi|350561806|ref|ZP_08930644.1| Tetratricopeptide TPR_1 repeat-containing protein [Thioalkalivibrio
thiocyanoxidans ARh 4]
gi|349780838|gb|EGZ35156.1| Tetratricopeptide TPR_1 repeat-containing protein [Thioalkalivibrio
thiocyanoxidans ARh 4]
Length = 765
Score = 119 bits (298), Expect = 7e-25, Method: Composition-based stats.
Identities = 65/160 (40%), Positives = 92/160 (57%), Gaps = 2/160 (1%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-FPAVGEANIQA 100
T+R + GL E V C F+ YK+ P+ W +L+ P+++LWLL+ PAV AN++
Sbjct: 558 TTRAEQGLSEAGPVLCGFHNPYKVTPTVFDAWCRLLRDSPDAVLWLLEPVPAVA-ANLRR 616
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+ G+D R++F+ EH+ R LAD+ LDT N HTT+ D LW G PV+T G
Sbjct: 617 EAEQRGVDPGRLVFAPRLRNPEHLARLALADLFLDTLPVNAHTTASDALWVGVPVLTCTG 676
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ET A RV AS LA G EL+ + + Y+ A L + D
Sbjct: 677 ETFAGRVGASLLAAAGLSELVTDSLETYEAKAQELLENPD 716
>gi|222149271|ref|YP_002550228.1| hypothetical protein Avi_3091 [Agrobacterium vitis S4]
gi|221736255|gb|ACM37218.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 680
Score = 119 bits (297), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 90/157 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPEDA V+ FN + KI S W+ +L+A P S+LWLL ++ TA
Sbjct: 419 TRAEVGLPEDAFVFACFNGMQKITQSGFTRWMTILQATPGSVLWLLSGNEEVNQRLRDTA 478
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R+LF+ A +H+ R +AD+ LDT H+T+ D + G PV+T PG++
Sbjct: 479 TRCGVDPARLLFAAKAGNAQHLARIAVADLFLDTFPYGAHSTAADAINMGLPVLTFPGKS 538
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR AS +A+ G PELI ++ +Y AI DR
Sbjct: 539 FASRFCASVVASAGAPELICQSPDDYVRRAIAFAQDR 575
>gi|428184992|gb|EKX53846.1| hypothetical protein GUITHDRAFT_160947 [Guillardia theta CCMP2712]
Length = 1178
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 86/136 (63%), Gaps = 3/136 (2%)
Query: 65 IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHV 124
+ P L+ W +L VPNS++WLL+ P V E + + G + RI+F+ AAKEEHV
Sbjct: 985 LRPRILKRWKRILDRVPNSVIWLLRAPQVAEEKVMQWWREAGDYEERIIFTGTAAKEEHV 1044
Query: 125 RRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL--ASRVAASQLATLGCPELIA 182
+R LAD+ LDT + H+T D LW G P+++ P L ASRV+AS L+T+G E+I
Sbjct: 1045 KRAGLADIFLDTDIYGAHSTGTDALWAGLPLLS-PTHLLGMASRVSASLLSTIGLEEMIG 1103
Query: 183 RTHKEYQDIAIRLGTD 198
RT EY+++A+ LG +
Sbjct: 1104 RTLDEYENLAVALGRN 1119
>gi|392384038|ref|YP_005033234.1| putative TPR repeat protein [Azospirillum brasilense Sp245]
gi|356880753|emb|CCD01717.1| putative TPR repeat protein [Azospirillum brasilense Sp245]
Length = 640
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 91/169 (53%), Gaps = 4/169 (2%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G VP + R GLP D V FN YKI P +W+ +L +P+++LWL +
Sbjct: 422 GHGVPPA---PPRSACGLPADGFVLACFNNAYKITPEIWAVWMALLHKIPDAVLWLARTT 478
Query: 92 AVGEANIQATAQALGLDQHRILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
A E N+ AQ+ G+ RI+F++ A +H+ R Q AD+ LDT HTT+ D LW
Sbjct: 479 AEAEQNLLRAAQSAGIITDRIVFASWAPTLGDHLSRLQNADLMLDTLHYGAHTTASDALW 538
Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G P+VT PG TL SRVAAS L T G P+ + + YQ + +R
Sbjct: 539 VGLPLVTCPGHTLQSRVAASLLHTAGLPDFVTESLAHYQSAVLHWANNR 587
>gi|158334416|ref|YP_001515588.1| hypothetical protein AM1_1237 [Acaryochloris marina MBIC11017]
gi|158304657|gb|ABW26274.1| TPR domain protein [Acaryochloris marina MBIC11017]
Length = 1865
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R ++G+PE V+C FN YKI P +W+ +L+ VP+S+LWL++ N++ A
Sbjct: 1652 TRAEFGVPESGFVFCCFNSHYKITPDLFDVWMRILQQVPDSVLWLIEGAETVMGNLRTEA 1711
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G++ R++F+ A E+++ R LAD+ LDT + N +T++ W+G P++T PG T
Sbjct: 1712 KQRGVNPDRLVFAPKIAHEDYLARYGLADLYLDTFVYNAGSTAVATCWSGLPMLTCPGPT 1771
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
ASR+ AS A G LI ++ +Y+ AI L T
Sbjct: 1772 NASRMGASICAAAGLESLICKSVADYEQRAIHLAT 1806
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 82/164 (50%), Gaps = 9/164 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA------- 96
R+ + +DA+V+ + +K+ P W +L AVPNS+L L F A
Sbjct: 763 RRDLNIADDAVVFVSGANFFKLIPELRITWTQILAAVPNSVLLLYPFSAWSHHYPVSAFR 822
Query: 97 -NIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTSMDVLWTGTP 154
++ A G+D R++ + E V++ Q+ADV LD G T+ +D L G P
Sbjct: 823 NHMAALFAGDGIDPSRLIIQDPIPNREQVKQLLQIADVYLDAYPFTGVTSLLDPLEVGLP 882
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VVT G +L SR + L ++ +LIA + Y ++AI+LG D
Sbjct: 883 VVTQAGHSLRSRQGPALLESIDLTDLIATDEQGYVELAIQLGQD 926
>gi|72383070|ref|YP_292425.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
gi|72002920|gb|AAZ58722.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
Length = 750
Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats.
Identities = 57/158 (36%), Positives = 89/158 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ + LP+ ++ FN YKI +W+N+L+ + S+LWL K N+ A
Sbjct: 555 SRKDFNLPDQGFIFTCFNNNYKITEKEFNIWMNLLRKIEGSVLWLYKANQWSMNNLYKEA 614
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+D+ RI+FS +H+ R L D+ LDT NG TS + LWTG PV+T G+
Sbjct: 615 SKRKVDRDRIIFSEKLPMSKHLARHSLGDLALDTFNYNGGVTSSNALWTGLPVLTKIGQN 674
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+R++AS L +LG PELI + EY+D A+ + ++ +
Sbjct: 675 FTARMSASLLTSLGLPELITYSESEYEDKALYIASNSE 712
>gi|253745103|gb|EET01207.1| O-linked GlcNAc transferase [Giardia intestinalis ATCC 50581]
Length = 1480
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 89/157 (56%), Gaps = 2/157 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YG+P + ++C FNQ+YK D TL + +L++VPN+ LLKFP + +I+A +
Sbjct: 1252 RALYGIPSNCFLFCTFNQVYKFDMGTLGIIAALLRSVPNAYYALLKFPPASQQHIEAFFR 1311
Query: 104 ALGLD-QHRILFSNV-AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
D RI+F N+ K EH+RR DV +DT CNG T +D LW+G PVV GE
Sbjct: 1312 HKAPDILDRIIFLNMLPMKVEHIRRYLAVDVFVDTLKCNGSTIVLDALWSGVPVVGFVGE 1371
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ SR S L+ L C +LI + ++ + RL D
Sbjct: 1372 YIISRKTVSFLSVLECTDLICASQEDAVALCTRLAID 1408
>gi|47848089|dbj|BAD21873.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
Japonica Group]
gi|47848095|dbj|BAD21878.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
Japonica Group]
Length = 953
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 65/94 (69%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
R YGLPED ++ FNQLYK+DP W N+LK VPNS LWLL+FPA GE ++A A
Sbjct: 783 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 842
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDT 136
A G+ +I+F++VA K EH+RR LAD+ LDT
Sbjct: 843 AARGVRPDQIIFTDVAMKNEHIRRSSLADLFLDT 876
>gi|78188708|ref|YP_379046.1| TPR repeat-containing protein [Chlorobium chlorochromatii CaD3]
gi|78170907|gb|ABB28003.1| TPR repeat [Chlorobium chlorochromatii CaD3]
Length = 706
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 94/156 (60%), Gaps = 1/156 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQAT 101
+R ++GLPE +VYC FN +YK++P +W+ +L VPNS+LWL+ G ++A
Sbjct: 500 TRAEFGLPEKGMVYCCFNGVYKLEPHVFSLWMEILSKVPNSVLWLIDGEESGSNERLRAV 559
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
AQ G+ R++F+ + EE++ +LAD+ LDT N T++ G P++T GE
Sbjct: 560 AQEAGIAPERLVFAKKRSHEEYLALYRLADLFLDTLSYNAGATAVGAFSAGLPLLTCQGE 619
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
A+R+ ++ +G PEL+A T +Y + A++LG+
Sbjct: 620 HYATRMGSALCYAVGLPELVAPTPADYVEFAVQLGS 655
>gi|188575464|ref|YP_001912393.1| hypothetical protein PXO_04549 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|188519916|gb|ACD57861.1| tetratricopeptide repeat domain protein [Xanthomonas oryzae pv.
oryzae PXO99A]
Length = 568
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE V C FN YK++P ++ + VL+AVP +LWLL P A ++A A
Sbjct: 367 SRAECGLPELGAVLCCFNNSYKLNPRSMARMLAVLRAVPGCVLWLLSGPGEANARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+D R++F ++ R + AD+ LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 QTQGVDPQRLVFMPKLPHPRYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVVDDAAFVAKAIALASD 522
>gi|408391244|gb|EKJ70624.1| hypothetical protein FPSE_09134 [Fusarium pseudograminearum CS3096]
Length = 1564
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 94/160 (58%), Gaps = 28/160 (17%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D I+ NFNQLYK VP +++WLL+FP +GEAN++ TA+ +
Sbjct: 1353 LADDTIIMGNFNQLYK---------------VPKAVIWLLRFPELGEANLRRTAKEWAGE 1397
Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
+ R+LF++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP +
Sbjct: 1398 EVASRLLFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1457
Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
SR+AAS L ELIA KEY+ A++L
Sbjct: 1458 SRMAASILKGALPKSDEGRQAAAELIAGDEKEYEQRAVKL 1497
>gi|72383071|ref|YP_292426.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
gi|72002921|gb|AAZ58723.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
Length = 909
Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats.
Identities = 55/158 (34%), Positives = 90/158 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ + LP+ ++ FN YKI +W+N+L+ V S+LWL K + N+ A
Sbjct: 714 SRKDFNLPDQGFIFTCFNNNYKITKKEFNIWMNLLRKVEGSVLWLYKSNQLSMNNLYKEA 773
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+D+ RI+F+ +H+ R L D+ LDT CNG T+ D L G P++T G++
Sbjct: 774 SKRKIDRDRIIFAEKLPMSKHLARHSLGDLALDTFNCNGGKTTCDALLAGLPLLTKIGQS 833
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+R++AS L +LG PELI + EY+D A+ + ++ +
Sbjct: 834 FTARMSASLLTSLGLPELITYSESEYEDKALYIASNSE 871
>gi|325914251|ref|ZP_08176601.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xanthomonas vesicatoria ATCC 35937]
gi|325539506|gb|EGD11152.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xanthomonas vesicatoria ATCC 35937]
Length = 568
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 83/138 (60%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLP V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAECGLPTQGAVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGQVDARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+D R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PGE
Sbjct: 427 QAHGVDGQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGEP 486
Query: 163 LASRVAASQLATLGCPEL 180
A+RVA S LG E+
Sbjct: 487 FAARVAGSLNQHLGLEEM 504
>gi|384263553|ref|YP_005418742.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
photometricum DSM 122]
gi|378404656|emb|CCG09772.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
photometricum DSM 122]
Length = 652
Score = 116 bits (291), Expect = 4e-24, Method: Composition-based stats.
Identities = 60/154 (38%), Positives = 92/154 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLPED +YC F+ YKI + W+++L+ V NS+LW+L +A+++ A
Sbjct: 456 TRADVGLPEDRFIYCCFSNTYKITEEIFEAWMSILRRVENSVLWVLARNPWAQASMRRHA 515
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D R++F+ +++ R +LAD+ LDT N TT+ D L G P+VTL G T
Sbjct: 516 EARGVDPARLIFAAPTGPAQYLSRLRLADLFLDTYPYNSGTTASDALRMGLPMVTLAGRT 575
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+SR+A S L +G +A T +Y +IA RLG
Sbjct: 576 FSSRMAGSLLHQVGLGACVAETLDDYINIAARLG 609
>gi|289667595|ref|ZP_06488670.1| tetratricopeptide repeat domain protein [Xanthomonas campestris pv.
musacearum NCPPB 4381]
Length = 568
Score = 116 bits (291), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 90/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE +V C FN YK++ ++ + VL+AVP+S+LWLL P +A ++A A
Sbjct: 367 SRAKCGLPEQGVVLCCFNNSYKLNARSMLRMLAVLRAVPDSVLWLLSGPGQADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + A + LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 QAQGVAPQRLIFMPKLPHPQYLARYRHAHLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVGDDAAFVAKAIALASD 522
>gi|384418009|ref|YP_005627369.1| hypothetical protein XOC_0996 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353460923|gb|AEQ95202.1| tetratricopeptide repeat domain protein [Xanthomonas oryzae pv.
oryzicola BLS256]
Length = 568
Score = 116 bits (291), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 88/156 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE V C FN YK++P ++ + VL+AVP +LWLL P +A ++A A
Sbjct: 367 SRAECGLPELGAVLCCFNNSYKLNPRSMARMLAVLRAVPGCVLWLLSGPGEADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+D R++F ++ R + A++ LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 QTQGVDPQRLVFMPKLPHPRYLARYRHANLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVVDDAAFVAKAIALASD 522
>gi|289663474|ref|ZP_06485055.1| tetratricopeptide repeat domain protein [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 568
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 90/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE +V C FN YK++ ++ + VL+AVP+S+LWLL P +A ++A A
Sbjct: 367 SRAKCGLPEQGVVLCCFNNSYKLNARSMLRMLAVLRAVPDSVLWLLSGPGQADARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + A + LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 QAQGVAPQRLIFMPKLPHPQYLARYRHAHLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVGDDAAFVAKAIALASD 522
>gi|222147824|ref|YP_002548781.1| hypothetical protein Avi_1088 [Agrobacterium vitis S4]
gi|221734812|gb|ACM35775.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 566
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 90/154 (58%), Gaps = 6/154 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP+D V +FN + K+ P T++ W+ ++ +P+SILW+L + + N++
Sbjct: 354 RAALGLPQDRFVLASFNSIKKLSPQTVEAWLKIMADLPDSILWILCRSDIAKDNLRTLFA 413
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGET 162
GL R++F+ A H+ R AD+ LD+ P C GHTT+ D LW G PV+ L G+
Sbjct: 414 RHGLSAERLIFTPPLAYPYHLARLSAADLVLDSFPYC-GHTTTSDCLWAGVPVLALKGQN 472
Query: 163 LASRVAASQLATLGCPELIARTHKEY----QDIA 192
ASRV+ S L LG PEL+A T +Y QD+A
Sbjct: 473 FASRVSESLLTALGVPELVAATVDDYIAQAQDLA 506
>gi|159114158|ref|XP_001707304.1| O-linked GlcNAc transferase [Giardia lamblia ATCC 50803]
gi|384950704|sp|A8BFN4.1|OGT1_GIAIC RecName: Full=UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase; Short=GlOGT; AltName:
Full=O-linked GlcNAc transferase; AltName: Full=O-linked
N-acetylglucosaminyltransferase
gi|157435408|gb|EDO79630.1| O-linked GlcNAc transferase [Giardia lamblia ATCC 50803]
Length = 1480
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 88/157 (56%), Gaps = 2/157 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YG+P + ++C FNQ+YK D TL + +L++VPN+ LLKFP + +I+A +
Sbjct: 1252 RALYGIPANCFLFCTFNQVYKFDMGTLGIIAALLRSVPNAYYALLKFPPASQLHIEAFFR 1311
Query: 104 ALGLD-QHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
D R++F S + K EH+RR DV +DT CNG T +D LW+G PVV GE
Sbjct: 1312 HKAPDILDRVIFLSMLPMKVEHIRRYLAVDVFVDTLKCNGSTIVLDALWSGVPVVGFVGE 1371
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ SR S L+ L C +LI + E + RL D
Sbjct: 1372 YILSRKTLSFLSVLECKDLICASQGEAVLLCTRLAVD 1408
>gi|428179062|gb|EKX47935.1| hypothetical protein GUITHDRAFT_54742, partial [Guillardia theta
CCMP2712]
Length = 377
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 102/177 (57%), Gaps = 2/177 (1%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
Q N+ ++V +S + R+ + LP+ A+V CNF +L +I W+ +++ V
Sbjct: 156 QVNSFNDLYQDVLRSETLPKRRDHFLPDKPAVVLCNFCRLGRISQELFDAWLRIMRRVEG 215
Query: 83 SILWLLKFPAVGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNG 141
SILWL P ++ A+ RI+F+ + K +H++R LAD+ LDT + NG
Sbjct: 216 SILWLYSHPRAAANRLRDYARRQDFPTDRIIFAPPCSPKLQHMKRLTLADLYLDTLVYNG 275
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
HTT+ DVLW G P+VT+ G S VA S ++G PE++A + +EY+++A+ L ++
Sbjct: 276 HTTASDVLWAGVPIVTVKGSNWPSLVATSLARSVGLPEMVATSLEEYEEMAVDLASN 332
>gi|222148768|ref|YP_002549725.1| hypothetical protein Avi_2410 [Agrobacterium vitis S4]
gi|221735754|gb|ACM36717.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 624
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 86/151 (56%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+SR GLP D +V FN K P TL++W +LKA P ++LW++ + A
Sbjct: 402 SSRMALGLPADRVVIGAFNSQSKNTPETLRLWARILKANPTALLWMMVDGHSARQSTAAF 461
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
++LG+ Q ++LF+ E H+ R Q AD +D+ NGHTT+ D+LW G PV+T G
Sbjct: 462 FKSLGVKQSQLLFAPKMLYENHIARAQAADFAIDSFPYNGHTTTSDMLWAGLPVITKRGT 521
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIA 192
ASRV+ S L +G EL+AR + ++A
Sbjct: 522 NFASRVSESLLKAIGLDELVARDEDNFVELA 552
>gi|58583232|ref|YP_202248.1| hypothetical protein XOO3609 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625068|ref|YP_452440.1| hypothetical protein XOO_3411 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58427826|gb|AAW76863.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369008|dbj|BAE70166.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 568
Score = 115 bits (289), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPE V C FN YK++P ++ + VL+AVP +LWLL P A ++A A
Sbjct: 367 SRAECGLPELGAVLCCFNNSYKLNPRSMARMLAVLRAVPGCVLWLLSGPGEANARLRAFA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G++ R++F ++ R + AD+ LDT N HTT+ D LW G PV+T PGET
Sbjct: 427 QTQGVEPQRLVFMPKLPHPRYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVVDDAAFVAKAIALASD 522
>gi|326386010|ref|ZP_08207634.1| TPR repeat-containing protein [Novosphingobium nitrogenifigens DSM
19370]
gi|326209235|gb|EGD60028.1| TPR repeat-containing protein [Novosphingobium nitrogenifigens DSM
19370]
Length = 882
Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats.
Identities = 61/156 (39%), Positives = 85/156 (54%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR +GLPE A V+C FN+ YKI W+ +L AV NS+LWL+ E ++ A
Sbjct: 671 SRADWGLPEGAFVFCCFNKGYKIARDDFAGWMRILAAVENSVLWLISLSPETEQALRKAA 730
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F+ A + H+ R AD+ LDT N HTT+ D LW G PVVT G
Sbjct: 731 ADAGVDPARLIFAPRAPHDVHLARHAHADLGLDTLRYNAHTTASDALWAGLPVVTRAGRC 790
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RV AS +G +L+ ++ +AI L D
Sbjct: 791 FAARVGASLNHAVGLSDLVTHDAAGFESLAIALARD 826
>gi|40063060|gb|AAR37916.1| TPR domain protein [uncultured marine bacterium 560]
Length = 764
Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats.
Identities = 68/178 (38%), Positives = 93/178 (52%), Gaps = 5/178 (2%)
Query: 20 LATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKA 79
L + Q N T + P I T RQ GLPE V+C FN +KI P+T W +L+
Sbjct: 535 LPSYQVNDSTQS----PPEIFFT-RQDLGLPETGFVFCCFNNTFKITPTTFDSWGRILEH 589
Query: 80 VPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
V SIL + + N+ G+D R++F E++ R ++AD+ LDT
Sbjct: 590 VEGSILLIYVDNESAKLNLTKEIVLRGIDPSRLIFGEKLPIPEYLARYRVADLFLDTHPY 649
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
N TTS D L G PV+TL G + SR+AAS + + PELI T KEY+ +AI L T
Sbjct: 650 NAGTTSSDALRMGLPVLTLKGNSFNSRMAASVINAVNLPELITSTQKEYESLAIELAT 707
>gi|380510752|ref|ZP_09854159.1| hypothetical protein XsacN4_06033 [Xanthomonas sacchari NCPPB 4393]
Length = 570
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 89/156 (57%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLPE V+C FN YK+ P ++ + VL+ VP+S+LWLL P + ++A A
Sbjct: 366 TRRDCGLPERGTVFCCFNNSYKLGPRSMARMLEVLRQVPDSVLWLLSGPGQADDRLRAAA 425
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R+ F A +++ R + AD+ LDT N HTT+ D LW G PV+T PG T
Sbjct: 426 ATAGVDPTRLRFMPKLAHPDYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTCPGST 485
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ A + A+RLG D
Sbjct: 486 FAARVAGSLNHHLGLDEMNAADDAAFVATAVRLGRD 521
>gi|294666024|ref|ZP_06731286.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604216|gb|EFF47605.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 568
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLP+ A+V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGQADARMRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+D R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWVGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522
>gi|154244408|ref|YP_001415366.1| hypothetical protein Xaut_0451 [Xanthobacter autotrophicus Py2]
gi|154158493|gb|ABS65709.1| Tetratricopeptide TPR_2 repeat protein [Xanthobacter autotrophicus
Py2]
Length = 574
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 83/139 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLP+DA V+C FN +K+DP + +LKAVP S+LW+L+ N++ A
Sbjct: 361 TRADCGLPDDAFVFCAFNSAWKLDPEVFSAYTRILKAVPGSVLWVLESRENSADNLRREA 420
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A GLD R++F+ + ++H R LAD+ LDT HTT+ D+L G P+VT G +
Sbjct: 421 EARGLDPARLVFAPMVPLKDHFARLPLADLFLDTFPYTAHTTASDMLRMGLPMVTRTGRS 480
Query: 163 LASRVAASQLATLGCPELI 181
ASRVAAS + + + I
Sbjct: 481 FASRVAASIMTQMDLADFI 499
>gi|157413635|ref|YP_001484501.1| O-linked N-acetylglucosamine transferase [Prochlorococcus marinus
str. MIT 9215]
gi|157388210|gb|ABV50915.1| Predicted O-linked N-acetylglucosamine transferase, SPINDLY family
[Prochlorococcus marinus str. MIT 9215]
Length = 632
Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats.
Identities = 57/156 (36%), Positives = 89/156 (57%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
+ +GL E V+ FN KI +W+ +LK V NSILWL K N++ A+
Sbjct: 433 KSDFGLSEHNFVFACFNANNKITLVEFNIWMRLLKKVKNSILWLYKSNNYSMMNLKKEAE 492
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ RI+F++ E+H+ R + AD+ LDT N HTT+ D LW PVVT G++
Sbjct: 493 KQGVQSKRIIFADKMLNEDHLSRIKFADLFLDTFHYNAHTTASDALWAEVPVVTKQGQSF 552
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
++RV +S L L ELI + + EY+++A ++ +D+
Sbjct: 553 SARVCSSLLTALNLEELITKENFEYENLAYKIASDK 588
>gi|452965780|gb|EME70798.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
SO-1]
Length = 658
Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats.
Identities = 56/156 (35%), Positives = 89/156 (57%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
+ SR GLPED +V C+FN K+D ++ +W+ +L +P+++LW L P ++
Sbjct: 449 LPSRAALGLPEDRLVLCSFNGHRKLDRASFTLWLEILAELPDAVLWQLAPPDTARKRLEE 508
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+ G+D R++++ + EH+ R AD+ +D +C HTT+ D L G P+VT G
Sbjct: 509 AARNAGIDPARLIWAPALPRPEHLARLPAADLFVDALVCGAHTTAADSLRMGVPLVTAAG 568
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
L SRVAAS L +G PEL +++A+ LG
Sbjct: 569 PRLGSRVAASLLHAVGLPELAVTDAAALKELAVGLG 604
>gi|308161318|gb|EFO63770.1| O-linked GlcNAc transferase [Giardia lamblia P15]
Length = 1480
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 88/157 (56%), Gaps = 2/157 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R YG+P + ++C FNQ+YK D TL + +L++VPN+ LLKFP + +I+A +
Sbjct: 1252 RALYGIPANCFLFCTFNQVYKFDMGTLGIIAALLRSVPNAYYALLKFPPASQLHIEAFFR 1311
Query: 104 ALGLD-QHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
D R++F S + K EH+RR DV +DT CNG T +D LW+G PVV GE
Sbjct: 1312 HKAPDILDRVIFLSMLPMKVEHIRRYLAVDVFVDTLKCNGSTIVLDALWSGVPVVGFVGE 1371
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ SR S L+ L C +LI + + + RL D
Sbjct: 1372 YILSRKTLSFLSVLECNDLICASQGDAVLLCTRLAVD 1408
>gi|347739128|ref|ZP_08870464.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
gi|346917662|gb|EGX99948.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
Length = 355
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 86/160 (53%), Gaps = 4/160 (2%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
T+R +GLP D +V+ FN YK+ P + W +L VP S+LWLL P +QA
Sbjct: 143 TTRADWGLPADGVVFACFNATYKLTPEVVACWARLLAKVPGSVLWLLD-PGPALPTLQAM 201
Query: 102 AQALGLDQHRILFSNVAAKE---EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
G+D R++ + K +H+ R AD+ LDT N HTT+ D L+ G PV+T
Sbjct: 202 LSTEGVDPSRLVVAPALPKARMADHLARHACADLFLDTHPVNAHTTAADALYAGLPVLTR 261
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE SRVAAS L LG P+L+ + + Y+ A+ D
Sbjct: 262 AGEPFISRVAASLLTALGLPDLVTYSLEAYEATALDFARD 301
>gi|348679898|gb|EGZ19714.1| hypothetical protein PHYSODRAFT_558418 [Phytophthora sojae]
Length = 749
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 98/168 (58%), Gaps = 7/168 (4%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
E P +T R+ E+A V+ F+ K+DPS W+ +L VP+S++W +++
Sbjct: 316 ERPTLFALTGRR-----ENAFVFATFSNWQKMDPSVFSAWMEILSRVPSSVMWFIEYFGR 370
Query: 94 --GEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
AN++A A+A G++ R++FS + +H R + AD+ LDT L NGHTT +D L
Sbjct: 371 EGAIANLRAEAKAHGINGDRLIFSPMDPWIDHTYRKRAADLILDTSLKNGHTTILDALCA 430
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G PV++L G+ +++R +S L +L +L + KEY ++A+ L T +
Sbjct: 431 GVPVISLEGDRMSNRATSSALNSLDLHDLTVNSLKEYVEVAVYLATHK 478
>gi|428179506|gb|EKX48377.1| hypothetical protein GUITHDRAFT_162478 [Guillardia theta CCMP2712]
Length = 494
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 82/143 (57%), Gaps = 1/143 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
I+ +FNQLYKI+P +W+ V++ VP S LWLLKF + A A G+ + R++
Sbjct: 293 IILASFNQLYKIEPKIFNVWMEVMRQVPKSKLWLLKFSKAASKYLLKEAAAAGVKEDRLV 352
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + K+ + AD+ LDTPL N HTT DVLW G P+++LPGE A RVA +
Sbjct: 353 FHSKFPKDIELLAKSHADLFLDTPLFNAHTTGGDVLWAGIPMLSLPGENFAQRVAGGLIR 412
Query: 174 TLGCPEL-IARTHKEYQDIAIRL 195
++ + + R +Y +A +L
Sbjct: 413 SVRMSNVSLVRNLDDYAHVATKL 435
>gi|23016729|ref|ZP_00056482.1| COG3914: Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Magnetospirillum magnetotacticum MS-1]
Length = 686
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 85/157 (54%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLP +A V+C FN L KI ++++L VPN +LWLL ++ A
Sbjct: 410 TREAAGLPANATVFCGFNGLQKITAPMWARFMDILSRVPNGVLWLLDGGERINERLRQEA 469
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ EH+ R LAD+ LDT C HTT+ D LW G PV+T+ G
Sbjct: 470 IKHGVTPDRIIFAPKLINAEHLSRYPLADLFLDTSPCGAHTTASDSLWMGVPVLTVAGRG 529
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S G +L+ T +EY + A+ LG D+
Sbjct: 530 FASRVCGSLAVAAGLSDLVCTTFEEYVEKAVELGNDK 566
>gi|124023730|ref|YP_001018037.1| hypothetical protein P9303_20301 [Prochlorococcus marinus str. MIT
9303]
gi|123964016|gb|ABM78772.1| Hypothetical protein P9303_20301 [Prochlorococcus marinus str. MIT
9303]
Length = 395
Score = 113 bits (283), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 93/155 (60%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R +GLPE + V+ F YKI PS +W+ +LK V +S+LWL + E N++ A+
Sbjct: 199 RSDFGLPESSFVFVCFCANYKITPSVFNVWMRLLKQVDDSVLWLYRSNKWAEINLRRQAE 258
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
+ +D R++F+ +H+ R LAD+ LDT N HTT+ D L G P++TL G++
Sbjct: 259 SRDIDPERLIFAGRLPLNKHLARHSLADLFLDTFNVNAHTTASDALLAGLPLLTLAGKSF 318
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SRVAAS L TL PELI T K+Y++ A+ + D
Sbjct: 319 TSRVAASLLVTLNLPELITYTIKDYEEKALMIALD 353
>gi|325929456|ref|ZP_08190584.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xanthomonas perforans 91-118]
gi|325540231|gb|EGD11845.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xanthomonas perforans 91-118]
Length = 568
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE A+V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPEHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG EL + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDELNVADDAAFVATAVALASD 522
>gi|428171863|gb|EKX40776.1| hypothetical protein GUITHDRAFT_75246 [Guillardia theta CCMP2712]
Length = 717
Score = 113 bits (282), Expect = 5e-23, Method: Composition-based stats.
Identities = 58/173 (33%), Positives = 94/173 (54%), Gaps = 3/173 (1%)
Query: 28 KTATGEEVPQSIVITSRQQYGLPEDA--IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
K + E++ + LP + +V NFN L K+ P T +W+ +++ +P+S+L
Sbjct: 423 KQSQAEQILGDFTAANASNLSLPHRSQQLVLANFNHLQKLGPHTFHLWLRIMQELPDSVL 482
Query: 86 WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
WLL+FP E N++ + + R+ S+ + H+ + A + LDT N H +
Sbjct: 483 WLLRFPPEAEPNLRREFEEREIGT-RLHLSDKFPSKIHLNVKRAASLLLDTLEYNAHVSG 541
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+D LW G P+VTL G +A R AS L T+ P+L+ART +EY IA +LG +
Sbjct: 542 LDALWAGLPLVTLAGSNMARRCGASFLRTMKVPQLLARTSEEYVSIATQLGEN 594
>gi|345869929|ref|ZP_08821884.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodococcus
drewsii AZ1]
gi|343922316|gb|EGV33018.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodococcus
drewsii AZ1]
Length = 693
Score = 113 bits (282), Expect = 5e-23, Method: Composition-based stats.
Identities = 59/156 (37%), Positives = 91/156 (58%), Gaps = 1/156 (0%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATA 102
R GLPE A+VYC N +YKI+ +W+ +L+ VP S+LWLL G A ++ A
Sbjct: 484 RAACGLPESAMVYCCLNGIYKIESGVFALWMRILERVPESVLWLLDGGESGSNARLREAA 543
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+D R++F+ A E++ +LAD+ LDT + N T++ L G PV+T PG
Sbjct: 544 RTAGIDPDRLIFAPKRAHAEYLAHYRLADLFLDTLVYNAGATAVGALAAGLPVLTCPGGH 603
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+R+ AS +G PEL+ + ++Y + A+ LG D
Sbjct: 604 YAARMGASLSHAVGLPELVCASPEDYVEQAVALGRD 639
>gi|428184442|gb|EKX53297.1| hypothetical protein GUITHDRAFT_101001 [Guillardia theta CCMP2712]
Length = 738
Score = 112 bits (281), Expect = 6e-23, Method: Composition-based stats.
Identities = 58/167 (34%), Positives = 93/167 (55%), Gaps = 19/167 (11%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVL----------------KAVPNSILWLLKFP 91
G +D ++ C++N L+K DP TLQ W+ +L + VP + LW+++F
Sbjct: 496 GFEDDDVIMCSWNTLFKCDPDTLQTWITLLVSFIPLPLLVLSQVLKRKVPKAKLWIMQFD 555
Query: 92 AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
N+ A G+D RI +S ++ + LADV +DTPL N HTT +D LW
Sbjct: 556 PTPPPNVALLAADNGVDVTRIAWSTFFPRQLEFKIKSLADVFVDTPLFNAHTTGVDALWC 615
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPEL---IARTHKEYQDIAIRL 195
GTP+VT+P E ++RVAA +A++ +L +AR+ ++Y+ + RL
Sbjct: 616 GTPLVTVPLEDFSARVAAGLVASVSRMQLGTTVARSMEDYEQVLTRL 662
>gi|78046530|ref|YP_362705.1| hypothetical protein XCV0974 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78034960|emb|CAJ22605.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 568
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 93/156 (59%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE A+V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPEHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QALG+ R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QALGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522
>gi|285017296|ref|YP_003375007.1| hypothetical protein XALc_0485 [Xanthomonas albilineans GPE PC73]
gi|283472514|emb|CBA15019.1| hypothetical protein XALC_0485 [Xanthomonas albilineans GPE PC73]
Length = 569
Score = 112 bits (281), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 85/156 (54%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPE V+C FN YK+ P ++ + VL+ VP+S+LWLL P +A ++ A
Sbjct: 367 SRHDCGLPEAGTVFCCFNNSYKLSPRSMARMLAVLRQVPDSVLWLLSGPGQADARLREAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
L+ R+ F A E++ R + AD+ LDT N HTT+ D LW G PV+T PG T
Sbjct: 427 CKADLEPTRLRFMPKLAHPEYLSRYRHADLFLDTHPYNAHTTASDALWAGCPVLTCPGNT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S L ++ + A+RLG D
Sbjct: 487 FAARVAGSLNHHLNMDDMNVVDDDAFIATAVRLGRD 522
>gi|375104670|ref|ZP_09750931.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Burkholderiales bacterium JOSHI_001]
gi|374665401|gb|EHR70186.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Burkholderiales bacterium JOSHI_001]
Length = 644
Score = 112 bits (281), Expect = 7e-23, Method: Composition-based stats.
Identities = 60/161 (37%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
Query: 43 SRQQYGLPEDA----IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
+R + GLP+D +V C FNQ +KI W +L +P ++LWLL++ NI
Sbjct: 432 TRSEAGLPDDGQGGPLVLCGFNQPFKISAPVFDTWCRLLHRLPQAVLWLLEWNGQVRKNI 491
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
+ A G+ R+ ++ + H+ R QLAD+ +DT CN HTT+ D LW PVVT
Sbjct: 492 ELEAAKRGIAPERLAWAPRRHPKHHMARLQLADIFIDTWPCNAHTTASDALWAAVPVVTF 551
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G T ASRVAAS +G LI Y++ + L DR
Sbjct: 552 AGRTFASRVAASLNHAVGLDGLICDDVAHYEERIVALAHDR 592
>gi|418300833|ref|ZP_12912644.1| hypothetical protein ATCR1_24890 [Agrobacterium tumefaciens
CCNWGS0286]
gi|355532841|gb|EHH02194.1| hypothetical protein ATCR1_24890 [Agrobacterium tumefaciens
CCNWGS0286]
Length = 653
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 90/158 (56%), Gaps = 2/158 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPEDA VY +FN + KI S W+ +L P S+LWLL ++Q A
Sbjct: 392 SRAEVGLPEDAFVYASFNGMQKITESCFARWMTILSETPGSLLWLLAGDEDVNEHLQGIA 451
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F+ +H+ R +AD+ LDT H+T+ D + +G PV+T+PG+T
Sbjct: 452 EKSGIAPERLVFAPKVPNPQHIARIGVADLFLDTFPYGAHSTAADAITSGLPVLTMPGKT 511
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R S + G PE+I + ++Y +A +G +RD
Sbjct: 512 FAARFCGSIVTAAGVPEMICSSPEDY--VARAIGFERD 547
>gi|409441295|ref|ZP_11268287.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Rhizobium
mesoamericanum STM3625]
gi|408747107|emb|CCM79485.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Rhizobium
mesoamericanum STM3625]
Length = 685
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 88/157 (56%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLPEDA ++ FN + KI + + W+ +L A PNSILWLL + ++ +A+
Sbjct: 422 RAEVGLPEDAFIFACFNGMQKITAACFERWMAILSATPNSILWLLSGGEDVDQRLRQSAE 481
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R++F+ A +H+ R +AD+ LDT H+T+ D + G PV+T PG
Sbjct: 482 QHGVAPERLIFAPKAPNPKHLARIGVADLFLDTFPYGAHSTAGDAVTMGLPVLTFPGNGF 541
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
ASR AS +A G PELI +Y A+ +DR+
Sbjct: 542 ASRFCASIVAAAGVPELICDGPDDYVGKAVAFASDRE 578
>gi|196232460|ref|ZP_03131313.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196223532|gb|EDY18049.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 792
Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats.
Identities = 58/145 (40%), Positives = 80/145 (55%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + + N KI+ + Q W VL+ S L LL +P + A + G++++R+
Sbjct: 599 VTFASLNDFGKINRAVFQRWAAVLRKATGSRLLLLTYPGEHRRDTIALLREEGIEENRVE 658
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + E++R AD LD NGHTTS+D LW G PVV+L GE + SR SQL
Sbjct: 659 FLEPRPRAEYLRYYHRADAMLDPFPYNGHTTSLDALWMGVPVVSLAGERMVSRAGLSQLT 718
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG PEL+ART EY +IA RL D
Sbjct: 719 NLGLPELVARTEDEYVEIATRLAGD 743
>gi|99078404|ref|YP_611662.1| tetratricopeptide TPR_2 [Ruegeria sp. TM1040]
gi|99035542|gb|ABF62400.1| Tetratricopeptide TPR_2 [Ruegeria sp. TM1040]
Length = 560
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 84/154 (54%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R +GLP+D V+ +FN +K+ PS W+ +L+ VP ++L A A + AQ
Sbjct: 351 RAAFGLPDDRFVFASFNHPHKVGPSEFATWMEILREVPQAVLLFYSGKADLGAALAERAQ 410
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G++ R+L + H+ R D+CLD N HTT+ D +W G P++TL G
Sbjct: 411 AHGVEPSRVLACGPLPQTAHLERIAQVDLCLDCFAYNAHTTASDAVWAGVPLLTLAGRQF 470
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
A+RVA S L+T G PEL + K+Y A+ L T
Sbjct: 471 AARVATSILSTAGVPELSTVSTKDYVAKAVHLAT 504
>gi|452963366|gb|EME68439.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
SO-1]
Length = 690
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 84/157 (53%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLP +A V+C FN L K ++++L VPN +LWLL ++ A
Sbjct: 410 TREAAGLPANATVFCGFNGLQKFTAPMWARFMDILSRVPNGVLWLLDGGERINERLRQEA 469
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ EH+ R LAD+ LDT C HTT+ D LW G PV+T+ G
Sbjct: 470 IKHGVTPDRIIFAPKLINAEHLSRYPLADLFLDTSPCGAHTTASDALWMGVPVLTVAGRG 529
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S G EL+ T ++Y + A+ LG D+
Sbjct: 530 FASRVCGSLAVAAGLGELVCTTFEDYVEKAVDLGNDK 566
>gi|83312891|ref|YP_423155.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
magneticum AMB-1]
gi|82947732|dbj|BAE52596.1| Predicted O-linked N-acetylglucosamine transferase
[Magnetospirillum magneticum AMB-1]
Length = 691
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 85/157 (54%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLP +A V+C FN + KI + ++++L VPN +LWLL ++ A
Sbjct: 410 TREAAGLPANATVFCGFNGVQKITAPMWERFMDILSRVPNGVLWLLDGGERINERLRQEA 469
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ EH+ R LAD+ LDT C HTT+ D LW G PV+T+ G
Sbjct: 470 IRHGVTPDRIIFAPKLINAEHLSRYPLADLFLDTSPCGAHTTASDALWMGVPVLTVAGRG 529
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV S G +++ T EY + A+ LG D+
Sbjct: 530 FASRVCGSLAVAAGLGDMVCTTFAEYVEKAVELGNDK 566
>gi|381171822|ref|ZP_09880962.1| putative uncharacterized protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380687782|emb|CCG37449.1| putative uncharacterized protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 568
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLP+ A+V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNHRLGLDEMNVADDATFVAKAIELASD 522
>gi|21241713|ref|NP_641295.1| hypothetical protein XAC0943 [Xanthomonas axonopodis pv. citri str.
306]
gi|21107081|gb|AAM35831.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 568
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLP+ A+V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNHRLGLDEMNVADDATFVAKAIELASD 522
>gi|117926636|ref|YP_867253.1| hypothetical protein Mmc1_3361 [Magnetococcus marinus MC-1]
gi|117610392|gb|ABK45847.1| TPR repeat-containing protein [Magnetococcus marinus MC-1]
Length = 750
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 94/177 (53%), Gaps = 9/177 (5%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q N +T + P SRQ+ GLP +V+ +FNQ YKI P + +W +L AV +
Sbjct: 518 QPNPRTLACDPTP------SRQEAGLPPQGVVFGSFNQSYKITPQIVALWSRLLHAVEHG 571
Query: 84 ILWLLK--FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
+LWLL PA E ++QA A G+ + R++F+ + H+ R L DV LDT
Sbjct: 572 VLWLLDPDLPAAVE-HLQAHFAAHGIAKQRVIFAPRLSMAAHMGRLPLVDVALDTFPYTS 630
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
HTT+ D LW+G P+V GET ASRV+ +G EL+ + Y +A L D
Sbjct: 631 HTTASDALWSGVPLVAWRGETFASRVSTCLAINMGMQELVVEDGEHYLALAQALAHD 687
>gi|428181344|gb|EKX50208.1| hypothetical protein GUITHDRAFT_161893 [Guillardia theta CCMP2712]
Length = 799
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 102/177 (57%), Gaps = 7/177 (3%)
Query: 20 LATNQTNTKTATGEEVPQSIVITSRQQYGLP-EDAIVYCNFNQLYKIDPSTLQMWVNVLK 78
L T + +++ + P + R++ GLP + A+V+ FN YKIDP +MW+ +L+
Sbjct: 569 LQTYRRDSRAGIAVDRPHQL----REELGLPAQGALVFGAFNDHYKIDPQLFRMWMRLLR 624
Query: 79 AVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPL 138
+ S++W ++ GEA +Q A G+D R++ + K EH+ +DV LDT L
Sbjct: 625 SSNGSVIWTIRHG--GEAILQKEAVKEGIDPARLVIAEPFPKVEHLAIKGYSDVFLDTHL 682
Query: 139 CNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
NGH+T+ D+LW PVVTLPGE +A+R AA +A IAR H +Y +++ RL
Sbjct: 683 YNGHSTTSDMLWAQVPVVTLPGEKMAARTAAGIMAAANAAIGIARNHDDYFELSKRL 739
>gi|418518286|ref|ZP_13084435.1| hypothetical protein MOU_15947 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418523120|ref|ZP_13089144.1| hypothetical protein WS7_19138 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410700276|gb|EKQ58838.1| hypothetical protein WS7_19138 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410704363|gb|EKQ62847.1| hypothetical protein MOU_15947 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 568
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLP+ A+V C FN YK++P ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + AI L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDATFVAKAIELASD 522
>gi|30250032|ref|NP_842102.1| TPR repeat-containing glycosyl transferase [Nitrosomonas europaea
ATCC 19718]
gi|30139139|emb|CAD86003.1| Glycosyl transferases group 1:TPR repeat [Nitrosomonas europaea
ATCC 19718]
Length = 1189
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 83/147 (56%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLP V+C+FN +K W+N+L+ VPNS+LWLL N+Q A
Sbjct: 437 TRKDCGLPARKFVFCSFNNNHKYTLEVFTTWMNILRRVPNSVLWLLADNPWARENLQKQA 496
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+D R++F+ +++ R +AD+ LDT N TT+ D LW G PV+T+ G +
Sbjct: 497 KAQGIDPKRLVFAERTMPADYLARYLVADLFLDTFPFNAGTTANDALWMGLPVLTMSGRS 556
Query: 163 LASRVAASQLATLGCPELIARTHKEYQ 189
ASR+A + L PELI + Y+
Sbjct: 557 FASRMAGALLTAADLPELITHDLQTYE 583
>gi|428176488|gb|EKX45372.1| hypothetical protein GUITHDRAFT_163259 [Guillardia theta CCMP2712]
Length = 826
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 3/172 (1%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
+ T G + V R+ GLP + V NFNQ+YK++P T ++W ++++ S
Sbjct: 524 EQETSGPAGSPSREEYVERQRRLLGLPANKTVVSNFNQMYKVNPETFKLWADIVREEETS 583
Query: 84 ILWLLKFPAVGEA---NIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCN 140
+ W + GE + A A + ++ ++ E +RR L+D+ LDT N
Sbjct: 584 LFWQILEGPQGERISRRYELHASAHRIAAEKLFWTYKTEIVEFIRRCGLSDIVLDTFPVN 643
Query: 141 GHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIA 192
HT +MDVLW GTP+++ PG ASRVA+S L++LG +LIAR +++ +++
Sbjct: 644 AHTVAMDVLWMGTPLLSRPGMLFASRVASSILSSLGQTQLIARNEEDFVELS 695
>gi|445499835|ref|ZP_21466690.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Janthinobacterium sp.
HH01]
gi|444789830|gb|ELX11378.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Janthinobacterium sp.
HH01]
Length = 957
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 87/156 (55%), Gaps = 6/156 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQ 99
+R YGLPE VYC F+ +I P W+ +L VP+S+LWL A GE N++
Sbjct: 424 ARAAYGLPEPGFVYCCFSAPEQITPERYATWMRILHQVPDSVLWLQ---AAGEQVRDNLR 480
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A G+ R+ F++ AA E+ + R Q AD+ LDT + DVLW G P++T
Sbjct: 481 VAAMQRGIPSERLCFADPAAPEQQLARYQAADLFLDTAPYAAGAMAGDVLWAGLPLLTST 540
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
G+ ASR +AS L G PEL+ + K+Y+ A+RL
Sbjct: 541 GDGSASRTSASLLHAAGLPELVTASEKKYEARAVRL 576
>gi|346723854|ref|YP_004850523.1| O-linked N-acetylglucosamine transferase [Xanthomonas axonopodis
pv. citrumelo F1]
gi|346648601|gb|AEO41225.1| O-linked N-acetylglucosamine transferase [Xanthomonas axonopodis
pv. citrumelo F1]
Length = 568
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 91/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLPE A+V C FN YK++P ++ + VL+ VP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPEHAVVLCCFNNSYKLNPRSMARMLAVLRVVPGSVLWLLSGPGEADARLRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+ R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHAQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522
>gi|408377703|ref|ZP_11175304.1| hypothetical protein QWE_08941 [Agrobacterium albertimagni AOL15]
gi|407748694|gb|EKF60209.1| hypothetical protein QWE_08941 [Agrobacterium albertimagni AOL15]
Length = 679
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 87/157 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPEDA V+ FN + KI W+ +L A P SILWLL ++ A
Sbjct: 422 TRSEVGLPEDAFVFACFNGMQKITGPMFARWMAILTATPGSILWLLGGSDAVNQRLRDLA 481
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
++ G+D R+ F++ H+ R +AD+ LDT H+T+ D L G PV+TLPG++
Sbjct: 482 KSAGVDPARLYFASKVPNPNHLARIAIADLFLDTFPYGAHSTASDALTMGLPVLTLPGKS 541
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR +S + + G PELI + +Y A+ L D+
Sbjct: 542 FASRFCSSIVKSAGIPELICASPDDYVRKAVALANDK 578
>gi|40063716|gb|AAR38497.1| TPR repeat protein [uncultured marine bacterium 583]
Length = 733
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 59/157 (37%), Positives = 87/157 (55%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
I +R+ GLPE V+C FN YKI P+T W +L+ V S+L + + E N+
Sbjct: 519 IFTRKDLGLPETGFVFCCFNNTYKITPTTFDGWGRILEQVDGSVLLIYADNELAEINLTK 578
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
G++ R++F K E++ R ++AD+ LDT N TT+ D L G PV+T G
Sbjct: 579 EIVLRGINPSRLIFGKRLPKPEYLARYRVADLFLDTLPYNAGTTASDALRMGLPVLTCMG 638
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
++ ASR AAS + PELI T ++Y+ +AI L T
Sbjct: 639 KSFASREAASVINAFNLPELITTTQEQYESLAIELAT 675
>gi|294626108|ref|ZP_06704715.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292599557|gb|EFF43687.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 568
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 92/156 (58%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR Q GLP+ A+V C FN YK++ ++ + VL+AVP S+LWLL P +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNTRSMARMLAVLRAVPGSVLWLLSGPGQADARMRAAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
QA G+D R++F +++ R + AD+ LDT N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG E+ + A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522
>gi|254464315|ref|ZP_05077726.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit, putative [Rhodobacterales bacterium Y4I]
gi|206685223|gb|EDZ45705.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit, putative [Rhodobacterales bacterium Y4I]
Length = 714
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 58/155 (37%), Positives = 88/155 (56%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLP+ V+C+F +I P +W+ +L + P+S+LWLL A +N++ A
Sbjct: 500 RADCGLPDAGFVFCSFAAASQITPREFDIWMRLLSSTPDSVLWLLDHGAHAVSNLRRAAA 559
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+D R++F+ A ++ H+ R LA +CLDT N + D L G PV+TLPG
Sbjct: 560 ARGVDPDRLIFALPAERKAHLARLTLAGLCLDTFTVNASAAARDALAAGVPVLTLPGRQF 619
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+R AS L G P+LIA + ++Y++ A L D
Sbjct: 620 AARTGASLLRAAGLPDLIADSAEDYEEKAAELAAD 654
>gi|409989551|ref|ZP_11273106.1| hypothetical protein APPUASWS_02083, partial [Arthrospira platensis
str. Paraca]
gi|409939587|gb|EKN80696.1| hypothetical protein APPUASWS_02083, partial [Arthrospira platensis
str. Paraca]
Length = 632
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 97/158 (61%), Gaps = 4/158 (2%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE--ANIQA 100
+R +GLPE V+ FN+ YKI P Q+W+ +L+ V S+LWL AV E ++++
Sbjct: 439 NRSDFGLPETGFVFACFNRHYKISPDLFQVWMQLLQEVEGSVLWLSL--AVEETLSHLRQ 496
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
TA+A G+D R++F+ +++ R +LAD+ LDT NG +T++ L+ G PV+T PG
Sbjct: 497 TAKAAGVDPQRLIFAPKIPHNQYLARMELADLGLDTWNYNGGSTTIAALYGGLPVLTKPG 556
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T ASR+ AS A+ G E+I + +YQ+ A+ T+
Sbjct: 557 VTNASRMGASICASAGVTEMICNSAVDYQEKALHWATN 594
>gi|291567291|dbj|BAI89563.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 1020
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 97/158 (61%), Gaps = 4/158 (2%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE--ANIQA 100
+R +GLPE V+ FN+ YKI P Q+W+ +L+ V S+LWL AV E ++++
Sbjct: 827 NRSDFGLPETGFVFACFNRHYKISPDLFQVWMQLLQEVEGSVLWLSL--AVEETLSHLRQ 884
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
TA+A G+D R++F+ +++ R +LAD+ LDT NG +T++ L+ G PV+T PG
Sbjct: 885 TAKAAGVDPQRLIFAPKIPHNQYLARMELADLGLDTWNYNGGSTTIAALYGGLPVLTKPG 944
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T ASR+ AS A+ G E+I + +YQ+ A+ T+
Sbjct: 945 VTNASRMGASICASAGVTEMICNSAVDYQEKALHWATN 982
>gi|418939709|ref|ZP_13493098.1| hypothetical protein PDO_2859 [Rhizobium sp. PDO1-076]
gi|375053594|gb|EHS49984.1| hypothetical protein PDO_2859 [Rhizobium sp. PDO1-076]
Length = 681
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R + GLPEDA V+ FN + KI + W+ +L A P S+LWLL ++ +A
Sbjct: 420 TRTEAGLPEDAFVFACFNGMQKITANGFARWMKILAATPGSVLWLLTGSDAVNQRLRQSA 479
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ RI+F+ A +H+ R +AD+ LDT H+T+ D + G PV+TLPG++
Sbjct: 480 TVCGVAADRIIFAPKAPNAQHLARIGVADLFLDTFPYGAHSTASDAITMGLPVLTLPGKS 539
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
ASR AS ++ G PELI T +Y + A+ D
Sbjct: 540 FASRFCASIVSAAGIPELICSTPDDYVNKAVAYARD 575
>gi|254476420|ref|ZP_05089806.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit, putative [Ruegeria sp. R11]
gi|214030663|gb|EEB71498.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110kDa subunit, putative [Ruegeria sp. R11]
Length = 547
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 83/155 (53%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R ++G+P+D V+ FN +YKI P W+ +LKA PNS+L + + A +
Sbjct: 350 RDRFGIPQDRFVFATFNNIYKIGPREFATWMEILKAAPNSVLLFYLSNLDLKDRLIAKVK 409
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A G+D R++ + +++H+ R D+CLD N HTT+ D +W G P++TL GE
Sbjct: 410 AAGVDPDRVILTGPLPQKDHLDRISQVDLCLDCFSYNAHTTASDAIWCGVPILTLCGEQF 469
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S L EL + EY +A L D
Sbjct: 470 AARVATSILHAANLAELSVTSVAEYVQLAAALAKD 504
>gi|154244409|ref|YP_001415367.1| hypothetical protein Xaut_0452 [Xanthobacter autotrophicus Py2]
gi|154158494|gb|ABS65710.1| Tetratricopeptide TPR_2 repeat protein [Xanthobacter autotrophicus
Py2]
Length = 569
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 88/158 (55%), Gaps = 7/158 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQA 100
R + GLP+ V+C FN K+DP+T +W+ +L VP S+LWLL G+ AN++
Sbjct: 354 RAECGLPDTGFVFCAFNNTRKLDPATFALWLRLLMEVPGSVLWLL----AGDDARANLRR 409
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+A G+ R++F+ EH+ R LAD+ LDT HTT+ D L G P+VT G
Sbjct: 410 EAEAAGVAPERLVFAPHRPLPEHLARMALADLFLDTFPYTAHTTASDALRVGLPLVTRTG 469
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ AS+VA L G +L+ T Y+ +A+ L D
Sbjct: 470 RSFASKVAGRLLQLSGVGDLVTDTPAAYEALALSLARD 507
>gi|298708309|emb|CBJ48372.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Ectocarpus siliculosus]
Length = 662
Score = 109 bits (273), Expect = 5e-22, Method: Composition-based stats.
Identities = 65/190 (34%), Positives = 108/190 (56%), Gaps = 17/190 (8%)
Query: 16 LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPED--AIVYCNFNQLYKIDPSTLQMW 73
L GLA T +++G + ++++ R+ GLP D V+ NFN++ K+DP + +W
Sbjct: 367 LYEGLAP----TSSSSGRKHDRAVL---RRGQGLPGDLSRAVFANFNKVDKLDPESFSLW 419
Query: 74 VNVLKAVPNSILWLLKFPAVG------EANIQATAQALGLDQHRILFSNVAAKEEHVRRG 127
+ +L+ VP S+LWLL+ AV ++ A++ G+D RI+++ +K EH+ R
Sbjct: 420 MQILRRVPGSVLWLLEASAVDSERAAIRTRLRQEAESQGVDGGRIVWAKWVSKSEHLSRH 479
Query: 128 QLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE--LIARTH 185
LAD+ LDT H+T+ D L G P++TL G + ASRV S L G + L+
Sbjct: 480 ALADLFLDTLTYGAHSTATDALAGGLPLLTLAGASFASRVGISLLRNAGPSQSALLVAGQ 539
Query: 186 KEYQDIAIRL 195
+E++D+A+ L
Sbjct: 540 REFEDLAVDL 549
>gi|349605893|gb|AEQ00976.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
110 kDa subunit-like protein, partial [Equus caballus]
Length = 418
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 47/67 (70%), Positives = 56/67 (83%)
Query: 1 MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
MI GQ+Q ++NG + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 352 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 411
Query: 61 QLYKIDP 67
QLYKIDP
Sbjct: 412 QLYKIDP 418
>gi|145588310|ref|YP_001154907.1| hypothetical protein Pnuc_0122 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145046716|gb|ABP33343.1| Tetratricopeptide TPR_2 repeat protein [Polynucleobacter
necessarius subsp. asymbioticus QLW-P1DMWA-1]
Length = 660
Score = 108 bits (271), Expect = 9e-22, Method: Composition-based stats.
Identities = 54/155 (34%), Positives = 84/155 (54%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
++ +G+ ED V+CN N +KI + W+ +LK PNS++ L A N+ A
Sbjct: 452 AKSDFGIQEDQFVFCNLNNAFKITQAIFDSWLRILKMTPNSVMLLYAENAEATKNLLEYA 511
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ R +F+ + +++ R ++ D+ LDT N TT+ D LW PV+TL GE+
Sbjct: 512 SKNGVAPQRFVFAERLTRADYLARYKITDLFLDTTPYNAGTTASDALWMDIPVITLQGES 571
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
SRVA+S L L PELI ++ Y+ A+ L T
Sbjct: 572 FPSRVASSLLTHLDLPELIHQSMAGYETQAVELAT 606
>gi|398350156|ref|YP_006395620.1| hypothetical protein USDA257_c02650 [Sinorhizobium fredii USDA 257]
gi|390125482|gb|AFL48863.1| hypothetical protein USDA257_c02650 [Sinorhizobium fredii USDA 257]
Length = 684
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 84/157 (53%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPEDA ++ FN + KI + W+ +L A P S LWLL + ++ A
Sbjct: 420 SRAEVGLPEDAFIFACFNGMQKITAACFDRWMAILSATPGSFLWLLGGGEDVDQRLKQAA 479
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F++ A +H+ R LAD+ LDT H+T+ D L G PV+T PG
Sbjct: 480 EQHGVAPERLIFASKAPNPKHLARIGLADLFLDTFPYGAHSTAGDALTMGLPVLTFPGNG 539
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR +S +A G PELI ++ AI DR
Sbjct: 540 FASRFCSSIVAAAGVPELICDGPDDFVRKAIGFARDR 576
>gi|372487918|ref|YP_005027483.1| hypothetical protein Dsui_1242 [Dechlorosoma suillum PS]
gi|359354471|gb|AEV25642.1| hypothetical protein Dsui_1242 [Dechlorosoma suillum PS]
Length = 724
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 91/166 (54%), Gaps = 3/166 (1%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL-LKFPA 92
E P ++ + + G P +V+ FNQ +K+D + +W VL AVP ++LWL + PA
Sbjct: 496 EAP-AVPLLEAAELGAPAAPVVFACFNQTWKVDAQAMALWQAVLAAVPEAVLWLHAETPA 554
Query: 93 VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
V A + A+A G+ R+ FS + ++ R + LDTPL +GH T D L G
Sbjct: 555 VAAALTRRAAEA-GIGATRLHFSPGTTRPRYLARYRHVHAVLDTPLYHGHGTGADALGMG 613
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
TPV++L GE +RV S + LG P+L A + + Y AIRLG D
Sbjct: 614 TPVLSLQGEGFQARVGGSLVRALGLPQLAAESAEAYVATAIRLGRD 659
>gi|170739041|ref|YP_001767696.1| hypothetical protein M446_0702 [Methylobacterium sp. 4-46]
gi|168193315|gb|ACA15262.1| Tetratricopeptide TPR_2 repeat protein [Methylobacterium sp. 4-46]
Length = 667
Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats.
Identities = 56/150 (37%), Positives = 86/150 (57%)
Query: 47 YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALG 106
+GLP + +V+ F+ YK+ P W+ +L+AVP S+LWL+ A N++ A G
Sbjct: 453 HGLPAEGLVFGAFSASYKLTPDLFAAWMRLLRAVPGSVLWLVADHAPTRDNLRREAGRHG 512
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+D R++ + A E ++ R + D+ LDT N TT+ D LW G P++TL G ASR
Sbjct: 513 VDPDRLVLAGRAPYETYLDRLRHIDLFLDTFPYNAGTTASDALWMGVPLLTLAGRAYASR 572
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLG 196
+A S L LG P+L+A + Y+ A+RL
Sbjct: 573 MAGSLLTALGLPDLVAASLDAYERQALRLA 602
>gi|424795604|ref|ZP_18221433.1| hypothetical protein XTG29_03097 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422795393|gb|EKU24099.1| hypothetical protein XTG29_03097 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 523
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 85/156 (54%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLPE V+C FN YK+ P ++ + VL+ VP+S+LWLL P + ++ A
Sbjct: 321 SRRDCGLPEHGTVFCCFNNSYKLTPRSMTRMLAVLRQVPDSVLWLLSGPGQADTRLRDAA 380
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ +D R+ F A +++ R + AD+ LDT N HTT+ D LW G P++ PG T
Sbjct: 381 RRADVDPARLRFMPKLAHADYLARYRHADLFLDTHPYNAHTTASDALWAGCPLLICPGTT 440
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A++LG D
Sbjct: 441 FAARVAGSLNHHLGMDAMNVADDDAFIAKAVQLGRD 476
>gi|428168147|gb|EKX37095.1| hypothetical protein GUITHDRAFT_145209 [Guillardia theta CCMP2712]
Length = 585
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 78/130 (60%), Gaps = 3/130 (2%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA--VGEANIQA 100
+RQ++ LP ++ FN +KI P L +W+++L +VP S L +K+ NI
Sbjct: 384 TRQEFSLPLSGVLLACFNAFFKITPDILDLWLHILLSVPQSFLLFVKYQYHQAARRNILQ 443
Query: 101 TAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+ + RI+F N A+ +H+RR L+DV LDT L N HT ++DVLW+G P+V LP
Sbjct: 444 VVKQKNISSARIIFMNRTASLRDHMRRTSLSDVYLDTRLYNAHTIAVDVLWSGVPIVALP 503
Query: 160 GETLASRVAA 169
G +LASRV A
Sbjct: 504 GTSLASRVTA 513
>gi|254468836|ref|ZP_05082242.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [beta proteobacterium KB13]
gi|207087646|gb|EDZ64929.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [beta proteobacterium KB13]
Length = 624
Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats.
Identities = 56/155 (36%), Positives = 86/155 (55%), Gaps = 1/155 (0%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
+ T R Y LP++ ++ +FNQ KI +W+N+L+ P S LW+L + + NI+
Sbjct: 420 ITTPRDTYRLPDNHFIFASFNQNLKITKDIFDVWINILQECPKSSLWILLQNDISKRNIK 479
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+ GLD +RI+ + A +EH+ R + AD+ LD N HTT D ++ P +T
Sbjct: 480 EYARKNGLDDNRIIIAEYAPIDEHIGRIEHADLFLDCYPYNAHTTIADAIYQNKPTITHI 539
Query: 160 GETLASRVAASQLATLGCP-ELIARTHKEYQDIAI 193
GE + SRV AS L + C EL+ ++YQ AI
Sbjct: 540 GEVMQSRVGASLLHAIDCSEELVFDNLQDYQKQAI 574
>gi|406706967|ref|YP_006757320.1| hypothetical protein HIMB5_00014060 [alpha proteobacterium HIMB5]
gi|406652743|gb|AFS48143.1| tetratricopeptide repeat protein [alpha proteobacterium HIMB5]
Length = 657
Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats.
Identities = 63/161 (39%), Positives = 91/161 (56%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ S+Q GL E + V+C FN KI+P +W+ +LK S+LWLLK E N
Sbjct: 445 SVNSISKQSLGLTEGSFVFCCFNSHQKINPIIFDVWLEILKKTEKSVLWLLKDNKFSEKN 504
Query: 98 IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
I+ + +D R++F++ E+H++R Q AD+ LDT N HTT D L PV+T
Sbjct: 505 IKLYCEKNKVDPDRLIFADRVELEDHLQRIQYADLFLDTFPYNAHTTCSDALRMNIPVLT 564
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
L G + ASRVA+S L +L ELI Y+D A+++ D
Sbjct: 565 LKGNSFASRVASSLLNSLDLSELITHDLNNYKDAALKIYKD 605
>gi|400755802|ref|YP_006564170.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Phaeobacter gallaeciensis 2.10]
gi|398654955|gb|AFO88925.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Phaeobacter
gallaeciensis 2.10]
Length = 730
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 84/158 (53%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLP++A V+C+F + I P +W+ +L + S+LWL P +A ++ A
Sbjct: 496 SRRDCGLPDEAFVFCSFTASHAITPREFDIWMRLLTKIDGSVLWLADHPEEAQAALRQAA 555
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+D R++F+ EE V R +AD+ LD+ N + D L G PV+T+ G
Sbjct: 556 GDRRVDPDRVIFAAPCTGEERVARSMVADLFLDSFTLNAGPAARDALVAGLPVLTMAGRQ 615
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R AS LA G EL T + Y+ A+ L +DRD
Sbjct: 616 FAARTTASLLAAAGMEELQTTTPQAYEARALELASDRD 653
>gi|428176588|gb|EKX45472.1| hypothetical protein GUITHDRAFT_108736 [Guillardia theta CCMP2712]
Length = 710
Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats.
Identities = 48/128 (37%), Positives = 81/128 (63%)
Query: 45 QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
++ G+ Y +FN LYK+ PS W+ +L V +SILWLLKFP+ E ++ A+A
Sbjct: 520 KELGIGPSYFRYASFNSLYKLTPSVFGAWMKILSQVDSSILWLLKFPSSAEKRLRKEAKA 579
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
G+D R++F+++ + EH+++ A + LDT + N H+++ D L +G PV+++ G +
Sbjct: 580 RGVDDKRLVFTDLYPRAEHLQKKSGASLFLDTLVYNAHSSAADSLASGVPVLSMAGSGIV 639
Query: 165 SRVAASQL 172
+RVAAS L
Sbjct: 640 NRVAASLL 647
>gi|227820814|ref|YP_002824784.1| hypothetical protein NGR_c02330 [Sinorhizobium fredii NGR234]
gi|227339813|gb|ACP24031.1| conserved hypothetical protein contains tetratricopeptide protein
domain [Sinorhizobium fredii NGR234]
Length = 685
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 82/157 (52%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPED ++ FN KI + W+ +L A P S LWLL + ++ A
Sbjct: 420 SRAEAGLPEDGFIFACFNGAQKITAACFDRWMAILAATPGSCLWLLGGGEDVDQRLKQAA 479
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
Q G+ R++F++ A +H+ R LAD+ LDT H+T+ D L G PV+T PG
Sbjct: 480 QQRGVAPERLIFASKAPNPKHLARIGLADLFLDTFPYGAHSTAGDALTMGLPVLTFPGNG 539
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR +S +A G PELI ++ AI DR
Sbjct: 540 FASRFCSSIVAAAGVPELICEGPDDFVRKAIGFAKDR 576
>gi|301106148|ref|XP_002902157.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Phytophthora infestans T30-4]
gi|262098777|gb|EEY56829.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Phytophthora infestans T30-4]
Length = 744
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 90/155 (58%), Gaps = 2/155 (1%)
Query: 47 YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQA 104
+ L + V+ F+ K+DPS W+ +L VP+S++W ++ A N++ A+A
Sbjct: 321 FDLDDYTFVFATFSNWQKMDPSVFSAWMEILARVPSSVMWFQEYFGHEGAITNLRNEAEA 380
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
G++ R++FS + +H R ++AD+ LDT + NGHTT +D L G PV+TL G+ ++
Sbjct: 381 HGINGKRLIFSPLDPWIDHTYRKRIADLVLDTSMKNGHTTVLDALCAGVPVITLEGDQMS 440
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+R +S L L ++ + KEY ++A+ L T +
Sbjct: 441 NRATSSALNALDLHDMTVHSIKEYVEVAVYLATHK 475
>gi|196232461|ref|ZP_03131314.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196223533|gb|EDY18050.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 760
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 76/145 (52%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ Y +FN K++P L++W VL A S L LL P + G+ +RI
Sbjct: 567 VTYGSFNNFCKVNPDVLRLWTRVLAATEGSRLILLSHPGSHRQRTVDIFEREGIAANRIE 626
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F E++R Q D+ LD NGHTTS+D LW G PVV+L G+ ASR SQL
Sbjct: 627 FVEPRPFAEYLRLYQRLDLVLDPFPYNGHTTSLDALWMGVPVVSLAGQLAASRAGLSQLT 686
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG EL+ART EY I+ RL D
Sbjct: 687 NLGLTELVARTEDEYVAISTRLAGD 711
>gi|399992733|ref|YP_006572973.1| hypothetical protein PGA1_c15440 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
gi|398657288|gb|AFO91254.1| hypothetical protein PGA1_c15440 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
Length = 552
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 86/167 (51%), Gaps = 2/167 (1%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
VP+ V +R Y LP+D V+ N N YK+ P W+ +LK P+S+L
Sbjct: 343 HVPE--VRNTRSDYNLPQDKFVFANLNNTYKVGPREFATWMKILKRTPDSVLLFYMGKDD 400
Query: 94 GEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
I +A G+D RI+ + +H+ R D+CLD N HTT+ D LW G
Sbjct: 401 LSDVIAQKTEAHGVDPDRIIPCGALPQADHLDRIAQVDLCLDCFSYNAHTTASDALWAGV 460
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
P++TL G+ A+RVA+S L+ P+L ++ + + D A+ L + D
Sbjct: 461 PILTLAGKQFAARVASSILSAAHLPDLSVKSEELFIDKAVSLAKNPD 507
>gi|224010822|ref|XP_002294368.1| o-linked GlcNAc transferase [Thalassiosira pseudonana CCMP1335]
gi|220969863|gb|EED88202.1| o-linked GlcNAc transferase [Thalassiosira pseudonana CCMP1335]
Length = 413
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 94/175 (53%), Gaps = 16/175 (9%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAV---------PNSILWLLKFP 91
++ RQ+YG+ A VYC ++ KIDPST + W+ ++ V P+ +LWLL+
Sbjct: 184 VSLRQKYGIDPAAFVYCCHSRPDKIDPSTFRSWLRAIRRVISEQSGTDAPSPVLWLLRSG 243
Query: 92 AVGEANI-QATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
E N+ Q G D Q ++F+ VA + EH++R AD+ LDTP N HT D L
Sbjct: 244 EEMEHNLRQLVVCEFGEDMQDCLVFAEVADRREHLKRLGCADLFLDTPAYNAHTLGCDAL 303
Query: 150 WTGTPVVTL-----PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+ G P+V+L LASRV AS L + C LI EY+D+ ++ +++
Sbjct: 304 YVGVPMVSLLRNYDGNSKLASRVGASMLISASCEALIVSDMNEYEDMMVKCVSEK 358
>gi|162455198|ref|YP_001617565.1| hypothetical protein sce6916 [Sorangium cellulosum So ce56]
gi|161165780|emb|CAN97085.1| TPR domain protein [Sorangium cellulosum So ce56]
Length = 686
Score = 106 bits (264), Expect = 5e-21, Method: Composition-based stats.
Identities = 57/151 (37%), Positives = 79/151 (52%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
L E + + N K++ L +W VL AVP S L L+ +++ + G+
Sbjct: 491 ALAEGRVNFGCLNHFRKVNEGVLHVWTRVLAAVPRSRLTLMAPQGRARDRVRSIFEEGGV 550
Query: 108 DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
RI F + + +++RR + D+CLDT NGHTTS+D LW G PVVTL GET+ R
Sbjct: 551 QLDRIEFVDRCGRLDYLRRYREIDICLDTFPSNGHTTSLDALWMGVPVVTLAGETVVGRA 610
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
Q LG PELIA T + Y +A L D
Sbjct: 611 GVCQAMNLGLPELIATTPEHYVRVASSLAED 641
>gi|427717067|ref|YP_007065061.1| hypothetical protein Cal7507_1771 [Calothrix sp. PCC 7507]
gi|427349503|gb|AFY32227.1| hypothetical protein Cal7507_1771 [Calothrix sp. PCC 7507]
Length = 1269
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 90/170 (52%), Gaps = 4/170 (2%)
Query: 35 VPQSIVIT----SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF 90
V SI IT +R +GLP DA V+C FN+ K DP W+ +L+ VPNS+LWL++
Sbjct: 1058 VASSITITDQPLTRADFGLPTDAFVFCCFNRSDKFDPEVFASWMRILQQVPNSVLWLIET 1117
Query: 91 PAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
++ AQ G+ R++F+ E++ +LAD+ LDT + N T + L+
Sbjct: 1118 TPDVSYTLRDMAQQQGIAPTRLVFTPRLPLAEYLAAYRLADLFLDTFVYNAGATGIHALF 1177
Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G P++T PG+ +R+ AS A G LI + Y+ A+ L T D
Sbjct: 1178 AGLPLITRPGKAFVARMGASICAAAGLDLLICDSSAAYEQKAVHLATHPD 1227
>gi|440733676|ref|ZP_20913367.1| hypothetical protein A989_18498 [Xanthomonas translucens DAR61454]
gi|440359550|gb|ELP96855.1| hypothetical protein A989_18498 [Xanthomonas translucens DAR61454]
Length = 569
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 84/156 (53%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLPE V+C FN YK+ P ++ + VL+ V S+LWLL P +A + A
Sbjct: 367 ARRDCGLPEHGTVFCCFNNSYKLTPRSMTRMLAVLRQVQGSVLWLLSGPGQADARLHDAA 426
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ +D R+ F A +++ R + AD+ LDT N HTT+ D LW G P++T PG T
Sbjct: 427 RRADVDPARLRFMPKLAHADYLARYRHADLFLDTHPYNAHTTASDALWAGCPLLTCPGTT 486
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A++LG D
Sbjct: 487 FAARVAGSLNHHLGMDAMNVADDDAFVAKAVQLGRD 522
>gi|400754413|ref|YP_006562781.1| hypothetical protein PGA2_c15330 [Phaeobacter gallaeciensis 2.10]
gi|398653566|gb|AFO87536.1| hypothetical protein PGA2_c15330 [Phaeobacter gallaeciensis 2.10]
Length = 552
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 10/170 (5%)
Query: 35 VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
VP+ V +R Y LP+D V+ N N YK+ P W+ +LK P S+L +G
Sbjct: 344 VPE--VRNTRSDYNLPQDKFVFANLNNTYKVGPREFATWMKILKRTPESVLLFY----MG 397
Query: 95 EAN----IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
+ + I +A G+D RI+ + +H+ R D+CLD N HTT+ D LW
Sbjct: 398 KDDLCDVIAQKTEAHGVDPDRIIPCGALPQADHLDRIAQVDLCLDCFSYNAHTTASDALW 457
Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G P++TL G+ A+RVA+S L+ P+L ++ + + D A+ L + D
Sbjct: 458 AGVPILTLAGKQFAARVASSILSAAHLPDLSVKSEELFIDKAVSLAKNPD 507
>gi|335035927|ref|ZP_08529257.1| hypothetical protein AGRO_3257 [Agrobacterium sp. ATCC 31749]
gi|333792491|gb|EGL63858.1| hypothetical protein AGRO_3257 [Agrobacterium sp. ATCC 31749]
Length = 681
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 88/158 (55%), Gaps = 2/158 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPEDA VY +FN + KI + W+ +L P S+LWLL ++ A
Sbjct: 420 SRADVGLPEDAFVYASFNGMQKITENCFARWMTILSETPGSLLWLLTGDDDVNQRLRDLA 479
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F+ A +H+ R LAD+ LDT H+T+ D + +G PV+T+ G+T
Sbjct: 480 EKSGVASERLVFAPKAQNPQHIARIGLADLFLDTFPYGAHSTASDAITSGLPVLTMSGKT 539
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R S + G PE+I + ++Y +A +G +RD
Sbjct: 540 FAARFCGSIVTAAGVPEMICSSPEDY--VARAIGFERD 575
>gi|40063048|gb|AAR37904.1| TPR domain protein [uncultured marine bacterium 560]
Length = 732
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 89/178 (50%), Gaps = 5/178 (2%)
Query: 20 LATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKA 79
L + Q N T + P I T RQ GLPE V+C FN +KI P+T W +L+
Sbjct: 503 LPSYQANDSTQS----PPEIFFT-RQDLGLPETGFVFCCFNNTFKITPTTFDSWGRILEH 557
Query: 80 VPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
V S+L + + N+ G+D R++F E++ R ++ D+ LDT
Sbjct: 558 VEGSVLLIYANNESAKLNLTKEIVLRGIDPSRLIFGEKLPMSEYLARYRVVDLFLDTHPY 617
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
N T + L G PV+T G + ASR+ AS + + PELI T KEY+ +AI L T
Sbjct: 618 NAGTIASGALRMGLPVLTCIGNSFASRMGASVINAVNLPELITSTQKEYESLAIELAT 675
>gi|46202694|ref|ZP_00052759.2| COG3914: Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Magnetospirillum magnetotacticum MS-1]
Length = 256
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 90/167 (53%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G+ P R +GLPEDA V+C N +K P T +W +L VP +LWLL
Sbjct: 37 GDHPPVGEPSQPRAAHGLPEDAFVFCCINNSFKFRPQTFDVWAEILHQVPRGVLWLLDRG 96
Query: 92 AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
+ N+ G+D R++F+ EH+ R +LAD+ LD+ N TT+ D LW+
Sbjct: 97 PLSRDNMNREMAGRGIDPARLVFAQRVDLAEHMARHRLADLFLDSFPFNAITTASDALWS 156
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G PV+T GE+ SR A+S L LG PEL+ + + Y+ AIRL +
Sbjct: 157 GLPVLTWMGESFVSRGASSLLLALGLPELVTASPEAYKAEAIRLARE 203
>gi|159185979|ref|NP_356665.2| hypothetical protein Atu3976 [Agrobacterium fabrum str. C58]
gi|159141113|gb|AAK89450.2| conserved hypothetical protein [Agrobacterium fabrum str. C58]
Length = 681
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 88/158 (55%), Gaps = 2/158 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLPEDA VY +FN + KI + W+ +L P S+LWLL ++ A
Sbjct: 420 SRADVGLPEDAFVYASFNGMQKITENCFARWMTILSETPGSLLWLLTGDDDVNQRLRDLA 479
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F+ A +H+ R LAD+ LDT H+T+ D + +G PV+T+ G+T
Sbjct: 480 EKSGVASERLVFAPKAQNPQHIARIGLADLFLDTFPYGAHSTASDAITSGLPVLTMYGKT 539
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R S + G PE+I + ++Y +A +G +RD
Sbjct: 540 FAARFCGSIVTAAGVPEMICSSPEDY--VARAIGFERD 575
>gi|378824868|ref|YP_005187600.1| hypothetical protein SFHH103_00273 [Sinorhizobium fredii HH103]
gi|365177920|emb|CCE94775.1| conserved hypothetical protein [Sinorhizobium fredii HH103]
Length = 685
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 81/156 (51%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + GLPED ++ FN KI + W+ +L A P S LWLL + ++ AQ
Sbjct: 421 RAEAGLPEDGFIFACFNGAQKITAACFDRWMAILAATPGSFLWLLGGGEDVDQRLKQAAQ 480
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R++F++ A +H+ R LAD+ LDT H+T+ D L G PV+T PG
Sbjct: 481 QRGVAPERLIFASKAPNPKHLARIGLADLFLDTFPYGAHSTAGDALTMGLPVLTFPGNGF 540
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASR +S +A G PELI ++ A+ DR
Sbjct: 541 ASRFCSSIVAAAGVPELICDGPDDFVRKAVGFAKDR 576
>gi|222148769|ref|YP_002549726.1| hypothetical protein Avi_2411 [Agrobacterium vitis S4]
gi|221735755|gb|ACM36718.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 582
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/159 (40%), Positives = 91/159 (57%), Gaps = 6/159 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR GLP D +V FN K T+++W VLKA P ++LWL+ G QATA
Sbjct: 366 SRMALGLPADRVVIGAFNSQRKNSLLTMRLWAEVLKANPKALLWLM---IDGHGARQATA 422
Query: 103 ---QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+ LG+ Q ++LF+ A H+ R Q AD +DT CNGHTT+ D+LW G PV+T+
Sbjct: 423 AHFKTLGVKQSQLLFAPKMAYAAHLARAQAADFAIDTFPCNGHTTTSDMLWAGLPVMTMQ 482
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G ASRV+ S L +G P+L+A +++ +A L D
Sbjct: 483 GRNFASRVSESLLNAIGLPQLVASNEQDFVALATALIND 521
>gi|224010090|ref|XP_002294003.1| UDP-N-acetylglucosamine: polypeptide-n-acetylglucosaminyl
transferase [Thalassiosira pseudonana CCMP1335]
gi|220970675|gb|EED89012.1| UDP-N-acetylglucosamine: polypeptide-n-acetylglucosaminyl
transferase [Thalassiosira pseudonana CCMP1335]
Length = 420
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 92/156 (58%), Gaps = 17/156 (10%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ-------ATAQAL 105
+ VYCNFN+ K +P T++ W+ +L+ VPNSIL LL+ P G N++ A+ Q+
Sbjct: 218 SFVYCNFNKFLKHNPQTMRSWIRILQEVPNSILCLLENPIEGIPNLRKFVSEATASPQSD 277
Query: 106 GLDQHRILFSNVAAKEEHVRRGQ-LADVCLDTPLCNGHTTSMDVLWTGTPVVTLP-GETL 163
G++ ++ F +H RR L +V LD+ NGHTT+ D L+ G P+VT G+ +
Sbjct: 278 GIEGNQNPF-------DHQRRTHSLCNVMLDSHPYNGHTTAQDALYAGVPIVTRSDGDDM 330
Query: 164 ASRVAASQLATLGCPELIARTH-KEYQDIAIRLGTD 198
+SRV+ S LG EL A EY+DIAIRLG D
Sbjct: 331 SSRVSTSANVVLGLEELNAYNGINEYEDIAIRLGND 366
>gi|428186291|gb|EKX55141.1| hypothetical protein GUITHDRAFT_131413 [Guillardia theta CCMP2712]
Length = 839
Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats.
Identities = 52/151 (34%), Positives = 91/151 (60%), Gaps = 5/151 (3%)
Query: 52 DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLD 108
+ V+C+F + YK+ P + WV++L PNSILWLL +P + ++ A+ +GLD
Sbjct: 625 ETFVFCSFAKSYKLHPRLFKTWVSILLRSPNSILWLLSYPDGSDLARPRLREHAERMGLD 684
Query: 109 QHRILFSNVAAKEEH-VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R++ + + ++ H + +G A + LD+ NGH T+ + L G PV+ LP ASR+
Sbjct: 685 PARLVVTGLLERKRHLLLKGTFAHLLLDSFPYNGHLTTAEALAAGIPVLALPQTRPASRL 744
Query: 168 AASQLATLGC-PELIARTHKEYQDIAIRLGT 197
++S LA L P L+AR ++Y+++A+R+ +
Sbjct: 745 SSSMLAALQLHPRLVARDLEDYEELAVRIAS 775
>gi|254475750|ref|ZP_05089136.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Ruegeria sp. R11]
gi|214029993|gb|EEB70828.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Ruegeria sp. R11]
Length = 730
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 83/158 (52%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLP+D V+C + I P +W+ +L V +S+LW+ +P + ++ A
Sbjct: 496 SRRDCGLPDDGFVFCTLAPSHAITPREYDIWMRLLTKVGDSVLWMPDYPTKAQVALRKEA 555
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A G+ R++F E + R ++AD+ LDT N T+ + L G PV+++ G
Sbjct: 556 EARGVSPERLIFGPACDGSERMARAKVADLFLDTFTLNAGPTAREALIAGLPVLSMAGRQ 615
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R AS L G EL+ + K Y+ A+ L DRD
Sbjct: 616 FAARTTASLLTAAGLGELLTTSEKTYEARALELAEDRD 653
>gi|399991359|ref|YP_006571599.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Phaeobacter gallaeciensis DSM 17395 = CIP 105210]
gi|398655914|gb|AFO89880.1| putative UDP-N-acetylglucosamine-peptide
N-acetylglucosaminyltransferase [Phaeobacter
gallaeciensis DSM 17395 = CIP 105210]
Length = 730
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 83/158 (52%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLP++A V+C+F + I P +W+ +L + S+LWL P +A ++ A
Sbjct: 496 SRRDCGLPDEAFVFCSFTAGHAITPREFDIWMRLLTKIDGSVLWLADHPEEAQAALRQAA 555
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+D R++F+ EE + R +AD+ LD+ N + D L G PV+T+ G
Sbjct: 556 GDHRVDPDRVIFAAPCTGEERMARSMVADLFLDSFTLNAGPAARDALVAGLPVLTMAGRQ 615
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R AS LA G EL T + Y+ A+ L DRD
Sbjct: 616 FAARTTASLLAAAGMEELQTTTPQAYEARALELAGDRD 653
>gi|417861444|ref|ZP_12506499.1| hypothetical protein Agau_L101243 [Agrobacterium tumefaciens F2]
gi|338821848|gb|EGP55817.1| hypothetical protein Agau_L101243 [Agrobacterium tumefaciens F2]
Length = 685
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 88/158 (55%), Gaps = 2/158 (1%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR + GLPEDA VY +FN + KI + W+ +L P S+LWLL ++ A
Sbjct: 424 SRAEVGLPEDAFVYASFNGMQKITENCFARWMTILSQTPGSLLWLLTGDDDVNQRLRDVA 483
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+ R++F+ +H+ R +AD+ LDT H+T+ D + +G PV+T+ G+T
Sbjct: 484 EKSGVAPERLVFAPKVQNPQHIARIGVADLFLDTFPYGAHSTAADAITSGLPVLTMSGKT 543
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A+R S + G PE++ + ++Y +A +G +RD
Sbjct: 544 FAARFCGSIVTAAGVPEMLCSSPEDY--VARAIGFERD 579
>gi|222147875|ref|YP_002548832.1| hypothetical protein Avi_1160 [Agrobacterium vitis S4]
gi|221734863|gb|ACM35826.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 630
Score = 102 bits (255), Expect = 7e-20, Method: Composition-based stats.
Identities = 55/158 (34%), Positives = 83/158 (52%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+ GLP V+ +FN KID +++W +LK VP +IL + N++
Sbjct: 402 SRESVGLPSGRFVFGSFNGSQKIDRQAIRIWAQILKRVPEAILSISCARVAVADNLRVAF 461
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+D R++F + E + R D+ LDT + NGHTT+ D LW G PV+T G+
Sbjct: 462 AQQGIDAGRLIFFDNCPSAEFLARMSATDLVLDTFIYNGHTTTSDALWAGVPVLTKKGKA 521
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A RV+ S L +G PEL+A+ ++ A+ L D
Sbjct: 522 FAGRVSESLLKAVGLPELVAQDADDFIARAVDLAEHPD 559
>gi|300120051|emb|CBK19605.2| unnamed protein product [Blastocystis hominis]
Length = 318
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 80/158 (50%), Gaps = 1/158 (0%)
Query: 14 IVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
+ L A N N S + R + +P++ +YC NQLYKI L W
Sbjct: 150 VYLPYYYACNHYNKVPDLHPYQSDSATLYLRNTFKVPQEGTLYCFPNQLYKISSDLLDTW 209
Query: 74 VNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA-KEEHVRRGQLADV 132
N+L P+S LWLL+ P GE +I+ A G+ + R+LFS+ K ++ R + D+
Sbjct: 210 ANILHRHPSSYLWLLRHPLEGEESIKLELMARGVHKDRVLFSDFEDNKALYMVRTSICDL 269
Query: 133 CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
LD+P+ + T +D W+G PVV LPG+ R+ S
Sbjct: 270 ILDSPVWSAGATGLDAYWSGVPVVNLPGDRTVERLGIS 307
>gi|221634584|ref|YP_002523272.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
KD131]
gi|221163457|gb|ACM04419.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
KD131]
Length = 598
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 82/155 (52%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE A V+ + YKI W ++ AVP S+LWL++ G+A + Q
Sbjct: 403 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVEDTPEGQAALAGRWQ 462
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A GLD HR++F+ + R LAD+ LDT N T + D L G PV+TL G T
Sbjct: 463 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 522
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+AAS LA +G + IA + Y A+ + TD
Sbjct: 523 SGRMAASLLAAVGLEDCIAPDLEAYVARAVAIATD 557
>gi|397644930|gb|EJK76613.1| hypothetical protein THAOC_01614, partial [Thalassiosira oceanica]
Length = 354
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 86/155 (55%), Gaps = 7/155 (4%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ-----ATAQAL 105
E + VYCNFN+ K +P T++ W+ +L+ VPNSIL LL+FPA N++ +
Sbjct: 114 EVSFVYCNFNKFLKNNPKTMRSWLRILEEVPNSILCLLEFPAEAIPNLREFILDSVQNKS 173
Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP-GETLA 164
D+ + + R L +V LD+ NGHTT+ D L+ G P+VT G+ +A
Sbjct: 174 TSDRIHFISWEKNPFDHQARSSSLCNVMLDSHPYNGHTTAQDALYAGVPIVTRSDGKDMA 233
Query: 165 SRVAASQLATLGCPELIA-RTHKEYQDIAIRLGTD 198
SRV+ S LG L A EY++IAIRLGTD
Sbjct: 234 SRVSTSANVVLGLEALNAVGGTSEYEEIAIRLGTD 268
>gi|428177567|gb|EKX46446.1| hypothetical protein GUITHDRAFT_70511, partial [Guillardia theta
CCMP2712]
Length = 518
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 88/177 (49%), Gaps = 23/177 (12%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVL-------------------KAVPNSI 84
R+Q GL EDA+V NFN LYKI+P + +W + + + P +
Sbjct: 323 RRQEGLEEDALVVANFNGLYKIEPESFHIWRDGILRCKERGRGGGRGGGGGASERTPCKV 382
Query: 85 -LWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
LWL+ NI+ A L I+F+ + H+RR ADV LD+ N HT
Sbjct: 383 QLWLVSEGEESNENIRKEAGPLL--SPLIVFAQRREFDAHIRRAAAADVFLDSFTYNAHT 440
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLG-CPELIARTHKEYQDIAIRLGTDR 199
T++D LW G VVTLPGET+ SRVA L G E + RT EY ++ RL DR
Sbjct: 441 TAVDALWGGIAVVTLPGETIVSRVATGVLGGFGPGSETVTRTTGEYAELIGRLLEDR 497
>gi|428178657|gb|EKX47531.1| hypothetical protein GUITHDRAFT_69463, partial [Guillardia theta
CCMP2712]
Length = 676
Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats.
Identities = 53/148 (35%), Positives = 85/148 (57%), Gaps = 4/148 (2%)
Query: 49 LPEDAI---VYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL 105
P DAI + CNFNQ YKI P Q+W+++L VP+S LWLL + G+ ++ A+
Sbjct: 493 FPADAIDSFILCNFNQFYKITPKVWQLWMHLLGQVPDSKLWLLSWNEAGKQSLLRYAKQS 552
Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
G+ +++F+ ++ HV D+ +DT H T+ D L+ G P+++ PGE++AS
Sbjct: 553 GIPSDKLIFTTFFEEKWHVASKANCDLFVDTRPYAAHGTAADALFAGLPIISAPGESMAS 612
Query: 166 RVAASQLATLGCPEL-IARTHKEYQDIA 192
RV+ S G +L IAR ++Y +A
Sbjct: 613 RVSLSLSLASGIGDLFIARNDQDYVRLA 640
>gi|88809832|ref|ZP_01125338.1| TPR domain protein [Synechococcus sp. WH 7805]
gi|88786216|gb|EAR17377.1| TPR domain protein [Synechococcus sp. WH 7805]
Length = 890
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 85/154 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLPEDA+V+ F++ KI P W+ +L+ VP+++LW++ + E ++ A
Sbjct: 696 TRNDIGLPEDAVVFACFHRAEKITPMRFHCWLEILQQVPDALLWIINDQPLVEERLRKKA 755
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+A GL R++FS + LAD+ LDT + T++ L G P+++ PGE
Sbjct: 756 RAAGLGPQRLVFSPKLESALFSQACSLADLLLDTSPYSSGATAVTALAAGLPLLSCPGEN 815
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASR+ AS A G ELI T + YQ AI LG
Sbjct: 816 FASRMGASLCAATGLNELICSTPEAYQQKAIALG 849
>gi|196230691|ref|ZP_03129552.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196225032|gb|EDY19541.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 684
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 78/145 (53%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
I + N L K++ L+ W +L AV S L LL I Q LG++ HR+
Sbjct: 493 ITFGALNNLTKVNEPLLESWARILGAVKGSRLLLLGSRGSHRQRITDFLQRLGVESHRVE 552
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F +EE++R Q D+ LD+ NGHTTS+D LW G PVV+L G T SR S L+
Sbjct: 553 FLLPCRREEYLRYHQRIDLILDSFPYNGHTTSLDALWMGVPVVSLAGNTSVSRGGLSILS 612
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
L EL+AR+ +Y +A+ L TD
Sbjct: 613 NLALAELVARSADDYVRVAVDLATD 637
>gi|398376282|ref|ZP_10534464.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rhizobium sp. AP16]
gi|397727476|gb|EJK87900.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rhizobium sp. AP16]
Length = 691
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 3/180 (1%)
Query: 17 QNGLATNQTNTKTATGEEVPQSIVIT---SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
+N + ++ + A + + + VI SR + GLPED VY FN + KI T W
Sbjct: 391 ENEIYFSEKVLRIACNQPIDRKRVIAERPSRAEAGLPEDVFVYACFNGMQKITQETFAHW 450
Query: 74 VNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVC 133
+ +L S+LWLL + + A+ G+ R++F+ A +H+ R +AD+
Sbjct: 451 MKILSGTTGSVLWLLTGGDDVDKRLLQLAENAGVAPERLIFAPKAPNAKHLARIAVADLF 510
Query: 134 LDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
LDT H+T+ D L G PV+T PG++ A+R S +A G PELI ++Y + AI
Sbjct: 511 LDTFPYGAHSTAGDALTMGLPVLTFPGKSFAARFCHSIVAAAGVPELICSNPEDYIERAI 570
>gi|222080835|ref|YP_002540198.1| glycosyltransferase domain-containing protein [Agrobacterium
radiobacter K84]
gi|221725514|gb|ACM28603.1| glycosyltransferase TPR domain protein [Agrobacterium radiobacter
K84]
Length = 704
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 3/180 (1%)
Query: 17 QNGLATNQTNTKTATGEEVPQSIVIT---SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
+N + ++ + A + + + VI SR + GLPED VY FN + KI T W
Sbjct: 404 ENEIYFSEKVLRIACNQPIDRKRVIAERPSRAEAGLPEDVFVYACFNGMQKITQETFAHW 463
Query: 74 VNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVC 133
+ +L S+LWLL + + A+ G+ R++F+ A +H+ R +AD+
Sbjct: 464 MKILAGTTGSVLWLLTGGDDVDKRLLQLAENAGVAPERLIFAPKAPNAKHLARIAVADLF 523
Query: 134 LDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
LDT H+T+ D L G PV+T PG++ A+R S +A G PELI ++Y + AI
Sbjct: 524 LDTFPYGAHSTAGDALTMGLPVLTFPGKSFAARFCHSIVAAAGVPELICSNPEDYIERAI 583
>gi|428178004|gb|EKX46881.1| hypothetical protein GUITHDRAFT_107237 [Guillardia theta CCMP2712]
Length = 902
Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats.
Identities = 56/190 (29%), Positives = 93/190 (48%), Gaps = 34/190 (17%)
Query: 24 QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
Q+ K G+ P+ + + + +++ NFN L K+ P T +W+ +L VP+S
Sbjct: 651 QSQMKQVEGDFTPEDAAARNLPHH---QHQVIFANFNHLQKLGPGTFALWMKILGQVPSS 707
Query: 84 ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
+LW+L+FP EA+++ A A ++ RIL ++ + LDT N H
Sbjct: 708 LLWMLRFPREAEAHLKREAAASNINASRILMTD--------------KILLDTLEYNAHV 753
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIART-----------------HK 186
+ +D LW G P++TL G +A R AS L + G L+ART +
Sbjct: 754 SGIDALWIGLPLITLAGSNMARRCGASFLQSQGLDLLLARTQATSHVLHPRTCDGSAEQE 813
Query: 187 EYQDIAIRLG 196
EY ++A++LG
Sbjct: 814 EYVEMAVKLG 823
>gi|302143604|emb|CBI22357.3| unnamed protein product [Vitis vinifera]
Length = 351
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 62/98 (63%)
Query: 82 NSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
+S+LWLL+FPAVGE +++ A GL RI+F A K EH++R LAD+ LDTPLCN
Sbjct: 123 SSVLWLLRFPAVGEMRLRSYTVAQGLQPDRIIFIVGAMKHEHIKRSALADLFLDTPLCNA 182
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
T D+LW P+VTL E +A+RVA S G E
Sbjct: 183 QTIDTDILWASLPMVTLSFEKMATRVAGSLCLATGLDE 220
>gi|196229049|ref|ZP_03127915.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196227330|gb|EDY21834.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 725
Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats.
Identities = 56/145 (38%), Positives = 77/145 (53%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + + N KI+ L++W VL AV +S L LL P + +A G+ + RI
Sbjct: 533 VTFASLNNYTKINEPLLKLWARVLAAVADSRLLLLSPPGSHRERVLEIMRAEGVREQRIE 592
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + E++ R DV LD NGHTTS+D LW G PVV+ GE + SR SQL+
Sbjct: 593 FVAEGPRREYLERYHRVDVMLDPFPYNGHTTSLDALWMGVPVVSRAGEAVVSRGGLSQLS 652
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG EL+A + +EY IA L D
Sbjct: 653 NLGLRELVAFSEEEYVKIAAGLAGD 677
>gi|72382664|ref|YP_292019.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
gi|72002514|gb|AAZ58316.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
Length = 739
Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats.
Identities = 52/157 (33%), Positives = 84/157 (53%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR++ GLP D+ V+ FN+ KI +W+ +LK + NS+LWL+K N+ +
Sbjct: 545 SREELGLPLDSFVFTCFNRTEKITRKEFDIWMRLLKKIDNSVLWLIKPHKAAIENLYSEL 604
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+++ RI+F+ +H+ R + D+ LDT N TT LW G P++TL G+T
Sbjct: 605 NKQSMNKERIVFAEFMNLNDHLSRHEYGDLFLDTFNYNAGTTGALSLWAGLPIITLAGKT 664
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+SR +A L L LI +Y+ +A L T++
Sbjct: 665 NSSRASAGFLNALDLNGLITYNEYDYESLAYELATNK 701
>gi|433679021|ref|ZP_20510807.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Xanthomonas
translucens pv. translucens DSM 18974]
gi|430815863|emb|CCP41351.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Xanthomonas
translucens pv. translucens DSM 18974]
Length = 504
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 87/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R+ GLPE V+C FN YK+ P ++ + VL+ VP+S+LWLL P +A ++ A
Sbjct: 302 ARRDCGLPEHGTVFCCFNNSYKLTPRSMTRMLAVLRQVPDSMLWLLSGPGQADARLREAA 361
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ +D R+ F A +++ R + AD+ LDT N HTT+ D LW G P++T PG T
Sbjct: 362 RRADVDPARLRFMPKLAHADYLARYRHADLFLDTHPYNAHTTASDALWAGCPLLTCPGTT 421
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+RVA S LG + + A++LG D
Sbjct: 422 FAARVAGSLNHHLGMDAMNVADDAAFAAKAVQLGRD 457
>gi|428184949|gb|EKX53803.1| hypothetical protein GUITHDRAFT_100772 [Guillardia theta CCMP2712]
Length = 653
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 88/157 (56%), Gaps = 2/157 (1%)
Query: 37 QSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA 96
Q + S +Q ++A+V+ + N LYK+D W+ V++ +P+S+L LL+ A
Sbjct: 426 QKLSSASVKQDKTSKEAVVFASLNNLYKLDERLFATWMQVMRKIPSSMLVLLQGREGMRA 485
Query: 97 N--IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
N + A+ G+ + R++ + KEEH+ R + LDT + H+T++D LW G P
Sbjct: 486 NARLAQYAERFGISKDRMITALPVKKEEHIARHSQFTLFLDTWKVSSHSTAVDSLWAGLP 545
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDI 191
VV LP E + +RV+A+ L LG E +AR +Y DI
Sbjct: 546 VVVLPHEKMQARVSAALLYVLGMVEFLARDVGDYMDI 582
>gi|254526504|ref|ZP_05138556.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110kDa subunit
[Prochlorococcus marinus str. MIT 9202]
gi|221537928|gb|EEE40381.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase 110kDa subunit
[Prochlorococcus marinus str. MIT 9202]
Length = 701
Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats.
Identities = 60/155 (38%), Positives = 92/155 (59%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R+ + LPED ++Y FN +KI +W+ +LK + S LWLL + N++ A+
Sbjct: 503 RKDFDLPEDGVIYTCFNNGFKITEKEFNLWMRLLKEIKRSHLWLLSTNLLMIKNLKDEAK 562
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
+D +RI F+ E+H+ R L D+ LDT N HTT+ D LW G PV+TL G++
Sbjct: 563 KRNVDPNRISFAKKLPLEQHLARHSLGDIFLDTFNYNAHTTASDALWAGLPVLTLKGKSF 622
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+SRV++S L +LG ELI + +EY + A+ LG +
Sbjct: 623 SSRVSSSLLNSLGLNELICLSEEEYFEKALDLGKN 657
>gi|428180751|gb|EKX49617.1| hypothetical protein GUITHDRAFT_135804 [Guillardia theta CCMP2712]
Length = 782
Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats.
Identities = 55/164 (33%), Positives = 91/164 (55%), Gaps = 18/164 (10%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-------PAVGEA---------- 96
+V NFN K+D TL++W V+ VPNSILWL+ AVG
Sbjct: 555 VVIANFNNWKKLDMRTLRVWAKVMGEVPNSILWLMSMLPYEGPSGAVGGGQEELDRCQTD 614
Query: 97 NIQATAQALGLDQHRILFSNVAAKEEH-VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
++++ Q+LG+D R++ + + E H + + Q DV LD+ N H+T+++++W PV
Sbjct: 615 SLKSILQSLGVDASRVVITGLLPSETHLIAKQQATDVFLDSLAYNAHSTAVELIWANVPV 674
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+T P + +RVA+S L G L+AR ++ +D+A+RL +R
Sbjct: 675 LTCPELKMTARVASSLLLAHGMSALVAREARDMEDVAVRLLRER 718
>gi|78779585|ref|YP_397697.1| TPR repeat-containing protein [Prochlorococcus marinus str. MIT
9312]
gi|78713084|gb|ABB50261.1| TPR repeat [Prochlorococcus marinus str. MIT 9312]
Length = 632
Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats.
Identities = 54/141 (38%), Positives = 80/141 (56%), Gaps = 1/141 (0%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LP +IV F+ +K+ + W+ VL N+ LWL + + N+ + Q+ +D
Sbjct: 438 LPPKSIVLAAFHMAFKLSEEVVNSWIKVLNQTENTYLWLKISNKLAKKNLISHFQSKNVD 497
Query: 109 QHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
RILF+ V EH+ R AD+ LDT NGH+T ++ +W+ P +TL GE+ ASRV
Sbjct: 498 LKRILFAEKVDLYTEHISRYSKADLFLDTFNFNGHSTLVECIWSELPFITLAGESFASRV 557
Query: 168 AASQLATLGCPELIARTHKEY 188
AS L +LG ELIA++ EY
Sbjct: 558 GASILHSLGLSELIAKSTDEY 578
>gi|390575516|ref|ZP_10255611.1| TPR repeat-containing protein [Burkholderia terrae BS001]
gi|389932565|gb|EIM94598.1| TPR repeat-containing protein [Burkholderia terrae BS001]
Length = 495
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 88/157 (56%), Gaps = 11/157 (7%)
Query: 49 LPED---AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS--ILWLLKFPAVGEAN--IQAT 101
LP D + + N K+ + +W NV+ VP++ ILW+ A G+A + A
Sbjct: 298 LPADEAGTVTFGCLNNPAKLTDRAIALWANVIANVPDARMILWV----AEGDARERVSAK 353
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+A G+D+ R+ F ++ A+E+++R DV LDT CNGHTTS+D LW G PVVT G
Sbjct: 354 FEANGVDRSRLQFVDMQAREDYLRTYHRIDVALDTFPCNGHTTSLDALWMGVPVVTRVGS 413
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T SR + LA LG PEL A + + + IA +L D
Sbjct: 414 TPISRGGYALLANLGLPELAAESDEAFVKIATQLALD 450
>gi|196232944|ref|ZP_03131793.1| TPR domain protein [Chthoniobacter flavus Ellin428]
gi|196222922|gb|EDY17443.1| TPR domain protein [Chthoniobacter flavus Ellin428]
Length = 178
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/129 (41%), Positives = 73/129 (56%)
Query: 70 LQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQL 129
L++WV VL+ VP S L LL+ + + G+D R+ F V A+ E+++
Sbjct: 2 LKIWVRVLQNVPGSRLLLLRGGGNYAERTRQFFERYGIDAGRVEFLGVRARGEYLKNYHR 61
Query: 130 ADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQ 189
D+ LDT NGHTT++D LW G PVV+L GET SR SQL LG EL+A + Y
Sbjct: 62 VDMVLDTFPYNGHTTNLDALWMGVPVVSLAGETAVSRGGLSQLNNLGLTELVAHSEDAYL 121
Query: 190 DIAIRLGTD 198
+IA RL D
Sbjct: 122 EIATRLAGD 130
>gi|126464807|ref|YP_001041783.1| TPR repeat-containing protein [Rhodobacter sphaeroides ATCC 17029]
gi|126106622|gb|ABN79147.1| Tetratricopeptide TPR_2 repeat protein [Rhodobacter sphaeroides
ATCC 17029]
Length = 590
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 80/155 (51%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE A V+ + YKI W ++ AVP S+LWL+ G+A + Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVADTPEGQAALAGRWQ 454
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A GLD HR++F+ + R LAD+ LDT N T + D L G PV+TL G T
Sbjct: 455 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 514
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+AAS L +G + IA + Y A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549
>gi|428186635|gb|EKX55485.1| hypothetical protein GUITHDRAFT_99261 [Guillardia theta CCMP2712]
Length = 656
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 88/150 (58%), Gaps = 1/150 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI-QAT 101
+R+ GLP D+ + NFN L K++P ++++ +L+ + LWL ++P I Q
Sbjct: 415 ARKSIGLPADSFLLANFNNLEKLEPRLWRLFLLLLRRHRGAHLWLQQYPPAKANRIRQMF 474
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ G+++ R++ + ++H+ LAD+ +D+P N H+T+MD LW P+ TLP E
Sbjct: 475 KEEEGMEEARLVTTKFFKSDDHIGFKALADIFVDSPSYNAHSTAMDTLWGALPLATLPEE 534
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDI 191
+A+RV AS L + C IAR+ ++ Q++
Sbjct: 535 KMAARVGASLLLAINCSSTIARSMEDLQEL 564
>gi|425472708|ref|ZP_18851549.1| hypothetical protein MICAK_3940006 [Microcystis aeruginosa PCC
9701]
gi|389881186|emb|CCI38246.1| hypothetical protein MICAK_3940006 [Microcystis aeruginosa PCC
9701]
Length = 268
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 88/173 (50%), Gaps = 9/173 (5%)
Query: 34 EVPQSIVIT---------SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI 84
E+P V T SR + GLP++ VY FN+ K P W+ +L+AVPN++
Sbjct: 49 ELPHVFVTTPLSFSEPPPSRSELGLPDEGFVYACFNRTDKWSPELFACWLEILQAVPNAV 108
Query: 85 LWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
LWL + ++A A+ G+D R++F + E + + AD+ LDT L NG T
Sbjct: 109 LWLAESSEDISQTLRAKAKTAGVDPERLVFLVQRSPWEFISVCRQADLFLDTFLYNGGAT 168
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
S+ + G P++T PG+T ASR+ S G + + + YQ AI GT
Sbjct: 169 SVCAIQAGVPLLTCPGDTFASRMGMSICHAAGLESFVCESRESYQVQAIYWGT 221
>gi|332561307|ref|ZP_08415622.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
WS8N]
gi|332274106|gb|EGJ19423.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
WS8N]
Length = 590
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 80/155 (51%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE A V+ + YKI W ++ AVP S+LWL+ G+A + Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVADTPEGQAALAGRWQ 454
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A GLD HR++F+ + R LAD+ LDT N T + D L G PV+TL G T
Sbjct: 455 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 514
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+AAS L +G + IA + Y A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549
>gi|294677605|ref|YP_003578220.1| hypothetical protein [Rhodobacter capsulatus SB 1003]
gi|3128311|gb|AAC16163.1| hypothetical protein [Rhodobacter capsulatus SB 1003]
gi|294476425|gb|ADE85813.1| TPR repeat domain protein [Rhodobacter capsulatus SB 1003]
Length = 617
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 86/164 (52%), Gaps = 1/164 (0%)
Query: 35 VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
+PQ +T R + GLPEDA+V+ + YK+ + W ++ VP S+LW++
Sbjct: 414 MPQLPAVT-RAEEGLPEDAVVFTCVSHHYKLTEAVWGAWCRIVARVPGSVLWIIDDNPES 472
Query: 95 EANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
A + A A GL R++F+ + R LAD+ LDT N T + D L G P
Sbjct: 473 RAALTARWGAAGLAPERLIFAARTDPARYRARLALADLFLDTTPYNAGTIASDALRMGLP 532
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
++T G A+R+ AS L +G P+ IA Y+D+A+ +GTD
Sbjct: 533 LITTRGRAFAARMGASLLTAIGLPDCIAEDLAGYEDLAVAIGTD 576
>gi|420241336|ref|ZP_14745477.1| hypothetical protein PMI07_03260 [Rhizobium sp. CF080]
gi|398071470|gb|EJL62724.1| hypothetical protein PMI07_03260 [Rhizobium sp. CF080]
Length = 646
Score = 99.0 bits (245), Expect = 9e-19, Method: Composition-based stats.
Identities = 59/174 (33%), Positives = 94/174 (54%), Gaps = 7/174 (4%)
Query: 30 ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
A +PQ I R +P+DA ++ F+ +K P+T+ +WV VLK P S L+L+
Sbjct: 412 AVHRPMPQPI---DRSLCDMPQDAFIFSCFHSHWKFTPATIDLWVRVLKETPESYLFLIC 468
Query: 90 FPAVGEAN--IQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSM 146
G I+A +A G+ + R++F + V ++ R L D+ LDT NGHTT+
Sbjct: 469 RERFGARKNLIKAFVEA-GIPESRLIFGDRVNNYAAYINRIGLTDLGLDTYPYNGHTTTS 527
Query: 147 DVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ LW G P++T G+ ASRV+ S L LG PE++A ++ A+ +R+
Sbjct: 528 EKLWAGLPMLTHKGKNFASRVSESLLNALGLPEMVAEDQDDFVRRAVHYYNNRE 581
>gi|196232370|ref|ZP_03131223.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196223442|gb|EDY17959.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 779
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 75/145 (51%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
I + + N K++ L++W VL AV NS L +L G++ RI
Sbjct: 588 ITFGSLNNFCKVNEPLLKLWARVLTAVNNSRLIILAHEGSHRQRTSEFFTQEGVEPERIE 647
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F ++E++ D+ LDT NGHTTS+D LW G PVV+L GE SR SQL
Sbjct: 648 FVTQRPRKEYLELYHRLDIVLDTFPYNGHTTSLDALWMGVPVVSLCGERPVSRAGLSQLN 707
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG PEL+A T +Y +IA +L D
Sbjct: 708 NLGLPELVAFTEDQYVEIATKLAND 732
>gi|397576620|gb|EJK50333.1| hypothetical protein THAOC_30710 [Thalassiosira oceanica]
Length = 747
Score = 99.0 bits (245), Expect = 1e-18, Method: Composition-based stats.
Identities = 69/213 (32%), Positives = 101/213 (47%), Gaps = 41/213 (19%)
Query: 28 KTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVL---KAVPNS- 83
+TA + V + R +YG+ + A VYC ++ KIDP T + W+ L K NS
Sbjct: 465 RTAVAQNVSDDDRRSIRLKYGISQTAFVYCCHSRPDKIDPVTFRSWIRALIRAKDEGNSE 524
Query: 84 -ILWLLKFPAVGEANIQATAQALGLDQHR---ILFSNVAAKEEHVRRGQLADVCLDTPLC 139
+LWLL+ EAN++ T QH ++F+++A +EEH+RR AD+ LDTP
Sbjct: 525 PVLWLLRSGKEMEANLR-TFLTREFGQHLDSLLVFADIADREEHLRRLACADLFLDTPAY 583
Query: 140 NGHTTSMDVLWTGTPVVTL--------------------------------PGETLASRV 167
HT D L++ P+VTL + LASRV
Sbjct: 584 GAHTVGCDALYSNVPMVTLLRSANDPTKHSFADEDYEPMHSSGSNPERRSITTDKLASRV 643
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
AS L +G P LIA ++Y+D+ +R D D
Sbjct: 644 GASLLRAVGLPMLIAEDMRQYEDLMVRCAHDID 676
>gi|196232942|ref|ZP_03131791.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196222920|gb|EDY17441.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 743
Score = 98.6 bits (244), Expect = 1e-18, Method: Composition-based stats.
Identities = 56/143 (39%), Positives = 74/143 (51%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFS 115
+ + N K++ TL +W VL VPN+ L LL G+ RI F+
Sbjct: 553 FGSLNNFCKVNVETLVLWARVLNHVPNARLVLLCEEGSQRQRTWDLFARHGISAERISFA 612
Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
A+ ++++ D+ LDT NGHTTS+D LW G PVV+L GE SR SQL L
Sbjct: 613 EPRARTDYLKLYYDVDIVLDTFPYNGHTTSLDALWMGVPVVSLVGEVAVSRAGLSQLNNL 672
Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
G PE +A T EY +IA RL D
Sbjct: 673 GLPEWVAHTEDEYVEIATRLAGD 695
>gi|124024786|ref|YP_001013902.1| hypothetical protein NATL1_00731 [Prochlorococcus marinus str.
NATL1A]
gi|123959854|gb|ABM74637.1| Hypothetical protein NATL1_00731 [Prochlorococcus marinus str.
NATL1A]
Length = 816
Score = 98.2 bits (243), Expect = 2e-18, Method: Composition-based stats.
Identities = 49/155 (31%), Positives = 85/155 (54%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R+ + LP+ ++ FN+ YKI +W+N+L + S+LWL K N+ A+
Sbjct: 623 RKDFNLPDQGFIFTCFNENYKITKKEFNIWMNLLIKIEGSVLWLYKSNQCSMNNLYKEAR 682
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
++ R++F+ A +H+ R L D+ LDT NG T+ L G PV+T G++
Sbjct: 683 KRKVNPDRLIFAERLAMNKHLPRHSLGDLALDTFNYNGGATTSCALLAGLPVLTKIGQSF 742
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+RV+AS L+++G ELI + EY++ A+ + +
Sbjct: 743 MARVSASLLSSIGLSELITYSESEYEEKALYIANN 777
>gi|403336494|gb|EJY67446.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Oxytricha trifallax]
Length = 559
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 6/160 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+Q G+P +A V+ FNQ ++ID +L+ W + + VP SIL + G+ I A A
Sbjct: 361 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFILVNKEGQQAIVAKA 420
Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ L +D + + F + K + +V R AD+ LDT N + D+LW+G P++TL +
Sbjct: 421 RQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQINARIVNTDLLWSGVPLLTLKSD 480
Query: 162 TLASRVAASQLATLGCPE-----LIARTHKEYQDIAIRLG 196
+ A+RV AS L+ LG E L+A T ++Y A+ L
Sbjct: 481 SWANRVGASMLSGLGLSEQQYDSLVATTVQQYIQKAVDLA 520
>gi|428184152|gb|EKX53008.1| hypothetical protein GUITHDRAFT_84548 [Guillardia theta CCMP2712]
Length = 346
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 81/147 (55%), Gaps = 3/147 (2%)
Query: 52 DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHR 111
D++V N N YKIDP + + W+++L V S+LW+ + +GE N+ A +G D R
Sbjct: 143 DSLVIGNLNAFYKIDPESFRAWLDILARVNQSLLWIQSWNDLGEQNLVMNADIMGGDPSR 202
Query: 112 ILFSNVAAKEEHVRRGQ---LADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
+ FS + Q +ADV LDT L N TT+ D+ W G P V+LP E L SR+
Sbjct: 203 MCFSKPPPALYNRLWNQHSVIADVHLDTFLYNTVTTATDLFWLGVPSVSLPSERLCSRIG 262
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRL 195
AS G L AR ++Y+ +A+++
Sbjct: 263 ASFAHAAGSECLTARNLEDYKQVAVQI 289
>gi|403362723|gb|EJY81094.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Oxytricha trifallax]
Length = 598
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 6/160 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+Q G+P +A V+ FNQ ++ID +L+ W + + VP SIL + G+ I A A
Sbjct: 359 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFILVNKEGQQAIVAKA 418
Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ L +D + + F + K + +V R AD+ LDT N + D+LW+G P++TL +
Sbjct: 419 RQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQINARIVNTDLLWSGVPLLTLKSD 478
Query: 162 TLASRVAASQLATLGCPE-----LIARTHKEYQDIAIRLG 196
+ A+RV AS L+ LG E L+A T ++Y A+ L
Sbjct: 479 SWANRVGASMLSGLGLSEQQYDSLVATTVQQYIQKAVDLA 518
>gi|125654622|ref|YP_001033816.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
2.4.1]
gi|77386282|gb|ABA81711.1| predicted O-linked acetylglucosamine transferase [Rhodobacter
sphaeroides 2.4.1]
Length = 590
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 80/155 (51%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE A V+ + YKI W ++ AVP S+LWL+ G+A + Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVADTPEGQAALAGRWQ 454
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A GLD HR++F+ + R LAD+ LDT N T + D L G P++TL G T
Sbjct: 455 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPLLTLAGRTF 514
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+AAS L +G + IA + Y A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549
>gi|403342452|gb|EJY70546.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Oxytricha trifallax]
Length = 598
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 6/160 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+Q G+P +A V+ FNQ ++ID +L+ W + + VP SIL + G+ I A A
Sbjct: 359 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFVLVNKEGQQAIVAKA 418
Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ L +D + + F + K + +V R AD+ LDT N + D+LW+G P++TL +
Sbjct: 419 KQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQVNARIVNTDLLWSGVPLLTLKSD 478
Query: 162 TLASRVAASQLATLGCPE-----LIARTHKEYQDIAIRLG 196
+ A+RV AS L+ LG E L+A T ++Y A+ L
Sbjct: 479 SWANRVGASMLSGLGLSEQQYDSLVATTVQQYIQKAVDLA 518
>gi|443475508|ref|ZP_21065455.1| hypothetical protein Pse7429DRAFT_1126 [Pseudanabaena biceps PCC
7429]
gi|443019627|gb|ELS33688.1| hypothetical protein Pse7429DRAFT_1126 [Pseudanabaena biceps PCC
7429]
Length = 736
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 90/161 (55%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R Q +PEDA++Y + YK P +++ + +LK VPNS L L+K V +AN
Sbjct: 536 RSQLDIPEDAVIYFTGQRGYKRHPDNIRLQMKILKNVPNSYL-LIK--GVSDANSTEELF 592
Query: 99 QATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A++ G+D R+ F + E EH +ADV LDT NG TT+++ LW G P+VT
Sbjct: 593 FRLAESEGVDCDRLKFLPLTMSELEHRANLTIADVVLDTYPYNGATTTLETLWMGIPIVT 652
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE ASR + + + G E IA T +EY + ++LGTD
Sbjct: 653 RVGEQFASRNSYTMMLNAGLVEGIAWTDEEYVEWGVKLGTD 693
>gi|209526542|ref|ZP_03275068.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
gi|209493048|gb|EDZ93377.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
Length = 728
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 9/163 (5%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI--- 98
T RQ Y + D +VY K +P ++ VN+LK VP+SIL A+G+ I
Sbjct: 526 TLRQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRE 582
Query: 99 --QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
A +A+G+ QHRI S A +EEH + LADV LD+ NG T S++ LW TP+
Sbjct: 583 TYAAACKAIGVGQHRIKQISRFATEEEHRKIYMLADVLLDSYPYNGGTHSLEALWFNTPL 642
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT G SR+ S L +G IA + +EYQ+ IRLG +
Sbjct: 643 VTRRGSQFLSRMGYSFLKGVGVEMGIADSWEEYQNCGIRLGIE 685
>gi|196233636|ref|ZP_03132477.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196222306|gb|EDY16835.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 752
Score = 97.8 bits (242), Expect = 2e-18, Method: Composition-based stats.
Identities = 58/157 (36%), Positives = 81/157 (51%), Gaps = 4/157 (2%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
+S +Q G+ + + + N K++ LQ+W VL+ V S L LL
Sbjct: 555 SSAKQNGI----VTFGSLNNFCKVNDPMLQLWARVLRGVKGSRLVLLSHAGTHRQRTLDI 610
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
G+D RI F+ + +++ + DV LD NGHTTS+D LW G PVVTL G+
Sbjct: 611 LTEAGIDPSRIEFATHRPRPKYLELYRHLDVVLDPFPYNGHTTSLDALWMGVPVVTLAGK 670
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T SR SQL+ LG PEL+A T +Y AI+L D
Sbjct: 671 TAVSRAGLSQLSNLGLPELVAFTGDDYVAKAIQLAHD 707
>gi|376002124|ref|ZP_09779971.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375329510|emb|CCE15724.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 728
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 9/163 (5%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI--- 98
T RQ Y + D +VY K +P ++ VN+LK VP+SIL A+G+ I
Sbjct: 526 TLRQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRE 582
Query: 99 --QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
A +A+G+ QHRI S A +EEH + LADV LD+ NG T S++ LW TP+
Sbjct: 583 TYAAACKAIGVGQHRIKQISRFATEEEHRKIYMLADVLLDSYPYNGGTHSLEALWFNTPL 642
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT G SR+ S L +G IA + +EYQ+ IRLG +
Sbjct: 643 VTRRGSQFLSRMGYSFLKGVGVEMGIADSWEEYQNWGIRLGIE 685
>gi|158521101|ref|YP_001528971.1| hypothetical protein Dole_1087 [Desulfococcus oleovorans Hxd3]
gi|158509927|gb|ABW66894.1| Tetratricopeptide TPR_2 repeat protein [Desulfococcus oleovorans
Hxd3]
Length = 808
Score = 97.4 bits (241), Expect = 3e-18, Method: Composition-based stats.
Identities = 55/154 (35%), Positives = 80/154 (51%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
+R GLPE V+C FN +YK++P T W+ +L+ N++LWL + A
Sbjct: 597 TRSDAGLPEAGFVFCCFNAVYKMEPETFGAWMEILREADNAVLWLSAASDAIRQRLTDRA 656
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+D R++F+ E++ R QLAD+ LDT L +T++ L G PV+T G T
Sbjct: 657 VAHGVDAGRLVFAEKLPHPEYMARYQLADLFLDTFLYTAGSTAVCALHGGVPVLTRTGPT 716
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
ASR+ AS A G E I + Y A+ L
Sbjct: 717 NASRMGASICAAAGMEETICPDTEAYIQRAVHLA 750
>gi|222150113|ref|YP_002551070.1| glycosyltransferase TPR domain-containing protein [Agrobacterium
vitis S4]
gi|221737095|gb|ACM38058.1| glycosyltransferase TPR domain-containing protein [Agrobacterium
vitis S4]
Length = 594
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 80/157 (50%), Gaps = 1/157 (0%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SRQQ+ LPED +V+ F+ +I P T +WVN+L+ NS+LW+ N
Sbjct: 385 SRQQFDLPEDGVVFAAFHPTEQISPRTADLWVNILRQTENSLLWI-SCGNYARNNFSTWM 443
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
G+ ++R++F N A E V LAD+ LD+ N +++ L G PV T G
Sbjct: 444 NRQGIAENRLVFRNPARPMEQVGWMTLADIALDSFPYNDGFSTLAALQAGLPVPTFSGGN 503
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
ASRV AS L G +LIAR Y + I L D+
Sbjct: 504 FASRVTASLLQACGLEKLIARDADAYVALCIDLARDK 540
>gi|423066382|ref|ZP_17055172.1| TPR repeat-containing protein [Arthrospira platensis C1]
gi|406712134|gb|EKD07324.1| TPR repeat-containing protein [Arthrospira platensis C1]
Length = 728
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 9/163 (5%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI--- 98
T RQ Y + D +VY K +P ++ VN+LK VP+SIL A+G+ I
Sbjct: 526 TLRQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRE 582
Query: 99 --QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
A +A+G+ QHRI S A +EEH + LADV LD+ NG T S++ LW TP+
Sbjct: 583 TYAAACKAIGVGQHRIKQISRFATEEEHRKIYMLADVLLDSYPYNGGTHSLEALWFNTPL 642
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT G SR+ S L +G IA + +EYQ+ IRLG +
Sbjct: 643 VTRRGSQFLSRMGYSFLKGVGVEMGIADSWEEYQNWGIRLGIE 685
>gi|429208817|ref|ZP_19200061.1| hypothetical protein D516_2259 [Rhodobacter sp. AKP1]
gi|428188242|gb|EKX56810.1| hypothetical protein D516_2259 [Rhodobacter sp. AKP1]
Length = 590
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 80/155 (51%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R GLPE A V+ + YKI W ++ AVP S+LWL++ G+A + Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVEDTPEGQAALAGRWQ 454
Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
A LD HR++F+ + R LAD+ LDT N T + D L G PV+TL G T
Sbjct: 455 AARLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 514
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+AAS L +G + IA + Y A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549
>gi|428186472|gb|EKX55322.1| hypothetical protein GUITHDRAFT_131550 [Guillardia theta CCMP2712]
Length = 864
Score = 96.7 bits (239), Expect = 4e-18, Method: Composition-based stats.
Identities = 57/145 (39%), Positives = 82/145 (56%), Gaps = 1/145 (0%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-AQALGLDQ 109
E +V NF Q+YKIDPS +W ++L+ ++ILWLL+FP V I+ L
Sbjct: 638 ERKMVLANFGQMYKIDPSLFLLWCDILREDADTILWLLEFPPVASKRIRRRFLSTCRLPP 697
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++FS + HV LAD+ LDT L NGH+T +D LW G P+VT+PG+ + R A
Sbjct: 698 SRLVFSAMLPLHLHVPIKSLADLALDTLLFNGHSTGLDTLWAGVPIVTVPGDRMHRRAGA 757
Query: 170 SQLATLGCPELIARTHKEYQDIAIR 194
S G +AR ++Y ++A R
Sbjct: 758 SMAHAAGATVWLARNERDYLELARR 782
>gi|428775921|ref|YP_007167708.1| hypothetical protein PCC7418_1295 [Halothece sp. PCC 7418]
gi|428690200|gb|AFZ43494.1| TPR repeat-containing protein [Halothece sp. PCC 7418]
Length = 736
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 97/163 (59%), Gaps = 9/163 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA--- 100
R+Q +P++A+VY + K P T+++ + +L VPNS L+K A EA++QA
Sbjct: 537 REQLNIPQNAVVYFSAQAGKKRHPETVRLQMKILAQVPNSYF-LIKGLA-DEASVQAFFT 594
Query: 101 -TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVT 157
AQ G+D +R+ F AKEE + R LA DV LDT NG TT+++ LW P+VT
Sbjct: 595 HIAQEEGVDSNRLRFLPYVAKEE-IHRANLAIADVVLDTYPYNGATTTLETLWMAVPLVT 653
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE ++R + + + ++G E IA + +EY + ++LGTD++
Sbjct: 654 RVGEQFSARNSYTFMKSVGVSEGIAWSDEEYIEWGVKLGTDQE 696
>gi|409991408|ref|ZP_11274672.1| hypothetical protein APPUASWS_10241 [Arthrospira platensis str.
Paraca]
gi|291571117|dbj|BAI93389.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409937730|gb|EKN79130.1| hypothetical protein APPUASWS_10241 [Arthrospira platensis str.
Paraca]
Length = 730
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 88/161 (54%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
RQ Y + D +VY K +P ++ VN+LK VP+SIL A+G+ I
Sbjct: 528 RQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRETY 584
Query: 99 QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A +A+G+ QHRI S A +EEH + LADV LD+ NG T S++ LW TP+VT
Sbjct: 585 AAACKAIGVGQHRIKQISRFATEEEHRKIYILADVLLDSYPYNGGTHSLEALWFNTPLVT 644
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G SR+ S L +G IA + ++YQ+ AIRLG +
Sbjct: 645 RRGSQFLSRMGYSFLKAVGVEMGIADSWEDYQNWAIRLGRE 685
>gi|428165389|gb|EKX34385.1| hypothetical protein GUITHDRAFT_147284 [Guillardia theta CCMP2712]
Length = 595
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 86/160 (53%), Gaps = 18/160 (11%)
Query: 58 NFNQLYKIDPSTLQMWVNVL----------------KAVPNSILWLLKF--PAVGEANIQ 99
N+NQ +KID L+ W+++L K + S+L L+KF E N++
Sbjct: 368 NYNQFFKIDKHILRAWIDILRRDEKVLKLFSSFWPLKGLSQSVLVLVKFLFHKQAERNLR 427
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A +A G++ R++ + +EH+ R QL + LDT L +GHTT++D LW+G PV+ P
Sbjct: 428 AALRANGINATRLILAEKLKNQEHILRSQLMHLHLDTRLQSGHTTTVDALWSGIPVLVWP 487
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
++ SR AA + G +IAR ++Y + A+ L +
Sbjct: 488 ARSMVSRAAAGIVHGAGLSWMIARNGQDYVEAAMVLSRSK 527
>gi|403346239|gb|EJY72509.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
family GT41 [Oxytricha trifallax]
Length = 641
Score = 96.3 bits (238), Expect = 6e-18, Method: Composition-based stats.
Identities = 56/160 (35%), Positives = 88/160 (55%), Gaps = 6/160 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR+Q G+P +A V+ FNQ ++ID +L+ W + + VP SIL + G+ I A A
Sbjct: 402 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFILVNKEGQQAIVAKA 461
Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ L +D + + F + K + +V R AD+ LDT N + D+LW+G P++TL +
Sbjct: 462 RQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQINARIVNTDLLWSGVPLLTLKSD 521
Query: 162 TLASRVAASQLATLGCP-----ELIARTHKEYQDIAIRLG 196
A+RV AS L+ LG L+A T ++Y A+ L
Sbjct: 522 NWANRVGASMLSGLGLSVQQYDSLVATTVQQYIQKAVDLA 561
>gi|428210828|ref|YP_007083972.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoria acuminata PCC 6304]
gi|427999209|gb|AFY80052.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoria acuminata PCC 6304]
Length = 744
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 93/160 (58%), Gaps = 6/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----Q 99
R + G+P+DA+VY + YK +P +++ + +++AVPNS L+K + ++ +
Sbjct: 539 RDELGIPDDAVVYFSGQTGYKRNPQNVRLQLQIIQAVPNSYF-LIKSSIADQESVKRFFE 597
Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G+ + R+ F VA++E H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 598 QMAEEEGVKRDRLRFLPPVASEEIHRANLAIADVVLDTYPYNGATTTLETLWMGIPLVTR 657
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE A+R + + + +G E IA T ++Y IRLGTD
Sbjct: 658 VGEQFAARNSYTMMMNVGVSEGIAWTDEDYVAWGIRLGTD 697
>gi|425445186|ref|ZP_18825222.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
gi|389734920|emb|CCI01527.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
Length = 717
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 95/163 (58%), Gaps = 7/163 (4%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ--- 99
+RQ +P DAI Y K +P+T+++ ++++KAVPNS +LL A E I+
Sbjct: 515 TRQNLNIPADAITYLTLQVGLKRNPATIRLQMHIIKAVPNS--YLLIKGAGSEKLIKDLF 572
Query: 100 -ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ A+ G+D++R+ F S VA +E H Q+ADV LDT +G TT+++ LW G PVVT
Sbjct: 573 ISIAKEEGIDENRLRFLSGVATEEIHRANLQIADVVLDTYPYSGATTTLEALWMGIPVVT 632
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ ASR + + + G E IA + +EY I+LG D +
Sbjct: 633 KVGQQWASRNSYTFITYAGISEGIAWSDEEYIQWGIKLGMDEE 675
>gi|119487761|ref|ZP_01621270.1| hypothetical protein L8106_29805 [Lyngbya sp. PCC 8106]
gi|119455594|gb|EAW36731.1| hypothetical protein L8106_29805 [Lyngbya sp. PCC 8106]
Length = 744
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 87/160 (54%), Gaps = 5/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA---VGEANIQA 100
R +P DA++Y + YK P T +M +++LKAVPNS + F V E IQ
Sbjct: 541 RDDLNIPGDAVIYLSSQTGYKYHPQTAKMQIHILKAVPNSYFLIKGFKKQDIVKEFFIQ- 599
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+ G++ R++F S V + H +ADV LDT NG TT+++ LW P+VT
Sbjct: 600 LAKDEGVNPDRLIFLSGVPSSIIHRANLAIADVLLDTFPYNGATTTLETLWMEVPMVTRV 659
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G+ ASR + + + G E IA T +EY + IRLGTD
Sbjct: 660 GQQFASRNSYTMMVNAGINEGIAWTDEEYVEWGIRLGTDE 699
>gi|196232459|ref|ZP_03131312.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196223531|gb|EDY18048.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 672
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 73/145 (50%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + N K++ T + W +L+ V S L LL P + + G+D RI
Sbjct: 480 VTFGCLNNFCKLNEQTWKSWARILRQVSGSRLILLSAPGSHRQRVVDGLEREGIDPRRIE 539
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F A + ++ Q D+ LD+ NGHTTS+D LW G PVV+ G SR SQL+
Sbjct: 540 FVERAPRRGYLEYYQRLDIALDSFPYNGHTTSLDALWMGVPVVSRAGRGPLSRAGWSQLS 599
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG PEL+A + +Y IA+ L D
Sbjct: 600 NLGLPELVAFSEDDYIRIAVELAGD 624
>gi|409991248|ref|ZP_11274526.1| hypothetical protein APPUASWS_09479 [Arthrospira platensis str.
Paraca]
gi|409937884|gb|EKN79270.1| hypothetical protein APPUASWS_09479 [Arthrospira platensis str.
Paraca]
Length = 744
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 86/164 (52%), Gaps = 3/164 (1%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ +R +P+DAIVY + K +P ++ + +L+ VPNS + F N
Sbjct: 534 SVPTITRADLAIPDDAIVYLGLQRGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSLN 593
Query: 98 --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
A G+ + RI F + V +EEH +ADV LDT NG TT+M+ LW G P
Sbjct: 594 QFFFDIANQQGITRERIKFIAPVKWEEEHRANLLIADVVLDTYPYNGATTTMETLWMGIP 653
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+VT GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 654 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 697
>gi|291566901|dbj|BAI89173.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 744
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 86/164 (52%), Gaps = 3/164 (1%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ +R +P+DAIVY + K +P ++ + +L+ VPNS + F N
Sbjct: 534 SVPTITRADLAIPDDAIVYLGLQRGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSLN 593
Query: 98 --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
A G+ + RI F + V +EEH +ADV LDT NG TT+M+ LW G P
Sbjct: 594 QFFFDIANQQGITRERIKFIAPVKWEEEHRANLLIADVVLDTYPYNGATTTMETLWMGIP 653
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+VT GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 654 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 697
>gi|428217092|ref|YP_007101557.1| hypothetical protein Pse7367_0829 [Pseudanabaena sp. PCC 7367]
gi|427988874|gb|AFY69129.1| hypothetical protein Pse7367_0829 [Pseudanabaena sp. PCC 7367]
Length = 774
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 87/158 (55%), Gaps = 3/158 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
RQ +G+P +A+VY K P TLQ + ++KAVPNS L + L A Q
Sbjct: 569 RQDFGIPAEAVVYLTAQVGLKRHPDTLQRQLEIIKAVPNSYLAIKGLADNASMTKLCQEM 628
Query: 102 AQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A A+ + R++F ++A E EH +ADV LDT NG TT+++ LW G P+VT G
Sbjct: 629 AAAMEISYDRLVFLPLSATEAEHRANLTIADVVLDTYPYNGATTTLETLWMGIPIVTRVG 688
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ++R A S + G IA + +EY + IRLG+D
Sbjct: 689 KQFSARNAYSLMVNAGISAGIAWSDREYVEWGIRLGSD 726
>gi|376003416|ref|ZP_09781227.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
gi|375328219|emb|CCE16980.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
Length = 744
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 86/164 (52%), Gaps = 3/164 (1%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ +R +P+DAIVY + K +P ++ + +L+ VPNS + F N
Sbjct: 534 SVPTMTRADLEIPDDAIVYLGLQRGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSLN 593
Query: 98 --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
A G+ + RI F + V +EEH +ADV LDT NG TT+M+ LW G P
Sbjct: 594 QFFFDIANQQGITRERIKFITPVKLEEEHRANLLIADVVLDTYPYNGATTTMETLWMGIP 653
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+VT GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 654 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 697
>gi|361068591|gb|AEW08607.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165155|gb|AFG65434.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165157|gb|AFG65435.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165159|gb|AFG65436.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165161|gb|AFG65437.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165163|gb|AFG65438.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165165|gb|AFG65439.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165167|gb|AFG65440.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165169|gb|AFG65441.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165171|gb|AFG65442.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165173|gb|AFG65443.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165175|gb|AFG65444.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165177|gb|AFG65445.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165179|gb|AFG65446.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165181|gb|AFG65447.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165183|gb|AFG65448.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
gi|383165185|gb|AFG65449.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
Length = 74
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 40/60 (66%), Positives = 49/60 (81%)
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
+I+F++VAAK EH+RR LADVCLDTPLCN HTT DVLW G P++TLP E +A+RVA S
Sbjct: 4 QIIFTDVAAKSEHIRRSSLADVCLDTPLCNAHTTGTDVLWAGVPIITLPLEKMATRVAGS 63
>gi|428175987|gb|EKX44874.1| hypothetical protein GUITHDRAFT_109295 [Guillardia theta CCMP2712]
Length = 703
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 93/185 (50%), Gaps = 22/185 (11%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
++P V R ++GLPE A + C+F + KI + WV +LK V +S+LW+ ++ +
Sbjct: 458 DLPSQHVPQMRSRFGLPEVAPLVCSFCEANKISQELVDSWVRILKRVEDSVLWIPRYNPL 517
Query: 94 GEANIQATAQALGLDQHRILFSN-------------------VAAKEEHVRRGQLADVCL 134
E N++ + GL+ R+LFS V + E++ ADV L
Sbjct: 518 AEENLRVEFKRRGLNVSRVLFSPPEVRCLGGKCTRHAGNGSLVLSDMEYLAAALCADVVL 577
Query: 135 DT-PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLG--CPELIARTHKEYQDI 191
DT P + T +LW P +T+PG ++SR ASQ+ +LG L+ART E +D+
Sbjct: 578 DTGPDISSSATFGSMLWAAIPSLTMPGFNMSSRSGASQVLSLGPAAASLVARTTSELEDM 637
Query: 192 AIRLG 196
+ L
Sbjct: 638 TVLLA 642
>gi|428174094|gb|EKX42992.1| hypothetical protein GUITHDRAFT_163991 [Guillardia theta CCMP2712]
Length = 1166
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 81/151 (53%), Gaps = 1/151 (0%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
LPED IV+C + ++K+D +W+ +L VP S L L + + ++ A G+D
Sbjct: 620 LPEDEIVFCALHHVWKMDEDIFSVWMELLLHVPGSKLRLQECAEEAKVTMRQFASRKGVD 679
Query: 109 QHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R++F+ + EHV + + LD P N HT++ D LW PV+T+P + + R
Sbjct: 680 PSRLVFNPKIGDYYEHVDKLSKCRLFLDAPKYNAHTSAGDALWANVPVITVPHQMMVHRG 739
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AA + + I R ++Y+ +A+RLG+D
Sbjct: 740 AACLVYSTNVTSTIVRNLEDYKVMALRLGSD 770
>gi|428313126|ref|YP_007124103.1| O-linked N-acetylglucosamine transferase, SPINDLY family
[Microcoleus sp. PCC 7113]
gi|428254738|gb|AFZ20697.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Microcoleus sp. PCC 7113]
Length = 742
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 88/159 (55%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA---VGEANIQA 100
R Q +P DA++Y + + +K P T ++ + +LK VPNS L+K A EA +
Sbjct: 535 RDQLDIPSDAVIYFSSQRGFKRHPDTARLQMQILKEVPNSYF-LIKGGADEEAIEAFFKE 593
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+A G++ R+ F V ++ H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 594 LAEAEGVESDRLRFLPGVFSEATHRANLGIADVVLDTYPYNGATTTLETLWMGVPLVTRV 653
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE A+R + + + G E IA T +EY + IRLG D
Sbjct: 654 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGIRLGKD 692
>gi|119510218|ref|ZP_01629356.1| hypothetical protein N9414_10548 [Nodularia spumigena CCY9414]
gi|119465168|gb|EAW46067.1| hypothetical protein N9414_10548 [Nodularia spumigena CCY9414]
Length = 729
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 89/163 (54%), Gaps = 8/163 (4%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ--- 99
SR+ +PEDAIVY + K +P +++ + ++K VPNS LLK +++
Sbjct: 527 SRESLDIPEDAIVYFSSQTGLKRNPQNIRLQMQIIKQVPNSYF-LLKSHRTNHQDLKDFI 585
Query: 100 ---ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
A A+ + L+ R L S V EH +ADV LDT NG TT+++ LW G P+V
Sbjct: 586 TPIAEAEGVDLECFRFLPS-VPTDTEHRANLAIADVVLDTYPYNGATTTLETLWMGIPLV 644
Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
T GE A+R + + + +G E +AR+ +EY + +RLG D
Sbjct: 645 TRVGEQFAARNSYTMMMNVGVTEGLARSDEEYVEWGVRLGKDE 687
>gi|209527310|ref|ZP_03275819.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
gi|209492236|gb|EDZ92582.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
Length = 742
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 84/160 (52%), Gaps = 5/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
RQ +PE+AIVY + K +P Q+ + +LK VPNS L + F N
Sbjct: 539 RQDIHIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLVIKGFGEQESINRLFLDL 598
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A Q + DQ R + VA +E H +ADV LD+ NG TT+++ LW G P+VT
Sbjct: 599 AEQQEVSRDQFRFI-GAVALEETHRANLAIADVVLDSYPYNGATTTLETLWMGIPLVTRV 657
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 658 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 697
>gi|33865520|ref|NP_897079.1| TPR domain-containing protein [Synechococcus sp. WH 8102]
gi|33632689|emb|CAE07501.1| TPR domain protein [Synechococcus sp. WH 8102]
Length = 928
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 88/165 (53%), Gaps = 8/165 (4%)
Query: 37 QSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA 96
Q + SR +GLP++A+V+C FN+ KI W+ +L+AVP S+LWL P ++
Sbjct: 731 QPMAGISRSSFGLPDEAVVFCCFNRGDKITAGIFSSWMTILEAVPGSLLWLAVKPKALQS 790
Query: 97 NIQATAQALGLDQHRILFSNVAAKEEHVRRG----QLADVCLDTPLCNGHTTSMDVLWTG 152
+QA AQ G+D HR++ A ++ V+R AD+ LDT N + L G
Sbjct: 791 -LQAQAQKRGIDPHRLV---AAPYQKPVQRFIAAMACADLFLDTAGFNAGAIGVLALNAG 846
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
P++T+ G +R+ AS + P+LI T + YQ AI L T
Sbjct: 847 LPLITIAGNRFCARMGASLCNAVHQPQLITTTPEAYQQKAIELAT 891
>gi|358635829|dbj|BAL23126.1| hypothetical protein AZKH_0787 [Azoarcus sp. KH32C]
Length = 586
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 85/150 (56%), Gaps = 5/150 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQ 109
+++ +FN L K+ L +W VL+A+P++ L++ + A+G+ +Q + G+
Sbjct: 391 LIFGSFNNLSKVSGPVLDLWAEVLRAIPDAGLFM-QARALGDTGTRERVQGQLEDRGVAP 449
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++ + ++ H+ DVCLDT NGH T++D LW G PV+TL G+ A R+ A
Sbjct: 450 ERVVLAGFSSLAAHLNLFNHIDVCLDTFPWNGHMTTLDSLWMGVPVLTLEGDRRAGRMGA 509
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+ +G + +ART +++ + AI L DR
Sbjct: 510 TIQRAIGLDDFVARTSQDFVERAIALDKDR 539
>gi|196234038|ref|ZP_03132873.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196221887|gb|EDY16422.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 733
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 73/145 (50%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
I + N K++ L++W LKAV NS L LL + + G+D I
Sbjct: 542 ITFGCLNNFCKVNDPLLKLWARGLKAVKNSRLILLSHEGSQTQRTRQFLEGEGVDPAHIE 601
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F +++ ++ D+ LD NGH+TS+D LW G PVV+L G+ SR SQL+
Sbjct: 602 FVAPRSRKPYLELYHRLDIALDPFPYNGHSTSLDALWMGVPVVSLCGDRPVSRAGLSQLS 661
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG PEL+A T Y DIA L D
Sbjct: 662 NLGLPELVASTKDRYVDIATGLAND 686
>gi|196232458|ref|ZP_03131311.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196223530|gb|EDY18047.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 747
Score = 91.7 bits (226), Expect = 1e-16, Method: Composition-based stats.
Identities = 51/145 (35%), Positives = 73/145 (50%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + N K++ TL+ W VL+ V +S L LL ++ G++ R+
Sbjct: 559 VTFGCLNHFCKVNEPTLRRWAKVLRTVRDSKLMLLSPEGSHRRKTWRILESEGVEARRVE 618
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + ++ Q DV LDT NGHTTS+D LW G PVV+L G + SR SQL+
Sbjct: 619 FVPRCGRRAYLELYQQMDVALDTFPYNGHTTSLDALWMGVPVVSLAGRSAVSRAGLSQLS 678
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG EL+A E+ +IA L D
Sbjct: 679 NLGLAELVASAEDEFVEIAAALAQD 703
>gi|428175656|gb|EKX44545.1| hypothetical protein GUITHDRAFT_109660 [Guillardia theta CCMP2712]
Length = 603
Score = 91.7 bits (226), Expect = 1e-16, Method: Composition-based stats.
Identities = 63/214 (29%), Positives = 96/214 (44%), Gaps = 36/214 (16%)
Query: 16 LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVN 75
LQ G+ + + E + S++ SR++ GLP+ ++ FN LYKI P+ W+
Sbjct: 361 LQLGMLFSSSTHAINHPEMLSSSLI--SRKEAGLPDKMFIFACFNTLYKITPTLFLAWLK 418
Query: 76 VLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL---------------------- 113
+LK VP F AV + A+ G + ++
Sbjct: 419 ILKNVPKRFQIPSPF-AVDSSQCPLVARRTGRREGTLMQLCTRKRRLKEEALVVMVVVVV 477
Query: 114 -----------FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
S+ E+H++ AD+ LDTP+ N H+T+ D LW G PV+TL E
Sbjct: 478 VEEDEEEEEDEDSSKVPLEKHLQVKANADLFLDTPMYNAHSTAADALWAGVPVLTLAREK 537
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+A+RV AS +L PELI R +EY +A L
Sbjct: 538 MAARVGASLALSLHMPELIVRNMEEYVSVATLLA 571
>gi|440681073|ref|YP_007155868.1| TPR repeat-containing protein [Anabaena cylindrica PCC 7122]
gi|428678192|gb|AFZ56958.1| TPR repeat-containing protein [Anabaena cylindrica PCC 7122]
Length = 737
Score = 91.7 bits (226), Expect = 2e-16, Method: Composition-based stats.
Identities = 54/162 (33%), Positives = 88/162 (54%), Gaps = 4/162 (2%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL---KFPAVGEANIQ 99
+R+ +P+DA++Y + YK +P +++ + +LK VPNS + K V +
Sbjct: 534 TRKNLEIPDDAVIYFSAQAGYKRNPDNVRLQMKILKEVPNSYFLIKDIHKEVGVVRQFFE 593
Query: 100 ATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G+ + R+ F N V + H Q+ADV LDT NG TT+M+ LW G P+VT
Sbjct: 594 KIAEEEGVSKERLRFLNKVPSSAIHRANLQIADVVLDTYPYNGATTTMETLWVGIPLVTK 653
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE ++R + S + G E IA + +EY + ++LG + D
Sbjct: 654 VGEQFSARNSYSMMVNAGITEGIAWSDEEYIEWGVKLGKETD 695
>gi|434404762|ref|YP_007147647.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Cylindrospermum stagnale PCC 7417]
gi|428259017|gb|AFZ24967.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Cylindrospermum stagnale PCC 7417]
Length = 742
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 85/160 (53%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---A 100
RQ +P DA+VY + + YK P T + + ++K VPNS L+K A EA Q
Sbjct: 541 RQDLDIPSDAVVYLSSQRSYKRHPETTKWQMRIIKEVPNSYF-LIKGDADEEAIKQFFYK 599
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G+D R+ F E V R LA DV LDT NG TT+++ LW G P+VT
Sbjct: 600 IAEEEGVDSSRLRFLP-QDPSEAVHRANLAIADVVLDTFPYNGATTTLETLWMGIPLVTR 658
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE SR + + + +G E +A T +EY + +RLG D
Sbjct: 659 VGEQFVSRNSYTMMMNVGVTEGLAWTDEEYVEWGVRLGKD 698
>gi|428315860|ref|YP_007113742.1| TPR repeat-containing protein [Oscillatoria nigro-viridis PCC 7112]
gi|428239540|gb|AFZ05326.1| TPR repeat-containing protein [Oscillatoria nigro-viridis PCC 7112]
Length = 744
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 87/159 (54%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQA 100
R + +P DA+VY + + +K +P+T ++ + +LK VPNS LLK A E +
Sbjct: 541 RDELNIPSDAVVYLSAQRGFKYNPNTARLQMKILKEVPNSYF-LLKGMANQELVKSFFAQ 599
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+ G+D R+ F VA H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 600 LAEEEGVDCDRLRFLPGVALSATHRANLGIADVVLDTYPYNGATTTLETLWMGIPIVTRV 659
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ +SR + + + G E IA T +EY + IRLG D
Sbjct: 660 GQQFSSRNSYTMMINAGITEGIAWTDEEYLEWGIRLGKD 698
>gi|398804002|ref|ZP_10563006.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Polaromonas sp. CF318]
gi|398095201|gb|EJL85547.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Polaromonas sp. CF318]
Length = 710
Score = 91.3 bits (225), Expect = 2e-16, Method: Composition-based stats.
Identities = 47/156 (30%), Positives = 86/156 (55%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
SR++ GLP DA+V C +D ++ W+ +L+++P+++LWL + AN+ A
Sbjct: 500 SREKAGLPADALVLCATVPSVMLDAASFAAWMKILRSLPDAVLWLPAYSLAIAANLVREA 559
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
A G+ + R++F++ + + + + AD+ LDT N + D L G P ++ GE+
Sbjct: 560 NAAGVHESRLIFTHRLTRAQSLACLKHADLFLDTLRFNANLGLEDALRMGVPALSCAGES 619
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ASR+ S + G P+ + ++ Y A+RLG D
Sbjct: 620 MASRLGGSIIRAAGLPQCVMQSRPAYIAEALRLGRD 655
>gi|298492469|ref|YP_003722646.1| hypothetical protein Aazo_4067 ['Nostoc azollae' 0708]
gi|298234387|gb|ADI65523.1| conserved hypothetical protein ['Nostoc azollae' 0708]
Length = 747
Score = 91.3 bits (225), Expect = 2e-16, Method: Composition-based stats.
Identities = 61/160 (38%), Positives = 87/160 (54%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN---IQA 100
R+ +P DAI+Y + YK L++ + ++K VPNS L L+K A EA+ +
Sbjct: 546 REDLNIPRDAIIYFMTQKGYKRHRPHLRLQMKIIKEVPNSYL-LIKGDADPEASKVFFEE 604
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRG--QLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
AQ G+D +RI F A + E V R Q+ADV LDT NG TT+++ LW G P+VT
Sbjct: 605 IAQEEGVDFNRIKFLPYA-RSEAVHRANLQIADVVLDTYPYNGATTTLETLWMGVPLVTR 663
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE ++R + + E IA T EY + +RLG D
Sbjct: 664 VGEQFSARNSYGMMINAAITEGIAWTEDEYVEWGVRLGKD 703
>gi|428186203|gb|EKX55054.1| hypothetical protein GUITHDRAFT_99692 [Guillardia theta CCMP2712]
Length = 741
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 84/156 (53%), Gaps = 4/156 (2%)
Query: 45 QQYG--LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
Q+G LP ++ F+ L K+ Q W +L+AV S LW++ P G N+ A
Sbjct: 503 HQHGVVLPSSGFIFGAFSNLGKVHVRLWQSWCRILQAVNQSHLWIIAEPPFGVDNLIEAA 562
Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
+ G+++ RI+ ++ +H A V LD+ NGH+T ++ LW G PV+TLP E
Sbjct: 563 RGFGVEE-RIVATSRIRVVDHASVKHAAGVYLDSRPFNGHSTVVEALWGGIPVLTLPVER 621
Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLGT 197
+A RVAAS L +G L R +Y+ +AI+L +
Sbjct: 622 MAGRVAASVLHAVGLGRHLTVRDGGDYEALAIKLAS 657
>gi|345871985|ref|ZP_08823926.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
drewsii AZ1]
gi|343919796|gb|EGV30539.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
drewsii AZ1]
Length = 2237
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 85/152 (55%), Gaps = 6/152 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN----IQATAQALGLDQ 109
I + +FN + K+ P TL +W NVL+A+P+S L +LK +V + + G+D
Sbjct: 598 ITFGSFNNIAKLTPDTLDLWGNVLRALPDSRL-VLKSHSVRDTRFWDYMLGRLTERGVDS 656
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
HR+ + + EH+ + + D+ LDT G TT+ + L+ G PVVTL G+ A RV
Sbjct: 657 HRVKILPRAPSYLEHLSQYREIDIALDTYPYQGTTTTCEALFMGVPVVTLAGDRHAGRVG 716
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S + +G P LIA + +Y IA+ L TDR+
Sbjct: 717 VSLMNQVGLPSLIANSADDYVRIAVSLATDRE 748
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 86/152 (56%), Gaps = 6/152 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
I + +FN L K+ +TL++W ++L AVPN+ L +LK P E ++ A G+D+
Sbjct: 2032 ITFGSFNNLAKVSSTTLRLWCDLLLAVPNARL-VLKTPTTTEGHVWGRVVAYFIEHGIDR 2090
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ + +H+ ++ D+ LDT NG TT+ + L+ G PV+TL G+ +RV
Sbjct: 2091 ERLETLPRASGYLKHLASYRVIDIALDTYPYNGTTTTCEALFMGVPVITLLGDRHVARVG 2150
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
AS L +G +LIA + +EY I+ L D+D
Sbjct: 2151 ASLLTRVGLTDLIAGSEEEYIRISKDLSGDKD 2182
Score = 79.7 bits (195), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 90/177 (50%), Gaps = 20/177 (11%)
Query: 31 TGEEVPQSIVITSRQQYGLP----EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILW 86
+GE++P GLP + + + +FN L KI +TL +W V+KAVP++ L
Sbjct: 1271 SGEDLP----------VGLPPCLERNRVTFGSFNNLAKITNTTLDLWGGVMKAVPDARL- 1319
Query: 87 LLKFPAVGEANIQATAQALGLDQ----HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
LLK + ++ A DQ R+ + + EH+ + Q D+ LD +G
Sbjct: 1320 LLKTHTASDIDVWNRLVAYLADQGIATDRVEMLPRAPSYMEHLVQYQRLDIALDAFPYHG 1379
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
TT+ + L+ G PVV+L G+ ASRV S L G PELIA + ++Y IA L D
Sbjct: 1380 TTTTCEALFMGVPVVSLMGDRHASRVGGSLLTHAGLPELIAESTEDYIRIARDLARD 1436
>gi|443321167|ref|ZP_21050230.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Gloeocapsa sp. PCC 73106]
gi|442789133|gb|ELR98803.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Gloeocapsa sp. PCC 73106]
Length = 726
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 83/163 (50%), Gaps = 13/163 (7%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R G+P DAI+Y N K P TL + + ++ AVPNS F G +++ +
Sbjct: 522 RDTLGIPSDAIIYLNIQNALKRHPHTLDLQMQIIGAVPNSY-----FLTKGSGDLEIIKR 576
Query: 104 AL-------GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
G+ R+ F EE H +ADV LDT NG TT+++VLW G P+
Sbjct: 577 VFTEAALRHGVSPDRLRFLPFTPDEETHRANLAIADVILDTYPYNGATTTLEVLWMGIPL 636
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT GE A+R + + + +G E IA T +EY + I+LGTD
Sbjct: 637 VTRVGEQFAARNSYAFMQQVGLNEGIAWTDEEYVEWGIKLGTD 679
>gi|443325693|ref|ZP_21054376.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xenococcus sp. PCC 7305]
gi|442794696|gb|ELS04100.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Xenococcus sp. PCC 7305]
Length = 1020
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 84/169 (49%), Gaps = 6/169 (3%)
Query: 35 VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
+PQ + +Q L + + +FN KI P+T+++W VLKAVPNS L LLK A+
Sbjct: 805 LPQDFLPEPQQLPALKNGYLTFGSFNNSSKISPTTIRLWSQVLKAVPNSRL-LLKALALE 863
Query: 95 EAN----IQATAQALGLDQHRILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
+ + A G+D R++ K E D+ LD NG TT+ L
Sbjct: 864 DKGTCEYFRGQFVAQGIDGTRVILEKPTLKIENFFESYHKIDIALDPFPYNGGTTTCQAL 923
Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
W G PV++L G+ SR++ S L LG PEL T +Y IA+ L D
Sbjct: 924 WMGVPVISLRGQQFCSRMSHSFLTNLGLPELSVATEADYIAIAVELAQD 972
>gi|307152772|ref|YP_003888156.1| O-linked N-acetylglucosamine transferase [Cyanothece sp. PCC 7822]
gi|306983000|gb|ADN14881.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
protein [Cyanothece sp. PCC 7822]
Length = 718
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 89/159 (55%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---A 100
R+ +P +AI+Y + K P T+++ + VLK VPNS L L+K E Q +
Sbjct: 518 REHLNIPSEAIIYLSVQTGLKRHPDTIRLQMKVLKEVPNSYL-LVKGAGQTEKIEQLFLS 576
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A G+ +RI F S VA++E H +ADV LDT NG TT+++VLW G P+VT
Sbjct: 577 IASEEGVSPNRIRFLSTVASEEIHRANLTIADVVLDTFPYNGATTTLEVLWMGIPLVTRV 636
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ A+R + + + G E IA T +EY I+LGTD
Sbjct: 637 GQQFAARNSYTFMIHAGLTEGIAWTDEEYIQWGIKLGTD 675
>gi|218248286|ref|YP_002373657.1| hypothetical protein PCC8801_3536 [Cyanothece sp. PCC 8801]
gi|218168764|gb|ACK67501.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
Length = 731
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 89/163 (54%), Gaps = 9/163 (5%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG-EANIQAT 101
+R+Q +PEDAI+Y +K P + + +L+ VPNS L L+K +G + IQ
Sbjct: 526 TREQLNIPEDAIIYWTGQNGFKRHPDMINCQMQILQEVPNSYL-LIK--GIGNDEKIQQL 582
Query: 102 AQALGLDQH----RILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
+ + L R F + A E H Q+AD+ LDT NG TT+++VLW G P+V
Sbjct: 583 FKTIALKYQVSPDRFRFLPIVADELTHRANLQIADIVLDTYPYNGATTTLEVLWMGIPLV 642
Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
T GE A+R + + + +G E IA T++EY D I+LG +
Sbjct: 643 TRVGEQFAARNSYTFMTNVGLTEGIAWTNEEYIDWGIQLGKNE 685
>gi|428173813|gb|EKX42713.1| hypothetical protein GUITHDRAFT_141110 [Guillardia theta CCMP2712]
Length = 870
Score = 90.5 bits (223), Expect = 3e-16, Method: Composition-based stats.
Identities = 51/141 (36%), Positives = 76/141 (53%), Gaps = 9/141 (6%)
Query: 57 CNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA--QALGLDQHRILF 114
CN+NQ +K+DP +WV +L VPNS L +LK+P E +++ A Q LGL R++F
Sbjct: 547 CNWNQHFKLDPEFFAVWVRILMQVPNSTLVMLKYPEESEPHLRYAARKQQLGLTGERLIF 606
Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
A+ EH+ R D+ LD N TT D LW G P +++P A+ +QL
Sbjct: 607 LPKASLTEHLMRTSFCDLFLDNREYNSGTTCTDSLWAGVPTISMPRMKHAAVGGNAQL-- 664
Query: 175 LGCPELIARTHKEYQDIAIRL 195
L AR +EY+ +++ L
Sbjct: 665 -----LQARNLEEYESMSVLL 680
>gi|257060387|ref|YP_003138275.1| O-linked N-acetylglucosamine transferase [Cyanothece sp. PCC 8802]
gi|256590553|gb|ACV01440.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
protein [Cyanothece sp. PCC 8802]
Length = 731
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/163 (36%), Positives = 90/163 (55%), Gaps = 9/163 (5%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG-EANIQAT 101
+R+Q +PEDAI+Y +K P + + +L+ VPNS L L+K +G + IQ
Sbjct: 526 TREQLNIPEDAIIYWTGQNGFKRHPDMINCQMQILQDVPNSYL-LIK--GIGNDEKIQQL 582
Query: 102 AQALGLDQH----RILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
+ + L R+ F + A E H Q+AD+ LDT NG TT+++VLW G P+V
Sbjct: 583 FKTIALKYQVSPDRLRFLPIVADELTHRANLQIADIVLDTYPYNGATTTLEVLWMGIPLV 642
Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
T GE A+R + + + +G E IA T++EY D I+LG +
Sbjct: 643 TRVGEQFAARNSYTFMTNVGLTEGIAWTNEEYIDWGIQLGKNE 685
>gi|94272257|ref|ZP_01292095.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
gi|93450186|gb|EAT01491.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
Length = 743
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 84/164 (51%), Gaps = 1/164 (0%)
Query: 37 QSIVITSRQQYGLP-EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
Q V+ +R + GLP ED ++ FN++ K+ W+++LK P + LW+L
Sbjct: 156 QRPVVPARAELGLPGEDVLIVGMFNKINKVRGEVWAAWMSILKLAPKAWLWVLDPGLAAR 215
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
AN+ A+ + + RILF+ + H+ R D+ LD GHT++ D L+ G PV
Sbjct: 216 ANLLKAAREHQVAEKRILFAPALPQAGHLSRLGACDLMLDPWPYGGHTSTTDALFAGVPV 275
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+TL G A RV+A L + P L+ + ++Y A L +R
Sbjct: 276 ITLEGRNFAGRVSAGLLKSANLPALVRGSVQDYIQTAAGLLNNR 319
>gi|257092158|ref|YP_003165799.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
gi|257044682|gb|ACV33870.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
Length = 597
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 82/162 (50%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P + + R + LP + V+C F+ YK++P+ +W+ +L+ V S+LWLL E
Sbjct: 374 PAAGRVPERVELALPAEGFVFCCFSHSYKLNPAMFAVWMRLLRQVAGSVLWLLAAKPEAE 433
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
++ A G+D R++F+ H+ R AD+ LDT N TT+ L G PV
Sbjct: 434 ERLRREADRCGVDPTRLVFAPRIDLAAHLARHAQADLFLDTLPFNAITTTSVALHAGLPV 493
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
++ G + A+R A S L +G L+ +Y+ A+ L T
Sbjct: 494 LSCSGRSFAARGAGSLLQAVGMSALVTGNLSDYEARALELAT 535
>gi|381160366|ref|ZP_09869598.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878430|gb|EIC20522.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 731
Score = 89.7 bits (221), Expect = 6e-16, Method: Composition-based stats.
Identities = 51/166 (30%), Positives = 86/166 (51%), Gaps = 6/166 (3%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P + R++ P + + + +FN K+ P+T+ +W ++L+ PN+ L LL++ ++
Sbjct: 521 PPEPAVPIRKRNANPRE-LTFGSFNNTIKLSPATIALWSHILRETPNTRL-LLRYASLRH 578
Query: 96 ANIQATA----QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
A I+ A G+ R+ A++ E + D+ LD G T+ + LW
Sbjct: 579 AEIRRQLLERFAAHGISAERLTLEGKASRTEMLETYNRVDIALDPTPYGGGITTAEALWM 638
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
G PV+TL G R +AS L T+GCP L+A+ +EY +AI L T
Sbjct: 639 GVPVITLHGGAWPGRHSASILNTIGCPGLVAKNEEEYVALAISLAT 684
>gi|196232796|ref|ZP_03131647.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196223256|gb|EDY17775.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 591
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 74/145 (51%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
I + FN K + TL + VL +V NS L LL + ++ G+ RI
Sbjct: 400 ITFGCFNNFSKFNEETLARFSRVLHSVKNSHLVLLAPEGSAQRHLSERFAHYGIAGDRIE 459
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F+ ++ +++RR D+ LDT NG T+ + LW G PV++L GET R L+
Sbjct: 460 FAATGSRADYLRRYHRVDIALDTLPSNGMATTCETLWMGVPVISLVGETSVGRAGLGLLS 519
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
T+G PEL+ RT E+ A +L TD
Sbjct: 520 TIGLPELVTRTPDEFVSTATQLATD 544
>gi|390569448|ref|ZP_10249733.1| TPR repeat-containing protein [Burkholderia terrae BS001]
gi|389938308|gb|EIN00152.1| TPR repeat-containing protein [Burkholderia terrae BS001]
Length = 709
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 78/152 (51%), Gaps = 5/152 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
AI + FN L K++ + + +W VL AVP S L L L PAV A ++ A A G+D
Sbjct: 489 AITFGCFNHLVKLNDAVIALWSRVLDAVPGSCLLLKTRQLDDPAVQRATLERFA-AHGID 547
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+L + + E + Q D+ LD G TTS++ LW G PV+T GE S V
Sbjct: 548 GARLLLEGQSPRAELLAAYQRVDIALDPFPYAGGTTSVEALWMGVPVLTRRGERFLSHVG 607
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S + T G PE IA +Y A+R T RD
Sbjct: 608 ESIVNTTGLPEWIAADDADYLSKALRFSTQRD 639
>gi|90079719|dbj|BAE89539.1| unnamed protein product [Macaca fascicularis]
Length = 113
Score = 89.4 bits (220), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 40/53 (75%), Positives = 47/53 (88%)
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
MDVLW GTP+VT+PGETLASRVAASQL LGC ELIA+ +EY+DIA++LGTD
Sbjct: 1 MDVLWAGTPMVTMPGETLASRVAASQLTCLGCLELIAKNRQEYEDIAVKLGTD 53
>gi|428217495|ref|YP_007101960.1| hypothetical protein Pse7367_1237 [Pseudanabaena sp. PCC 7367]
gi|427989277|gb|AFY69532.1| TPR repeat-containing protein [Pseudanabaena sp. PCC 7367]
Length = 798
Score = 89.4 bits (220), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 84/159 (52%), Gaps = 3/159 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG--EANIQAT 101
R+ +P ++VY + YK P + + ++KAVPNS + F +
Sbjct: 596 RRDLNIPAASVVYLSSQIGYKRHPDHARSQMQIIKAVPNSYFLIKGFADQNTVQKFFYEL 655
Query: 102 AQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A G+++ R++F +N + + EH +ADV LDT NG TT+M+ LW G P+VT G
Sbjct: 656 ADQEGVERDRLIFLANTSLEIEHRANLAIADVVLDTFPYNGATTTMETLWAGVPIVTRVG 715
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+ A+R + S L G IA + +EY D IRLG+DR
Sbjct: 716 QQFAARNSYSMLVNAGIEAGIAWSDQEYIDWGIRLGSDR 754
>gi|75908610|ref|YP_322906.1| hypothetical protein Ava_2394 [Anabaena variabilis ATCC 29413]
gi|75702335|gb|ABA22011.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 740
Score = 89.0 bits (219), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 89/167 (53%), Gaps = 9/167 (5%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ R+Q +P DAIVY + + +K P T ++ + ++K VPNS L+K V E
Sbjct: 532 SVPTIRREQLDIPRDAIVYFSGQRGFKRHPHTTRLQLKIIKEVPNSYF-LIK--GVSEEE 588
Query: 98 -----IQATAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
+ A+ G++ ++ F + + E H ++ D+ LDT NG TT+++ LW
Sbjct: 589 GIKKFFEQLAEEEGVELSKLRFLPIDSTEPIHRANLEIVDIVLDTYPYNGATTTLETLWM 648
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
P+VT GE A+R + + + G PE IA T +EY D +RLG D
Sbjct: 649 CIPMVTRVGEQFAARNSYTMMMNAGIPEGIAWTDEEYVDWGVRLGKD 695
>gi|381160356|ref|ZP_09869588.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878420|gb|EIC20512.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 655
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 81/146 (55%), Gaps = 6/146 (4%)
Query: 58 NFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA----TAQALGLDQHRIL 113
+FN L K+ TL +W V++A+P S+L +LK +A +Q T + G++ R+
Sbjct: 466 SFNNLAKVGEQTLALWSRVMRALPESML-ILKGRGAEDAAVQTRILETLRGHGIEGARLE 524
Query: 114 F-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F ++ EH+ D+ LDT N TT+ D LW GTP+VT+ G LA R AAS L
Sbjct: 525 FWPYTPSRSEHLALYNQIDLALDTIPFNSATTACDALWMGTPLVTVLGGQLAGRQAASIL 584
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
LG PE IAR + E+ D+ L +D
Sbjct: 585 TGLGRPEWIARNNDEFVDVVASLASD 610
>gi|414077428|ref|YP_006996746.1| hypothetical protein ANA_C12192 [Anabaena sp. 90]
gi|413970844|gb|AFW94933.1| TPR repeat-containing protein [Anabaena sp. 90]
Length = 733
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 85/162 (52%), Gaps = 6/162 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ--- 99
SR+ +P D I+Y + K +P +++ + ++K VPNS LLK ++Q
Sbjct: 530 SRESLDIPNDGIIYFSSQSGLKRNPDNIRLQMQIIKQVPNSYF-LLKTSRSNHEDMQNFI 588
Query: 100 -ATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A+A GLD F A E EH +AD+ LDT NG TT+++ LW G P+VT
Sbjct: 589 APLAEAQGLDLECFRFLPTAPTEMEHRANLAIADIVLDTYPYNGATTTLETLWMGVPIVT 648
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + +G E +A + +EY + +RLG D
Sbjct: 649 RVGEQFAARNSYTMMMNVGVTEGLAWSDEEYVEWGVRLGKDE 690
>gi|253702411|ref|YP_003023600.1| hypothetical protein GM21_3825 [Geobacter sp. M21]
gi|251777261|gb|ACT19842.1| Tetratricopeptide TPR_2 repeat protein [Geobacter sp. M21]
Length = 589
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 78/152 (51%), Gaps = 6/152 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLD 108
A+ + +FN L K+ P T+ +W VL AVP S L LLK ++ ++ +L G+
Sbjct: 396 AVTFGSFNNLAKVTPETIALWCRVLDAVPGSRL-LLKGRPFADSGVRERIASLFARGGIA 454
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R+ L H+ + D+ LDT NG TT+ + LW G PVVTL G A+R
Sbjct: 455 GERVELHPGEPENSAHLAQYGRVDIALDTFPYNGTTTTCEALWMGVPVVTLAGTRHAART 514
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
AS L G EL+A EY +IA RL DR
Sbjct: 515 GASILTNCGLDELVAEDEGEYLEIARRLAADR 546
>gi|300866601|ref|ZP_07111289.1| hypothetical protein OSCI_3020012 [Oscillatoria sp. PCC 6506]
gi|300335373|emb|CBN56449.1| hypothetical protein OSCI_3020012 [Oscillatoria sp. PCC 6506]
Length = 1294
Score = 89.0 bits (219), Expect = 1e-15, Method: Composition-based stats.
Identities = 61/175 (34%), Positives = 88/175 (50%), Gaps = 11/175 (6%)
Query: 35 VPQSI--VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
VP I V +R +G E +V+ + + +KI P + W ++ AVPNSIL L F +
Sbjct: 1071 VPSEIAKVHPTRSSWGASEGTVVFMSGARAFKIIPELRETWAKIIAAVPNSILVLHPFRS 1130
Query: 93 VGEA--------NIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHT 143
GE+ I+A G+D+ RI+ V R +LADV LD+ G T
Sbjct: 1131 DGESYPIIPFFNQIRAVFSEKGIDKKRIVVIKVLPNRADFRECLKLADVYLDSFPYAGAT 1190
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ++ L G P V G+T SR + + L L PELIA T Y +IA+ LGT+
Sbjct: 1191 SMVEPLLVGIPTVVKEGQTARSRQSGAILRELQLPELIANTEDAYINIAVALGTN 1245
>gi|307152771|ref|YP_003888155.1| O-linked N-acetylglucosamine transferase [Cyanothece sp. PCC 7822]
gi|306982999|gb|ADN14880.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
protein [Cyanothece sp. PCC 7822]
Length = 724
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 87/165 (52%), Gaps = 13/165 (7%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
RQQ +P+ A VY + K P T+++ + +LKAVP+S L L+K G N +
Sbjct: 520 KRQQLDIPDGAFVYLSVQTGLKRHPDTIRLQMKILKAVPDSYL-LIK----GAGNFEKIQ 574
Query: 103 QAL-------GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
Q + +R+ F + A EE H +ADV LDT NG TT+++VLW G P
Sbjct: 575 QLFVSIASEEDVSPNRLRFLSQAPTEEIHRANLAIADVVLDTYPYNGATTTLEVLWMGIP 634
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+VT GE A+R + + + G E IA T +EY ++LGTD
Sbjct: 635 LVTRVGEQFAARNSYTFMINAGLTEGIAWTDEEYIQWGVKLGTDE 679
>gi|75756032|gb|ABA27069.1| TO89-1 [Taraxacum officinale]
Length = 89
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/79 (54%), Positives = 55/79 (69%), Gaps = 1/79 (1%)
Query: 122 EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ-LATLGCPEL 180
E++RR LAD+CLDTPLCN HT DVLW G P+VTLP E +A+RVA S LAT E+
Sbjct: 1 EYIRRSSLADLCLDTPLCNAHTAGTDVLWAGLPMVTLPLEKMATRVAGSLCLATGVGEEM 60
Query: 181 IARTHKEYQDIAIRLGTDR 199
I + KEY++ A+ L +R
Sbjct: 61 IVNSMKEYEERAVYLALNR 79
>gi|376003415|ref|ZP_09781226.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
gi|375328218|emb|CCE16979.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
Length = 746
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 84/160 (52%), Gaps = 5/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
RQ +PE+AIVY + K +P Q+ + +LK VPNS L + F G
Sbjct: 543 RQDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLIIKGFDKEQSIAGLFFEL 602
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A Q + +Q R L +V ++ H +ADV LD+ NG TT+++ LW G P+VT
Sbjct: 603 AEEQGVSREQLR-LTGSVLLEQTHRANLAIADVVLDSYPYNGATTTLETLWMGIPLVTRV 661
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 662 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 701
>gi|253996290|ref|YP_003048354.1| hypothetical protein Mmol_0918 [Methylotenera mobilis JLW8]
gi|253982969|gb|ACT47827.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera mobilis JLW8]
Length = 693
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 78/144 (54%), Gaps = 5/144 (3%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN---IQATAQA 104
L D I + FN++ KI PS +++W +L AVPNS L L P G + + + A
Sbjct: 488 ALSNDFITFGCFNRVEKITPSVIKLWSGLLNAVPNSKLLLGAMPKDGSYDGVINELASHA 547
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
+ D R++ + + + +++ L D+CLDT NG TT+ W G P + L G++L
Sbjct: 548 ISAD--RLIIHHRSTMDNYLKLHNLVDICLDTFPSNGVTTTCHAAWMGVPTLCLEGQSLM 605
Query: 165 SRVAASQLATLGCPELIARTHKEY 188
SR A + + LG PE + ++H ++
Sbjct: 606 SRGAMAVMLHLGLPEFVVKSHDDF 629
>gi|197120103|ref|YP_002140530.1| hypothetical protein [Geobacter bemidjiensis Bem]
gi|197089463|gb|ACH40734.1| TPR domain protein [Geobacter bemidjiensis Bem]
Length = 589
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 75/152 (49%), Gaps = 4/152 (2%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQ 109
A+ + +FN L K+ P + +W VL AVP S L L P + A G+
Sbjct: 396 AVTFGSFNNLAKVTPEVIALWCRVLHAVPGSRLLLKGRPFEDSGVRERIASQFSTGGIGG 455
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
RI L H+ + D+ LDT NG TT+ + LW G PVVTL G A+R
Sbjct: 456 ERIELNPGEPESSAHLAQYGRVDIALDTFPYNGTTTTCEALWMGVPVVTLAGTRHAARTG 515
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
AS LA G EL+A +EY +IA R+ DR+
Sbjct: 516 ASILANCGLDELVAENEEEYLEIARRMAADRE 547
>gi|423066870|ref|ZP_17055660.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
gi|406711635|gb|EKD06835.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
Length = 746
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 84/160 (52%), Gaps = 5/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
RQ +PE+AIVY + K +P Q+ + +LK VPNS L + F G
Sbjct: 543 RQDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLIIKGFDKEQSIAGLFFEL 602
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A Q + +Q R L +V ++ H +ADV LD+ NG TT+++ LW G P+VT
Sbjct: 603 AEEQGVSREQLR-LTGSVLLEQTHRANLAIADVVLDSYPYNGATTTLETLWMGIPLVTRV 661
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 662 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 701
>gi|428221015|ref|YP_007105185.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 7502]
gi|427994355|gb|AFY73050.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 7502]
Length = 604
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 86/165 (52%), Gaps = 13/165 (7%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN------ 97
R +PE AIVY YK P T+++ + ++KAVP+S L ++K G AN
Sbjct: 403 RSLLDIPEGAIVYFTSQSGYKRHPDTIKLQLQIIKAVPHSFL-IIK----GTANQETLKT 457
Query: 98 -IQATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
+ +GLD+ R+ F + EH +ADV LDT NG TT+M+ LW P+
Sbjct: 458 LFYEIGEEVGLDRDRLRFLPMTKGVLEHRANLDIADVVLDTYPYNGATTTMETLWMCIPM 517
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
VT G+ A+R + + + G E IA T KEY + IRLG D +
Sbjct: 518 VTKVGQQFAARNSYTMMMNAGITEGIAWTDKEYVEWGIRLGRDEN 562
>gi|309256371|gb|ADO61013.1| spindly [Helianthus annuus]
Length = 266
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T + LGL+ H
Sbjct: 11 VTFGSFNNLAKITPKVLQVWAKILCAVPNSRLIVKCKPFCCDSVRQKFLSTLEQLGLESH 70
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 71 RVDLVPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGLVHAHNVGV 130
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+T EY +AI L +D
Sbjct: 131 SLLTAVGLERLVAKTEDEYIRLAIELASD 159
>gi|94264577|ref|ZP_01288362.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
gi|93454995|gb|EAT05229.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
Length = 994
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 83/164 (50%), Gaps = 1/164 (0%)
Query: 37 QSIVITSRQQYGLP-EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
Q V+ +R + GLP ED ++ FN++ K+ W+++LK P + LW+L
Sbjct: 407 QRPVVPARAELGLPDEDVLIVGMFNKINKVRGEVWAAWMSILKLAPKAWLWVLDPGLAAR 466
Query: 96 ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
N+ A+ + + RILF+ + H+ R D+ LD GHT++ D L+ G PV
Sbjct: 467 DNLLKAAREHQVAEKRILFAPALPQAGHLSRLGACDLMLDPWPYGGHTSTTDALFAGVPV 526
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+TL G A RV+A L + P L+ + ++Y A L +R
Sbjct: 527 ITLAGRNFAGRVSAGLLKSANLPALVRGSVQDYIQTAAGLLNNR 570
>gi|209527309|ref|ZP_03275818.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
gi|209492235|gb|EDZ92581.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
Length = 758
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 3/164 (1%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ +R +P+DAIVY + K +P ++ + +L+ VPNS + F +
Sbjct: 548 SVPTITRADLEIPDDAIVYLGLQKGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSFS 607
Query: 98 --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
A G+ RI F V +E H +ADV LDT NG TT+M+ LW G P
Sbjct: 608 KFFFDIANQQGITTDRIKFIPTVKLEENHRANLLIADVVLDTYPYNGATTTMETLWMGIP 667
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+VT GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 668 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 711
>gi|309256349|gb|ADO61002.1| spindly [Helianthus annuus]
Length = 266
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T + LGL+ H
Sbjct: 11 VTFGSFNNLAKITPKVLQVWAKILCAVPNSRLIVKCKPFCCDSVRQKFLSTLEQLGLESH 70
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 71 RVDLVPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGLVHAHNVGV 130
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+T EY +AI L +D
Sbjct: 131 SLLTAVGLXRLVAKTEDEYIRLAIELASD 159
>gi|423066869|ref|ZP_17055659.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
gi|406711634|gb|EKD06834.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
Length = 745
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 3/164 (1%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ +R +P+DAIVY + K +P ++ + +L+ VPNS + F +
Sbjct: 535 SVPTITRADLEIPDDAIVYLGLQKGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSFS 594
Query: 98 --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
A G+ RI F V +E H +ADV LDT NG TT+M+ LW G P
Sbjct: 595 KFFFDIANQQGITTDRIKFIPTVKLEENHRANLLIADVVLDTYPYNGATTTMETLWMGIP 654
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+VT GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 655 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 698
>gi|409991249|ref|ZP_11274527.1| hypothetical protein APPUASWS_09484 [Arthrospira platensis str.
Paraca]
gi|409937885|gb|EKN79271.1| hypothetical protein APPUASWS_09484 [Arthrospira platensis str.
Paraca]
Length = 742
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 85/160 (53%), Gaps = 5/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
R+ +PE+AIVY + K +P Q+ + +LK VPNS L + F G
Sbjct: 539 REDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLVIKGFDKEQSIAGLFFEL 598
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A Q + +Q R L +V+ ++ H +ADV LD+ NG TT+++ LW G P+VT
Sbjct: 599 AEEQGVSSEQLR-LTGSVSTEQTHRANLLIADVVLDSYPYNGATTTLETLWMGIPLVTRV 657
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 658 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 697
>gi|291566900|dbj|BAI89172.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 742
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 85/160 (53%), Gaps = 5/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
R+ +PE+AIVY + K +P Q+ + +LK VPNS L + F G
Sbjct: 539 REDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLVIKGFDKEQSIAGLFFEL 598
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A Q + +Q R L +V+ ++ H +ADV LD+ NG TT+++ LW G P+VT
Sbjct: 599 AEEQGVSSEQLR-LTGSVSTEQTHRANLLIADVVLDSYPYNGATTTLETLWMGIPLVTRV 657
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + G E IA T +EY + +RLGTD
Sbjct: 658 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 697
>gi|428221016|ref|YP_007105186.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 7502]
gi|427994356|gb|AFY73051.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 7502]
Length = 734
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 90/167 (53%), Gaps = 15/167 (8%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI---- 98
+R+Q +P + I+Y + + K P T+++ + +LK VPNS L+K + + N
Sbjct: 533 TREQLEVPANGIIYFSSQKSDKRHPETVRLQMQILKQVPNSYF-LIK--GLSDQNSIKDF 589
Query: 99 ---QATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGT 153
A + + DQ R L ++EE+ RG L ADV LDT NG TT+M+VLW+G
Sbjct: 590 FYEMAELEGVNRDQLRFL---PYSREENEHRGNLPIADVVLDTYPYNGATTTMEVLWSGI 646
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
P+VT GE A+R + + + G E IA T +EY +R G + D
Sbjct: 647 PLVTRVGEQFAARNSYTMMMNAGISEGIAWTPEEYVAWGVRFGLEED 693
>gi|334118799|ref|ZP_08492887.1| hypothetical protein MicvaDRAFT_5391 [Microcoleus vaginatus FGP-2]
gi|333459029|gb|EGK87644.1| hypothetical protein MicvaDRAFT_5391 [Microcoleus vaginatus FGP-2]
Length = 746
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 83/156 (53%), Gaps = 9/156 (5%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL---WLLKFPAVGEANIQ-ATAQA 104
+P DA++Y YK +P+T ++ + ++KAVPNS W + ++ IQ A ++
Sbjct: 548 IPSDAVIYYCSQMGYKYNPNTARLQMQIIKAVPNSYFVTKWFAEQESLKNFFIQIAESEG 607
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
L D+ R L A E V R L ADV LDT NG TT+M+ LW P+VT G+
Sbjct: 608 LSADRLRFL---PHAPSEAVHRANLGIADVVLDTYPYNGATTTMETLWMCIPLVTRVGQQ 664
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+SR + + G E IA T +EY + IRLG D
Sbjct: 665 FSSRNSYGMMMNAGILEGIAWTDEEYVEWGIRLGKD 700
>gi|443312171|ref|ZP_21041791.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechocystis sp. PCC 7509]
gi|442777849|gb|ELR88122.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechocystis sp. PCC 7509]
Length = 730
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 92/161 (57%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA--- 100
R+ +P DA++Y + YK +P T+++ + ++K VPNS L+K A E +++
Sbjct: 521 REHLEIPPDAVIYYSGQTGYKRNPETVRLQMQIIKQVPNSYF-LVKGKA-DETSVKQFFT 578
Query: 101 -TAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A+A G+D +R+ F + E V R L ADV LDT NG TT+++ LW G P+VT
Sbjct: 579 QVAEAEGVDPNRLKFIP-RDESELVHRANLGVADVVLDTYPYNGATTTLETLWMGVPLVT 637
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE ++R + + + +G E IA T +EY + +RLG +
Sbjct: 638 RVGEQFSARNSYTFMMNVGVTEGIAWTDEEYVEWGVRLGKE 678
>gi|428175309|gb|EKX44200.1| hypothetical protein GUITHDRAFT_140027 [Guillardia theta CCMP2712]
Length = 806
Score = 87.0 bits (214), Expect = 3e-15, Method: Composition-based stats.
Identities = 45/118 (38%), Positives = 72/118 (61%), Gaps = 3/118 (2%)
Query: 82 NSILWLLKFP--AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
+S +W+LK E ++A A+ G+D+ R+ F+ A ++HV AD+ LDT L
Sbjct: 634 SSQIWILKLNEGTSAEPQLKAEARRQGIDEGRLFFTGSADLKDHVAYKSAADLLLDTDLY 693
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLG-CPELIARTHKEYQDIAIRLG 196
N H+T+ D L+ G P++TLPG ++SR+ AS LG ++AR K+Y++IA+RL
Sbjct: 694 NAHSTAADALFAGVPLITLPGTRMSSRIGASIANGLGHASVMVARDLKDYEEIAVRLA 751
>gi|403376314|gb|EJY88133.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oxytricha trifallax]
Length = 236
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 84/155 (54%), Gaps = 6/155 (3%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
G+P +A V+ FNQ ++ID +L+ + + VP SIL + G+ +I A A+ L +
Sbjct: 2 GIPRNAFVFGIFNQYFQIDSLSLETGFKIKQQVPGSILLFVLVNKEGQQDIVAKARQLEV 61
Query: 108 DQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
D + + F + K + +V R AD+ LDT N + D+LW+G P++TL + A+R
Sbjct: 62 DLNEVYFLSQDIKSQVNVNRMFTADLALDTLQVNARIVNADLLWSGVPLLTLKSDNWANR 121
Query: 167 VAASQLATLGCP-----ELIARTHKEYQDIAIRLG 196
V AS L+ LG L+A T ++Y A+ L
Sbjct: 122 VGASMLSGLGLSVQQYDSLVASTVQQYIQKAVDLA 156
>gi|186681176|ref|YP_001864372.1| hypothetical protein Npun_F0677 [Nostoc punctiforme PCC 73102]
gi|186463628|gb|ACC79429.1| Tetratricopeptide TPR_2 repeat protein [Nostoc punctiforme PCC
73102]
Length = 736
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 7/161 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
R +P DA+VY + + +K P+T Q+ + ++K VPNS L+K A EA
Sbjct: 534 RDSLDIPSDAVVYLSAQRGFKRHPNTAQLQMRIIKEVPNSYF-LIKGLADSEAVQKFFMQ 592
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G++ R+ F + E V R LA DV LDT NG TT+++ LW G P+VT
Sbjct: 593 LAEQEGVECSRLRFLQLDPSES-VHRANLAIADVILDTYPYNGATTTLETLWMGIPLVTR 651
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G+ A+R + + + G E IA T +EY + +RLG D
Sbjct: 652 VGQQFAARNSYTMMMNAGIAEGIAWTDEEYVEWGVRLGKDE 692
>gi|119510220|ref|ZP_01629358.1| TPR repeat protein [Nodularia spumigena CCY9414]
gi|119465170|gb|EAW46069.1| TPR repeat protein [Nodularia spumigena CCY9414]
Length = 743
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 93/167 (55%), Gaps = 17/167 (10%)
Query: 44 RQQYGLPEDAIVYCNFNQLYK--IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
R+Q +PEDA+VY + YK +D + LQ + ++K VPNS L L+K GEA+ QA+
Sbjct: 539 REQLDIPEDAVVYFVTQKGYKRHLDHAKLQ--IAIIKEVPNSYL-LIK----GEADEQAS 591
Query: 102 -------AQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
A++ +D +R F + + E H LADV LDT NG TT+++ LW
Sbjct: 592 EKFYLDLAKSQNVDFNRFRFLPLVSSEAIHRANLSLADVVLDTYPYNGATTTLETLWREI 651
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
P+VT G+ A+R + + + G E IA + +EY + +RLG D +
Sbjct: 652 PLVTKVGQQFAARNSYTMMINAGITEGIAWSDEEYVEWGVRLGKDEN 698
>gi|381160369|ref|ZP_09869601.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878433|gb|EIC20525.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 721
Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats.
Identities = 53/151 (35%), Positives = 78/151 (51%), Gaps = 5/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
I + +FN + KI ST+++W VL +S L +L+ + E +++ A + GL
Sbjct: 530 ITFGSFNNVAKISDSTIKLWARVLDESKDSRL-VLRDKSFTELSMRKRMLARFGSFGLAA 588
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ + E++ + DV LD G TT+ D LW G PVVTL G+T A R++
Sbjct: 589 ERLSLEPALGRREYLESYRHIDVALDPTPYGGGTTTADALWMGVPVVTLRGKTWAGRIST 648
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L LG PE A+ EY IA L DR+
Sbjct: 649 SILNALGIPEFSAQDENEYVSIASSLARDRE 679
>gi|443475509|ref|ZP_21065456.1| TPR repeat-containing protein [Pseudanabaena biceps PCC 7429]
gi|443019628|gb|ELS33689.1| TPR repeat-containing protein [Pseudanabaena biceps PCC 7429]
Length = 741
Score = 86.3 bits (212), Expect = 6e-15, Method: Composition-based stats.
Identities = 57/167 (34%), Positives = 91/167 (54%), Gaps = 5/167 (2%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
SI SR+Q +P+DA+VY YK P T+ + + ++ +V NS +L+K E
Sbjct: 532 SIPSISREQLNIPDDAVVYLCTQTSYKRFPDTVHLQMQIINSVSNS-YFLIKGWGEEEGT 590
Query: 98 IQ---ATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
+ A+ G+ + R+ F +A E EH ++ADV LDT NG TT+M+ LW
Sbjct: 591 KRFFIDIAKKEGVSEDRLRFLPLADSEAEHRANLRIADVVLDTYPYNGATTTMETLWMCI 650
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
P+VT G+ ++R + + + +G E I+ T +EY + IRLG D +
Sbjct: 651 PLVTKVGQQFSARNSYTMMMNVGVTEGISWTDEEYVEWGIRLGKDEN 697
>gi|428318228|ref|YP_007116110.1| Tetratricopeptide TPR_1 repeat-containing protein [Oscillatoria
nigro-viridis PCC 7112]
gi|428241908|gb|AFZ07694.1| Tetratricopeptide TPR_1 repeat-containing protein [Oscillatoria
nigro-viridis PCC 7112]
Length = 729
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 90/181 (49%), Gaps = 15/181 (8%)
Query: 35 VPQSIV-ITSRQQYGLPE-----------DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
+P S V ++ Q+YG+ D IVY K + ++ + +LK VPN
Sbjct: 504 MPDSFVAVSGFQRYGVDPVALRRSNRIGLDQIVYLCVAPGRKFNYELVKAQIAILKQVPN 563
Query: 83 SIL--WLLKFPAVGEANIQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLC 139
SIL L AV EA +A + +HRI F A +EEH LADV LD+
Sbjct: 564 SILVHKALGDAAVFEAAYHQACEAEKVSKHRIKFLPRFATEEEHRIIYLLADVLLDSYPY 623
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
NG T +++ LW PVVT GE SR+ S L TLG +A + EY ++ ++LG DR
Sbjct: 624 NGGTHTLEALWFNLPVVTRTGEQFLSRMGYSFLQTLGIETGVALSWSEYVEVGVKLGKDR 683
Query: 200 D 200
+
Sbjct: 684 E 684
>gi|374999547|ref|YP_004975635.1| hypothetical protein AZOLI_p60052 [Azospirillum lipoferum 4B]
gi|357428518|emb|CBS91475.1| conserved protein of unknown function; putative TPR domains
[Azospirillum lipoferum 4B]
Length = 583
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQATAQALGLDQH 110
+ + +FN+L KI T+++W VL+ VP+S L L P A ++A A G+
Sbjct: 379 VTFGSFNKLSKITDHTVRLWAEVLRRVPDSRLLLKDRPLSDPGTAAGLRARFAAAGIAPD 438
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + H+ D+ LD NG T+ D LW G P+VTL G A+RV A
Sbjct: 439 RLDLVGFIKDAAGHLAAYNRIDIALDPHPYNGTITTCDTLWMGAPLVTLAGGRHAARVGA 498
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S +A++G PEL+A T Y IA L D
Sbjct: 499 SLMASIGLPELVAATPDRYAAIAAELAGD 527
>gi|347736536|ref|ZP_08869150.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
gi|346919949|gb|EGY01259.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
Length = 525
Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 80/149 (53%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
+ + FN K++ + W +L VP S L LLK + EA +QA +A G+D
Sbjct: 341 VTFGCFNIAAKLNAEVIACWAQILNRVPGSRL-LLKSRGLDEAPVQARILEQFRAGGIDA 399
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ F + EHV Q D+ LD G TT+++ LW G PVVTL G+T ++R +
Sbjct: 400 ARLDFLGATPQAEHVACMQRVDIALDPFPFAGSTTTLESLWMGLPVVTLTGQTFSARHST 459
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
L+ LG L+A T ++Y DIA RL T+
Sbjct: 460 CFLSVLGLTSLMATTVEQYVDIAARLATN 488
>gi|218670286|ref|ZP_03519957.1| hypothetical protein RetlG_00835 [Rhizobium etli GR56]
Length = 176
Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%)
Query: 95 EANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
+AN+ QA G+ RI+F A EEH+ R Q AD+ +DT NGHTT+ + LW G P
Sbjct: 8 QANLLKQFQAAGISPKRIIFCPRAPYEEHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLP 67
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
V+T+ G ASRV+ S L + PEL+A + Y+D+A+ L +
Sbjct: 68 VLTVKGTNFASRVSESLLRAIDLPELVAGDLQAYEDLAVELAQN 111
>gi|377820855|ref|YP_004977226.1| hypothetical protein BYI23_A014110 [Burkholderia sp. YI23]
gi|357935690|gb|AET89249.1| TPR repeat-containing protein [Burkholderia sp. YI23]
Length = 798
Score = 85.9 bits (211), Expect = 7e-15, Method: Composition-based stats.
Identities = 57/151 (37%), Positives = 69/151 (45%), Gaps = 1/151 (0%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
L I + N K TL MW VL A P + L LL P ALG+
Sbjct: 603 ALASGHITFGCLNNPCKASDRTLNMWAAVLAATPGARLVLLS-PHGARERAGERLGALGV 661
Query: 108 DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
D RI F + ++ ++R D+ LDT NGHTTS+D LW G PV + GET SR
Sbjct: 662 DIARIEFLDYQPRDAYLRTYDRIDIGLDTFPYNGHTTSLDALWMGVPVPSRAGETAVSRA 721
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L LG +L A Y DI RL D
Sbjct: 722 GLSFLMNLGLGDLAAHDDAAYVDIVSRLARD 752
>gi|356570865|ref|XP_003553604.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like [Glycine
max]
Length = 917
Score = 85.9 bits (211), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 81/149 (54%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+WV +L A+PNS L + P ++ Q +T + LGL+
Sbjct: 651 VTFGSFNNLAKITPKVLQVWVRILCAIPNSRLVVKCKPFCSDSVRQIFLSTLEQLGLEPL 710
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 711 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 770
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G LIAR EY ++A +L +D
Sbjct: 771 SLLSKVGLEHLIARNEDEYVELAQQLASD 799
>gi|224028913|gb|ACN33532.1| unknown [Zea mays]
Length = 144
Score = 85.9 bits (211), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 1/78 (1%)
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGC-P 178
K EH+RR LAD+ LDTPLCN HTT D+LW G P++TLP E +A+RVA S G
Sbjct: 2 KNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEKMATRVAGSLCVATGLGE 61
Query: 179 ELIARTHKEYQDIAIRLG 196
E+I + K+Y+D A+ L
Sbjct: 62 EMIVSSMKKYEDRAVELA 79
>gi|427737301|ref|YP_007056845.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rivularia sp. PCC 7116]
gi|427372342|gb|AFY56298.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Rivularia sp. PCC 7116]
Length = 744
Score = 85.9 bits (211), Expect = 9e-15, Method: Composition-based stats.
Identities = 58/160 (36%), Positives = 87/160 (54%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
R+ +PEDAIVY + K P T+++ + +LKAVPNS +LL E IQ
Sbjct: 542 RENLNIPEDAIVYFSGQGGCKRHPDTVRLQMRILKAVPNS--YLLIKGLADEKAIQNFFN 599
Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A A G++ + F V+++ H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 600 EIAIAEGVEPKYLRFLPMVSSEATHRANLGIADVVLDTYPYNGATTTLETLWMGIPLVTK 659
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE +R + + + +G E I+ + +EY + IRLG D
Sbjct: 660 VGEQFVARNSYTMMMNVGIKEGISWSDEEYLEWGIRLGKD 699
>gi|197120102|ref|YP_002140529.1| hypothetical protein [Geobacter bemidjiensis Bem]
gi|197089462|gb|ACH40733.1| TPR domain protein [Geobacter bemidjiensis Bem]
Length = 1106
Score = 85.9 bits (211), Expect = 9e-15, Method: Composition-based stats.
Identities = 52/151 (34%), Positives = 78/151 (51%), Gaps = 4/151 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
+ + +FN KI P T+ +W VL+AVP S L L + A ++ T G+++
Sbjct: 429 VTFGSFNNPAKITPETVLLWSGVLRAVPGSQLLLKGYSLACAETRLRLEETFAGHGIERE 488
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L N + +H+ D+ LD+ NG TT+ + LW G PVVTL G SRV A
Sbjct: 489 RLELLGNTPSYRDHLSLYDRVDIALDSYPYNGTTTTCEALWMGVPVVTLAGSAHRSRVGA 548
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L LG L+A +++ +A L DR+
Sbjct: 549 SILQALGLEGLVAHEARKFVVLAQALAGDRE 579
>gi|440683084|ref|YP_007157879.1| methyltransferase FkbM family [Anabaena cylindrica PCC 7122]
gi|428680203|gb|AFZ58969.1| methyltransferase FkbM family [Anabaena cylindrica PCC 7122]
Length = 1200
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 80/154 (51%), Gaps = 8/154 (5%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEANIQATAQALGL 107
+ +FN L K+ P T +W+ +LKAVPNS L L + P + E IQ G+
Sbjct: 384 FTFSSFNNLRKLTPETFALWIEILKAVPNSRLVLKCASSDVFSPLIAE-KIQTPFVEQGI 442
Query: 108 DQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
D RI L+ AA E+H+ D+ LD+ G TT+ + LW G P +T+ G R
Sbjct: 443 DPKRIFLYGGYAADEDHLNLYNQVDLHLDSLPYTGCTTTCEALWMGVPTITVAGTRKMER 502
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
++AS L ++G + IA + EY + AI L + D
Sbjct: 503 MSASILHSVGLDDCIAYSAVEYVEKAIALARNPD 536
>gi|17228058|ref|NP_484606.1| hypothetical protein all0562 [Nostoc sp. PCC 7120]
gi|17129907|dbj|BAB72520.1| all0562 [Nostoc sp. PCC 7120]
Length = 739
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 86/161 (53%), Gaps = 7/161 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
R Q +P+DA+VY + +K P ++ + ++K VPNS L+K + E+ +
Sbjct: 538 RDQLDIPDDAVVYFCGQRGFKRHPDITRLQLQIIKEVPNSYF-LIKGISDEESIKNFFEE 596
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G+D R+ F + A E V R L AD+ LDT NG TT+++ LW P+VT
Sbjct: 597 LAEEEGVDYSRLRFLPIVATES-VHRANLGIADIVLDTYPYNGATTTLETLWMCIPMVTK 655
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
GE A+R + + + G E IA T +EY + +RLG D
Sbjct: 656 VGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGKDE 696
>gi|288963188|ref|YP_003453467.1| TPR repeat-containing protein [Azospirillum sp. B510]
gi|288915440|dbj|BAI76923.1| TPR repeat-containing protein [Azospirillum sp. B510]
Length = 654
Score = 85.5 bits (210), Expect = 1e-14, Method: Composition-based stats.
Identities = 54/155 (34%), Positives = 80/155 (51%), Gaps = 6/155 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----A 104
L + + +FN K+ +++W +LK VPN+ L LLK +G+ +A +
Sbjct: 455 LDSGVVTFGSFNNTSKVTAEVVRVWAEILKRVPNARL-LLKSRQMGDDETRARYRNSFIT 513
Query: 105 LGLDQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+D R+ L + + A + H+R D+ LD NG TT+ + LW G PVVTL G +
Sbjct: 514 HGIDPDRVELLARIPAADGHLRAYDRLDIALDPFPYNGTTTTCEALWMGVPVVTLAGRSH 573
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+RV AS L +G ELIA EY A+ L D
Sbjct: 574 VARVGASLLTNVGLEELIASDEAEYIAKAVALAGD 608
>gi|196232228|ref|ZP_03131082.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196223596|gb|EDY18112.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 660
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 78/154 (50%), Gaps = 8/154 (5%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNS-ILWLLKFPAVGEANIQATAQALGLDQHRI 112
I + FN KI+ LQ+W +L+ VP+S ++ LLK A E ++A Q G+ R+
Sbjct: 464 ITFGCFNNFSKINERVLQLWARILQGVPDSRLILLLKTGAHREKTLEAI-QRHGIAAERL 522
Query: 113 LFSNVA------AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+F + + ++R D+ LD NG TT+ D LW G PV+ L G SR
Sbjct: 523 VFHDYEPASLERSASHFLQRYAQVDIALDPFPYNGMTTTCDALWMGVPVIALRGTLGISR 582
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ S LA LG PE + EY + A+ L DRD
Sbjct: 583 ASFSLLANLGLPEYAVASEDEYCETAVALAGDRD 616
>gi|392380152|ref|YP_004987310.1| conserved protein of unknown function [Azospirillum brasilense
Sp245]
gi|356882519|emb|CCD03531.1| conserved protein of unknown function [Azospirillum brasilense
Sp245]
Length = 550
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
+ +FN L K++ STL +W VL AVP + L L L V + A A A G+++
Sbjct: 350 FTFGSFNNLTKLNASTLALWGRVLAAVPEARLMLKWRGLSGGGVARRVLDAFA-AHGVEE 408
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+L A R L D LD NG TT+ D LW G PV+++ GE + SR A
Sbjct: 409 SRLLLRGTAPDPYEDYR--LLDCALDPVFANGGTTTCDALWMGVPVLSIAGEAMISRWGA 466
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+ L ++G L+ +Y +A+RL TDR
Sbjct: 467 TMLGSVGLGGLVVEREDDYVALAVRLATDR 496
>gi|381160360|ref|ZP_09869592.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878424|gb|EIC20516.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 914
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 75/145 (51%), Gaps = 4/145 (2%)
Query: 58 NFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQHRI-L 113
+FN L+K+ +L +W VL A+P + L L F + +AL G+D+ R+ L
Sbjct: 711 SFNNLFKVREESLALWARVLHALPQAHLALKDFKGANRVHQLRVLKALAAQGIDESRVAL 770
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
+ EH+ D+ LDT N TT+ D LW G P+VTL G+ +A AAS L+
Sbjct: 771 LRATPSWSEHMAYYNRLDIALDTLPFNSATTAFDALWMGCPLVTLAGDRMAGLQAASALS 830
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG + +AR +Y IA+ L D
Sbjct: 831 GLGRTQWVARDEDDYVRIAVELARD 855
>gi|428210827|ref|YP_007083971.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoria acuminata PCC 6304]
gi|427999208|gb|AFY80051.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoria acuminata PCC 6304]
Length = 746
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 87/160 (54%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R + +P+DA+VY + YK P T ++ + ++KAVPNS + F ++ IQ
Sbjct: 543 RDRLNIPQDAVVYFSGQTGYKRHPDTARLQMQIIKAVPNSYFLIKGFAEQESIKRFFIQ- 601
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G++ R+ F A E + R L ADV LDT NG TT+++ LW G PVVT
Sbjct: 602 LAEEEGVEGDRLRFLPPAPSEA-IHRANLGIADVVLDTYPYNGATTTLETLWMGIPVVTK 660
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ A+ + + + +G E I+ T +EY + IRLG D
Sbjct: 661 VGKQFAACNSYTMMMNVGVSEGISWTDQEYVEWGIRLGKD 700
>gi|218437263|ref|YP_002375592.1| hypothetical protein PCC7424_0256 [Cyanothece sp. PCC 7424]
gi|218169991|gb|ACK68724.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 719
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 82/160 (51%), Gaps = 6/160 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
R+ + DAI+Y + +K P T+++ + +LK VPNS + L E IQ
Sbjct: 520 REDLNIANDAIIYLSTQTGFKRHPDTIRLQMKILKEVPNS--YFLIQGLTDEEKIQQLFT 577
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
AQ G++ +R+ F E + +ADV LDT NG TT++D+LW G PVVT
Sbjct: 578 EIAQQEGVNPNRLRFLPYYPTETYRANLTIADVVLDTYPFNGGTTTLDILWQGIPVVTRV 637
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G+ ASR + L G E IA ++Y I+LGTD
Sbjct: 638 GQQWASRNGYTLLINAGITEGIAWNDEDYIKWGIQLGTDE 677
>gi|254431323|ref|ZP_05045026.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Cyanobium sp. PCC 7001]
gi|197625776|gb|EDY38335.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
putative [Cyanobium sp. PCC 7001]
Length = 747
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 77/148 (52%), Gaps = 5/148 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----ALGLDQ 109
+V+ FN + K+ PST+ +W +L+AVP S L LLK P++ + ++ + Q A G+
Sbjct: 539 VVFGCFNNVMKLSPSTIALWARILQAVPESSL-LLKAPSLRDGLVRHSFQERFAAHGIGP 597
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
HR+ ++ D+ LD NG TT++ LW G PVV L G SR+ A
Sbjct: 598 HRLRLEGPTELSVMMQTYGQIDIGLDPFPYNGGTTTLQALWMGVPVVVLEGGNFVSRMGA 657
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGT 197
S L +LG PE I Y IA+ L +
Sbjct: 658 SFLTSLGRPEWIGADPDSYARIAVALAS 685
>gi|256772634|emb|CAX46402.1| putative SPINDLY protein [Rosa lucieae]
Length = 916
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 81/149 (54%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
I + +FN L KI P LQ+W +L A+PNS L + P ++ +T + LGL+
Sbjct: 662 ITFGSFNNLAKITPKVLQVWARILSAIPNSRLVVKCKPFSCDSVRERFLSTLEQLGLEPL 721
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 722 RVDLLPLILLNYDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 781
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+T+G LIA+ +EY +A++L +D
Sbjct: 782 SILSTVGLGNLIAKNEEEYVQLAVQLASD 810
>gi|113476017|ref|YP_722078.1| hypothetical protein Tery_2387 [Trichodesmium erythraeum IMS101]
gi|110167065|gb|ABG51605.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 745
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 88/163 (53%), Gaps = 9/163 (5%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT- 101
R+ G+P DA+VY + +K +P T+++ + +LK VP S F +G+++ +
Sbjct: 539 KREDLGIPSDAVVYFTAQRGHKYNPDTVRLQMKILKEVPQS---YFLFKNLGDSDTLSEL 595
Query: 102 ----AQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
A++ G+ R++ V+ ++ H +AD+ LDT NG TT+++ LW P+V
Sbjct: 596 LIDIAESEGISSDRLIPIEEVSLEQTHRANLGIADIVLDTYPYNGATTTLETLWMCIPMV 655
Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
T G+ A+R + + + G E IA + +EY + IRLG D
Sbjct: 656 TRVGQQFAARNSYTMMMNTGITEGIAWSDEEYVEWGIRLGKDE 698
>gi|428162823|gb|EKX31932.1| hypothetical protein GUITHDRAFT_121883 [Guillardia theta CCMP2712]
Length = 672
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 87/184 (47%), Gaps = 33/184 (17%)
Query: 18 NGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKI------DPSTLQ 71
N L+ N+ + + +E P +R++ G +D I+ FNQ+ D T
Sbjct: 444 NILSKNEKVRRITSDDEQP------TRRELGFDDDDIILACFNQVNAHAVVVCPDNRTQI 497
Query: 72 MWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLAD 131
M + + K P + G+D RI+ +E R LAD
Sbjct: 498 MPLQLYKIEP---------------------EKRGVDPSRIVSHEQFPREREFRVKGLAD 536
Query: 132 VCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDI 191
V LDTPL N HTT+ DVLW+G P++TLPGE+ RVA+ + + G ++ R +Y+++
Sbjct: 537 VFLDTPLFNAHTTAGDVLWSGVPILTLPGESFVQRVASGMVKSAGMQWMVTRNLLDYEEL 596
Query: 192 AIRL 195
A+ L
Sbjct: 597 AVAL 600
>gi|427707007|ref|YP_007049384.1| hypothetical protein Nos7107_1593 [Nostoc sp. PCC 7107]
gi|427359512|gb|AFY42234.1| Tetratricopeptide TPR_1 repeat-containing protein [Nostoc sp. PCC
7107]
Length = 739
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 85/162 (52%), Gaps = 7/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R + +P DA+VY + + +K P ++ + +LK VPNS L+K A E+ + AQ
Sbjct: 538 RDELDIPNDAVVYFSGQRGFKRHPDITRLQLKILKEVPNSYF-LIKGLADEESTHKFFAQ 596
Query: 104 ALG---LDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
+D R+ F A E V R L AD+ LDT NG TT+++ LW P+VT
Sbjct: 597 IAKEEEVDYARLRFL-ADAPTEGVHRANLDIADIVLDTYPYNGATTTLETLWMCIPMVTR 655
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G E IA T +EY + IRLG D D
Sbjct: 656 VGEQFAARNSYTMMMNAGISEGIAWTDEEYVEWGIRLGKDAD 697
>gi|196232945|ref|ZP_03131794.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196222923|gb|EDY17444.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 808
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 70/142 (49%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + N K++ L++W VL AV ++ L L + G+ R+
Sbjct: 615 LTFGCLNNFCKVNEPLLRLWARVLVAVKDARLVLSSPRGSHRQRTLDFLERAGVAPRRVE 674
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + ++ DV LDT NGHTTS+D LW G PVV+L G+ SR SQL+
Sbjct: 675 FVEPCPRRAYLELYHQLDVVLDTFPYNGHTTSLDALWMGVPVVSLAGKPAVSRAGLSQLS 734
Query: 174 TLGCPELIARTHKEYQDIAIRL 195
LG PEL+A + EY +A RL
Sbjct: 735 NLGLPELVAHSEDEYVAVAARL 756
>gi|242081821|ref|XP_002445679.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
gi|241942029|gb|EES15174.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
Length = 910
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + +FN L KI P LQ+W +L AVPNS L + P E NI+ +T + LGL+
Sbjct: 646 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCE-NIRQKFLSTLEELGLES 704
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 705 LRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVG 764
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+T EY +A+ L +D
Sbjct: 765 VSLLSKVGLGRLVAKTEDEYVSLALGLASD 794
>gi|212274891|ref|NP_001130413.1| uncharacterized protein LOC100191509 [Zea mays]
gi|194689056|gb|ACF78612.1| unknown [Zea mays]
Length = 567
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + +FN L KI P LQ+W +L +VPNS L + P E NI+ AT + LGL+
Sbjct: 302 VTFGSFNNLAKITPKVLQVWARILCSVPNSRLVVKCKPFCCE-NIRQKFLATLEELGLES 360
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 361 LRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVG 420
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+T EY +A+ L +D
Sbjct: 421 VSLLSKVGLGRLVAKTEDEYVSLALDLASD 450
>gi|288963063|ref|YP_003453342.1| TPR repeat protein [Azospirillum sp. B510]
gi|288915315|dbj|BAI76798.1| TPR repeat protein [Azospirillum sp. B510]
Length = 1000
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 56/164 (34%), Positives = 85/164 (51%), Gaps = 10/164 (6%)
Query: 35 VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL-LKFPAV 93
VPQ + SR G+PED V+ ++Q+ KI+ TL +W+ VL+ PNSIL L P +
Sbjct: 789 VPQPL---SRSALGIPEDCFVFLGYHQVMKINKKTLDLWMEVLRNCPNSILILNWTNPWI 845
Query: 94 GEANIQATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDT-PLCNGHTTSMDVLWT 151
I+ A++ G+D RI+F E EH+ R +AD+ LDT P + T D L+
Sbjct: 846 ----IRNLARSHGIDASRIMFFRFVRTEVEHLMRLGVADLFLDTNPFTSAALTGADALYM 901
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
P +TL L SR + +G + + KE+ + A+ L
Sbjct: 902 KVPRITLVEANLYSRFGQIMMNGIGLNDWVCHDRKEFIERAVAL 945
>gi|334119237|ref|ZP_08493324.1| methyltransferase FkbM family [Microcoleus vaginatus FGP-2]
gi|333458708|gb|EGK87325.1| methyltransferase FkbM family [Microcoleus vaginatus FGP-2]
Length = 2742
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/169 (33%), Positives = 93/169 (55%), Gaps = 16/169 (9%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-PAV-----GEA 96
+R G+ E+++V+ + LYK+ P + W ++ AVPNS+L+L+ F P+ G A
Sbjct: 2076 TRNSIGISEESVVFISSANLYKLVPELRETWAKIMAAVPNSVLFLMPFGPSWTNHYPGGA 2135
Query: 97 ---NIQATAQALGLDQHRIL----FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
N++A G++ +R+ F N A +E ++ L DV LD+ G T+ +D L
Sbjct: 2136 FVNNMKAVFAKYGIESNRLRVLKSFPNRADVKEVLK---LGDVYLDSYPYAGTTSLVDPL 2192
Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G P V G+TL SR+ A+ + +L P+LIA + Y +A+ LGT+
Sbjct: 2193 EVGLPTVVRDGDTLRSRMGAAVVRSLSMPDLIANSEASYIQLAVTLGTN 2241
>gi|356505394|ref|XP_003521476.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like [Glycine
max]
Length = 919
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L A+PNS L + P ++ Q +T + LGL+
Sbjct: 652 VTFGSFNNLAKITPKVLQVWARILCAIPNSRLVVKCKPFCSDSVRQRFLSTLEQLGLEPL 711
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 712 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 771
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G LIAR EY ++A +L +D
Sbjct: 772 SLLSKVGLEHLIARNEDEYVELAQQLASD 800
>gi|186477743|ref|YP_001859213.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
gi|184194202|gb|ACC72167.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
Length = 708
Score = 84.3 bits (207), Expect = 3e-14, Method: Composition-based stats.
Identities = 54/152 (35%), Positives = 78/152 (51%), Gaps = 5/152 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
AI + FN L K++ + + +W VL A+P S L L L PAV A + A A G+D
Sbjct: 488 AITFGCFNHLVKLNEAVVALWARVLDAMPGSRLLLKTRQLDDPAVQHATL-ARFAAHGID 546
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R++ + + E + D+ LD G TTS++ LW G PV+T GE S V
Sbjct: 547 GARLMLEGQSPRAELLAAYNRVDIALDPFPYAGGTTSIEALWMGVPVLTRRGERFLSHVG 606
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S + T+G P+ IA +Y A+R+G RD
Sbjct: 607 ESIVDTVGLPDWIASDDADYIAKAVRVGVQRD 638
>gi|428227088|ref|YP_007111185.1| family 2 glycosyl transferase [Geitlerinema sp. PCC 7407]
gi|427986989|gb|AFY68133.1| glycosyl transferase family 2 [Geitlerinema sp. PCC 7407]
Length = 1249
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 90/181 (49%), Gaps = 12/181 (6%)
Query: 30 ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
+ G E+ S ++ +RQ G+ ++ +V+C+ +KI P+ LQ W+ +L+ VPNS+L L+
Sbjct: 891 SYGTEIASSSLVVTRQDLGIKDEQVVFCSGANFFKITPALLQSWLEILQNVPNSLLLLMP 950
Query: 90 F-----PAVGEANIQATAQ------ALGLDQHRILFSNVAAKEEHVRR-GQLADVCLDTP 137
F A + Q Q L DQ +L + V++ +LAD+ LD+
Sbjct: 951 FGPNWSNAYPKGYFQKYLQNAFVEAGLSKDQFLVLDPDPVPNRADVKQYFKLADIYLDSY 1010
Query: 138 LCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
G T+ ++ L P+VT GE + A L LG +++A EY AI LGT
Sbjct: 1011 PFAGTTSLIEPLEVNLPIVTRVGEYFRGAMGAGILRALGLVQMVAADEAEYVRKAIALGT 1070
Query: 198 D 198
D
Sbjct: 1071 D 1071
>gi|75318818|sp|O82039.1|SPY_PETHY RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY; AltName:
Full=PhSPY
gi|3319682|emb|CAA76834.1| SPINDLY protein [Petunia x hybrida]
Length = 932
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 83/150 (55%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + +FN L KI P LQ+W +L AVP+S L ++K G +++ + + LGL+
Sbjct: 660 VTFGSFNNLAKITPKVLQVWARILCAVPHSRL-IVKCKPFGCDSVRQRFLSILEQLGLEP 718
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 719 QRVDLVPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGSVHAHNVG 778
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L T+G +L+AR EY ++AI+L +D
Sbjct: 779 VSLLKTVGLRKLVARNEDEYVELAIQLASD 808
>gi|356500631|ref|XP_003519135.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like [Glycine
max]
gi|401721241|gb|AFP99901.1| putative UDP-N-acetylglucosamin [Glycine max]
Length = 928
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L A+PNS L + P ++ Q +T + LGL+
Sbjct: 658 VTFGSFNNLAKITPKVLQVWAKILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEKLGLEPL 717
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 718 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 777
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G LIA+ EY +A++L +D
Sbjct: 778 SLLSKVGLGNLIAKNEDEYVKLAVKLASD 806
>gi|434386708|ref|YP_007097319.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Chamaesiphon minutus PCC 6605]
gi|428017698|gb|AFY93792.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Chamaesiphon minutus PCC 6605]
Length = 738
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 87/162 (53%), Gaps = 5/162 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEANIQ 99
R+ +P+DAI+Y + +K P T+++ + +LK+VPNS L+K P +
Sbjct: 524 KRRDLNIPDDAIIYWSGQVGHKRHPDTVRLQLRILKSVPNSYF-LIKGDTDPDIIREFFG 582
Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A G++ R+ F NV + H +ADV LDT NG TT++++LW G P+VT
Sbjct: 583 KIAAEEGVEFDRLRFLGNVPDEYTHRANLGIADVALDTYPYNGATTTLEILWMGIPLVTR 642
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +R + + + G E IA +EY + I+LG DR+
Sbjct: 643 VGQQFIARNSYTFMLNAGIEEGIAWNAEEYVEWGIKLGLDRE 684
>gi|391232225|ref|ZP_10268431.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Opitutaceae bacterium TAV1]
gi|391221886|gb|EIQ00307.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Opitutaceae bacterium TAV1]
Length = 637
Score = 84.3 bits (207), Expect = 3e-14, Method: Composition-based stats.
Identities = 56/150 (37%), Positives = 77/150 (51%), Gaps = 6/150 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLD 108
++ + FN K S +W +L AVP S L LLK + EA ++ A Q GL
Sbjct: 439 SVTFGCFNSPTKFTDSQFALWARILAAVPASRL-LLKGAGLEEAPVRELLLARLQRAGLP 497
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R+ L H+ + D+ LDT NG TT+ + LW G PV+TL G A+RV
Sbjct: 498 ADRVELLPRTDDTASHLALYRRVDIALDTFPYNGTTTTCEALWMGRPVITLAGNRHAARV 557
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGT 197
+AS LA +GCPE IA ++Y AI L +
Sbjct: 558 SASLLAAIGCPEWIASCQEDYVARAIGLAS 587
>gi|428306910|ref|YP_007143735.1| hypothetical protein Cri9333_3397 [Crinalium epipsammum PCC 9333]
gi|428248445|gb|AFZ14225.1| hypothetical protein Cri9333_3397 [Crinalium epipsammum PCC 9333]
Length = 719
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 88/161 (54%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
R G+P DA++Y + +K P T ++ + ++K VPNS L+K + EA I+
Sbjct: 517 RDDLGIPADAVIYFSAQHGHKRHPDTTRLQMQIIKQVPNSYF-LIKGRS-DEATIKQFFT 574
Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A++ G+D R+++ E +V R L AD+ LDT NG TT+++ LW G P+VT
Sbjct: 575 KIAESEGVDPSRLIYLPRDINE-YVHRANLGIADIVLDTYPYNGATTTLETLWMGVPLVT 633
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ A+R + + L G E IA T EY + ++LG D
Sbjct: 634 RVGKQFAARNSYTFLMNAGVTEGIAWTDAEYVEWGVKLGLD 674
>gi|254413207|ref|ZP_05026978.1| hypothetical protein MC7420_694 [Coleofasciculus chthonoplastes PCC
7420]
gi|196179827|gb|EDX74820.1| hypothetical protein MC7420_694 [Coleofasciculus chthonoplastes PCC
7420]
Length = 745
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 86/160 (53%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
R +P DA++Y + +K P T ++ + +LK VPNS L+K A V + IQ
Sbjct: 542 RDHLDIPNDAVIYLSAQMGHKRHPDTARLQMQILKQVPNSYF-LIKGIANQESVKKFFIQ 600
Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G+D R+ F VA + H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 601 -IAEEEGVDCQRLRFLPEVALEATHRANLGIADVVLDTYPYNGATTTLETLWMGIPLVTR 659
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ ++R + + + G E IA T +EY + +RLG D
Sbjct: 660 VGQQFSARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGKD 699
>gi|332712009|ref|ZP_08431939.1| hypothetical protein LYNGBM3L_68530 [Moorea producens 3L]
gi|332349337|gb|EGJ28947.1| hypothetical protein LYNGBM3L_68530 [Moorea producens 3L]
Length = 344
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 83/159 (52%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
R +P DA +Y + + YK +P T ++ + ++KAVPNS L+K + ++
Sbjct: 147 RDHLDIPNDATIYLSSQKGYKRNPDTTRLQMKIIKAVPNSYF-LIKGKSDQDSITRFFNQ 205
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+ G+D R+ F V + H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 206 IAEEEGVDCDRLRFLPEVPTEAVHRANLGIADVVLDTYPYNGATTTLETLWMGIPLVTRV 265
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE +R + + + G E IA T +Y + IRLG D
Sbjct: 266 GEQFVARNSYTMMMNAGITEGIAWTDDQYIEWGIRLGKD 304
>gi|356534448|ref|XP_003535766.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like [Glycine
max]
Length = 929
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L A+PNS L + P ++ Q +T + LGL+
Sbjct: 659 VTFGSFNNLAKITPKVLQVWAKILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEKLGLEPL 718
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 719 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 778
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G LIA+ EY +A++L +D
Sbjct: 779 SLLSKVGLGNLIAKNEDEYVKLALKLASD 807
>gi|411117158|ref|ZP_11389645.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoriales cyanobacterium JSC-12]
gi|410713261|gb|EKQ70762.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoriales cyanobacterium JSC-12]
Length = 741
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 84/160 (52%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA--- 100
R+ +P DAIVY + + YK T ++ + ++K VPNS + F + +IQA
Sbjct: 537 REDLDIPSDAIVYLSTQKGYKRHRETAKLQMRIIKNVPNSYFLIKGF--ADQNSIQAFFY 594
Query: 101 -TAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G++ ++ F A E H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 595 QIAEEEGVNPEQLRFLPDAPSEAVHRANMMIADVVLDTFPYNGATTTLETLWMGVPLVTR 654
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE A+R + + L +G IA T +EY + +R G D
Sbjct: 655 VGEQFAARNSYTMLKNVGVEAGIAWTDEEYVEWGVRFGKD 694
>gi|16330340|ref|NP_441068.1| hypothetical protein slr1816 [Synechocystis sp. PCC 6803]
gi|383322081|ref|YP_005382934.1| hypothetical protein SYNGTI_1172 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325250|ref|YP_005386103.1| hypothetical protein SYNPCCP_1171 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491134|ref|YP_005408810.1| hypothetical protein SYNPCCN_1171 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436401|ref|YP_005651125.1| hypothetical protein SYNGTS_1172 [Synechocystis sp. PCC 6803]
gi|451814498|ref|YP_007450950.1| hypothetical protein MYO_111820 [Synechocystis sp. PCC 6803]
gi|1652829|dbj|BAA17748.1| slr1816 [Synechocystis sp. PCC 6803]
gi|339273433|dbj|BAK49920.1| hypothetical protein SYNGTS_1172 [Synechocystis sp. PCC 6803]
gi|359271400|dbj|BAL28919.1| hypothetical protein SYNGTI_1172 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274570|dbj|BAL32088.1| hypothetical protein SYNPCCN_1171 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277740|dbj|BAL35257.1| hypothetical protein SYNPCCP_1171 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958259|dbj|BAM51499.1| hypothetical protein BEST7613_2568 [Synechocystis sp. PCC 6803]
gi|451780467|gb|AGF51436.1| hypothetical protein MYO_111820 [Synechocystis sp. PCC 6803]
Length = 717
Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats.
Identities = 53/161 (32%), Positives = 87/161 (54%), Gaps = 5/161 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQ 99
SR+Q G+ + ++Y + K +P+ L++ + +L++VP+S+ +L+K E A ++
Sbjct: 516 SREQLGIKPNTVIYLSVQNKIKSNPNILKLQLQILQSVPDSV-FLIKGGGDNELIRAMVE 574
Query: 100 ATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ +GLD HRI F E H +ADV LDT NG TT+++ LW P+VT
Sbjct: 575 QLAKEMGLDLHRIRFLPKDGDEMTHRANLSIADVVLDTYPYNGATTTLETLWMEVPLVTR 634
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G+ ASR + + L L E +A +EY R G ++
Sbjct: 635 VGKQFASRNSYTFLTQLNIREGLAWNDQEYVYWGTRFGLEK 675
>gi|269839835|ref|YP_003324528.1| hypothetical protein Tter_2822 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791565|gb|ACZ43705.1| Tetratricopeptide TPR_2 repeat protein [Thermobaculum terrenum ATCC
BAA-798]
Length = 529
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 79/148 (53%), Gaps = 5/148 (3%)
Query: 55 VYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQH 110
V+ FN+L KI P L +W +L A+P + L L L+ P V + + + G+
Sbjct: 332 VFGCFNRLSKIGPEVLDLWAKILLALPRARLRLIATGLQDP-VTSSRLMRALEGRGVAGE 390
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
R+ + + + + DV LDT +G TTS++ LW G PVVTL G +A R +S
Sbjct: 391 RLELLSPMPRTDLLATYNDIDVALDTLPYSGCTTSLEALWMGVPVVTLEGADMAGRATSS 450
Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTD 198
L G EL++RT +EY DIA+ LG D
Sbjct: 451 LLRWAGLQELVSRTQEEYIDIALGLGRD 478
>gi|357148919|ref|XP_003574938.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like
[Brachypodium distachyon]
Length = 959
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
I + +FN L KI P LQ+W +L AVPNS L ++K G +I+ +T + LGL+
Sbjct: 651 ITFGSFNNLAKITPKVLQVWARILCAVPNSRL-VVKCKPFGCDSIRQKFLSTLEELGLES 709
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 710 IRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVG 769
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A++ EY +A+ L +D
Sbjct: 770 VSLLTKVGLGRLVAKSEDEYVSLALDLASD 799
>gi|449526080|ref|XP_004170042.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like [Cucumis
sativus]
Length = 925
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 81/149 (54%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P L++W +L A+PNS L + P ++ Q +T + LGL+
Sbjct: 660 ITFGSFNNLAKITPKVLEVWARILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEQLGLESQ 719
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 720 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 779
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+ +EY +A++L +D
Sbjct: 780 SLLSKVGLGHLVAKNEEEYVKLALQLASD 808
>gi|375152092|gb|AFA36504.1| putative O-linked N-acetyl glucosamine transferase, partial [Lolium
perenne]
Length = 225
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 41/58 (70%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
R YGLPED ++ FNQLYK+DP W N++K VPNS+LWLL+FPA GE ++A
Sbjct: 168 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVLWLLRFPATGEMRVKA 225
>gi|75318847|sp|O82422.1|SPY_HORVU RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY; AltName:
Full=HvSPY
gi|3617837|gb|AAC36055.1| gibberellin action negative regulator SPY [Hordeum vulgare subsp.
vulgare]
gi|326515544|dbj|BAK07018.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 944
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 79/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P +Q+W +L AVPNS L + P ++ Q +T + LGL+
Sbjct: 646 ITFGSFNNLAKITPKVMQVWARILCAVPNSRLVVKCKPFCCDSIRQKFLSTLEELGLESL 705
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 706 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 765
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+T EY +A+ L +D
Sbjct: 766 SLLTKVGLGRLVAKTEDEYVSLALDLASD 794
>gi|334120476|ref|ZP_08494556.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
gi|333456454|gb|EGK85086.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
vaginatus FGP-2]
Length = 726
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 89/181 (49%), Gaps = 15/181 (8%)
Query: 35 VPQSIV-ITSRQQYGLPE-----------DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
+P S V ++ Q+YG+ D IVY K + ++ + +LK VPN
Sbjct: 504 MPDSFVAVSGFQRYGVDPVALRRSNRIGLDQIVYLCVAPGRKFNHELVKAQIAILKQVPN 563
Query: 83 SILW--LLKFPAVGEANIQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLC 139
SIL L AV EA +A + +HRI F A +EEH LADV LD+
Sbjct: 564 SILIHKALGDAAVFEAAYHQACEAEKVSKHRIKFLPRFATEEEHRTIYLLADVLLDSYPY 623
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
NG T +++ LW PVVT GE SR+ S L LG +A + EY ++ ++LG +R
Sbjct: 624 NGGTHTLEALWCNLPVVTRTGEQFLSRMGYSFLQALGIETGVALSWSEYVEVGVKLGKER 683
Query: 200 D 200
+
Sbjct: 684 E 684
>gi|253702410|ref|YP_003023599.1| hypothetical protein GM21_3824 [Geobacter sp. M21]
gi|251777260|gb|ACT19841.1| Tetratricopeptide TPR_2 repeat protein [Geobacter sp. M21]
Length = 1106
Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats.
Identities = 51/149 (34%), Positives = 76/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
+ + +FN KI P T+ +W VL+AVP S L L + A ++ G+++
Sbjct: 429 VTFGSFNNPAKITPETVLLWSGVLRAVPGSHLLLKGYSLACAETRLRLEEAFAGHGIERE 488
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L N + +H+ D+ LD+ NG TTS + LW G PVVTL G + SRV A
Sbjct: 489 RLELMGNTPSYRDHLALYDRVDIALDSYPYNGTTTSCEALWMGVPVVTLAGSSHRSRVGA 548
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L LG L+A +++ +A L D
Sbjct: 549 SLLQALGLEGLVAHEARKFVALAAALAGD 577
>gi|449451161|ref|XP_004143330.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY-like [Cucumis
sativus]
Length = 920
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 81/149 (54%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P L++W +L A+PNS L + P ++ Q +T + LGL+
Sbjct: 655 ITFGSFNNLAKITPKVLEVWARILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEQLGLESQ 714
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 715 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 774
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+ +EY +A++L +D
Sbjct: 775 SLLSKVGLGHLVAKNEEEYVKLALQLASD 803
>gi|326512858|dbj|BAK03336.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 644
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 79/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P +Q+W +L AVPNS L + P ++ Q +T + LGL+
Sbjct: 346 ITFGSFNNLAKITPKVMQVWARILCAVPNSRLVVKCKPFCCDSIRQKFLSTLEELGLESL 405
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 406 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 465
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+T EY +A+ L +D
Sbjct: 466 SLLTKVGLGRLVAKTEDEYVSLALDLASD 494
>gi|393770118|ref|ZP_10358624.1| tpr domain protein [Methylobacterium sp. GXF4]
gi|392724408|gb|EIZ81767.1| tpr domain protein [Methylobacterium sp. GXF4]
Length = 713
Score = 83.6 bits (205), Expect = 5e-14, Method: Composition-based stats.
Identities = 55/151 (36%), Positives = 78/151 (51%), Gaps = 6/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
I + FN+L K+ + L W ++L AVP S L++ + P V EA ++ A GL
Sbjct: 520 ITFGCFNRLTKVCDAVLTTWASILDAVPESRLFMVVGDIGTPEVREA-VETRLTAAGLPL 578
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++F + H ++ D+ LD NG TTS D L G P V L G+ A+RV +
Sbjct: 579 DRVIFQPRVSSGYHELYHRV-DIALDPYPYNGGTTSFDTLSMGVPFVALSGDHAAARVGS 637
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L +G PEL+A + Y IA L DRD
Sbjct: 638 SILRVIGLPELVADSQDAYVAIARDLALDRD 668
>gi|225440809|ref|XP_002281883.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Vitis vinifera]
gi|297740152|emb|CBI30334.3| unnamed protein product [Vitis vinifera]
Length = 914
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T + LGL+
Sbjct: 658 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSVRQRFLSTLEQLGLESL 717
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 718 RVDLLPLILLNHDHMQAYALMDISLDTFPYAGTTTTCESLFMGVPCVTMAGSVHAHNVGV 777
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+T EY +A++L +D
Sbjct: 778 SLLNKVGLGRLVAKTEDEYVQLALQLASD 806
>gi|195423963|gb|ACF96937.1| SPINDLY [Sinningia speciosa]
Length = 934
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 79/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ +T + LGL+
Sbjct: 659 ITFGSFNNLAKITPRVLQVWARILCAVPNSRLIVKCKPFCSDSVRLQFLSTLEKLGLESL 718
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P +++ G A V
Sbjct: 719 RVDLLPLILLNRDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCISMGGHVHAHNVGV 778
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L T+G L+A+ EY +A++L +D
Sbjct: 779 SLLNTVGLSNLVAKNEDEYVQLALQLASD 807
>gi|196232229|ref|ZP_03131083.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
gi|196223597|gb|EDY18113.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
Length = 701
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
+ + ++N + K T+++W +L+ +P + L L PA E ++ +D
Sbjct: 506 VTFASYNIVPKTTGQTIELWSRILRELPEARLTLGNRGFGTPAAVE-RVRKRFAERAIDP 564
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RI+F + E + AD+ LDT NG T+ + +W G PVVT PGET ASR
Sbjct: 565 SRIIFQGWVPRAELLAAYNQADIALDTLPYNGGLTTCEAMWMGVPVVTCPGETFASRHGL 624
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ L G PE IAR +Y IA+ L D
Sbjct: 625 AHLTAAGVPETIARDPDDYVKIAVDLARD 653
>gi|425445185|ref|ZP_18825221.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389734919|emb|CCI01526.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 713
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 86/172 (50%), Gaps = 30/172 (17%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R G+PEDAI Y K +P T+++ + VLKAVPNS L +IQ +
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPNSYL-----------SIQGLSD 561
Query: 104 ALGLDQHRILFSNVAAKE----EHVR----------RGQL--ADVCLDTPLCNGHTTSMD 147
A L++ LF VA +E E ++ R L ADV LDT G T++D
Sbjct: 562 AKSLEK---LFFKVAEEEGINYERLKIFPLYPTGIYRANLRIADVVLDTYPFTGGMTTLD 618
Query: 148 VLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
VLW G P+VT G+ +SR + + + G E IA + +EY D I+LG D
Sbjct: 619 VLWMGIPLVTKVGQQWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 670
>gi|220906369|ref|YP_002481680.1| hypothetical protein Cyan7425_0933 [Cyanothece sp. PCC 7425]
gi|219862980|gb|ACL43319.1| TPR repeat-containing protein [Cyanothece sp. PCC 7425]
Length = 738
Score = 82.8 bits (203), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 90/161 (55%), Gaps = 7/161 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF----PAVGEANIQ 99
R+ G+P DA VY + + K +P +++ + +LKAVP S L+K AV E +Q
Sbjct: 534 REDLGIPADAPVYFSSQRGNKRNPHLIRLQMQILKAVPGSYF-LIKGQSDQAAVQEVFLQ 592
Query: 100 ATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+A G+ R+ F V EEH ++A++ LDT NG TT+++ LW PVVTL
Sbjct: 593 -MAEAEGISVDRLKFPPLVNTCEEHRANLRIANIILDTYPYNGATTTLEALWMEVPVVTL 651
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G+ ++R + + L+ G E IA + EY + +RLG D+
Sbjct: 652 VGQQFSARNSYTFLSNAGVTEGIAWSDAEYVEWGVRLGQDQ 692
>gi|381160368|ref|ZP_09869600.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878432|gb|EIC20524.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 717
Score = 82.8 bits (203), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 77/151 (50%), Gaps = 5/151 (3%)
Query: 52 DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGL 107
D++ + +FN + K+ T+++W VLK VP S L + L P V A + G
Sbjct: 515 DSVTFASFNNILKLSERTIRLWSRVLKEVPGSNLLIRDKVLADPGV-RAKVLGEFVKSGT 573
Query: 108 DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
+++ + ++EE++R D+ L G TT+ + LW G PVV L G T A R+
Sbjct: 574 CPEQLVMESSVSREEYLREYCDVDISLSPTPFGGGTTTAEALWMGVPVVFLSGGTWAGRI 633
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L T+G PEL+A+ + Y IA L D
Sbjct: 634 GESILRTVGLPELVAQDEERYVGIATALAGD 664
>gi|425464283|ref|ZP_18843605.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833753|emb|CCI21475.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 716
Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 2/159 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P T+++ + +LKAVPNS L + L E
Sbjct: 516 RDDLGIPEDAINYLTAQTGVKRNPETIRLQLQILKAVPNSYLSIQGLSDAKSVEKLFFKV 575
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 576 AEEEGINYERLKIFPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+SR + + + G E IA + +EY D I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674
>gi|354566941|ref|ZP_08986112.1| TPR repeat-containing protein [Fischerella sp. JSC-11]
gi|353544600|gb|EHC14054.1| TPR repeat-containing protein [Fischerella sp. JSC-11]
Length = 740
Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 86/167 (51%), Gaps = 7/167 (4%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-- 95
S+ +R+Q +P DA+VY + +K ++ + +LK VPNS L+K + E
Sbjct: 532 SVPTITREQLDIPHDAVVYLCAQRGFKRHLDITRLQLRILKEVPNSYF-LIKGISDEEST 590
Query: 96 -ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTG 152
A + A+ G+D + F + E V R L ADV LDT NG TT+++ LW
Sbjct: 591 KAFFEKLAEEEGVDTSKFRFLPIVVAES-VHRANLGIADVVLDTYPYNGATTTLETLWMC 649
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
P+VT GE A+R + + + G E IA T +EY + +RLG D
Sbjct: 650 IPMVTRVGEQFAARNSYTMMINAGITEGIAWTDEEYVEWGVRLGKDE 696
>gi|350535410|ref|NP_001233937.1| probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Solanum
lycopersicum]
gi|75330646|sp|Q8RVB2.1|SPY_SOLLC RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY; Short=LeSPY
gi|19913115|emb|CAC85168.1| SPY protein [Solanum lycopersicum]
gi|19913117|emb|CAC85169.1| SPY protein [Solanum lycopersicum]
Length = 931
Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 79/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P L++W +L AVP+S L + P ++ Q + + LGL+
Sbjct: 660 VTFGSFNNLAKITPKVLKVWARILSAVPHSRLIVKCKPFCCDSVRQRFLSILEQLGLEPQ 719
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 720 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGSVHAHNVGV 779
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L T+G L+AR EY + AI+L +D
Sbjct: 780 SLLKTVGLENLVARNEDEYVESAIQLASD 808
>gi|428182321|gb|EKX51182.1| hypothetical protein GUITHDRAFT_134689 [Guillardia theta CCMP2712]
Length = 726
Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 73 WVNVLKAVP--NSILWLLKFPAVGEANIQATAQA-LGLDQH-RILFSNVAAKEEHVRRGQ 128
W+++L+ ++ LWL FP V + + A+A LG + RI+++N+ + EH++
Sbjct: 533 WMDILRRTTEHHTTLWLPLFPDVAKPQLMRIARAKLGKNATTRIVWTNLFDESEHLQVKS 592
Query: 129 LADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP-ELIARTHKE 187
LA + LDT L GHT+ D+LW G P VTLPG ++RV AS L L L+ART ++
Sbjct: 593 LATMQLDTFLYCGHTSGADILWAGVPTVTLPGIMQSARVGASLLKGLSLHGHLVARTAED 652
Query: 188 YQDIAIR 194
Y D+ +R
Sbjct: 653 YADVVVR 659
>gi|425468294|ref|ZP_18847324.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
gi|389885054|emb|CCI34705.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
Length = 716
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 81/158 (51%), Gaps = 2/158 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P T+++ + VLKAVPNS L + L E
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+SR + + + G E IA + +EY D I+LG D
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 673
>gi|428318600|ref|YP_007116482.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
gi|428242280|gb|AFZ08066.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
Length = 2237
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 86/179 (48%), Gaps = 12/179 (6%)
Query: 30 ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL---- 85
AT EV + V +RQ G+ ++ V+ + YKI P + WV +L AVP+SIL
Sbjct: 1560 ATESEV--ATVKPTRQSLGISDNTTVFISGANFYKILPELRESWVKILAAVPDSILILYP 1617
Query: 86 ----WLLKFPAVGEAN-IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLC 139
W +PA N + A G++ +R+ F V+ QLADV LD+
Sbjct: 1618 FGPAWTRTYPATPFVNNLNAVCAKYGVNNNRLRFVKQLPSRADVKEFLQLADVYLDSYPY 1677
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G T+ +D L G P V + G L R ++ L L P+LIA Y +A+ LGT+
Sbjct: 1678 AGATSLIDPLQVGLPAVVVEGNALRFRQGSAMLRELQMPDLIAYDEASYIQLAVTLGTN 1736
>gi|358639499|dbj|BAL26796.1| hypothetical protein AZKH_4523 [Azoarcus sp. KH32C]
Length = 585
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 80/150 (53%), Gaps = 5/150 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLD 108
A+ + +FN L K++ + + +W VL+A+P S L L++ A+ + ++ A G+
Sbjct: 390 ALTFGSFNALGKVNDAVIDLWAEVLRALPASRL-LMQARALSDPGSRSCVAARFGERGIA 448
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ EH+ DVCLDT NGH T++D LW G PV+TL G+ A+R+
Sbjct: 449 SERVEMHGFMPLAEHLALFHRTDVCLDTFPWNGHMTTLDSLWMGVPVLTLKGDRRAARMG 508
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ++ +G +A + +++ A+ L D
Sbjct: 509 TAIMSAIGLEGFVAESREDFVAKAVGLAGD 538
>gi|428166307|gb|EKX35285.1| hypothetical protein GUITHDRAFT_146593 [Guillardia theta CCMP2712]
Length = 757
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 66/123 (53%), Gaps = 6/123 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAV------PNSILWLLKFPAVGEAN 97
R++ GLP + V C FNQ +K+DP T ++W + ++ + + LWLL+ + N
Sbjct: 604 RKEEGLPPEGPVICYFNQHFKLDPPTFKIWTDAIQKLVSFNSSAHPTLWLLQGSPISHRN 663
Query: 98 IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
++ Q+ GL ++ + + H+RR L D+ +DTP+ N T D L+ P++
Sbjct: 664 LKREFQSAGLPGENLVIAKRVLVKTHIRRASLCDLAVDTPMYNSGATGADTLYAAVPLLH 723
Query: 158 LPG 160
+PG
Sbjct: 724 IPG 726
>gi|374293478|ref|YP_005040513.1| hypothetical protein AZOLI_3149 [Azospirillum lipoferum 4B]
gi|357425417|emb|CBS88304.1| protein of unknown function; putative TPR repeat domains
[Azospirillum lipoferum 4B]
Length = 896
Score = 82.0 bits (201), Expect = 1e-13, Method: Composition-based stats.
Identities = 53/158 (33%), Positives = 80/158 (50%), Gaps = 9/158 (5%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQ 103
GL + + +FN K++ ST+ +W VL AVP + L +LK+ + +++ A
Sbjct: 689 GLADGTFTFGSFNNPTKLNASTIALWARVLHAVPRARL-MLKWRGLTSSSLGGRLLAQFA 747
Query: 104 ALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
ALG+ R++ S E RR D LD NG TT+ D LW G PV+++ G
Sbjct: 748 ALGIAGDRLILSGTTPDPYESYRR---IDCGLDPVFANGGTTTCDSLWMGVPVLSIAGTA 804
Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SR S L+ +G P+L+A Y +A+RL D D
Sbjct: 805 AISRWGISLLSAVGLPDLVADDDDSYVALAVRLAGDPD 842
>gi|220906370|ref|YP_002481681.1| hypothetical protein Cyan7425_0934 [Cyanothece sp. PCC 7425]
gi|219862981|gb|ACL43320.1| TPR repeat-containing protein [Cyanothece sp. PCC 7425]
Length = 739
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 85/159 (53%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI---QA 100
R+ G+P DA++Y + + K +P Q+ + +L+ +PNS L + F + Q
Sbjct: 537 REDLGIPADAVIYLSSQRAAKRNPHLTQLQMQILQEMPNSYLLIKGFANQKSLQVWFEQI 596
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+A G+ R+ F + + E+H ++ADV LDT NG TT+++ LW P+VT
Sbjct: 597 AAEA-GVSSDRLRFVPSCPSDEQHRANLRIADVVLDTYPYNGATTTLETLWMEIPLVTRV 655
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE A+R + + L G E IA + EY + +RLG +
Sbjct: 656 GEQFAARNSYTFLVNAGVEEGIAWSDAEYVEWGVRLGKE 694
>gi|189425152|ref|YP_001952329.1| hypothetical protein Glov_2093 [Geobacter lovleyi SZ]
gi|189421411|gb|ACD95809.1| TPR repeat-containing protein [Geobacter lovleyi SZ]
Length = 1714
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL----WLLKFPAVGEANIQATAQALGLDQ 109
+ + +FN + K P ++ W +L V S++ W ++ + A A G+
Sbjct: 1521 VTFGSFNNISKTSPYVIETWATILNRVEGSVIMLKNWFFSDQSIQQKFYDGFA-AFGITS 1579
Query: 110 HRILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
RI+ A +EH+ Q D+ LDT NG TT+ + LW G PVVT G+ A RV
Sbjct: 1580 DRIILLPAAPDIKEHLCMYQHVDIALDTFPYNGTTTTCEALWMGVPVVTFTGDRHAGRVG 1639
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AS L +G EL+AR +Y D A+ L +
Sbjct: 1640 ASILNRIGLSELVARDRNDYIDRAVMLAEN 1669
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 5/154 (3%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL--- 105
L + I + +FN KI + +++W +++K VP S L +LKF + +++ + L
Sbjct: 991 LDSEFITFGSFNNPLKISENVVEVWSDIMKRVPKSRL-VLKFSTFKDVSVRRRFRELFST 1049
Query: 106 -GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
G+ RI F ++ + D+ LDT G TS++ LW G P++TL G T
Sbjct: 1050 YGVSPRRIEFRTFSSPYLMLMEHGDIDIMLDTFPFTGGMTSLNALWMGVPIITLAGTTPI 1109
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SR + L LG EL+ EY + A++L D
Sbjct: 1110 SRQTKTFLDLLGLYELVTTNKDEYIETAVKLSFD 1143
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 73/149 (48%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQ 109
I + +FN ++K+ + W VL AV +S L LLK + + + + G+
Sbjct: 404 ITFGSFNSIHKLTAEVVDAWSRVLHAVSHSKL-LLKSKSFSDDVVVDDFIRRFKIHGISS 462
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
+RI+F +++ E ++ D+ LD NG ++ + LW G PVVTL T SR +
Sbjct: 463 NRIIFKKLSSHYEMLQEYGEIDIALDPFPYNGGASTCEALWMGVPVVTLEMGTPISRQSK 522
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ L + EL+A T +Y IA L D
Sbjct: 523 AFLYAIDHHELVASTLDDYVQIAQNLALD 551
>gi|302878329|ref|YP_003846893.1| hypothetical protein Galf_1101 [Gallionella capsiferriformans ES-2]
gi|302581118|gb|ADL55129.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
capsiferriformans ES-2]
Length = 1646
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 78/149 (52%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLDQ 109
I + +FN+L KI T W +L+A+P+S L +LK A+ +A + G+D
Sbjct: 1444 ITFGSFNRLVKISEQTYLAWAKILRALPDSRL-ILKTHALDDAGTRERVSEHFIRAGIDP 1502
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RI+ ++E + D+ LDT G T+++ L G PVVTL TL RV+A
Sbjct: 1503 ARIILLGKTSREAQLAAFNRVDIALDTFPHGGGMTTLEGLVMGVPVVTLRWPTLTGRVSA 1562
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L TLG P+ IA + EY +AI+ D
Sbjct: 1563 SILTTLGMPDWIAESQDEYVKLAIQKAAD 1591
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 81/149 (54%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
+ + ++N+L K + + W +L+AVP+S + + K P + +A +Q +A G+
Sbjct: 759 VTFGSYNRLAKNSEAAYRAWAKILRAVPDSRM-IFKTPVLDDAAVQDQVRAYFTSAGVAA 817
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RI+ + +EH+ D+ LD G ++++ L G PVVTL +LA R++A
Sbjct: 818 ERIILLGKSPHDEHMAAFNQIDISLDPFPHGGGVSALEGLLMGVPVVTLNWPSLAGRISA 877
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S + TL P+ IA + +EY +AI+ TD
Sbjct: 878 SIMTTLAMPDWIAGSEEEYVQLAIQKATD 906
>gi|340516226|gb|EGR46476.1| glycosyltransferase family 41 [Trichoderma reesei QM6a]
Length = 1738
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 80/161 (49%), Gaps = 47/161 (29%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+DAI+ NFNQLYK E N++ TA+A +
Sbjct: 1542 DDAIILANFNQLYK----------------------------ARETNLRQTAEAWAGAEV 1573
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
R++F++VA K +H+ R ++ D+ LDT CN HTT+ DVLW+ TP++TLP + + SR
Sbjct: 1574 ASRLVFTDVAPKNQHINRARVCDLFLDTAECNAHTTAADVLWSSTPLLTLPRYSYKMCSR 1633
Query: 167 VAAS------------QLATLGCPELIARTHKEYQDIAIRL 195
+AAS Q A L ELIA EY+D A L
Sbjct: 1634 MAASILRGALPKSAEGQQAAL---ELIADGETEYEDQAAEL 1671
>gi|428207291|ref|YP_007091644.1| hypothetical protein Chro_2283 [Chroococcidiopsis thermalis PCC
7203]
gi|428009212|gb|AFY87775.1| hypothetical protein Chro_2283 [Chroococcidiopsis thermalis PCC
7203]
Length = 725
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 86/164 (52%), Gaps = 15/164 (9%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN------ 97
R+ +P +A+VY + +K P T+++ + +L VPNS F G+A+
Sbjct: 523 REHLEIPSNAVVYLSSQVGFKRHPDTVRLQMQILSQVPNSY-----FLVKGKADENTIRE 577
Query: 98 -IQATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTP 154
A+ G++ R+ F + E V R L ADV LDT NG TT+++ LW G P
Sbjct: 578 FFNKIAEDEGVNPDRLRFLS-RDPSEFVHRANLGIADVVLDTYPYNGATTTLETLWMGIP 636
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+VT G+ ASR + + + +G E IA T +EY + ++LGTD
Sbjct: 637 LVTKVGQQFASRNSYTFMMNVGVTEGIAWTDEEYVEWGVKLGTD 680
>gi|253996282|ref|YP_003048346.1| hypothetical protein Mmol_0910 [Methylotenera mobilis JLW8]
gi|253982961|gb|ACT47819.1| TPR repeat-containing protein [Methylotenera mobilis JLW8]
Length = 1673
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL-WLLKFPAVGEANIQATAQA----LGLD 108
I + +F L K L +W V++A+P+S L W K + G+AN+ A A LG+D
Sbjct: 638 ITFASFQTLAKAGDDVLALWAEVMRALPDSRLRWQCK--SFGDANVVANMAAKFAKLGID 695
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ ++E + D+ LDT G TT+ + LW G P +TL G++L +R
Sbjct: 696 VARLSLLGAVSRENYFASHHDVDMILDTFPYPGGTTTCEALWMGVPTLTLAGDSLIARQG 755
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AS L G + +A T EY D A++ +D
Sbjct: 756 ASMLTAAGLADWVAETKAEYLDKALKHCSD 785
>gi|75906571|ref|YP_320867.1| hypothetical protein Ava_0346 [Anabaena variabilis ATCC 29413]
gi|75700296|gb|ABA19972.1| TPR repeat protein [Anabaena variabilis ATCC 29413]
Length = 739
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 84/166 (50%), Gaps = 7/166 (4%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ +R++ +P DA+VY + +K P ++ + ++K VPNS + L E +
Sbjct: 532 SVPTVTREELDIPHDAVVYLCGQRGFKRHPDITRLQLKIIKEVPNS--YFLIKGISDEDS 589
Query: 98 IQATAQAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
I+ L G+D R+ F + E H +AD+ LDT NG TT+++ LW
Sbjct: 590 IKLFFDGLADEEGVDTSRLRFLPIVMSESIHRANLDIADIVLDTYPYNGATTTLETLWMC 649
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
P+VT GE A+R + + + G E IA T EY + +RLG D
Sbjct: 650 IPMVTRVGEQFAARNSYTMMMNAGITEGIAWTDDEYIEWGVRLGKD 695
>gi|312282561|dbj|BAJ34146.1| unnamed protein product [Thellungiella halophila]
Length = 762
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 78/149 (52%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L AVPNS L + P ++ Q T + LGL+
Sbjct: 517 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 576
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 577 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 636
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+ EY +++ L +D
Sbjct: 637 SLLSKVGLGHLVAKNEDEYVQLSVDLASD 665
>gi|166363521|ref|YP_001655794.1| hypothetical protein MAE_07800 [Microcystis aeruginosa NIES-843]
gi|166085894|dbj|BAG00602.1| hypothetical protein MAE_07800 [Microcystis aeruginosa NIES-843]
Length = 713
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 2/159 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P T+++ + VLKAVP+S L + L E
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPHSYLSIQGLSDAKSVEKLFFKV 572
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 573 AEEEGINYERLKIFPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 632
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+SR + + + G E IA + +EY D I+LG D +
Sbjct: 633 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 671
>gi|427720511|ref|YP_007068505.1| hypothetical protein Cal7507_5336 [Calothrix sp. PCC 7507]
gi|427352947|gb|AFY35671.1| TPR repeat-containing protein [Calothrix sp. PCC 7507]
Length = 739
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 83/159 (52%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
R+ +P +A+VY + + YK P T + + +LK VP+S L+K + EA
Sbjct: 537 REDLDIPTNAVVYLSAQRGYKRHPETTRWQMQILKQVPDSYF-LIKGLSEEEAIKRFFYQ 595
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A+ G+D R+ F V + H +ADV LDT NG TT+++ LW G P+VT
Sbjct: 596 IAEEEGVDCSRLRFLPQVYLESVHRANLGIADVVLDTFPYNGATTTLETLWMGIPLVTRV 655
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE A+R + + + G E +A T EY + +RLG D
Sbjct: 656 GEQFAARNSYTMMINAGITEGVAWTDAEYIEWGVRLGKD 694
>gi|358400410|gb|EHK49741.1| glycosyltransferase family 41 protein [Trichoderma atroviride IMI
206040]
Length = 1702
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 79/160 (49%), Gaps = 44/160 (27%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
+DAI+ NFNQLYK + N++ TA+A +
Sbjct: 1507 DDAIILANFNQLYK-----------------------------AKVNLRRTAEAWAGAEV 1537
Query: 110 -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
R++F++VA K +H+ R ++ D+ LDT CN HTT+ DVLW+ TP++TLP + + SR
Sbjct: 1538 ASRLIFTDVAPKSQHITRARVCDLFLDTAECNAHTTAADVLWSSTPLLTLPRYSYKMCSR 1597
Query: 167 VAASQLATLGCP----------ELIARTHKEYQDIAIRLG 196
+AAS L P ELIA EY+D A L
Sbjct: 1598 MAASILRG-ALPKTAEGEQVAVELIAENETEYEDSATELA 1636
>gi|320166199|gb|EFW43098.1| hypothetical protein CAOG_08230 [Capsaspora owczarzaki ATCC 30864]
Length = 875
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 79/159 (49%), Gaps = 1/159 (0%)
Query: 40 VITSRQQYGLPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
V+ + + GLP D A +C+ ++K + ST+ W+ +LK P + LW++ P AN
Sbjct: 609 VVAKKTELGLPNDGAFTFCSLGPIFKANKSTVHSWIRILKQAPEARLWVIAHPNDAAANF 668
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
+ ++ IL + ++ + D+ LDT N HTT+ ++L G PV+T
Sbjct: 669 RRLVAEAQVEAQVILTPRLEREDFLEYAPSMCDLFLDTLPYNAHTTASEMLTGGLPVLTQ 728
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
G+T R AA L++ + IA + Y+ +A+ T
Sbjct: 729 AGQTRCGRSAAGMLSSASLDQFIATSAGLYEAMAVEFAT 767
>gi|428167107|gb|EKX36071.1| hypothetical protein GUITHDRAFT_165818 [Guillardia theta CCMP2712]
Length = 299
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 77/161 (47%), Gaps = 6/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
R +Y +P A VY ++K+ P + +L PNSI+ + P + Q +
Sbjct: 88 RSRYNIPPRATVYACLQTIFKVSPKMDFVAKRILNEDPNSIIMFKELPMTDQCGKQVLER 147
Query: 104 -----ALGLDQH-RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
L D+ RI F ++ + DV LD+ GHTTSM+ L G P+VT
Sbjct: 148 MRNKSKLTEDEMARIKFLPAMHDADYSDMFAIVDVILDSYPFGGHTTSMEALAVGRPIVT 207
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
LP + ++ R LA LG ELIA +++ IA++LG D
Sbjct: 208 LPTDFMSGRCTQGFLAFLGLQELIASDLEDFIRIAVKLGKD 248
>gi|297829706|ref|XP_002882735.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
lyrata]
gi|297328575|gb|EFH58994.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
lyrata]
Length = 897
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L AVPNS L + P ++ Q T + LGL+
Sbjct: 638 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 697
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 698 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 757
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+ EY +++ L +D
Sbjct: 758 SLLTKVGLGHLVAKNEDEYVQLSVDLASD 786
>gi|115477811|ref|NP_001062501.1| Os08g0559300 [Oryza sativa Japonica Group]
gi|75325353|sp|Q6YZI0.1|SPY_ORYSJ RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY
gi|45736106|dbj|BAD13137.1| putative gibberellin action negative regulator SPY [Oryza sativa
Japonica Group]
gi|113624470|dbj|BAF24415.1| Os08g0559300 [Oryza sativa Japonica Group]
gi|215686860|dbj|BAG89710.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 927
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T LGL+
Sbjct: 646 ITFGSFNNLAKITPKVLQVWAKILCAVPNSRLVVKCKPFCCDSIRQKFLSTLAELGLEPL 705
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 706 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 765
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A++ EY +A+ L D
Sbjct: 766 SLLTKVGLGRLVAKSENEYVSLALDLAAD 794
>gi|218201606|gb|EEC84033.1| hypothetical protein OsI_30264 [Oryza sativa Indica Group]
Length = 947
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T LGL+
Sbjct: 666 ITFGSFNNLAKITPKVLQVWAKILCAVPNSRLVVKCKPFCCDSIRQKFLSTLAELGLEPL 725
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 726 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 785
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A++ EY +A+ L D
Sbjct: 786 SLLTKVGLGRLVAKSENEYVSLALDLAAD 814
>gi|222641017|gb|EEE69149.1| hypothetical protein OsJ_28275 [Oryza sativa Japonica Group]
Length = 947
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T LGL+
Sbjct: 666 ITFGSFNNLAKITPKVLQVWAKILCAVPNSRLVVKCKPFCCDSIRQKFLSTLAELGLEPL 725
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 726 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 785
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A++ EY +A+ L D
Sbjct: 786 SLLTKVGLGRLVAKSENEYVSLALDLAAD 814
>gi|42572381|ref|NP_974286.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
thaliana]
gi|332641540|gb|AEE75061.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
thaliana]
Length = 732
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L AVPNS L + P ++ Q T + LGL+
Sbjct: 473 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 532
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 533 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 592
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+ EY +++ L +D
Sbjct: 593 SLLTKVGLGHLVAKNEDEYVQLSVDLASD 621
>gi|323451078|gb|EGB06956.1| hypothetical protein AURANDRAFT_65153 [Aureococcus anophagefferens]
Length = 1100
Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats.
Identities = 53/145 (36%), Positives = 70/145 (48%), Gaps = 23/145 (15%)
Query: 55 VYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG-------------------E 95
VY +FN+ K+ P ++ W +L P S+LWLL+FP G E
Sbjct: 494 VYASFNKFEKLSPGVVEAWATILLDAPGSVLWLLEFPRSGVDGARRALEGAADRVRQKRE 553
Query: 96 ANIQATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
+ A A A LD R++F A + E+ R L D CLDT HT+S+D L+ G P
Sbjct: 554 GSRDAPAGA--LDPARLIFGAFAPTDRENQARLPLIDACLDTWPYGAHTSSLDCLFAGVP 611
Query: 155 VVT-LPGETLASRVAASQLATLGCP 178
V+T G LA RVA S L L P
Sbjct: 612 VITRRDGGHLAGRVALSHLVALSDP 636
>gi|167620865|ref|ZP_02389496.1| TPR domain protein [Burkholderia thailandensis Bt4]
Length = 781
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 78/148 (52%), Gaps = 1/148 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
+ Y +FN+L K+ + + +W VL A P++ + + G+ I + G+D HR+
Sbjct: 570 VTYGSFNRLNKLRQNVIALWSQVLLATPSARMVIGSIEQEGDRETIIEWFVSEGIDAHRL 629
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F AA ++++ DVCLD G TT+++ LW G P VTL G+++ SR +AS L
Sbjct: 630 TFQPRAAMPVYLQQHHHVDVCLDAFPYAGSTTTLNALWMGVPTVTLSGKSIPSRGSASWL 689
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ +G E IA K + +I D D
Sbjct: 690 SHVGLAEYIAHDQKSFVRTSIAAAQDLD 717
>gi|443657012|ref|ZP_21131811.1| hypothetical protein C789_2351 [Microcystis aeruginosa DIANCHI905]
gi|159029888|emb|CAO90942.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333289|gb|ELS47856.1| hypothetical protein C789_2351 [Microcystis aeruginosa DIANCHI905]
Length = 716
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 2/159 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P +++ + VLKAVPNS L + L E
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+SR + + + G E IA + +EY D I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674
>gi|83721418|ref|YP_443704.1| TPR domain-containing protein [Burkholderia thailandensis E264]
gi|83655243|gb|ABC39306.1| TPR domain protein [Burkholderia thailandensis E264]
Length = 781
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 78/148 (52%), Gaps = 1/148 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
+ Y +FN+L K+ + + +W VL A P++ + + G+ I + G+D HR+
Sbjct: 570 VTYGSFNRLNKLRQNVIALWSQVLLATPSARMVIGSIEQEGDRETIIEWFVSEGIDAHRL 629
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F AA ++++ DVCLD G TT+++ LW G P VTL G+++ SR +AS L
Sbjct: 630 TFQPRAAMPVYLQQHHHVDVCLDAFPYAGSTTTLNALWMGVPTVTLSGKSIPSRGSASWL 689
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ +G E IA K + +I D D
Sbjct: 690 SHVGLAEYIAHDQKSFVRTSIAAAQDLD 717
>gi|328875280|gb|EGG23645.1| glycosyltransferase [Dictyostelium fasciculatum]
Length = 687
Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats.
Identities = 56/170 (32%), Positives = 82/170 (48%), Gaps = 12/170 (7%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P S+ ++ Y + + +N L K+ T + W +L +P + L ++K P E
Sbjct: 484 PVSVAPCAKNGY------VTFGCYNTLSKVQDPTWKCWKQILDRLPQARL-IIKAPLFIE 536
Query: 96 ----ANIQATAQALGLDQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
+ + Q LG+D R+ L + + H DV LD NG TTSMD LW
Sbjct: 537 ESAAQHYRDRLQKLGVDTSRVSLRAYSMDTQNHYVSYDEMDVSLDPFPYNGTTTSMDSLW 596
Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G P VT G T RV AS L +G +L+ + +EY DIA++LG D D
Sbjct: 597 MGVPFVTYAGTTHVHRVGASILNNVGLGDLVGYSTQEYVDIAVKLGQDLD 646
>gi|425436659|ref|ZP_18817093.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
gi|389678582|emb|CCH92567.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
Length = 716
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 2/159 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P +++ + VLKAVPNS L + L E
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+SR + + + G E IA + +EY D I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674
>gi|334117819|ref|ZP_08491910.1| glycosyl transferase family 2 [Microcoleus vaginatus FGP-2]
gi|333460928|gb|EGK89536.1| glycosyl transferase family 2 [Microcoleus vaginatus FGP-2]
Length = 1283
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 93/179 (51%), Gaps = 10/179 (5%)
Query: 29 TATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL 88
+ T E P + T R +G ED++V+ + Q + I P + W ++ AVPNSIL L
Sbjct: 1065 SYTVESAPAQVEPT-RSSWGATEDSVVFMSGAQAFTIIPELRETWAKIIAAVPNSILVLY 1123
Query: 89 KFPAVGE--------ANIQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLC 139
F + + I++ G+D+ R ++ + ++ + ++ +LAD+ LD+
Sbjct: 1124 PFSSHNKNYPTVPFYKQIRSLFAEFGVDKKRLVVIKALPSRADCIKCLELADIYLDSYPY 1183
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ + + L G P+V G+T SR +A+ L L PEL+A + K Y +++I LGT+
Sbjct: 1184 SSACYAAEPLLVGLPIVVRKGQTTRSRQSAAVLEELLIPELVADSEKSYLELSIALGTN 1242
>gi|75908605|ref|YP_322901.1| glycosyl transferase family protein [Anabaena variabilis ATCC
29413]
gi|75702330|gb|ABA22006.1| Glycosyl transferase, family 2 [Anabaena variabilis ATCC 29413]
Length = 1007
Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats.
Identities = 51/181 (28%), Positives = 92/181 (50%), Gaps = 12/181 (6%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF- 90
G E + ++ R G+PE+A+V+ + +KI P + W N++ VPNS+L LL F
Sbjct: 645 GTEEGKLTILVKRNSLGIPENAVVFISGANYFKIVPELVATWANIISRVPNSVLVLLPFG 704
Query: 91 ----PAVGEAN----IQATAQALGLDQHRILFSNVAA---KEEHVRRGQLADVCLDTPLC 139
A +AN + + GL R++ ++ +E+ ++ADV LD+
Sbjct: 705 PNWSNAYPKANFIDHLNSIFSQHGLATERLIVLDIQPIPDREDMKEYYKIADVYLDSYPF 764
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G T+ ++ L PV+ G S + A+ + TL P+L+A + + Y ++A+ LGT+
Sbjct: 765 AGTTSLIEPLQVNLPVIARQGNCFRSAMGAAIIQTLNIPDLVADSEESYIELAVALGTNS 824
Query: 200 D 200
+
Sbjct: 825 E 825
>gi|425458863|ref|ZP_18838349.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
gi|389823717|emb|CCI27874.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
Length = 716
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 2/159 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P +++ + VLKAVPNS L + L E
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+SR + + + G E IA + +EY D I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674
>gi|357407018|ref|YP_004918942.1| TPR domain/SEC-C motif domain protein (fragment), partial
[Methylomicrobium alcaliphilum 20Z]
gi|351719683|emb|CCE25359.1| TPR domain/SEC-C motif domain protein (fragment) [Methylomicrobium
alcaliphilum 20Z]
Length = 488
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 85/153 (55%), Gaps = 7/153 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEAN-IQATAQALGLDQ 109
+ + FN++ KI T+ +WV +L+AVP+S ++LK F E N I+ + GL+Q
Sbjct: 293 VTFGTFNKISKITDRTIALWVQILQAVPDS-KFVLKGRGFEKKVEQNRIKKRFASAGLEQ 351
Query: 110 --HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
+L+ + EH+ D+ LDT NG TT+M ++ G PV+ L G++ +RV
Sbjct: 352 IDRLVLYGFSPKRSEHLLLYNELDIALDTFPYNGTTTTMQAIFMGVPVIALDGKSHVARV 411
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ S L+ +G EL+A ++Y A+ L D++
Sbjct: 412 SQSILSAIGANELVAMDEEDYFLKAVHLANDKE 444
>gi|60598530|gb|AAX25814.1| unknown [Schistosoma japonicum]
Length = 114
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/55 (67%), Positives = 44/55 (80%)
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
MDVLW G PVVT P ETLASRVAASQL TLGCPEL+A + ++Y IA +LG +R+
Sbjct: 1 MDVLWAGCPVVTFPLETLASRVAASQLHTLGCPELVANSQEDYVRIATKLGNNRE 55
>gi|15229778|ref|NP_187761.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
thaliana]
gi|75332921|sp|Q96301.1|SPY_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY
gi|12322895|gb|AAG51433.1|AC008153_6 spindly (gibberellin signal transduction protein); 75377-80082
[Arabidopsis thaliana]
gi|1589778|gb|AAC49446.1| SPINDLY [Arabidopsis thaliana]
gi|62319977|dbj|BAD94086.1| spindly [Arabidopsis thaliana]
gi|332641539|gb|AEE75060.1| putative UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
thaliana]
Length = 914
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
+ + +FN L KI P LQ+W +L AVPNS L + P ++ Q T + LGL+
Sbjct: 655 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 714
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 715 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 774
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+ EY +++ L +D
Sbjct: 775 SLLTKVGLGHLVAKNEDEYVQLSVDLASD 803
>gi|75330266|sp|Q8LP10.1|SPY_EUSGR RecName: Full=Probable UDP-N-acetylglucosamine--peptide
N-acetylglucosaminyltransferase SPINDLY; Short=EgSPY
gi|22773823|dbj|BAC11808.1| SPINDLY [Eustoma grandiflorum]
Length = 918
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AV NS L + P E+ Q +T + LGL+
Sbjct: 629 ITFGSFNNLAKITPKVLQVWARILCAVSNSRLIVKCKPFCCESVRQTFLSTLEQLGLEST 688
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P +T+ G A V
Sbjct: 689 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCITMRGLVHAHNVGV 748
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+T+G L+A+ +Y +A++L +D
Sbjct: 749 SLLSTVGLGHLVAKNEDDYVRLAVQLASD 777
>gi|425454003|ref|ZP_18833752.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389799814|emb|CCI20674.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 713
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 81/158 (51%), Gaps = 2/158 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P T+++ + VLKAVP+S L + L E
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPHSYLSIQGLSDAKSVEKLFFKV 572
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 573 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 632
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+SR + + + G E IA + +EY D I+LG D
Sbjct: 633 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 670
>gi|170731575|ref|YP_001763522.1| hypothetical protein Bcenmc03_0220 [Burkholderia cenocepacia MC0-3]
gi|169814817|gb|ACA89400.1| TPR repeat-containing protein [Burkholderia cenocepacia MC0-3]
Length = 779
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 75/144 (52%), Gaps = 1/144 (0%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRILF 114
Y +FN+L K+ P +++W VL A P + + L + G+ + A G+D R+ F
Sbjct: 569 YASFNRLNKLSPHVIEVWARVLHADPTARMALGAIGSEGDKQTLTEWFAAAGIDASRLSF 628
Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
+ ++++ D+CLD G TT+++ LW G P VT+PG T+A R +A L
Sbjct: 629 HRRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGATMAGRGSAGWLQH 688
Query: 175 LGCPELIARTHKEYQDIAIRLGTD 198
+G IA+ ++ A+ LG D
Sbjct: 689 VGLDAYIAKDEDDFVARALALGRD 712
>gi|434386709|ref|YP_007097320.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Chamaesiphon minutus PCC 6605]
gi|428017699|gb|AFY93793.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Chamaesiphon minutus PCC 6605]
Length = 735
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 88/160 (55%), Gaps = 7/160 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEANIQA 100
RQ +P+DA++Y + YK +P+ ++ + ++K+VPNS L+K P + +
Sbjct: 522 RQDLDIPDDAVIYFSGQSGYKRNPNCIRAQMEIVKSVPNSYF-LIKGSSDPEIIKNLFGN 580
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G+ R+ F + ++E R LA D+ LDT NG TT+++ LW G P+VT
Sbjct: 581 LAAEVGISFDRLKFID-RVEDEPTHRANLAIADIVLDTFPYNGATTTLETLWMGIPMVTQ 639
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ ++R + + + G E IA + +EY + I+LG D
Sbjct: 640 VGKQFSARNSYTFMLNAGIEEGIAWSQQEYIEWGIKLGLD 679
>gi|427732274|ref|YP_007078511.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Nostoc sp. PCC 7524]
gi|427368193|gb|AFY50914.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Nostoc sp. PCC 7524]
Length = 739
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 85/179 (47%), Gaps = 31/179 (17%)
Query: 38 SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
S+ R++ +P DAIVY + +K P ++ + ++K VPNS F G ++
Sbjct: 532 SVPTVRREELDIPNDAIVYFCGQRGFKRHPDITRLQLQIIKEVPNSY-----FLIKGMSD 586
Query: 98 IQATAQALGLDQHRILFSNVAAKE---------------EHVRRGQL--ADVCLDTPLCN 140
++T R+ F +A KE E V R L AD+ LDT N
Sbjct: 587 EEST---------RVFFEELADKEGVEISQLRFLPGVISESVHRANLGIADIVLDTYPYN 637
Query: 141 GHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G TT+++ LW P+VT GE A+R + + + G E IA T +EY + +RLG D
Sbjct: 638 GATTTLETLWMCIPMVTKVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGKDE 696
>gi|425449355|ref|ZP_18829195.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
gi|389764012|emb|CCI09573.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
Length = 716
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 80/158 (50%), Gaps = 2/158 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+PEDAI Y K +P +++ + VLKAVPNS L + L E
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+SR + + + G E IA + +EY D I+LG D
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 673
>gi|421866704|ref|ZP_16298367.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Burkholderia cenocepacia H111]
gi|358073189|emb|CCE49245.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Burkholderia cenocepacia H111]
Length = 782
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 76/147 (51%), Gaps = 7/147 (4%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
Y +FN+L K+ + +W NVL A P + + L A+G Q T A G+D R
Sbjct: 572 YGSFNRLNKLSRHVISVWANVLHANPTARMAL---GAIGNDGDQHTLTEWFAAAGIDVER 628
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ D+CLDT G TT+++ LW G P VTLPG TLA R +A+
Sbjct: 629 LSFHRRSNIPVYMQQHHGVDLCLDTFPYTGSTTTLNALWMGVPTVTLPGTTLAGRGSATW 688
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L +G IA ++ A+ LG D
Sbjct: 689 LQHVGLDNYIATNEDDFVAKALALGRD 715
>gi|196231119|ref|ZP_03129979.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
gi|196224949|gb|EDY19459.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
Ellin428]
Length = 600
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 74/151 (49%), Gaps = 5/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
I + +FN + K T+ +W +L+ +P S L ++K A +QA L +D
Sbjct: 397 ITFASFNIVSKTSGPTIALWSRILRRLPTSRL-IIKNKGFEGARLQADIHQLFAQESVDP 455
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RI F + + E + AD+ LDT NG T+ + LW G PVV+ GET A R
Sbjct: 456 ARIEFRGPSPQAEFLASYGDADIALDTFPYNGGLTTCEALWMGLPVVSCAGETFAGRHGL 515
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ L G PE +A Y D+A+ L +D D
Sbjct: 516 AHLTAAGLPEWVAADFDAYVDLAVALASDLD 546
>gi|119488020|ref|ZP_01621464.1| hypothetical protein L8106_11502 [Lyngbya sp. PCC 8106]
gi|119455309|gb|EAW36448.1| hypothetical protein L8106_11502 [Lyngbya sp. PCC 8106]
Length = 727
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQA 100
R+ Y + D ++Y + K + ++ V +LK VPNSIL + K AV + +
Sbjct: 525 RKAYRIGLDQVIYLSVAPGRKFNRELVEAQVAILKQVPNSIL-IHKAAGDVAVFQGAYEQ 583
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
QA+G+ HR+ F +EEH + LADV LD+ NG T +++ LW PVVT
Sbjct: 584 ACQAVGVSLHRVKFIPRFPTEEEHRQVYALADVLLDSYPYNGGTHTLEALWFEVPVVTRK 643
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE SR+ S L +G IA + ++Y + +R G D
Sbjct: 644 GEQFLSRMGYSFLQGVGVEAGIAESWEDYINWGVRFGED 682
>gi|374999506|ref|YP_004975594.1| hypothetical protein AZOLI_p60009 [Azospirillum lipoferum 4B]
gi|357428477|emb|CBS91434.1| conserved protein of unknown function; putative TPR domains
[Azospirillum lipoferum 4B]
Length = 652
Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats.
Identities = 52/152 (34%), Positives = 77/152 (50%), Gaps = 6/152 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA----TAQALGLDQ 109
+ + +FN K+ +++W +LK VPNS L ++K +G+ + T G+D
Sbjct: 456 VTFGSFNNTSKVTGEVVRVWSEILKRVPNSRL-IVKSAQMGDEETRRRYLNTFVKFGIDA 514
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
RI L S + A + H+R D+ LD NG TT+ + LW G PVVT+ G +RV
Sbjct: 515 GRIELLSRIDAADGHLRAYDRIDIGLDPFPYNGTTTTCEALWMGVPVVTVAGANHVARVG 574
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
AS L G EL+A +Y A+ L D D
Sbjct: 575 ASLLTHCGLNELVASDEADYIATAVALAGDPD 606
>gi|323527755|ref|YP_004229908.1| tetratricopeptide repeat-containing protein [Burkholderia sp.
CCGE1001]
gi|323384757|gb|ADX56848.1| Tetratricopeptide repeat [Burkholderia sp. CCGE1001]
Length = 810
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 10/160 (6%)
Query: 48 GLP---EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-- 102
GLP + I Y +FN+L K+ P + +W +L+ P S + + A+G + T
Sbjct: 578 GLPALHKGYITYGSFNRLNKLSPEVIALWSTILREQPTSRMVI---GAIGSKLDEETYLE 634
Query: 103 --QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
+ G+D R+ F ++++ D+CLDT G TT+++ LW G P VT+PG
Sbjct: 635 WFRNEGIDTSRLTFCPRTTLPVYMQQHHHVDLCLDTFPYVGSTTTLNALWMGVPTVTMPG 694
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
++ SR A L +G E IAR E+ +I L D D
Sbjct: 695 ISMPSRCGAGWLEQVGLHEFIARDKDEFVKKSIELTRDLD 734
>gi|224088480|ref|XP_002308458.1| predicted protein [Populus trichocarpa]
gi|222854434|gb|EEE91981.1| predicted protein [Populus trichocarpa]
Length = 934
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
I + +FN L KI P LQ+W +L AVPNS L ++K G +++ + LGL+
Sbjct: 660 ITFGSFNNLAKITPKVLQVWARILCAVPNSRL-VVKCKPFGCDSVRQRFLTVLEQLGLEP 718
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + +H++ L D+ LDT G TT+ + L+ G P +T+ G A V
Sbjct: 719 LRVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCITMAGAVHAHNVG 778
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+ +EY +A++L +D
Sbjct: 779 VSLLSKVGLGHLVAKNEEEYVQLALQLASD 808
>gi|430747186|ref|YP_007206315.1| O-linked N-acetylglucosamine transferase, SPINDLY family
[Singulisphaera acidiphila DSM 18658]
gi|430018906|gb|AGA30620.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Singulisphaera acidiphila DSM 18658]
Length = 827
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 8/171 (4%)
Query: 33 EEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
EE P+ + + + G + + + ++L K++ L++W +L+++P + L L +
Sbjct: 617 EEAPEVNTLPALRAKG-----VTFGSLHKLPKLNARVLELWAELLRSIPKARLLLYRNNL 671
Query: 93 VGE--ANIQATAQALGLDQHRILFSNV-AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
G+ I A +A G+ ++ N+ H + Q D+ LD +GHTT+ + L
Sbjct: 672 RGQRKEEILAFFKAQGVAADQLELRNIIEGGGSHYQVYQDIDISLDVFPWSGHTTACESL 731
Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
W G P+VT+ G A R+ AS L LG + IA T ++Y DIA R TD D
Sbjct: 732 WMGVPIVTIRGNRHAGRMTASVLTCLGLSDWIAETPEQYLDIAQRFATDLD 782
>gi|422303123|ref|ZP_16390477.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389791934|emb|CCI12280.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 713
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 81/161 (50%), Gaps = 8/161 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R G+PEDAI Y K +P T+++ + VLK+VPNS L + +A
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKSVPNS---YLSIQGLSDAKSVKKLF 569
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT
Sbjct: 570 FKVAEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTK 629
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G+ +SR + + + G E IA + +EY D I+LG D
Sbjct: 630 VGQQWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 670
>gi|182412285|ref|YP_001817351.1| hypothetical protein Oter_0461 [Opitutus terrae PB90-1]
gi|177839499|gb|ACB73751.1| Tetratricopeptide TPR_2 repeat protein [Opitutus terrae PB90-1]
Length = 627
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 84/168 (50%), Gaps = 8/168 (4%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
P S+ +R+ + ++V+ +FN L KI S L +W +LK VPN L LLK + E
Sbjct: 413 PDSLAPNTREPAA--DGSVVFGSFNNLSKITDSMLAVWGRLLKQVPNGRL-LLKGVGLSE 469
Query: 96 ANIQ----ATAQALGLDQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
+ A + GL R+ L H+ DV LDT +G TT+ + LW
Sbjct: 470 PESRGRYVARMETAGLPVDRVELLERTPDTAAHLALYHRVDVALDTFPYHGTTTTCEALW 529
Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G PVVTL G+ +SRV S L+ +G E IA + +Y IA+ L D
Sbjct: 530 MGVPVVTLAGDRHSSRVGVSLLSAVGHSEWIAHSADDYVAIALALAAD 577
>gi|421480197|ref|ZP_15927838.1| bacterial transcriptional activator domain protein [Burkholderia
multivorans CF2]
gi|400221527|gb|EJO51976.1| bacterial transcriptional activator domain protein [Burkholderia
multivorans CF2]
Length = 779
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 7/147 (4%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
Y +FN+L K+ P + +W VL A P + + L A+G A Q T A G+D R
Sbjct: 569 YGSFNRLNKLSPQVIAVWARVLHADPTARMAL---GAIGNAGDQRTLTEWFAAAGIDASR 625
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ D+CLD G TT+++ LW G P VT+PG T+A R +A
Sbjct: 626 LTFHPRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGSTVAGRGSAGW 685
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L + E IA+ ++ A+ LG D
Sbjct: 686 LQHVRLDEYIAKDEDDFVAKAVALGRD 712
>gi|428217091|ref|YP_007101556.1| hypothetical protein Pse7367_0828 [Pseudanabaena sp. PCC 7367]
gi|427988873|gb|AFY69128.1| hypothetical protein Pse7367_0828 [Pseudanabaena sp. PCC 7367]
Length = 809
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 84/160 (52%), Gaps = 5/160 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN---IQ 99
+R + +P DA+V+ K P T +M + ++ VPNS L L+K A E+ +
Sbjct: 608 NRDRLDIPADAVVFLTAQTGQKRHPDTTRMQLQIISQVPNSYL-LIKGIADQESMQNLFK 666
Query: 100 ATAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A G+ ++++ F + EE H +ADV LDT NG TT+M+ LW G P+VT
Sbjct: 667 NLAIEAGVSENQLKFLPLVKNEETHRANLAIADVVLDTYPYNGATTTMETLWMGIPIVTR 726
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
GE ++R + + G E IA + +EY + +RLG D
Sbjct: 727 VGEQFSARNSYGMMINAGINEGIAWSDQEYIEWGVRLGKD 766
>gi|221202121|ref|ZP_03575156.1| TPR repeat protein [Burkholderia multivorans CGD2M]
gi|221209071|ref|ZP_03582066.1| TPR repeat protein [Burkholderia multivorans CGD2]
gi|221171066|gb|EEE03518.1| TPR repeat protein [Burkholderia multivorans CGD2]
gi|221177915|gb|EEE10327.1| TPR repeat protein [Burkholderia multivorans CGD2M]
Length = 779
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 7/147 (4%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
Y +FN+L K+ P + +W VL A P + + L A+G A Q T A G+D R
Sbjct: 569 YGSFNRLNKLSPQVIAVWARVLHADPTARMAL---GAIGNAGDQRTLTEWFAAAGIDASR 625
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ D+CLD G TT+++ LW G P VT+PG T+A R +A
Sbjct: 626 LTFHPRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGSTVAGRGSAGW 685
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L + E IA+ ++ A+ LG D
Sbjct: 686 LQHVRLDEYIAKDEDDFVAKAVALGRD 712
>gi|300869150|ref|ZP_07113747.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
gi|300332849|emb|CBN58945.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
Length = 746
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-- 101
R + +P +A++Y + YK T ++ V +LK VPNS L+K A + +I+
Sbjct: 545 RDRLNIPSNAVIYLSSQTGYKRHLDTARLQVKILKEVPNSYF-LIKGGA-DQNSIKKLFN 602
Query: 102 --AQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
A+ GL+ R+ F + EE V R L ADV LDT NG TT+++ LW P+VT
Sbjct: 603 QLAEEEGLEGDRLRFLPAVSSEE-VHRANLGIADVVLDTYPYNGATTTLETLWMCIPLVT 661
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ A+R + + + G E IA T +EY + IRLG D
Sbjct: 662 RVGQQFAARNSYTMMMNAGVTEGIAWTDEEYIEWGIRLGKD 702
>gi|421471789|ref|ZP_15920043.1| bacterial transcriptional activator domain protein [Burkholderia
multivorans ATCC BAA-247]
gi|400224616|gb|EJO54837.1| bacterial transcriptional activator domain protein [Burkholderia
multivorans ATCC BAA-247]
Length = 779
Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 7/147 (4%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
Y +FN+L K+ P + +W VL A P + + L A+G A Q T A G+D R
Sbjct: 569 YGSFNRLNKLSPRVVAVWARVLHADPTARMAL---GAIGNAGDQRTLTEWFAAAGIDASR 625
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ D+CLD G TT+++ LW G P VT+PG T+A R +A
Sbjct: 626 LTFHPRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGSTVAGRGSAGW 685
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L + E IA+ ++ A+ LG D
Sbjct: 686 LQHVRLDEYIAKDEDDFVAKAVALGRD 712
>gi|224140075|ref|XP_002323412.1| predicted protein [Populus trichocarpa]
gi|222868042|gb|EEF05173.1| predicted protein [Populus trichocarpa]
Length = 917
Score = 79.7 bits (195), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 78/149 (52%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQ- 109
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q + LGL+
Sbjct: 664 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSVRQRFLTMLEQLGLEPL 723
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
H L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V A
Sbjct: 724 HVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVGA 783
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+ +EY A++L +D
Sbjct: 784 SLLSNVGLGHLVAKNEEEYVQSALQLASD 812
>gi|392409193|ref|YP_006445800.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfomonile tiedjei DSM 6799]
gi|390622329|gb|AFM23536.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfomonile tiedjei DSM 6799]
Length = 583
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 78/151 (51%), Gaps = 5/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
+ +C+FNQL KI L+ W +L +P S L +++ A+ +A I+ + +LG+ +
Sbjct: 395 VTFCSFNQLAKISDKALETWAEILGQLPRSRL-VMRGKALNDATIRDSFAEKLSSLGISR 453
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ +++R D+ LDT G TT+ D LW G PVVT G+ R +A
Sbjct: 454 DRMDLLPKTTFHDYLRTYNDVDIALDTFPFAGGTTTCDSLWMGVPVVTFTGDRFCHRHSA 513
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S + G +L+A + Y AI+L +D D
Sbjct: 514 SHIINSGHEDLVAHDIEGYIHKAIQLASDLD 544
>gi|126728515|ref|ZP_01744331.1| TPR domain/SEC-C motif domain protein [Sagittula stellata E-37]
gi|126711480|gb|EBA10530.1| TPR domain/SEC-C motif domain protein [Sagittula stellata E-37]
Length = 555
Score = 79.3 bits (194), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 74/139 (53%), Gaps = 3/139 (2%)
Query: 63 YKIDPSTLQMWVNVLKAVPNSILWLL-KFPAVGEANIQATAQ--ALGLDQHRILFSNVAA 119
YK+ P+ + W +LK VP + L+L + GE + T + G+D R++F A
Sbjct: 369 YKLTPTVIDAWSEILKGVPGATLFLANRALESGENRVWLTERFAERGIDTDRLIFDGGAE 428
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
++++ D+ LD NG TT+++ +W G PV+T G+ ASR +AS + +
Sbjct: 429 HYDYLKNYDRMDIALDAFPYNGGTTTVEAIWQGVPVLTFDGDRWASRTSASLICYTHLKD 488
Query: 180 LIARTHKEYQDIAIRLGTD 198
+A + + D+AIR+G D
Sbjct: 489 FVAPSRQAMIDMAIRMGND 507
>gi|288963192|ref|YP_003453471.1| hypothetical protein AZL_f01670 [Azospirillum sp. B510]
gi|288915444|dbj|BAI76927.1| hypothetical protein AZL_f01670 [Azospirillum sp. B510]
Length = 1587
Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats.
Identities = 48/166 (28%), Positives = 87/166 (52%), Gaps = 11/166 (6%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---------PAVG 94
R + GLP D +V+ + + KI P ++W +L VP S+L L+ + A+
Sbjct: 1375 RMRLGLPADVVVFGSTANVNKIIPDLQEVWAAILARVPGSVLLLMPYGPHWNHSYPEALF 1434
Query: 95 EANIQATAQALGLDQHRILFSNVAAKEEH-VRRG-QLADVCLDTPLCNGHTTSMDVLWTG 152
+ + +A G+ + R++ + + H VRR AD+ LD+ +G T+ ++ + G
Sbjct: 1435 HDSFRQRLRAAGVAEDRLVILSERGWDRHDVRRALGAADIYLDSFPFSGSTSLIEPMEAG 1494
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
P++ G T ++ A+ L LGCPEL+A + +Y + A+RL +D
Sbjct: 1495 LPIICRAGATFRGQMGAAMLTDLGCPELVAGSDTDYIEKAVRLASD 1540
>gi|414870070|tpg|DAA48627.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
gi|414870071|tpg|DAA48628.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
gi|414870072|tpg|DAA48629.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
Length = 786
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 75/140 (53%), Gaps = 6/140 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + +FN L KI P LQ+W +L +VPNS L + P E NI+ AT + LGL+
Sbjct: 646 VTFGSFNNLAKITPKVLQVWARILCSVPNSRLVVKCKPFCCE-NIRQKFLATLEELGLES 704
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 705 LRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVG 764
Query: 169 ASQLATLGCPELIARTHKEY 188
S L+ +G L+A+T EY
Sbjct: 765 VSLLSKVGLGRLVAKTEDEY 784
>gi|354598043|ref|ZP_09016060.1| hypothetical protein BrE312_2631 [Brenneria sp. EniD312]
gi|353675978|gb|EHD22011.1| hypothetical protein BrE312_2631 [Brenneria sp. EniD312]
Length = 1133
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 78/157 (49%), Gaps = 6/157 (3%)
Query: 48 GLP---EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQAT 101
GLP + I + FN KI+ L W +L AVPNS L+L F ++ I AT
Sbjct: 416 GLPATSKGYITFGCFNNASKINDILLAQWAIILHAVPNSRLFLKSFNYKNQSLSERIYAT 475
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
+ G+ + R+ + +E + D+ LD +G T+ + + G PVVTLPG
Sbjct: 476 LEKHGIVRERVRIEGSSPHKELLASYNDVDIALDPWPYSGGLTTCEAMAMGVPVVTLPGP 535
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T A R +AS L G PEL+A ++Y +I + + D
Sbjct: 536 TFAGRHSASHLVNAGMPELLANDWEQYINITVGIAND 572
>gi|443657013|ref|ZP_21131812.1| hypothetical protein C789_2352 [Microcystis aeruginosa DIANCHI905]
gi|159029887|emb|CAO90941.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333290|gb|ELS47857.1| hypothetical protein C789_2352 [Microcystis aeruginosa DIANCHI905]
Length = 726
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDKN 679
>gi|322421642|ref|YP_004200865.1| hypothetical protein GM18_4175 [Geobacter sp. M18]
gi|320128029|gb|ADW15589.1| Tetratricopeptide TPR_1 repeat-containing protein [Geobacter sp.
M18]
Length = 1104
Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats.
Identities = 52/152 (34%), Positives = 76/152 (50%), Gaps = 6/152 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL----WLLKFPAVGEANIQATAQALGLD 108
AI + +FN KI P T+ +W VL++VP S L + L P ++ + + G+
Sbjct: 428 AITFGSFNNPAKITPETVALWAGVLRSVPGSRLLVKGYSLACPG-SRKRLEESFASHGIG 486
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R+ L N + +H+ D+ LDT NG TT+ + LW G PVVTL G SRV
Sbjct: 487 PERLELTGNTPSYRDHLALYGRVDIALDTFPYNGTTTTCEALWMGVPVVTLAGSAHRSRV 546
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
AS L LG L+A + ++A L D+
Sbjct: 547 GASLLHALGLDALVAHDSGAFVELARALALDQ 578
>gi|167582750|ref|ZP_02375624.1| TPR domain protein [Burkholderia thailandensis TXDOH]
Length = 781
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 78/148 (52%), Gaps = 1/148 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
+ Y +FN+L K+ + + +W VL A P++ + + G+ I + G+D HR+
Sbjct: 570 VTYGSFNRLNKLRQNVIALWSQVLLATPSARMVIGSIEQEGDRETIIEWFVSEGIDPHRL 629
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F AA ++++ DVCLD G TT+++ LW G P VTL G+++ SR +AS L
Sbjct: 630 TFQPRAAMPVYLQQHHHVDVCLDAFPYAGSTTTLNALWMGVPTVTLSGKSIPSRGSASWL 689
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ +G E IA + + +I D D
Sbjct: 690 SHVGLTEYIAHDQESFVRTSIAAAQDLD 717
>gi|206558517|ref|YP_002229277.1| putative TPR repeat protein [Burkholderia cenocepacia J2315]
gi|444358767|ref|ZP_21160141.1| tetratricopeptide repeat protein [Burkholderia cenocepacia BC7]
gi|444367389|ref|ZP_21167339.1| tetratricopeptide repeat protein [Burkholderia cenocepacia
K56-2Valvano]
gi|198034554|emb|CAR50419.1| putative TPR repeat protein [Burkholderia cenocepacia J2315]
gi|443602945|gb|ELT70984.1| tetratricopeptide repeat protein [Burkholderia cenocepacia
K56-2Valvano]
gi|443603327|gb|ELT71348.1| tetratricopeptide repeat protein [Burkholderia cenocepacia BC7]
Length = 780
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 76/147 (51%), Gaps = 7/147 (4%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----ALGLDQHR 111
Y +FN+L K+ + +W VL A P + + L A+G Q T A G+D R
Sbjct: 570 YGSFNRLNKLSSHVIAVWARVLHADPTARMAL---GAIGNEGDQQTLTEWFVAAGIDASR 626
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ D+CLDT G TT+++ LW G P +T+PG T+A R +AS
Sbjct: 627 LSFHRRSNIPVYMQQHHGVDLCLDTFPYTGSTTTLNGLWMGVPTLTIPGSTMAGRGSASW 686
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L +GC IA ++ A+ L +D
Sbjct: 687 LQHVGCDAYIANDEDDFVAKALALSSD 713
>gi|92114109|ref|YP_574037.1| sulfotransferase [Chromohalobacter salexigens DSM 3043]
gi|91797199|gb|ABE59338.1| sulfotransferase [Chromohalobacter salexigens DSM 3043]
Length = 1415
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 72/143 (50%), Gaps = 3/143 (2%)
Query: 59 FNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQHRILFS 115
FN K++ TL+ W ++ +P+S L L P E+ + AT +A G+ + R++
Sbjct: 427 FNNPTKLNDVTLKQWAGIMHELPDSRLMLKGRPYTSESFCERLYATLEAAGIARERLIIE 486
Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
+ E + AD+ LD +G T+ + G PVVTLPG T A R +A+ L
Sbjct: 487 GPGSNYEMLDAYNRADIALDPWPYSGGLTTCEAFIMGVPVVTLPGPTFAGRHSATHLVHA 546
Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
G PEL+ + EY+ I L +D
Sbjct: 547 GMPELVVNSWDEYRARVIELASD 569
>gi|428227087|ref|YP_007111184.1| hypothetical protein GEI7407_3665 [Geitlerinema sp. PCC 7407]
gi|427986988|gb|AFY68132.1| TPR repeat-containing protein [Geitlerinema sp. PCC 7407]
Length = 746
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 5/159 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
R+ G+ DA++Y + K +P T+++ + VL VPNS L L+K A A + +
Sbjct: 538 REDLGISTDAVIYLSAQTGIKRNPDTVRLQMKVLAQVPNSYL-LIKGWADESALQQSFKQ 596
Query: 101 TAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A G+D R+ F + E H +ADV LDT NG TT+M+ LW P+VT
Sbjct: 597 IASEEGVDSERLRFLPIVPSEATHRANMGIADVILDTYPYNGATTTMEALWMCVPIVTKV 656
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ ++R + S + G E IA + +EY + LG D
Sbjct: 657 GQQFSARNSYSMMMNAGIQEGIAWSDEEYIRWGVTLGRD 695
>gi|397628033|gb|EJK68713.1| hypothetical protein THAOC_10083, partial [Thalassiosira oceanica]
Length = 670
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 52/156 (33%), Positives = 77/156 (49%), Gaps = 3/156 (1%)
Query: 47 YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL- 105
+ LP A +Y + +K P+ M V +A PN+IL L + GE N L
Sbjct: 453 HSLPGTATLYLCMQKPHKYSPAFDGMLAGVHRADPNAILVLHRVDQDGEHNHARFTDRLV 512
Query: 106 --GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+D R+ F + LADV LD+ G TT+ + G VVTLP + L
Sbjct: 513 RAGVDLDRVAFLPTQPHSLLMALYSLADVVLDSYHAGGCTTTREAFEVGALVVTLPAQYL 572
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
SR + + + +G +L+A T +EY DIA+R+ TD+
Sbjct: 573 GSRWSLAYYSIMGVTDLVAETREEYVDIAVRMATDK 608
>gi|425445188|ref|ZP_18825224.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
gi|389734922|emb|CCI01529.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
Length = 717
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLDKN 679
>gi|357633166|ref|ZP_09131044.1| Tetratricopeptide TPR_1 repeat-containing protein [Desulfovibrio
sp. FW1012B]
gi|357581720|gb|EHJ47053.1| Tetratricopeptide TPR_1 repeat-containing protein [Desulfovibrio
sp. FW1012B]
Length = 741
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 80/151 (52%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQ 109
I + +FN L KI +TL +W +L A+P L LLK + A + A + G+ +
Sbjct: 552 IWFGSFNHLSKIQDATLDLWSRILSALPEGKL-LLKSGQLDHAANRERMLAAFEGRGIAR 610
Query: 110 HR-ILFSNVAAKEEHVRRGQLADVCLD-TPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R +L A H+ L DV LD +G TT+ D LW G PVVT+ G+ LA+R+
Sbjct: 611 DRLVLMGQTANWPAHMAAYDLVDVALDPVDAVSGVTTTCDALWMGVPVVTMAGDRLATRM 670
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AAS ++ LG + +A T + Y +A+ L D
Sbjct: 671 AASLVSGLGHGDWVAATPEAYAGLAVALARD 701
>gi|238927671|ref|ZP_04659431.1| possible TPR repeat-containing protein [Selenomonas flueggei ATCC
43531]
gi|238884387|gb|EEQ48025.1| possible TPR repeat-containing protein [Selenomonas flueggei ATCC
43531]
Length = 1077
Score = 78.2 bits (191), Expect = 2e-12, Method: Composition-based stats.
Identities = 52/150 (34%), Positives = 75/150 (50%), Gaps = 7/150 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLD 108
+IV+ +FN KI+ L++W +L+ VP S L+L K E + +A A G+
Sbjct: 359 SIVFGSFNNFTKINDHVLRVWAEILRRVPESRLFL-KAAVFSEEDARAEMLTRIAAAGIS 417
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ +++ D+ LD G T+ D L+ G PVVTL GETL SR
Sbjct: 418 LSRV--DTEGNSPDYLAAYNRMDIALDPFPYPGGGTTCDALYMGVPVVTLSGETLGSRFG 475
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AS L +G LIA T +EY +A+ L D
Sbjct: 476 ASLLTNIGAEALIAHTEEEYIALAVSLAGD 505
Score = 65.1 bits (157), Expect = 2e-08, Method: Composition-based stats.
Identities = 54/177 (30%), Positives = 77/177 (43%), Gaps = 14/177 (7%)
Query: 35 VPQSIVITSRQQYGLP-------EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL 87
+P V R + +P AI++ FNQ K L +W +L+ VP S L L
Sbjct: 861 LPSQFVYVPRAELPVPTGTPARQRGAILFGVFNQYRKWTDEMLCVWREILERVPQSKL-L 919
Query: 88 LK----FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
LK F A + LG D RIL A +++ R D+ LDT G
Sbjct: 920 LKSQVFFAAAMRRAAEERLARLGFDLDRILLE--PATTDYMERYLDVDIALDTYPWTGGG 977
Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T+ D L+ G PVVT+ G ++R + + L +G EL + Y A+ L D D
Sbjct: 978 TTCDALYMGVPVVTMYGTRRSTRFSYAMLHLVGAEELAVASASAYISCAVSLAEDLD 1034
>gi|425458862|ref|ZP_18838348.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
gi|389823716|emb|CCI27872.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
Length = 721
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679
>gi|386392026|ref|ZP_10076807.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfovibrio sp. U5L]
gi|385732904|gb|EIG53102.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfovibrio sp. U5L]
Length = 741
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 80/151 (52%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF----PAVGEANIQATAQALGLDQ 109
I + +FN L KI +T+ +W +L A+P L LLK A + A + G+ +
Sbjct: 552 IWFGSFNHLSKIQNATVDLWSRILSALPEGKL-LLKSGQLDHAANRERMLAAFEGRGISR 610
Query: 110 HR-ILFSNVAAKEEHVRRGQLADVCLD-TPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R +L A + H+ L DV LD +G TT+ D LW G PVVT+ G+ LA+R+
Sbjct: 611 DRLVLMGQTANWQAHMAAYDLIDVALDPVDAVSGVTTTCDALWMGVPVVTMAGDRLATRM 670
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AAS ++ LG + +A T + Y +A+ L D
Sbjct: 671 AASLISGLGHGDWVAATPQAYAGLAVALARD 701
>gi|440755492|ref|ZP_20934694.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
protein [Microcystis aeruginosa TAIHU98]
gi|440175698|gb|ELP55067.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
protein [Microcystis aeruginosa TAIHU98]
Length = 721
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679
>gi|425436658|ref|ZP_18817092.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
gi|389678581|emb|CCH92566.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
Length = 721
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679
>gi|425449354|ref|ZP_18829194.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
gi|389764011|emb|CCI09572.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
Length = 726
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679
>gi|381160353|ref|ZP_09869585.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878417|gb|EIC20509.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 629
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 74/145 (51%), Gaps = 4/145 (2%)
Query: 58 NFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG---EANIQATAQALGLDQHRILF 114
+FN L K+ +L +W VL+A+P + L L A+ +A I + G+ R+ F
Sbjct: 431 SFNNLTKVRHESLVLWSQVLRALPQARLILKDKRALDHSVQARILGPLREQGIHPDRVEF 490
Query: 115 SNV-AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
AA +H+ D+ LD+ N TT D LW GTP++TL G+ LA R AAS L+
Sbjct: 491 RGASAAWSDHMDAYNDTDIALDSLPFNSATTGFDALWMGTPLITLTGDRLAGRQAASLLS 550
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
LG E IAR E+ I L D
Sbjct: 551 GLGRTEWIARDADEFVAIVTGLAQD 575
>gi|390440787|ref|ZP_10228990.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389835918|emb|CCI33116.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 256
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + ++ VPNS L+K VG+ IQ
Sbjct: 58 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKIIAQVPNSYF-LIK--GVGKTEKIQELF 114
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 115 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 174
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 175 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 216
>gi|63095209|gb|AAY32335.1| SPY [Phyllostachys praecox]
Length = 781
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q +T + LG +
Sbjct: 502 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQKFLSTLEELGSESL 561
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G V
Sbjct: 562 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHGHNVGV 621
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G L+A+ EY +A+ L +D
Sbjct: 622 SLLTKVGLGRLVAKPEDEYISLALDLASD 650
>gi|295678098|ref|YP_003606622.1| hypothetical protein BC1002_3066 [Burkholderia sp. CCGE1002]
gi|295437941|gb|ADG17111.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia sp. CCGE1002]
Length = 1634
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 72/137 (52%), Gaps = 2/137 (1%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATAQALGLDQHRI 112
+ + +F+++ KI P + +W VL+ VP+S + + A + + + A G+D RI
Sbjct: 1161 MTFGSFSKMLKIGPEVIALWARVLRKVPHSRMVIGAIVAPEQISKVSALFANEGIDGSRI 1220
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F A ++ + DVCLD G TT+M LW G P VTLPG ++ASR + L
Sbjct: 1221 SFLPRAGMAAYLEQHHQIDVCLDAFPFAGSTTTMHALWMGVPTVTLPGVSMASRGSTGWL 1280
Query: 173 ATLGCPE-LIARTHKEY 188
+ LG E +AR ++
Sbjct: 1281 SQLGIDEAFVARDEDDF 1297
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 74/151 (49%), Gaps = 10/151 (6%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
+ + +F+++ KI P+ + +W VL+ VP++ L + E + + A G+ R+
Sbjct: 533 LTFGSFSRMSKIGPAVVALWARVLREVPDARLVIGSISGQDEMDKLTERFAAEGIGAARL 592
Query: 113 LF----SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
F + +E+H DVCLD G TT++ LW G P VTLPG+++ASR +
Sbjct: 593 GFLPRKGMLVYQEQH----HQIDVCLDAFPFGGSTTTLQALWMGVPTVTLPGDSMASRSS 648
Query: 169 ASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
L+ LG +A +Y + L D
Sbjct: 649 TGWLSLLGLDAPFVAHDKDDYVRKCVTLAAD 679
>gi|425440157|ref|ZP_18820465.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389719465|emb|CCH96700.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 508
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 310 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 366
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 367 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 426
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 427 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 468
>gi|427713486|ref|YP_007062110.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 6312]
gi|427377615|gb|AFY61567.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 6312]
Length = 1128
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 82/164 (50%), Gaps = 9/164 (5%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT- 101
R +P D I++ + K P T+ + + ++K VPNS + F + +IQ +
Sbjct: 915 KRHDLDIPNDVIIFLSSQGGQKRHPDTMSLQLQIIKNVPNSFFLIKGF--ADQVSIQESF 972
Query: 102 ---AQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVV 156
A +GL ++ F + + R L ADV LDT NG TT+++ LW G P+V
Sbjct: 973 YSIASEIGLSASQLRFLKPDSTS-MIHRANLGIADVILDTFPYNGATTTLEALWMGIPLV 1031
Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T G+ A+R + + L G E IA T +EY + I+ GTD +
Sbjct: 1032 TRVGQQFAARNSYTFLKNAGVEEGIAWTDEEYVEWGIKFGTDHE 1075
>gi|307731403|ref|YP_003908627.1| hypothetical protein [Burkholderia sp. CCGE1003]
gi|307585938|gb|ADN59336.1| Tetratricopeptide TPR_1 repeat-containing protein [Burkholderia sp.
CCGE1003]
Length = 820
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 10/160 (6%)
Query: 48 GLP---EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-- 102
GLP + I Y +FN+L K+ + +W +L+A P S + + A+G + T
Sbjct: 580 GLPALHKGYITYGSFNRLNKLSTEVIALWSTILRAQPTSRMVI---GAIGSKLDEETYLE 636
Query: 103 --QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
+ G+D R+ F ++++ D+CLDT G TT+++ LW G P VT+PG
Sbjct: 637 WFRNEGIDTSRLTFCPRTTLPVYMQQHHHVDLCLDTFPYVGSTTTLNALWMGVPTVTMPG 696
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
++ SR A L +G + +AR E+ +I L D D
Sbjct: 697 TSMPSRCGAGWLEQVGLHDFVARDKDEFVKKSIELTRDLD 736
>gi|255579381|ref|XP_002530535.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
gi|223529939|gb|EEF31867.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
communis]
Length = 930
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + +FN L KI P LQ+W +L AVPNS L + P ++ Q + LGL+
Sbjct: 661 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSVRQRFLTMLEELGLESL 720
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G A V
Sbjct: 721 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAIHAHNVGV 780
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G L+A+ Y +A++L +D
Sbjct: 781 SLLSKVGLGHLVAQNEDNYVQLALQLASD 809
>gi|440754653|ref|ZP_20933855.1| hypothetical protein O53_3044 [Microcystis aeruginosa TAIHU98]
gi|440174859|gb|ELP54228.1| hypothetical protein O53_3044 [Microcystis aeruginosa TAIHU98]
Length = 705
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 79/158 (50%), Gaps = 2/158 (1%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
R G+P DAI Y K +P +++ + VLKAVPNS L + L E
Sbjct: 505 RDDLGIPGDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 564
Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
A+ G++ R+ + + ++ADV LDT G T++DVLW G P+VT G+
Sbjct: 565 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 624
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+SR + + + G E IA + +EY D I+LG D
Sbjct: 625 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 662
>gi|332530455|ref|ZP_08406398.1| hypothetical protein HGR_11012 [Hylemonella gracilis ATCC 19624]
gi|332040115|gb|EGI76498.1| hypothetical protein HGR_11012 [Hylemonella gracilis ATCC 19624]
Length = 755
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 83/152 (54%), Gaps = 8/152 (5%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLDQ 109
+ + + N L K+ L +W +L VPNS L L++ +G+ A+ +A LG+D
Sbjct: 414 VTFGSCNNLGKLTDDVLALWGRILATVPNSRL-LIEGKNLGQPEFSASYRARCERLGIDV 472
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ + ++++ ++ DV LD PL G TTS DV+W G P++T+ G+ L SR+
Sbjct: 473 SRLDLVPLDYAKQYLVYHRI-DVALDPFPLVGG-TTSNDVIWMGVPLITMNGDALKSRMG 530
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
LA +G + IAR EY IA RL +D D
Sbjct: 531 VGMLAHMGRHDWIARNPDEYVAIASRLASDVD 562
>gi|427406429|ref|ZP_18896634.1| hypothetical protein HMPREF9161_00994 [Selenomonas sp. F0473]
gi|425708248|gb|EKU71288.1| hypothetical protein HMPREF9161_00994 [Selenomonas sp. F0473]
Length = 559
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 77/149 (51%), Gaps = 7/149 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
IV+ + N K++ + L++W +L VP S L LLK A+ A A A G+
Sbjct: 364 IVFGSLNNFTKVNDTVLRVWAKILHRVPTSRL-LLKAEIFSVADGAAEAWRRIAAAGIPA 422
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A +++ + D+ LDT G T+ + L+ G PVVTL GETL SR A
Sbjct: 423 ERV--ETEGASADYLTAYRRIDIALDTFPYPGGGTTCEALYMGVPVVTLAGETLGSRFGA 480
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G LIART +EY + A+ L D
Sbjct: 481 SLLQNIGAQALIARTTEEYIERAVLLAQD 509
>gi|166364311|ref|YP_001656584.1| hypothetical protein MAE_15700 [Microcystis aeruginosa NIES-843]
gi|166086684|dbj|BAG01392.1| hypothetical protein MAE_15700 [Microcystis aeruginosa NIES-843]
Length = 711
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 513 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 570 TEIAILEGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 630 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671
>gi|425467182|ref|ZP_18846466.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389830109|emb|CCI28120.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 711
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 513 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 570 TEIAILEGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 630 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671
>gi|330820193|ref|YP_004349055.1| TPR repeat-containing protein [Burkholderia gladioli BSR3]
gi|327372188|gb|AEA63543.1| TPR repeat-containing protein [Burkholderia gladioli BSR3]
Length = 786
Score = 77.4 bits (189), Expect = 3e-12, Method: Composition-based stats.
Identities = 47/154 (30%), Positives = 73/154 (47%), Gaps = 3/154 (1%)
Query: 48 GLPEDAIVYCNF---NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
LP A Y F N K+ ++ ++W V++ + L L+ + +
Sbjct: 588 ALPALANGYPTFGCLNNPCKLGDASFRLWAEVMRGFDEARLILMAPEGSARRLLLDRLRG 647
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
G+D R+ F+ + E++R D+ LDT NGHTTS+D W G PVVT +T+
Sbjct: 648 HGIDDERVAFTAFRPRAEYLRTYHEIDIGLDTLPYNGHTTSLDSYWMGVPVVTRITDTVV 707
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
R SQ A LG E++A + + +IA L D
Sbjct: 708 GRAGLSQAANLGLSEIVADSDARFVEIATGLARD 741
>gi|425454289|ref|ZP_18834035.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389805097|emb|CCI15371.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 711
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 513 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 570 TEIAILEGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 630 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671
>gi|418542280|ref|ZP_13107723.1| TPR domain-containing protein [Burkholderia pseudomallei 1258a]
gi|418548903|ref|ZP_13113999.1| TPR domain-containing protein [Burkholderia pseudomallei 1258b]
gi|385355974|gb|EIF62122.1| TPR domain-containing protein [Burkholderia pseudomallei 1258a]
gi|385356826|gb|EIF62911.1| TPR domain-containing protein [Burkholderia pseudomallei 1258b]
Length = 775
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 74/145 (51%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + +FN+L K+ + +W +++AVP S + L P G A + + G+ + R+
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGAAMIEWFEQEGIARERLS 624
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR + ++
Sbjct: 625 FQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTWMS 684
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
+G IA ++ I L +D
Sbjct: 685 HVGLESFIADDIDDFVAKGIALASD 709
>gi|328856695|gb|EGG05815.1| family 41 glycosyltransferase [Melampsora larici-populina 98AG31]
Length = 313
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 50/71 (70%), Gaps = 2/71 (2%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L +D +++ NFNQLYK+DP + W+ +L +VP SILWLL+FPA GE +++ TA+
Sbjct: 240 LSDDTVIFANFNQLYKVDPVIFKYWLEILASVPKSILWLLRFPAPGEPHLKLTAEKWAGP 299
Query: 109 Q--HRILFSNV 117
+ RI+F++V
Sbjct: 300 EVASRIMFTDV 310
>gi|217423849|ref|ZP_03455349.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 576]
gi|217392912|gb|EEC32934.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 576]
Length = 775
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 74/145 (51%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + +FN+L K+ + +W +++AVP S + L P G A + + G+ + R+
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGAAMIEWFEQEGIARERLS 624
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR + ++
Sbjct: 625 FQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTWMS 684
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
+G IA ++ I L +D
Sbjct: 685 HVGLESFIADDIDDFVAKGIALASD 709
>gi|148238461|ref|YP_001223848.1| glycosyl transferase family protein [Synechococcus sp. WH 7803]
gi|147847000|emb|CAK22551.1| Glycosyltransferase of family GT41 [Synechococcus sp. WH 7803]
Length = 721
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 79/147 (53%), Gaps = 4/147 (2%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQHRI 112
+ +FN K++P T++++ VL+ VPNS L L V +A QAL G+ + RI
Sbjct: 531 FGSFNHSRKLNPGTIRLYAGVLRDVPNSTLVLKSVSFVEQAEKTRVMQALTAAGIGEDRI 590
Query: 113 LF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
L + A +EH+ + DV LD G T++ D L G PVV+L + R+++S
Sbjct: 591 LLLPSTANHKEHLALYREVDVVLDPFPYGGATSTCDALMMGVPVVSLASAGMVGRLSSSI 650
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
LA+ G + IAR+ +Y+ IA L +
Sbjct: 651 LASAGLEKWIARSQTDYRRIARELAAE 677
>gi|422302254|ref|ZP_16389617.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389788584|emb|CCI15664.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 711
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 513 REDLEIPPDATIYFSAQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 570 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 630 LVGEEFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671
>gi|374293909|ref|YP_005040932.1| hypothetical protein AZOLI_p30436 [Azospirillum lipoferum 4B]
gi|357427312|emb|CBS90256.1| conserved protein of unknown function; putative TPR domains
[Azospirillum lipoferum 4B]
Length = 732
Score = 77.0 bits (188), Expect = 4e-12, Method: Composition-based stats.
Identities = 49/147 (33%), Positives = 75/147 (51%), Gaps = 5/147 (3%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQHR 111
+ +FN L K++ L W +L +P S L + L PA A + A G++ R
Sbjct: 545 FGSFNILTKLNDGVLAAWAAILGRMPGSRLLMKTKALSCPATA-ALWRDRLAAAGIEPDR 603
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ ++ +H+RR DV LD +G TT+++ LW G PVVTLPGET +SR + +
Sbjct: 604 VTMVGASSSLDHMRRCASVDVALDPFPFSGSTTTLETLWMGVPVVTLPGETFSSRHSLAF 663
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L G IA +Y + A+ L +D
Sbjct: 664 LTVAGVEGCIATDPADYVERAVALASD 690
>gi|386390730|ref|ZP_10075512.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfovibrio sp. U5L]
gi|385735611|gb|EIG55808.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Desulfovibrio sp. U5L]
Length = 609
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 74/150 (49%), Gaps = 9/150 (6%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATAQALGLDQHRI 112
I + +FN + KI P+++ +W VL +VP S L LLK A +A + A G +
Sbjct: 342 ITFGSFNNVAKISPASIALWREVLASVPGSRL-LLKGKAFADAPTVDRFRDAFGPQAPSV 400
Query: 113 LF----SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
F S+ + + RR D+ LDT NG T+ + LW G PV+TL G+ SRV
Sbjct: 401 EFLPWSSDAVSHLDVYRR---IDIALDTIPYNGTMTTCEALWMGVPVITLLGDRFTSRVG 457
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AS L G E IA Y +A RL D
Sbjct: 458 ASLLTQAGLSEWIADDQAAYIRLAQRLAAD 487
>gi|428211055|ref|YP_007084199.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoria acuminata PCC 6304]
gi|427999436|gb|AFY80279.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Oscillatoria acuminata PCC 6304]
Length = 2093
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 83/173 (47%), Gaps = 10/173 (5%)
Query: 36 PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-PAVG 94
P S V +R +G E +V+ + YKI P + W +L AVPNSIL L F PA
Sbjct: 1733 PASTVNPTRASWGATEQTVVFMSGANFYKIIPELTETWAKILAAVPNSILVLYPFGPAWN 1792
Query: 95 EA--------NIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTS 145
+ + G+D+ R++ N ++ +LAD+ LD+ +G T+
Sbjct: 1793 SSYPAMPFWERLHRIFAQYGVDKGRLVLINTLPSPADIKECVKLADLYLDSYPYSGATSL 1852
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+D L G P + G +L SR AA+ L + +LIA+ Y +A +L T+
Sbjct: 1853 LDPLQLGIPAIAWEGSSLRSRQAAALLREIHLADLIAQDESSYIQLATKLATN 1905
>gi|425444850|ref|ZP_18824891.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389735306|emb|CCI01162.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 712
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 514 REDLEIPPDATIYFSAQSGMKRHPDTIKLQMKILARVPNSYF-LIK--GVGKTEKIQELF 570
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 571 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 630
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE ++R + + + G I EY D ++LG DR
Sbjct: 631 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 672
>gi|118581731|ref|YP_902981.1| hypothetical protein Ppro_3331 [Pelobacter propionicus DSM 2379]
gi|118504441|gb|ABL00924.1| Tetratricopeptide TPR_2 repeat protein [Pelobacter propionicus DSM
2379]
Length = 633
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 9/159 (5%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
L A+ + +N K++ L +W VL+ +P S L LK A+ +++ + L L
Sbjct: 432 LERGAVTFVAYNNFAKLNCPLLDLWCRVLRMIPGSRL-ALKDRALAN-DVEFRMEMLDLF 489
Query: 109 QHR-------ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
R I+ H+ D+ LD+ +G TT+ + LW G PVVTL G
Sbjct: 490 AQRGITADRLIIRDRTETVHAHLDALAEGDIALDSHPYSGTTTTCESLWMGVPVVTLAGG 549
Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
T ASRV+AS L ++G PEL+A + ++ IA+ L D D
Sbjct: 550 THASRVSASLLTSVGVPELVAESDDDFVAIAVALAKDTD 588
>gi|254186382|ref|ZP_04892899.1| tetratricopeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
gi|157934067|gb|EDO89737.1| tetratricopeptide repeat protein [Burkholderia pseudomallei Pasteur
52237]
Length = 836
Score = 76.6 bits (187), Expect = 5e-12, Method: Composition-based stats.
Identities = 42/145 (28%), Positives = 74/145 (51%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ + +FN+L K+ + +W +++AVP S + L P G A + + G+ + R+
Sbjct: 626 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGAAMIEWFEQEGIARERLS 685
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR + ++
Sbjct: 686 FQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTWMS 745
Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
+G IA ++ I L +D
Sbjct: 746 HVGLESFIADDIDDFVAKGIALASD 770
>gi|301113846|ref|XP_002998693.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
putative [Phytophthora infestans T30-4]
gi|262111994|gb|EEY70046.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
putative [Phytophthora infestans T30-4]
Length = 482
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 79/166 (47%), Gaps = 16/166 (9%)
Query: 14 IVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
++L N N + A+ E + + + ++ ++ V+ NFN++ K+ S W
Sbjct: 256 VILPNCYQVNYYDQILASSETISRRSDLWDIAKHEGGDEGFVFVNFNKIDKLKASLFSTW 315
Query: 74 VNVLKAVPNSILWLLKFPAVGE---------------ANIQATAQALGLDQHRILFSNVA 118
++ L+ VP S L LL P E N+ AQA G+ + R+ F +
Sbjct: 316 MSTLRRVPRSSLLLLD-PGRNELGDDVTESVTSREIKKNLWKEAQAQGISRGRVRFVSRI 374
Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
K EH+ R + + LDT + H+T+ D L+ G PV+TL G++ A
Sbjct: 375 PKAEHLLRHHVGGLFLDTFIYGAHSTATDALYAGLPVITLAGDSFA 420
>gi|425445187|ref|ZP_18825223.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389734921|emb|CCI01528.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 719
Score = 76.6 bits (187), Expect = 5e-12, Method: Composition-based stats.
Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + ++K+VPNS + V + N
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLWDLF 575
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 576 CQIAAEVGVETNRIKMLPIYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E IA + +EY I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677
>gi|238025842|ref|YP_002910073.1| TPR domain-containing protein [Burkholderia glumae BGR1]
gi|237875036|gb|ACR27369.1| TPR domain protein [Burkholderia glumae BGR1]
Length = 790
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 75/146 (51%), Gaps = 1/146 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-AQALGLDQHRI 112
+ + +FN+L K+ + +W +L AVPNS + + P VG ++ A G+ R+
Sbjct: 570 VTFGSFNRLNKLRREVIAVWARILHAVPNSRMRVGSVPRVGGVDMLLEWFTAEGIAHQRL 629
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
A ++++ D CLDT G TT+++ LW G P +T+ G TLASR A +
Sbjct: 630 DLQPRAPAAVYLQQHHHVDFCLDTFPYTGSTTALNALWMGVPTLTIRGGTLASRAGAVWM 689
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
+++G + +A + ++ I L D
Sbjct: 690 SSVGLEQFVADSPDDFVARGIALAND 715
>gi|425464281|ref|ZP_18843603.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9809]
gi|389833751|emb|CCI21471.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9809]
Length = 721
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E +++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLKS 579
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 580 -ADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG +++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLNKN 679
>gi|425468292|ref|ZP_18847322.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
gi|389885052|emb|CCI34703.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
Length = 721
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS L + F + E ++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMICLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 579 SADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 638
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLDQN 679
>gi|322421643|ref|YP_004200866.1| hypothetical protein GM18_4176 [Geobacter sp. M18]
gi|320128030|gb|ADW15590.1| Tetratricopeptide TPR_1 repeat-containing protein [Geobacter sp.
M18]
Length = 598
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 76/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
I + +FN L K+ P + +W VL+AVP S+L L P + I A G+
Sbjct: 401 ITFGSFNNLAKVTPEVIALWSRVLQAVPGSLLLLKAKPLADQGVRRRILDLFAAGGVAGD 460
Query: 111 RILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RI F EH+ + + DV LDT NG TT+ + LW G P++TL G+ ASR A
Sbjct: 461 RIEFDPGQPGIREHLAQYRRVDVALDTFPYNGTTTTCEALWMGVPIITLAGDRHASRTGA 520
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L G +L+ R+ +Y + A ++ D
Sbjct: 521 SLLTNCGLADLVTRSEAQYLETARQVAQD 549
>gi|107024411|ref|YP_622738.1| hypothetical protein Bcen_2869 [Burkholderia cenocepacia AU 1054]
gi|116688262|ref|YP_833885.1| TPR repeat-containing protein [Burkholderia cenocepacia HI2424]
gi|105894600|gb|ABF77765.1| Tetratricopeptide TPR_2 [Burkholderia cenocepacia AU 1054]
gi|116646351|gb|ABK06992.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia cenocepacia
HI2424]
Length = 780
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 1/144 (0%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-AQALGLDQHRILF 114
Y +FN+L K+ P + +W VL A P + + L G+ + A G+D R+ F
Sbjct: 570 YGSFNRLNKLSPHVIGVWARVLHADPTARMALGAIGNEGDQRVLIDWFAAAGIDASRLSF 629
Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
+ ++++ D+CLDT G TT+++ LW G P +T+ G TLA R +A+ L
Sbjct: 630 HRRSNIPVYLQQHHGVDLCLDTFPYTGSTTTLNALWMGVPTLTIAGVTLAGRGSANWLMH 689
Query: 175 LGCPELIARTHKEYQDIAIRLGTD 198
+G IA+ E+ A+ LG D
Sbjct: 690 VGLDTYIAKNEDEFVAKALALGKD 713
>gi|53804390|ref|YP_113965.1| hypothetical protein MCA1509 [Methylococcus capsulatus str. Bath]
gi|53758151|gb|AAU92442.1| TPR domain protein [Methylococcus capsulatus str. Bath]
Length = 699
Score = 76.3 bits (186), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 67/149 (44%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
I + N L KI+ L +W +L A P S L + N++ A GLD
Sbjct: 482 ITFGCLNNLAKINTEVLDLWAQILAASPTSRLLIQTVALCDRLNLERFRALCVERGLDPA 541
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ + E+ + D+ LD P C G TS D LW G PV+TL + LA R
Sbjct: 542 RLELRPATSLEKFAQTYHQIDIALDPFPFCGG-FTSFDALWMGVPVITLEQQRLAGRQTL 600
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L LG P LIA +Y IA+ L D
Sbjct: 601 SMLMNLGLPHLIASNKTDYVSIALELSRD 629
>gi|254246744|ref|ZP_04940065.1| TPR repeat [Burkholderia cenocepacia PC184]
gi|124871520|gb|EAY63236.1| TPR repeat [Burkholderia cenocepacia PC184]
Length = 781
Score = 76.3 bits (186), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 5/146 (3%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ---ALGLDQHRI 112
Y +FN+L K+ P + +W VL + + L GE + Q + A G+D R+
Sbjct: 571 YGSFNRLNKLSPQVVAVWARVLHGAQTARMVLGAIG--GEEDRQTLTEWFSAAGIDAGRL 628
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F ++ ++++ D+CLDT G TT+++ LW GTP +TL G T+A R +++ L
Sbjct: 629 SFHRRSSIPVYMQQHHGVDLCLDTFPYTGSTTTLNALWMGTPTLTLSGATMAGRGSSTWL 688
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
+G IA+ E+ A+ LG D
Sbjct: 689 QHVGLDAYIAKNEDEFVAKALALGKD 714
>gi|300867986|ref|ZP_07112625.1| hypothetical protein OSCI_3520023 [Oscillatoria sp. PCC 6506]
gi|300334007|emb|CBN57803.1| hypothetical protein OSCI_3520023 [Oscillatoria sp. PCC 6506]
Length = 2187
Score = 76.3 bits (186), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 79/166 (47%), Gaps = 10/166 (6%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL--------WLLKFPAVG 94
+R G+ E V+ + YKI P ++W +L AVPNS+L W +PA
Sbjct: 1512 TRNSLGISEQTNVFISGANFYKILPELREIWAKILAAVPNSMLILYPFGPAWTRSYPATP 1571
Query: 95 EAN-IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTSMDVLWTG 152
N + A G++ +R+L V+ LADV LD+ G T+ +D L G
Sbjct: 1572 FVNNLYAVLAKYGVESNRLLIVKQLPSRGDVKEFLGLADVYLDSYPYAGATSLIDPLQVG 1631
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
P V + G L R ++ L + PELIA + + Y +A+ LG++
Sbjct: 1632 LPTVVMEGNALRFRQGSAMLREIEIPELIADSQQSYIRLAVALGSN 1677
>gi|302878310|ref|YP_003846874.1| hypothetical protein Galf_1080 [Gallionella capsiferriformans ES-2]
gi|302581099|gb|ADL55110.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
capsiferriformans ES-2]
Length = 963
Score = 76.3 bits (186), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 77/145 (53%), Gaps = 5/145 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
I + +FN+L K +T W +L+A+P S + + L P E I +A G+
Sbjct: 771 ITFGSFNRLVKNSDATYAAWAKILRALPESRMIIKTEALDDPVTRERVIACFVRA-GIAA 829
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RIL A+++H+ D+ LD G T+M+ L G PVVTL T+A R++A
Sbjct: 830 DRILLQGKTARDQHLAAFNQIDIALDPFPHGGGVTAMEGLMMGVPVVTLRWPTIAGRLSA 889
Query: 170 SQLATLGCPELIARTHKEYQDIAIR 194
S L TLG + IA T ++Y ++AI+
Sbjct: 890 SILTTLGLTDWIAETQQQYIELAIQ 914
>gi|222053752|ref|YP_002536114.1| hypothetical protein Geob_0648 [Geobacter daltonii FRC-32]
gi|221563041|gb|ACM19013.1| TPR repeat-containing protein [Geobacter daltonii FRC-32]
Length = 621
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 72/144 (50%), Gaps = 5/144 (3%)
Query: 59 FNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEAN-IQATAQALGLDQHRILF 114
FN + KI+ + +W +L + +S L LLK F G A Q G+D RI+
Sbjct: 433 FNNVAKINQRVITLWAKILHTLRDSRL-LLKNRLFDDPGCARTFQDRFTHAGIDPGRIML 491
Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
++ E +R L D+ LDT G TT+ + LW G PVV + G T A R A S LA
Sbjct: 492 RGHSSHMETLREYSLVDIALDTFPFPGGTTTCEALWMGVPVVAMSGVTFAERQAISLLAN 551
Query: 175 LGCPELIARTHKEYQDIAIRLGTD 198
G E IA + +EY +I + L D
Sbjct: 552 AGHREWIATSEEEYMEIVLDLAAD 575
>gi|166363525|ref|YP_001655798.1| hypothetical protein MAE_07840 [Microcystis aeruginosa NIES-843]
gi|166085898|dbj|BAG00606.1| hypothetical protein MAE_07840 [Microcystis aeruginosa NIES-843]
Length = 264
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 84/161 (52%), Gaps = 5/161 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R +P+DAI+Y K + + + + +L+ VPNS + + F + E ++
Sbjct: 67 RTDLNIPDDAIIYLTVQSGLKRTLNMICLQLQILQQVPNSYILIKGFADKETIRELFLK- 125
Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
+A LG+ Q R+ F N +E H +AD+ LDT +G TT+++ LW G P+VT
Sbjct: 126 SADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 185
Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
GE A+R + + + G + IA + +EY I+LG D +
Sbjct: 186 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLDEN 226
>gi|117924934|ref|YP_865551.1| hypothetical protein Mmc1_1636 [Magnetococcus marinus MC-1]
gi|117608690|gb|ABK44145.1| Tetratricopeptide TPR_2 repeat protein [Magnetococcus marinus MC-1]
Length = 968
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 75/147 (51%), Gaps = 6/147 (4%)
Query: 57 CNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQHRI 112
+FN L K+ L++W +L +P + L LLK G + ++LG+D RI
Sbjct: 500 ASFNTLTKMGHEVLRLWAEILTRLPQATL-LLKNGGFGSQAVVDDYLQRFESLGIDSKRI 558
Query: 113 LFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
V EEH+ D+ LDT +G TT+ + LW G PVVTL G+ +RV S
Sbjct: 559 RCMGVTDTLEEHLNLYNQVDIALDTFPYHGTTTTCEALWMGVPVVTLQGDHHVNRVGGSL 618
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
L +G +L+A T K++ D+ + L D
Sbjct: 619 LHQVGLHKLVASTPKQFVDLVVGLAQD 645
>gi|39997695|ref|NP_953646.1| TPR domain/SEC-C motif domain-containing protein [Geobacter
sulfurreducens PCA]
gi|39984587|gb|AAR35973.1| TPR domain/SEC-C motif domain protein [Geobacter sulfurreducens
PCA]
Length = 585
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 69/141 (48%), Gaps = 6/141 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLD 108
+ + +FN L K+ P + W +L VP S L +LKF + + A G+
Sbjct: 392 GVTFGSFNNLAKVSPQVIDAWSAILGRVPGSRL-MLKFKGFADETCRRRVSDAFGARGVS 450
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R+ L S V +H+ D+ LDT NG T+ D LW G PV+T G +RV
Sbjct: 451 PDRLELVSFVENFNDHLALYNRLDLALDTFPYNGTATTCDALWMGVPVLTCCGTRHVARV 510
Query: 168 AASQLATLGCPELIARTHKEY 188
AS L +LG PELI + +EY
Sbjct: 511 GASILTSLGLPELITHSPEEY 531
>gi|409913047|ref|YP_006891512.1| TPR domain/SEC-C motif domain-containing protein [Geobacter
sulfurreducens KN400]
gi|307635063|gb|ADI85350.2| TPR domain/SEC-C motif domain protein [Geobacter sulfurreducens
KN400]
Length = 585
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 69/141 (48%), Gaps = 6/141 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLD 108
+ + +FN L K+ P + W +L VP S L +LKF + + A G+
Sbjct: 392 GVTFGSFNNLAKVSPQVIDAWSAILGRVPGSRL-MLKFKGFADETCRRRVSDAFGARGVS 450
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R+ L S V +H+ D+ LDT NG T+ D LW G PV+T G +RV
Sbjct: 451 PDRLELVSFVEDFNDHLALYNRLDLALDTFPYNGTATTCDALWMGVPVLTCCGTRHVARV 510
Query: 168 AASQLATLGCPELIARTHKEY 188
AS L +LG PELI + +EY
Sbjct: 511 GASILTSLGLPELITHSPEEY 531
>gi|428179442|gb|EKX48313.1| hypothetical protein GUITHDRAFT_105920 [Guillardia theta CCMP2712]
Length = 698
Score = 75.5 bits (184), Expect = 1e-11, Method: Composition-based stats.
Identities = 50/152 (32%), Positives = 79/152 (51%), Gaps = 15/152 (9%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+ +CN + LYK+DP+ W+ +L+ V S L LL E+++ ++A + RI+
Sbjct: 479 LTFCNHDSLYKLDPTRFTSWMKLLEEVEGSTLLLLAQGNSTESSL--LSRAPPHLRSRIV 536
Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS--- 170
F +H+ R + D+ LDT +GHTTS + LW G P+V+LP A RV +S
Sbjct: 537 FQPFLDVRKHLERIRACDLALDTHGISGHTTSANYLWGGVPLVSLPSFMWAGRVTSSLLS 596
Query: 171 ------QLATLGCPELIARTHKEYQDIAIRLG 196
LA LG IAR+ ++ ++ RL
Sbjct: 597 ALSASCHLALLG----IARSEDDFVQLSKRLA 624
>gi|425464282|ref|ZP_18843604.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833752|emb|CCI21473.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 719
Score = 75.1 bits (183), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + ++K+VPNS + V + N
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLLDLF 575
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E IA + +EY I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677
>gi|428181283|gb|EKX50147.1| hypothetical protein GUITHDRAFT_135331 [Guillardia theta CCMP2712]
Length = 511
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 84/150 (56%), Gaps = 11/150 (7%)
Query: 56 YCNFNQLYKID--PSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
Y ++N L K+ P + + L+ +P S F G + + +AL +++
Sbjct: 325 YMHYNILDKVQAPPEHAEFYSERLQLLPVS-----AFKGEGSSEEEVMTEALPAERNLKN 379
Query: 114 FSNVAA---KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG-ETLASRVAA 169
F+ ++E ++ ++AD+ LDTP N H T+ +VL +G PV+TLPG + +SRVAA
Sbjct: 380 FAKEFKNFLEQERIKGKRVADLFLDTPEYNAHGTATEVLSSGVPVLTLPGSKAHSSRVAA 439
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDR 199
S + G PE+IAR ++Y++IA +L + R
Sbjct: 440 SVVIAAGLPEIIARDLEDYEEIATKLISRR 469
>gi|71906424|ref|YP_284011.1| hypothetical protein Daro_0785 [Dechloromonas aromatica RCB]
gi|71846045|gb|AAZ45541.1| TPR repeat [Dechloromonas aromatica RCB]
Length = 796
Score = 75.1 bits (183), Expect = 1e-11, Method: Composition-based stats.
Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 3/144 (2%)
Query: 60 NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA---QALGLDQHRILFSN 116
N+ K+ L +W ++L VPNS L L+ E ++A Q G+ R+
Sbjct: 607 NRFSKVSMRALDLWGDLLLKVPNSRLLLIGRGGSDEQTVRALRKRFQEKGVPAERLEILE 666
Query: 117 VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLG 176
E++ D+CLD NG TT D +W G P VTL G++L SR ++ L +
Sbjct: 667 SKPPHEYLDTYNQVDLCLDPFPFNGGTTGFDSIWMGVPYVTLRGDSLHSRAGSNILKYVD 726
Query: 177 CPELIARTHKEYQDIAIRLGTDRD 200
+L+A + EY A+ L DRD
Sbjct: 727 LADLVADSESEYVAKAVALAADRD 750
>gi|302878313|ref|YP_003846877.1| hypothetical protein Galf_1083 [Gallionella capsiferriformans ES-2]
gi|302581102|gb|ADL55113.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
capsiferriformans ES-2]
Length = 750
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 5/146 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN----IQATAQALGLD 108
+I + +FN+L KI T ++W +L VP S L +LK P + E + I + GL
Sbjct: 553 SITFGSFNRLAKISGQTYRIWAQILLQVPGSCL-ILKTPELNERSNRERILRYFENAGLT 611
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
RI+ ++ EH+R Q D+ LD G T+M+ L G PV+TL L R++
Sbjct: 612 ADRIVMLGRSSWFEHMRAYQQVDIALDPFPHGGGMTAMEGLMMGVPVITLNWPILTGRLS 671
Query: 169 ASQLATLGCPELIARTHKEYQDIAIR 194
+S + TLG + I ++ +Y ++AI+
Sbjct: 672 SSLMTTLGLQDWIVQSEDQYLELAIK 697
>gi|428309702|ref|YP_007120679.1| O-linked N-acetylglucosamine transferase, SPINDLY family
[Microcoleus sp. PCC 7113]
gi|428251314|gb|AFZ17273.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Microcoleus sp. PCC 7113]
Length = 506
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 4/166 (2%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
EVP + +R +GL +DA+VY YK P ++V + K VP + ++ V
Sbjct: 301 EVPS--LTKTRSDFGLRDDAVVYLCCQAPYKYLPQHDFIFVEIAKRVPQAQFVFVR-ADV 357
Query: 94 GEANIQATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
+ ++AT A+G + +F + E ++R L+DV LDT G T++ L
Sbjct: 358 LKPRLKATFAAVGKSSEDYCVFLSFLNHENYLRLNLLSDVFLDTLGFTGGNTTLAALACN 417
Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
PVVT PGE + ++ L L E IA EY +IA+RLG +
Sbjct: 418 LPVVTCPGECMRGYMSTGMLKMLRLSETIAENEAEYIEIAVRLGLE 463
>gi|422343493|ref|ZP_16424421.1| hypothetical protein HMPREF9432_00481 [Selenomonas noxia F0398]
gi|355378800|gb|EHG25980.1| hypothetical protein HMPREF9432_00481 [Selenomonas noxia F0398]
Length = 1079
Score = 75.1 bits (183), Expect = 2e-11, Method: Composition-based stats.
Identities = 51/150 (34%), Positives = 76/150 (50%), Gaps = 7/150 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLD 108
+IV+ +FN K++ L +W +L+ VP S L LLK A+ + A +A G+
Sbjct: 361 SIVFGSFNNFTKLNDRVLSVWAEILRRVPESRL-LLKTDVFSYADSRREALRRIEAAGIP 419
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ A +++ D+ LD G T+ D L+ G PVVTL GE+L SR
Sbjct: 420 LVRV--DAEGASADYLAAYARVDIALDPFPYPGGGTTCDALYMGVPVVTLAGESLGSRFG 477
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AS L +G EL A T ++Y +A+ L D
Sbjct: 478 ASLLENIGAKELAAHTEEDYIALAVSLAQD 507
Score = 63.5 bits (153), Expect = 5e-08, Method: Composition-based stats.
Identities = 52/177 (29%), Positives = 83/177 (46%), Gaps = 18/177 (10%)
Query: 35 VPQSIVITSRQQYGLPEDA---------IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
+P V R GLP A I + FNQ K + L +W +++ VP + L
Sbjct: 862 LPSQFVYVPRA--GLPAAAETPARSRGYITFGVFNQYRKYTDAMLTVWREIMERVPTARL 919
Query: 86 WLLKFPAVGEANIQATAQA----LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
L+K + + TA+A LG D R+ A ++++R D+ LDT G
Sbjct: 920 -LIKSQVFFSSAMTETARARMKRLGFDLRRVALE--PATTDYMQRYLNVDIALDTYPWPG 976
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T+ D L+ G PVVT+ G ++R + + LA +G +L +T +Y + A+ L D
Sbjct: 977 GGTTCDALYMGVPVVTMYGARRSTRFSYALLAHIGRTDLAVQTPADYIERAVSLAGD 1033
>gi|288963191|ref|YP_003453470.1| TPR repeat-containing protein [Azospirillum sp. B510]
gi|288915443|dbj|BAI76926.1| TPR repeat-containing protein [Azospirillum sp. B510]
Length = 1246
Score = 75.1 bits (183), Expect = 2e-11, Method: Composition-based stats.
Identities = 52/152 (34%), Positives = 74/152 (48%), Gaps = 5/152 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
A+ + +FN + K++P +++W VL+AVP S L L L P V +A A G+
Sbjct: 424 AVTFGSFNLIAKLNPRVIRVWSRVLQAVPASRLLLKSINLSDPVVCHRLEEAFA-VHGIG 482
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
+ R+ ++ D+ LD G TS + LW G P+VT+PGE SR
Sbjct: 483 RDRLELRGPSSHAAMFEEYGDVDIALDPFPFTGGITSCEALWMGLPIVTMPGERPVSRQT 542
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A L LG LIAR EY IA L +D +
Sbjct: 543 AGLLDILGLNGLIARDEDEYVRIAAALASDSE 574
>gi|381160367|ref|ZP_09869599.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878431|gb|EIC20523.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 697
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 77/151 (50%), Gaps = 5/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQ 109
I + +FN + K+ T+ +W ++ VPNS L +++ A+G+A ++ G+ Q
Sbjct: 498 IKFASFNNIAKLSDQTIWLWAQIILRVPNSQL-VIRDKALGDATVRQRIIDRFAIQGVVQ 556
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ +EE++ D+ L G TT+ + LW G PVV L G T R++
Sbjct: 557 ERLDIKPSLRREEYLESYNDVDISLSPTPFGGGTTTAEALWMGVPVVCLRGGTWVGRISE 616
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S + T+G +L+A T +EY IA L T D
Sbjct: 617 SIIKTVGLRDLVAETEEEYIHIATSLATRAD 647
>gi|67640477|ref|ZP_00439282.1| TPR domain protein [Burkholderia mallei GB8 horse 4]
gi|121600690|ref|YP_994729.1| hypothetical protein BMASAVP1_A3451 [Burkholderia mallei SAVP1]
gi|124383342|ref|YP_001027648.1| hypothetical protein BMA10229_A1668 [Burkholderia mallei NCTC
10229]
gi|126449970|ref|YP_001082618.1| hypothetical protein BMA10247_3099 [Burkholderia mallei NCTC 10247]
gi|167000422|ref|ZP_02266236.1| TPR domain protein [Burkholderia mallei PRL-20]
gi|254174654|ref|ZP_04881315.1| TPR domain protein [Burkholderia mallei ATCC 10399]
gi|254201458|ref|ZP_04907822.1| TPR domain protein [Burkholderia mallei FMH]
gi|254206800|ref|ZP_04913151.1| TPR domain protein [Burkholderia mallei JHU]
gi|254357176|ref|ZP_04973450.1| TPR domain protein [Burkholderia mallei 2002721280]
gi|121229500|gb|ABM52018.1| TPR domain protein [Burkholderia mallei SAVP1]
gi|124291362|gb|ABN00631.1| TPR domain protein [Burkholderia mallei NCTC 10229]
gi|126242840|gb|ABO05933.1| TPR domain protein [Burkholderia mallei NCTC 10247]
gi|147747352|gb|EDK54428.1| TPR domain protein [Burkholderia mallei FMH]
gi|147752342|gb|EDK59408.1| TPR domain protein [Burkholderia mallei JHU]
gi|148026240|gb|EDK84325.1| TPR domain protein [Burkholderia mallei 2002721280]
gi|160695699|gb|EDP85669.1| TPR domain protein [Burkholderia mallei ATCC 10399]
gi|238521193|gb|EEP84646.1| TPR domain protein [Burkholderia mallei GB8 horse 4]
gi|243063614|gb|EES45800.1| TPR domain protein [Burkholderia mallei PRL-20]
Length = 776
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT+++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTALNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|428315272|ref|YP_007113154.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
gi|428238952|gb|AFZ04738.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
Length = 1301
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 85/165 (51%), Gaps = 9/165 (5%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA------ 96
+R +G ++++V+ + Q I P + W ++ AVPNSIL L F + E
Sbjct: 1077 TRSSWGATDESVVFVSGAQALTITPELRETWAKIIAAVPNSILVLYPFSSRSENYPTVPF 1136
Query: 97 --NIQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
I+A G+D+ R ++ + ++ + ++ +LAD+ LD+ + + + L G
Sbjct: 1137 YNQIRALFAEYGIDKKRLVVIKALPSRADCIKCLELADIYLDSYPYSSACSLAEPLLVGL 1196
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
PVV G+T SR +A+ L L PE++A + Y D++ LGT+
Sbjct: 1197 PVVARKGQTTRSRQSAAVLEELLLPEMVADSENSYLDLSTALGTN 1241
>gi|395650965|ref|ZP_10438815.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
protein [Pseudomonas extremaustralis 14-3 substr. 14-3b]
Length = 1104
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 72/151 (47%), Gaps = 5/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG----EANIQATAQALGLDQ 109
+ Y FN K++ L W +++A P+S L LLK A G ++ A G+D+
Sbjct: 423 VTYGCFNNPTKVNDVLLAHWAELMRATPDSRL-LLKGGAFGTRALRDHVHGVMAAHGIDR 481
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++ + + D+ LD +G T+ + L G PVVTLPG T A R +A
Sbjct: 482 ERVMIEGPVGHKHLLETYNRVDIALDPWPYSGGLTTCEALLMGVPVVTLPGPTFAGRHSA 541
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ L G PEL+ + +Y+ + L D D
Sbjct: 542 THLVNAGLPELVVHSWAQYRQRVMELAGDLD 572
>gi|304438234|ref|ZP_07398176.1| TPR domain/SEC-C domain protein [Selenomonas sp. oral taxon 149
str. 67H29BP]
gi|304368841|gb|EFM22524.1| TPR domain/SEC-C domain protein [Selenomonas sp. oral taxon 149
str. 67H29BP]
Length = 1077
Score = 74.7 bits (182), Expect = 2e-11, Method: Composition-based stats.
Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 15/154 (9%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRI 112
IV+ +FN K++ L++W +L+ VP S L+L +A+I + A RI
Sbjct: 359 GIVFGSFNNFTKVNDRVLRVWAEILRRVPESRLFL-------KADIFSEDDAHAEVLERI 411
Query: 113 LFSNVAAK--------EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
+ + +++ D+ LD G T+ D L+ G PVVTL GETL
Sbjct: 412 AAAGIPTSCVDTEGNSPDYLAAYNRMDIALDPFPYPGGGTTCDALYMGVPVVTLSGETLG 471
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SR AS L +G LIA T +EY +A+ L D
Sbjct: 472 SRFGASLLTNIGAEALIAHTEEEYISLAVSLAGD 505
Score = 66.6 bits (161), Expect = 5e-09, Method: Composition-based stats.
Identities = 54/171 (31%), Positives = 77/171 (45%), Gaps = 12/171 (7%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---- 89
E+P +RQ+ AI++ FNQ K L +W +L+ VP S L LLK
Sbjct: 872 ELPAPTGTPARQR-----GAILFGVFNQYRKWTDEMLCVWREILERVPQSKL-LLKSQVF 925
Query: 90 FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
F A + LG D RIL A +++ R D+ LDT G T+ D L
Sbjct: 926 FAAAMRREAEERLARLGFDLDRILLE--PATTDYMERYLDVDIALDTYPWPGGGTTCDAL 983
Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ G PVVT+ G ++R + + L +G EL + Y A+ L D D
Sbjct: 984 YMGVPVVTMYGTRRSTRFSYAMLHLVGAEELAVTSASAYISCAVSLAEDPD 1034
>gi|425468293|ref|ZP_18847323.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389885053|emb|CCI34704.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 719
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + +LK+VPNS + V + N
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQILKSVPNSYFLI---QGVADDNSLLDLF 575
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E IA + +EY I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677
>gi|401408423|ref|XP_003883660.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
gi|325118077|emb|CBZ53628.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
Length = 987
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 78/150 (52%), Gaps = 5/150 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
I + +FN L K+ +++W +LKAVP S L L P E +A +A G+
Sbjct: 792 ITFGSFNNLAKLGKQVIELWSRILKAVPTSRLLLKARPFANEEMQRKFKAKFEAHGVSGD 851
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL-ASRVA 168
RI + V A +H+ L D+ LD+ G TT+ + L G PVV+L + + A V
Sbjct: 852 RIDAMALVPACMDHLMVYSLVDIALDSFPYAGTTTTCEALVMGVPVVSLRRKNIHAHNVG 911
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A+ LA G PEL+A T ++Y +A+ L D
Sbjct: 912 ATLLAHYGLPELVADTPEQYVRLAVELAGD 941
>gi|357059079|ref|ZP_09119924.1| hypothetical protein HMPREF9334_01641 [Selenomonas infelix ATCC
43532]
gi|355372987|gb|EHG20325.1| hypothetical protein HMPREF9334_01641 [Selenomonas infelix ATCC
43532]
Length = 561
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 73/154 (47%), Gaps = 17/154 (11%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
+V+ +FN K+ + L+ W +L+ VP S L L P E ++ A G+
Sbjct: 365 VVFGSFNNFTKLSDTVLRTWAEILRRVPESRLLLKTDVFSCPDAREEALRRI-MAAGIPS 423
Query: 110 HRILFSNVAAKEEHVRRGQLA-----DVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
R+ + E +G LA D+ LD G T+ D L+ G PVVTL GE+L
Sbjct: 424 ERV-------ETEGTSQGYLAAYNRVDIALDPFPYPGGGTTCDALYMGVPVVTLAGESLG 476
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
SR AS L +G LIA T +EY +A L D
Sbjct: 477 SRFGASLLENIGAGALIAHTEEEYTALAAALAGD 510
>gi|425443193|ref|ZP_18823419.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715556|emb|CCI00089.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 569
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + +LK+VPNS + V + N
Sbjct: 369 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQILKSVPNSYFLI---QGVADDNSLWDLF 425
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 426 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 485
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E IA + +EY I+LG D++
Sbjct: 486 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 527
>gi|94971753|ref|YP_593801.1| hypothetical protein Acid345_4728 [Candidatus Koribacter versatilis
Ellin345]
gi|94553803|gb|ABF43727.1| TPR repeat protein [Candidatus Koribacter versatilis Ellin345]
Length = 718
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 4/137 (2%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV---GEANIQATAQALGLDQHRI 112
+ + N+ KI +L +W V++AVP S L L A I A G+ RI
Sbjct: 525 FGSLNRYSKIGQESLDLWAKVMEAVPKSKLLLEDRTADDTDAHQRISAELATHGIGADRI 584
Query: 113 LFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
F + E H+R DV LDT N TT+ D LW G P+V + G ASR+AA
Sbjct: 585 EFEPYIPGHERHMRLYDRVDVALDTIPLNSGTTACDALWMGVPLVAMEGNRTASRIAAGF 644
Query: 172 LATLGCPELIARTHKEY 188
L +G E IA + + Y
Sbjct: 645 LRAIGRTEWIADSEQNY 661
>gi|53724345|ref|YP_104386.1| hypothetical protein BMA2876 [Burkholderia mallei ATCC 23344]
gi|52427768|gb|AAU48361.1| TPR domain protein [Burkholderia mallei ATCC 23344]
Length = 821
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 610 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 668
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT+++ LW G P +T+ GETL SR +
Sbjct: 669 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTALNALWMGVPTLTMRGETLPSRAGLTW 728
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 729 MSHVGLESFIADDIDDFVAKGIALASD 755
>gi|384253147|gb|EIE26622.1| O-linked N-acetylglucosamine transferase [Coccomyxa subellipsoidea
C-169]
Length = 937
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 74/150 (49%), Gaps = 5/150 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
+ + +FN L KI P + W +L A+P+S L L P A +++ QA G+D
Sbjct: 700 VTFGSFNNLAKITPRVMAAWARILTALPSSRLVLKNKPFACASARSHVLGQLQAAGVDAS 759
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L AA +H+ L D+ LD G TT+ + L+ G PV+TL G A V A
Sbjct: 760 RVDLLPLAAANSDHLATYALMDISLDPFPYAGTTTTCESLYMGVPVITLAGGCHAHNVGA 819
Query: 170 SQLATLGCPE-LIARTHKEYQDIAIRLGTD 198
S L +G E +A + EY +A+ D
Sbjct: 820 SLLDAMGMREGWVAHSEDEYVRLAVAAAAD 849
>gi|428180905|gb|EKX49771.1| hypothetical protein GUITHDRAFT_162101 [Guillardia theta CCMP2712]
Length = 758
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 47/166 (28%), Positives = 75/166 (45%), Gaps = 28/166 (16%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI--LWLLKFPAVGEANIQ------------ 99
I++ NFN LYKIDP+ + +W + L + + LW+ P N++
Sbjct: 532 IIFANFNSLYKIDPALVNVWKSALFSSNDKSWQLWITNRPKKAAENLKNHLTDSKNELRQ 591
Query: 100 --------------ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
+ + IL +++ + ++H+R AD+ LD+ H+T+
Sbjct: 592 WSESEDESLSHQFFTSQEKEKFSDELILVTDLQSAKDHIRVKSCADIFLDSFRLGSHSTA 651
Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDI 191
D L G +VTLP SRVA+S LA G I RT +EY D+
Sbjct: 652 ADALAAGLVLVTLPHIGALSRVASSLLAASGASATIVRTSQEYHDL 697
>gi|428181273|gb|EKX50137.1| hypothetical protein GUITHDRAFT_103951 [Guillardia theta CCMP2712]
Length = 865
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 34/93 (36%), Positives = 56/93 (60%)
Query: 47 YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALG 106
+ +P D+IV FN++ KID T + W+ +L+ ++LWL++FP + I A G
Sbjct: 618 HDVPLDSIVLACFNRIQKIDEVTFETWMRILEQESRTVLWLIEFPKEAQRRINQHASRRG 677
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
+D R+++S +AA+EE + AD+ LDTP C
Sbjct: 678 VDPRRLIWSPMAAEEEFMHMAAAADLFLDTPRC 710
>gi|409992536|ref|ZP_11275720.1| type 11 methyltransferase [Arthrospira platensis str. Paraca]
gi|291569835|dbj|BAI92107.1| probable glycosyl transferase [Arthrospira platensis NIES-39]
gi|409936606|gb|EKN78086.1| type 11 methyltransferase [Arthrospira platensis str. Paraca]
Length = 2556
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 11/172 (6%)
Query: 36 PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
PQ + V R Q+G + IV+ + YKI P + WV +L VPNSIL L F
Sbjct: 2023 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 2082
Query: 95 EAN---------IQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
+N + AT G++ R +L ++ ++ + + D+ LD+ +G +
Sbjct: 2083 SSNYASSQFIRRMYATLSKYGVEDKRLVLIKSLPSRTDVKACLEQGDIYLDSYPHSGSNS 2142
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+D L G PVV G+TL S+ A+ + +LG P+LI + Y +A+ L
Sbjct: 2143 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDSESSYISLAVTLA 2194
>gi|425454004|ref|ZP_18833753.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389799815|emb|CCI20675.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 724
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 85/162 (52%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + ++K+VPNS L++ V + N
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYF-LIQ--GVADDNSLWDLF 575
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 576 CQIAAEVGVETNRIKMLPIYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E IA + +EY I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677
>gi|334126612|ref|ZP_08500561.1| TPR domain/SEC-C domain protein [Centipeda periodontii DSM 2778]
gi|333391283|gb|EGK62401.1| TPR domain/SEC-C domain protein [Centipeda periodontii DSM 2778]
Length = 548
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 76/155 (49%), Gaps = 15/155 (9%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
+V+ +FN K++ L+ W +L+ VP S L+L + ++ + A A HR+
Sbjct: 360 VVFGSFNNFTKLNDRVLRTWAEILRRVPESRLFL-------KTDVFSYADARAEVLHRMG 412
Query: 114 FSNV--------AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
+ + + ++++ D+ LD G T+ D L+ G PVVTL G +L S
Sbjct: 413 EAGIPLDRVDTEGSSKDYLAAYNRVDIALDPFPYPGGGTTCDALYMGVPVVTLAGASLGS 472
Query: 166 RVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
R AS L +G L+AR +EY +A+ L D D
Sbjct: 473 RFGASLLENVGAGALVARAEEEYIALAVSLAGDAD 507
>gi|166363522|ref|YP_001655795.1| hypothetical protein MAE_07810 [Microcystis aeruginosa NIES-843]
gi|166085895|dbj|BAG00603.1| hypothetical protein MAE_07810 [Microcystis aeruginosa NIES-843]
Length = 719
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + ++K+VPNS + V + N
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLWDLF 575
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E I+ + +EY I+LG DR+
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGISWSDEEYVQWGIKLGLDRN 677
>gi|357635838|ref|ZP_09133715.1| transcriptional activator domain protein [Desulfovibrio sp.
FW1012B]
gi|357580678|gb|EHJ46012.1| transcriptional activator domain protein [Desulfovibrio sp.
FW1012B]
Length = 788
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 6/151 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
A+ + +FN+L K+ +TL +W +L A+P + L + L P A + A+A GL
Sbjct: 591 AVTFGSFNKLAKLSDTTLALWKEILAALPRATLRIKDIALSDPGCRAALAERCARA-GLP 649
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
R+ L ++EH+ + D+ LDT +G TT+ + LW G PVVTL GE SRV
Sbjct: 650 MDRLALLPGSPTRQEHLSQYGTIDIALDTFPYHGTTTTCEALWMGVPVVTLIGEQHHSRV 709
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L++L A +Y A+ L +D
Sbjct: 710 GLSVLSSLDATAWAATDKADYIGKAVALASD 740
>gi|381160359|ref|ZP_09869591.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
gi|380878423|gb|EIC20515.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Thiorhodovibrio sp. 970]
Length = 955
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 8/155 (5%)
Query: 50 PEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF----PAVGEANIQATAQAL 105
P + +FNQ K+ +L++W +L+AVP + L+L P V E I T
Sbjct: 749 PAGRLWLASFNQFAKVREESLELWSEILRAVPQAHLFLKDKKAADPMVRE-RILGTLARH 807
Query: 106 GLDQHRI-LFSNVAAKEEHVRRGQLADVCLD-TPLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+ R+ + S + +H+ D+ LD TP + TT D LW GTP+VTL G+
Sbjct: 808 GIHADRVKMASRTNSWSDHMGLYNYVDLALDPTPFTSA-TTGFDALWMGTPLVTLLGDQP 866
Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
A R AAS L LG + IA T ++Y IA+ L D
Sbjct: 867 AGRQAASLLTGLGRTDWIACTQEDYVRIALDLLDD 901
>gi|167740689|ref|ZP_02413463.1| putative TPR domain protein [Burkholderia pseudomallei 14]
Length = 629
Score = 73.9 bits (180), Expect = 4e-11, Method: Composition-based stats.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 418 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 476
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 477 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 536
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 537 MSHVGLESFIADDIDDFVAKGIALASD 563
>gi|167721711|ref|ZP_02404947.1| putative TPR domain protein [Burkholderia pseudomallei DM98]
Length = 622
Score = 73.9 bits (180), Expect = 4e-11, Method: Composition-based stats.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 411 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 469
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 470 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 529
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 530 MSHVGLESFIADDIDDFVAKGIALASD 556
>gi|167896374|ref|ZP_02483776.1| putative TPR domain protein [Burkholderia pseudomallei 7894]
gi|167920964|ref|ZP_02508055.1| putative TPR domain protein [Burkholderia pseudomallei BCC215]
gi|254261993|ref|ZP_04953047.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1710a]
gi|254220682|gb|EET10066.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1710a]
Length = 776
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|254194528|ref|ZP_04900959.1| tetratricopeptide repeat protein [Burkholderia pseudomallei S13]
gi|169651278|gb|EDS83971.1| tetratricopeptide repeat protein [Burkholderia pseudomallei S13]
Length = 776
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|126441749|ref|YP_001060867.1| hypothetical protein BURPS668_3873 [Burkholderia pseudomallei 668]
gi|126221242|gb|ABN84748.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 668]
Length = 776
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|300868250|ref|ZP_07112881.1| TPR repeat-containing protein [Oscillatoria sp. PCC 6506]
gi|300333774|emb|CBN58065.1| TPR repeat-containing protein [Oscillatoria sp. PCC 6506]
Length = 728
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 85/166 (51%), Gaps = 9/166 (5%)
Query: 41 ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
I R+ + + + +VY + K + ++ V +LK VPNSIL A+G+ +
Sbjct: 522 IDLRKSHRIGLEQVVYLCVSPGRKFNRELVKAQVAILKQVPNSIL---IHKAIGDIEVFQ 578
Query: 101 TA-----QALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
+A +A G+ +HRI F V +EEH LADV LD+ NG T +++ LW P
Sbjct: 579 SAYHQACEAEGIGKHRIKFLPRVPTEEEHRLIYLLADVFLDSYPYNGGTHTLEALWFNLP 638
Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
VVT GE SR+ S + L ++ + +EY + ++LG + D
Sbjct: 639 VVTRVGEQFLSRMGYSFMQALKMETGVSWSWEEYTEWGVKLGNNLD 684
>gi|167913028|ref|ZP_02500119.1| putative TPR domain protein [Burkholderia pseudomallei 112]
Length = 776
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|134281637|ref|ZP_01768345.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 305]
gi|167826278|ref|ZP_02457749.1| putative TPR domain protein [Burkholderia pseudomallei 9]
gi|226193032|ref|ZP_03788642.1| tetratricopeptide repeat protein [Burkholderia pseudomallei
Pakistan 9]
gi|134247304|gb|EBA47390.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 305]
gi|225934632|gb|EEH30609.1| tetratricopeptide repeat protein [Burkholderia pseudomallei
Pakistan 9]
Length = 776
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|53720932|ref|YP_109918.1| TPR domain-containing protein [Burkholderia pseudomallei K96243]
gi|167817901|ref|ZP_02449581.1| putative TPR domain protein [Burkholderia pseudomallei 91]
gi|167847785|ref|ZP_02473293.1| putative TPR domain protein [Burkholderia pseudomallei B7210]
gi|167904747|ref|ZP_02491952.1| putative TPR domain protein [Burkholderia pseudomallei NCTC 13177]
gi|254298777|ref|ZP_04966228.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 406e]
gi|386863576|ref|YP_006276525.1| hypothetical protein BP1026B_I3558 [Burkholderia pseudomallei
1026b]
gi|418534702|ref|ZP_13100540.1| TPR domain-containing protein [Burkholderia pseudomallei 1026a]
gi|52211346|emb|CAH37335.1| putative TPR domain protein [Burkholderia pseudomallei K96243]
gi|157808480|gb|EDO85650.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 406e]
gi|385358823|gb|EIF64806.1| TPR domain-containing protein [Burkholderia pseudomallei 1026a]
gi|385660704|gb|AFI68127.1| TPR domain-containing protein [Burkholderia pseudomallei 1026b]
Length = 776
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|320530500|ref|ZP_08031558.1| conserved domain protein [Selenomonas artemidis F0399]
gi|320137333|gb|EFW29257.1| conserved domain protein [Selenomonas artemidis F0399]
Length = 565
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 74/151 (49%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV----GEANIQATAQALGLDQ 109
IV+ + N KI+ LQ+W +L+ VP + L LLK G A + A G+
Sbjct: 363 IVFGSLNNFTKINDRVLQVWAEILRRVPTARL-LLKTEIFSVSDGAAEARRRIAAAGIPP 421
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A +++ D+ LDT G T+ D L+ G PVVT GETL SR A
Sbjct: 422 ERV--ETEGASADYLAAYGRIDIALDTFPYPGGGTTCDALYMGVPVVTRAGETLGSRFGA 479
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L +G LIA T +EY + A+ L D D
Sbjct: 480 SLLRNIGADALIAYTEEEYIERAVFLAQDFD 510
>gi|428769585|ref|YP_007161375.1| hypothetical protein Cyan10605_1212 [Cyanobacterium aponinum PCC
10605]
gi|428683864|gb|AFZ53331.1| hypothetical protein Cyan10605_1212 [Cyanobacterium aponinum PCC
10605]
Length = 746
Score = 73.6 bits (179), Expect = 4e-11, Method: Composition-based stats.
Identities = 50/158 (31%), Positives = 80/158 (50%), Gaps = 6/158 (3%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILW--LLKFPAVGEANIQA 100
+R+Q +P+DA++Y N K++P +QM + +++ V NS L + K V + + +
Sbjct: 543 TRKQLNIPDDAVIYLNMQNAAKLNPHIVQMQMEIIQNVKNSYLLFNVRKDKNVLKKYLYS 602
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A + RI F + + + R LA D+ LDT NG T +D LW P+VT
Sbjct: 603 LADKFQGVKDRIRF--IPSYPPAIYRANLAIADIILDTYPFNGAVTVLDSLWQNIPLVTR 660
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
G+ +R S L LG E IA T ++Y + I+ G
Sbjct: 661 VGQQYHARQGYSFLKNLGIEEGIAWTDRKYIEWGIKFG 698
>gi|398805152|ref|ZP_10564133.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Polaromonas sp. CF318]
gi|398092314|gb|EJL82729.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Polaromonas sp. CF318]
Length = 759
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 72/148 (48%), Gaps = 3/148 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
+ + FN L K++ + +++W +L VP S L+L A + + A A G++
Sbjct: 555 VTFGCFNNLTKMNDAVVKLWAQILNTVPASRLFLKARQLQQASAQREVMARFAAHGIEPG 614
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
R++ + +E ++ Q D+ LD G TT+++ LW G PV+TL GE SR
Sbjct: 615 RLILEDYVPRENYLAAYQRVDIALDPFPYPGGTTTVEALWMGVPVLTLAGERFLSRQGVG 674
Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTD 198
+ G PE +A ++Y A+ D
Sbjct: 675 LMMNAGLPEWVASDPRDYLSRAVAHAGD 702
>gi|374292328|ref|YP_005039363.1| hypothetical protein AZOLI_1877 [Azospirillum lipoferum 4B]
gi|357424267|emb|CBS87134.1| conserved protein of unknown function; glycosyltransferase and TPR
domains [Azospirillum lipoferum 4B]
Length = 631
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 72/149 (48%), Gaps = 5/149 (3%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQHR 111
+ + N L K++ +W +L AVP S L LL+ P + + +++A AL G D R
Sbjct: 431 FGSLNALPKLNAQVAALWTRLLAAVPESRL-LLRTPGLDDPDLRARTLALFTAAGADPER 489
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
I A E + DV LD +G T+++ LW G PVVTL G+ +R A +
Sbjct: 490 IELRGAAPHAEFLATYGEIDVALDPFPYSGGLTTLEALWMGVPVVTLGGDRFCARHAVTH 549
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTDRD 200
L + G P L + Y +A L +D D
Sbjct: 550 LTSAGLPALAVEGEEAYLAMAAALVSDSD 578
>gi|254183905|ref|ZP_04890496.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1655]
gi|184214437|gb|EDU11480.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1655]
Length = 837
Score = 73.6 bits (179), Expect = 4e-11, Method: Composition-based stats.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 626 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 684
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 685 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 744
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 745 MSHVGLESFIADDIDDFVAKGIALASD 771
>gi|402302461|ref|ZP_10821573.1| glycosyltransferases group 1 [Selenomonas sp. FOBRC9]
gi|400380528|gb|EJP33345.1| glycosyltransferases group 1 [Selenomonas sp. FOBRC9]
Length = 565
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 74/151 (49%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV----GEANIQATAQALGLDQ 109
IV+ + N KI+ LQ+W +L+ VP + L LLK G A + A G+
Sbjct: 363 IVFGSLNNFTKINDRVLQVWAEILRRVPTARL-LLKTEIFSVSDGAAEARRRIAAAGIPP 421
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A +++ D+ LDT G T+ D L+ G PVVT GETL SR A
Sbjct: 422 ERV--ETEGASADYLAAYGRIDIALDTFPYPGGGTTCDALYMGVPVVTRAGETLGSRFGA 479
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L +G LIA T +EY + A+ L D D
Sbjct: 480 SLLRNIGADALIAYTEEEYIERAVFLAQDFD 510
>gi|76812180|ref|YP_331511.1| hypothetical protein BURPS1710b_0094 [Burkholderia pseudomallei
1710b]
gi|76581633|gb|ABA51108.1| TPR domain protein [Burkholderia pseudomallei 1710b]
Length = 821
Score = 73.6 bits (179), Expect = 5e-11, Method: Composition-based stats.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 610 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 668
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 669 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 728
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 729 MSHVGLESFIADDIDDFVAKGIALASD 755
>gi|197103965|ref|YP_002129342.1| TPR domain-containing protein [Phenylobacterium zucineum HLK1]
gi|196477385|gb|ACG76913.1| TPR domain protein [Phenylobacterium zucineum HLK1]
Length = 761
Score = 73.6 bits (179), Expect = 5e-11, Method: Composition-based stats.
Identities = 48/148 (32%), Positives = 72/148 (48%), Gaps = 5/148 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
+ + N YK +P L+ W +L PNS ++ E+ ++ A A G+
Sbjct: 573 VTFGTANNPYKYNPDVLRTWGRILARTPNSRFLFIRPEGGSESFRRHMTAAFAAEGVAPE 632
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
RILF V + +H+ D+ LDT G TT+ + LW G PVV+L GE L R++ S
Sbjct: 633 RILFEPV--RGQHLPHYNRLDMSLDTFPQTGGTTTCESLWMGAPVVSLVGEALYERLSYS 690
Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTD 198
L +G +L A T Y++ A L D
Sbjct: 691 VLNNIGLGDLCAETVAGYEETACALAAD 718
>gi|186477739|ref|YP_001859209.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
gi|184194198|gb|ACC72163.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia phymatum
STM815]
Length = 789
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 71/139 (51%), Gaps = 5/139 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----ALGLDQ 109
+ + F +L KI P+ + W +L+AVP + L ++K +G A+ + +A A G+D
Sbjct: 596 VTFGYFGKLVKITPNVIAAWSRLLRAVPGAKL-MMKSHELGAAHARRSAAEQFAAQGIDA 654
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++ + + E+++ D+ L G TT+ + LW GTPVV L G+ + +
Sbjct: 655 LRLILEGGSPRHEYLKAYNRVDIMLSPFPYPGGTTTAEALWMGTPVVALQGDRFVTHICE 714
Query: 170 SQLATLGCPELIARTHKEY 188
S L G E IAR EY
Sbjct: 715 SVLNAAGLGEWIARDEDEY 733
>gi|126452788|ref|YP_001068175.1| hypothetical protein BURPS1106A_3955 [Burkholderia pseudomallei
1106a]
gi|242316562|ref|ZP_04815578.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1106b]
gi|403520597|ref|YP_006654731.1| hypothetical protein BPC006_I3996 [Burkholderia pseudomallei
BPC006]
gi|126226430|gb|ABN89970.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1106a]
gi|242139801|gb|EES26203.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1106b]
gi|403076239|gb|AFR17819.1| TPR repeat-containing protein [Burkholderia pseudomallei BPC006]
Length = 776
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIAGDIDDFVAKGIALASD 710
>gi|428219241|ref|YP_007103706.1| hypothetical protein Pse7367_3029 [Pseudanabaena sp. PCC 7367]
gi|427991023|gb|AFY71278.1| Tetratricopeptide TPR_2 repeat-containing protein [Pseudanabaena
sp. PCC 7367]
Length = 1639
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 84/165 (50%), Gaps = 10/165 (6%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
RQ+ G+ + +VY K +P+++ +L VPNSI+ L+ A +A
Sbjct: 645 RQKLGITDQQVVYLYAAPAQKSNPASIAAQAQILAQVPNSII--LRKGAQDLDTAKALYA 702
Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
L + RI+F +EEH Q+ADV LD+ NG T +++ LW PVVTL
Sbjct: 703 EYCDPLAVSSDRIIFLPRTNTEEEHRIIYQIADVGLDSYPYNGGTHNLESLWFNLPVVTL 762
Query: 159 PGETLASRVAASQLATLGCPEL---IARTHKEYQDIAIRLGTDRD 200
G+ +R+ S +AT+G + IA+ EY A+RLG DRD
Sbjct: 763 VGDQSFARMGYSFVATIGLSDKKAGIAQNWHEYIAEAVRLGGDRD 807
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 77/165 (46%), Gaps = 10/165 (6%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL--------WLLKFP-AVG 94
R +G +AIV+ + YKI P W VL AVPNSIL W K+P
Sbjct: 1403 RASWGANPEAIVFISGANFYKILPELRHAWAKVLAAVPNSILVLYPFGGNWRRKYPIQTF 1462
Query: 95 EANIQATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
+++ G+ R++ N+ + + Q AD+ LD+ G +++D L G
Sbjct: 1463 VSHLLEVLAWYGVKPERLVTLRNIPTRADVKVCLQQADIYLDSFPYGGANSTIDPLEVGL 1522
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
P V + GE L ++ A+ L LG LI EY D A++L TD
Sbjct: 1523 PTVAMDGECLRNQQGAALLRQLGLDGLIVNNEAEYIDRAVQLATD 1567
>gi|326402277|ref|YP_004282358.1| hypothetical protein ACMV_01290 [Acidiphilium multivorum AIU301]
gi|325049138|dbj|BAJ79476.1| hypothetical protein ACMV_01290 [Acidiphilium multivorum AIU301]
Length = 733
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + FN L K+ TL +W +L+ V N+ L LL+ P E I + A ALG+D
Sbjct: 538 VTFGCFNNLTKLTDETLSLWGRILEQVDNAHL-LLRCPQFSEDGIPERFISRAHALGVDT 596
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A E + + D+ LD +G T+ + ++ G PV+TL G+ A+R +
Sbjct: 597 SRVELRGRAPHPEFIEGYKDVDIALDPFPYSGGLTTCESMFMGVPVITLAGDFFAARHSV 656
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G + + + ++Y D A+ + +D
Sbjct: 657 SHLSNVGLTDCVTESPEQYIDRAVAMSSD 685
>gi|348670339|gb|EGZ10161.1| hypothetical protein PHYSODRAFT_523651 [Phytophthora sojae]
Length = 502
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 64/128 (50%), Gaps = 16/128 (12%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG---------------EAN 97
++ NFN++ K++ S W+ +L+ VP S L LL P N
Sbjct: 296 GFLFVNFNKIDKLEASVFSTWMGILRRVPRSSLLLLD-PGRHVLDGNMTQSVTSHEIRKN 354
Query: 98 IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ AQA G+ RI F + K EH++R + + LDT + H+T+ D L+ G PV+T
Sbjct: 355 LWREAQAQGISTRRIRFVSRTPKLEHLQRHRTGGLFLDTFIYGAHSTATDALYAGLPVLT 414
Query: 158 LPGETLAS 165
L G++ AS
Sbjct: 415 LAGDSFAS 422
>gi|148259123|ref|YP_001233250.1| hypothetical protein Acry_0103 [Acidiphilium cryptum JF-5]
gi|146400804|gb|ABQ29331.1| TPR repeat-containing protein [Acidiphilium cryptum JF-5]
Length = 733
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + FN L K+ TL +W +L+ V N+ L LL+ P E I + A ALG+D
Sbjct: 538 VTFGCFNNLTKLTDETLSLWGRILEQVDNAHL-LLRCPQFSEDGIPERFISRAHALGVDT 596
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A E + + D+ LD +G T+ + ++ G PV+TL G+ A+R +
Sbjct: 597 SRVELRGRAPHPEFIEGYKDVDIALDPFPYSGGLTTCESMFMGVPVITLAGDFFAARHSV 656
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G + + + ++Y D A+ + +D
Sbjct: 657 SHLSNVGLTDCVTESPEQYIDRAVAMSSD 685
>gi|425472063|ref|ZP_18850914.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881954|emb|CCI37543.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 712
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 80/162 (49%), Gaps = 9/162 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
R+ +P DA +Y + K P T+++ + +L VPNS L+K VG+ IQ
Sbjct: 514 REDLEIPPDATIYFSVQNGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 570
Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ G++ R+ F E H Q+ADV LDT NG TT+++VLW G P+VT
Sbjct: 571 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 630
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
L GE +R + + + G I EY ++LG DR
Sbjct: 631 LVGEQFYARNSYTFMINAGIEAGIGWNEVEYIHWGVKLGLDR 672
>gi|113476143|ref|YP_722204.1| hypothetical protein Tery_2526 [Trichodesmium erythraeum IMS101]
gi|110167191|gb|ABG51731.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
Length = 1154
Score = 73.2 bits (178), Expect = 5e-11, Method: Composition-based stats.
Identities = 48/165 (29%), Positives = 82/165 (49%), Gaps = 13/165 (7%)
Query: 42 TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
++R G+ D +VY K +P ++ + +L+ P SIL G+ ++
Sbjct: 948 STRNSLGIDSDQMVYLCIAPGRKTNPKMVRAQLKILQHTPKSILI-----RKGQGDVDVI 1002
Query: 102 AQA-------LGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
Q+ LG+D +RI+F +EEH Q+ADV LD+ NG T +++ LW+
Sbjct: 1003 YQSYLEESENLGIDFNRIMFLGQTQTEEEHRAIYQIADVLLDSYPYNGGTHNLEALWSNL 1062
Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
P++T G+ SR+ S L + +A + +EY + I+LG D
Sbjct: 1063 PIITRAGKQYLSRMGYSFLQNVNLDVGVAWSWEEYTEWGIKLGQD 1107
>gi|338988961|ref|ZP_08633852.1| TPR repeat-containing protein [Acidiphilium sp. PM]
gi|338206109|gb|EGO94354.1| TPR repeat-containing protein [Acidiphilium sp. PM]
Length = 719
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
+ + FN L K+ TL +W +L+ V N+ L LL+ P E I + A ALG+D
Sbjct: 524 VTFGCFNNLTKLTDETLSLWGRILEQVDNAHL-LLRCPQFSEDGIPERFISRAHALGVDT 582
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A E + + D+ LD +G T+ + ++ G PV+TL G+ A+R +
Sbjct: 583 SRVELRGRAPHPEFIEGYKDVDIALDPFPYSGGLTTCESMFMGVPVITLAGDFFAARHSV 642
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L+ +G + + + ++Y D A+ + +D
Sbjct: 643 SHLSNVGLTDCVTESPEQYIDRAVAMSSD 671
>gi|209527307|ref|ZP_03275816.1| Methyltransferase type 11 [Arthrospira maxima CS-328]
gi|209492233|gb|EDZ92579.1| Methyltransferase type 11 [Arthrospira maxima CS-328]
Length = 924
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 90/184 (48%), Gaps = 13/184 (7%)
Query: 30 ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
+ G E+PQ RQ G+ D +++ + +KI P ++ W ++ VPNS+L L+
Sbjct: 565 SYGPELPQVRTTVDRQSLGIGSDRVIFISGANFFKIIPDLIETWAKIIAGVPNSVLVLMP 624
Query: 90 F-----------PAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRR-GQLADVCLDTP 137
F + + N T + ++ +L N + V+ ++AD+ LD+
Sbjct: 625 FGPNWSNTYPKKSFIKQLNHHFTQHGIQTERLVVLDPNPTPNRDDVKEYFKIADIYLDSY 684
Query: 138 LCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIART-HKEYQDIAIRLG 196
+G T+ M+ L G P+V+ G + S + + L +L P+L+ T K Y D+A+ LG
Sbjct: 685 PFSGTTSLMEPLQVGLPIVSRRGSSFRSSMGVAILQSLNIPDLVTLTSEKSYIDLAVALG 744
Query: 197 TDRD 200
T+ +
Sbjct: 745 TNSE 748
>gi|418394627|ref|ZP_12968742.1| TPR domain-containing protein [Burkholderia pseudomallei 354a]
gi|418554742|ref|ZP_13119513.1| TPR domain-containing protein [Burkholderia pseudomallei 354e]
gi|385369925|gb|EIF75216.1| TPR domain-containing protein [Burkholderia pseudomallei 354e]
gi|385374781|gb|EIF79601.1| TPR domain-containing protein [Burkholderia pseudomallei 354a]
Length = 776
Score = 73.2 bits (178), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARAR 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA ++ I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710
>gi|237814265|ref|YP_002898716.1| TPR domain protein [Burkholderia pseudomallei MSHR346]
gi|237505900|gb|ACQ98218.1| TPR domain protein [Burkholderia pseudomallei MSHR346]
Length = 776
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 74/147 (50%), Gaps = 3/147 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
+ + +FN+L K+ + +W +++AVP S + L P G A I+ Q G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ F + ++++ DVCLDT G TT ++ LW G P +T+ GETL SR +
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683
Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
++ +G IA + I L +D
Sbjct: 684 MSHVGLESFIADDIDNFVAKGIALASD 710
>gi|427712404|ref|YP_007061028.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 6312]
gi|427376533|gb|AFY60485.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 6312]
Length = 738
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 79/161 (49%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
R +P DA++Y K P TL+ +L VPNS L+K GE I
Sbjct: 539 RADLDIPIDAVIYFTAQNGLKQHPETLKQQCQILAQVPNSYF-LIKL--RGEKGILRELF 595
Query: 99 QATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
++ AQ G+ R+ F + + E H QLAD+ LD+ NG TT+++ LW P+VT
Sbjct: 596 ESIAQEEGVSLSRLRFVDFDSDELTHRANLQLADIVLDSYPYNGATTTLETLWLEIPLVT 655
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G+ A+R + + + G E IA + EY I+LG D
Sbjct: 656 QVGQQFAARNSYTFMVNAGITEGIAWSAPEYVAWGIKLGQD 696
>gi|117925099|ref|YP_865716.1| hypothetical protein Mmc1_1802 [Magnetococcus marinus MC-1]
gi|117608855|gb|ABK44310.1| TPR repeat-containing protein [Magnetococcus marinus MC-1]
Length = 822
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 72/151 (47%), Gaps = 5/151 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
I Y FN L K+ P TL++W +LK +PNS L LK +V +A + ALG+
Sbjct: 632 ITYGCFNNLAKLSPRTLRLWGEILKRLPNSRL-ALKSASVCDAETREALVKQLAALGVSA 690
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++F + ++ Q D+ LD NG T++ LW G P+VT G RV
Sbjct: 691 QRLIFIQPSDYGRYLEDYQFVDIALDPLPFNGGITTLQGLWQGVPIVTRWGLGQRDRVGG 750
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L L IA + + Y + A+ D D
Sbjct: 751 SILTDLDLVSWIADSDQAYIECAVAKAADLD 781
>gi|422303122|ref|ZP_16390476.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389791933|emb|CCI12279.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 719
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 83/162 (51%), Gaps = 8/162 (4%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
R+ G+ DA++Y + K +P+ +++ + ++K+VPNS + V + N
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLLDLF 575
Query: 99 QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A +G++ +RI + E + +ADV LDT NG TT+++ LW G P+V
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ +SR + + G E I+ + +EY I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGISWSDEEYVQWGIKLGLDKN 677
>gi|83311709|ref|YP_421973.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
magneticum AMB-1]
gi|82946550|dbj|BAE51414.1| Predicted O-linked N-acetylglucosamine transferase
[Magnetospirillum magneticum AMB-1]
Length = 639
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 77/156 (49%), Gaps = 8/156 (5%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG----EANIQATAQA 104
L ++ + +FN + K++ + W +L VPNS LWL K+P + + ++ A
Sbjct: 438 LERGSVTFGSFNNIAKLNEQVIAAWARILNRVPNSRLWL-KWPHLALDEVASPLRQAFAA 496
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGETL 163
G+D R+ + + + D+ LD P C G TS + LW G PVVTLPG
Sbjct: 497 HGVDPGRLELRGNSPPAQLLAEYAEIDIALDPFPYC-GAFTSCEALWMGVPVVTLPGPRP 555
Query: 164 ASRVAASQLATLGCPELIARTH-KEYQDIAIRLGTD 198
SR + LA +G + +AR Y+D+A+ L D
Sbjct: 556 FSRQTLALLAAMGLEDELARPDLAAYEDLAVALAAD 591
>gi|452821043|gb|EME28078.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
SPINDLY-lie protein [Galdieria sulphuraria]
Length = 1154
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 77/153 (50%), Gaps = 6/153 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
I + +FN L K +T+ +W +L VP S L L P + + +A+G+
Sbjct: 958 ITFGSFNVLAKTQANTIALWSKILHLVPKSRLLLKAKPFASSFARRRFEYVFEAVGITPD 1017
Query: 111 RILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASRV 167
R+ + + H+ L D+CLD G TT+ + L+ G PVV+L G+ A V
Sbjct: 1018 RLDLLPLLPETRNHLETYSLVDICLDPFPYAGTTTTCEALYMGVPVVSLSAAGQNHAHSV 1077
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ L ++G EL+A + +EY DIA+ L +D D
Sbjct: 1078 GETLLKSIGHSELVAHSEEEYVDIAVSLASDLD 1110
>gi|423065081|ref|ZP_17053871.1| glycosyl transferase family 2 [Arthrospira platensis C1]
gi|406714324|gb|EKD09492.1| glycosyl transferase family 2 [Arthrospira platensis C1]
Length = 2280
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 79/172 (45%), Gaps = 11/172 (6%)
Query: 36 PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
PQ + V R Q+G + IV+ + YKI P + WV +L VPNSIL L F
Sbjct: 1746 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 1805
Query: 95 EAN---------IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTT 144
N + AT G+ + R++ V+ + D+ LD+ +G +
Sbjct: 1806 SNNYASSQFIRRMHATLSKYGVAEKRLVLIKTLPSRTDVKACLEQGDIYLDSYPHSGSNS 1865
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+D L G PVV G+TL S+ A+ + +LG P+LI Y +A+ L
Sbjct: 1866 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDDESSYISLAVTLA 1917
>gi|378948679|ref|YP_005206167.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Pseudomonas fluorescens F113]
gi|359758693|gb|AEV60772.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Pseudomonas fluorescens F113]
Length = 1142
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 71/149 (47%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
I Y FN K++ L W +++A +S L LLK A G + ++ QA G+ +
Sbjct: 432 ITYGCFNNPTKVNDVLLAQWAELMRATADSRL-LLKGGAFGNSELREHVQAFMAAQGIAR 490
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+L + + D+ LD +G T+ + L G PVVTLPG T A R +A
Sbjct: 491 ERVLVEGPVGHKHLLETYNRIDIALDPWPYSGGLTTCEALLMGVPVVTLPGPTFAGRHSA 550
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ L G PEL+ + EY+ I L D
Sbjct: 551 THLVNAGLPELVVDSWDEYRQRVIGLAGD 579
>gi|209515740|ref|ZP_03264603.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia sp. H160]
gi|209503767|gb|EEA03760.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia sp. H160]
Length = 1004
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 74/146 (50%), Gaps = 1/146 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATAQALGLDQHRI 112
+ + +FN++ K+ + +W ++ AVPNS + + P G + + G+ + R+
Sbjct: 779 VTFGSFNRMDKLRRDVVALWSRLMHAVPNSRMVIGAMPRDGSLGKLPDWFEEEGIARDRL 838
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F A+ ++++ D CLD+ +G TT++ LW G P +TLPGET+ R + +
Sbjct: 839 DFMPRASVPVYLQQHHRVDFCLDSFPFSGLTTALHSLWMGVPTLTLPGETVPGRSGLTAM 898
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
+G +AR ++ + L +D
Sbjct: 899 THVGLANFVARDKDDFVRKGVALASD 924
>gi|113478271|ref|YP_724332.1| hypothetical protein Tery_4945 [Trichodesmium erythraeum IMS101]
gi|110169319|gb|ABG53859.1| TPR repeat [Trichodesmium erythraeum IMS101]
Length = 750
Score = 72.4 bits (176), Expect = 1e-10, Method: Composition-based stats.
Identities = 55/174 (31%), Positives = 87/174 (50%), Gaps = 9/174 (5%)
Query: 31 TGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF 90
+G E ++ T R+ + D IVY K + + V +LK VPNSIL
Sbjct: 512 SGFERIETSKTTLRKSQRIGLDQIVYLCLASGRKFNRDLVNGQVAILKQVPNSIL---VH 568
Query: 91 PAVGEANIQATA-----QALGLDQHRILFSNV-AAKEEHVRRGQLADVCLDTPLCNGHTT 144
A+G+ + +A +A G+ HR+ F + + +EEH LAD+ LD+ NG T
Sbjct: 569 KALGDIEVFQSAYHEACKAEGVSIHRVKFMPIFSTEEEHRSIYSLADILLDSYPYNGGTH 628
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+++ LW PVVT GE SR+ S L +L ++ + +EY + ++LG D
Sbjct: 629 TLEALWFNLPVVTYAGEQFLSRMGYSFLESLDIKLGVSWSWEEYINWGVKLGQD 682
>gi|376006070|ref|ZP_09783407.1| putative glycosyl transferase [Arthrospira sp. PCC 8005]
gi|375325526|emb|CCE19160.1| putative glycosyl transferase [Arthrospira sp. PCC 8005]
Length = 2557
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 79/172 (45%), Gaps = 11/172 (6%)
Query: 36 PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
PQ + V R Q+G + IV+ + YKI P + WV +L VPNSIL L F
Sbjct: 2023 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 2082
Query: 95 EAN---------IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTT 144
N + AT G+ + R++ V+ + D+ LD+ +G +
Sbjct: 2083 SNNYASSQFIRRMHATLSKYGVAEKRLVLIKTLPSRTDVKACLEQGDIYLDSYPHSGSNS 2142
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+D L G PVV G+TL S+ A+ + +LG P+LI Y +A+ L
Sbjct: 2143 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDDESSYISLAVTLA 2194
>gi|313897099|ref|ZP_07830646.1| tetratricopeptide repeat family protein [Selenomonas sp. oral taxon
137 str. F0430]
gi|312974546|gb|EFR40014.1| tetratricopeptide repeat family protein [Selenomonas sp. oral taxon
137 str. F0430]
Length = 565
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 75/149 (50%), Gaps = 7/149 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAV-GEANIQATAQALGLDQ 109
+V+ + N KI+ L++W +L+ VP + L LLK F G A + A GL
Sbjct: 363 VVFGSLNNFTKINDRVLKVWAEILRRVPTARL-LLKTEIFSVTDGAAEARRRIAAAGLPL 421
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ A +++ D+ LDT G T+ D L+ G PVVTL GETL SR A
Sbjct: 422 ARV--ETEGASADYLAAYGRIDIALDTFPYPGGGTTCDALYMGVPVVTLAGETLGSRFGA 479
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L +G LIA T +EY + A+ L D
Sbjct: 480 SLLRNIGADALIAHTAEEYIERAVFLAQD 508
>gi|16124374|ref|NP_418938.1| hypothetical protein CC_0119 [Caulobacter crescentus CB15]
gi|221233057|ref|YP_002515493.1| porphyrin biosynthesis protein [Caulobacter crescentus NA1000]
gi|13421226|gb|AAK22106.1| TPR domain protein [Caulobacter crescentus CB15]
gi|220962229|gb|ACL93585.1| porphyrin biosynthesis protein [Caulobacter crescentus NA1000]
Length = 747
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 71/150 (47%), Gaps = 3/150 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQATAQALGLDQH 110
+ + L K++ + +W +L+AVP S L L P V A+IQA G+
Sbjct: 547 MTFGCLGNLSKMNGEVVALWSQILRAVPESKLLLKAKPFVDAQVVADIQARFAREGVSSD 606
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
R++ +++ E+ D+ LDT G TTS+D LW G PV+TL G+ +R+ S
Sbjct: 607 RLILEGPSSRAEYFETYNRIDLVLDTFPYPGGTTSVDALWMGVPVLTLAGDRFLARLGES 666
Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G +AR +Y A+ + RD
Sbjct: 667 IARNAGQSAWVARDRADYLAKAVAFASVRD 696
>gi|209525307|ref|ZP_03273849.1| glycosyl transferase family 2 [Arthrospira maxima CS-328]
gi|209494322|gb|EDZ94635.1| glycosyl transferase family 2 [Arthrospira maxima CS-328]
Length = 2681
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 82/172 (47%), Gaps = 11/172 (6%)
Query: 36 PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
PQ + V R Q+G + IV+ + YKI P + WV +L VPNSIL L F
Sbjct: 2147 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 2206
Query: 95 EAN---------IQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
N + AT G+ + R +L ++ ++ + + D+ LD+ +G +
Sbjct: 2207 SNNYASSQFIRRMHATLSKYGVAEKRLVLIKSLPSRTDVKACLEQGDIYLDSYPHSGSNS 2266
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+D L G PVV G+TL S+ A+ + +LG P+LI Y +A+ L
Sbjct: 2267 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDDESSYISLAVTLA 2318
>gi|422343491|ref|ZP_16424419.1| hypothetical protein HMPREF9432_00479 [Selenomonas noxia F0398]
gi|355378798|gb|EHG25978.1| hypothetical protein HMPREF9432_00479 [Selenomonas noxia F0398]
Length = 497
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 75/154 (48%), Gaps = 4/154 (2%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQALG 106
+P D+ V+ +FN KI LQ+W +L+AVP + L L E +Q A+ G
Sbjct: 302 VPHDSPVFASFNNFAKITDEMLQIWGGILRAVPGARLLLKNVHPQRETLHRMQRRAEQAG 361
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+D R+ ++++R AD+ LDT G T+ + L G PVVTL G +R
Sbjct: 362 IDPVRLELR--PGSKDYLRDYLDADIILDTYPYQGGGTTCEALCMGLPVVTLAGTRHGAR 419
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L G EL A + + Y + A+ L DR+
Sbjct: 420 FGVSLLQNAGLGELAADSVQAYAERAVLLAQDRE 453
>gi|428221793|ref|YP_007105963.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 7502]
gi|427995133|gb|AFY73828.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Synechococcus sp. PCC 7502]
Length = 773
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 87/175 (49%), Gaps = 13/175 (7%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
E PQ ++ R Q+ + EDAIVY LYK P ++ + K VP + + +
Sbjct: 561 ESPQYLL--ERSQFQISEDAIVYLCCQSLYKYLPKYDYIFAAIAKRVPKAKFAFI-YSNN 617
Query: 94 GEANIQATAQALGL---------DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
G+ + Q L + ++H ++ + ++E++ ++AD+ LDT +G T
Sbjct: 618 GDYVTRQFKQRLDIAFARQNQNSEEHCLIMPRLN-RDEYLSLNRIADIFLDTIDWSGGNT 676
Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
+++ + PVVTLP + + R + + L L E IA + EY DIA++LG D
Sbjct: 677 TLEAISCNLPVVTLPTQFMRGRHSYAMLQMLEVTETIASSESEYIDIAVKLGLDH 731
>gi|372273166|ref|ZP_09509214.1| sulfotransferase [Marinobacterium stanieri S30]
Length = 1017
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 73/154 (47%), Gaps = 3/154 (1%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ---A 104
L + I + FN KI+ L W +++ PNS L L F ++ Q A
Sbjct: 424 ALNNEYITFGCFNNPTKINKVILSCWADIMHLTPNSRLLLKSFQFNSPELVKRITQEMEA 483
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
+G+D+ R++ + +E +R D+ LDT +G T+ + + G PVVT G T A
Sbjct: 484 MGIDKDRLILEGPSPHQELLRSYNRVDIALDTWPYSGGLTTCEAMLMGVPVVTCTGPTFA 543
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
R +AS + G EL+ + +Y+ + L +D
Sbjct: 544 GRHSASHIKNAGMYELVTDSFSQYKTRVLELTSD 577
>gi|430759455|ref|YP_007215312.1| TPR domain protein, putative component of TonB system
[Thioalkalivibrio nitratireducens DSM 14787]
gi|430009079|gb|AGA31831.1| TPR domain protein, putative component of TonB system
[Thioalkalivibrio nitratireducens DSM 14787]
Length = 739
Score = 72.0 bits (175), Expect = 1e-10, Method: Composition-based stats.
Identities = 50/152 (32%), Positives = 79/152 (51%), Gaps = 8/152 (5%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN----IQATAQALGLDQ 109
+ + FN KI + +W +L AVP+S L L+K + + + + +A GL
Sbjct: 358 VTFGTFNNYAKISREVIAVWARILAAVPDSRL-LIKSTGLDSSETRTRLMSLFEAAGLSN 416
Query: 110 --HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
RI L + V + EEH+R D+ LDT +G TT+ LW G PV+TL G + R
Sbjct: 417 PGARIDLMNRVRSTEEHLRIYDRVDLALDTFPYSGTTTTCQALWMGVPVLTLYGRSHVGR 476
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
V+AS L + +L+A + ++Y A +LG +
Sbjct: 477 VSASVLRQMALDDLVAGSEEDYIARAGQLGRE 508
>gi|22298553|ref|NP_681800.1| hypothetical protein tll1010 [Thermosynechococcus elongatus BP-1]
gi|22294733|dbj|BAC08562.1| tll1010 [Thermosynechococcus elongatus BP-1]
Length = 1126
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 77/158 (48%), Gaps = 4/158 (2%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
R Q GL +A+V+ + KI P L++ +++AVPN++L ++K+ + E ++
Sbjct: 914 RDQLGLSAEAVVFLCAQRTSKIQPEILKLQFAIVEAVPNAVL-VVKYHRGSPIFEEWCRS 972
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
A+ G D ++ F E H DV LDT G S++ LW P+VT G
Sbjct: 973 VAEVEGFDLDKLYFLTPNLPEIHRANLYDVDVVLDTYPYAGGAMSLEALWLEVPIVTKVG 1032
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ +R + L +G E IA +EY IR GT+
Sbjct: 1033 QQFVARHTYTFLKNVGVEEGIAFNDEEYVHWGIRFGTE 1070
>gi|75906573|ref|YP_320869.1| hypothetical protein Ava_0348 [Anabaena variabilis ATCC 29413]
gi|75700298|gb|ABA19974.1| TPR repeat protein [Anabaena variabilis ATCC 29413]
Length = 563
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 82/161 (50%), Gaps = 2/161 (1%)
Query: 39 IVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
+V+ +R +GL +D I+Y +K P ++ + +P + L+ + E +
Sbjct: 364 LVLKTRADFGLSDDDILYLCCQAPFKYLPQYDFIFAEIASRIPQAKFIFLRGTLLQE-RL 422
Query: 99 QATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
Q LGL + +F N+ + +++ L+D+ LDT +G T+++ + P+VT
Sbjct: 423 QKAFGNLGLKFEDYCVFLNIPERLDYLMINLLSDIYLDTFTWSGGNTTLEAIACNLPIVT 482
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
PGE + R + S L LG + IA+ EY DIA++LG +
Sbjct: 483 CPGEFMRGRHSDSFLKMLGVTDTIAQNEGEYIDIAVKLGQN 523
>gi|440791074|gb|ELR12328.1| SPY protein [Acanthamoeba castellanii str. Neff]
Length = 378
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 83/176 (47%), Gaps = 18/176 (10%)
Query: 35 VPQSIVITSRQQYGLP--EDA-IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
+P + + R+ GLP ED ++ NF + YK+D W+ +L N++L LL
Sbjct: 182 IPFELQLPDRKALGLPPKEDGTFIFFNFAEPYKVDRQVFATWMQILNVTQNTVL-LLPSG 240
Query: 92 AVGEAN--IQATAQALGLDQHRILFSNV---------AAKEEHVRRGQLADVCLDT-PLC 139
A E N ++ A G+D R++F+ + H+ D+ LDT P
Sbjct: 241 ATNERNQNMKKYMSARGIDPKRLIFTTINGVPWQLTSMTDTRHIL--SFVDLYLDTFPFG 298
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
+ ++ L+ PV+ L G + + V S L TLG PELIA++ +Y A RL
Sbjct: 299 TDVLSLVETLYHEVPVLALQGRSFSEMVGTSFLKTLGLPELIAQSKTKYVATAARL 354
>gi|428316233|ref|YP_007114115.1| hypothetical protein Osc7112_1150 [Oscillatoria nigro-viridis PCC
7112]
gi|428239913|gb|AFZ05699.1| hypothetical protein Osc7112_1150 [Oscillatoria nigro-viridis PCC
7112]
Length = 899
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 82/167 (49%), Gaps = 14/167 (8%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-----------PA 92
R+ G+ + A+V+ + +KI P + W ++ AVPNS+L L F P
Sbjct: 691 RKSLGISDTAVVFISGANTFKIIPELRETWAKIIAAVPNSVLVLYPFGNTWSGDYVKQPF 750
Query: 93 VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTSMDVLWT 151
+ + + A +D+ R++ N A E V+ QLADV LD+ G +++D L
Sbjct: 751 INK--MSAIFDKYSIDRTRLILLNTLANREDVKALLQLADVYLDSYPYAGANSTVDPLEV 808
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G P V G L SR A+ L + +LIA + + Y ++++ LG +
Sbjct: 809 GLPTVVREGNNLRSRQGAAILRDIQLFDLIADSEESYINLSVALGIN 855
>gi|219113085|ref|XP_002186126.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
[Phaeodactylum tricornutum CCAP 1055/1]
gi|209582976|gb|ACI65596.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
[Phaeodactylum tricornutum CCAP 1055/1]
Length = 812
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 60/190 (31%), Positives = 88/190 (46%), Gaps = 25/190 (13%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G PQ + G + A VYCNFN+ K +P T++ W+ +L+ VP+SIL LL P
Sbjct: 534 GTGSPQENRCLAPPDVGPTDVAFVYCNFNKFLKNNPETVRGWIQILRQVPDSILCLLDNP 593
Query: 92 AVG---------------EANIQATAQALGLDQ--HRILFSNVAAKE-EHVRRGQLADVC 133
G + N + Q D +R+ F +H +R + D C
Sbjct: 594 RDGIPYLHKFIHEAAGTSDGNSPDSFQPGDGDDLVNRVHFLPWEPNPFDHQQRNR--DFC 651
Query: 134 ---LDTPLCNGHTTSMDVLWTGTPVVTLP-GETLASRVAASQLATLGCPELIA-RTHKEY 188
LD+ NGHT + D L+ G P+VT G+ +++RV S LG L A +Y
Sbjct: 652 NAMLDSHPYNGHTVAQDALYAGVPIVTRSDGDDMSARVTTSANLVLGLSHLNAVHGPAQY 711
Query: 189 QDIAIRLGTD 198
IA+ LGT+
Sbjct: 712 VAIAVALGTN 721
>gi|91785677|ref|YP_560883.1| TPR domain-containing protein [Burkholderia xenovorans LB400]
gi|91689631|gb|ABE32831.1| putative TPR domain protein [Burkholderia xenovorans LB400]
Length = 790
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 77/152 (50%), Gaps = 13/152 (8%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL-------G 106
I + +FN+L K+ + +W +L+A P + + L G Q L G
Sbjct: 571 ITFGSFNRLNKLRRDVIALWAELLRAQPTARMLL------GAIATDEDEQLLIDWFAGEG 624
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+++ R++F ++ ++++ D+CLDT G TT ++ LW G P +T+ G+TLASR
Sbjct: 625 IERGRLMFRRRSSIPVYLQQHFHVDICLDTFPYTGSTTVLNSLWMGVPTLTIAGDTLASR 684
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ ++ +G E +A +E+ + L +D
Sbjct: 685 AGTTWMSHVGLEEFVAANKEEFVARGVALASD 716
>gi|407715135|ref|YP_006835700.1| hypothetical protein BUPH_03787 [Burkholderia phenoliruptrix
BR3459a]
gi|407237319|gb|AFT87518.1| TPR domain-containing protein [Burkholderia phenoliruptrix BR3459a]
Length = 787
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 76/146 (52%), Gaps = 1/146 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA-VGEANIQATAQALGLDQHRI 112
I + +FN+L K+ + +W +L+A+P++ + L E + A G+D+ R+
Sbjct: 568 ITFGSFNRLNKLRRDVIALWAELLRALPDAKMLLGAIATDQDEQMLIDWFTAEGIDRERL 627
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F ++ ++++ D+CLDT G TT ++ LW G P +T+ G TLASR + +
Sbjct: 628 TFRRRSSIPVYLQQHFHVDICLDTFPYTGSTTVLNSLWMGVPTLTIAGSTLASRAGTTWM 687
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
+ +G + + +E+ + L +D
Sbjct: 688 SHVGLEQFVVANKEEFVARGVALASD 713
>gi|288962056|ref|YP_003452366.1| tetratricopeptide TPR_2 repeat protein [Azospirillum sp. B510]
gi|288914336|dbj|BAI75822.1| tetratricopeptide TPR_2 repeat protein [Azospirillum sp. B510]
Length = 721
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 45/144 (31%), Positives = 73/144 (50%), Gaps = 5/144 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQATAQALGLDQ 109
+ + +FN L K++ L W +L +P++ L L+K A V A + A G+ +
Sbjct: 533 VTFGSFNILTKLNDDVLAAWAAILNRMPDARL-LMKTKALSCPVTAALWRCRLTAAGIAE 591
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ +H+R DV LD +G TT+++ LW G PV+TLPGET +SR +
Sbjct: 592 DRVTMVGATNSLDHMRWCASVDVALDPFPFSGSTTTLETLWMGVPVITLPGETFSSRHSL 651
Query: 170 SQLATLGCPELIARTHKEYQDIAI 193
+ L G + +A +Y D A+
Sbjct: 652 AFLTVAGVADCVATDPADYVDRAV 675
>gi|119487585|ref|ZP_01621195.1| TPR repeat protein [Lyngbya sp. PCC 8106]
gi|119455754|gb|EAW36890.1| TPR repeat protein [Lyngbya sp. PCC 8106]
Length = 1107
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 84/163 (51%), Gaps = 13/163 (7%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN------ 97
R+Q + +D +VY K + ++ V +LKAVP+S++ G+ +
Sbjct: 905 RKQLKIDQDKLVYLCVAPGRKTNFEMIEAQVKILKAVPDSLM-----IRKGQGDHLLIRE 959
Query: 98 -IQATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
+ + G+D +RILF + +EEH Q+ DV LD+ NG T +++ LW+ P+
Sbjct: 960 MYHQSCEEQGVDFNRILFIGLTKTEEEHRAIYQVVDVLLDSYPYNGGTHNLEALWSNLPI 1019
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+T GE SR+ + L + +A + +EY ++ I+LGTD
Sbjct: 1020 LTRAGEQYLSRMGYAFLQAVNLDVGVAWSWEEYTELGIKLGTD 1062
>gi|94314154|ref|YP_587363.1| hypothetical protein Rmet_5235 [Cupriavidus metallidurans CH34]
gi|93358006|gb|ABF12094.1| conserved hypothetical protein; TPR domain protein [Cupriavidus
metallidurans CH34]
Length = 797
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 5/150 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQH 110
I Y +FN+L K+ + +W +L A P++ + + G+A+ Q G+D
Sbjct: 579 ITYGSFNRLNKVSRQVVALWARILHADPSARMIIGAIE--GDADQQTCLTRFADEGIDAG 636
Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
R+ F ++++ D+CLDT G TT+++ LW G P VT+ G + SR +AS
Sbjct: 637 RLSFRTRTETPVYLQQHHHVDICLDTFPYTGATTTINALWMGVPTVTIAGNSPLSRGSAS 696
Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTDRD 200
L LG + IA ++ A+ L D D
Sbjct: 697 WLGQLGLHQYIANDADDFVQRALALSKDLD 726
>gi|377819539|ref|YP_004975910.1| hypothetical protein BYI23_A000950 [Burkholderia sp. YI23]
gi|357934374|gb|AET87933.1| TPR repeat-containing protein [Burkholderia sp. YI23]
Length = 767
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 77/149 (51%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG----EANIQATAQALGLDQ 109
+ + +FN++ K+ P + +W +L A+P+S + + G EA+ G+ +
Sbjct: 561 VTFGSFNRINKVRPEVIALWAKILHAMPSSKMLIGAIRREGDRSDEAHYADQFAKQGISR 620
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R++F +A ++++ D+CLD + TT+++ LW G P +T+ G +L SR A
Sbjct: 621 DRLMFVPRSATAVYLQQHHQVDLCLDAFPFSASTTTLNALWMGVPTLTMAGNSLVSRGAT 680
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ L+ +G + + ++ A+ + +D
Sbjct: 681 TWLSHVGLESFVTKDKDDFVQSALSIVSD 709
>gi|302878311|ref|YP_003846875.1| hypothetical protein Galf_1081 [Gallionella capsiferriformans ES-2]
gi|302581100|gb|ADL55111.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
capsiferriformans ES-2]
Length = 961
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 73/154 (47%), Gaps = 3/154 (1%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQA 104
L + I + F++L K+ T Q W +V+ +VP+S + + AV A + Q
Sbjct: 759 ALTKKTITFGTFSRLEKVTEQTWQTWADVVLSVPDSCMLIKNAEMDHAVARARVAGYFQR 818
Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
G+ R++F A +H+ D+CLDT G T+++ L G PV+TL T
Sbjct: 819 AGVALDRLIFHGRTAWNDHMAVFNQVDICLDTFPQGGGVTTLEGLMMGVPVITLHSPTFV 878
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
R S L LG + +A T ++Y IA + D
Sbjct: 879 GRTGVSILTALGLVDWVAETPEQYVKIAKQKAQD 912
>gi|357059078|ref|ZP_09119923.1| hypothetical protein HMPREF9334_01640 [Selenomonas infelix ATCC
43532]
gi|355372986|gb|EHG20324.1| hypothetical protein HMPREF9334_01640 [Selenomonas infelix ATCC
43532]
Length = 906
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 50/151 (33%), Positives = 76/151 (50%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLDQ 109
I + +FNQ K+ L W +++ VPN+ L LLK A G + A Q LG D
Sbjct: 713 IQFASFNQHAKLRDDMLLAWREIMERVPNAWL-LLKNKAYGRRGVVAALQERLRRLGFDM 771
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
+R++F A ++++R D+ LDT G TS D L+ G PVV+ E ++R
Sbjct: 772 NRVMFE--VATRDYMQRYLDVDIALDTFPWPGGGTSCDALYMGVPVVSYYTERHSTRFTY 829
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S LA +G +L + ++Y A+ L D D
Sbjct: 830 SLLANMGLADLASTKLEDYIATAVALAGDID 860
>gi|425444417|ref|ZP_18824468.1| TPR repeat protein [Microcystis aeruginosa PCC 9443]
gi|389735843|emb|CCI00716.1| TPR repeat protein [Microcystis aeruginosa PCC 9443]
Length = 873
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 51/163 (31%), Positives = 79/163 (48%), Gaps = 8/163 (4%)
Query: 43 SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS--ILWLLKFPAVGEANIQA 100
+R + GL ED I+Y N L+K P ++ + + VPNS I + V
Sbjct: 672 TRLEMGLTEDKIIYLNCQSLFKYLPENDDIFPRIAQQVPNSQFIFICHRSEFVTHCFQSR 731
Query: 101 TAQA-----LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
+QA L +R++ + ++++ R LAD+ LD +G T+++ + PV
Sbjct: 732 LSQAFNKYGLNWQDYRVMMPQLE-QDDYFRLNLLADIYLDNLSWSGGNTTLEAIACQLPV 790
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT PGE + R + + L LG E IA Y +IAIRLG D
Sbjct: 791 VTCPGEFMRGRHSYAILKKLGITETIATDKNHYIEIAIRLGLD 833
>gi|357059077|ref|ZP_09119922.1| hypothetical protein HMPREF9334_01639 [Selenomonas infelix ATCC
43532]
gi|355372985|gb|EHG20323.1| hypothetical protein HMPREF9334_01639 [Selenomonas infelix ATCC
43532]
Length = 488
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 72/154 (46%), Gaps = 4/154 (2%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQALG 106
+P D V+ +FN KI L++W +L AVP + L L EA ++ A+ G
Sbjct: 293 VPHDPPVFGSFNNFSKITDEMLRLWGEILAAVPRARLLLKNVHPSREALCRMRTRAERAG 352
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+D R+ E++R D+ LDT G T+ + L+ G PVVT+ G +R
Sbjct: 353 IDMERLDLR--PGTREYLRDYLEVDIILDTYPYQGGGTTCEALFMGLPVVTMAGTRHGAR 410
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
A L +G EL+ Y A+ L +DR+
Sbjct: 411 FGAGLLHNVGLAELVTEDSAAYTACAVGLASDRE 444
>gi|168066691|ref|XP_001785267.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663137|gb|EDQ49919.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 910
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 79/150 (52%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP----AVGEANIQATAQALGLDQ 109
+ + +FN L KI P L++W +L +P++ L + P +V EA + A + LG++
Sbjct: 635 VTFGSFNNLAKITPRVLRVWARILIKMPSARLVVKCKPFCCDSVREAFL-AKLEELGVES 693
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
RI L + +H++ L D+ LDT G TT+ + L+ G P VT+ G+ A V
Sbjct: 694 LRIDLLPLILLNHDHMQAYCLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGKVHAHNVG 753
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ L +G L+ +T EY D AI L ++
Sbjct: 754 VTLLHQVGLENLVVKTEDEYVDKAIELASN 783
>gi|86606504|ref|YP_475267.1| hypothetical protein CYA_1854 [Synechococcus sp. JA-3-3Ab]
gi|86555046|gb|ABD00004.1| TPR repeat protein [Synechococcus sp. JA-3-3Ab]
Length = 749
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 51/163 (31%), Positives = 83/163 (50%), Gaps = 10/163 (6%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQAT 101
RQ +GLP++A+VY L K P ++ + + VP + L P A+G+ Q
Sbjct: 539 RQDFGLPQEAVVYLTCQYLGKYLPQYDYLFPAIARQVPRAQFVFLALPNAAIGQRFWQRL 598
Query: 102 AQ-----ALGLDQH-RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
A L ++ H RIL ++++ + ADV LD+ +G T+++ + G PV
Sbjct: 599 ASIFSSYGLAVEDHCRIL--PALDHQDYLDLNRCADVMLDSYGWSGGITTLEAIAAGLPV 656
Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
VT PG L R + L +G LIA + Y+++A++LGTD
Sbjct: 657 VTCPGRFLRGRHTYAILKRMGLELLIAPDLQRYEELAVQLGTD 699
>gi|428306904|ref|YP_007143729.1| hypothetical protein Cri9333_3391 [Crinalium epipsammum PCC 9333]
gi|428248439|gb|AFZ14219.1| Tetratricopeptide TPR_1 repeat-containing protein [Crinalium
epipsammum PCC 9333]
Length = 1016
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 87/184 (47%), Gaps = 28/184 (15%)
Query: 36 PQSIVITS---RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
PQS++ TS R + L +DA+VY + L+K P ++ + K VP S +K
Sbjct: 800 PQSVLPTSLKKRADFSLEDDALVYLSCQSLFKYLPQHDYIFAAIAKCVPQSQFVFIK--- 856
Query: 93 VGEANIQATAQALGLDQHRIL---FSNVAAKEE-------------HVRRGQLADVCLDT 136
++A+ HR L F++ K E ++ ++DV LDT
Sbjct: 857 ------SDISEAITRKLHRRLQKAFASFGLKSEEYCVMVPPLNQVDYLNLNLVSDVFLDT 910
Query: 137 PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
+G T+++ + P+VT PGE + R + L LG E IA + EY +IA+RLG
Sbjct: 911 LGWSGGNTTLEAISCNLPIVTCPGEFMRGRHSYGILQMLGVTETIASSEAEYIEIAVRLG 970
Query: 197 TDRD 200
+++
Sbjct: 971 LEQE 974
>gi|288549794|ref|ZP_05968179.2| sulfotransferase [Enterobacter cancerogenus ATCC 35316]
gi|288317412|gb|EFC56350.1| sulfotransferase [Enterobacter cancerogenus ATCC 35316]
Length = 1390
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 3/143 (2%)
Query: 59 FNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQATAQALGLDQHRILFS 115
FN KI+ L W +L +VP+S L+L F E + T + G+ + R+
Sbjct: 416 FNNASKINDFLLNQWAVLLHSVPDSRLFLKSFNFDNENLKERVLTTLEGHGIARERVRIE 475
Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
+A +E + D+ LD +G T+ + L G PVVTLPG T A R +AS L
Sbjct: 476 GMAPHQELLACYNEVDIALDPWPYSGGLTTCEALAMGVPVVTLPGPTFAGRHSASHLVNA 535
Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
G EL+A + + DI + L D
Sbjct: 536 GLQELVASDWQNFIDITVGLTQD 558
>gi|440753447|ref|ZP_20932650.1| methyltransferase, FkbM family domain protein [Microcystis
aeruginosa TAIHU98]
gi|440177940|gb|ELP57213.1| methyltransferase, FkbM family domain protein [Microcystis
aeruginosa TAIHU98]
Length = 1246
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 8/158 (5%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
L + I + N K++P L +W +L+A+P S L L L P ++ +
Sbjct: 505 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 564
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
Q L L+Q ++ S A E++ D+ LDT NG TT+ D LW G PV+TL G
Sbjct: 565 QEQGLNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 623
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+ S L +G + IA + +EY + AI D
Sbjct: 624 DRKIQRMGNSLLQAIGLGDWIAHSPEEYVNKAITFAQD 661
>gi|395010255|ref|ZP_10393652.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Acidovorax sp. CF316]
gi|394311671|gb|EJE48985.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Acidovorax sp. CF316]
Length = 726
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 74/149 (49%), Gaps = 6/149 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
I + + N L K+ L +W +L+AVP S L + L P A Q + LGLD
Sbjct: 401 ITFGSCNNLGKLTDEVLALWGQILEAVPGSRLLIEGKNLDQPDFANAYRQ-RCRGLGLDP 459
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ + +++ ++ D+ LD G TT+ DVLW G P+V++ G + SR+
Sbjct: 460 DRLELVALNGDNQYLTYHRI-DIALDPFPLTGGTTTFDVLWMGLPIVSMVGSSFKSRMGV 518
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
L+ LG E +A T ++Y IA L D
Sbjct: 519 GLLSYLGRTEWLAETSEDYVRIASGLAAD 547
>gi|159028890|emb|CAO90695.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 1261
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 8/158 (5%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
L + I + N K++P L +W +L+A+P S L L L P ++ +
Sbjct: 520 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 579
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
Q L L+Q ++ S A E++ D+ LDT NG TT+ D LW G PV+TL G
Sbjct: 580 QEQGLNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 638
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+ S L +G + IA + +EY + AI D
Sbjct: 639 DRKIQRMGNSLLQAIGLGDWIAHSPEEYVNKAITFAQD 676
>gi|376007446|ref|ZP_09784641.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375324082|emb|CCE20394.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 1103
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
RQQ +P +A+ Y K + ++ VN+L++VPNS+L G+A +
Sbjct: 901 RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 957
Query: 99 QATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ LG+D +R++F + +EEH ++ADV LD+ NG T +++ LW+ PVVT
Sbjct: 958 NQACEDLGVDINRLIFLGLTKTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1017
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G SR+ + L ++ +A + +EY + I G +
Sbjct: 1018 RSGRQYLSRMGYAFLKSVNLDIGVAWSWEEYTQLGIEFGQN 1058
>gi|209522682|ref|ZP_03271240.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
gi|209496731|gb|EDZ97028.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
Length = 1103
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
RQQ +P +A+ Y K + ++ VN+L++VPNS+L G+A +
Sbjct: 901 RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 957
Query: 99 QATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ LG+D +R++F + +EEH ++ADV LD+ NG T +++ LW+ PVVT
Sbjct: 958 NQACEDLGVDINRLIFLGLTKTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1017
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G SR+ + L ++ +A + +EY + I G +
Sbjct: 1018 RSGRQYLSRMGYAFLKSVNLDIGVAWSWEEYTQLGIEFGQN 1058
>gi|423063206|ref|ZP_17051996.1| TPR repeat-containing protein [Arthrospira platensis C1]
gi|406715328|gb|EKD10484.1| TPR repeat-containing protein [Arthrospira platensis C1]
Length = 1103
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
RQQ +P +A+ Y K + ++ VN+L++VPNS+L G+A +
Sbjct: 901 RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 957
Query: 99 QATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
+ LG+D +R++F + +EEH ++ADV LD+ NG T +++ LW+ PVVT
Sbjct: 958 NQACEDLGVDINRLIFLGLTKTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1017
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G SR+ + L ++ +A + +EY + I G +
Sbjct: 1018 RSGRQYLSRMGYAFLKSVNLDIGVAWSWEEYTQLGIEFGQN 1058
>gi|443646868|ref|ZP_21129546.1| methyltransferase, FkbM family domain protein [Microcystis
aeruginosa DIANCHI905]
gi|443335697|gb|ELS50161.1| methyltransferase, FkbM family domain protein [Microcystis
aeruginosa DIANCHI905]
Length = 1246
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 8/158 (5%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
L + I + N K++P L +W +L+A+P S L L L P ++ +
Sbjct: 505 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 564
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
Q L L+Q ++ S A E++ D+ LDT NG TT+ D LW G PV+TL G
Sbjct: 565 QEQGLNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 623
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+ S L +G + IA + +EY + AI D
Sbjct: 624 DRKIQRMGNSLLQAIGLGDWIAHSPEEYVNKAITFAQD 661
>gi|427732279|ref|YP_007078516.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Nostoc sp. PCC 7524]
gi|427368198|gb|AFY50919.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Nostoc sp. PCC 7524]
Length = 1323
Score = 70.5 bits (171), Expect = 3e-10, Method: Composition-based stats.
Identities = 48/181 (26%), Positives = 84/181 (46%), Gaps = 12/181 (6%)
Query: 32 GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
G E + R G+PEDA+V+ + +K P + W ++ V NS+L LL F
Sbjct: 955 GTEEGKLTTPVERNSLGVPEDAVVFISGANYFKTVPELMGTWAKIISKVSNSVLVLLPFG 1014
Query: 92 AVGEAN---------IQATAQALGLDQHRILFSN---VAAKEEHVRRGQLADVCLDTPLC 139
N +++ GL R++ + V +E+ ++ADV LD+
Sbjct: 1015 LNWSNNYPKIEFIHHLKSIFSKHGLATERLIVLDPQPVPDREDMKEYYKIADVYLDSYPF 1074
Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G T+ ++ L PV+ G S + A+ + TL P L+A + + Y ++A+ LGT+
Sbjct: 1075 AGTTSLIEPLQVNLPVIARQGNCFRSAMGAAIIQTLDIPNLVADSEESYIELAVALGTNS 1134
Query: 200 D 200
+
Sbjct: 1135 E 1135
>gi|330827422|ref|XP_003291797.1| hypothetical protein DICPUDRAFT_39606 [Dictyostelium purpureum]
gi|325077989|gb|EGC31666.1| hypothetical protein DICPUDRAFT_39606 [Dictyostelium purpureum]
Length = 717
Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats.
Identities = 53/153 (34%), Positives = 74/153 (48%), Gaps = 6/153 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-PAVGEANIQATAQAL---GLDQ 109
I + +FN + K L W +L+ P S LLK P V E Q+ + L G +
Sbjct: 532 ITFGSFNIMAKYSDKCLNNWRIILEKSPPSTRLLLKSKPFVCEKTKQSFLKKLAKMGFNT 591
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR-V 167
+I L +++H++ ++ D+ LDT G TT+ + LW G PVVTL S V
Sbjct: 592 SQIDLIGLFPQQKDHLQYYKMMDISLDTFPYAGTTTTCEALWMGVPVVTLYSPNCHSHNV 651
Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L L P LIA + EY DIA+ L D D
Sbjct: 652 GKSILTNLDIPSLIAHSESEYVDIALSLSKDID 684
>gi|429736729|ref|ZP_19270617.1| hypothetical protein HMPREF9163_01491 [Selenomonas sp. oral taxon
138 str. F0429]
gi|429154357|gb|EKX97091.1| hypothetical protein HMPREF9163_01491 [Selenomonas sp. oral taxon
138 str. F0429]
Length = 905
Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats.
Identities = 46/151 (30%), Positives = 77/151 (50%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
I++ +FNQ KI L W +++ +PNS L LLK A + + TA + +G D
Sbjct: 712 ILFASFNQYRKIRDDMLVAWRQIMERIPNSRL-LLKNSAYQQPGVAMTAYKRLKEMGFDM 770
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ F ++++ R D+ LDT G T+ D L+ G PV++ E ++R +
Sbjct: 771 SRVTFE--PGTKDYMMRYLDVDIALDTFPWPGGGTTCDALYMGVPVISYYTERHSTRFSY 828
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S LA +G EL + ++Y + A+ L + D
Sbjct: 829 SILANMGLAELASERMEDYIETAVALAGNLD 859
>gi|395762104|ref|ZP_10442773.1| TPR domain-containing protein [Janthinobacterium lividum PAMC
25724]
Length = 1070
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 65/136 (47%), Gaps = 1/136 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP-AVGEANIQATAQALGLDQHRI 112
+ + +FN++ KI+ + W +L+A+P++ + L P + I + G+ R+
Sbjct: 861 LTFGSFNRITKINRKVIARWSMLLRAIPDARMLLAAMPTSAPHPLIMRWLEEEGIAPERL 920
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F + + D+CLDT G TT+ LW G P +T G TL SR+ S L
Sbjct: 921 TFEGRTGITQFMEMHHRVDICLDTFPYGGGTTTFHALWMGVPTLTTAGTTLPSRIGTSIL 980
Query: 173 ATLGCPELIARTHKEY 188
+G E IA+ +Y
Sbjct: 981 LQVGLEEFIAKDDDDY 996
>gi|302878312|ref|YP_003846876.1| hypothetical protein Galf_1082 [Gallionella capsiferriformans ES-2]
gi|302581101|gb|ADL55112.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
capsiferriformans ES-2]
Length = 793
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 76/145 (52%), Gaps = 5/145 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
+ + +FN+L K + W +L AVP S + +LK + +A + A G++
Sbjct: 597 VTFGSFNRLVKSSEAVYAAWAKILLAVPGSRM-ILKTHELDDAVVSARVAGYFVRAGVEV 655
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
RI+ ++E H+ D+ LD G T+++ L G PV+TL T+ RV+A
Sbjct: 656 GRIVLQGKTSREAHLAAFNQIDIALDPFPHGGGMTAIEGLMMGVPVITLRWPTMVGRVSA 715
Query: 170 SQLATLGCPELIARTHKEYQDIAIR 194
S + TLG + IA+T +EY D+A++
Sbjct: 716 SIMTTLGLTDWIAQTPQEYVDMAVQ 740
>gi|392377847|ref|YP_004985006.1| conserved protein of unknown function; putative TPR domains
[Azospirillum brasilense Sp245]
gi|356879328|emb|CCD00240.1| conserved protein of unknown function; putative TPR domains
[Azospirillum brasilense Sp245]
Length = 634
Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats.
Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 6/150 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA----TAQALGLDQ 109
+ + +FN K+ P +++W +L VP++ L LK A G+A + G+D
Sbjct: 434 VTFGSFNNAAKVTPEVMRVWSAILARVPSARL-CLKSRAFGDAPTRERYLRQFAGNGVDP 492
Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ L + + H++ D+ LD NG TT+ + LW G PV+TL G +RV
Sbjct: 493 GRVDLLPPMDVIDHHLQAYDRIDIGLDPFPYNGTTTTCEALWMGVPVITLAGRHHVARVG 552
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
AS L G E IA Y + A+ L D
Sbjct: 553 ASLLTQCGLAEFIATDEAGYMETAVALAGD 582
>gi|83593683|ref|YP_427435.1| glycosyl transferase family protein [Rhodospirillum rubrum ATCC
11170]
gi|386350429|ref|YP_006048677.1| glycosyl transferase family protein [Rhodospirillum rubrum F11]
gi|83576597|gb|ABC23148.1| Glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
gi|346718865|gb|AEO48880.1| glycosyl transferase [Rhodospirillum rubrum F11]
Length = 1837
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 68/141 (48%), Gaps = 4/141 (2%)
Query: 63 YKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG---EANIQATAQALGLDQHRILFSNVAA 119
++I S L+ W ++L + N+ L +L A A QA G+ R+++ A
Sbjct: 982 FRISRSCLETWADLLAGMDNARLMILGRQAHDWEFRAGFSRLLQACGIAPERVVYRFQQA 1041
Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE-TLASRVAASQLATLGCP 178
+H+ D+ LDT NG D LW G PVVTL G LA R AS LA C
Sbjct: 1042 YADHLHSFAGIDIALDTFPANGGLCLADALWMGVPVVTLAGTGLLAERQGASLLAAADCG 1101
Query: 179 ELIARTHKEYQDIAIRLGTDR 199
+ IAR+ +Y IA +L DR
Sbjct: 1102 DWIARSPADYLAIARKLAGDR 1122
>gi|167589230|ref|ZP_02381618.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia ubonensis Bu]
Length = 760
Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats.
Identities = 49/150 (32%), Positives = 72/150 (48%), Gaps = 5/150 (3%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLD 108
++ + + N K+ + + +W VL AVP S L LLK P + EA ++ A A G+
Sbjct: 559 SVTFGSLNNANKLGDAVIALWARVLHAVPGSRL-LLKSPQLDEAALRVSLAARFAAHGIT 617
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ +A+ H+ D+ LD G TTS++ LW G P+VT G S +
Sbjct: 618 AERLPMRGGSARLAHLSAFNDIDIALDPFPYPGGTTSLEGLWMGVPLVTRRGGRFLSHIG 677
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L TL PE IAR +Y A +D
Sbjct: 678 ESILHTLDMPEWIARDDDDYVAKAAAFASD 707
>gi|292670017|ref|ZP_06603443.1| TPR domain/SEC-C domain protein [Selenomonas noxia ATCC 43541]
gi|292648318|gb|EFF66290.1| TPR domain/SEC-C domain protein [Selenomonas noxia ATCC 43541]
Length = 468
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 4/154 (2%)
Query: 49 LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQALG 106
+P D+ V+ +FN KI LQ+W +L+AVP + L L E +Q A+ G
Sbjct: 273 VPHDSPVFASFNNFAKITDEMLQIWGGILRAVPGAHLLLKNVHPQRETLRRMQRRAERAG 332
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+D R+ ++++R AD+ LDT G T+ + L G PVVTL G +R
Sbjct: 333 IDPVRLELR--PGSKDYLRDYLDADIILDTYPYQGGGTTCEALCMGLPVVTLAGTRHGAR 390
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L G EL A + + Y + A L DR+
Sbjct: 391 FGVSLLQNAGLGELAADSVQAYAERAALLAQDRE 424
>gi|409993375|ref|ZP_11276518.1| hypothetical protein APPUASWS_19737 [Arthrospira platensis str.
Paraca]
gi|291568213|dbj|BAI90485.1| TPR domain protein [Arthrospira platensis NIES-39]
gi|409935756|gb|EKN77277.1| hypothetical protein APPUASWS_19737 [Arthrospira platensis str.
Paraca]
Length = 1104
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 82/161 (50%), Gaps = 9/161 (5%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
RQQ +P +A+ Y K + ++ VN+L++VPNS+L G+A +
Sbjct: 902 RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 958
Query: 99 QATAQALGLDQHRILFSNVA-AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
Q +G+D +R++F + +EEH ++ADV LD+ NG T +++ LW+ PVVT
Sbjct: 959 NQACQEVGVDLNRLIFLGLTQTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1018
Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
G SR+ + L + +A + +EY + I G +
Sbjct: 1019 RSGRQYLSRMGYAFLKAVNLDIGVAWSWEEYTQLGIEFGHN 1059
>gi|392381948|ref|YP_005031145.1| conserved protein of unknown function; glycosyltransferase and TPR
domains [Azospirillum brasilense Sp245]
gi|356876913|emb|CCC97706.1| conserved protein of unknown function; glycosyltransferase and TPR
domains [Azospirillum brasilense Sp245]
Length = 636
Score = 70.1 bits (170), Expect = 4e-10, Method: Composition-based stats.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 5/131 (3%)
Query: 56 YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQHR 111
+ +FN L K++ +W VL AVP S L LL+ P + +A A AL GLD R
Sbjct: 436 FGSFNALPKLNAEVAALWARVLGAVPGSRL-LLRTPGLDDAGTAARTLALFERAGLDPAR 494
Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
+ A E + DV LD +G T+++ LW G PVVTL G+ +R + +
Sbjct: 495 VDLRGGAPHREFLAGYGEVDVALDPFPYSGGLTTLEALWMGVPVVTLDGDRFCARHSVTH 554
Query: 172 LATLGCPELIA 182
LA+ G +L+A
Sbjct: 555 LASAGLSDLVA 565
>gi|288960866|ref|YP_003451206.1| TPR repeat-containing protein [Azospirillum sp. B510]
gi|288913174|dbj|BAI74662.1| TPR repeat-containing protein [Azospirillum sp. B510]
Length = 974
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 72/153 (47%), Gaps = 12/153 (7%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
I + +FN + K++ +T+ +W +L AVP + L +LK+ +G+ QH I
Sbjct: 769 ITFGSFNNITKVNAATIALWARLLAAVPTARL-VLKWRGLGDGGGAGLRLLAAFAQHGIE 827
Query: 114 FSNVAAK-------EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+ + +++V DV LD NG TT D LW G PV+ G T R
Sbjct: 828 AERLDLRGITPDPYQDYV----TIDVALDPAFANGGTTICDALWMGVPVLNQSGPTKIGR 883
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
A+ L +G EL+ R EY +RL TDR
Sbjct: 884 WGATMLDAVGLGELVTRDDGEYLARGVRLATDR 916
>gi|425460198|ref|ZP_18839680.1| Genome sequencing data, contig C315 [Microcystis aeruginosa PCC
9808]
gi|389827193|emb|CCI21793.1| Genome sequencing data, contig C315 [Microcystis aeruginosa PCC
9808]
Length = 1246
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 75/158 (47%), Gaps = 8/158 (5%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
L + I + N K++P L +W +L+A+P S L L L P ++ +
Sbjct: 505 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 564
Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
Q L+Q ++ S A E++ D+ LDT NG TT+ D LW G PV+TL G
Sbjct: 565 QEQGFNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 623
Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ R+ S L +G + IA + +EY + AI D
Sbjct: 624 DRKIQRMGNSLLQAIGLEDWIAHSPEEYVNKAITFAQD 661
>gi|170694349|ref|ZP_02885503.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
gi|170140772|gb|EDT08946.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
Length = 791
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 75/148 (50%), Gaps = 1/148 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA-VGEANIQATAQALGLDQHRI 112
+ Y +FN+L K+ + +W +L+A P+S + + + + E A G+ R+
Sbjct: 575 VTYGSFNRLNKLSQEVIALWSVILRAEPSSRMVIGAISSKLDEPTYLEWFAAEGITADRL 634
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
F + ++++ D+CLDT G TT+++ LW G P +T+PG ++ SR A L
Sbjct: 635 TFCPRGSLPLYMQQHHQVDLCLDTFPYTGSTTTLNALWMGVPTITMPGISMPSRGGACWL 694
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
+G + I R +++ ++ L D D
Sbjct: 695 EHVGLEQFIVRDKEDFVRKSLELTRDLD 722
>gi|281200878|gb|EFA75092.1| hypothetical protein PPL_11166 [Polysphondylium pallidum PN500]
Length = 667
Score = 69.7 bits (169), Expect = 6e-10, Method: Composition-based stats.
Identities = 45/151 (29%), Positives = 73/151 (48%), Gaps = 4/151 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQH 110
I + +N L K+ T +W +L +P + + + + E+ QA + L G+D
Sbjct: 477 ITFGCYNTLSKVQKGTWAVWKLILDRLPTARIAIKAPLFIVESAAQAYLKKLADIGVDTT 536
Query: 111 RILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+L + + H D+ LD NG TTS+D LW G P V++ G T V
Sbjct: 537 RVLLKTYSMETGNHYDSYNEMDISLDPFPYNGTTTSLDSLWMGVPFVSMSGVTHVHNVGR 596
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S L +G EL+ +EY +IA++L +D +
Sbjct: 597 SILTNVGLKELVGSNPEEYVNIAVQLASDTE 627
>gi|17228054|ref|NP_484602.1| hypothetical protein alr0558 [Nostoc sp. PCC 7120]
gi|17129903|dbj|BAB72516.1| alr0558 [Nostoc sp. PCC 7120]
Length = 1013
Score = 69.7 bits (169), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 79/167 (47%), Gaps = 12/167 (7%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-----------PA 92
R G+P +A+V+ + +K P + W ++ VPNS+L LL F
Sbjct: 656 RNSLGIPGNAVVFISGANYFKTVPELMDTWAKIISIVPNSVLVLLPFGPNWSSNYPKAEF 715
Query: 93 VGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
+ N + LG ++ R+L V +E+ ++ADV LD+ G T+ ++ L
Sbjct: 716 INHLNSIFSRHGLGTERLRVLDPQPVPDREDMKEYYKIADVYLDSYPFAGTTSLIEPLQV 775
Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
PV+ G S + A+ + TL P+L+A + Y ++A+ LG +
Sbjct: 776 NLPVIARQGSCFRSAMGAAIIQTLNIPDLVADSEDSYIELAVALGNN 822
>gi|186681177|ref|YP_001864373.1| group 1 glycosyl transferase [Nostoc punctiforme PCC 73102]
gi|186463629|gb|ACC79430.1| glycosyl transferase, group 1 [Nostoc punctiforme PCC 73102]
Length = 2289
Score = 69.7 bits (169), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 84/161 (52%), Gaps = 2/161 (1%)
Query: 40 VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
VI +R + L +DA++Y +K P ++ + +P + L+ + E ++
Sbjct: 2092 VIKTRSDFRLEDDAVIYLCCQAPFKYLPQYDFIFAEIACRLPQAKFVFLR-GTLLEPRLK 2150
Query: 100 ATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
A+GL+ + +F ++ + +++ L+DV LDT +G T+++ + P+VT
Sbjct: 2151 RAFAAVGLNSEDYCVFLSIPERLDYLMINLLSDVYLDTFTWSGGNTTLEAIACNLPIVTC 2210
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
PGE + R + S L LG + IA+ EY +IA++LG D+
Sbjct: 2211 PGEFMRGRHSDSFLKMLGVTDTIAQNEAEYIEIAVKLGLDQ 2251
>gi|401565692|ref|ZP_10806514.1| glycosyltransferase family 41 domain protein [Selenomonas sp.
FOBRC6]
gi|400185544|gb|EJO19772.1| glycosyltransferase family 41 domain protein [Selenomonas sp.
FOBRC6]
Length = 906
Score = 69.7 bits (169), Expect = 6e-10, Method: Composition-based stats.
Identities = 46/151 (30%), Positives = 77/151 (50%), Gaps = 7/151 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
I++ +FNQ KI L W +++ +PNS L LLK A + + TA + +G D
Sbjct: 713 ILFASFNQYRKIRDDMLVAWHQIMERIPNSRL-LLKNSAYQQPGVAMTAYKRLKEMGFDM 771
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ F ++++ R D+ LDT G T+ D L+ G PV++ E ++R +
Sbjct: 772 SRVTFE--PGTKDYMMRYLDVDIALDTFPWPGGGTTCDALYMGVPVISYYTERHSTRFSY 829
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
S LA +G EL + ++Y + A+ L + D
Sbjct: 830 SILANMGLAELASERMEDYIETAVVLAGNLD 860
>gi|429736728|ref|ZP_19270616.1| tetratricopeptide repeat protein [Selenomonas sp. oral taxon 138
str. F0429]
gi|429154356|gb|EKX97090.1| tetratricopeptide repeat protein [Selenomonas sp. oral taxon 138
str. F0429]
Length = 1079
Score = 69.7 bits (169), Expect = 6e-10, Method: Composition-based stats.
Identities = 51/156 (32%), Positives = 75/156 (48%), Gaps = 15/156 (9%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRI 112
+I++ +FN K++ L+ W +L V S L L +A++ + A A RI
Sbjct: 359 SIIFGSFNNFTKLNDEVLRAWAEILHRVEGSRLLL-------KADVFSYADARAEVMQRI 411
Query: 113 LFSN---VAAKEEHVRRGQLA-----DVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
+ V E R LA D+ LD G T+ D L+ G PVVT+ GE+L
Sbjct: 412 EAAGIPLVCVDTEGASRDYLAAYNRMDIALDPFPYPGGGTTCDALYMGVPVVTMRGESLG 471
Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
SR AS + +G LIA+T +EY D A+ L D +
Sbjct: 472 SRFGASLVENIGAGALIAQTTEEYIDRAVSLARDTE 507
Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats.
Identities = 49/152 (32%), Positives = 76/152 (50%), Gaps = 9/152 (5%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-----FPAVGEANIQATAQALGLD 108
I++ FNQ K L +W +++ +P S L LLK PA+ EA + + LG D
Sbjct: 887 ILFGVFNQYRKFTDEMLLLWREIMERMPTSKL-LLKSQIFFAPAMVEA-ARMRLERLGFD 944
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R++ A +++ R D+ LDT G T+ D L+ G PVVTL GE ++R +
Sbjct: 945 LTRVILE--PATTDYMERYLDVDIALDTYPWPGGGTTCDALYMGVPVVTLYGERRSTRFS 1002
Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
+ L +G EL ++ +Y A+ L D D
Sbjct: 1003 YALLTHVGLEELAVQSSADYVAYAVTLAGDLD 1034
>gi|119488074|ref|ZP_01621518.1| TPR repeat protein [Lyngbya sp. PCC 8106]
gi|119455363|gb|EAW36502.1| TPR repeat protein [Lyngbya sp. PCC 8106]
Length = 759
Score = 69.7 bits (169), Expect = 6e-10, Method: Composition-based stats.
Identities = 50/179 (27%), Positives = 86/179 (48%), Gaps = 22/179 (12%)
Query: 34 EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
E+P+ + +R +GL D ++Y + L+K P Q++V + + VP L +F V
Sbjct: 551 EIPE--LEKTRSNFGLKNDTVIYLSCQSLFKYLPQYDQIFVEIAQQVP-----LAQFVFV 603
Query: 94 GEANIQATAQALGLDQHRILFSNVA-------------AKEEHVRRGQLADVCLDTPLCN 140
N A ++ + ++ F++ K ++++ +AD+ LDT
Sbjct: 604 SHWN-SAISEKF-RKRLKLAFADFGLNSEDYCTILPRLEKSDYLQLNLIADIGLDTIQFT 661
Query: 141 GHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
G T++D + P+VT GE + SR A L +G E I R KEY IA++LG +R
Sbjct: 662 GFLTTLDSIACNLPIVTCEGELMRSRQTAGILKRIGVTETIVRNEKEYIKIAVQLGLNR 720
>gi|172059231|ref|YP_001806883.1| hypothetical protein BamMC406_0166 [Burkholderia ambifaria MC40-6]
gi|171991748|gb|ACB62667.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia ambifaria
MC40-6]
Length = 754
Score = 69.7 bits (169), Expect = 6e-10, Method: Composition-based stats.
Identities = 49/142 (34%), Positives = 67/142 (47%), Gaps = 5/142 (3%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALG 106
E + + N KI + + +W VL AVP S L LLK + EA +I A A G
Sbjct: 558 EGVVTFGCLNNANKIGDAVVALWSRVLHAVPGSRL-LLKSAQLDEAALRTSIAARFAAHG 616
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+ R+L + + H+ D+ LD G TTSM+ LW G P +T G+ S
Sbjct: 617 IPVERLLLRGGSKRLAHIATYNDIDIVLDPFPYPGGTTSMEGLWMGAPFITRRGDRFLSH 676
Query: 167 VAASQLATLGCPELIARTHKEY 188
+ S L TLG PE IA +Y
Sbjct: 677 IGESILHTLGMPEWIADNDDDY 698
>gi|330815133|ref|YP_004358838.1| hypothetical protein bgla_1g01810 [Burkholderia gladioli BSR3]
gi|327367526|gb|AEA58882.1| TPR domain protein [Burkholderia gladioli BSR3]
Length = 814
Score = 69.7 bits (169), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 73/146 (50%), Gaps = 1/146 (0%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
+ + +FN+L K+ + +W +L AVP S + + P G + + A G+ R+
Sbjct: 592 VTFGSFNRLNKLRHDVIAVWARILHAVPGSRMRIGSIPRDGGVDMLLGWFSAEGISHDRL 651
Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
A ++++ D LDT G TT+++ LW G P +T+ G+TLASR A +
Sbjct: 652 DLQPRAPAAVYLQQHHHVDFGLDTFPYTGSTTALNALWMGVPTLTIRGDTLASRAGAVWM 711
Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
+++G + +A ++ AI L D
Sbjct: 712 SSVGLEQFVADDVDDFVARAIALAGD 737
>gi|115524525|ref|YP_781436.1| hypothetical protein RPE_2518 [Rhodopseudomonas palustris BisA53]
gi|115518472|gb|ABJ06456.1| Tetratricopeptide TPR_2 repeat protein [Rhodopseudomonas palustris
BisA53]
Length = 676
Score = 69.7 bits (169), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 68/134 (50%), Gaps = 5/134 (3%)
Query: 59 FNQLYKIDPSTLQMWVNVLKAVPNSIL----WLLKFPAVGEANIQATAQALGLDQHRILF 114
FN++ KI ++ Q W +L VPN+ W L A+ N+ A LG+ RI
Sbjct: 487 FNRIGKISDASAQAWAQILHLVPNARFVVKHWGLD-DALTRDNLLARFARLGVPPERIAL 545
Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
++ +H+ D+CLDT NG ++ + L G PVV L G T +SRVAA+ L
Sbjct: 546 RGTTSRPDHLAALNGVDICLDTFPQNGGVSTWEALQMGVPVVALIGATASSRVAAAILTA 605
Query: 175 LGCPELIARTHKEY 188
+G + IA + + Y
Sbjct: 606 VGMTDWIADSPESY 619
>gi|428769583|ref|YP_007161373.1| hypothetical protein Cyan10605_1210 [Cyanobacterium aponinum PCC
10605]
gi|428683862|gb|AFZ53329.1| hypothetical protein Cyan10605_1210 [Cyanobacterium aponinum PCC
10605]
Length = 725
Score = 69.7 bits (169), Expect = 7e-10, Method: Composition-based stats.
Identities = 46/162 (28%), Positives = 78/162 (48%), Gaps = 5/162 (3%)
Query: 44 RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP-AVGEANIQ--- 99
++ G+ AI+Y + K P +++ + +L+ V +S + + + NI+
Sbjct: 520 KESLGISNSAIIYLSLQTGLKRHPEYIRLQMKILRQVADSYFLVSGYKNELSMNNIKNLF 579
Query: 100 -ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
AQ G++ RI F +++ + DV LDT NG TT++D LW P+VT
Sbjct: 580 TQIAQQEGVNPDRIKFLPYMPLQDYRANLFIGDVVLDTYPFNGATTTLDALWLNIPLVTR 639
Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
G+ SR + L LG E +A T +EY ++ GTD +
Sbjct: 640 VGQQFHSRQGYTFLQNLGITEGMAYTDEEYIQWGVKFGTDEE 681
>gi|225163601|ref|ZP_03725909.1| tetratricopeptide TPR_2 repeat protein [Diplosphaera colitermitum
TAV2]
gi|224801803|gb|EEG20091.1| tetratricopeptide TPR_2 repeat protein [Diplosphaera colitermitum
TAV2]
Length = 652
Score = 69.3 bits (168), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 71/141 (50%), Gaps = 6/141 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLD 108
++ + FN K + W +L AVP S L LLK + +A ++A A GL
Sbjct: 455 SVTFGCFNSPTKFTDTLFATWARLLAAVPGSSL-LLKGRDLEDAGVRAHLFARMNRAGLP 513
Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
+ R L A H+ + D+ LDT NG TT+ + LW G PV+T+ G+ A+RV
Sbjct: 514 EERTELLPRTADTASHLALYKRVDIALDTFPYNGTTTTCESLWMGRPVITIGGDRHAARV 573
Query: 168 AASQLATLGCPELIARTHKEY 188
+AS L +G PE IA +Y
Sbjct: 574 SASLLTAIGRPEWIASCPDDY 594
>gi|448747261|ref|ZP_21728922.1| Methyltransferase FkbM [Halomonas titanicae BH1]
gi|445565173|gb|ELY21285.1| Methyltransferase FkbM [Halomonas titanicae BH1]
Length = 1072
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 65/134 (48%), Gaps = 3/134 (2%)
Query: 70 LQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQHRILFSNVAAKEEHVRR 126
L+ W ++ A+PNS L L E I+ T G++ R+L + E +
Sbjct: 3 LEQWAKIMLALPNSKLMLKSMQYKSEERCQKIKDTMALYGIESDRLLIEGPSPHAELLDA 62
Query: 127 GQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHK 186
D+ LD +G T+ + G PVV+LPG T A R +A+ L G PEL+ + +
Sbjct: 63 YNKVDISLDPWPYSGGLTTCESFLMGVPVVSLPGPTFAGRHSATHLINAGMPELVVSSWE 122
Query: 187 EYQDIAIRLGTDRD 200
EY++ + L +D D
Sbjct: 123 EYRERVLELASDLD 136
>gi|347821315|ref|ZP_08874749.1| hypothetical protein VeAt4_19630, partial [Verminephrobacter
aporrectodeae subsp. tuberculatae At4]
Length = 728
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 73/149 (48%), Gaps = 6/149 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
+ + + N L K+ L +W VL A P S L + L P +A Q Q LGL+
Sbjct: 412 VTFGSCNNLGKLSDEVLALWGRVLAATPGSRLLIEGKNLDRPDFADAYRQ-RCQGLGLEP 470
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ + +++ ++ D+ LD +G TT+ D LW G PVV++ G + R+
Sbjct: 471 QRLDLVALRNANQYLTYHRI-DIALDPFPLSGGTTTFDALWMGLPVVSMVGASFKGRMGV 529
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
S L LG E +A T ++Y IA RL D
Sbjct: 530 SLLTHLGRSEWLAETPQDYVRIAQRLAAD 558
>gi|159472004|ref|XP_001694146.1| O-linked N-acetylglucosamine transferase [Chlamydomonas
reinhardtii]
gi|158277313|gb|EDP03082.1| O-linked N-acetylglucosamine transferase [Chlamydomonas
reinhardtii]
Length = 837
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 72/145 (49%), Gaps = 7/145 (4%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
I + +FN L KI P L++W +L AVP S L L P EA ++ ALG++
Sbjct: 644 ITFGSFNNLAKITPQVLRVWAAILAAVPRSRLVLKNKPFACEAARSHLLRQLAALGVEGW 703
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L +H+ + L D+ LD G TT+ + L+ G P +TL G A V
Sbjct: 704 RVDLLPLAPGNSQHLSQYALMDISLDPFPYAGTTTTTESLYMGVPTLTLAGRCHAHNVGV 763
Query: 170 SQLATLG---CPELIARTHKEYQDI 191
S L +G P +AR+ +EY ++
Sbjct: 764 SLLTAVGLHPLPPWVARSEEEYVEL 788
>gi|390960058|ref|YP_006423815.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Terriglobus roseus DSM 18391]
gi|390414976|gb|AFL90480.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Terriglobus roseus DSM 18391]
Length = 598
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 70/147 (47%), Gaps = 1/147 (0%)
Query: 48 GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATAQALG 106
L +C ++ K++ + +W +L+ P S + L P E ++A G
Sbjct: 397 ALASGVFTFCCVARINKLNRPHVALWARMLQCAPASRMLLATLPGGAEPERVKAWFAEAG 456
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+ +++F++ EE +R +AD+CLDT NG TT L G P +TL G L SR
Sbjct: 457 VGAEQLIFAHARTVEEQLRLHHMADLCLDTFPYNGATTISHALCLGVPTLTLRGAVLGSR 516
Query: 167 VAASQLATLGCPELIARTHKEYQDIAI 193
+ AS + +G E IA + Y A+
Sbjct: 517 LGASINSHVGITEFIAEDGEAYVRKAV 543
>gi|115350209|ref|YP_772048.1| hypothetical protein Bamb_0153 [Burkholderia ambifaria AMMD]
gi|115280197|gb|ABI85714.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia ambifaria
AMMD]
Length = 754
Score = 69.3 bits (168), Expect = 8e-10, Method: Composition-based stats.
Identities = 50/152 (32%), Positives = 69/152 (45%), Gaps = 5/152 (3%)
Query: 51 EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALG 106
E + + N KI + + +W VL AVP S L LLK + EA ++ A A G
Sbjct: 558 EGVVTFGCLNNANKIGDAVVALWSRVLHAVPGSRL-LLKSAQLDEAALRTSLAARFAAHG 616
Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
+ R+L + + H+ D+ LD G TTSM+ LW G P +T G+ S
Sbjct: 617 IPAERLLLRGGSKRLAHIATYNDIDIVLDPFPYPGGTTSMEGLWMGAPFITRRGDRFLSH 676
Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ S L TLG PE IA +Y A D
Sbjct: 677 IGESILHTLGMPEWIADNDDDYVAKAAAFAQD 708
>gi|292670019|ref|ZP_06603445.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
gi|292648320|gb|EFF66292.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
Length = 1079
Score = 69.3 bits (168), Expect = 9e-10, Method: Composition-based stats.
Identities = 48/145 (33%), Positives = 73/145 (50%), Gaps = 7/145 (4%)
Query: 53 AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLD 108
+IV+ +FN K++ L +W +L+ VP S L LLK A+ + A +A G+
Sbjct: 361 SIVFGSFNNFTKLNDRVLSVWAEILRRVPESRL-LLKTDVFSYADSRREALRRIEAAGIP 419
Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
R+ A +++ D+ LD G T+ D L+ G PVVTL G +L SR
Sbjct: 420 LVRV--DAEGASADYLAAYARVDIALDPFPYPGGGTTCDALYMGVPVVTLAGASLGSRFG 477
Query: 169 ASQLATLGCPELIARTHKEYQDIAI 193
AS L +G EL A T ++Y +A+
Sbjct: 478 ASLLENIGAKELAAHTEEDYIALAV 502
Score = 62.8 bits (151), Expect = 8e-08, Method: Composition-based stats.
Identities = 52/177 (29%), Positives = 82/177 (46%), Gaps = 18/177 (10%)
Query: 35 VPQSIVITSRQQYGLPEDA---------IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
+P V R GLP A I + FNQ K + L +W +++ VP + L
Sbjct: 862 LPSQFVYVPRA--GLPAAAETPARSRGYITFGVFNQYRKYTDAMLTVWREIMERVPTARL 919
Query: 86 WLLKFPAVGEANIQATAQA----LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
L+K + TA+A LG D R+ A ++++R D+ LDT G
Sbjct: 920 -LIKSQVFFSPAMTETARARMKRLGFDLRRVALE--PATTDYMQRYLDVDIALDTYPWPG 976
Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
T+ D L+ G PVVT+ G ++R + + LA +G +L +T +Y + A+ L D
Sbjct: 977 GGTTCDALYMGVPVVTMYGARRSTRFSYALLAHIGQTDLAVQTPADYIERAVSLAGD 1033
>gi|168061951|ref|XP_001782948.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665566|gb|EDQ52246.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 899
Score = 69.3 bits (168), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 75/149 (50%), Gaps = 4/149 (2%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
+ + +FN L KI P L++W +L VP + L + P ++ + + LG+
Sbjct: 635 VTFGSFNNLAKITPRVLRVWARILIKVPTARLVVKCKPFCCDSVRDTFLSKLEELGVKSL 694
Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+ L + +H++ L D+ LDT G TT+ + L+ G P VT+ G+ A V
Sbjct: 695 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGKVHAHNVGV 754
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
+ L +G L+ RT +Y D A++L ++
Sbjct: 755 TLLHQVGLENLVVRTEDDYVDKAVQLASN 783
>gi|399155209|ref|ZP_10755276.1| hypothetical protein gproSAA_05251, partial [gamma proteobacterium
SCGC AAA007-O20]
Length = 163
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%)
Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
G+ R++F E++ R ++AD+ LDT N TT+ D L G PV+T G + AS
Sbjct: 17 GIHPSRLIFGKRLPTPEYLARYRVADLFLDTLPYNAGTTASDALRMGLPVLTCIGNSFAS 76
Query: 166 RVAASQLATLGCPELIARTHKEYQDIAIRLG 196
RVAAS L + PELI T ++Y+ +AI+L
Sbjct: 77 RVAASILNAVNLPELITTTQEQYESLAIQLA 107
>gi|398805153|ref|ZP_10564134.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Polaromonas sp. CF318]
gi|398092315|gb|EJL82730.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
[Polaromonas sp. CF318]
Length = 733
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 5/149 (3%)
Query: 54 IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
+ + FN L K+ + + +W +L+AVP S L L+ + EA+ + T A G+D
Sbjct: 524 VTFACFNALPKMGDAVVALWARILQAVPGSRLHLMA-SQLSEASTRQTTLERFMAHGIDA 582
Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
R+L + ++++ Q D+ LD G TT+ + LW G PV+TL G T SR
Sbjct: 583 TRLLIQGPMPRIKYLQTYQRVDIALDPFPYTGGTTTAEALWMGVPVLTLAGNTFLSRQGV 642
Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
L G E IA +Y A+ D
Sbjct: 643 GLLMNAGLSEWIASDADDYVKRAVAHAGD 671
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.131 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,964,749,775
Number of Sequences: 23463169
Number of extensions: 113467777
Number of successful extensions: 288947
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1446
Number of HSP's successfully gapped in prelim test: 51
Number of HSP's that attempted gapping in prelim test: 286377
Number of HSP's gapped (non-prelim): 1593
length of query: 200
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 65
effective length of database: 9,191,667,552
effective search space: 597458390880
effective search space used: 597458390880
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)