BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy15362
         (200 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|242023889|ref|XP_002432363.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [Pediculus humanus corporis]
 gi|212517786|gb|EEB19625.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [Pediculus humanus corporis]
          Length = 1041

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 174/200 (87%), Positives = 189/200 (94%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQ+QTSVNGIV+QNGLATNQTN K ATGEEVPQSIVIT+RQQY LP+DA+VYCNFN
Sbjct: 788 MIASGQIQTSVNGIVVQNGLATNQTNNKAATGEEVPQSIVITTRQQYNLPDDAVVYCNFN 847

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW+N+LK VPN+ILWLL+FPAVGE+NI ATAQ LG+   RILFSNVAAK
Sbjct: 848 QLYKIDPITLQMWMNILKHVPNAILWLLRFPAVGESNIHATAQQLGVSPARILFSNVAAK 907

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 967

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IAR+ ++YQDIAIRLGTDR+
Sbjct: 968 IARSRQDYQDIAIRLGTDRE 987


>gi|357612087|gb|EHJ67792.1| hypothetical protein KGM_21815 [Danaus plexippus]
          Length = 1054

 Score =  367 bits (943), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 172/200 (86%), Positives = 186/200 (93%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQVQTSVNG++LQNGLAT QTN K ATGEEVPQSIVIT+RQQYGLP+DA+VYCNFN
Sbjct: 802  MIASGQVQTSVNGVILQNGLATTQTNNKAATGEEVPQSIVITTRQQYGLPDDAVVYCNFN 861

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MWV +LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RI+FSNVAAK
Sbjct: 862  QLYKIDPLTLHMWVYILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRIIFSNVAAK 921

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMD+LWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922  EEHVRRGQLADVCLDTPLCNGHTTSMDILWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIA+RLGTDR+
Sbjct: 982  IARTRQEYQDIAVRLGTDRE 1001


>gi|270004555|gb|EFA01003.1| hypothetical protein TcasGA2_TC003916 [Tribolium castaneum]
          Length = 1054

 Score =  367 bits (941), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 171/200 (85%), Positives = 188/200 (94%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+RQQYGLP+DA+VYCNFN
Sbjct: 800 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRQQYGLPDDAVVYCNFN 859

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL MWVN+LKAVPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 860 QLYKIDPMTLHMWVNILKAVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRILFSNVAAK 919

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT PGETLASRVAASQLATLGCPEL
Sbjct: 920 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTFPGETLASRVAASQLATLGCPEL 979

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IAR+ +EYQ+IAI+LGTD++
Sbjct: 980 IARSRQEYQEIAIKLGTDKE 999


>gi|189235894|ref|XP_967579.2| PREDICTED: similar to AGAP006254-PA [Tribolium castaneum]
          Length = 1086

 Score =  366 bits (940), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 171/200 (85%), Positives = 188/200 (94%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+RQQYGLP+DA+VYCNFN
Sbjct: 832  MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRQQYGLPDDAVVYCNFN 891

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MWVN+LKAVPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 892  QLYKIDPMTLHMWVNILKAVPNSVLWLLRFPAVGEPNLQATAQQLGLPPGRILFSNVAAK 951

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT PGETLASRVAASQLATLGCPEL
Sbjct: 952  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTFPGETLASRVAASQLATLGCPEL 1011

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IAR+ +EYQ+IAI+LGTD++
Sbjct: 1012 IARSRQEYQEIAIKLGTDKE 1031


>gi|383866051|ref|XP_003708485.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            [Megachile rotundata]
          Length = 1094

 Score =  363 bits (931), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 171/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 832  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 891

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 892  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031


>gi|350396796|ref|XP_003484668.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            [Bombus impatiens]
          Length = 1065

 Score =  362 bits (930), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 862  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 982  IARTRQEYQDIAIRLGTDRE 1001


>gi|328777929|ref|XP_003249419.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit isoform 1
            [Apis mellifera]
          Length = 1065

 Score =  362 bits (930), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 862  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 982  IARTRQEYQDIAIRLGTDRE 1001


>gi|380019749|ref|XP_003693765.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 2 [Apis florea]
          Length = 1065

 Score =  362 bits (930), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 862  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 982  IARTRQEYQDIAIRLGTDRE 1001


>gi|340715916|ref|XP_003396453.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 1 [Bombus terrestris]
          Length = 1065

 Score =  362 bits (930), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 802  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 861

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 862  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 921

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 922  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 981

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 982  IARTRQEYQDIAIRLGTDRE 1001


>gi|158295621|ref|XP_316319.4| AGAP006254-PA [Anopheles gambiae str. PEST]
 gi|157016124|gb|EAA10760.4| AGAP006254-PA [Anopheles gambiae str. PEST]
          Length = 1120

 Score =  362 bits (929), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 169/200 (84%), Positives = 187/200 (93%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DAIVYCNFN
Sbjct: 868  MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAIVYCNFN 927

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TLQ WV +LK VPNS+LWLL+FPAVGEANIQATAQ +G+   RI+FSNVAAK
Sbjct: 928  QLYKIDPLTLQSWVTILKHVPNSVLWLLRFPAVGEANIQATAQQMGIAAGRIIFSNVAAK 987

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 988  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 1047

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IA++ +EYQDIAI+LGTDR+
Sbjct: 1048 IAKSRQEYQDIAIKLGTDRE 1067


>gi|340715918|ref|XP_003396454.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 2 [Bombus terrestris]
          Length = 1095

 Score =  362 bits (929), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 832  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 891

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 892  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031


>gi|380019747|ref|XP_003693764.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 1 [Apis florea]
          Length = 1095

 Score =  362 bits (929), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 832  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 891

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 892  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031


>gi|307193762|gb|EFN76443.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
            kDa subunit [Harpegnathos saltator]
          Length = 1180

 Score =  362 bits (929), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 918  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 977

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 978  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 1037

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 1038 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1097

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            +ART +EYQDIAIRLGTDR+
Sbjct: 1098 VARTRQEYQDIAIRLGTDRE 1117


>gi|110760723|ref|XP_623820.2| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit isoform 2
            [Apis mellifera]
          Length = 1095

 Score =  362 bits (928), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 170/200 (85%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+I+IT+RQQYGLPEDA+VYCNFN
Sbjct: 832  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIMITTRQQYGLPEDAVVYCNFN 891

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 892  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 951

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 952  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1011

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGTDR+
Sbjct: 1012 IARTRQEYQDIAIRLGTDRE 1031


>gi|332029693|gb|EGI69572.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
            kDa subunit [Acromyrmex echinatior]
          Length = 1087

 Score =  361 bits (927), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 169/200 (84%), Positives = 184/200 (92%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 824  MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 883

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPNS+LWLL+FPAVGE N+QATAQ LGL   RILFSNVAAK
Sbjct: 884  QLYKIDPLTLHMWAHILKHVPNSVLWLLRFPAVGEPNLQATAQQLGLAPGRILFSNVAAK 943

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 944  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 1003

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            +ART +EYQDI+IRLGTDR+
Sbjct: 1004 VARTRQEYQDISIRLGTDRE 1023


>gi|328700630|ref|XP_001950113.2| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 1045

 Score =  359 bits (922), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 171/201 (85%), Positives = 187/201 (93%), Gaps = 1/201 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
           MIASGQ QTSVNG+VLQNGLA  +  N K ATGEEVPQ+IVITSRQQY LPEDA+VYCNF
Sbjct: 791 MIASGQAQTSVNGVVLQNGLAVVDNGNNKAATGEEVPQNIVITSRQQYSLPEDAVVYCNF 850

Query: 60  NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
           NQLYKIDP+TL MWVN+LKAVPNS+LWLL+FPAVGE +IQ+TA +LG++  RILFSNVAA
Sbjct: 851 NQLYKIDPNTLLMWVNILKAVPNSVLWLLRFPAVGETHIQSTAASLGIEPGRILFSNVAA 910

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE
Sbjct: 911 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 970

Query: 180 LIARTHKEYQDIAIRLGTDRD 200
           LIART +EYQDIAIRLGTD++
Sbjct: 971 LIARTREEYQDIAIRLGTDKE 991


>gi|328700632|ref|XP_003241333.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 2 [Acyrthosiphon pisum]
          Length = 1090

 Score =  359 bits (922), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 171/201 (85%), Positives = 187/201 (93%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MIASGQ QTSVNG+VLQNGLA  +  N K ATGEEVPQ+IVITSRQQY LPEDA+VYCNF
Sbjct: 836  MIASGQAQTSVNGVVLQNGLAVVDNGNNKAATGEEVPQNIVITSRQQYSLPEDAVVYCNF 895

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDP+TL MWVN+LKAVPNS+LWLL+FPAVGE +IQ+TA +LG++  RILFSNVAA
Sbjct: 896  NQLYKIDPNTLLMWVNILKAVPNSVLWLLRFPAVGETHIQSTAASLGIEPGRILFSNVAA 955

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE
Sbjct: 956  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 1015

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            LIART +EYQDIAIRLGTD++
Sbjct: 1016 LIARTREEYQDIAIRLGTDKE 1036


>gi|312381341|gb|EFR27109.1| hypothetical protein AND_06385 [Anopheles darlingi]
          Length = 833

 Score =  359 bits (922), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 167/200 (83%), Positives = 187/200 (93%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DAIVYCNFN
Sbjct: 580 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAIVYCNFN 639

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQ W+ +LK VPNS+LWLL+FPAVGEANIQATAQ +G+   RI+FSNVAAK
Sbjct: 640 QLYKIDPLTLQSWIYILKHVPNSVLWLLRFPAVGEANIQATAQQMGIAAGRIIFSNVAAK 699

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 700 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 759

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++ +EYQDIA++LGTDR+
Sbjct: 760 IAKSRQEYQDIAVKLGTDRE 779


>gi|157106778|ref|XP_001649477.1| o-linked n-acetylglucosamine transferase, ogt [Aedes aegypti]
 gi|108868774|gb|EAT32999.1| AAEL014746-PA [Aedes aegypti]
          Length = 793

 Score =  359 bits (922), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 167/200 (83%), Positives = 185/200 (92%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQ+QTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DA+VYCNFN
Sbjct: 539 MIASGQIQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAVVYCNFN 598

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL  WVN+LK VPNS+LWLL+FPAVGE NIQA AQ LGL   RI+FSNVAAK
Sbjct: 599 QLYKIDPHTLASWVNILKHVPNSVLWLLRFPAVGETNIQAAAQQLGLSPGRIIFSNVAAK 658

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 659 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 718

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IAR+ +EYQDIAI+LGTD++
Sbjct: 719 IARSRQEYQDIAIKLGTDKE 738


>gi|229577290|ref|NP_001153341.1| O-glycosyltransferase [Nasonia vitripennis]
          Length = 1061

 Score =  357 bits (917), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 167/200 (83%), Positives = 182/200 (91%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQ Q SVNG+V+QNG+AT Q N KTATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 799 MIASGQCQMSVNGVVVQNGMATTQVNNKTATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 858

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL MW  +LK VPNS+LWLL+FPAVGE N+Q TAQ LGL   RILFSNVAAK
Sbjct: 859 QLYKIDPLTLHMWAYILKNVPNSVLWLLRFPAVGEQNLQTTAQQLGLAPGRILFSNVAAK 918

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 919 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPEL 978

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           +ART +EYQ+IA+RLGTDR+
Sbjct: 979 VARTRQEYQEIAVRLGTDRE 998


>gi|307173081|gb|EFN64211.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
            kDa subunit [Camponotus floridanus]
          Length = 1092

 Score =  355 bits (912), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 165/200 (82%), Positives = 182/200 (91%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIASGQ Q SVNG+V+QNG+ T Q N K ATGEEVPQ+IVIT+RQQYGLPEDA+VYCNFN
Sbjct: 830  MIASGQCQMSVNGVVVQNGMTTTQVNNKAATGEEVPQNIVITTRQQYGLPEDAVVYCNFN 889

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL MW ++LK VPN++LWLL+FPAVGE N+Q+TAQ LGL   RILFSNVAAK
Sbjct: 890  QLYKIDPLTLHMWAHILKHVPNAVLWLLRFPAVGEPNLQSTAQQLGLTPGRILFSNVAAK 949

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCP+L
Sbjct: 950  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLNTLGCPDL 1009

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            +ART +EYQDIAIRLGTDR+
Sbjct: 1010 VARTRQEYQDIAIRLGTDRE 1029


>gi|170036979|ref|XP_001846338.1| o-linked N-acetylglucosamine transferase, ogt [Culex
           quinquefasciatus]
 gi|167879966|gb|EDS43349.1| o-linked N-acetylglucosamine transferase, ogt [Culex
           quinquefasciatus]
          Length = 838

 Score =  355 bits (912), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 164/200 (82%), Positives = 185/200 (92%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQ+QTS+NG+V+QNGLAT QTN K ATGEEVPQ+IV+T+RQQYGLP+DA++YCNFN
Sbjct: 579 MIASGQIQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVVTTRQQYGLPDDAVIYCNFN 638

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL  W N+LK VPN++LWLL+FPAVGEANIQA AQ LG+   RI+FSNVAAK
Sbjct: 639 QLYKIDPHTLTSWCNILKHVPNAVLWLLRFPAVGEANIQAAAQQLGVAPGRIIFSNVAAK 698

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP ETLASRVAASQLATLGCPEL
Sbjct: 699 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPAETLASRVAASQLATLGCPEL 758

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IAR+ +EYQDIAI+LGTD++
Sbjct: 759 IARSRQEYQDIAIKLGTDKE 778


>gi|321463604|gb|EFX74619.1| hypothetical protein DAPPUDRAFT_324191 [Daphnia pulex]
          Length = 1043

 Score =  353 bits (905), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 162/200 (81%), Positives = 183/200 (91%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI SGQVQTSVNG+V+QNGLATNQ NT+ ATGEEVPQSIV+T+RQQYGLP+DAI+YCNFN
Sbjct: 789 MIGSGQVQTSVNGVVVQNGLATNQANTRAATGEEVPQSIVVTTRQQYGLPDDAIIYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMWVN+LK VPN++LWLL+FP VGE N+ ATAQALGL   RILFSNVAAK
Sbjct: 849 QLYKIDPPTLQMWVNILKRVPNAVLWLLRFPTVGETNMLATAQALGLGSGRILFSNVAAK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTT MDVLW GTP+VTLPGETLASRVAASQL T+GCPEL
Sbjct: 909 EEHVRRGQLADICLDTPLCNGHTTGMDVLWAGTPMVTLPGETLASRVAASQLTTIGCPEL 968

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA + +EY+DIA++LGTD +
Sbjct: 969 IASSPQEYEDIAVKLGTDSE 988


>gi|195430888|ref|XP_002063480.1| GK21381 [Drosophila willistoni]
 gi|194159565|gb|EDW74466.1| GK21381 [Drosophila willistoni]
          Length = 1059

 Score =  348 bits (893), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 163/200 (81%), Positives = 182/200 (91%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 806  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 865

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ +   LG+   RI+FSNVAAK
Sbjct: 866  QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISPDRIIFSNVAAK 925

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 926  EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 985

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQDIAIRLGT+R+
Sbjct: 986  IARTREEYQDIAIRLGTERE 1005


>gi|195175533|ref|XP_002028500.1| GL13214 [Drosophila persimilis]
 gi|194103718|gb|EDW25761.1| GL13214 [Drosophila persimilis]
          Length = 881

 Score =  347 bits (891), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 162/200 (81%), Positives = 179/200 (89%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 628 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 687

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T   LG+   R++FSNVAAK
Sbjct: 688 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDLGISPDRVIFSNVAAK 747

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 748 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLVTLGCPEL 807

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IART  EYQDIAIRLGT R+
Sbjct: 808 IARTRDEYQDIAIRLGTQRE 827


>gi|198456072|ref|XP_002136393.1| GA29107 [Drosophila pseudoobscura pseudoobscura]
 gi|198142772|gb|EDY71489.1| GA29107 [Drosophila pseudoobscura pseudoobscura]
          Length = 652

 Score =  347 bits (890), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 162/200 (81%), Positives = 179/200 (89%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 399 MIASGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 458

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T   LG+   R++FSNVAAK
Sbjct: 459 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDLGISPDRVIFSNVAAK 518

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL TLGCPEL
Sbjct: 519 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLVTLGCPEL 578

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IART  EYQDIAIRLGT R+
Sbjct: 579 IARTRDEYQDIAIRLGTQRE 598


>gi|195382543|ref|XP_002049989.1| GJ20442 [Drosophila virilis]
 gi|194144786|gb|EDW61182.1| GJ20442 [Drosophila virilis]
          Length = 1050

 Score =  345 bits (885), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 159/200 (79%), Positives = 181/200 (90%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIASGQVQTS+NG+++QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 800 MIASGQVQTSLNGVLVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 859

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ +   LG+   R++FSNVAAK
Sbjct: 860 QLYKIDPQTLQSWVVILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGVSSERVIFSNVAAK 919

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPV+TLPGETLASRVAASQLATLGCPEL
Sbjct: 920 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVITLPGETLASRVAASQLATLGCPEL 979

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA + +EYQDIAIRLGTD++
Sbjct: 980 IANSREEYQDIAIRLGTDKE 999


>gi|195122698|ref|XP_002005848.1| GI18858 [Drosophila mojavensis]
 gi|193910916|gb|EDW09783.1| GI18858 [Drosophila mojavensis]
          Length = 1052

 Score =  345 bits (885), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 160/200 (80%), Positives = 182/200 (91%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 802  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 861

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ +   LG+   R++FSNVAAK
Sbjct: 862  QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISSDRVIFSNVAAK 921

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 922  EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 981

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IA++ +EYQDIAIRLGT+R+
Sbjct: 982  IAQSREEYQDIAIRLGTERE 1001


>gi|195356295|ref|XP_002044614.1| GM11101 [Drosophila sechellia]
 gi|194132318|gb|EDW53892.1| GM11101 [Drosophila sechellia]
          Length = 333

 Score =  343 bits (881), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 158/200 (79%), Positives = 180/200 (90%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 83  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 142

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T    G+   R++FSNVAAK
Sbjct: 143 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 202

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 203 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 262

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IART +EYQ+IAIRLGT ++
Sbjct: 263 IARTREEYQNIAIRLGTKKE 282


>gi|17647755|ref|NP_523620.1| super sex combs, isoform B [Drosophila melanogaster]
 gi|24585827|ref|NP_724406.1| super sex combs, isoform A [Drosophila melanogaster]
 gi|24585829|ref|NP_724407.1| super sex combs, isoform C [Drosophila melanogaster]
 gi|6942068|gb|AAF32311.1|AF217788_1 O-glycosyltransferase [Drosophila melanogaster]
 gi|7302245|gb|AAF57338.1| super sex combs, isoform B [Drosophila melanogaster]
 gi|10728167|gb|AAG22338.1| super sex combs, isoform A [Drosophila melanogaster]
 gi|10728168|gb|AAG22339.1| super sex combs, isoform C [Drosophila melanogaster]
 gi|189182206|gb|ACD81879.1| SD06381p [Drosophila melanogaster]
          Length = 1059

 Score =  343 bits (880), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 158/200 (79%), Positives = 180/200 (90%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 809  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 868

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T    G+   R++FSNVAAK
Sbjct: 869  QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 928

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 929  EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 988

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IART +EYQ+IAIRLGT ++
Sbjct: 989  IARTREEYQNIAIRLGTKKE 1008


>gi|16769108|gb|AAL28773.1| LD16758p [Drosophila melanogaster]
          Length = 333

 Score =  343 bits (880), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 158/200 (79%), Positives = 180/200 (90%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 83  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 142

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T    G+   R++FSNVAAK
Sbjct: 143 QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 202

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 203 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 262

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IART +EYQ+IAIRLGT ++
Sbjct: 263 IARTREEYQNIAIRLGTKKE 282


>gi|5052534|gb|AAD38597.1|AF145622_1 BcDNA.GH04245 [Drosophila melanogaster]
          Length = 1011

 Score =  342 bits (877), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 158/197 (80%), Positives = 178/197 (90%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DA+VYCNFN
Sbjct: 809  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAVVYCNFN 868

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TL+ WV +LK VP S+LWLL+FPAVGE NI+ T    G+   R++FSNVAAK
Sbjct: 869  QLYKIDPQTLESWVEILKNVPKSVLWLLRFPAVGEQNIKKTVSDFGISPDRVIFSNVAAK 928

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 929  EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 988

Query: 181  IARTHKEYQDIAIRLGT 197
            IART +EYQ+IAIRLGT
Sbjct: 989  IARTREEYQNIAIRLGT 1005


>gi|195093725|ref|XP_001997748.1| GH22221 [Drosophila grimshawi]
 gi|193905681|gb|EDW04548.1| GH22221 [Drosophila grimshawi]
          Length = 666

 Score =  342 bits (877), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 159/200 (79%), Positives = 180/200 (90%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 435 MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 494

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ +   LG+   RI+FSNVAAK
Sbjct: 495 QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISADRIIFSNVAAK 554

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 555 EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 614

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA + + YQ+IAIRLGT+R+
Sbjct: 615 IAHSRESYQEIAIRLGTERE 634


>gi|195028406|ref|XP_001987067.1| GH21711 [Drosophila grimshawi]
 gi|193903067|gb|EDW01934.1| GH21711 [Drosophila grimshawi]
          Length = 1053

 Score =  341 bits (874), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 159/200 (79%), Positives = 180/200 (90%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MIA+GQVQTS+NG+V+QNGLAT QTN K ATGEEVPQ+IVIT+R+QY LP+DAIVYCNFN
Sbjct: 803  MIATGQVQTSLNGVVVQNGLATTQTNNKAATGEEVPQNIVITTRRQYMLPDDAIVYCNFN 862

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TLQ WV +LK VP S+LWLL+FPAVGE NI+ +   LG+   RI+FSNVAAK
Sbjct: 863  QLYKIDPQTLQSWVIILKNVPKSVLWLLRFPAVGEQNIKKSVSDLGISADRIIFSNVAAK 922

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLAD+CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL
Sbjct: 923  EEHVRRGQLADICLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 982

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IA + + YQ+IAIRLGT+R+
Sbjct: 983  IAHSRESYQEIAIRLGTERE 1002


>gi|427794599|gb|JAA62751.1| Putative o-linked n-acetylglucosamine transferase ogt, partial
           [Rhipicephalus pulchellus]
          Length = 1026

 Score =  335 bits (860), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 155/201 (77%), Positives = 180/201 (89%), Gaps = 1/201 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI+SGQVQ  VNG+++ NGLA+ Q NT+ ATGEE P++I++T RQQYGLPEDA+VYCNFN
Sbjct: 776 MISSGQVQAQVNGVLVHNGLASTQVNTRAATGEEAPRAILVTCRQQYGLPEDAVVYCNFN 835

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQH-RILFSNVAA 119
           QLYK+DP+TL+MWVN+L+ VPNS+LWLL+FPAVGEA++QA AQA GL Q  RI+FSNVAA
Sbjct: 836 QLYKLDPATLRMWVNILQRVPNSVLWLLRFPAVGEAHVQAWAQAEGLTQPGRIIFSNVAA 895

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTPVVT PGETLASRVAASQL  LG PE
Sbjct: 896 KEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPVVTWPGETLASRVAASQLHCLGVPE 955

Query: 180 LIARTHKEYQDIAIRLGTDRD 200
           L+AR+ +EYQDIA+RLGTDRD
Sbjct: 956 LVARSRREYQDIAVRLGTDRD 976


>gi|387016114|gb|AFJ50176.1| O-linked GlcNAc transferase [Crotalus adamanteus]
          Length = 933

 Score =  330 bits (847), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 173/198 (87%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPED++VYCNFN
Sbjct: 676 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDSVVYCNFN 735

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 736 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 795

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCPEL
Sbjct: 796 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 855

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 856 IAKSRQEYEDIAVKLGTD 873


>gi|363732735|ref|XP_003641145.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Gallus gallus]
          Length = 1045

 Score =  328 bits (842), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ LGL Q+RI+FS VA K
Sbjct: 848 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 907

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 967

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 968 IAKSRQEYEDIAVKLGTD 985


>gi|118089398|ref|XP_001232519.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Gallus gallus]
          Length = 1035

 Score =  328 bits (842), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 837

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ LGL Q+RI+FS VA K
Sbjct: 838 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 897

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 957

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 958 IAKSRQEYEDIAVKLGTD 975


>gi|326918804|ref|XP_003205676.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like,
           partial [Meleagris gallopavo]
          Length = 1038

 Score =  328 bits (841), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 781 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 840

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ LGL Q+RI+FS VA K
Sbjct: 841 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 900

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 901 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 960

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 961 IAKSRQEYEDIAVKLGTD 978


>gi|149042178|gb|EDL95885.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_a [Rattus
           norvegicus]
 gi|149042179|gb|EDL95886.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_a [Rattus
           norvegicus]
          Length = 1046

 Score =  328 bits (841), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|354502726|ref|XP_003513433.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit
           [Cricetulus griseus]
          Length = 1046

 Score =  328 bits (841), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|449267896|gb|EMC78787.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit, partial [Columba livia]
          Length = 1036

 Score =  328 bits (841), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 779 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ LGL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLSQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976


>gi|431914414|gb|ELK15671.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Pteropus alecto]
          Length = 1046

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|46909607|ref|NP_631883.2| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Mus musculus]
 gi|146325019|sp|Q8CGY8.2|OGT1_MOUSE RecName: Full=UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit;
           AltName: Full=O-GlcNAc transferase subunit p110;
           AltName: Full=O-linked N-acetylglucosamine transferase
           110 kDa subunit; Short=OGT
 gi|34785719|gb|AAH57319.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Mus musculus]
 gi|148682191|gb|EDL14138.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_b [Mus
           musculus]
          Length = 1046

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|139948535|ref|NP_058803.2| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Rattus norvegicus]
 gi|149042180|gb|EDL95887.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_b [Rattus
           norvegicus]
 gi|149042181|gb|EDL95888.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_b [Rattus
           norvegicus]
          Length = 1036

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976


>gi|13775066|gb|AAK39123.1|AF363030_1 UDP-N-acetylglucosaminyltransferase [Mus musculus]
          Length = 1046

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|27499606|gb|AAO17363.1| O-linked GlcNAc transferase [Mus musculus]
          Length = 1046

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|395858889|ref|XP_003801787.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Otolemur garnettii]
          Length = 1046

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 969 IAKSRQEYEDIAVKLGTD 986


>gi|148682190|gb|EDL14137.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_a [Mus
           musculus]
          Length = 1036

 Score =  328 bits (840), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976


>gi|395858891|ref|XP_003801788.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Otolemur garnettii]
          Length = 1036

 Score =  328 bits (840), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976


>gi|417412996|gb|JAA52852.1| Putative o-linked n-acetylglucosamine transferase ogt, partial
           [Desmodus rotundus]
          Length = 874

 Score =  328 bits (840), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 617 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 676

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 677 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 736

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 737 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 796

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 797 IAKSRQEYEDIAVKLGTD 814


>gi|444707375|gb|ELW48655.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Tupaia chinensis]
          Length = 1007

 Score =  328 bits (840), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 750 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 809

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 810 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 869

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 870 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 929

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 930 IAKNRQEYEDIAVKLGTD 947


>gi|440901878|gb|ELR52744.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit, partial [Bos grunniens mutus]
          Length = 1037

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 780 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 839

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 840 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 899

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 900 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 959

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 960 IAKNRQEYEDIAVKLGTD 977


>gi|297304121|ref|XP_002806323.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit [Macaca
           mulatta]
          Length = 963

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 706 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 765

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 766 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 825

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 826 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 885

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 886 IAKNRQEYEDIAVKLGTD 903


>gi|301791309|ref|XP_002930623.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 2 [Ailuropoda melanoleuca]
 gi|410988810|ref|XP_004000670.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Felis catus]
 gi|355708635|gb|AES03331.1| O-linked N-acetylglucosamine transferase [Mustela putorius furo]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|31873825|emb|CAD97853.1| hypothetical protein [Homo sapiens]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|432118087|gb|ELK37988.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Myotis davidii]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|151554489|gb|AAI49783.1| OGT protein [Bos taurus]
          Length = 908

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 651 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 710

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 711 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 770

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 771 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 830

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 831 IAKNRQEYEDIAVKLGTD 848


>gi|344282022|ref|XP_003412774.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Loxodonta africana]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|403305181|ref|XP_003943148.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Saimiri boliviensis boliviensis]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|281337757|gb|EFB13341.1| hypothetical protein PANDA_021118 [Ailuropoda melanoleuca]
          Length = 1037

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 780 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 839

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 840 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 899

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 900 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 959

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 960 IAKNRQEYEDIAVKLGTD 977


>gi|32307148|ref|NP_858058.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 1 [Homo sapiens]
 gi|397498834|ref|XP_003820179.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Pan paniscus]
 gi|426396356|ref|XP_004064413.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Gorilla gorilla gorilla]
 gi|68067509|sp|O15294.3|OGT1_HUMAN RecName: Full=UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit;
           AltName: Full=O-GlcNAc transferase subunit p110;
           AltName: Full=O-linked N-acetylglucosamine transferase
           110 kDa subunit; Short=OGT
 gi|18250915|emb|CAC86128.1| UDP-N-acetylglucosamine: polypeptide-N-acetylglucosaminyl
           transferase [Homo sapiens]
 gi|23315618|gb|AAH38180.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Homo sapiens]
 gi|30268372|emb|CAD89970.1| hypothetical protein [Homo sapiens]
 gi|119625691|gb|EAX05286.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_b [Homo
           sapiens]
 gi|410227760|gb|JAA11099.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Pan troglodytes]
 gi|410227764|gb|JAA11101.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Pan troglodytes]
 gi|410257116|gb|JAA16525.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Pan troglodytes]
 gi|410342219|gb|JAA40056.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Pan troglodytes]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|301791307|ref|XP_002930622.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 1 [Ailuropoda melanoleuca]
 gi|410988812|ref|XP_004000671.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Felis catus]
          Length = 1036

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976


>gi|149758509|ref|XP_001493422.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Equus caballus]
 gi|291407677|ref|XP_002720149.1| PREDICTED: O-linked GlcNAc transferase isoform 2 [Oryctolagus
           cuniculus]
 gi|296235743|ref|XP_002763023.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Callithrix jacchus]
 gi|348570518|ref|XP_003471044.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 2 [Cavia porcellus]
 gi|402910496|ref|XP_003917911.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Papio anubis]
 gi|426257214|ref|XP_004022227.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Ovis aries]
 gi|378405191|sp|P81436.2|OGT1_RABIT RecName: Full=UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit;
           AltName: Full=O-GlcNAc transferase subunit p110;
           AltName: Full=O-linked N-acetylglucosamine transferase
           110 kDa subunit; Short=OGT
 gi|355757457|gb|EHH60982.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Macaca fascicularis]
 gi|380817296|gb|AFE80522.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 1 [Macaca mulatta]
 gi|383422239|gb|AFH34333.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 1 [Macaca mulatta]
 gi|384949880|gb|AFI38545.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 1 [Macaca mulatta]
          Length = 1046

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|344282024|ref|XP_003412775.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Loxodonta africana]
          Length = 1036

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976


>gi|149758511|ref|XP_001493438.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Equus caballus]
 gi|291407675|ref|XP_002720148.1| PREDICTED: O-linked GlcNAc transferase isoform 1 [Oryctolagus
           cuniculus]
 gi|296235745|ref|XP_002763024.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Callithrix jacchus]
 gi|348570516|ref|XP_003471043.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 1 [Cavia porcellus]
 gi|402910498|ref|XP_003917912.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Papio anubis]
 gi|426257212|ref|XP_004022226.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Ovis aries]
 gi|380817298|gb|AFE80523.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 2 [Macaca mulatta]
 gi|383422241|gb|AFH34334.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 2 [Macaca mulatta]
          Length = 1036

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976


>gi|403305183|ref|XP_003943149.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Saimiri boliviensis boliviensis]
          Length = 1036

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976


>gi|32307150|ref|NP_858059.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit isoform 2 [Homo sapiens]
 gi|397498836|ref|XP_003820180.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Pan paniscus]
 gi|426396358|ref|XP_004064414.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Gorilla gorilla gorilla]
 gi|15680175|gb|AAH14434.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Homo sapiens]
 gi|18250914|emb|CAC86127.1| UDP-N-acatylglucosamine: polypeptide-N-acetylglucosaminyl
           transferase [Homo sapiens]
 gi|119625692|gb|EAX05287.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_c [Homo
           sapiens]
 gi|123993895|gb|ABM84549.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [synthetic construct]
 gi|124000677|gb|ABM87847.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [synthetic construct]
 gi|224487755|dbj|BAH24112.1| O-linked N-acetylglucosamine (GlcNAc) transferase [synthetic
           construct]
 gi|410227762|gb|JAA11100.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Pan troglodytes]
 gi|410257114|gb|JAA16524.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Pan troglodytes]
          Length = 1036

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976


>gi|332265872|ref|XP_003281937.1| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit
           [Nomascus leucogenys]
          Length = 1050

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 793 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 852

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 853 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 912

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 913 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 972

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 973 IAKNRQEYEDIAVKLGTD 990


>gi|126342739|ref|XP_001362317.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Monodelphis domestica]
          Length = 1035

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 837

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 838 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 897

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 957

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 958 IAKNRQEYEDIAVKLGTD 975


>gi|147899141|ref|NP_001091539.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Bos taurus]
 gi|146186901|gb|AAI40543.1| OGT protein [Bos taurus]
 gi|296470806|tpg|DAA12921.1| TPA: O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Bos taurus]
          Length = 1036

 Score =  327 bits (839), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 959 IAKNRQEYEDIAVKLGTD 976


>gi|410056634|ref|XP_521123.4| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit [Pan
           troglodytes]
          Length = 1022

 Score =  327 bits (838), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 765 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 824

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 825 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 884

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 885 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 944

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 945 IAKNRQEYEDIAVKLGTD 962


>gi|126342737|ref|XP_001362233.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Monodelphis domestica]
          Length = 1045

 Score =  327 bits (838), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 847

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 848 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 907

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 967

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 968 IAKNRQEYEDIAVKLGTD 985


>gi|449499306|ref|XP_002189786.2| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit, partial
           [Taeniopygia guttata]
          Length = 1038

 Score =  327 bits (838), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 172/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 781 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 840

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ LGL Q+RI+FS VA K
Sbjct: 841 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNLGLAQNRIIFSPVAPK 900

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 901 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 960

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 961 IAKSRQEYEDIAVKLGTD 978


>gi|2266994|gb|AAB63466.1| O-linked GlcNAc transferase [Homo sapiens]
 gi|6911265|gb|AAF31458.1| HRNT1 [Homo sapiens]
 gi|18250916|emb|CAC86129.1| UDP-N-acetylglucosamine: polypeptide-N-acetylglucosaminyl
           transferase [Homo sapiens]
 gi|119625690|gb|EAX05285.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase), isoform CRA_a [Homo
           sapiens]
          Length = 920

 Score =  327 bits (838), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 663 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 722

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 723 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 782

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 783 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 842

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 843 IAKNRQEYEDIAVKLGTD 860


>gi|319443727|pdb|3PE3|A Chain A, Structure Of Human O-Glcnac Transferase And Its Complex
           With A Peptide Substrate
 gi|319443728|pdb|3PE3|B Chain B, Structure Of Human O-Glcnac Transferase And Its Complex
           With A Peptide Substrate
 gi|319443729|pdb|3PE3|C Chain C, Structure Of Human O-Glcnac Transferase And Its Complex
           With A Peptide Substrate
 gi|319443730|pdb|3PE3|D Chain D, Structure Of Human O-Glcnac Transferase And Its Complex
           With A Peptide Substrate
 gi|319443731|pdb|3PE4|A Chain A, Structure Of Human O-Glcnac Transferase And Its Complex
           With A Peptide Substrate
 gi|319443733|pdb|3PE4|C Chain C, Structure Of Human O-Glcnac Transferase And Its Complex
           With A Peptide Substrate
 gi|357380651|pdb|3TAX|A Chain A, A Neutral Diphosphate Mimic Crosslinks The Active Site Of
           Human O- Glcnac Transferase
 gi|357380653|pdb|3TAX|C Chain C, A Neutral Diphosphate Mimic Crosslinks The Active Site Of
           Human O- Glcnac Transferase
 gi|409973752|pdb|4AY5|A Chain A, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
           Glycopeptide
 gi|409973753|pdb|4AY5|B Chain B, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
           Glycopeptide
 gi|409973754|pdb|4AY5|C Chain C, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
           Glycopeptide
 gi|409973755|pdb|4AY5|D Chain D, Human O-Glcnac Transferase (Ogt) In Complex With Udp And
           Glycopeptide
 gi|409973760|pdb|4AY6|A Chain A, Human O-Glcnac Transferase (Ogt) In Complex With
           Udp-5sglcnac And Substrate Peptide
 gi|409973761|pdb|4AY6|B Chain B, Human O-Glcnac Transferase (Ogt) In Complex With
           Udp-5sglcnac And Substrate Peptide
 gi|409973762|pdb|4AY6|C Chain C, Human O-Glcnac Transferase (Ogt) In Complex With
           Udp-5sglcnac And Substrate Peptide
 gi|409973763|pdb|4AY6|D Chain D, Human O-Glcnac Transferase (Ogt) In Complex With
           Udp-5sglcnac And Substrate Peptide
 gi|410563064|pdb|4GYW|A Chain A, Crystal Structure Of Human O-Glcnac Transferase In Complex
           With Udp And A Glycopeptide
 gi|410563066|pdb|4GYW|C Chain C, Crystal Structure Of Human O-Glcnac Transferase In Complex
           With Udp And A Glycopeptide
 gi|410563068|pdb|4GYY|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-5sglcnac And A Peptide Substrate
 gi|410563070|pdb|4GYY|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-5sglcnac And A Peptide Substrate
 gi|410563101|pdb|4GZ3|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With Udp
           And A Thioglycopeptide
 gi|410563103|pdb|4GZ3|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With Udp
           And A Thioglycopeptide
 gi|410563105|pdb|4GZ5|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-Glcnac
 gi|410563106|pdb|4GZ5|B Chain B, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-Glcnac
 gi|410563107|pdb|4GZ5|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-Glcnac
 gi|410563108|pdb|4GZ5|D Chain D, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-Glcnac
 gi|410563109|pdb|4GZ6|A Chain A, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-5sglcnac
 gi|410563110|pdb|4GZ6|B Chain B, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-5sglcnac
 gi|410563111|pdb|4GZ6|C Chain C, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-5sglcnac
 gi|410563112|pdb|4GZ6|D Chain D, Crystal Structure Of Human O-Glcnac Transferase With
           Udp-5sglcnac
          Length = 723

 Score =  327 bits (838), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 471 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 530

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 531 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 590

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 591 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 650

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 651 IAKNRQEYEDIAVKLGTD 668


>gi|395546237|ref|XP_003774996.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit
            [Sarcophilus harrisii]
          Length = 1075

 Score =  327 bits (838), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 818  MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 877

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 878  QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 937

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 938  EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 997

Query: 181  IARTHKEYQDIAIRLGTD 198
            IA+  +EY+DIA++LGTD
Sbjct: 998  IAKNRQEYEDIAVKLGTD 1015


>gi|6562168|emb|CAB62528.1| hypothetical protein [Homo sapiens]
          Length = 665

 Score =  327 bits (838), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 408 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 467

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 468 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 527

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 528 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 587

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 588 IAKNRQEYEDIAVKLGTD 605


>gi|328908909|gb|AEB61122.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kda subunit-like protein, partial [Equus caballus]
          Length = 313

 Score =  327 bits (837), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 149/198 (75%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 56  MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 115

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 116 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 175

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 176 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 235

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 236 IAKNRQEYEDIAVKLGTD 253


>gi|351710822|gb|EHB13741.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110
            kDa subunit [Heterocephalus glaber]
          Length = 1270

 Score =  326 bits (836), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 170/198 (85%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP +I++T+R QYGLP+DAIVYCNFN
Sbjct: 1013 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPHTIIVTTRSQYGLPDDAIVYCNFN 1072

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 1073 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 1132

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 1133 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 1192

Query: 181  IARTHKEYQDIAIRLGTD 198
            IA+  +EY+DIA++LGTD
Sbjct: 1193 IAKNRQEYEDIAVKLGTD 1210



 Score =  286 bits (732), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 131/171 (76%), Positives = 148/171 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 740 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 799

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 800 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 859

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           EEHVRRGQLADVCLDT LCNGHTT MDVLW GTP+VT+PGETLASRVAASQ
Sbjct: 860 EEHVRRGQLADVCLDTLLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQ 910


>gi|74007686|ref|XP_538075.2| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Canis lupus familiaris]
          Length = 1036

 Score =  326 bits (835), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  ++Y+DIA++LGTD
Sbjct: 959 IAKNRQDYEDIAVKLGTD 976


>gi|66347879|ref|NP_001018117.1| O-linked N-acetylglucosamine transferase isoform 4 [Danio rerio]
 gi|56967378|gb|AAW31873.1| O-GlcNAc transferase variant 4 [Danio rerio]
          Length = 1036

 Score =  326 bits (835), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 147/200 (73%), Positives = 171/200 (85%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 777 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 836

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ LGL   RI+FS VA K
Sbjct: 837 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 896

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCPEL
Sbjct: 897 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 956

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++ +EY+D+A++LGTD +
Sbjct: 957 IAQSRQEYEDVAVKLGTDME 976


>gi|89886173|ref|NP_001034837.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Sus scrofa]
 gi|122142735|sp|Q27HV0.1|OGT1_PIG RecName: Full=UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit;
           AltName: Full=O-GlcNAc transferase subunit p110;
           AltName: Full=O-linked N-acetylglucosamine transferase
           110 kDa subunit; Short=OGT
 gi|89114276|gb|ABD61726.1| O-linked N-acetylglucosamine transferase [Sus scrofa]
          Length = 1046

 Score =  326 bits (835), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +E++DIA++LGTD
Sbjct: 969 IAKNRQEFEDIAVKLGTD 986


>gi|74007670|ref|XP_849392.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Canis lupus familiaris]
          Length = 1046

 Score =  326 bits (835), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  ++Y+DIA++LGTD
Sbjct: 969 IAKNRQDYEDIAVKLGTD 986


>gi|66347871|ref|NP_001018116.1| O-linked N-acetylglucosamine transferase isoform 3 [Danio rerio]
 gi|56967376|gb|AAW31872.1| O-GlcNAc transferase variant 3 [Danio rerio]
          Length = 1046

 Score =  326 bits (835), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 147/200 (73%), Positives = 171/200 (85%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 787 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 846

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ LGL   RI+FS VA K
Sbjct: 847 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 906

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCPEL
Sbjct: 907 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 966

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++ +EY+D+A++LGTD +
Sbjct: 967 IAQSRQEYEDVAVKLGTDME 986


>gi|190340092|gb|AAI63923.1| O-linked N-acetylglucosamine (GlcNAc) transferase
           (UDP-N-acetylglucosamine:polypeptide-N-
           acetylglucosaminyl transferase) [Danio rerio]
          Length = 1046

 Score =  325 bits (834), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 147/200 (73%), Positives = 171/200 (85%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 787 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 846

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ LGL   RI+FS VA K
Sbjct: 847 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 906

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCPEL
Sbjct: 907 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCPEL 966

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++ +EY+D+A++LGTD +
Sbjct: 967 IAQSRQEYEDVAVKLGTDME 986


>gi|327287038|ref|XP_003228236.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 2 [Anolis carolinensis]
          Length = 1036

 Score =  325 bits (834), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 147/198 (74%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 837

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP+TLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q RI+FS VA K
Sbjct: 838 QLYKIDPATLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 897

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 957

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 958 IAKSRQEYEDIAVKLGTD 975


>gi|327287040|ref|XP_003228237.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           isoform 3 [Anolis carolinensis]
          Length = 1046

 Score =  325 bits (834), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 147/198 (74%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP+TLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q RI+FS VA K
Sbjct: 848 QLYKIDPATLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 907

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 908 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 967

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 968 IAKSRQEYEDIAVKLGTD 985


>gi|66793439|ref|NP_001019747.1| O-linked N-acetylglucosamine (GlcNAc) transferase [Xenopus
           (Silurana) tropicalis]
 gi|60618530|gb|AAH90599.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           [Xenopus (Silurana) tropicalis]
          Length = 1035

 Score =  325 bits (832), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 146/198 (73%), Positives = 170/198 (85%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 778 MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 837

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q RI+FS VA K
Sbjct: 838 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 897

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PG+TLASRVAASQL  LGCPEL
Sbjct: 898 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGDTLASRVAASQLTCLGCPEL 957

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  ++Y+D A++LGTD
Sbjct: 958 IAKGRQDYEDTAVKLGTD 975


>gi|377657074|gb|AFB74087.1| O-linked N-acetylglucosamine transferase [Bubalus bubalis]
          Length = 1046

 Score =  324 bits (831), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 170/198 (85%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCL TPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLGTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|224587609|gb|ACN58689.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Salmo salar]
          Length = 401

 Score =  323 bits (827), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 147/200 (73%), Positives = 168/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 142 MINQGQIQVTINGFTVSNGLATTQLNNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 201

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ  GL   RI+FS VA K
Sbjct: 202 QLYKIDPPTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNFGLPGSRIIFSPVAPK 261

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCPEL
Sbjct: 262 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLQCLGCPEL 321

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA T +EY+D+A++LG D +
Sbjct: 322 IAHTRQEYEDVAVKLGCDME 341


>gi|3914191|sp|P56558.1|OGT1_RAT RecName: Full=UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit;
           AltName: Full=O-GlcNAc transferase subunit p110;
           AltName: Full=O-linked N-acetylglucosamine transferase
           110 kDa subunit; Short=OGT
 gi|1931579|gb|AAC53121.1| O-GlcNAc transferase, p110 subunit [Rattus norvegicus]
          Length = 1036

 Score =  323 bits (827), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 148/198 (74%), Positives = 171/198 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 779 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQM  N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 839 QLYKIDPSTLQMGANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC EL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLEL 958

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA++ +EY+DIA++LGTD
Sbjct: 959 IAKSRQEYEDIAVKLGTD 976


>gi|348516824|ref|XP_003445937.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Oreochromis niloticus]
          Length = 1038

 Score =  320 bits (821), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 172/200 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q + K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 779 MINQGQIQVTINGFTVSNGLATTQISNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL   RI+FS VA K
Sbjct: 839 QLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPGSRIIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCP+L
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLNCLGCPDL 958

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++ ++Y+D+A++LG+D +
Sbjct: 959 IAQSRQDYEDVAVKLGSDME 978


>gi|348516822|ref|XP_003445936.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           1 [Oreochromis niloticus]
          Length = 1048

 Score =  320 bits (821), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 172/200 (86%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q + K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 789 MINQGQIQVTINGFTVSNGLATTQISNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL   RI+FS VA K
Sbjct: 849 QLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPGSRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCP+L
Sbjct: 909 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLNCLGCPDL 968

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++ ++Y+D+A++LG+D +
Sbjct: 969 IAQSRQDYEDVAVKLGSDME 988


>gi|292620687|ref|XP_001921543.2| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit isoform 1
            [Danio rerio]
          Length = 1102

 Score =  320 bits (819), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 146/200 (73%), Positives = 169/200 (84%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI  GQ+Q ++NG  + NGLAT Q N K ATGEEV ++IV+T+R QYGLP+D+IVYCNFN
Sbjct: 843  MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVLKTIVVTTRSQYGLPDDSIVYCNFN 902

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ LGL   RI+FS VA K
Sbjct: 903  QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 962

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
            EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLAS VAASQL  LGCPEL
Sbjct: 963  EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASCVAASQLTCLGCPEL 1022

Query: 181  IARTHKEYQDIAIRLGTDRD 200
            IAR+ +EY+DIA++LGTD +
Sbjct: 1023 IARSRQEYEDIAVKLGTDME 1042


>gi|351700623|gb|EHB03542.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Heterocephalus glaber]
          Length = 1046

 Score =  319 bits (818), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 146/198 (73%), Positives = 168/198 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP +I++T+R QYGLP+DAIVYCNFN
Sbjct: 789 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPHTIIVTTRSQYGLPDDAIVYCNFN 848

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA K
Sbjct: 849 QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPK 908

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDT LCNGHTT MDVLW GTP+VT+PGETLASRVA SQL  LGC EL
Sbjct: 909 EEHVRRGQLADVCLDTLLCNGHTTGMDVLWAGTPMVTMPGETLASRVATSQLTCLGCLEL 968

Query: 181 IARTHKEYQDIAIRLGTD 198
           IA+  +EY+DIA++LGTD
Sbjct: 969 IAKNRQEYEDIAVKLGTD 986


>gi|189527635|ref|XP_001921570.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           2 [Danio rerio]
          Length = 1045

 Score =  319 bits (818), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 146/200 (73%), Positives = 169/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEV ++IV+T+R QYGLP+D+IVYCNFN
Sbjct: 786 MINQGQIQVTINGFTVSNGLATTQINNKAATGEEVLKTIVVTTRSQYGLPDDSIVYCNFN 845

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ LGL   RI+FS VA K
Sbjct: 846 QLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 905

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLAS VAASQL  LGCPEL
Sbjct: 906 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASCVAASQLTCLGCPEL 965

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IAR+ +EY+DIA++LGTD +
Sbjct: 966 IARSRQEYEDIAVKLGTDME 985


>gi|47222947|emb|CAF99103.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1037

 Score =  319 bits (817), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 169/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q   K ATGEEV +++V+T+R QYGLPEDAIVYCNFN
Sbjct: 780 MINQGQIQVTINGFTVSNGLATTQITNKAATGEEVLRTVVVTTRSQYGLPEDAIVYCNFN 839

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+L  VPNS+LWLL+FPAVGE NIQ  AQ +GL   RI+FS VA K
Sbjct: 840 QLYKIDPPTLQMWANILTRVPNSVLWLLRFPAVGEPNIQQYAQKVGLPASRIIFSPVAPK 899

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVA SQL+ LGCPEL
Sbjct: 900 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVATSQLSCLGCPEL 959

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA++H+EY+DIA++LG+D +
Sbjct: 960 IAQSHEEYEDIAVKLGSDME 979


>gi|148230096|ref|NP_001087833.1| O-linked N-acetylglucosamine (GlcNAc) transferase [Xenopus laevis]
 gi|51949982|gb|AAH82353.1| Ogt-prov protein [Xenopus laevis]
          Length = 1063

 Score =  317 bits (812), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 147/216 (68%), Positives = 172/216 (79%), Gaps = 18/216 (8%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788  MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDPSTLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q RI+FS VA K
Sbjct: 848  QLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 907

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP------------------GET 162
            EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+P                  G+T
Sbjct: 908  EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGKESAESLGLHNKDCFLSGDT 967

Query: 163  LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            LASRVAASQL  LGCPELIA++ ++Y+DIA++LGTD
Sbjct: 968  LASRVAASQLTCLGCPELIAKSRQDYEDIAVKLGTD 1003


>gi|260833062|ref|XP_002611476.1| hypothetical protein BRAFLDRAFT_117202 [Branchiostoma floridae]
 gi|229296847|gb|EEN67486.1| hypothetical protein BRAFLDRAFT_117202 [Branchiostoma floridae]
          Length = 1022

 Score =  317 bits (812), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 142/200 (71%), Positives = 169/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           M+  GQV T +NG+V+QNGLAT+Q N K ATGEE P ++++T+R QYGLPED++VYCNFN
Sbjct: 771 MVTQGQVSTCMNGVVVQNGLATSQVNNKAATGEEPPANVIVTTRVQYGLPEDSVVYCNFN 830

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP+TLQMWVN+L  VPNS+LWLL+FPAVGE N+   A  +GL   RI+FS+VA K
Sbjct: 831 QLYKIDPATLQMWVNILNRVPNSVLWLLRFPAVGEQNVLNAATQMGLSPGRIIFSHVAPK 890

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLD+PLCNGHTT MDVLW GTP++TLPGETLASRVAASQL  LGCPEL
Sbjct: 891 EEHVRRGQLADVCLDSPLCNGHTTGMDVLWAGTPMITLPGETLASRVAASQLGCLGCPEL 950

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           +A T  EY+DIA++LG D D
Sbjct: 951 VANTRAEYEDIAVQLGNDAD 970


>gi|432879797|ref|XP_004073552.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Oryzias latipes]
          Length = 1016

 Score =  316 bits (810), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 168/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++N   + NGLAT Q N K ATGEEVP++IV+T+R QYGLPED+IVYCNFN
Sbjct: 757 MINQGQIQVTINNFTVSNGLATTQINNKAATGEEVPRTIVVTTRSQYGLPEDSIVYCNFN 816

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK V NS+LWLL+FPAVGE NIQ  AQ LGL   RI+FS VA K
Sbjct: 817 QLYKIDPPTLQMWANILKRVSNSVLWLLRFPAVGEPNIQQYAQNLGLPASRIIFSPVAPK 876

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGCPEL
Sbjct: 877 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLQCLGCPEL 936

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           IA+  ++Y++IA++LG+D +
Sbjct: 937 IAQNRQDYEEIAVKLGSDME 956


>gi|66347873|ref|NP_001018115.1| O-linked N-acetylglucosamine transferase isoform 2 [Danio rerio]
 gi|56967374|gb|AAW31871.1| O-GlcNAc transferase variant 2 [Danio rerio]
          Length = 1052

 Score =  315 bits (808), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 147/216 (68%), Positives = 171/216 (79%), Gaps = 16/216 (7%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQT----------------NTKTATGEEVPQSIVITSR 44
           MI  GQ+Q ++NG  + NGLAT Q                 N K ATGEEVP++IV+T+R
Sbjct: 777 MINQGQIQVTINGFTVSNGLATTQIFTVEEVVVSGTVALQINNKAATGEEVPRTIVVTTR 836

Query: 45  QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
            QYGLPED+IVYCNFNQLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ 
Sbjct: 837 SQYGLPEDSIVYCNFNQLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQN 896

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
           LGL   RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 897 LGLPASRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 956

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           SRVAASQL  LGCPELIA++ +EY+D+A++LGTD +
Sbjct: 957 SRVAASQLTCLGCPELIAQSRQEYEDVAVKLGTDME 992


>gi|62821820|ref|NP_001017359.1| O-linked N-acetylglucosamine transferase isoform 1 [Danio rerio]
 gi|56967372|gb|AAW31870.1| O-GlcNAc transferase variant 1 [Danio rerio]
          Length = 1062

 Score =  315 bits (807), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 147/216 (68%), Positives = 171/216 (79%), Gaps = 16/216 (7%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQT----------------NTKTATGEEVPQSIVITSR 44
            MI  GQ+Q ++NG  + NGLAT Q                 N K ATGEEVP++IV+T+R
Sbjct: 787  MINQGQIQVTINGFTVSNGLATTQIFTVEEVVVSGTVALQINNKAATGEEVPRTIVVTTR 846

Query: 45   QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
             QYGLPED+IVYCNFNQLYKIDP TLQMW N+LK VPNS++WLL+FPAVGE NIQ  AQ 
Sbjct: 847  SQYGLPEDSIVYCNFNQLYKIDPPTLQMWANILKRVPNSVIWLLRFPAVGEPNIQQYAQN 906

Query: 105  LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            LGL   RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 907  LGLPASRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 966

Query: 165  SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            SRVAASQL  LGCPELIA++ +EY+D+A++LGTD +
Sbjct: 967  SRVAASQLTCLGCPELIAQSRQEYEDVAVKLGTDME 1002


>gi|327287036|ref|XP_003228235.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 1 [Anolis carolinensis]
          Length = 1066

 Score =  314 bits (804), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 148/218 (67%), Positives = 171/218 (78%), Gaps = 20/218 (9%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDA+VYCNFN
Sbjct: 788  MINRGQIQITINGFNISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAVVYCNFN 847

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
            QLYKIDP+TLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q RI+FS VA K
Sbjct: 848  QLYKIDPATLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQSRIIFSPVAPK 907

Query: 121  EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT--------------------LPG 160
            EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT                    LPG
Sbjct: 908  EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGEALRGSVYLFANDPFLPLPG 967

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ETLASRVAASQL  LGC ELIA++ +EY+DIA++LGTD
Sbjct: 968  ETLASRVAASQLTCLGCLELIAKSRQEYEDIAVKLGTD 1005


>gi|410914070|ref|XP_003970511.1| PREDICTED: LOW QUALITY PROTEIN: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Takifugu rubripes]
          Length = 1036

 Score =  314 bits (804), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 142/200 (71%), Positives = 168/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q + NG  + NGLAT Q + K ATGEEV ++IV+T+R QYGLPED+IVYCNFN
Sbjct: 779 MINQGQIQVTXNGFTVSNGLATTQISNKAATGEEVLRTIVVTTRSQYGLPEDSIVYCNFN 838

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW N+LK VPNS+LWLL+FPAVGE NIQ  AQ +GL   R++FS VA K
Sbjct: 839 QLYKIDPPTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPASRVIFSPVAPK 898

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLASRVA SQL  LGCPEL
Sbjct: 899 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVATSQLNCLGCPEL 958

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           +A++ +EY+DIA++LG+D +
Sbjct: 959 VAQSRQEYEDIAVKLGSDME 978


>gi|348516828|ref|XP_003445939.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit isoform
           4 [Oreochromis niloticus]
          Length = 1054

 Score =  310 bits (794), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 144/216 (66%), Positives = 171/216 (79%), Gaps = 16/216 (7%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTN----------------TKTATGEEVPQSIVITSR 44
           MI  GQ+Q ++NG  + NGLAT Q                   K ATGEEVP++IV+T+R
Sbjct: 779 MINQGQIQVTINGFTVSNGLATTQVRVTFRRTLLASSASCKLNKAATGEEVPRTIVVTTR 838

Query: 45  QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
            QYGLPED+IVYCNFNQLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ  AQ 
Sbjct: 839 SQYGLPEDSIVYCNFNQLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQN 898

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
           +GL   RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 899 MGLPGSRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 958

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           SRVAASQL  LGCP+LIA++ ++Y+D+A++LG+D +
Sbjct: 959 SRVAASQLNCLGCPDLIAQSRQDYEDVAVKLGSDME 994


>gi|348516826|ref|XP_003445938.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit isoform 3
            [Oreochromis niloticus]
          Length = 1064

 Score =  310 bits (794), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 144/216 (66%), Positives = 171/216 (79%), Gaps = 16/216 (7%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTN----------------TKTATGEEVPQSIVITSR 44
            MI  GQ+Q ++NG  + NGLAT Q                   K ATGEEVP++IV+T+R
Sbjct: 789  MINQGQIQVTINGFTVSNGLATTQVRVTFRRTLLASSASCKLNKAATGEEVPRTIVVTTR 848

Query: 45   QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
             QYGLPED+IVYCNFNQLYKIDP TLQMWVN+LK VPNS+LWLL+FPAVGE NIQ  AQ 
Sbjct: 849  SQYGLPEDSIVYCNFNQLYKIDPPTLQMWVNILKRVPNSVLWLLRFPAVGEPNIQQYAQN 908

Query: 105  LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            +GL   RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLA
Sbjct: 909  MGLPGSRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLA 968

Query: 165  SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            SRVAASQL  LGCP+LIA++ ++Y+D+A++LG+D +
Sbjct: 969  SRVAASQLNCLGCPDLIAQSRQDYEDVAVKLGSDME 1004


>gi|443704233|gb|ELU01378.1| hypothetical protein CAPTEDRAFT_181048 [Capitella teleta]
          Length = 1019

 Score =  308 bits (790), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 140/200 (70%), Positives = 169/200 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  G+   ++  I +QNGL TNQTN K+ATGEE+P SI++++RQQYGL  DA+VYCNFN
Sbjct: 766 MIQEGKPTCTLEDIGIQNGLTTNQTNNKSATGEEIPPSIILSARQQYGLSNDAVVYCNFN 825

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP+TL++WVN+LKAVPNS++WLL+FPAVGE N+   A+ LGL   +ILFS VA K
Sbjct: 826 QLYKIDPATLEVWVNILKAVPNSVMWLLRFPAVGETNVVDFAKRLGLTADKILFSPVAPK 885

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+P ETLASRVAASQL  LGCPEL
Sbjct: 886 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPNETLASRVAASQLHCLGCPEL 945

Query: 181 IARTHKEYQDIAIRLGTDRD 200
           +A+  +EYQ IAI+LGTD++
Sbjct: 946 VAKNREEYQRIAIKLGTDKE 965


>gi|391328604|ref|XP_003738777.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit
           [Metaseiulus occidentalis]
          Length = 1034

 Score =  304 bits (779), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 141/199 (70%), Positives = 168/199 (84%), Gaps = 1/199 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           ++A  +    V+G+VLQ+G A    N + ATGEE P SI++T+RQQY LPEDAIVYCNFN
Sbjct: 786 VLAGTEEAADVDGVVLQSGAAAT-ANPREATGEEAPSSILVTTRQQYELPEDAIVYCNFN 844

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TLQMW+N+LKAVPNS+LWLL+FPAVGE  I A   A GL+++R++FSNVAAK
Sbjct: 845 QLYKIDPDTLQMWINILKAVPNSVLWLLRFPAVGEQQILAQVIAAGLEKNRVIFSNVAAK 904

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VTLP E+LASRVA+SQL  LGCPEL
Sbjct: 905 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTLPLESLASRVASSQLTCLGCPEL 964

Query: 181 IARTHKEYQDIAIRLGTDR 199
           IA+  K+Y++IAIRLGTD+
Sbjct: 965 IAKDRKDYENIAIRLGTDQ 983


>gi|390344140|ref|XP_784504.3| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Strongylocentrotus purpuratus]
          Length = 1041

 Score =  301 bits (772), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 141/203 (69%), Positives = 169/203 (83%), Gaps = 5/203 (2%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNT-----KTATGEEVPQSIVITSRQQYGLPEDAIV 55
           +I SGQVQ ++NGI + NGLA  Q        K+ATGEEVP   +IT+R  YGLP+DAIV
Sbjct: 771 LIQSGQVQGTINGITVCNGLALTQVGDYKSIPKSATGEEVPSYPLITTRNMYGLPDDAIV 830

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFS 115
           +CNFNQLYKIDP+TL+MWVN+++ VPNS+LWLL+FPA GE ++ ATA  LG+ +  ++FS
Sbjct: 831 FCNFNQLYKIDPATLRMWVNIIQKVPNSVLWLLRFPAAGEPHLLATATQLGMPKGSLVFS 890

Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
            VA+KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP++TLPGETLASRVAASQL TL
Sbjct: 891 AVASKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMITLPGETLASRVAASQLQTL 950

Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
           GCPELIA + +EY+DIAIRLGTD
Sbjct: 951 GCPELIASSKQEYEDIAIRLGTD 973


>gi|324502359|gb|ADY41039.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
            [Ascaris suum]
          Length = 1100

 Score =  301 bits (771), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 139/201 (69%), Positives = 165/201 (82%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI SG +  +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLPEDAIV+CNF
Sbjct: 836  MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPEDAIVFCNF 895

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDPSTL MW  +LK VPNS+LWLL+FP  GEAN+       G+D  R++FSNVAA
Sbjct: 896  NQLYKIDPSTLTMWCEILKLVPNSVLWLLRFPFHGEANVIRFCAERGIDARRVVFSNVAA 955

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL  LG PE
Sbjct: 956  KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLVALGVPE 1015

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+  ++Y  IA RLGTDR+
Sbjct: 1016 LVAKDREDYIRIAARLGTDRE 1036


>gi|339242851|ref|XP_003377351.1| putative tetratricopeptide repeat-containing domain protein
           [Trichinella spiralis]
 gi|316973855|gb|EFV57404.1| putative tetratricopeptide repeat-containing domain protein
           [Trichinella spiralis]
          Length = 1062

 Score =  300 bits (769), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 138/199 (69%), Positives = 166/199 (83%), Gaps = 1/199 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGL-ATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
           M  +GQVQTSV G+ +QNG+ A  QT+ K ATGEEVPQSI+ITSR QYGLPEDAIVYCNF
Sbjct: 734 MFYTGQVQTSVEGVPVQNGVTALTQTHIKAATGEEVPQSILITSRHQYGLPEDAIVYCNF 793

Query: 60  NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
           NQ+YKIDP TL+MW  +L+ VPNS+LWLL+FP+  E N+    ++ G++  RI F+NVAA
Sbjct: 794 NQMYKIDPMTLKMWCEILQRVPNSVLWLLRFPSHAEPNVLKFCESQGVNTKRIKFNNVAA 853

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLAD+CLDTPLCNGHTT MD+LW GTP+VTLP ET ASRVA+SQL  LGCPE
Sbjct: 854 KEEHVRRGQLADLCLDTPLCNGHTTGMDMLWAGTPMVTLPLETFASRVASSQLTALGCPE 913

Query: 180 LIARTHKEYQDIAIRLGTD 198
           LIA+T ++Y +IA+RLGTD
Sbjct: 914 LIAKTREDYVNIAVRLGTD 932


>gi|405973204|gb|EKC37930.1| hypothetical protein CGI_10009704 [Crassostrea gigas]
          Length = 269

 Score =  300 bits (767), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 137/199 (68%), Positives = 164/199 (82%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           M  +G   + +N +V+QNG+ TNQTN K ATGEE+PQ+I+I++R QYGLPEDA++YCNFN
Sbjct: 12  MAQTGLSHSCINDVVIQNGVTTNQTNNKAATGEEIPQNIIISARSQYGLPEDAVIYCNFN 71

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAK 120
           QLYKIDP TL MW+ +LK VP+ +LWLL+FPAVGE N+  TA   GL   RI+FS VA K
Sbjct: 72  QLYKIDPQTLHMWIEILKQVPDGVLWLLRFPAVGETNVLQTAANAGLSPGRIIFSPVAPK 131

Query: 121 EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPEL 180
           EEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VTL GETLASRVA+SQL TLGCPEL
Sbjct: 132 EEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTLLGETLASRVASSQLNTLGCPEL 191

Query: 181 IARTHKEYQDIAIRLGTDR 199
           IA+T ++Y  IA +LGTD+
Sbjct: 192 IAKTSEDYIRIASKLGTDK 210


>gi|393909951|gb|EFO25409.2| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase [Loa
            loa]
          Length = 1094

 Score =  296 bits (759), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 137/201 (68%), Positives = 164/201 (81%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI SG +  +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV+CNF
Sbjct: 832  MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIVFCNF 891

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDP TL MW ++LK+VPNSILWLL+FP  GE N+        +D  RI+FSNVAA
Sbjct: 892  NQLYKIDPPTLSMWCDILKSVPNSILWLLRFPYHGEPNVMRFCAEHNIDTRRIVFSNVAA 951

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL  LG PE
Sbjct: 952  KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 1011

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+  ++Y  IA RLGTDR+
Sbjct: 1012 LVAKDREDYIKIAKRLGTDRE 1032


>gi|170571769|ref|XP_001891856.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
            [Brugia malayi]
 gi|158603386|gb|EDP39326.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
            putative [Brugia malayi]
          Length = 1136

 Score =  296 bits (758), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 137/201 (68%), Positives = 163/201 (81%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI SG +  +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV+CNF
Sbjct: 874  MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIVFCNF 933

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDP TL MW ++LK VPNSILWLL+FP  GE N+        +D  RI+FSNVAA
Sbjct: 934  NQLYKIDPPTLSMWCDILKLVPNSILWLLRFPYHGEPNVMRFCAERNIDTRRIVFSNVAA 993

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL  LG PE
Sbjct: 994  KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 1053

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+  ++Y  IA RLGTDR+
Sbjct: 1054 LVAKDREDYIKIAKRLGTDRE 1074


>gi|402582718|gb|EJW76663.1| hypothetical protein WUBG_12430 [Wuchereria bancrofti]
          Length = 284

 Score =  294 bits (753), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 137/201 (68%), Positives = 163/201 (81%), Gaps = 1/201 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
           MI SG +  +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV+CNF
Sbjct: 22  MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIVFCNF 81

Query: 60  NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
           NQLYKIDP TL MW ++LK VPNSILWLL+FP  GE N+        +D  RI+FSNVAA
Sbjct: 82  NQLYKIDPPTLSMWCDILKLVPNSILWLLRFPYHGEPNVMRFCAERNIDTRRIVFSNVAA 141

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL  LG PE
Sbjct: 142 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 201

Query: 180 LIARTHKEYQDIAIRLGTDRD 200
           L+A+  ++Y  IA RLGTDR+
Sbjct: 202 LVAKDREDYIKIAKRLGTDRE 222


>gi|358341529|dbj|GAA37010.2| polypeptide N-acetylglucosaminyltransferase [Clonorchis sinensis]
          Length = 1056

 Score =  291 bits (746), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 133/202 (65%), Positives = 161/202 (79%), Gaps = 2/202 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 785 MIRTNSPSCAINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 844

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD--QHRILFSNVA 118
           QLYKIDPST++MWV +LK VPNS+LWLL+FPA GEA   + A  +GL   Q RI+FSNVA
Sbjct: 845 QLYKIDPSTMRMWVEILKNVPNSVLWLLRFPAAGEAGALSAATEMGLQQVQRRIIFSNVA 904

Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
            KEEHVRRGQ+ADVCLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL  LGCP
Sbjct: 905 PKEEHVRRGQVADVCLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHALGCP 964

Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
           EL+A T ++Y  IA +LG +R+
Sbjct: 965 ELVANTQEDYVRIASKLGNNRE 986


>gi|2266992|gb|AAB63465.1| O-linked GlcNAc transferase [Caenorhabditis elegans]
          Length = 1151

 Score =  285 bits (729), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 133/201 (66%), Positives = 161/201 (80%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI +GQ+  +V   + +QNGL  +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 893  MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 952

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDPSTL MW+ +L+ VP SILWLL+FP  GE +I+      GLD  RI+FSNVAA
Sbjct: 953  NQLYKIDPSTLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGLDPSRIVFSNVAA 1012

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL  LG PE
Sbjct: 1013 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 1072

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+T +EY  IA+RLGTD D
Sbjct: 1073 LVAKTRQEYVSIAVRLGTDAD 1093


>gi|115532690|ref|NP_001040860.1| Protein OGT-1, isoform a [Caenorhabditis elegans]
 gi|33112401|sp|O18158.2|OGT1_CAEEL RecName: Full=UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase; AltName: Full=O-GlcNAc;
            AltName: Full=OGT
 gi|351065633|emb|CCD61614.1| Protein OGT-1, isoform a [Caenorhabditis elegans]
          Length = 1151

 Score =  285 bits (729), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 133/201 (66%), Positives = 161/201 (80%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI +GQ+  +V   + +QNGL  +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 893  MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 952

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDPSTL MW+ +L+ VP SILWLL+FP  GE +I+      GLD  RI+FSNVAA
Sbjct: 953  NQLYKIDPSTLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGLDPSRIVFSNVAA 1012

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL  LG PE
Sbjct: 1013 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 1072

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+T +EY  IA+RLGTD D
Sbjct: 1073 LVAKTRQEYVSIAVRLGTDAD 1093


>gi|115532692|ref|NP_001040861.1| Protein OGT-1, isoform b [Caenorhabditis elegans]
 gi|351065634|emb|CCD61615.1| Protein OGT-1, isoform b [Caenorhabditis elegans]
          Length = 973

 Score =  285 bits (729), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 133/201 (66%), Positives = 161/201 (80%), Gaps = 1/201 (0%)

Query: 1   MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
           MI +GQ+  +V   + +QNGL  +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 715 MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 774

Query: 60  NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
           NQLYKIDPSTL MW+ +L+ VP SILWLL+FP  GE +I+      GLD  RI+FSNVAA
Sbjct: 775 NQLYKIDPSTLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGLDPSRIVFSNVAA 834

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL  LG PE
Sbjct: 835 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 894

Query: 180 LIARTHKEYQDIAIRLGTDRD 200
           L+A+T +EY  IA+RLGTD D
Sbjct: 895 LVAKTRQEYVSIAVRLGTDAD 915


>gi|198434517|ref|XP_002131769.1| PREDICTED: similar to O-linked N-acetylglucosamine transferase
           [Ciona intestinalis]
          Length = 1042

 Score =  285 bits (728), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 131/185 (70%), Positives = 155/185 (83%), Gaps = 1/185 (0%)

Query: 17  QNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNV 76
           QNGLA  Q + K ATGE+ P  I+++SR QY LP +AIV+CNFNQLYKIDPSTL MW N+
Sbjct: 808 QNGLALVQIDAKAATGEKPPSGIIVSSRSQYRLPNNAIVFCNFNQLYKIDPSTLIMWCNI 867

Query: 77  LKAVPNSILWLLKFPAVGEANIQATA-QALGLDQHRILFSNVAAKEEHVRRGQLADVCLD 135
           LK VPNS+LWLL+FPAVGEAN++  A Q  G++ +RI+FS VA KEEHVRRGQLAD+CLD
Sbjct: 868 LKRVPNSVLWLLRFPAVGEANVKKFARQTCGINANRIIFSPVAPKEEHVRRGQLADICLD 927

Query: 136 TPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
           TPLCNGHTT+MDVLW G P+VTLP ETLASRVA+SQLA LGCPELIA   ++Y+DIA+RL
Sbjct: 928 TPLCNGHTTAMDVLWAGCPMVTLPKETLASRVASSQLACLGCPELIAENSQQYEDIAVRL 987

Query: 196 GTDRD 200
           GTD D
Sbjct: 988 GTDMD 992


>gi|256079484|ref|XP_002576017.1| o-linked n-acetylglucosamine transferase ogt [Schistosoma mansoni]
          Length = 1063

 Score =  283 bits (725), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 134/202 (66%), Positives = 162/202 (80%), Gaps = 2/202 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 803  MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 862

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
            QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA
Sbjct: 863  QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 922

Query: 119  AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
             KEEHVRRGQ+ADVCLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 923  PKEEHVRRGQVADVCLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 982

Query: 179  ELIARTHKEYQDIAIRLGTDRD 200
            EL+A + ++Y  IA +LG +R+
Sbjct: 983  ELVANSQEDYVRIATKLGNNRE 1004


>gi|353231229|emb|CCD77647.1| putative o-linked n-acetylglucosamine transferase, ogt [Schistosoma
            mansoni]
          Length = 1063

 Score =  283 bits (725), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 134/202 (66%), Positives = 162/202 (80%), Gaps = 2/202 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 803  MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 862

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
            QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA
Sbjct: 863  QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 922

Query: 119  AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
             KEEHVRRGQ+ADVCLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 923  PKEEHVRRGQVADVCLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 982

Query: 179  ELIARTHKEYQDIAIRLGTDRD 200
            EL+A + ++Y  IA +LG +R+
Sbjct: 983  ELVANSQEDYVRIATKLGNNRE 1004


>gi|226486966|emb|CAX75348.1| O-glycosyltransferase [Schistosoma japonicum]
          Length = 1063

 Score =  283 bits (725), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 133/202 (65%), Positives = 162/202 (80%), Gaps = 2/202 (0%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 803  MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 862

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
            QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA
Sbjct: 863  QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 922

Query: 119  AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
             KEEHVRRGQ+AD+CLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 923  PKEEHVRRGQVADICLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 982

Query: 179  ELIARTHKEYQDIAIRLGTDRD 200
            EL+A + ++Y  IA +LG +R+
Sbjct: 983  ELVANSQEDYVRIATKLGNNRE 1004


>gi|226486968|emb|CAX75349.1| O-glycosyltransferase [Schistosoma japonicum]
          Length = 1045

 Score =  283 bits (724), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 133/202 (65%), Positives = 162/202 (80%), Gaps = 2/202 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 785 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 844

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
           QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA
Sbjct: 845 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 904

Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
            KEEHVRRGQ+AD+CLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 905 PKEEHVRRGQVADICLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 964

Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
           EL+A + ++Y  IA +LG +R+
Sbjct: 965 ELVANSQEDYVRIATKLGNNRE 986


>gi|226486964|emb|CAX75347.1| O-glycosyltransferase [Schistosoma japonicum]
          Length = 1045

 Score =  283 bits (724), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 133/202 (65%), Positives = 162/202 (80%), Gaps = 2/202 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 785 MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 844

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
           QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA
Sbjct: 845 QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 904

Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP 178
            KEEHVRRGQ+AD+CLDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCP
Sbjct: 905 PKEEHVRRGQVADICLDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCP 964

Query: 179 ELIARTHKEYQDIAIRLGTDRD 200
           EL+A + ++Y  IA +LG +R+
Sbjct: 965 ELVANSQEDYVRIATKLGNNRE 986


>gi|341897224|gb|EGT53159.1| CBN-OGT-1 protein [Caenorhabditis brenneri]
          Length = 1171

 Score =  278 bits (711), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 129/201 (64%), Positives = 160/201 (79%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI +GQ+  +V   + +QNGL  +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 910  MIMTGQMTMNVMEDMNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 969

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDP+TL MW+ +L+ VP SILWLL+FP  GE +I+      G++Q RI+FSNVAA
Sbjct: 970  NQLYKIDPATLDMWIKILENVPKSILWLLRFPYQGEEHIRKYCVERGIEQSRIVFSNVAA 1029

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P ++LASRVA SQL  LG PE
Sbjct: 1030 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLDSLASRVATSQLYALGVPE 1089

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+T +EY  IA RLG D D
Sbjct: 1090 LVAKTRQEYIRIATRLGNDAD 1110


>gi|357607539|gb|EHJ65577.1| hypothetical protein KGM_15173 [Danaus plexippus]
          Length = 873

 Score =  278 bits (710), Expect = 1e-72,   Method: Composition-based stats.
 Identities = 123/199 (61%), Positives = 156/199 (78%)

Query: 2   IASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQ 61
           I+  Q Q  VN I + NGL+   +  K A+GEE   +I++T+R+QY LPEDA+V+CNFNQ
Sbjct: 623 ISCKQEQVCVNNIFIDNGLSLRLSKKKIASGEERYDNIILTTRRQYNLPEDAVVFCNFNQ 682

Query: 62  LYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKE 121
           LYK DP  L+MW+N+L  VPNS+LWLL FPA GE+N++  AQ  GL   RI+FS +A KE
Sbjct: 683 LYKTDPKALEMWINILNNVPNSVLWLLAFPAAGESNLRHFAQIRGLSPDRIIFSKIAPKE 742

Query: 122 EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELI 181
           EHVRRGQ++DVCLDTPLCNGHTT+MD+LWTGTPVVTLPG+TLASRVA+SQL  L C ELI
Sbjct: 743 EHVRRGQISDVCLDTPLCNGHTTTMDILWTGTPVVTLPGKTLASRVASSQLTALKCTELI 802

Query: 182 ARTHKEYQDIAIRLGTDRD 200
           A++ K Y++IA +LG D +
Sbjct: 803 AKSEKNYEEIATKLGMDAE 821


>gi|268573864|ref|XP_002641909.1| C. briggsae CBR-OGT-1 protein [Caenorhabditis briggsae]
          Length = 1148

 Score =  275 bits (702), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 127/201 (63%), Positives = 157/201 (78%), Gaps = 1/201 (0%)

Query: 1    MIASGQVQTSV-NGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI +GQ+  +V   + +QNGL  +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNF
Sbjct: 890  MIMTGQMTMNVMEDVNVQNGLGQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNF 949

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
            NQLYKIDP TL MW+ +L+ VP S+LWLL+FP  GE +I+      G++  RI+FSNVAA
Sbjct: 950  NQLYKIDPPTLDMWIKILENVPKSVLWLLRFPYQGEEHIRKYCVERGIEPSRIVFSNVAA 1009

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL  LG PE
Sbjct: 1010 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPE 1069

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+T  EY  IA RLG D +
Sbjct: 1070 LVAKTRNEYVQIATRLGNDAE 1090


>gi|76154911|gb|AAX26306.2| SJCHGC00985 protein [Schistosoma japonicum]
          Length = 246

 Score =  272 bits (695), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 129/187 (68%), Positives = 154/187 (82%), Gaps = 2/187 (1%)

Query: 16  LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVN 75
           + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFNQLYK+DPST++MWV 
Sbjct: 1   IHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFNQLYKVDPSTMRMWVE 60

Query: 76  VLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVAAKEEHVRRGQLADVC 133
           +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA KEEHVRRGQ+AD+C
Sbjct: 61  ILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVAPKEEHVRRGQVADIC 120

Query: 134 LDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
           LDTPLCNGHTT MDVLW G PVVTLP ETLASRVAASQL TLGCPEL+A + ++Y  IA 
Sbjct: 121 LDTPLCNGHTTGMDVLWAGCPVVTLPLETLASRVAASQLHTLGCPELVANSQEDYVRIAT 180

Query: 194 RLGTDRD 200
           +LG +R+
Sbjct: 181 KLGNNRE 187


>gi|308501583|ref|XP_003112976.1| CRE-OGT-1 protein [Caenorhabditis remanei]
 gi|308265277|gb|EFP09230.1| CRE-OGT-1 protein [Caenorhabditis remanei]
          Length = 1255

 Score =  271 bits (692), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 126/186 (67%), Positives = 148/186 (79%)

Query: 15   VLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWV 74
            V Q  L  +Q + K ATGEE+P S+++TSR QY LP+DAIV+CNFNQLYKIDPSTL MW+
Sbjct: 1012 VQQLNLFQSQMHHKAATGEEIPNSVLLTSRAQYQLPDDAIVFCNFNQLYKIDPSTLDMWI 1071

Query: 75   NVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCL 134
             +L+ VP SILWLL+FP  GE +I+      GL+  RI+FSNVAAKEEHVRRGQLADVCL
Sbjct: 1072 KILENVPKSILWLLRFPYQGEEHIRKYCVERGLEPSRIVFSNVAAKEEHVRRGQLADVCL 1131

Query: 135  DTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIR 194
            DTPLCNGHTT MD+LWTGTP+VT+P E+LASRVA SQL  LG PEL+A+T  EY DIA R
Sbjct: 1132 DTPLCNGHTTGMDILWTGTPMVTMPLESLASRVATSQLYALGVPELVAKTKAEYVDIATR 1191

Query: 195  LGTDRD 200
            LG D D
Sbjct: 1192 LGNDAD 1197


>gi|156401113|ref|XP_001639136.1| predicted protein [Nematostella vectensis]
 gi|156226262|gb|EDO47073.1| predicted protein [Nematostella vectensis]
          Length = 993

 Score =  265 bits (678), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 118/169 (69%), Positives = 141/169 (83%)

Query: 31  TGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF 90
           +GE +P ++ +T R QYGLPEDA VYCNFNQLYKIDP TL+ W N+L+ VPNS+LWLLKF
Sbjct: 770 SGERIPLTMPLTCRSQYGLPEDAFVYCNFNQLYKIDPKTLKTWANILRRVPNSVLWLLKF 829

Query: 91  PAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
           PAVGE+NI+A   ++GL Q R++FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW
Sbjct: 830 PAVGESNIKAEVSSMGLSQDRVIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTCMDVLW 889

Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G PVVTLP +TLASRVA+SQL  LGC ELIA +  +Y++IA+RLG D+
Sbjct: 890 AGCPVVTLPLDTLASRVASSQLNALGCSELIADSRGDYEEIAVRLGNDK 938


>gi|340376830|ref|XP_003386934.1| PREDICTED: UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110 kDa subunit-like
           [Amphimedon queenslandica]
          Length = 1029

 Score =  255 bits (651), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 122/194 (62%), Positives = 150/194 (77%), Gaps = 5/194 (2%)

Query: 6   QVQTSVNGIVLQNGLA-TNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYK 64
           ++Q S+N +  Q  +A T Q     A    +P  + +T+R QYGLPE A+VYCNFNQLYK
Sbjct: 789 KIQASLNQLSQQTAIASTPQPEHPLA----MPTDLPLTTRAQYGLPEGAVVYCNFNQLYK 844

Query: 65  IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHV 124
           IDP+TL+ W N+LK VPNSI+WLL+FPAVGE N+ AT Q LG+   RI+FS VA KEEHV
Sbjct: 845 IDPATLESWANILKRVPNSIMWLLRFPAVGEPNVVATTQKLGIQPSRIIFSPVAPKEEHV 904

Query: 125 RRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIART 184
           RRG+LAD+CLDTPLCNGHTT MDVLW GTPV+TLP ETLASRVAASQL +LG PEL+A +
Sbjct: 905 RRGRLADMCLDTPLCNGHTTGMDVLWAGTPVLTLPLETLASRVAASQLISLGFPELVASS 964

Query: 185 HKEYQDIAIRLGTD 198
            +EY+DIA+  G +
Sbjct: 965 RQEYEDIAVTYGNN 978


>gi|196006385|ref|XP_002113059.1| hypothetical protein TRIADDRAFT_56833 [Trichoplax adhaerens]
 gi|190585100|gb|EDV25169.1| hypothetical protein TRIADDRAFT_56833 [Trichoplax adhaerens]
          Length = 996

 Score =  247 bits (631), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 117/207 (56%), Positives = 147/207 (71%), Gaps = 12/207 (5%)

Query: 4   SGQVQTSVNGI---------VLQNG---LATNQTNTKTATGEEVPQSIVITSRQQYGLPE 51
           S  + TS N I         V+ NG   +  N  +     G  VP  +V+T+R QYGLP 
Sbjct: 738 SSHINTSTNLISNGTLGSNAVIPNGGPIINGNGDSVPNGPGVNVPSGMVLTTRAQYGLPS 797

Query: 52  DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHR 111
            AI+YCNFNQLYKIDP  L+ WV +LK VPNSILWLL+FPA GEA ++ T   +GL   R
Sbjct: 798 KAIIYCNFNQLYKIDPKILETWVRILKRVPNSILWLLRFPAAGEAMLKNTILKMGLPADR 857

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           ++F+NVA KEEHVRRGQLAD+CLDTP+CNGHTT MDVLW G P++T+P ETLASRVA+SQ
Sbjct: 858 VVFTNVAPKEEHVRRGQLADICLDTPMCNGHTTGMDVLWAGVPMITMPKETLASRVASSQ 917

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L +LG  ELIA+   EY+D+A++ G+D
Sbjct: 918 LTSLGVSELIAKDLTEYEDVAVKYGSD 944


>gi|313240003|emb|CBY32363.1| unnamed protein product [Oikopleura dioica]
          Length = 1054

 Score =  246 bits (629), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 116/201 (57%), Positives = 147/201 (73%), Gaps = 1/201 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           M+  GQ    +  I + NGL       + + GE   Q+ V TSR+QYGLP+DAIVYCNFN
Sbjct: 798 MVIKGQQCLQIGDIKIINGLCLPAFEAQASQGEAKIQNAVYTSRRQYGLPDDAIVYCNFN 857

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-QALGLDQHRILFSNVAA 119
           QLYK+DP+T+  W N+LK VP++++WLL+FPA+GE ++     +   + + RI+FS VAA
Sbjct: 858 QLYKLDPNTMSAWCNILKKVPDAVIWLLRFPALGERHVHDWCWRHHNIPKERIIFSPVAA 917

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLADVCLDTPLCNGHTT MDVLW G P+VTLP E+ ASRVA+SQ+ TLG  E
Sbjct: 918 KEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGCPMVTLPLESFASRVASSQMKTLGLDE 977

Query: 180 LIARTHKEYQDIAIRLGTDRD 200
           LIA  ++ Y+ IAIRLG D D
Sbjct: 978 LIADDYESYESIAIRLGRDVD 998


>gi|313226628|emb|CBY21773.1| unnamed protein product [Oikopleura dioica]
          Length = 1054

 Score =  246 bits (627), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 115/201 (57%), Positives = 147/201 (73%), Gaps = 1/201 (0%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           M+  GQ    +  I + NGL       + + GE   Q+ V TSR+QYGLP+D+IVYCNFN
Sbjct: 798 MVIKGQQCLQIGDIKIINGLCLPAFEAQASQGEAKIQNAVYTSRRQYGLPDDSIVYCNFN 857

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-QALGLDQHRILFSNVAA 119
           QLYK+DP+T+  W N+LK VP++++WLL+FPA+GE ++     +   + + RI+FS VAA
Sbjct: 858 QLYKLDPNTMSAWCNILKKVPDAVIWLLRFPALGERHVHDWCWRHHNIPKERIIFSPVAA 917

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           KEEHVRRGQLADVCLDTPLCNGHTT MDVLW G P+VTLP E+ ASRVA+SQ+ TLG  E
Sbjct: 918 KEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGCPMVTLPLESFASRVASSQMKTLGLDE 977

Query: 180 LIARTHKEYQDIAIRLGTDRD 200
           LIA  ++ Y+ IAIRLG D D
Sbjct: 978 LIADDYESYESIAIRLGRDVD 998


>gi|172087340|ref|XP_001913212.1| UDP-N-acetylglucosaminyltransferase [Oikopleura dioica]
 gi|42601339|gb|AAS21365.1| UDP-N-acetylglucosaminyltransferase [Oikopleura dioica]
          Length = 1070

 Score =  242 bits (617), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 116/217 (53%), Positives = 147/217 (67%), Gaps = 17/217 (7%)

Query: 1    MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
            M+  GQ    +  I + NGL       + + GE   Q+ V TSR+QYGLP+DAIVYCNFN
Sbjct: 798  MVIKGQQCLQIGDIKIINGLCLPAFEAQASQGEAKIQNAVYTSRRQYGLPDDAIVYCNFN 857

Query: 61   QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ-----------------ATAQ 103
            QLYK+DP+T+  W N+LK VP++++WLL+FPA+GE ++                   +  
Sbjct: 858  QLYKLDPNTMSAWCNILKKVPDAVIWLLRFPALGERHVHDWCWRLVSIIFTKPLTNYSCS 917

Query: 104  ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
               + + RI+FS VAAKEEHVRRGQLADVCLDTPLCNGHTT MDVLW G P+VTLP E+ 
Sbjct: 918  HHNIPKERIIFSPVAAKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGCPMVTLPLESF 977

Query: 164  ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ASRVA+SQ+ TLG  ELIA  ++ Y+ IAIRLG D D
Sbjct: 978  ASRVASSQMKTLGLDELIADDYESYESIAIRLGRDVD 1014


>gi|312071553|ref|XP_003138661.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase [Loa
            loa]
          Length = 1205

 Score =  239 bits (609), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 119/201 (59%), Positives = 142/201 (70%), Gaps = 26/201 (12%)

Query: 1    MIASGQVQTSVNGIVLQNGLAT-NQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNF 59
            MI SG +  +V G+ +QNGL T NQT+ K ATGEEVP +I++TSRQQYGLP+DAIV    
Sbjct: 968  MIGSGMIAGNVEGVPVQNGLTTLNQTHVKAATGEEVPHTILVTSRQQYGLPDDAIV---- 1023

Query: 60   NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA 119
                                 PNSILWLL+FP  GE N+        +D  RI+FSNVAA
Sbjct: 1024 ---------------------PNSILWLLRFPYHGEPNVMRFCAEHNIDTRRIVFSNVAA 1062

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            KEEHVRRGQLADVCLDTPLCNGHTT MD+LWTGTP++T+P ETLASRVA+SQL  LG PE
Sbjct: 1063 KEEHVRRGQLADVCLDTPLCNGHTTGMDILWTGTPMITMPLETLASRVASSQLYALGVPE 1122

Query: 180  LIARTHKEYQDIAIRLGTDRD 200
            L+A+  ++Y  IA RLGTDR+
Sbjct: 1123 LVAKDREDYIKIAKRLGTDRE 1143


>gi|167527386|ref|XP_001748025.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773443|gb|EDQ87082.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1173

 Score =  223 bits (568), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 97/167 (58%), Positives = 129/167 (77%), Gaps = 2/167 (1%)

Query: 32   GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
            G+E  Q ++   R+ YGLPED +++CNFNQLYKIDP     W+N+LK VPNS+LW+L+FP
Sbjct: 880  GQEATQPML--HRRFYGLPEDKVIFCNFNQLYKIDPLMFDTWLNILKRVPNSVLWILRFP 937

Query: 92   AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
              GE N+Q      GL   R++FS VA K EHVRRG LAD+CLDT +CNGHTT MD+LW 
Sbjct: 938  PAGEQNLQKRIIEAGLTLDRVIFSPVAGKIEHVRRGALADICLDTHVCNGHTTGMDILWA 997

Query: 152  GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GTP+++LPG+TLASRVA+S L TLGCPELIAR+H++Y ++A++  ++
Sbjct: 998  GTPMISLPGDTLASRVASSLLKTLGCPELIARSHEDYVNLAVQYASN 1044


>gi|326432574|gb|EGD78144.1| O-linked N-acetylglucosamine transferase [Salpingoeca sp. ATCC 50818]
          Length = 1496

 Score =  214 bits (545), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 93/153 (60%), Positives = 120/153 (78%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            +R  YGLP+D I+ CNFNQLYKIDP T  +W+++LKA  N++LW+L+FP  GE N++   
Sbjct: 1033 TRAYYGLPQDKIILCNFNQLYKIDPMTFSLWLDILKATTNTVLWILRFPPAGEQNLRMHV 1092

Query: 103  QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            QA GLDQ R++FS VA K EHVRRG LAD+CLDT  CNGHTT MD+LW GTP++TLPG+T
Sbjct: 1093 QAAGLDQSRVIFSPVAGKVEHVRRGALADLCLDTHACNGHTTGMDILWAGTPMLTLPGDT 1152

Query: 163  LASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
            LASRVA+S +  LGCPELIA + ++Y + A+ L
Sbjct: 1153 LASRVASSLVTALGCPELIASSRQDYFNKAVDL 1185


>gi|301094298|ref|XP_002896255.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
           putative [Phytophthora infestans T30-4]
 gi|262109650|gb|EEY67702.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
           putative [Phytophthora infestans T30-4]
          Length = 977

 Score =  199 bits (506), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 89/176 (50%), Positives = 126/176 (71%), Gaps = 1/176 (0%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N    +  +V  +    +R  YG+PED  V+CNFNQ+YKIDP T   W+N+LK VPNS+L
Sbjct: 719 NDHKQSARDVLDTEKCPTRADYGVPEDKFVFCNFNQVYKIDPVTFTTWMNILKRVPNSVL 778

Query: 86  WLLKFPAVGEANIQATAQALGL-DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
           WLL+FP + EANI+A A+A G+ DQ R++F++VA K+EH++RG LAD+ LDTP CN HTT
Sbjct: 779 WLLRFPPIAEANIRAEARARGVKDQTRLIFTDVAPKDEHLKRGYLADLFLDTPECNAHTT 838

Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             D+LW GTP+VT+P + +A+RVA+S L      ELI  + +EY+++A+ L +D D
Sbjct: 839 GCDILWGGTPMVTMPKDRMATRVASSLLRAANMSELITNSLEEYEELAVALASDMD 894


>gi|348687602|gb|EGZ27416.1| hypothetical protein PHYSODRAFT_553979 [Phytophthora sojae]
          Length = 986

 Score =  196 bits (499), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 88/176 (50%), Positives = 125/176 (71%), Gaps = 1/176 (0%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N    +  +V  +    +R  YG+P+D  V+CNFNQ+YKIDP T   W+N+LK VPNS+L
Sbjct: 719 NDHKQSARDVLDTEKCPTRADYGVPDDKFVFCNFNQVYKIDPVTFTTWMNILKRVPNSVL 778

Query: 86  WLLKFPAVGEANIQATAQALGL-DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
           WLL+FP + EANI+A A+A G+ DQ R++F++VA K EH++RG LAD+ LDTP CN HTT
Sbjct: 779 WLLRFPPIAEANIRAEARARGVKDQTRLIFTDVAPKNEHLKRGYLADLFLDTPECNAHTT 838

Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             D+LW GTP+VT+P + +A+RVA+S L      ELI  + +EY+++A+ L +D D
Sbjct: 839 GCDILWGGTPMVTMPKDRMATRVASSLLRAANMSELITNSLEEYEELAVALASDMD 894


>gi|325187990|emb|CCA22532.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 968

 Score =  194 bits (494), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 88/173 (50%), Positives = 119/173 (68%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N    +  EV        R+QYG+P+   V+CNFNQ+YKIDP T   W+NVLK VPNSIL
Sbjct: 721 NDHKQSAREVLDVSKCPKREQYGVPQGKFVFCNFNQVYKIDPDTFTTWMNVLKRVPNSIL 780

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+FP V EANI+A A+  G+   R+ F++VA KEEH++RG LAD+ LDTP CN HTT 
Sbjct: 781 WLLRFPPVAEANIRAQARERGVPDDRLQFTDVAKKEEHLKRGHLADLFLDTPECNAHTTG 840

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            D+LW GTP++TL  + +A+RVA+S L      E+I  + +EY+++A+ L TD
Sbjct: 841 CDILWGGTPMITLAKDRMATRVASSLLRAANLDEMITNSLEEYEELAVTLATD 893


>gi|298710892|emb|CBJ26401.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Ectocarpus siliculosus]
          Length = 1080

 Score =  183 bits (465), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 83/175 (47%), Positives = 118/175 (67%), Gaps = 2/175 (1%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N    +   V    ++ +R  YG+PED  V CNFNQLYK+DP+    W++VLK VPN++L
Sbjct: 721 NDHKQSARYVLDRTLLPTRATYGVPEDRFVLCNFNQLYKMDPAIFSTWMSVLKRVPNAVL 780

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+FP  GEANI+  A+  G+ + ++ F++VA KEEH++RG LAD+ LDTP CN HTT 
Sbjct: 781 WLLRFPPAGEANIRMEARKRGVREEQLHFTDVATKEEHIKRGYLADLFLDTPSCNAHTTG 840

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCP--ELIARTHKEYQDIAIRLGTD 198
            D+LW+GTP++T+ G  +A+RVA S L   G     LI  + +EY++ A+ L TD
Sbjct: 841 CDILWSGTPMLTMAGSKMATRVAPSLLKAAGAEGTGLIVESLEEYEERAVSLATD 895


>gi|328699992|ref|XP_003241117.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 2 [Acyrthosiphon pisum]
 gi|328699994|ref|XP_003241118.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 3 [Acyrthosiphon pisum]
 gi|328699996|ref|XP_003241119.1| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 4 [Acyrthosiphon pisum]
 gi|328699998|ref|XP_001952129.2| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            isoform 1 [Acyrthosiphon pisum]
          Length = 1108

 Score =  182 bits (463), Expect = 5e-44,   Method: Composition-based stats.
 Identities = 80/166 (48%), Positives = 113/166 (68%)

Query: 34   EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
            E+ +S+    R  + LPE+ +V+CNF++LYKIDP T +MW+N+L  VPNS+LWLL    +
Sbjct: 904  ELEESVKSYYRSTFNLPENVVVFCNFSKLYKIDPFTFRMWLNILNNVPNSVLWLLHLSDI 963

Query: 94   GEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
             E N++  A  L  D  RI+F++   K +H+ R QLAD+ LDT LCNGH   +D +W G 
Sbjct: 964  AENNLRTFADHLNFDTSRIIFADFIPKYQHLNRIQLADIYLDTHLCNGHIACLDAIWAGV 1023

Query: 154  PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            PV+TLPG+T  SRV ASQL TLG  + IA+  + Y +IAI+LG ++
Sbjct: 1024 PVITLPGDTYESRVTASQLTTLGIADTIAQNEENYIEIAIQLGLNK 1069


>gi|147857906|emb|CAN80393.1| hypothetical protein VITISV_001595 [Vitis vinifera]
          Length = 566

 Score =  182 bits (461), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 108/157 (68%), Gaps = 1/157 (0%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  +++ A 
Sbjct: 348 RSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRSYAV 407

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A GL   RI+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VTLP E +
Sbjct: 408 AQGLQPDRIIFTDVAMKHEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEKM 467

Query: 164 ASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           A+RVA S  LAT    E+I  + KEY++ A+ L  +R
Sbjct: 468 ATRVAGSLCLATGLGEEMIVSSMKEYEEKAVSLAMNR 504


>gi|225430656|ref|XP_002270163.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC [Vitis vinifera]
 gi|296085167|emb|CBI28662.3| unnamed protein product [Vitis vinifera]
          Length = 986

 Score =  181 bits (460), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 108/157 (68%), Gaps = 1/157 (0%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  +++ A 
Sbjct: 768 RSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRSYAV 827

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A GL   RI+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VTLP E +
Sbjct: 828 AQGLQPDRIIFTDVAMKHEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEKM 887

Query: 164 ASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           A+RVA S  LAT    E+I  + KEY++ A+ L  +R
Sbjct: 888 ATRVAGSLCLATGLGEEMIVSSMKEYEEKAVSLAMNR 924


>gi|384254151|gb|EIE27625.1| TPR-like protein [Coccomyxa subellipsoidea C-169]
          Length = 945

 Score =  181 bits (460), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 88/159 (55%), Positives = 109/159 (68%), Gaps = 1/159 (0%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR + GLPED IVY   NQLYK DP T   W N+L+ VPNS+LWLL+FP  GE  I+A
Sbjct: 734 LPSRTEVGLPEDRIVYSCSNQLYKYDPETFTTWCNILRRVPNSVLWLLRFPPYGEPRIRA 793

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A A G+D  RI+F++VAAK  H+RR  +ADV LDTPLCN HTT  DVLW G P+VTLP 
Sbjct: 794 EAAARGVDPARIIFTDVAAKPLHIRRSGIADVFLDTPLCNAHTTGCDVLWGGCPMVTLPL 853

Query: 161 ETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
           E +ASRVAAS     G   E++  +  EY+D A+ LG +
Sbjct: 854 ERMASRVAASLCYATGLGHEMVVSSQHEYEDRAVELGLN 892


>gi|326498109|dbj|BAJ94917.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 986

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 82/157 (52%), Positives = 110/157 (70%), Gaps = 1/157 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLP+D  ++  FNQLYK+DP     W N++K VPNS+LWLL+FPA GE  ++A A
Sbjct: 767 KRSDYGLPKDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVLWLLRFPATGEMRVKAHA 826

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR +LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 827 AARGVSPDQIIFTDVAMKHEHIRRSELADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 886

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
           +A+RVA S  LAT    E+I  + KEY+D A+ L T+
Sbjct: 887 MATRVAGSLCLATGLGDEMIVSSTKEYEDRAVELATN 923


>gi|326523469|dbj|BAJ92905.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  181 bits (458), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 82/157 (52%), Positives = 110/157 (70%), Gaps = 1/157 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLP+D  ++  FNQLYK+DP     W N++K VPNS+LWLL+FPA GE  ++A A
Sbjct: 149 KRSDYGLPKDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVLWLLRFPATGEMRVKAHA 208

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR +LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 209 AARGVSPDQIIFTDVAMKHEHIRRSELADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 268

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
           +A+RVA S  LAT    E+I  + KEY+D A+ L T+
Sbjct: 269 MATRVAGSLCLATGLGDEMIVSSTKEYEDRAVELATN 305


>gi|357441255|ref|XP_003590905.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
 gi|355479953|gb|AES61156.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
          Length = 986

 Score =  180 bits (457), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 84/158 (53%), Positives = 107/158 (67%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLLKFPA GE  ++A A
Sbjct: 767 KRSDYGLPEDKFLFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLKFPAAGEMRLRAYA 826

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VTLP E 
Sbjct: 827 AAQGVQPDQIIFTDVAMKGEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEK 886

Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S   + G   E+I  + KEY+D A+ L  +R
Sbjct: 887 MATRVAGSLCISTGLGEEMIVSSMKEYEDRAVSLALNR 924


>gi|357441257|ref|XP_003590906.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
 gi|355479954|gb|AES61157.1| O-linked GlcNAc transferase like protein [Medicago truncatula]
          Length = 744

 Score =  180 bits (457), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 84/158 (53%), Positives = 107/158 (67%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLLKFPA GE  ++A A
Sbjct: 525 KRSDYGLPEDKFLFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLKFPAAGEMRLRAYA 584

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VTLP E 
Sbjct: 585 AAQGVQPDQIIFTDVAMKGEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEK 644

Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S   + G   E+I  + KEY+D A+ L  +R
Sbjct: 645 MATRVAGSLCISTGLGEEMIVSSMKEYEDRAVSLALNR 682


>gi|356535232|ref|XP_003536152.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Glycine max]
          Length = 988

 Score =  180 bits (456), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 84/158 (53%), Positives = 108/158 (68%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 769 KRSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYA 828

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LD+PLCN HTT  D+LW G P+VTLP E 
Sbjct: 829 AAQGVQPDQIIFTDVATKNEHIRRSSLADLFLDSPLCNAHTTGTDILWAGLPMVTLPLEK 888

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S  LAT    E+I  + KEY+D A+ L  +R
Sbjct: 889 MATRVAGSLCLATGLGDEMIVSSMKEYEDRAVSLALNR 926


>gi|357162940|ref|XP_003579571.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Brachypodium
           distachyon]
          Length = 983

 Score =  179 bits (455), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 84/174 (48%), Positives = 113/174 (64%), Gaps = 1/174 (0%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N       +V   +    R  YGLPED  ++  FNQLYK+DP     W N++K VPNS+L
Sbjct: 747 NDYKQKNRDVFSPLCPHKRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVL 806

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+FPA GE  ++A A A G+   +I+F++VA K EH+RR +LAD+ LDTPLCN HTT 
Sbjct: 807 WLLRFPATGEMRVKAHAAARGVSPDQIIFTDVAVKHEHIRRSELADLFLDTPLCNAHTTG 866

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
            D+LW G P++TL  E +A+RVA S  LAT    E+I  + KEY+D A+ L T+
Sbjct: 867 TDILWAGLPMITLALEKMATRVAGSLCLATGLGDEMIVSSTKEYEDRAVELATN 920


>gi|428175412|gb|EKX44302.1| hypothetical protein GUITHDRAFT_95096, partial [Guillardia theta
           CCMP2712]
          Length = 286

 Score =  178 bits (452), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 79/139 (56%), Positives = 101/139 (72%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR QYG+PED IV CNFNQLYKIDP  L +W  +L  +P ++LWLL+FP  GEA I+  A
Sbjct: 143 SRAQYGIPEDKIVLCNFNQLYKIDPLILDVWCRILHKLPETVLWLLRFPHAGEAGIRREA 202

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+   RI+F++VA K EH+RRG  AD+ LD+  CN HTT  D+LW+GTP+VT P E 
Sbjct: 203 RARGIADDRIIFTDVANKSEHIRRGVCADLFLDSYQCNAHTTGCDILWSGTPIVTTPQEK 262

Query: 163 LASRVAASQLATLGCPELI 181
           ++ RVAAS +  LGC ELI
Sbjct: 263 MSCRVAASLVNALGCKELI 281


>gi|224123642|ref|XP_002319130.1| predicted protein [Populus trichocarpa]
 gi|222857506|gb|EEE95053.1| predicted protein [Populus trichocarpa]
          Length = 923

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 83/158 (52%), Positives = 109/158 (68%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 707 KRSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYA 766

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K+EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VT+P E 
Sbjct: 767 VAQGVQPDQIIFTDVAMKQEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTMPLEK 826

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S  LAT    E+I  + KEY++ A+ L  +R
Sbjct: 827 MATRVAGSLCLATGLGDEMIVSSMKEYEERAVSLALNR 864


>gi|452824446|gb|EME31449.1| polypeptide N-acetylglucosaminyltransferase [Galdieria sulphuraria]
          Length = 1024

 Score =  178 bits (451), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 87/168 (51%), Positives = 111/168 (66%), Gaps = 7/168 (4%)

Query: 40  VITSRQQYGLPEDA-------IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
           +I +R+ YGLP+         +++ NFNQLYK+DPSTL +W N+LK VP S +WLL+FP 
Sbjct: 776 IILTRESYGLPKSVYEGGDCTVLFANFNQLYKLDPSTLNIWCNILKRVPGSKIWLLRFPP 835

Query: 93  VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
             EA + A      +   RI+F++VA KEEH+ R  LAD+ LDTP+CN HTT+ D LW+G
Sbjct: 836 AAEARLLAQVSKRHISTDRIIFTDVAPKEEHIARCGLADIFLDTPVCNAHTTATDALWSG 895

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            PVVT P   LASRVAAS L   G  ELIAR   EYQDIAI L  + +
Sbjct: 896 LPVVTGPTVRLASRVAASVLHAAGLDELIARDMNEYQDIAIYLAYNYE 943


>gi|302765547|ref|XP_002966194.1| peptide-N-acetylglucosaminyltransferase [Selaginella
           moellendorffii]
 gi|300165614|gb|EFJ32221.1| peptide-N-acetylglucosaminyltransferase [Selaginella
           moellendorffii]
          Length = 985

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 84/174 (48%), Positives = 111/174 (63%), Gaps = 1/174 (0%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N       +V  S  +  R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS L
Sbjct: 750 NDYKQKNRDVLDSTNLPKRSDYGLPEDKFIFACFNQLYKMDPEIFTTWCNILKRVPNSAL 809

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+FPA GE  ++A A A G+   +I+F++VAAK EH+RR  LAD+ LDTPLCN HTT 
Sbjct: 810 WLLRFPAAGEDRLKAFAVAQGVGPGQIIFTDVAAKNEHIRRSALADLFLDTPLCNAHTTG 869

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
            DVLW G P++TLP E +A+RVA S     G   E++ +  +EY+D A+ L  +
Sbjct: 870 TDVLWAGLPMITLPLEKMATRVAMSLCYAAGFGDEMVVKKMQEYEDRAVMLAEN 923


>gi|255562045|ref|XP_002522031.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
 gi|223538835|gb|EEF40435.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
          Length = 979

 Score =  177 bits (449), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 82/158 (51%), Positives = 109/158 (68%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  +++ A
Sbjct: 760 KRSDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRSYA 819

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            + G+   +I+F++VA K+EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VTLP E 
Sbjct: 820 VSQGVQPEQIIFTDVAMKQEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTLPLEK 879

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S  LAT    E+I  + KEY++ A+ L  +R
Sbjct: 880 MATRVAGSLCLATGLGDEMIVSSMKEYEEKAVSLALNR 917


>gi|413936918|gb|AFW71469.1| hypothetical protein ZEAMMB73_864118 [Zea mays]
          Length = 706

 Score =  177 bits (449), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 104/155 (67%), Gaps = 1/155 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 487 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAYA 546

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 547 AARGVRSDQIVFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 606

Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
           +A+RVA S     G   E+I  + K+Y+D A+ L 
Sbjct: 607 MATRVAGSLCVATGLGEEMIVSSMKKYEDRAVELA 641


>gi|242065108|ref|XP_002453843.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
 gi|241933674|gb|EES06819.1| hypothetical protein SORBIDRAFT_04g019560 [Sorghum bicolor]
          Length = 1011

 Score =  177 bits (448), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 81/155 (52%), Positives = 104/155 (67%), Gaps = 1/155 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 792 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 851

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 852 VARGVRSDQIIFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 911

Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
           +A+RVA S     G   E+I  + KEY+D A+ L 
Sbjct: 912 MATRVAGSLCVATGLGEEMIVSSMKEYEDRAVDLA 946


>gi|297833104|ref|XP_002884434.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330274|gb|EFH60693.1| hypothetical protein ARALYDRAFT_896453 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 977

 Score =  177 bits (448), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 83/158 (52%), Positives = 108/158 (68%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP  +  W N+LK VPNS LWLL+FPA GE   +  A
Sbjct: 758 KRSDYGLPEDKFIFACFNQLYKMDPEIVNTWCNILKRVPNSALWLLRFPAAGEMRFRTYA 817

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LADV LDTPLCNGHTT  DVLW G P++TLP E 
Sbjct: 818 AAQGVQSDQIIFTDVAMKSEHIRRSVLADVILDTPLCNGHTTGTDVLWAGVPMITLPLEK 877

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S  LAT    E+I  + +EY++ A+ L  ++
Sbjct: 878 MATRVAGSLCLATGLGHEMIVNSLEEYEEKAVSLALNK 915


>gi|224144666|ref|XP_002325368.1| predicted protein [Populus trichocarpa]
 gi|222862243|gb|EEE99749.1| predicted protein [Populus trichocarpa]
          Length = 873

 Score =  177 bits (448), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 82/158 (51%), Positives = 110/158 (69%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 654 KRLDYGLPEDKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYA 713

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G++  +I+F++VA K+EH+RR  LAD+ LDTPLCN HTT  D+LW G P+VT+P E 
Sbjct: 714 VAQGVNPDQIIFTDVAMKQEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMVTMPLEK 773

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S  LAT    E+I  + KEY++ A+ L  ++
Sbjct: 774 MATRVAGSLCLATGLGDEMIVSSMKEYEERAVSLALNK 811


>gi|293334567|ref|NP_001167956.1| uncharacterized protein LOC100381672 [Zea mays]
 gi|223945123|gb|ACN26645.1| unknown [Zea mays]
          Length = 1009

 Score =  176 bits (447), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 104/155 (67%), Gaps = 1/155 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 790 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAYA 849

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 850 AARGVRSDQIVFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 909

Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
           +A+RVA S     G   E+I  + K+Y+D A+ L 
Sbjct: 910 MATRVAGSLCVATGLGEEMIVSSMKKYEDRAVELA 944


>gi|328714601|ref|XP_001947755.2| PREDICTED: UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase 110 kDa subunit-like
            [Acyrthosiphon pisum]
          Length = 1076

 Score =  176 bits (447), Expect = 4e-42,   Method: Composition-based stats.
 Identities = 79/156 (50%), Positives = 105/156 (67%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
            R  + LPE+ +V+CNF++LYKIDP TL+MW+ +L  VP S+LWLL    V E N++  A 
Sbjct: 876  RSTFNLPENVVVFCNFSKLYKIDPFTLRMWITILNNVPKSVLWLLHLNDVAENNLRTFAD 935

Query: 104  ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             L  D  RI+F++   K +H+ R QLAD+ LDT L NGH   +D LW G PV+TLPG+T 
Sbjct: 936  VLNFDTSRIIFADFIPKYQHLNRIQLADIYLDTHLYNGHIACLDALWAGVPVITLPGDTY 995

Query: 164  ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  SQL TLG  + IA+  + Y DIAI LG ++
Sbjct: 996  ASRVTTSQLTTLGITDTIAQNEENYIDIAIELGLNK 1031


>gi|323452442|gb|EGB08316.1| putative O-linked GlcNAc transferase [Aureococcus anophagefferens]
          Length = 1151

 Score =  176 bits (446), Expect = 5e-42,   Method: Composition-based stats.
 Identities = 83/176 (47%), Positives = 114/176 (64%), Gaps = 1/176 (0%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDA-IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI 84
           N    +   +  +  +  R  YG+PE A  V+CNFNQ+YKIDP  L +W  +L  V  SI
Sbjct: 770 NDHAQSASYIFDAAALPKRADYGVPERASFVFCNFNQIYKIDPGILDVWCRILTRVDGSI 829

Query: 85  LWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
           LWLL+FP  GEANI+A A+  G+   RI F+ V+AK+EH++RG LAD+ LDTP CN HTT
Sbjct: 830 LWLLRFPPTGEANIRAEARKRGVSDDRIHFTAVSAKDEHIKRGVLADLFLDTPQCNAHTT 889

Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             D+LW G P+VT  G  +A+RVAAS L   G  +L+A     Y+++A+ L TD++
Sbjct: 890 GCDILWGGCPMVTCLGGKMATRVAASLLGAAGLADLVAPDLGGYEELAVALATDKE 945


>gi|60599458|gb|AAX26274.1| unknown [Schistosoma japonicum]
          Length = 180

 Score =  176 bits (446), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 84/139 (60%), Positives = 106/139 (76%), Gaps = 2/139 (1%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI +     ++NG+ + NGL T QT +K + GEEVP+ +++TSRQ YGLPEDA+V+CNFN
Sbjct: 34  MIRTNSPSCTINGVTIHNGLVTQQTQSKASAGEEVPRDLILTSRQHYGLPEDAVVFCNFN 93

Query: 61  QLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVA 118
           QLYK+DPST++MWV +LK VPNS+LWLL+FPA GEA   A A  +GL Q   RILFSNVA
Sbjct: 94  QLYKVDPSTMRMWVEILKGVPNSVLWLLRFPAAGEAGALAAASEMGLQQVNRRILFSNVA 153

Query: 119 AKEEHVRRGQLADVCLDTP 137
            KEEHVRRGQ++   L TP
Sbjct: 154 PKEEHVRRGQVSSYFLITP 172


>gi|357142312|ref|XP_003572530.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Brachypodium
           distachyon]
          Length = 1001

 Score =  175 bits (444), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 81/157 (51%), Positives = 107/157 (68%), Gaps = 1/157 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LW+L+FPA GE  ++A A
Sbjct: 782 KRSDYGLPEDKFIFACFNQLYKMDPEIFDRWCNILKRVPNSALWILRFPAAGETRVRAHA 841

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 842 AARGVRSDQIIFTDVAMKTEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 901

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTD 198
           +A+RVA S  LAT    E+I  + +EY+D A+ L  +
Sbjct: 902 MATRVAGSLCLATGLGEEMIVSSMQEYEDRAVALAQN 938


>gi|89242353|gb|ABD64595.1| O-linked N-acetylglucosamine transferase, partial [Sus scrofa]
 gi|89242355|gb|ABD64596.1| O-linked N-acetylglucosamine transferase, partial [Sus scrofa]
          Length = 103

 Score =  175 bits (444), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 83/103 (80%), Positives = 90/103 (87%)

Query: 77  LKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDT 136
           LK VPNS+LWLL+FPAVGE NIQ  AQ +GL Q+RI+FS VA KEEHVRRGQLADVCLDT
Sbjct: 1   LKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDT 60

Query: 137 PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
           PLCNGHTT MDVLW GTP+VT+PGETLASRVAASQL  LGC E
Sbjct: 61  PLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLE 103


>gi|218190772|gb|EEC73199.1| hypothetical protein OsI_07263 [Oryza sativa Indica Group]
          Length = 1004

 Score =  174 bits (441), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 84/157 (53%), Positives = 108/157 (68%), Gaps = 3/157 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 783 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 842

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 843 AARGVRPDQIIFTDVAMKNEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 902

Query: 163 LASRVAASQ-LAT-LGCPELIAR-THKEYQDIAIRLG 196
           +A+RVA S  LAT LG   +++R   KEY+D A+ L 
Sbjct: 903 MATRVAGSLCLATGLGEEMIVSRQVMKEYEDRAVDLA 939


>gi|222622881|gb|EEE57013.1| hypothetical protein OsJ_06783 [Oryza sativa Japonica Group]
          Length = 1004

 Score =  174 bits (441), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 84/157 (53%), Positives = 108/157 (68%), Gaps = 3/157 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 783 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 842

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E 
Sbjct: 843 AARGVRPDQIIFTDVAMKNEHIRRSSLADLFLDTPLCNAHTTGTDILWAGLPMITLPLEK 902

Query: 163 LASRVAASQ-LAT-LGCPELIAR-THKEYQDIAIRLG 196
           +A+RVA S  LAT LG   +++R   KEY+D A+ L 
Sbjct: 903 MATRVAGSLCLATGLGEEMIVSRQVMKEYEDRAVDLA 939


>gi|403344986|gb|EJY71848.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Oxytricha trifallax]
          Length = 1211

 Score =  173 bits (439), Expect = 3e-41,   Method: Composition-based stats.
 Identities = 75/160 (46%), Positives = 111/160 (69%), Gaps = 1/160 (0%)

Query: 42   TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
            + R+QYGLP D  ++ NFNQLYK+DP+T  +W+ +L  VPNSILW++++PA  + N+   
Sbjct: 915  SCREQYGLPNDKFIFANFNQLYKLDPNTFTVWMKILGRVPNSILWIVEYPAEAKENLLKE 974

Query: 102  AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP-G 160
            A A G+D  RI+ +  A K EH+ R  LAD+ LD P+ NGHTT+ D+LW+G P++T P  
Sbjct: 975  AVARGIDPSRIIMTTKAPKHEHIERCHLADLSLDNPITNGHTTTCDLLWSGLPIITYPFS 1034

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            E + SRVAAS    L CPE+I  ++++Y++ A++L T  D
Sbjct: 1035 ENMPSRVAASICQALECPEMIVTSYQDYEEYAVKLATPTD 1074


>gi|15229253|ref|NP_187074.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC [Arabidopsis
           thaliana]
 gi|75336082|sp|Q9M8Y0.1|SEC_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC; AltName:
           Full=Protein SECRET AGENT
 gi|6721161|gb|AAF26789.1|AC016829_13 putative O-linked GlcNAc transferase [Arabidopsis thaliana]
 gi|18139887|gb|AAL60196.1|AF441079_1 O-linked N-acetyl glucosamine transferase [Arabidopsis thaliana]
 gi|20259324|gb|AAM13988.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
 gi|21436429|gb|AAM51415.1| putative O-linked GlcNAc transferase [Arabidopsis thaliana]
 gi|110742062|dbj|BAE98963.1| O-linked GlcNAc transferase like protein [Arabidopsis thaliana]
 gi|332640535|gb|AEE74056.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC [Arabidopsis
           thaliana]
          Length = 977

 Score =  173 bits (439), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 82/158 (51%), Positives = 107/158 (67%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP  +  W N+LK VPNS LWLL+FPA GE   +  A
Sbjct: 758 KRSDYGLPEDKFIFACFNQLYKMDPEIVNTWCNILKRVPNSALWLLRFPAAGEMRFRTYA 817

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   +I+F++VA K EH+RR  LADV LDTPLCNGHTT  DVLW G P++TLP E 
Sbjct: 818 AAQGVQPDQIIFTDVAMKSEHIRRSVLADVILDTPLCNGHTTGTDVLWAGVPMITLPLEK 877

Query: 163 LASRVAASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
           +A+RVA S  LAT     +I  + +EY++ A+ L  ++
Sbjct: 878 MATRVAGSLCLATGLGHGMIVNSLEEYEEKAVSLALNK 915


>gi|357131539|ref|XP_003567394.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Brachypodium
           distachyon]
          Length = 980

 Score =  172 bits (437), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 80/174 (45%), Positives = 110/174 (63%), Gaps = 1/174 (0%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           N      ++    +    R  YGLPED  ++  FNQLYK+DP     W N++K VPNS L
Sbjct: 744 NDYKKKNQDALDQVCPHKRADYGLPEDKFIFACFNQLYKMDPDIFGTWCNIVKRVPNSAL 803

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+FP  GE  ++A A + G+   +I+F++VAAK EH+RR  LAD+ LDTPLCNGHTT 
Sbjct: 804 WLLRFPPAGEMRLRAYAISKGVRADQIIFTDVAAKSEHIRRSALADLFLDTPLCNGHTTG 863

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
            D+LW G P++TLP E +A+RVA S     G   E+I  + KEY++ A+ L  +
Sbjct: 864 TDILWAGLPMITLPLEKMATRVAGSLCRATGIGEEMIVSSLKEYEERAVFLAKN 917


>gi|168054571|ref|XP_001779704.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668902|gb|EDQ55500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 922

 Score =  172 bits (436), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 105/155 (67%), Gaps = 1/155 (0%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  YGLPED  ++  FNQLYK+DP     W  +LK VPNS LWLL+FPA GE  ++A A 
Sbjct: 704 RSDYGLPEDKFLFACFNQLYKMDPEIFSTWCRILKRVPNSALWLLRFPAAGETRLKAFAI 763

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+   +I+F++VAAK EH+RR  LAD+ LD+PLCN HTT  DVLW G P+VTLP E +
Sbjct: 764 AQGVRPDQIIFTDVAAKNEHIRRSGLADLFLDSPLCNAHTTGTDVLWAGLPMVTLPLEKM 823

Query: 164 ASRVAASQLATLGC-PELIARTHKEYQDIAIRLGT 197
           A+RVA S     G   E++  + +EY++ A+ L T
Sbjct: 824 ATRVAMSLCYAAGFGEEMVVSSMQEYEERAVMLAT 858


>gi|168044085|ref|XP_001774513.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674225|gb|EDQ60737.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 932

 Score =  170 bits (430), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 78/160 (48%), Positives = 105/160 (65%), Gaps = 1/160 (0%)

Query: 40  VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
           +   R  Y LPED  ++  FNQLYK+DP     W  +LK VPNS LWLL+FPA GE  ++
Sbjct: 710 ISMKRSDYSLPEDKFLFACFNQLYKMDPEIFSTWCRILKRVPNSALWLLRFPAAGETRLK 769

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A A A G+   + +F++VAAK EH+RR  LAD+ LD+PLCN HTT  DVLW G P+VTLP
Sbjct: 770 AFAIAQGVRPEQFIFTDVAAKNEHIRRSALADLFLDSPLCNAHTTGTDVLWAGLPMVTLP 829

Query: 160 GETLASRVAASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
            E +A+RVA S     G   E++  + +EY++ A+ L T+
Sbjct: 830 LEKMATRVAMSLCYAAGFGEEMVVSSMQEYEERAVMLATN 869


>gi|449449593|ref|XP_004142549.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like [Cucumis
           sativus]
          Length = 975

 Score =  167 bits (422), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 78/152 (51%), Positives = 104/152 (68%), Gaps = 1/152 (0%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP+   ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A A G+ 
Sbjct: 762 LPQGKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYAVAQGVQ 821

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E +A+RVA
Sbjct: 822 PEQIIFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEKMATRVA 881

Query: 169 ASQ-LATLGCPELIARTHKEYQDIAIRLGTDR 199
            S  LAT    E+I  + KEY++ A+ L  +R
Sbjct: 882 GSLCLATGLGDEMIVSSMKEYEEKAVTLALNR 913


>gi|323449294|gb|EGB05183.1| hypothetical protein AURANDRAFT_54803 [Aureococcus anophagefferens]
          Length = 897

 Score =  166 bits (419), Expect = 7e-39,   Method: Composition-based stats.
 Identities = 73/157 (46%), Positives = 105/157 (66%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R +YGLPEDA ++  F QL K+DP    +W+++LK VPNSIL L+K P     +++  A+
Sbjct: 687 RSKYGLPEDAFIFAYFGQLNKVDPIAYGVWMSILKRVPNSILLLIKSPESAVVHLKKAAR 746

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
            L + + R++F + A K EH+ RG LAD+ LDTP CN    +MD LWTGTPV+TL G ++
Sbjct: 747 DLQVHEQRLIFMDSAPKREHILRGVLADLLLDTPACNAFDATMDALWTGTPVITLLGNSI 806

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           A+R +AS    +GCP+L+  T  EY+D+A+ L  D D
Sbjct: 807 ATRASASLCTAVGCPDLVTSTLDEYEDLAVSLALDAD 843


>gi|349605516|gb|AEQ00726.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit-like protein, partial [Equus caballus]
          Length = 158

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 76/97 (78%), Positives = 86/97 (88%)

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           AQ +GL Q+RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGE
Sbjct: 2   AQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGE 61

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           TLASRVAASQL  LGC ELIA+  +EY+DIA++LGTD
Sbjct: 62  TLASRVAASQLTCLGCLELIAKNRQEYEDIAVKLGTD 98


>gi|449532483|ref|XP_004173210.1| PREDICTED: LOW QUALITY PROTEIN: probable
           UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC-like, partial
           [Cucumis sativus]
          Length = 434

 Score =  163 bits (413), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 71/127 (55%), Positives = 90/127 (70%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  YGLPE   ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A 
Sbjct: 294 RSDYGLPEGKFIFACFNQLYKMDPEIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYAV 353

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+   +I+F++VA K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E +
Sbjct: 354 AQGVQPEQIIFTDVAMKNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEKM 413

Query: 164 ASRVAAS 170
           A+RVA S
Sbjct: 414 ATRVAGS 420


>gi|425442300|ref|ZP_18822552.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           9717]
 gi|389716770|emb|CCH99039.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           9717]
          Length = 1039

 Score =  162 bits (411), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 74/158 (46%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 830 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 889

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+D  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T PG T
Sbjct: 890 QARGIDGDRLIFAHLQPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 949

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 950 FPSRVGASLLTAMGLPELITKNLEEYKNLAINLAKSPD 987


>gi|400754587|ref|YP_006562955.1| hypothetical protein PGA2_c17130 [Phaeobacter gallaeciensis 2.10]
 gi|398653740|gb|AFO87710.1| hypothetical protein PGA2_c17130 [Phaeobacter gallaeciensis 2.10]
          Length = 585

 Score =  162 bits (410), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 83/194 (42%), Positives = 117/194 (60%), Gaps = 8/194 (4%)

Query: 4   SGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLY 63
            G+   S   + + N    N  +   A  EE P      +R   GLPEDA+V+C+FN   
Sbjct: 342 EGRRHFSEKILYMPNCYQVN--DNSRAHPEEKP------TRADMGLPEDAVVFCSFNNHN 393

Query: 64  KIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEH 123
           K+ P+   +W+++LK V NS+LW L    +  ANI   A+A G+   RI+F+   +  +H
Sbjct: 394 KVTPAEFDIWMDLLKDVDNSVLWFLAAADIVRANILKEAEARGVPADRIVFAGRCSTPDH 453

Query: 124 VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIAR 183
           V R  LAD+ LDT  CN HTT+ ++LW+G PVVT PGE  A+RVAAS +  +GCPELIA 
Sbjct: 454 VARLPLADIFLDTFACNAHTTASEMLWSGVPVVTKPGEQFAARVAASIVTAVGCPELIAE 513

Query: 184 THKEYQDIAIRLGT 197
           T +EY+ +A+RL T
Sbjct: 514 TDEEYRALALRLAT 527


>gi|26338580|dbj|BAC32961.1| unnamed protein product [Mus musculus]
          Length = 153

 Score =  161 bits (407), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 74/93 (79%), Positives = 84/93 (90%)

Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
           GL Q+RI+FS VA KEEHVRRGQLADVCLDTPLCNGHTT MDVLW GTP+VT+PGETLAS
Sbjct: 1   GLPQNRIIFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLAS 60

Query: 166 RVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           RVAASQL  LGC ELIA++ +EY+DIA++LGTD
Sbjct: 61  RVAASQLTCLGCLELIAKSRQEYEDIAVKLGTD 93


>gi|166368783|ref|YP_001661056.1| hypothetical protein MAE_60420 [Microcystis aeruginosa NIES-843]
 gi|166091156|dbj|BAG05864.1| TPR repeat protein [Microcystis aeruginosa NIES-843]
          Length = 631

 Score =  161 bits (407), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 73/158 (46%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 422 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMQILANVPGSVLWLFSRVAEAEANLRREA 481

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T PG T
Sbjct: 482 QARGIEGDRLIFAHLQPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 541

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 542 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 579


>gi|422303914|ref|ZP_16391264.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC (fragment)
           [Microcystis aeruginosa PCC 9806]
 gi|389791071|emb|CCI13110.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC (fragment)
           [Microcystis aeruginosa PCC 9806]
          Length = 288

 Score =  160 bits (406), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 73/158 (46%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 79  TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILDNVPGSVLWLFSRVAEAEANLRREA 138

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T PG T
Sbjct: 139 QARGIEGDRLIFAHLQPKPEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGTT 198

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 199 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 236


>gi|399992912|ref|YP_006573152.1| hypothetical protein PGA1_c17330 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
 gi|398657467|gb|AFO91433.1| hypothetical protein PGA1_c17330 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
          Length = 585

 Score =  160 bits (406), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 83/194 (42%), Positives = 117/194 (60%), Gaps = 8/194 (4%)

Query: 4   SGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLY 63
            G+   S   + + N    N  +   A  EE P      +R   GLPE+A+V+C+FN   
Sbjct: 342 EGRRHFSEKILYMPNCYQVN--DNSRAHPEEKP------TRADMGLPENAVVFCSFNNHN 393

Query: 64  KIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEH 123
           K+ P+   +W+++LK V NS+LW L    V  ANI   A+A G+   RI+F+   +  +H
Sbjct: 394 KVTPAEFDIWMDLLKDVDNSVLWFLAAADVVRANILKEAEARGVPADRIVFAGRCSTPDH 453

Query: 124 VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIAR 183
           V R  LAD+ LDT  CN HTT+ ++LW+G PVVT PGE  A+RVAAS +  +GCPELIA 
Sbjct: 454 VARLPLADIFLDTFACNAHTTASEMLWSGVPVVTKPGEQFAARVAASIVTAVGCPELIAE 513

Query: 184 THKEYQDIAIRLGT 197
           T +EY+ +A+RL T
Sbjct: 514 TDEEYRALALRLAT 527


>gi|425449527|ref|ZP_18829366.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           7941]
 gi|389763752|emb|CCI09797.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           7941]
          Length = 1038

 Score =  160 bits (404), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 72/158 (45%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 829 TRSHYGLPESGFVFCCFNHTYKIEPQVFTVWMQILANVPGSVLWLFSRVAEAEANLRREA 888

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T PG T
Sbjct: 889 KARGIEGDRLIFAHLEPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 948

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 949 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 986


>gi|440755140|ref|ZP_20934342.1| TPR domain protein [Microcystis aeruginosa TAIHU98]
 gi|440175346|gb|ELP54715.1| TPR domain protein [Microcystis aeruginosa TAIHU98]
          Length = 388

 Score =  159 bits (403), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 72/158 (45%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 179 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 238

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T PG T
Sbjct: 239 KARGIEGDRLIFAHLEPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 298

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 299 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 336


>gi|390440632|ref|ZP_10228858.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC (fragment)
           [Microcystis sp. T1-4]
 gi|389836094|emb|CCI32984.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC (fragment)
           [Microcystis sp. T1-4]
          Length = 450

 Score =  159 bits (403), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 73/158 (46%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 241 TRSHYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 300

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T PG T
Sbjct: 301 QARGIEGDRLIFAHLQPKLEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCPGAT 360

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 361 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 398


>gi|425471484|ref|ZP_18850344.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC (fragment)
           [Microcystis aeruginosa PCC 9701]
 gi|389882649|emb|CCI36920.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SEC (fragment)
           [Microcystis aeruginosa PCC 9701]
          Length = 404

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 73/158 (46%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R QYGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 195 TRSQYGLPESGFVFCCFNHTYKIEPQIFTVWMQILANVPGSVLWLFSRVAEAEANLRREA 254

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T  G T
Sbjct: 255 QARGIEGDRLIFAHLQPKPEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIITCRGAT 314

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 315 FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 352


>gi|425437323|ref|ZP_18817742.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
            9432]
 gi|389677704|emb|CCH93372.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
            9432]
          Length = 1059

 Score =  156 bits (394), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 72/158 (45%), Positives = 99/158 (62%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            +R QYGLPE   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 850  TRSQYGLPESGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 909

Query: 103  QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            +A G++  R++F+++  K EH+ R QLAD+ LDT   N HTT  D LW G P++T   ET
Sbjct: 910  KARGIEGDRLIFAHLEPKSEHLARHQLADLFLDTLYYNAHTTGSDALWAGLPIMTCLEET 969

Query: 163  LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
              SRV AS L  +G PELI +  +EY+++AI L    D
Sbjct: 970  FPSRVGASLLTAIGLPELITKNLEEYKNLAINLAKSPD 1007


>gi|425448104|ref|ZP_18828084.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           9443]
 gi|389731183|emb|CCI04721.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           9443]
          Length = 1040

 Score =  156 bits (394), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 71/158 (44%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R QYGLP+   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 831 TRSQYGLPKSGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 890

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G++  R++F+++  K EH+ R QLAD+ +DT   N HTT  D LW G P++T  GET
Sbjct: 891 KARGIEGDRLIFAHLEPKSEHLARHQLADLFIDTLYYNAHTTGSDALWAGLPIITCLGET 950

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV AS L   G PELI +  +EY+++AI L    D
Sbjct: 951 FPSRVGASLLTATGLPELITKNLEEYKNLAINLAKSPD 988


>gi|425453499|ref|ZP_18833256.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           9807]
 gi|389802856|emb|CCI18142.1| TPR repeat protein (modular protein) [Microcystis aeruginosa PCC
           9807]
          Length = 1039

 Score =  155 bits (393), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 70/158 (44%), Positives = 99/158 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R QYGLP+   V+C FN  YKI+P    +W+ +L  VP S+LWL    A  EAN++  A
Sbjct: 830 TRSQYGLPKSGFVFCCFNHTYKIEPQIFTVWMEILANVPGSVLWLFSRVAEAEANLRREA 889

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G++  R++F+++  K EH+ R QLAD+ +DT   N HTT  D LW G P++T  GET
Sbjct: 890 KARGIEGDRLIFAHLEPKSEHLARHQLADLFIDTLYYNAHTTGSDALWAGLPIITCLGET 949

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRV  S L  +G PELI +  +EY+++AI L    D
Sbjct: 950 FPSRVGGSLLTAIGLPELITKNLEEYKNLAINLAKSPD 987


>gi|320033304|gb|EFW15252.1| UDP-N-acetylglucosaminyltransferase [Coccidioides posadasii str.
           Silveira]
          Length = 279

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP+D I++ NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ +A A   +
Sbjct: 55  LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 114

Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
               RI+F++VA K  H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP     + 
Sbjct: 115 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 174

Query: 165 SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
           SR+A+S L +             ELI  + +EY++ AIRLG D
Sbjct: 175 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 217


>gi|119189521|ref|XP_001245367.1| hypothetical protein CIMG_04808 [Coccidioides immitis RS]
          Length = 1610

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I++ NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ +A A   +
Sbjct: 1386 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1445

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP     + 
Sbjct: 1446 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 1505

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELI  + +EY++ AIRLG D
Sbjct: 1506 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 1548


>gi|254475400|ref|ZP_05088786.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit, putative [Ruegeria sp. R11]
 gi|214029643|gb|EEB70478.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit, putative [Ruegeria sp. R11]
          Length = 581

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 69/161 (42%), Positives = 106/161 (65%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P+     +R   GLP+D +V+C FN   K+ P+   +W+++LK V  S+LW L       
Sbjct: 366 PRPEQTPTRADMGLPQDGVVFCAFNNHNKVSPAEFDIWMDLLKQVDGSVLWFLAGAEPLR 425

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           ANI+  A+A G+   RI+F++  +  +H+ R  LAD+ LDT  CN HTT+ +++W+G PV
Sbjct: 426 ANIRKEAEARGVSAERIVFADRCSTPDHIARLPLADIFLDTFACNAHTTASELMWSGVPV 485

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           VT+PG+  A+RVAAS ++ + CPELIA++  EY+DIA+RL 
Sbjct: 486 VTMPGQQFAARVAASIVSAVNCPELIAQSVDEYRDIALRLA 526


>gi|303323021|ref|XP_003071502.1| TPR Domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
 gi|240111204|gb|EER29357.1| TPR Domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
          Length = 1676

 Score =  154 bits (388), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I++ NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ +A A   +
Sbjct: 1452 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1511

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP     + 
Sbjct: 1512 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 1571

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELI  + +EY++ AIRLG D
Sbjct: 1572 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 1614


>gi|392868271|gb|EAS34034.2| UDP-N-acetylglucosaminyltransferase [Coccidioides immitis RS]
          Length = 1676

 Score =  154 bits (388), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 109/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I++ NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ +A A   +
Sbjct: 1452 LPDDTIIFGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1511

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTTS D+LW+GTP++TLP     + 
Sbjct: 1512 GTASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTSADILWSGTPLITLPRYKYKMC 1571

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELI  + +EY++ AIRLG D
Sbjct: 1572 SRMASSILTSALPQNAAGREAAAELITSSEEEYENNAIRLGLD 1614


>gi|434406348|ref|YP_007149233.1| TIGR03032 family protein [Cylindrospermum stagnale PCC 7417]
 gi|428260603|gb|AFZ26553.1| TIGR03032 family protein [Cylindrospermum stagnale PCC 7417]
          Length = 1083

 Score =  154 bits (388), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 100/155 (64%), Gaps = 2/155 (1%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL-KFPAVGEANIQAT 101
            +R QYGLPE   V+C FN  YKI+P     W+ +L  VP  +LWLL +FP V + N++  
Sbjct: 871  TRSQYGLPESGFVFCCFNNNYKIEPLIFDAWMRILADVPEGVLWLLPRFP-VAKENLRRE 929

Query: 102  AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            A+  G+ + R++F++V  K EH+ R +LAD+ LDT   N HT++ D LW G PV+T PG 
Sbjct: 930  AEMRGISEERLIFADVEPKPEHLARHRLADLFLDTLYYNAHTSATDALWAGLPVITCPGT 989

Query: 162  TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            T ASRVAAS L  +G PEL+    +EY+ +AI L 
Sbjct: 990  TFASRVAASLLTAVGLPELVTANLQEYEQLAINLA 1024


>gi|169605775|ref|XP_001796308.1| hypothetical protein SNOG_05914 [Phaeosphaeria nodorum SN15]
 gi|160706839|gb|EAT86978.2| hypothetical protein SNOG_05914 [Phaeosphaeria nodorum SN15]
          Length = 1609

 Score =  152 bits (384), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 111/163 (68%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP++AI+  NFNQLYKI+P+T + W+ +L+ +PN+ILWLL+FP +GE N++ TA      
Sbjct: 1377 LPDNAIILGNFNQLYKIEPTTFRTWLRILERLPNAILWLLRFPDLGETNLKQTAYMWAGA 1436

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +  +RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1437 EVANRIIFTDVAQKSQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1496

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L               ELIA   ++Y++ A+RLG+D
Sbjct: 1497 SRMAASILKGALPKSSEGVQAAKELIAANDEDYEEKAVRLGSD 1539


>gi|440474923|gb|ELQ43638.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
            [Magnaporthe oryzae Y34]
 gi|440489796|gb|ELQ69415.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
            [Magnaporthe oryzae P131]
          Length = 1524

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 73/160 (45%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L  VP ++LWLL+FP +GE N++ TA+A   +
Sbjct: 1297 LPDDAIILGNFNQLYKIDPTTFRTWLRILSQVPKAVLWLLRFPELGECNLRNTARAWAGE 1356

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1357 KVASRIIFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1416

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIA   K+Y++ A+RL
Sbjct: 1417 SRMAASILKGALPKSEEGARAATELIASDEKDYEESAVRL 1456


>gi|389623941|ref|XP_003709624.1| UDP-N-acetylglucosaminyltransferase [Magnaporthe oryzae 70-15]
 gi|351649153|gb|EHA57012.1| UDP-N-acetylglucosaminyltransferase [Magnaporthe oryzae 70-15]
          Length = 1541

 Score =  151 bits (382), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 73/160 (45%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L  VP ++LWLL+FP +GE N++ TA+A   +
Sbjct: 1314 LPDDAIILGNFNQLYKIDPTTFRTWLRILSQVPKAVLWLLRFPELGECNLRNTARAWAGE 1373

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1374 KVASRIIFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1433

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIA   K+Y++ A+RL
Sbjct: 1434 SRMAASILKGALPKSEEGARAATELIASDEKDYEESAVRL 1473


>gi|367036987|ref|XP_003648874.1| glycosyltransferase family 41 protein [Thielavia terrestris NRRL
            8126]
 gi|346996135|gb|AEO62538.1| glycosyltransferase family 41 protein [Thielavia terrestris NRRL
            8126]
          Length = 1601

 Score =  151 bits (381), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 74/160 (46%), Positives = 106/160 (66%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE +++ TA+    +
Sbjct: 1354 LPDDAIILGNFNQLYKIDPTTFRTWLRILAAAPKAYLWLLRFPELGETHLRRTARDWSGE 1413

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
                RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1414 GVARRIIFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1473

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIAR  KEY++ AIRL
Sbjct: 1474 SRMAASILKGALPKGEEGEQAARELIARDEKEYEEFAIRL 1513


>gi|429864035|gb|ELA38419.1| udp-n-acetylglucosaminyltransferase [Colletotrichum gloeosporioides
            Nara gc5]
          Length = 1093

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/160 (47%), Positives = 109/160 (68%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L  VP +ILWLL+FP +GE N++ TA+A   D
Sbjct: 867  LPDDAIILGNFNQLYKIDPTTFRTWLRILSQVPKAILWLLRFPELGETNLRRTAKAWAGD 926

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ +P++TLP     + 
Sbjct: 927  EAPSRIIFTDVAPKNQHISRARVCDLFLDTPECNAHTTAADVLWSSSPLLTLPRYPYKMC 986

Query: 165  SRVAASQL--ATLGCP-------ELIARTHKEYQDIAIRL 195
            SR+AAS L  A    P       +LIA + +EY+  AI+L
Sbjct: 987  SRMAASILKGALPRSPSGLEAANDLIASSEEEYEAFAIKL 1026


>gi|384487344|gb|EIE79524.1| hypothetical protein RO3G_04229 [Rhizopus delemar RA 99-880]
          Length = 456

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 69/154 (44%), Positives = 104/154 (67%), Gaps = 5/154 (3%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP+D I++ NFNQLYK++P T ++W+ +L+ +PNSILWLL+FP  GE ++++ A      
Sbjct: 211 LPDDVIIFANFNQLYKLEPGTFRLWLRILERIPNSILWLLRFPPAGEHHLKSCAAEWAGP 270

Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
           Q   R++F++VA K  H+ RG++AD+ LDTP CN HTT+ D+LW+GTP+VT P     + 
Sbjct: 271 QVAQRVVFTDVAPKHIHIHRGRIADIFLDTPECNAHTTAADILWSGTPIVTWPKYLHKMC 330

Query: 165 SRVAAS-QLATLGCPELIARTHKEYQDIAIRLGT 197
           SRV AS  +AT    E+I    ++Y+   + L T
Sbjct: 331 SRVGASIAMATGFGKEMIVMNEQQYETRTVELAT 364


>gi|396488362|ref|XP_003842860.1| hypothetical protein LEMA_P086200.1 [Leptosphaeria maculans JN3]
 gi|312219437|emb|CBX99381.1| hypothetical protein LEMA_P086200.1 [Leptosphaeria maculans JN3]
          Length = 1999

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/163 (46%), Positives = 108/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA--QALG 106
            LPEDAI+  NFNQLYKI+P+T + W+ +L+ +P +ILWLL+FP +GE N++ TA   A  
Sbjct: 1766 LPEDAIILANFNQLYKIEPTTFRTWLRILERLPRAILWLLRFPDLGETNLKNTALKWAGP 1825

Query: 107  LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1826 AVASRIMFTDVAQKSQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1885

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L               ELIA + ++Y+D A+RL  D
Sbjct: 1886 SRMAASILKGALPKNDAGVRAAMELIASSDEDYEDKAVRLARD 1928


>gi|451849276|gb|EMD62580.1| glycosyltransferase family 41 protein [Cochliobolus sativus ND90Pr]
          Length = 1644

 Score =  150 bits (379), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 73/163 (44%), Positives = 109/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L ++AI+  NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA      
Sbjct: 1421 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTALMWAGA 1480

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1481 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKHKMC 1540

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L               ELIA + +EY+D+A+RL  D
Sbjct: 1541 SRMAASILKGALPKTPAGAQAAKELIASSDEEYEDMAVRLTRD 1583


>gi|403161889|ref|XP_003322196.2| hypothetical protein PGTG_03733 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375171971|gb|EFP77777.2| hypothetical protein PGTG_03733 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1596

 Score =  150 bits (379), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 108/155 (69%), Gaps = 7/155 (4%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L E  +++ NFNQLYKIDP   + W+++L A+PNSILWLL+FPA GE +++ TA+     
Sbjct: 1312 LDEQTVIFANFNQLYKIDPLIFKYWLDILAAIPNSILWLLRFPAPGEPHLKITAERWAGR 1371

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET---- 162
            +   RI F++VA K  H++RG++AD+ LD+  CN HTT+ D+LW+GTP++TLP  T    
Sbjct: 1372 EVAGRIRFTDVAPKPIHIKRGRIADLFLDSTECNAHTTAADILWSGTPILTLPRPTHAHK 1431

Query: 163  LASRVAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
            + SRVAAS  LAT    ++IA +  +Y++ AIR G
Sbjct: 1432 MCSRVAASIALATGYGSQMIATSMDDYKERAIRFG 1466


>gi|156037528|ref|XP_001586491.1| hypothetical protein SS1G_12478 [Sclerotinia sclerotiorum 1980]
 gi|154697886|gb|EDN97624.1| hypothetical protein SS1G_12478 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1584

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 74/160 (46%), Positives = 106/160 (66%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  VP +ILWLL+FP +GE+N++ATA+    +
Sbjct: 1358 LSDDAIILGNFNQLYKIEPTTFRTWLRILDKVPRAILWLLRFPDLGESNLKATARDWAGE 1417

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1418 SVASRIWFTDVAPKHQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYKYKMC 1477

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIAR  ++Y++ AIRL
Sbjct: 1478 SRMAASILKGALPRSEEGKRAASELIARDEEQYEEFAIRL 1517


>gi|452001265|gb|EMD93725.1| glycosyltransferase family 41 protein [Cochliobolus heterostrophus
            C5]
          Length = 1544

 Score =  149 bits (377), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 73/163 (44%), Positives = 109/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L ++AI+  NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA      
Sbjct: 1321 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTALMWAGA 1380

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1381 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1440

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L               ELIA + +EY+D+A+RL  D
Sbjct: 1441 SRMAASILKGALPKTPAGAQAAKELIASSDEEYEDMAVRLTRD 1483


>gi|154245494|ref|YP_001416452.1| hypothetical protein Xaut_1547 [Xanthobacter autotrophicus Py2]
 gi|154159579|gb|ABS66795.1| conserved hypothetical protein [Xanthobacter autotrophicus Py2]
          Length = 459

 Score =  149 bits (377), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 72/158 (45%), Positives = 96/158 (60%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR++ GLP D  V+C FNQ YKI P    +W  +L  VP+S+LWLL  P     N+++ A
Sbjct: 251 SRREAGLPADGFVFCCFNQPYKISPQMFDIWCRLLACVPDSVLWLLDTPHAA-GNLRSEA 309

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+D  R++F     + EH+RR QLAD+ LDT   N HTT+ D LW G P VT PG+T
Sbjct: 310 IARGIDARRLVFGANMGQAEHLRRLQLADLMLDTSPFNAHTTASDALWVGLPFVTCPGDT 369

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRVA S L  +G P+L+  +   Y D+A  L TD +
Sbjct: 370 FPSRVAGSILHAIGLPDLVTDSPDAYFDLAYTLATDEE 407


>gi|254464640|ref|ZP_05078051.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit [Rhodobacterales bacterium Y4I]
 gi|206685548|gb|EDZ46030.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit [Rhodobacterales bacterium Y4I]
          Length = 287

 Score =  149 bits (376), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 77/177 (43%), Positives = 102/177 (57%), Gaps = 6/177 (3%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q N    T   VP      SR + GLPEDA V+C+FN  YK+ P    +W+ +LK VP+S
Sbjct: 66  QPNDNKRTASRVP------SRAEAGLPEDAFVFCSFNSPYKVSPEEFDIWMKLLKLVPDS 119

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LW     A  +ANI   A+  G+   RI+F+  A +E+H+ R QLADV LDT   N HT
Sbjct: 120 VLWFYAPRAETKANILKEAKKRGVAADRIVFAGFAGQEDHLARLQLADVFLDTFAVNAHT 179

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           T+ D LW G PVVT  G+  A+R A S L   G  EL A T + YQ +A++L  + D
Sbjct: 180 TASDALWAGVPVVTKTGKQFAARFATSILNAAGLRELAASTPQRYQALALKLARNPD 236


>gi|310790631|gb|EFQ26164.1| tetratricopeptide [Glomerella graminicola M1.001]
          Length = 1607

 Score =  149 bits (376), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 76/160 (47%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D+I+  NFNQLYKIDP+T + W+ +L  V N+ILWLL+FP +GE N++ TA+A    
Sbjct: 1381 LPDDSIIMGNFNQLYKIDPTTFRTWLRILSKVDNAILWLLRFPELGETNLRRTAEAWAGQ 1440

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1441 EVAKRIIFTDVAPKNQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1500

Query: 165  SRVAASQL--ATLGCPE-------LIARTHKEYQDIAIRL 195
            SR+AAS L  A    PE       LIA +  EY++ A+RL
Sbjct: 1501 SRMAASILRGALPHGPEGVEAAEDLIAGSEDEYEEFAVRL 1540


>gi|258565995|ref|XP_002583742.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237907443|gb|EEP81844.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1554

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 72/163 (44%), Positives = 107/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP++ I+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ +A A   +
Sbjct: 1330 LPDNTIILGNFNQLYKIEPTTFRSWLRILARIPNAILWLLRFPDLGEQNLRQSAVAWAGE 1389

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTTS D+LW+GTP++T P     + 
Sbjct: 1390 ATASRIIFTDVAPKHAHISRARVCDLFLDTPECNAHTTSADILWSGTPLITFPRYKYKMC 1449

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELI  + +EY++ AIRLG D
Sbjct: 1450 SRMASSILTSALPQNAAGREAASELITSSEEEYENNAIRLGLD 1492


>gi|407921016|gb|EKG14185.1| Tetratricopeptide TPR-1 [Macrophomina phaseolina MS6]
          Length = 1605

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 73/162 (45%), Positives = 108/162 (66%), Gaps = 13/162 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKI+P+T + W+ +L  +P +ILWLL+FP VGE +++ TA A    
Sbjct: 1362 LPDDVIILGNFNQLYKIEPTTFRTWLRILANLPKAILWLLRFPDVGETHLKQTALAWAGP 1421

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1422 EVASRVIFTDVAPKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYPYKMC 1481

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
            SR+AAS L               ELIA+  +EY+ +AI+LG+
Sbjct: 1482 SRMAASILKGALPDNDAGQQAAKELIAQNEEEYERLAIKLGS 1523


>gi|343428307|emb|CBQ71837.1| related to UDP-N-acetylglucosaminyltransferase [Sporisorium reilianum
            SRZ2]
          Length = 2070

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 70/152 (46%), Positives = 105/152 (69%), Gaps = 5/152 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D +++ +FNQLYK DPS  ++W+ +L  VP SILWLL+FPA GE ++   A+A   D
Sbjct: 1700 LPDDYVIFADFNQLYKCDPSLFRLWLRILARVPRSILWLLRFPAAGEHHLLREARAYAGD 1759

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
                R++F++VA K  H+ RG++AD+ LDT  CN HTT++D+LW+ TPV+T P     + 
Sbjct: 1760 AVAARVIFTDVAPKHIHIHRGRIADLFLDTTECNAHTTAVDILWSATPVLTWPRHMHKMC 1819

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAIRL 195
            SRVAAS ++  G   E+  R+ +EY++ A+RL
Sbjct: 1820 SRVAASIVSATGFGDEMTVRSAEEYEERAVRL 1851


>gi|288963186|ref|YP_003453465.1| hypothetical protein AZL_f01610 [Azospirillum sp. B510]
 gi|288915438|dbj|BAI76921.1| hypothetical protein AZL_f01610 [Azospirillum sp. B510]
          Length = 669

 Score =  148 bits (373), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 72/155 (46%), Positives = 98/155 (63%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  YGLPE++ V+C+FN L K+ P+   +W+ +L  VP S+LWL      GE N++  A 
Sbjct: 456 RAAYGLPENSFVFCSFNALQKLSPAIFALWMRLLARVPGSVLWLYGGAPDGERNLRREAA 515

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+   R+LF++     +H+ R ++AD+CLDT    GHTT+ D LW G PVVT  G T 
Sbjct: 516 ARGVAPDRLLFASKLPLADHLARYRVADLCLDTLPYTGHTTTSDALWMGCPVVTCLGGTF 575

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G PELI  T +EY+ +A+RL  D
Sbjct: 576 ASRVAASLLTAAGLPELITHTLEEYETLAVRLAED 610


>gi|380485005|emb|CCF39637.1| UDP-N-acetylglucosamine-peptideN-acetylglucosaminyltransferase
           [Colletotrichum higginsianum]
          Length = 401

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 72/161 (44%), Positives = 105/161 (65%), Gaps = 13/161 (8%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP+D I+  NFNQLYKIDP+T + W+ +L  V  SILWLL+FP +GE N++ TA+A   +
Sbjct: 175 LPDDTIIMGNFNQLYKIDPTTFRTWLRILSKVDKSILWLLRFPELGETNLRRTAKAWAGE 234

Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
           +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 235 EVARRIIFTDVAPKNQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYPYKMC 294

Query: 165 SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
           SR+AAS L               +LIA +  EY++ A+RL 
Sbjct: 295 SRMAASILKGALPNDAEGLRAAKDLIAGSEDEYEEFAVRLA 335


>gi|402081136|gb|EJT76281.1| UDP-N-acetylglucosaminyltransferase [Gaeumannomyces graminis var.
            tritici R3-111a-1]
          Length = 1576

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKIDP+T + W+ +L AVP +ILWLL+FP +GE N++ TA+    +
Sbjct: 1349 LPDDTIILGNFNQLYKIDPTTFRAWLRILAAVPKAILWLLRFPDLGEVNLKRTAKEWAGE 1408

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            Q  +RILF++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1409 QVANRILFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1468

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               +L+A    +Y+++A++L
Sbjct: 1469 SRMAASILKGALPNSEEGLRAASDLVAEDDVDYENLAVKL 1508


>gi|239609400|gb|EEQ86387.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis ER-3]
          Length = 1687

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1463 LRDDAIILGNFNQLYKIEPTTFRTWLRILAHIPNAILWLLRFPELGEHNLKQTAVAWAGE 1522

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1523 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1582

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELIA + +EY+  AI LG D
Sbjct: 1583 SRMASSILTSALPQTEAGRQAAHELIASSDEEYEKRAISLGLD 1625


>gi|302404586|ref|XP_003000130.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Verticillium albo-atrum VaMs.102]
 gi|261360787|gb|EEY23215.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Verticillium albo-atrum VaMs.102]
          Length = 234

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 72/163 (44%), Positives = 105/163 (64%), Gaps = 13/163 (7%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
           G  +D ++  NFNQLYKIDPST + W+ +L  VP ++LWLL+FP +GE N++ TA+A   
Sbjct: 8   GFSDDTVILGNFNQLYKIDPSTFRSWLRILSRVPKAVLWLLRFPELGETNLKRTAKAWAG 67

Query: 108 DQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--L 163
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P  +  +
Sbjct: 68  AEVASRIIFTDVAPKHLHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTFPRYSFKM 127

Query: 164 ASRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
            SR+AAS L               ELIA + +EY+ +AIRL +
Sbjct: 128 CSRMAASILKGALPKGPEGQRAAQELIASSDEEYESLAIRLAS 170


>gi|327355846|gb|EGE84703.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis ATCC
            18188]
          Length = 1687

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1463 LRDDAIILGNFNQLYKIEPTTFRTWLRILAHIPNAILWLLRFPELGEHNLKQTAVAWAGE 1522

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1523 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1582

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELIA + +EY+  AI LG D
Sbjct: 1583 SRMASSILTSALPQTEAGRQAAHELIASSDEEYEKRAISLGLD 1625


>gi|261196356|ref|XP_002624581.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239595826|gb|EEQ78407.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces dermatitidis
            SLH14081]
          Length = 1659

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 107/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1435 LRDDAIILGNFNQLYKIEPTTFRTWLRILAHIPNAILWLLRFPELGEHNLKQTAVAWAGE 1494

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1495 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1554

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +             ELIA + +EY+  AI LG D
Sbjct: 1555 SRMASSILTSALPQTEAGRQAAHELIASSDEEYEKRAISLGLD 1597


>gi|367024217|ref|XP_003661393.1| glycosyltransferase family 41 protein [Myceliophthora thermophila
            ATCC 42464]
 gi|347008661|gb|AEO56148.1| glycosyltransferase family 41 protein [Myceliophthora thermophila
            ATCC 42464]
          Length = 1574

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/160 (45%), Positives = 105/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE +++ TA+  G +
Sbjct: 1348 LPDDAILLGNFNQLYKIDPTTFRSWLRILAAAPKAYLWLLRFPELGETHLRRTARDWGGE 1407

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
                RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1408 GVASRIIFTDVAPKLQHISRARVCDLFLDTPECNAHTTAADILWSNTPLLTLPRYEYKMC 1467

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIAR   EY++ A+RL
Sbjct: 1468 SRMAASILKGALPKGEAGEQAARELIARDEAEYEEFAVRL 1507


>gi|375104105|ref|ZP_09750366.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Burkholderiales bacterium JOSHI_001]
 gi|374664836|gb|EHR69621.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Burkholderiales bacterium JOSHI_001]
          Length = 630

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/153 (47%), Positives = 98/153 (64%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  YGLPEDA VYC+FN  +K  P     W+ +LKAVPNS+LWLL      EAN++A A+
Sbjct: 418 RNTYGLPEDAFVYCSFNNNFKFTPEVFGAWMRMLKAVPNSVLWLLADNRWAEANMRAFAK 477

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+D  R++F+   +  E+V R QLAD+ LDT   N  TT+ DVLW GTP++TL G + 
Sbjct: 478 ENGVDAARLIFAPRVSPAEYVARFQLADLVLDTFPYNAGTTANDVLWAGTPILTLSGRSY 537

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            SR+  S L  +G PELI  +  EY+  AI++G
Sbjct: 538 ISRMCGSLLTAVGLPELITTSLAEYEQRAIQIG 570


>gi|342321253|gb|EGU13187.1| TPR Domain containing protein [Rhodotorula glutinis ATCC 204091]
          Length = 2015

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 69/153 (45%), Positives = 104/153 (67%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D +++ N+NQLYKIDP   ++W+ +LK  PNS+LWLL+FPA GEA+++ TA      
Sbjct: 1754 LRDDTVIFANWNQLYKIDPFIFRIWLEILKKHPNSVLWLLRFPAPGEAHLKETALRWAGK 1813

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K +H+ RG++AD+ LDT  CN HTT+ D+LW+GTP++T P     + 
Sbjct: 1814 EVADRVIFTDVANKNDHIHRGRIADLFLDTTECNAHTTAADILWSGTPILTFPRHAHKMC 1873

Query: 165  SRVAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
            SRVAAS  +AT   P++I     EY+  A+ L 
Sbjct: 1874 SRVAASIAMATGFGPQMIVNNEHEYERRALELA 1906


>gi|428179162|gb|EKX48034.1| hypothetical protein GUITHDRAFT_68991 [Guillardia theta CCMP2712]
          Length = 493

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 74/178 (41%), Positives = 106/178 (59%), Gaps = 1/178 (0%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           QTN+      E+  S  + SR+ +GLPE A+V+C FN+L +I       W+ +LK VPNS
Sbjct: 249 QTNSFQDLYPEILSSSCLPSRKDHGLPEHALVFCTFNRLGRITEHIFHAWIRILKRVPNS 308

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGH 142
           +LWL K P +    +   A+  G+D  R +F+  +  K EH++R  LADV LDT + NGH
Sbjct: 309 VLWLYKHPTMAVLRLLRAAREQGIDATRFVFAGPMMPKIEHLKRLTLADVYLDTHIYNGH 368

Query: 143 TTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           TT  D LW G P+VTL G+T  SRV AS    +G  E+IA     Y++ A+ LG +++
Sbjct: 369 TTGSDFLWAGVPMVTLQGDTFPSRVGASLARAVGMQEMIATDLAGYEEKAVELGNNKE 426


>gi|240279775|gb|EER43280.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces capsulatus H143]
 gi|325092907|gb|EGC46217.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces capsulatus H88]
          Length = 1696

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 108/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1472 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPDLGEQNLKQTAVAWAGE 1531

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1532 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1591

Query: 165  SRVAASQLAT---------LGCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +         L   +LI+ +  EY++ AI LG D
Sbjct: 1592 SRMASSILTSALPQTEAGRLAARDLISSSDDEYENRAISLGLD 1634


>gi|225562952|gb|EEH11231.1| UDP-N-acetylglucosaminyltransferase [Ajellomyces capsulatus G186AR]
          Length = 1696

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 108/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1472 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPDLGEQNLKQTAVAWAGE 1531

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1532 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1591

Query: 165  SRVAASQLAT---------LGCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +         L   +LI+ +  EY++ AI LG D
Sbjct: 1592 SRMASSILTSALPQTEAGRLAARDLISSSDDEYENRAISLGLD 1634


>gi|428179456|gb|EKX48327.1| hypothetical protein GUITHDRAFT_105934 [Guillardia theta CCMP2712]
          Length = 813

 Score =  147 bits (371), Expect = 2e-33,   Method: Composition-based stats.
 Identities = 73/163 (44%), Positives = 103/163 (63%), Gaps = 3/163 (1%)

Query: 34  EVPQSIVITSRQQYG-LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
           ++P S V  SR+ +  LP   I+   FNQLYKIDP   Q W+ +LK  P+++LWLL+FP 
Sbjct: 606 DIPSSRV--SRKNFAELPAHGILLACFNQLYKIDPLVFQTWMKILKQHPHALLWLLEFPP 663

Query: 93  VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
           V   NI+  A A G+   R++F     K E++ RG LAD+ LDTPL N HT++ D+LW G
Sbjct: 664 VAVENIRKEAIARGVSSDRLVFGKTLPKREYLERGALADLFLDTPLVNAHTSATDILWAG 723

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
            P++TL  E++ SRVAAS     G PE++A   +EY+ +A  L
Sbjct: 724 VPMLTLSRESMISRVAASLCLAAGRPEMVAINLEEYEQVASAL 766


>gi|194335759|ref|YP_002017553.1| hypothetical protein Ppha_0637 [Pelodictyon phaeoclathratiforme BU-1]
 gi|194308236|gb|ACF42936.1| TPR repeat-containing protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 4489

 Score =  147 bits (371), Expect = 3e-33,   Method: Composition-based stats.
 Identities = 70/158 (44%), Positives = 100/158 (63%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R++ GLPE   V+C FN  YKI P T + W+ +LK V  S+LWLL+      AN++ 
Sbjct: 2654 VFTRKELGLPEIGFVFCCFNNNYKITPVTFEGWMRILKRVEGSVLWLLEDNPKAAANLRK 2713

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A   G+D  R++F+      EH+ R ++AD+ LDT  CN HTT+ D LW G P++TL G
Sbjct: 2714 EATERGVDAERLIFAKRMPLAEHLARHRIADLFLDTFPCNAHTTASDALWAGLPLLTLAG 2773

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ET A+RVAAS L  +  PELI  T ++Y+ +AI L T+
Sbjct: 2774 ETFAARVAASLLNAIQLPELITFTQEKYEALAIELATN 2811



 Score =  144 bits (364), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 69/158 (43%), Positives = 97/158 (61%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R++ GLPE   V+C FN  YKI P T   W+ +LK V  S+LWLL        N++ 
Sbjct: 3416 VFTRKECGLPEKGFVFCCFNNNYKITPVTFDGWMRILKRVEGSVLWLLGGNPKAVENLRK 3475

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             +   G+D  R++F+      EH+ R +LAD+ +DT  CN HTT+ D LW G P++TL G
Sbjct: 3476 ESTERGVDAERLIFAKRMPLAEHLARHRLADLFIDTFPCNAHTTASDALWAGLPLLTLAG 3535

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ET A+RVAAS L  +  PELI  T +EY+ +AI L T+
Sbjct: 3536 ETFAARVAASLLNAIQLPELITSTQEEYEALAIELATN 3573



 Score =  142 bits (358), Expect = 7e-32,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 95/158 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R++ GLPE   V+C FN  YKI P     W+ +L+ V  S+ W+ +       N++  A
Sbjct: 545 TRKECGLPEKGFVFCCFNNNYKITPFVFDSWMRILRQVEGSVFWIFEDNPKAAENLRKEA 604

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D +R++FS      EH+ R +LAD+ LDT  CN HTT+ D LW G P++TL GET
Sbjct: 605 TRRGVDSNRLIFSKRIPLAEHLARHRLADLFLDTHPCNAHTTASDALWAGLPLLTLAGET 664

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +RVAAS L  +  PELI  T +EY+ +AI L T+ D
Sbjct: 665 FVARVAASLLNAIQLPELITCTQEEYEALAIELATNPD 702



 Score =  142 bits (357), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 96/158 (60%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R++ GLPE   V+C FN  YKI P T   W+ +LK V  S+LWLL        N++ 
Sbjct: 2029 VFTRKECGLPEKGFVFCCFNNNYKITPVTFDGWMRILKRVEGSVLWLLGGNPKAVENLRK 2088

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             +   G+D  R++F+      EH+ R ++AD+ +DT  CN HTT+ D LW G P++TL G
Sbjct: 2089 ESTERGVDAERLIFAKRMPLAEHLARHRIADLFIDTFPCNAHTTASDALWAGLPLLTLAG 2148

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ET  +RVAAS L  +  PELI  T +EY+ +AI L T+
Sbjct: 2149 ETFVARVAASLLNAIQLPELITSTQEEYEALAIELATN 2186



 Score =  140 bits (354), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 68/158 (43%), Positives = 96/158 (60%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R++ GLPE   ++C FN  YKI P T   W+ +L+ V  S+LWLL+       N++ 
Sbjct: 4279 VFTRKECGLPEKGFIFCCFNNNYKITPVTFDGWMRLLRQVEGSVLWLLEDNVTAADNLRK 4338

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A   G+D  RI+F+      EH+ R +L  + LDT  CN HTT+ D LW G PV+TL G
Sbjct: 4339 EAIKRGIDAARIVFAQRMPLAEHLARQRLGYLFLDTFPCNAHTTASDALWVGLPVLTLAG 4398

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            E+ ASRVAAS L  +  PELI  T +EY+ +AI + T+
Sbjct: 4399 ESFASRVAASLLNAMQLPELITSTQEEYEALAIDVATN 4436



 Score =  138 bits (348), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 66/156 (42%), Positives = 95/156 (60%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            +R++ GLPE   V+C FN  YKI P     W+ +L+ V  S+LWL +       N++  A
Sbjct: 1337 TRKELGLPEIGFVFCCFNNNYKITPLIFDGWMRILRQVEGSVLWLFEDTQRAAQNLRREA 1396

Query: 103  QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               G+D  R++F+      EH+ R ++AD+ LDT  CN HTT+ D LW G P++TL GET
Sbjct: 1397 IEKGVDAERLIFAKRMPLAEHLARHRIADLFLDTFPCNAHTTASDALWAGLPLLTLAGET 1456

Query: 163  LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             A+RVAAS L  +  PELI  T ++Y+ +AI L T+
Sbjct: 1457 FAARVAASLLNAIQLPELITFTQEKYEALAIELATN 1492


>gi|154280266|ref|XP_001540946.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150412889|gb|EDN08276.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1165

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 74/163 (45%), Positives = 108/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 941  LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPDLGEQNLKQTAVAWAGE 1000

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1001 ATASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1060

Query: 165  SRVAASQLAT---------LGCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L +         L   +LI+ +  EY++ AI LG D
Sbjct: 1061 SRMASSILTSALPQTEAGRLAARDLISSSDDEYENRAISLGLD 1103


>gi|189197921|ref|XP_001935298.1| UDP-N-acetylglucosaminyltransferase [Pyrenophora tritici-repentis
            Pt-1C-BFP]
 gi|187981246|gb|EDU47872.1| UDP-N-acetylglucosaminyltransferase [Pyrenophora tritici-repentis
            Pt-1C-BFP]
          Length = 1529

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 107/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L ++AI+  NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA      
Sbjct: 1300 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTAHLWAGP 1359

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1360 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1419

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L               ELIA    EY+++A+RL  +
Sbjct: 1420 SRMAASILKGALPKTPAGAQAAKELIASNDNEYEEMAVRLARN 1462


>gi|330927740|ref|XP_003301979.1| hypothetical protein PTT_13645 [Pyrenophora teres f. teres 0-1]
 gi|311322904|gb|EFQ89927.1| hypothetical protein PTT_13645 [Pyrenophora teres f. teres 0-1]
          Length = 1669

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 107/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L ++AI+  NFNQLYKI+P+T + W+ +L+ +PN++LWLL+FP +GE N++ TA      
Sbjct: 1440 LSDNAIILANFNQLYKIEPTTFRTWLRILEKIPNAVLWLLRFPDLGETNLKQTAHLWAGA 1499

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1500 EVASRIMFTDVAQKHQHISRARVCDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1559

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L               ELIA    EY+++A+RL  +
Sbjct: 1560 SRMAASILKGALPKTPAGAQAAKELIASNDDEYEEMAVRLARN 1602


>gi|407772277|ref|ZP_11119579.1| hypothetical protein TH2_00220 [Thalassospira profundimaris WP0211]
 gi|407284230|gb|EKF09746.1| hypothetical protein TH2_00220 [Thalassospira profundimaris WP0211]
          Length = 680

 Score =  146 bits (368), Expect = 5e-33,   Method: Composition-based stats.
 Identities = 71/165 (43%), Positives = 102/165 (61%), Gaps = 2/165 (1%)

Query: 32  GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
           G E P++    +R + GLPED  V+C+FN  YKI P    +W+ +LK V  S+LWL +  
Sbjct: 461 GREFPENT--QTRAELGLPEDGFVFCSFNANYKISPREFDIWMRLLKKVDGSVLWLFEGS 518

Query: 92  AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
                N++  A   G+D  R++F+    +E+H+ R + AD+ LDT   N HTT+ D LWT
Sbjct: 519 EDAMLNLRKEAGNRGVDPARLVFAGFLPEEQHLARHKHADLFLDTFNVNAHTTASDALWT 578

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           G P+VTLPG+  A+RVAAS L     PELIA++ +EY+ IA+ L 
Sbjct: 579 GLPLVTLPGKQFAARVAASVLKAANLPELIAKSEEEYEAIALDLA 623


>gi|295663513|ref|XP_002792309.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides sp. 'lutzii'
            Pb01]
 gi|226278979|gb|EEH34545.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides sp. 'lutzii'
            Pb01]
          Length = 1577

 Score =  145 bits (367), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 75/162 (46%), Positives = 107/162 (66%), Gaps = 15/162 (9%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1353 LRDDAIILGNFNQLYKIEPTTFRTWLRILSRIPNAILWLLRFPDLGEHNLKQTAIAWAGE 1412

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1413 AIASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1472

Query: 165  SRVAASQLAT----------LGCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +            C ELIA + +EY+  AI LG
Sbjct: 1473 SRMASSILTSALPQTEAGRQAAC-ELIASSEEEYESRAISLG 1513


>gi|86137175|ref|ZP_01055753.1| TPR repeat protein [Roseobacter sp. MED193]
 gi|85826499|gb|EAQ46696.1| TPR repeat protein [Roseobacter sp. MED193]
          Length = 741

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 66/158 (41%), Positives = 100/158 (63%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLP++  V+C+FN  YKI P    +W+ +L  V +S+LWLL      + N++  A
Sbjct: 507 TRKDCGLPDNGFVFCSFNSSYKITPVEFDIWMRLLDQVEDSVLWLLDCSETSKTNLRKEA 566

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G D  R++F+   A+EEH+ R + AD+ LDT + N HTT+ D LW G PV+TLPG  
Sbjct: 567 KRRGQDPDRLIFAPRIAQEEHLARHRAADLFLDTFVVNAHTTASDALWAGLPVLTLPGRQ 626

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ASRV AS ++ +G PE+IA++  +Y+  A+ L  D D
Sbjct: 627 FASRVGASLVSAMGLPEMIAKSAADYEARALELANDLD 664


>gi|395334322|gb|EJF66698.1| TPR-like protein [Dichomitus squalens LYAD-421 SS1]
          Length = 1243

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 71/153 (46%), Positives = 101/153 (66%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            +P+D +++ NFNQLYK+DPS   +WV +L+ VP SILWLL+FP+ GE ++  +A+  G D
Sbjct: 991  VPQDVVIFANFNQLYKVDPSIFLVWVRILRKVPRSILWLLRFPSAGEEHLLRSARLWGGD 1050

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
                RI F++VA K+ HV R + AD+ LDT  CN HT + DVLWTGTP++T P     + 
Sbjct: 1051 DVASRIHFTDVAKKDWHVYRARAADIFLDTAECNAHTIAADVLWTGTPILTWPKHRHKMC 1110

Query: 165  SRVAASQLATLGCPE-LIARTHKEYQDIAIRLG 196
            SRVAAS L   G  + +I  + +EY+  A+ L 
Sbjct: 1111 SRVAASMLQATGFADHMIVHSAEEYEARAVALA 1143


>gi|83765645|dbj|BAE55788.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1612

 Score =  145 bits (366), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 109/160 (68%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1388 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLKETAVAWAGE 1447

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1448 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1507

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             ELIA + ++Y++ AIRL
Sbjct: 1508 SRMASSILSSALPDSDDGQKAREELIATSDEDYEEKAIRL 1547


>gi|238483389|ref|XP_002372933.1| UDP-N-acetylglucosaminyltransferase [Aspergillus flavus NRRL3357]
 gi|220700983|gb|EED57321.1| UDP-N-acetylglucosaminyltransferase [Aspergillus flavus NRRL3357]
 gi|391864779|gb|EIT74073.1| O-linked N-acetylglucosamine transferase OGT [Aspergillus oryzae
            3.042]
          Length = 1668

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 109/160 (68%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1444 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLKETAVAWAGE 1503

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1504 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1563

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             ELIA + ++Y++ AIRL
Sbjct: 1564 SRMASSILSSALPDSDDGQKAREELIATSDEDYEEKAIRL 1603


>gi|317139826|ref|XP_001817790.2| UDP-N-acetylglucosaminyltransferase [Aspergillus oryzae RIB40]
          Length = 1457

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 109/160 (68%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1233 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLKETAVAWAGE 1292

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1293 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1352

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             ELIA + ++Y++ AIRL
Sbjct: 1353 SRMASSILSSALPDSDDGQKAREELIATSDEDYEEKAIRL 1392


>gi|225677798|gb|EEH16082.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides brasiliensis
            Pb03]
          Length = 1671

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 74/162 (45%), Positives = 107/162 (66%), Gaps = 15/162 (9%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1447 LRDDAIILGNFNQLYKIEPTTFRTWLRILSRIPNAILWLLRFPDLGEHNLKQTAIAWAGE 1506

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1507 AIASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1566

Query: 165  SRVAASQLAT----------LGCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +            C ELIA + +EY+  A+ LG
Sbjct: 1567 SRMASSILTSALPQTEAGRQAAC-ELIASSEEEYESRAVSLG 1607


>gi|346979671|gb|EGY23123.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
            [Verticillium dahliae VdLs.17]
          Length = 1546

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 104/160 (65%), Gaps = 13/160 (8%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +D ++  NFNQLYKIDPST + W+ +L  VP ++LWLL+FP +GE N++ TA+A    + 
Sbjct: 1323 DDTVILGNFNQLYKIDPSTFRSWLRILSRVPKAVLWLLRFPELGETNLKRTAKAWAGAEV 1382

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
              RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P  +  + SR
Sbjct: 1383 ASRIIFTDVAPKHLHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTFPRYSFKMCSR 1442

Query: 167  VAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
            +AAS L               ELIA + +EY+ +AIRL +
Sbjct: 1443 MAASILKGALPKGPEGQRAAQELIASSDEEYESLAIRLAS 1482


>gi|226287468|gb|EEH42981.1| UDP-N-acetylglucosaminyltransferase [Paracoccidioides brasiliensis
            Pb18]
          Length = 1701

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 74/162 (45%), Positives = 107/162 (66%), Gaps = 15/162 (9%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A   +
Sbjct: 1477 LRDDAIILGNFNQLYKIEPTTFRTWLRILSRIPNAILWLLRFPDLGEHNLKQTAIAWAGE 1536

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1537 AIASRIVFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTLPRYKYKMC 1596

Query: 165  SRVAASQLAT----------LGCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +            C ELIA + +EY+  A+ LG
Sbjct: 1597 SRMASSILTSALPQTEAGRQAAC-ELIASSEEEYESRAVSLG 1637


>gi|374999539|ref|YP_004975627.1| hypothetical protein AZOLI_p60044 [Azospirillum lipoferum 4B]
 gi|357428510|emb|CBS91467.1| conserved protein of unknown function; putative TPR domain
           [Azospirillum lipoferum 4B]
          Length = 679

 Score =  145 bits (365), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 69/158 (43%), Positives = 99/158 (62%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR + GLP D +V+  FN  YKI P+   +W+ +LK VP S+LWL +   +  AN+ A
Sbjct: 467 VPSRAECGLPADGVVFAAFNTAYKISPAMFALWMRILKRVPGSVLWLFEANPLAAANLTA 526

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
           +AQA G+D  R++F+      +H+ R ++AD+ LDT    GHTT+ D LW G PVVT  G
Sbjct: 527 SAQAQGVDPARLVFAPPRPLPDHIARYRVADLALDTLPYTGHTTTSDALWAGCPVVTCLG 586

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            T ASRVAAS L   G P+ + R+  EY+++A+ L  D
Sbjct: 587 GTFASRVAASLLNAAGLPDTVTRSLAEYEEMAVALAGD 624


>gi|332526962|ref|ZP_08403051.1| TPR repeat-containing protein [Rubrivivax benzoatilyticus JA2]
 gi|332111401|gb|EGJ11384.1| TPR repeat-containing protein [Rubrivivax benzoatilyticus JA2]
          Length = 622

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 70/157 (44%), Positives = 99/157 (63%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR++ GLPE A V+C FNQ YK++P+ L +W  +L  VP+S+LWLL++      N+ A  
Sbjct: 408 SRREEGLPEGATVFCCFNQSYKLNPAMLDLWARILAQVPDSVLWLLEWNGQAPRNLLAEL 467

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+ + R++F    + E+H+ R Q AD+ LDT  CN HTT+ + LW G PV+T+PG+T
Sbjct: 468 AARGIARERVVFGKRVSIEDHLARLQCADLFLDTWPCNAHTTASEALWAGVPVLTVPGQT 527

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRVAAS L      EL+      Y D+A+ L  DR
Sbjct: 528 FASRVAASLLHACELDELVCGDGDAYVDLAVGLARDR 564


>gi|154316161|ref|XP_001557402.1| hypothetical protein BC1G_03666 [Botryotinia fuckeliana B05.10]
 gi|347836428|emb|CCD51000.1| glycosyltransferase family 41 protein [Botryotinia fuckeliana]
          Length = 1576

 Score =  144 bits (364), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 104/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  VP +ILWLL+FP +GE+N++ATA+    +
Sbjct: 1350 LSDDAIILGNFNQLYKIEPTTFRTWLRILDKVPKAILWLLRFPDLGESNLKATAREWAGE 1409

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1410 SVASRIWFTDVAPKHLHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYKYKMC 1469

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELI++   +Y++ AIRL
Sbjct: 1470 SRMAASILKGALPRSEEGDRAASELISKDEAQYEEFAIRL 1509


>gi|449551087|gb|EMD42051.1| glycosyltransferase family 41 protein [Ceriporiopsis subvermispora B]
          Length = 1365

 Score =  144 bits (364), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 69/150 (46%), Positives = 100/150 (66%), Gaps = 5/150 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            +P+D +++ NFNQLYKIDP    +W+ +L  VP S+LWLL+FPA GE +I+ TA+    +
Sbjct: 1114 MPQDVVIFANFNQLYKIDPGIFAVWLRILVQVPRSVLWLLRFPAAGEEHIKCTARMWANE 1173

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RILF++V  KEEHV R ++AD+ LDT  CN HT + DVLWTGTP++T P     + 
Sbjct: 1174 EVASRILFTDVTRKEEHVYRTRVADLFLDTVECNAHTIAADVLWTGTPIITFPKHRHKMC 1233

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAI 193
            SRVAAS     G   +++A +  +Y+  A+
Sbjct: 1234 SRVAASMAHATGFGDQMVASSMDDYETRAV 1263


>gi|329895776|ref|ZP_08271152.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
 gi|328922138|gb|EGG29495.1| TPR repeat-containing protein [gamma proteobacterium IMCC3088]
          Length = 807

 Score =  144 bits (364), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 68/159 (42%), Positives = 98/159 (61%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           TSR  +GLPE+A V C FN  +KI P    +W+ +L  +  ++LWLLK     E N++  
Sbjct: 591 TSRADFGLPENAFVLCCFNNNFKISPREFDIWMRILNRLEKAVLWLLKTNKWTEHNLRKE 650

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G+D  RI+F+    + EH+ R + AD+ +DT  CN HTT+ D LW G PVVT  G 
Sbjct: 651 AKQRGIDPSRIVFAERLPQSEHLARHKHADLFIDTFNCNAHTTASDALWAGLPVVTKQGS 710

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             A+RV+AS L ++G PELI  T +EY+ + + L TD +
Sbjct: 711 QFAARVSASLLNSIGLPELIVNTQEEYEALILELATDTN 749


>gi|381166068|ref|ZP_09875286.1| conserved hypothetical protein [Phaeospirillum molischianum DSM
           120]
 gi|380684800|emb|CCG40098.1| conserved hypothetical protein [Phaeospirillum molischianum DSM
           120]
          Length = 671

 Score =  144 bits (364), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 73/163 (44%), Positives = 96/163 (58%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P S     R  YGLP+ A V+C FNQ YKI+P     W+ +L+ VP+S+LWLL      E
Sbjct: 455 PISPDSAERAAYGLPDGAFVFCCFNQAYKIEPIMFGRWMRILERVPDSVLWLLGDSVAME 514

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N++  AQA G+D  R++F+    K  H+ R + A + LDT   N HTT+ D LW G P+
Sbjct: 515 TNLRREAQARGIDPARLVFAAREPKPRHLARHRHAGLGLDTLFYNAHTTASDALWAGLPI 574

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +T PGE  ASRV AS L  LG PELI      Y++ A+ L TD
Sbjct: 575 LTTPGEAFASRVGASLLGALGLPELICPDLDAYEEKAVALATD 617


>gi|358366822|dbj|GAA83442.1| UDP-N-acetylglucosaminyltransferase [Aspergillus kawachii IFO 4308]
          Length = 1670

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1446 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1505

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1506 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1565

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             +LIA + ++Y+D AIRL
Sbjct: 1566 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYEDKAIRL 1605


>gi|406867525|gb|EKD20563.1| UDP-N-acetylglucosaminyltransferase [Marssonina brunnea f. sp.
            'multigermtubi' MB_m1]
          Length = 1710

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 72/162 (44%), Positives = 105/162 (64%), Gaps = 13/162 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  VP +ILWLL+FP +GE+N++ TAQ    +
Sbjct: 1410 LSDDAIILGNFNQLYKIEPTTFRTWLRILDRVPKAILWLLRFPDLGESNLKRTAQKWAGE 1469

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1470 NVASRIWFTDVAPKHQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYKYKMC 1529

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
            SR+AAS L               ELIA    +Y+++AI+L +
Sbjct: 1530 SRMAASILKGALPRGEEGRRAAEELIAEDDTQYEELAIKLAS 1571


>gi|296811472|ref|XP_002846074.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
            [Arthroderma otae CBS 113480]
 gi|238843462|gb|EEQ33124.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
            [Arthroderma otae CBS 113480]
          Length = 1698

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 104/161 (64%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP VGE N++ TA+A   +
Sbjct: 1473 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEQNLRQTAKAWAGE 1532

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1533 ATASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1592

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +             +LI+ +  EY+  AI LG
Sbjct: 1593 SRMASSILTSALPQTEDGKQAARQLISSSEAEYEQTAIELG 1633


>gi|320588655|gb|EFX01123.1| udp-n-acetylglucosaminyltransferase [Grosmannia clavigera kw1407]
          Length = 1733

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/161 (45%), Positives = 100/161 (62%), Gaps = 13/161 (8%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +D ++  NFNQLYKIDP+T + W+ +L  VPN+ILWLL+FP  GEAN++ TA     +  
Sbjct: 1514 DDTVILGNFNQLYKIDPTTFRSWLRILAGVPNAILWLLRFPEAGEANLRRTALTWAGESV 1573

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASR 166
              RI+F++VA K  H+ R  + D+ LDTP CN HTT+ DVLW+GTP++TLP     + SR
Sbjct: 1574 ACRIIFTDVAPKHLHIARASVCDIFLDTPECNAHTTAADVLWSGTPLLTLPRYNYKMCSR 1633

Query: 167  VAASQL---------ATLGCPELIARTHKEYQDIAIRLGTD 198
            +AAS L               ELIA    +Y+  AIRL  +
Sbjct: 1634 IAASILRGALPKDAEGKQAAAELIAVDESDYEKSAIRLANN 1674


>gi|388852878|emb|CCF53563.1| related to UDP-N-acetylglucosaminyltransferase [Ustilago hordei]
          Length = 2087

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 100/153 (65%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D +++ +FNQLYK DP   ++W+ +LK VP SILWLL+FPA GE ++   A+    D
Sbjct: 1690 LPDDYVIFADFNQLYKCDPMLFKLWLRILKRVPKSILWLLRFPAAGEHHLLRQARRYAGD 1749

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   R++F++VA K  H+ RG++AD+ LDT  CN HTT+ D+LW+ TPV+T P     + 
Sbjct: 1750 EVASRVIFTDVAPKHIHIHRGRIADLFLDTTECNAHTTAADILWSATPVLTWPRHMHKMC 1809

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
            SRVAAS +   G   E+   + +EY+D A+   
Sbjct: 1810 SRVAASIVHATGFGEEMTVNSEQEYEDRAVEFA 1842


>gi|407772275|ref|ZP_11119577.1| hypothetical protein TH2_00210 [Thalassospira profundimaris WP0211]
 gi|407284228|gb|EKF09744.1| hypothetical protein TH2_00210 [Thalassospira profundimaris WP0211]
          Length = 628

 Score =  144 bits (362), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 68/155 (43%), Positives = 94/155 (60%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R  +GLPE   V+C+FN  YKI P    +W+ +L  V  S+LWL K       N++  
Sbjct: 416 TTRADFGLPETGFVFCSFNNTYKITPREFDIWMRLLSKVEGSVLWLFKGNDYAAENLRKE 475

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           AQ  G+D  R++F++   + EH+ R + AD+ LDT   N HTT+ D LW G P+VTLPGE
Sbjct: 476 AQKRGIDPGRLVFADKMPEPEHLARHKHADLLLDTFNVNAHTTASDALWAGLPLVTLPGE 535

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             A+RVAAS L     PELIA+   +Y+ IA+ L 
Sbjct: 536 QFAARVAASILTAANLPELIAKDEADYEAIALDLA 570


>gi|313201112|ref|YP_004039770.1| hypothetical protein MPQ_1373 [Methylovorus sp. MP688]
 gi|312440428|gb|ADQ84534.1| Tetratricopeptide TPR_2 repeat protein [Methylovorus sp. MP688]
          Length = 697

 Score =  143 bits (361), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 97/156 (62%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE+A VYC FNQ +KI P+    W+ +L+ VP+S+LWLL+  A   AN+Q  A
Sbjct: 500 ARVDYGLPEEAFVYCCFNQSFKITPTVFACWMRILQQVPDSVLWLLESNATATANLQQAA 559

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+D  R++F+   A  +H+ R  LAD+ LDT   N HTT+ D LW   PV+T  G+T
Sbjct: 560 KTAGIDPARLVFAPRVAMADHLARHALADLFLDTLPYNAHTTASDALWMCLPVLTCSGDT 619

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRVA S L  +  PELIA     Y+ +AI L  D
Sbjct: 620 FASRVAGSLLHAVNLPELIAPDMAAYEAMAISLRHD 655


>gi|398390604|ref|XP_003848762.1| hypothetical protein MYCGRDRAFT_101342 [Zymoseptoria tritici IPO323]
 gi|339468638|gb|EGP83738.1| UDP-N-ACETYLGLUCOSAMINE--peptide N-ACETYLGLUCOSAMINYLtransferase
            [Zymoseptoria tritici IPO323]
          Length = 1526

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 72/161 (44%), Positives = 105/161 (65%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L ++ I++ NFNQLYKIDP+T + W+ +L  VPNSILWLL+FP +GE+++ ATA+     
Sbjct: 1295 LSDETIIFGNFNQLYKIDPTTFRTWLRILARVPNSILWLLRFPDLGESHLLATARLWAGS 1354

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K  H+ R ++ D+ LDT  CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1355 EVASRVIFTDVAPKHLHISRARVCDLVLDTAECNAHTTAADVLWSGTPLLTLPRYAYKMC 1414

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
            SR+AAS L               ELIA   ++Y+D A+ LG
Sbjct: 1415 SRMAASILRGALPKNEEGDQAAVELIAHNEEDYEDKAVALG 1455


>gi|119357636|ref|YP_912280.1| hypothetical protein Cpha266_1840 [Chlorobium phaeobacteroides DSM
           266]
 gi|119354985|gb|ABL65856.1| TPR repeat-containing protein [Chlorobium phaeobacteroides DSM 266]
          Length = 3560

 Score =  143 bits (360), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 69/158 (43%), Positives = 93/158 (58%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + +R + GLPE   V+C FN  YKI P+T   W+ +L  VP S+LWL +       N+  
Sbjct: 562 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVPGSVLWLFEDNPAVTGNLGR 621

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A   G+   R++F+      EH+ R +L D+ LDT  CN HTT+ D LW G PV+T  G
Sbjct: 622 EAATRGIAGDRLVFAKRMPLPEHLARHRLGDLFLDTLPCNAHTTASDALWAGLPVLTCMG 681

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           E+ ASRVAAS L  +G PELI  T +EY+ +AI L  D
Sbjct: 682 ESFASRVAASVLTAIGLPELITTTQEEYEALAIELALD 719



 Score =  138 bits (348), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 95/158 (60%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R + GLPE   V+C FN  YKI P+T   W+ +L  VP S+LWL +  A    N++ 
Sbjct: 3347 VFTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILGQVPGSVLWLFEENAKAAENLRR 3406

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A + G+D  R++F       E++ R ++AD+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 3407 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 3466

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            E+ ASR+AAS L  +G PELI    +EY+ +AI L  D
Sbjct: 3467 ESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 3504



 Score =  138 bits (348), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 74/182 (40%), Positives = 104/182 (57%), Gaps = 7/182 (3%)

Query: 20   LATNQTN-TKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLK 78
            L++ Q N TK    E V       +R + GLPE   V+C FN  YKI P+T   W+ +L 
Sbjct: 1973 LSSYQVNDTKRRIAERV------FTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILG 2026

Query: 79   AVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPL 138
             VP S+LWL +  A    N++  A + G+D  R++F       E++ R ++AD+ LDT  
Sbjct: 2027 QVPGSVLWLYEENAKAAENLRREAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLP 2086

Query: 139  CNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             N  TT+ D LW G PV+TL GE+ ASR+AAS L  +G PELI    +EY+ +AI L  D
Sbjct: 2087 YNAGTTASDALWAGLPVLTLRGESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 2146

Query: 199  RD 200
             +
Sbjct: 2147 SE 2148



 Score =  136 bits (342), Expect = 6e-30,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 94/158 (59%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R + GLPE   V+C FN  YKI P+T   W+ +L  VP S+LWL +  A    N++ 
Sbjct: 1241 VFTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILGQVPGSVLWLFEENAKAAENLRR 1300

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A + G+D  R++F       E++ R ++AD+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 1301 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 1360

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            E+ ASR+AAS L  +G PELI    +EY+  AI L  D
Sbjct: 1361 ESFASRMAASLLTAIGLPELITSGQEEYEARAIELALD 1398



 Score =  131 bits (329), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 71/180 (39%), Positives = 102/180 (56%), Gaps = 7/180 (3%)

Query: 20   LATNQTN-TKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLK 78
            L++ Q N TK    E V       +R + GLPE   V+C FN  YKI P+T   W+ +L 
Sbjct: 2652 LSSYQVNDTKRRIAERV------FTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILG 2705

Query: 79   AVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPL 138
             V  S+L+L        +N++  A++ G+ + R++F       EH+ R ++AD+ LDT  
Sbjct: 2706 QVEGSVLFLYTDNEAAASNLKKEAESRGVKRDRLIFGKRLPLAEHLARYRVADLFLDTNP 2765

Query: 139  CNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             N  TT+ D LW G PV+TL GE+ ASR+AAS L  +G PELI    +EY+ +AI L  D
Sbjct: 2766 YNAGTTASDALWAGLPVLTLRGESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 2825


>gi|336472604|gb|EGO60764.1| hypothetical protein NEUTE1DRAFT_57508 [Neurospora tetrasperma FGSC
            2508]
 gi|350294161|gb|EGZ75246.1| hypothetical protein NEUTE2DRAFT_104796 [Neurospora tetrasperma FGSC
            2509]
          Length = 1655

 Score =  143 bits (360), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 105/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKIDP+T + W+ +L  VP +ILWLL+FP +GE N++ TA+    +
Sbjct: 1406 LPDDRIILGNFNQLYKIDPTTFRTWLRILAHVPKAILWLLRFPELGENNLRRTAKLWAGE 1465

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1466 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1525

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               +LIA    EY++ AI+L
Sbjct: 1526 SRMAASILKGALPKSEAGRQAAEDLIAGDDYEYENRAIKL 1565


>gi|116206628|ref|XP_001229123.1| hypothetical protein CHGG_02607 [Chaetomium globosum CBS 148.51]
 gi|88183204|gb|EAQ90672.1| hypothetical protein CHGG_02607 [Chaetomium globosum CBS 148.51]
          Length = 1434

 Score =  143 bits (360), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 104/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE +++ TA+    +
Sbjct: 1209 LPDDAILLGNFNQLYKIDPTTFRSWLRILAAAPKAYLWLLRFPELGETHLRRTARDWSGE 1268

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K +H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1269 GVASRIIFTDVAPKLQHISRARVCDLFLDTPECNAHTTAADILWSNTPLLTLPRYEYKMC 1328

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIAR   EY++ A++L
Sbjct: 1329 SRMAASILKGALPKGPEGERAAQELIARDEAEYEEFAVQL 1368


>gi|315044201|ref|XP_003171476.1| hypothetical protein MGYG_06022 [Arthroderma gypseum CBS 118893]
 gi|311343819|gb|EFR03022.1| hypothetical protein MGYG_06022 [Arthroderma gypseum CBS 118893]
          Length = 1698

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 68/161 (42%), Positives = 104/161 (64%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP VGE N++ TA+A   +
Sbjct: 1473 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAVLWLLRFPDVGEQNLRQTAKAWAGE 1532

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1533 ATASRIIFTDVAPKHAHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1592

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +             +L++ +  EY+  AI LG
Sbjct: 1593 SRMASSILTSALPQTEDGKDAARQLVSSSEAEYEKTAIELG 1633


>gi|71006212|ref|XP_757772.1| hypothetical protein UM01625.1 [Ustilago maydis 521]
 gi|46097017|gb|EAK82250.1| hypothetical protein UM01625.1 [Ustilago maydis 521]
          Length = 2239

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 101/153 (66%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D +++ +FNQLYK DP   ++W+ +L+ VP SILWLL+FPA GE ++   A+    D
Sbjct: 1862 LPDDYVIFADFNQLYKCDPMLFRLWLRILERVPKSILWLLRFPAAGEEHLMHEARRYAGD 1921

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   R++F++VA K  H+ RG++AD+ LDT  CN HTT+ D+LW+ TP++T P     + 
Sbjct: 1922 RVACRVIFTDVAPKHIHIHRGRIADLFLDTIECNAHTTAADILWSATPMLTWPRHVHKMC 1981

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
            SRVAAS +   G   E+   + KEY+D A+ L 
Sbjct: 1982 SRVAASIVHATGFGDEMTVHSEKEYEDRAVELA 2014


>gi|300022675|ref|YP_003755286.1| hypothetical protein Hden_1151 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299524496|gb|ADJ22965.1| TPR repeat-containing protein [Hyphomicrobium denitrificans ATCC
           51888]
          Length = 818

 Score =  143 bits (360), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 70/176 (39%), Positives = 105/176 (59%), Gaps = 5/176 (2%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q N +  T  E+P      +R   GLPEDA V+C+FN  YK++ +   +W+++L+ V  S
Sbjct: 595 QPNDRKRTISELP-----VTRADAGLPEDAFVFCSFNNSYKLNATMFDVWMSLLRNVDGS 649

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LWLL   A    N++  A   G+D  R++F++     EH+ R +LAD+ LD   CN HT
Sbjct: 650 VLWLLVPTATCRENLRREAAQRGVDPDRLVFASRKPIAEHLARHRLADLFLDALPCNAHT 709

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           T+ D LW G PV+T  GET + RVAAS L  +G PEL+ +   +Y ++A+ L  D+
Sbjct: 710 TASDALWAGLPVITATGETFSGRVAASLLTAVGLPELVTKNLDDYAELALALARDK 765


>gi|452845777|gb|EME47710.1| glycosyltransferase family 41 protein [Dothistroma septosporum NZE10]
          Length = 1653

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 71/159 (44%), Positives = 105/159 (66%), Gaps = 13/159 (8%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +D I++ NFNQLYKI+P+T + W+ +L  VPN+ILWLL+FP +GE+N+ ATA+  G    
Sbjct: 1425 DDTIIFGNFNQLYKIEPTTFRTWLRILARVPNAILWLLRFPDLGESNLLATARMWGGPDV 1484

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLASR 166
              R+ F++VA K  H+ R ++ D+ LDT  CN HTT+ DVLW+GTP++TLP     + SR
Sbjct: 1485 ASRVTFTDVAPKHLHISRARVCDLVLDTAECNAHTTAADVLWSGTPLLTLPRYEYKMCSR 1544

Query: 167  VAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
            +AAS L               EL+A + ++Y++ AI+LG
Sbjct: 1545 MAASILKGALPKGDEGDRAAEELVATSEEDYEEKAIKLG 1583


>gi|346322980|gb|EGX92578.1| UDP-N-acetylglucosaminyltransferase [Cordyceps militaris CM01]
          Length = 1592

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 73/161 (45%), Positives = 103/161 (63%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L ED ++  NFNQLYKIDPST + W+ +L  VP ++LWLL+FP  GEAN++ TA+     
Sbjct: 1283 LQEDRVILANFNQLYKIDPSTFRSWLRILAQVPKAVLWLLRFPEAGEANLRRTAELWAGP 1342

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P     + 
Sbjct: 1343 EVASRLIFTDVAQKSQHISRARICDLFLDTPECNAHTTAADVLWSSTPLLTFPRYPYKMC 1402

Query: 165  SRVAASQL--ATLGCP-------ELIARTHKEYQDIAIRLG 196
            SR+AAS L  A    P       ELIA + K Y+ +AI L 
Sbjct: 1403 SRMAASILRGALPKSPEGRHAAGELIATSEKRYEKVAIDLA 1443


>gi|443899251|dbj|GAC76582.1| animal-type fatty acid synthase and related proteins [Pseudozyma
            antarctica T-34]
          Length = 2043

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 101/153 (66%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D +++ +FNQLYK DP   ++W+ +LK VP SILWLL+FPA GE ++   A+    D
Sbjct: 1723 LPDDYVIFADFNQLYKCDPMLFKLWLRILKRVPKSILWLLRFPAAGEHHLLREARQYAGD 1782

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
                R++F++VA K  H+ RG++AD+ LDT  CN HTT+ D+LW+ TPV+T P     + 
Sbjct: 1783 DVAARVIFTDVAPKHIHIHRGRIADLFLDTTECNAHTTAADILWSATPVLTWPRHMHKMC 1842

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
            SRVAAS +   G   E+   + +EY++ A++L 
Sbjct: 1843 SRVAASIVHATGFGDEMTVHSEREYEERAVQLA 1875


>gi|326476337|gb|EGE00347.1| UDP-N-acetylglucosaminyltransferase [Trichophyton tonsurans CBS
            112818]
          Length = 1527

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP VGE N++ TA+A   +
Sbjct: 1302 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEYNLRQTAKAWAGE 1361

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1362 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1421

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +             +LI+    EY+  AI LG
Sbjct: 1422 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1462


>gi|336266788|ref|XP_003348161.1| hypothetical protein SMAC_04006 [Sordaria macrospora k-hell]
          Length = 1443

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 105/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKIDP+T + W+ +L+ VP ++LWLL+FP +GE N++ TA+    +
Sbjct: 1197 LPDDRIILGNFNQLYKIDPTTFRSWLRILQDVPKAVLWLLRFPELGENNLRRTAKQWAGE 1256

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1257 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1316

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               +LIA    EY++ AI L
Sbjct: 1317 SRMAASILKGALPKSEAGRRAAEDLIADDDYEYENKAIEL 1356


>gi|380091097|emb|CCC11303.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1514

 Score =  142 bits (359), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 105/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKIDP+T + W+ +L+ VP ++LWLL+FP +GE N++ TA+    +
Sbjct: 1268 LPDDRIILGNFNQLYKIDPTTFRSWLRILQDVPKAVLWLLRFPELGENNLRRTAKQWAGE 1327

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1328 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1387

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               +LIA    EY++ AI L
Sbjct: 1388 SRMAASILKGALPKSEAGRRAAEDLIADDDYEYENKAIEL 1427


>gi|255935459|ref|XP_002558756.1| Pc13g03180 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583376|emb|CAP91387.1| Pc13g03180 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1515

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 70/163 (42%), Positives = 108/163 (66%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYK++P+T + W+ +L  +PN++LWLL+FP  GE N++  A+A   +
Sbjct: 1292 LSDDTIILGNFNQLYKVEPTTFRTWLRILARIPNAVLWLLRFPDTGEQNLRDIAKAWAGE 1351

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1352 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1411

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L++             ELIA + ++Y++ AIRL  D
Sbjct: 1412 SRMASSILSSALPSSEAGQRAREELIASSDEDYENKAIRLCLD 1454


>gi|302661258|ref|XP_003022298.1| hypothetical protein TRV_03509 [Trichophyton verrucosum HKI 0517]
 gi|291186238|gb|EFE41680.1| hypothetical protein TRV_03509 [Trichophyton verrucosum HKI 0517]
          Length = 1697

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP VGE N++ TA+A   +
Sbjct: 1472 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEHNLRQTAKAWAGE 1531

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1532 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1591

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +             +LI+    EY+  AI LG
Sbjct: 1592 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1632


>gi|326481193|gb|EGE05203.1| UDP-N-acetylglucosaminyltransferase [Trichophyton equinum CBS 127.97]
          Length = 1564

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP VGE N++ TA+A   +
Sbjct: 1339 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEYNLRQTAKAWAGE 1398

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1399 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1458

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +             +LI+    EY+  AI LG
Sbjct: 1459 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1499


>gi|317025352|ref|XP_001388903.2| UDP-N-acetylglucosaminyltransferase [Aspergillus niger CBS 513.88]
          Length = 1670

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1446 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1505

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1506 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1565

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             +LIA + ++Y++ AIRL
Sbjct: 1566 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYENKAIRL 1605


>gi|134055003|emb|CAK37011.1| unnamed protein product [Aspergillus niger]
          Length = 1546

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1322 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1381

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1382 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1441

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             +LIA + ++Y++ AIRL
Sbjct: 1442 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYENKAIRL 1481


>gi|342879983|gb|EGU81213.1| hypothetical protein FOXB_08246 [Fusarium oxysporum Fo5176]
          Length = 1491

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 70/162 (43%), Positives = 106/162 (65%), Gaps = 13/162 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  VP +++WLL+FP +GEAN++ TA+A   +
Sbjct: 1265 LSDDTIIMGNFNQLYKIEPTTFRTWLRILAQVPKAVIWLLRFPELGEANLRRTAKAWAGE 1324

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1325 EVASRLIFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1384

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
            SR+AAS L               ELIA + +EY+  A+ L T
Sbjct: 1385 SRMAASILKGALPKSNEGQEAAAELIAASEEEYEQRAVELAT 1426


>gi|425766660|gb|EKV05262.1| UDP-N-acetylglucosaminyltransferase [Penicillium digitatum Pd1]
 gi|425775265|gb|EKV13543.1| UDP-N-acetylglucosaminyltransferase [Penicillium digitatum PHI26]
          Length = 1642

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 67/163 (41%), Positives = 106/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYK++P+T + W+ +L  +PN++LWLL+FP  GE N++  A+A   +
Sbjct: 1419 LSDDTIILGNFNQLYKVEPTTFRTWLRILARIPNAVLWLLRFPDTGEKNLRDIAKAWAGE 1478

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP------- 159
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP       
Sbjct: 1479 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1538

Query: 160  ----GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
                   L+S + +S++      +LIA + ++Y++ AIRL  D
Sbjct: 1539 SRMASSILSSALPSSEVGQRAREDLIASSDEDYENKAIRLCLD 1581


>gi|302897315|ref|XP_003047536.1| hypothetical protein NECHADRAFT_105493 [Nectria haematococca mpVI
            77-13-4]
 gi|256728467|gb|EEU41823.1| hypothetical protein NECHADRAFT_105493 [Nectria haematococca mpVI
            77-13-4]
          Length = 1538

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 106/160 (66%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  VP ++LWLL+FP +GEAN++ TA+A    
Sbjct: 1312 LADDAIIMGNFNQLYKIEPTTFRSWLRILAQVPKAVLWLLRFPELGEANLRRTAKAWAGT 1371

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1372 EVASRLIFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1431

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIA + +EY+  A++L
Sbjct: 1432 SRMAASILKGALPKSEDGRKAAAELIASSEEEYEQRAVQL 1471


>gi|164426318|ref|XP_961401.2| hypothetical protein NCU01315 [Neurospora crassa OR74A]
 gi|157071287|gb|EAA32165.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1655

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 105/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKIDP+T + W+ +L  VP ++LWLL+FP +GE N++ TA+    +
Sbjct: 1406 LPDDRIILGNFNQLYKIDPTTFRTWLRILAHVPKAVLWLLRFPELGENNLRRTAKLWAGE 1465

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI+F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1466 EVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYEYKMC 1525

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               +LIA    EY++ AI+L
Sbjct: 1526 SRMAASILKGALPKSEAGRQAAEDLIAGDDYEYENRAIKL 1565


>gi|449300658|gb|EMC96670.1| glycosyltransferase family 41 protein [Baudoinia compniacensis UAMH
            10762]
          Length = 1439

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 13/159 (8%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            ++ I++ NFNQLYKIDP+T + W+ VL AVPNSILWLL+FP +GE+++ ATA+     + 
Sbjct: 1210 DETIIFGNFNQLYKIDPTTFRTWLRVLAAVPNSILWLLRFPDLGESHLMATARLWATTEV 1269

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
              R++F++VA K  H+ R ++ D+ LDT  CN HTT+ DVLW+GTP++TLP  +  + SR
Sbjct: 1270 ASRLIFTDVAPKHLHISRARICDLFLDTAECNAHTTAADVLWSGTPLLTLPRYSYKMCSR 1329

Query: 167  VAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
            +AAS L               ELIA    +Y+  AI LG
Sbjct: 1330 MAASILRGALPKTGEGVQAARELIATDEDDYESKAIALG 1368


>gi|327296958|ref|XP_003233173.1| UDP-N-acetylglucosaminyltransferase [Trichophyton rubrum CBS 118892]
 gi|326464479|gb|EGD89932.1| UDP-N-acetylglucosaminyltransferase [Trichophyton rubrum CBS 118892]
          Length = 1678

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 103/161 (63%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP VGE N++ TA+A   +
Sbjct: 1453 LKDDTVILGNFNQLYKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEHNLRQTAKAWAGE 1512

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1513 ATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLLTFPRYKYKMC 1572

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            SR+A+S L +             +LI+    EY+  AI LG
Sbjct: 1573 SRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1613


>gi|350638065|gb|EHA26421.1| hypothetical protein ASPNIDRAFT_172247 [Aspergillus niger ATCC 1015]
          Length = 1603

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 108/160 (67%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1379 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAVAWAGE 1438

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1439 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1498

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             +LIA + ++Y++ AIRL
Sbjct: 1499 SRMASSILSSALPKSDAGREAQSDLIAMSDEDYENKAIRL 1538


>gi|440636477|gb|ELR06396.1| hypothetical protein GMDG_02113 [Geomyces destructans 20631-21]
          Length = 1643

 Score =  142 bits (358), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 69/162 (42%), Positives = 106/162 (65%), Gaps = 13/162 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I+  NFNQLYKI+P+T + W+ +L  VP ++LWLL+FP +GE+N++ TA+    +
Sbjct: 1421 LPDDVIILGNFNQLYKIEPTTFRTWLRILANVPKAVLWLLRFPDLGESNLRRTARLWAGE 1480

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RILF++VA K  H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1481 EVASRILFTDVAPKNLHISRARICDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1540

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGT 197
            SR+AAS L               ELIA   ++Y++ A++L +
Sbjct: 1541 SRMAASILKGALPRSDEGFDAASELIATDEEQYEEFAVKLAS 1582


>gi|328766887|gb|EGF76939.1| hypothetical protein BATDEDRAFT_33767 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 452

 Score =  142 bits (358), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 64/126 (50%), Positives = 88/126 (69%), Gaps = 4/126 (3%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           L ED +++ NFNQLYK+DP     W+ +L  VPNSILWLL+FP  GEA+++  A  L  +
Sbjct: 100 LAEDTVIFANFNQLYKVDPHIFATWMRILSRVPNSILWLLRFPPAGEAHLRRKAVELVGE 159

Query: 109 --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE--TLA 164
               R++F++VA K  H+ RG++ADV LDTP CN HTT+ D+LW+GTP++T P     + 
Sbjct: 160 SVSRRLIFTDVAPKHLHIHRGRIADVFLDTPECNAHTTAADILWSGTPIITYPKYDFKMC 219

Query: 165 SRVAAS 170
           SRVAAS
Sbjct: 220 SRVAAS 225


>gi|338739161|ref|YP_004676123.1| hypothetical protein HYPMC_2335 [Hyphomicrobium sp. MC1]
 gi|337759724|emb|CCB65555.1| TPR repeat-containing protein [Hyphomicrobium sp. MC1]
          Length = 835

 Score =  142 bits (357), Expect = 9e-32,   Method: Composition-based stats.
 Identities = 66/157 (42%), Positives = 97/157 (61%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR  +GLPEDA V+C+FN  YK++ +   +W+ +L+ VP S+LWLL        N++  A
Sbjct: 610 SRADFGLPEDAFVFCSFNNSYKLNATMFDVWMPLLQKVPGSVLWLLVPNTTCAENLRREA 669

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  R++F+      EH+ R + AD+ LD   CN HTT+ D LW G PV+T  G+T
Sbjct: 670 EARGVDPSRLVFAKRMPIAEHLARHRFADLFLDALPCNAHTTTTDALWAGLPVLTCLGDT 729

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            A RVAAS L+ +  PELI     +Y D+A+ L  ++
Sbjct: 730 FAGRVAASLLSAIELPELITTNLADYSDLALELAQNK 766


>gi|453087624|gb|EMF15665.1| glycosyltransferase family 41 protein [Mycosphaerella populorum
            SO2202]
          Length = 1685

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/162 (44%), Positives = 107/162 (66%), Gaps = 13/162 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I++ NFNQLYKI+P+T + W+ +L  VPNSILWLL+FP +GE+++ ATA+     
Sbjct: 1452 LSDDTIIFGNFNQLYKIEPTTFRTWLRILARVPNSILWLLRFPDLGESHLLATARLWAGR 1511

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   R++F++VA K  H+ R ++ D+ LDT  CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1512 EVASRVIFTDVAPKHLHISRARVCDLVLDTAECNAHTTAADVLWSGTPLLTLPRYQYKMC 1571

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLGT 197
            SR+AAS L               ELIA+  ++Y++ A+ LGT
Sbjct: 1572 SRMAASILKGALPKNEEGRRAETELIAKHEEDYEEKAVALGT 1613


>gi|67515967|ref|XP_657869.1| hypothetical protein AN0265.2 [Aspergillus nidulans FGSC A4]
 gi|40746982|gb|EAA66138.1| hypothetical protein AN0265.2 [Aspergillus nidulans FGSC A4]
 gi|259489517|tpe|CBF89854.1| TPA: UDP-N-acetylglucosaminyltransferase (AFU_orthologue;
            AFUA_1G03380) [Aspergillus nidulans FGSC A4]
          Length = 1596

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 107/160 (66%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L ++PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1372 LSDDTIILGNFNQLYKIEPTTFRTWLRILASIPNAVLWLLRFPDIGEQNLRETAVAWAGE 1431

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1432 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTLPRYKYKMC 1491

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             ELIA + ++Y+  AI L
Sbjct: 1492 SRMASSILSSALPKTESGQQAREELIALSDEDYEKKAIHL 1531


>gi|115388427|ref|XP_001211719.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195803|gb|EAU37503.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1573

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 107/160 (66%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +DAI+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A   +
Sbjct: 1349 LRDDAIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRDTAVAWAGE 1408

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1409 ETASRIIFTDVAPKNTHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTYPRYKYKMC 1468

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             ELIA +  +Y++ AIRL
Sbjct: 1469 SRMASSILSSALPDSDAGNQAREELIAVSDDDYENKAIRL 1508


>gi|340904950|gb|EGS17318.1| transferase-like protein [Chaetomium thermophilum var. thermophilum
            DSM 1495]
          Length = 1621

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 102/160 (63%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DA++  NFNQLYKIDP+T + W+ +L A P + LWLL+FP +GE N+  TA+A   +
Sbjct: 1386 LPDDAVILGNFNQLYKIDPTTFRTWLRILAACPKAYLWLLRFPDLGERNLMETARAWSGN 1445

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RI F++VA K++H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1446 AVASRIRFTDVAPKQDHISRAKVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1505

Query: 165  SRVAAS---------QLATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS          L      EL+    +EY++ A+ L
Sbjct: 1506 SRMAASILRGALPRGPLGDQAARELVCADEREYEERAVAL 1545


>gi|358057401|dbj|GAA96750.1| hypothetical protein E5Q_03421 [Mixia osmundae IAM 14324]
          Length = 1930

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 70/153 (45%), Positives = 103/153 (67%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L  + +++ NFNQLYKIDPST  +W+ +L+AVPNS+LWLL+FPA GE N+  TA+  G +
Sbjct: 1690 LAPETVIFANFNQLYKIDPSTFCIWLKILQAVPNSVLWLLRFPAAGEPNLLDTAKRWGGE 1749

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
                RILF++VA+K+EH+ RG++AD+ +DT     HTT +DVLW+G+P+V  P   E + 
Sbjct: 1750 AIAKRILFTSVASKDEHLNRGRVADLFIDTLELGSHTTCVDVLWSGSPIVVSPRKREKMG 1809

Query: 165  SRVAASQLATLGCPE-LIARTHKEYQDIAIRLG 196
            SRV+ S +   G  + L+     +Y+  A  LG
Sbjct: 1810 SRVSTSIVTASGFADALVVEDEAQYERRAAELG 1842


>gi|46110068|ref|XP_382092.1| hypothetical protein FG01916.1 [Gibberella zeae PH-1]
          Length = 1596

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 104/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  VP +++WLL+FP +GEAN++ TA+    +
Sbjct: 1370 LADDTIIMGNFNQLYKIEPTTFRTWLRILAQVPKAVIWLLRFPELGEANLRRTAKEWAGE 1429

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R+LF++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1430 EVASRLLFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1489

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIA   KEY+  A++L
Sbjct: 1490 SRMAASILKGALPKSDEGRQAAAELIAGDEKEYEQRAVKL 1529


>gi|253999006|ref|YP_003051069.1| hypothetical protein Msip34_1296 [Methylovorus glucosetrophus
           SIP3-4]
 gi|253985685|gb|ACT50542.1| Tetratricopeptide TPR_2 repeat protein [Methylovorus glucosetrophus
           SIP3-4]
          Length = 697

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 96/156 (61%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R  YGLPE+A VYC FNQ +KI P+    W+ +L+ VP+ +LWLL+  A    N+Q  A
Sbjct: 500 ARVDYGLPEEAFVYCCFNQSFKITPAVFACWMRILQQVPDGVLWLLESNATATLNLQQAA 559

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  R++F+   A  +H+ R  LAD+ LDT   N HTT+ D LW   PV+T  G+T
Sbjct: 560 KAAGVDPARLVFAPRVAMADHLARHALADLFLDTLPYNAHTTASDALWMCLPVLTCSGDT 619

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRVA S L  +  PELIA     Y+ +AI L  D
Sbjct: 620 FASRVAGSLLHAVNLPELIAPDMAAYEAMAINLKHD 655


>gi|452986315|gb|EME86071.1| glycosyltransferase family 41 protein, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 1463

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 106/161 (65%), Gaps = 13/161 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I++ NFNQLYKI+P+T + W+ +L  VPNSILWLL+FP +GE+++ ATA+     
Sbjct: 1237 LADDTIIFGNFNQLYKIEPTTFRTWLRILARVPNSILWLLRFPDLGESHLLATARMWAGA 1296

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R++F++VA K  H+ R ++ D+ +DT  CN HTT+ DVLW+GTP++TLP     + 
Sbjct: 1297 EVASRVIFTDVAPKHLHISRARICDLVVDTAECNAHTTAADVLWSGTPLLTLPRYPYKMC 1356

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRLG 196
            SR+AAS L               ELIA + +EY++ A+ LG
Sbjct: 1357 SRMAASILKGALPKNAEGERAARELIAASEEEYEEKAVALG 1397


>gi|84503473|ref|ZP_01001528.1| TPR repeat protein [Oceanicola batsensis HTCC2597]
 gi|84388151|gb|EAQ01104.1| TPR repeat protein [Oceanicola batsensis HTCC2597]
          Length = 630

 Score =  141 bits (355), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 68/164 (41%), Positives = 99/164 (60%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P+  +  +R  +GLPE+  V C FN   KI P    +W+ +L+AVP S+LWLL+      
Sbjct: 410 PEPRITGTRADHGLPEEGPVLCCFNAGRKIGPQEFDLWMRLLRAVPASVLWLLRSNPRMV 469

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N++  A   G+D  R++F++    E H+ R   AD+ LDT   NGHTT+ D LW G PV
Sbjct: 470 GNLRGEAARRGVDPDRLVFADPCPHESHIARYVHADIVLDTFRYNGHTTTSDALWAGVPV 529

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           VT+ G   A+RVAAS L+  G P+L+  T +EY+ +++ L TDR
Sbjct: 530 VTMAGRQFAARVAASLLSASGLPDLVTETVQEYEALSLALATDR 573


>gi|171692141|ref|XP_001910995.1| hypothetical protein [Podospora anserina S mat+]
 gi|170946019|emb|CAP72820.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1635

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 69/160 (43%), Positives = 103/160 (64%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKIDP+T + W+ +L + P + LWLL+FP +GE +++ TA+    +
Sbjct: 1405 LPDDAIILGNFNQLYKIDPTTFRTWLRILASCPKAHLWLLRFPELGETHLRRTAKDWAGE 1464

Query: 109  --QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
                RI F++VA K++H+ R ++ D+ LDTP CN HTT+ D+LW+ TP++TLP     + 
Sbjct: 1465 AVASRIHFTDVAPKQQHISRARVCDLFLDTPECNAHTTAADILWSSTPLLTLPRYEYKMC 1524

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIA   K+Y++ A+ L
Sbjct: 1525 SRMAASILKGALPKSEEGERAAKELIAEDEKQYEEFAVGL 1564


>gi|296422902|ref|XP_002840997.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637225|emb|CAZ85188.1| unnamed protein product [Tuber melanosporum]
          Length = 1503

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 115/201 (57%), Gaps = 24/201 (11%)

Query: 8    QTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDP 67
            Q++ +    Q G    QT       E  PQ           L  D ++  NFNQLYKI+P
Sbjct: 1243 QSAPDSKSRQLGWEEEQTRRWAKRKELFPQ-----------LKPDNVILGNFNQLYKIEP 1291

Query: 68   STLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ--HRILFSNVAAKEEHVR 125
            +T + W+ +L  VP ++LWLL+FP +GE+N++A A A   ++   RI+F++VA K +H+ 
Sbjct: 1292 TTFRTWLRILSRVPKAVLWLLRFPDLGESNLKALAVAWAGEEIASRIIFTDVAPKLQHIS 1351

Query: 126  RGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASRVAASQL--ATLGCPE-- 179
            R Q+ D+ LDTP CN HTT+ DVLW+GTP++T P     + SR+AAS L  A    PE  
Sbjct: 1352 RAQVCDIFLDTPECNAHTTAADVLWSGTPLLTFPRHKHKMCSRIAASILRAAVPQTPEGK 1411

Query: 180  -----LIARTHKEYQDIAIRL 195
                 LI  + +EY+D A  L
Sbjct: 1412 AMANSLIVDSEEEYEDRAAAL 1432


>gi|121703103|ref|XP_001269816.1| UDP-N-acetylglucosaminyltransferase [Aspergillus clavatus NRRL 1]
 gi|119397959|gb|EAW08390.1| UDP-N-acetylglucosaminyltransferase [Aspergillus clavatus NRRL 1]
          Length = 1669

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 106/160 (66%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN+ILWLL+FP +GE N++ TA A    
Sbjct: 1445 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAILWLLRFPELGEQNLRETAIAWAGQ 1504

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1505 ETASRIIFTDVAPKNTHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTFPRYKYKMC 1564

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             EL+A + ++Y++ AIRL
Sbjct: 1565 SRMASSILSSALPNSEAGHKARTELMAVSDEDYEEKAIRL 1604


>gi|452963349|gb|EME68422.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
           SO-1]
          Length = 602

 Score =  140 bits (354), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 68/165 (41%), Positives = 100/165 (60%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P++     + ++GLP DA V   FN  +KI+     +W++VL+A P+++LW ++F     
Sbjct: 388 PRAAAREPKAKWGLPGDAFVVAVFNNSFKINAEAFAVWISVLQAQPDAVLWFVEFHPAAT 447

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           A+++A AQA+G+D  R++F+   ++ EH+ R   AD+ LDT  C GHTT+ D LW G P+
Sbjct: 448 ASLRAMAQAVGIDPARLVFAPRLSQAEHLARLSAADLFLDTWPCGGHTTASDALWAGVPL 507

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           V   G T ASRVA S L  LG  ELIA +   Y  +A  L  DRD
Sbjct: 508 VAWAGRTFASRVAGSLLHALGFDELIAESQGAYYALAQHLAKDRD 552


>gi|83311456|ref|YP_421720.1| SPY protein [Magnetospirillum magneticum AMB-1]
 gi|82946297|dbj|BAE51161.1| SPY protein [Magnetospirillum magneticum AMB-1]
          Length = 798

 Score =  140 bits (353), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 70/166 (42%), Positives = 95/166 (57%)

Query: 32  GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
           G+  P    +  R  YGLPEDA VYC FN  +K       +W ++L+AVP  +LWL +  
Sbjct: 580 GDLAPVGEPVQPRSAYGLPEDAFVYCGFNNPFKFRAEVFDLWADILRAVPQGVLWLREDN 639

Query: 92  AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
                N+     A G+D  R++F+      EH+ R +LAD+ LD    N HTT+ D LW 
Sbjct: 640 DYSRNNLGREIAARGIDPARLIFAQRTDFAEHMARHRLADLFLDCLPYNAHTTASDALWA 699

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           G PV+T  GET ASRVA S L+ LG PELI  + +EY++ AI L +
Sbjct: 700 GLPVLTRVGETFASRVAGSLLSGLGLPELITESAEEYRERAIALAS 745


>gi|383757802|ref|YP_005436787.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
 gi|381378471|dbj|BAL95288.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
          Length = 639

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 69/153 (45%), Positives = 96/153 (62%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLPEDA+V C FNQ YKI P  L +W  VL  +P ++LWLL + A G+  + A  +
Sbjct: 432 RAELGLPEDALVLCCFNQSYKITPEVLALWARVLHELPQAVLWLLAWNADGQRRLLAALE 491

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           ALG+   R++F+     ++H+ R + AD+ LDT   N HTT+ + LW   PV+T+PGET 
Sbjct: 492 ALGIGAERVVFAERLPVDQHIARLRAADLFLDTWPYNAHTTASESLWAAVPVLTVPGETF 551

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           ASRVAAS +A  G PEL     ++Y   A+ LG
Sbjct: 552 ASRVAASLVAACGLPELACADAEDYVRTAVELG 584


>gi|148258992|ref|YP_001243577.1| hypothetical protein BBta_7846 [Bradyrhizobium sp. BTAi1]
 gi|146411165|gb|ABQ39671.1| hypothetical protein BBta_7846 [Bradyrhizobium sp. BTAi1]
          Length = 452

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 95/160 (59%), Gaps = 1/160 (0%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR + GLPE   V+C FNQ YK  PS   +W  +L A P+S+LWLL      + N++ 
Sbjct: 233 LPSRAEAGLPETGFVFCCFNQAYKFTPSVFDLWCRLLAATPDSVLWLLA-SDQAQGNLRG 291

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A   G+   R++F+    + EH+RR QLAD+ LDT     HTT+ D LW G P+VT  G
Sbjct: 292 EALRRGVSPDRLVFAPQIGQSEHLRRLQLADLVLDTAPYGAHTTASDALWAGVPIVTCAG 351

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           +TLASRVA S L  +G PELIA    +Y  +A+ L  + D
Sbjct: 352 DTLASRVAGSLLHAVGLPELIASDEADYVAVALTLAAEPD 391


>gi|70990478|ref|XP_750088.1| UDP-N-acetylglucosaminyltransferase [Aspergillus fumigatus Af293]
 gi|66847720|gb|EAL88050.1| UDP-N-acetylglucosaminyltransferase [Aspergillus fumigatus Af293]
 gi|159130569|gb|EDP55682.1| UDP-N-acetylglucosaminyltransferase [Aspergillus fumigatus A1163]
          Length = 1634

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 105/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A    
Sbjct: 1410 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAIAWAGR 1469

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1470 ETASRIIFTDVAPKNAHISRAKILDLFLDTPECNAHTTATDVLWSGTPLLTYPRYKYKMC 1529

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             EL A + ++Y+D AIRL
Sbjct: 1530 SRMASSILSSALPDSDAGRKARDELTAVSDEDYEDKAIRL 1569


>gi|291614668|ref|YP_003524825.1| hypothetical protein Slit_2210 [Sideroxydans lithotrophicus ES-1]
 gi|291584780|gb|ADE12438.1| TPR repeat-containing protein [Sideroxydans lithotrophicus ES-1]
          Length = 736

 Score =  140 bits (352), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 68/162 (41%), Positives = 103/162 (63%), Gaps = 3/162 (1%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           PQ +   +R++ GLP    V+C  N  YKI P  L +W+ +LK VP S+LWL K  A  E
Sbjct: 505 PQQL---TREELGLPAHGFVFCCHNSSYKITPVDLDIWMRLLKRVPGSVLWLYKSSAEVE 561

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           AN++  A++ G++ HR++F+N A    ++ R ++AD+ LDT   N  TT+ + LW G PV
Sbjct: 562 ANLRREAESRGVEPHRLVFANQAPHATYLARYRMADLFLDTAFYNAQTTAAEALWAGLPV 621

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           +T PG T+ASRVA+  L  +G  E+IA + ++Y++ A RL T
Sbjct: 622 LTCPGVTMASRVASGLLHAIGLEEMIAGSPQQYEECAYRLAT 663


>gi|407772276|ref|ZP_11119578.1| hypothetical protein TH2_00215 [Thalassospira profundimaris WP0211]
 gi|407284229|gb|EKF09745.1| hypothetical protein TH2_00215 [Thalassospira profundimaris WP0211]
          Length = 629

 Score =  140 bits (352), Expect = 4e-31,   Method: Composition-based stats.
 Identities = 66/155 (42%), Positives = 92/155 (59%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           TSR  YGLP+   V+C+FN  YKI P    +W+ +L  V  S+LWL K       N++  
Sbjct: 416 TSRADYGLPDSGFVFCSFNNTYKISPREFDIWMRLLNQVEGSVLWLFKGNDYAAQNLRKE 475

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           AQ   +D  R++ +   ++ EH+ R +  D+ LDT   N HTT+ D LW G P+VTLPGE
Sbjct: 476 AQKRDVDPDRLICAEKLSQSEHLARQKHGDLLLDTFNVNAHTTASDALWAGLPLVTLPGE 535

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             A+RVAAS L+    PELIA+   +Y+ IA+ L 
Sbjct: 536 QFAARVAASILSAANLPELIAKDEADYEAIALDLA 570


>gi|428164760|gb|EKX33775.1| hypothetical protein GUITHDRAFT_81153 [Guillardia theta CCMP2712]
          Length = 851

 Score =  140 bits (352), Expect = 4e-31,   Method: Composition-based stats.
 Identities = 63/155 (40%), Positives = 96/155 (61%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +RQ+Y LP   I++ NF QLYK++PS   +WV ++K   N+ LWLLKFP      +   A
Sbjct: 599 TRQEYKLPSSGIIFGNFGQLYKVEPSLFDVWVRIIKRTSNTSLWLLKFPKEAVKRLLKEA 658

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              GL + R++ +++   + H+    +ADV LDT + NGHTT  D LW+G P+V+L GE 
Sbjct: 659 DKRGLPRDRLVLTSLLPIDSHLAIKAVADVALDTNMFNGHTTGADTLWSGLPLVSLSGEQ 718

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           + SR  AS    LG    +AR+ ++Y++IA+RL +
Sbjct: 719 MRSRAGASMAYALGVTRWLARSLEDYEEIAVRLAS 753


>gi|296448837|ref|ZP_06890677.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
 gi|296253657|gb|EFH00844.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
          Length = 452

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 92/155 (59%), Gaps = 1/155 (0%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE  +V+C FNQ YK+ P    +W  +L A P S+LWLL  P + E N++  A 
Sbjct: 246 RASVGLPEQGVVFCCFNQAYKLTPEIFDVWCRLLDAAPGSVLWLLAAP-MAEGNLRNEAW 304

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G++ +R++F+    + +H+ R QLAD+ LDT   N HTT+ D LW G P+VT  G T 
Sbjct: 305 KRGVNGNRLVFAPDMGQGDHLARLQLADLVLDTAPYNAHTTASDALWAGVPIVTCSGSTF 364

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVA S L  +G PELI      Y D+A RL  D
Sbjct: 365 ASRVAGSVLRAVGMPELITEDLDGYFDLASRLAND 399


>gi|322695669|gb|EFY87473.1| UDP-N-acetylglucosaminyltransferase [Metarhizium acridum CQMa 102]
          Length = 1746

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 73/160 (45%), Positives = 100/160 (62%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L  DAI+  NFNQLYKIDP+T + W+ +L  VP ++LWLL+FP  GE+N++ATA+A    
Sbjct: 1520 LGNDAIILGNFNQLYKIDPTTFRAWLRILARVPRAVLWLLRFPEPGESNLRATAKAWAGP 1579

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R  + D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1580 EVAERIIFTDVAPKSQHISRATVCDMFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1639

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELI  T  EY+  A  L
Sbjct: 1640 SRMAASILRGALPRSLEGQQAAEELITYTEAEYEQRAADL 1679


>gi|242803242|ref|XP_002484134.1| UDP-N-acetylglucosaminyltransferase [Talaromyces stipitatus ATCC
            10500]
 gi|218717479|gb|EED16900.1| UDP-N-acetylglucosaminyltransferase [Talaromyces stipitatus ATCC
            10500]
          Length = 1666

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 69/163 (42%), Positives = 106/163 (65%), Gaps = 13/163 (7%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKI+P+T + W+ +L  +P ++LWLL+FP +GE N++  A     +
Sbjct: 1438 LPDDAIILGNFNQLYKIEPTTFRTWLRILSDIPKAVLWLLRFPELGEQNLKDCAVKWANE 1497

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K+ H+ R Q+ D+ LDTP CN HTT+ D+LW+GTP++T P     + 
Sbjct: 1498 EIASRIIFTDVAPKQAHIARAQVVDLFLDTPECNAHTTAADILWSGTPMLTYPRYKYKMC 1557

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRLGTD 198
            SR+A+S L++             ELIA + ++Y+  A RL  D
Sbjct: 1558 SRMASSILSSALPDTEAGHQARKELIAISDEDYRAKASRLCRD 1600


>gi|298704865|emb|CBJ28382.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Ectocarpus siliculosus]
          Length = 916

 Score =  139 bits (351), Expect = 5e-31,   Method: Composition-based stats.
 Identities = 62/147 (42%), Positives = 93/147 (63%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           +V CNFN+L+K+DP T   W+ VL+ VP ++LWL+        N+   A+  G+D+ R++
Sbjct: 701 VVLCNFNRLHKLDPHTFGAWMEVLRTVPGTVLWLIDGGETARTNLLRQARLAGVDEWRVV 760

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F+ +  KEEH++R +LAD+ +DTPL N HT   D LW G P+VTL G  +ASRV AS + 
Sbjct: 761 FAPLVGKEEHLQRLRLADLFVDTPLYNAHTVGCDALWAGVPMVTLRGAKMASRVGASLVE 820

Query: 174 TLGCPELIARTHKEYQDIAIRLGTDRD 200
             G PEL+  + +EY  + + L  D +
Sbjct: 821 AAGMPELVTDSLEEYTQLVLALARDNE 847


>gi|323447882|gb|EGB03789.1| putative O-linked GlcNAc transferase [Aureococcus anophagefferens]
          Length = 971

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 65/158 (41%), Positives = 96/158 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+ YGLP+DA V+  F    +IDP    +W+++LK VPNS+LWL          ++  A
Sbjct: 760 ARETYGLPQDAFVFAYFGDSDRIDPIIFSVWMSILKRVPNSLLWLHMNSETVIDRLKKEA 819

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +   + + R++FSN   + EHV    LAD+ LDTP CN    ++D LW GTPV+ L G T
Sbjct: 820 RGHQVIEQRLIFSNSVPRREHVFHAMLADIVLDTPACNALDATLDALWAGTPVIALLGNT 879

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           +A+R +AS  + +GC ELI+    +Y+D+AI L  D D
Sbjct: 880 IATRTSASLCSAVGCHELISANLGDYEDLAISLAIDSD 917


>gi|212539952|ref|XP_002150131.1| UDP-N-acetylglucosaminyltransferase [Talaromyces marneffei ATCC
            18224]
 gi|210067430|gb|EEA21522.1| UDP-N-acetylglucosaminyltransferase [Talaromyces marneffei ATCC
            18224]
          Length = 1667

 Score =  139 bits (350), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 69/164 (42%), Positives = 107/164 (65%), Gaps = 15/164 (9%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+DAI+  NFNQLYKI+P+T + W+ +L  +P ++LWLL+FP +GE N++  A     +
Sbjct: 1434 LPDDAIILGNFNQLYKIEPTTFRTWLRILSDIPKAVLWLLRFPELGEQNLKDCAVQWANE 1493

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K+ H+ R Q+ D+ LDTP CN HTT+ D+LW+GTP++T P     + 
Sbjct: 1494 EIASRIIFTDVAPKQAHIARAQVVDLFLDTPECNAHTTAADILWSGTPMLTFPRYKYKMC 1553

Query: 165  SRVAASQLATLGCP----------ELIARTHKEYQDIAIRLGTD 198
            SR+A+S L++   P          EL+A + ++Y+  A RL  D
Sbjct: 1554 SRMASSILSS-ALPDTEAGRQARNELVAISDEDYRAKASRLCRD 1596


>gi|392571498|gb|EIW64670.1| TPR-like protein [Trametes versicolor FP-101664 SS1]
          Length = 1237

 Score =  139 bits (350), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 68/153 (44%), Positives = 99/153 (64%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP+D I++ NFNQLYKIDP    +W+ +L+ VP S+LWLL+FP  GE  ++ TA+    +
Sbjct: 984  LPQDVIIFANFNQLYKIDPGIFLVWLRILRQVPRSVLWLLRFPGAGEEQLRRTAKMWAGE 1043

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   RI F++VA K+ HV R ++AD+ LDT  CN HT + DVLWTGTP++T P     + 
Sbjct: 1044 EVAARIRFTDVAKKDWHVFRARVADLFLDTAECNAHTIAADVLWTGTPILTWPKHRHKMC 1103

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
            SRVAAS +   G   ++   + +EY+  A+ L 
Sbjct: 1104 SRVAASMVNATGFGAQMTVHSAEEYERRAVALA 1136


>gi|418056785|ref|ZP_12694836.1| Tetratricopeptide TPR_2 repeat-containing protein [Hyphomicrobium
           denitrificans 1NES1]
 gi|353207557|gb|EHB72964.1| Tetratricopeptide TPR_2 repeat-containing protein [Hyphomicrobium
           denitrificans 1NES1]
          Length = 818

 Score =  139 bits (349), Expect = 9e-31,   Method: Composition-based stats.
 Identities = 69/176 (39%), Positives = 103/176 (58%), Gaps = 5/176 (2%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q N +     E P      +R  +GLPEDA V+C+FN  YK++ +   +W+++L+ V  S
Sbjct: 595 QPNDRKRKISETP-----VTRADFGLPEDAFVFCSFNNSYKLNATMFDVWMSLLRRVAGS 649

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LWLL   +    N++  A A G+D  R++F++     EH+ R +LAD+ LD   CN HT
Sbjct: 650 VLWLLVPTSACRENLRREAAARGVDPDRLVFASRLPIAEHLARHRLADLFLDALPCNAHT 709

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           T+ D LW G PV+T  GET + RVAAS L   G PEL+  +  +Y + A+ L  D+
Sbjct: 710 TASDALWAGLPVLTAMGETFSGRVAASLLTAAGLPELVTGSLADYAEAALALAQDK 765


>gi|443326782|ref|ZP_21055424.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xenococcus sp. PCC 7305]
 gi|442793575|gb|ELS03020.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xenococcus sp. PCC 7305]
          Length = 1023

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 96/157 (61%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR++  LP    ++C FN  YKI P    +W+ +LKAVP+S+LWLL        N++ 
Sbjct: 525 VPSREECNLPPQGFIFCGFNNSYKITPEFFNIWMRLLKAVPHSVLWLLSKNQDMTENLRQ 584

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            AQA G+D  R++F+      +H+ R QLAD+ LD   CN HTT+ D LW G PV+T  G
Sbjct: 585 EAQARGVDPDRLVFAPKKDLPDHLARQQLADLFLDNLPCNAHTTTSDALWVGLPVLTCAG 644

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ++ A+RVA S L  +G PEL+     +Y+ +A+++ T
Sbjct: 645 QSFAARVAGSLLHAIGLPELVTYNLPDYEALALKIAT 681


>gi|83312909|ref|YP_423173.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
           magneticum AMB-1]
 gi|82947750|dbj|BAE52614.1| Predicted O-linked N-acetylglucosamine transferase
           [Magnetospirillum magneticum AMB-1]
          Length = 591

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 66/164 (40%), Positives = 101/164 (61%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P+++    + ++GLPE+++V   FN  +KI+  TL +W++VL+A P+++LW ++F     
Sbjct: 377 PRALTREPKAKWGLPEESLVVAVFNNTFKINAETLAVWISVLQAQPDAVLWFVEFHPAAT 436

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           A+++A   A+G+D  R++F+   ++ EH+ R   AD+ LDT  C GHTT+ D LW G PV
Sbjct: 437 ASLRAMISAVGIDPARLIFAPRLSQAEHMARLSAADLFLDTWPCAGHTTASDALWAGVPV 496

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           V   G T ASRVA S L  LG  ELI  +   Y  +A  L  DR
Sbjct: 497 VAWVGRTFASRVAGSLLHALGLDELITESQGAYHALAQHLAKDR 540


>gi|145588309|ref|YP_001154906.1| hypothetical protein Pnuc_0121 [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
 gi|145046715|gb|ABP33342.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
          Length = 732

 Score =  138 bits (348), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 69/160 (43%), Positives = 96/160 (60%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S +I SR++ GLPE   V+C FN  YKI P+ L  WV +L AV  S+LWL +   +  AN
Sbjct: 515 SPLIKSRKELGLPESGFVFCCFNNNYKITPAVLDGWVKILLAVEGSVLWLYEDNPIAVAN 574

Query: 98  IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           ++  A   GLD  R +F+      +H+ R + A++ LDT  CN HTT+ D LW G PV+T
Sbjct: 575 LKQEALTRGLDAGRFIFAGRMDSADHLARYKNANLFLDTTPCNAHTTASDALWAGLPVLT 634

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           L GE+  +RVAAS    +G   L   T +EY+ +AI+L T
Sbjct: 635 LAGESFGARVAASLNNAVGLSGLTVETQEEYEALAIQLAT 674


>gi|119357637|ref|YP_912281.1| hypothetical protein Cpha266_1841 [Chlorobium phaeobacteroides DSM
            266]
 gi|119354986|gb|ABL65857.1| TPR repeat-containing protein [Chlorobium phaeobacteroides DSM 266]
          Length = 3035

 Score =  138 bits (347), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 95/158 (60%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R + GLPE   V+C FN  YKI P+T   W+ +L  VP S+LWL +  A    N++ 
Sbjct: 2816 VFTRAECGLPESGFVFCCFNNTYKITPATFDGWMRILGQVPGSVLWLYEENAKAAENLRR 2875

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A + G+D  R++F       E++ R ++AD+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 2876 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 2935

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            E+ ASR+AAS L  +G PELI    +EY+ +AI L  D
Sbjct: 2936 ESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 2973



 Score =  137 bits (346), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 95/158 (60%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R + GLPE   V+C FN  YKI P+T   W+ +L  VP S+LWL +  A    N++ 
Sbjct: 1390 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVPGSVLWLFEENAKAAENLRR 1449

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A + G+D  R++F       E++ R ++AD+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 1450 EAASRGVDAGRLIFGKRLPVAEYLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 1509

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            E+ ASR+AAS L  +G PELI    +EY+ +AI L  D
Sbjct: 1510 ESFASRMAASLLTAIGLPELITSGQEEYESLAIELALD 1547



 Score =  133 bits (335), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 64/158 (40%), Positives = 95/158 (60%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R + GLPE   V+C FN  YKI P+T   W+ +L  VP S+L+L        +N++ 
Sbjct: 2035 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVPGSVLFLYTDNEAAASNLKK 2094

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A++ G+ + R++F       EH+ R ++AD+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 2095 EAESRGVKRDRLIFGKRLPLAEHLARYRVADLFLDTNPYNAGTTASDALWAGLPVLTLRG 2154

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            E+ ASR+AAS L  +G PELI    +EY+ +A+ L  D
Sbjct: 2155 ESFASRMAASLLTAIGLPELITSGQEEYEALAVELALD 2192



 Score =  130 bits (328), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 65/158 (41%), Positives = 94/158 (59%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + +R + GLPE   V+C FN  YKI P+T   W+ +L  V  S+L+L        +N++ 
Sbjct: 575 VFTRAECGLPESGFVFCCFNNNYKITPATFDGWMRILGQVEGSVLFLYTDNEAAASNLKK 634

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A++ G+ + R++F       EH+ R ++AD+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 635 EAESRGVKRDRLIFGKRLPLAEHLARYRVADLFLDTLPYNAGTTASDALWAGLPVLTLRG 694

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           E+ ASR+AAS L  +G PELI  T +EY+  AI L  D
Sbjct: 695 ESFASRMAASLLTAIGLPELITTTQEEYEARAIELALD 732


>gi|390604347|gb|EIN13738.1| hypothetical protein PUNSTDRAFT_140213 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1253

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 68/150 (45%), Positives = 99/150 (66%), Gaps = 5/150 (3%)

Query: 52   DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ-- 109
            D +++ NF+QLYKI P+T  MW+ +L+ VP SILWLL+FPA G+ ++  TA A   D+  
Sbjct: 1006 DFVIFANFSQLYKICPATFSMWLKILERVPRSILWLLRFPAAGQEHLLRTASAWANDEVA 1065

Query: 110  HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASRV 167
             RI F++V  K +HV RG++AD+ LDT  CN HT + D LW+GTP++T P     +ASRV
Sbjct: 1066 ARIRFTDVTDKHQHVIRGRVADLFLDTIECNAHTVAADALWSGTPLITWPRYSHKMASRV 1125

Query: 168  AASQLATLGCPE-LIARTHKEYQDIAIRLG 196
             AS +   G  + ++  + +EYQD A+ L 
Sbjct: 1126 GASIVNATGFGDRMVVHSEEEYQDRAVFLA 1155


>gi|393240238|gb|EJD47765.1| protein prenylyltransferase [Auricularia delicata TFB-10046 SS5]
          Length = 1522

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 71/153 (46%), Positives = 101/153 (66%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I++ NFNQLYKIDP     W+ +L+ VPNSILWLL+FPA GE N+++ A+    +
Sbjct: 1285 LHDDTIIFANFNQLYKIDPVVFATWLRILRQVPNSILWLLRFPAAGENNLRSFARLWAGE 1344

Query: 109  QH--RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE--TLA 164
            +   RI F++V AKE H++R ++AD+ LDT  CN HT + DVLW+GTP++T P     + 
Sbjct: 1345 EFSARIRFTDVCAKERHIQRSRVADLFLDTIECNAHTIATDVLWSGTPILTWPKYKFKMC 1404

Query: 165  SRVAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
            SRVAAS   AT    ++   + +EY+  AI L 
Sbjct: 1405 SRVAASIARATGFGQQMTVSSLEEYEARAIALA 1437


>gi|400596152|gb|EJP63936.1| tetratricopeptide-like protein [Beauveria bassiana ARSEF 2860]
          Length = 811

 Score =  137 bits (345), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 102/160 (63%), Gaps = 13/160 (8%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           L +D ++  NFNQLYKIDPST + W+ +L  VP +ILWLL+FP  GEAN++ TA+     
Sbjct: 585 LQDDTVILANFNQLYKIDPSTFRSWLRILAQVPKAILWLLRFPEAGEANLRRTAELWAGP 644

Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
           +   R++F++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++T P     + 
Sbjct: 645 EVASRLVFTDVAQKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTYPRYPYKMC 704

Query: 165 SRVAASQL--ATLGCP-------ELIARTHKEYQDIAIRL 195
           +R+AAS L  A    P       ELIA + K Y+  AI L
Sbjct: 705 ARMAASILKGALPRTPEGQRVAGELIATSEKMYEKSAIEL 744


>gi|119497099|ref|XP_001265316.1| UDP-N-acetylglucosaminyltransferase [Neosartorya fischeri NRRL 181]
 gi|119413478|gb|EAW23419.1| UDP-N-acetylglucosaminyltransferase [Neosartorya fischeri NRRL 181]
          Length = 1661

 Score =  137 bits (344), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 69/160 (43%), Positives = 104/160 (65%), Gaps = 13/160 (8%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYKI+P+T + W+ +L  +PN++LWLL+FP +GE N++ TA A    
Sbjct: 1437 LSDDTIILGNFNQLYKIEPTTFRTWLRILARIPNAVLWLLRFPDLGEQNLRETAIAWAGR 1496

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++T P     + 
Sbjct: 1497 ETASRIIFTDVAPKNAHIARAKILDLFLDTPECNAHTTATDVLWSGTPLLTYPRYKYKMC 1556

Query: 165  SRVAASQLATL---------GCPELIARTHKEYQDIAIRL 195
            SR+A+S L++             EL A + ++Y+  AIRL
Sbjct: 1557 SRMASSILSSALPDSDAGRKARDELTAVSDEDYEGKAIRL 1596


>gi|300693287|ref|YP_003749260.1| hypothetical protein RPSI07_mp0265 [Ralstonia solanacearum PSI07]
 gi|299075324|emb|CBJ34618.1| conserved hypothethical protein with TPR repeat domain [Ralstonia
           solanacearum PSI07]
          Length = 675

 Score =  137 bits (344), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 95/155 (61%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP D  V+C FNQ YKI  +    W  +L   P S+LWLL+  A  +A +  +A+
Sbjct: 471 RADVGLPADGFVFCCFNQAYKITEARANTWFTILSRTPGSVLWLLEPDASAKAALLESAR 530

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G++ HR++F+   A+ EH+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T+
Sbjct: 531 RHGVESHRLVFAPQVAQREHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRMGDTM 590

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G  +L+  T ++Y D A+RL  D
Sbjct: 591 ASRVAASILQAAGLHDLVVTTEEDYVDAAVRLAGD 625


>gi|344171339|emb|CCA83825.1| conserved hypothethical protein with TPR repeat domain [blood
           disease bacterium R229]
          Length = 646

 Score =  137 bits (344), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 95/155 (61%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP D  V+C FNQ YKI  +    W  +L   P S+LWLL+  A  +A +  +A+
Sbjct: 442 RADVGLPADGFVFCCFNQAYKITEARANTWFTILSRTPGSVLWLLEPDASAKAALLESAR 501

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G++ HR++F+   A+ EH+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T+
Sbjct: 502 RHGVESHRLVFAPQVAQREHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRMGDTM 561

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G  +L+  T ++Y D A+RL  D
Sbjct: 562 ASRVAASILQAAGLHDLVVTTEEDYVDAAVRLAGD 596


>gi|145588676|ref|YP_001155273.1| hypothetical protein Pnuc_0489 [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
 gi|145047082|gb|ABP33709.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
          Length = 618

 Score =  137 bits (344), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPE   VYC FN  YKI P T   W+ +LKAV  S+LWL++     E N++A A
Sbjct: 412 TRAELGLPESGFVYCCFNNNYKITPETFDSWMRILKAVEGSVLWLIQDNVPAEENLKAEA 471

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+      EH+ R ++AD+ LDT   N HTT+ D LW G PV+TL G T
Sbjct: 472 LKRGISSDRIIFAQRLPLPEHLARHKIADLFLDTLPYNAHTTASDALWAGVPVLTLLGNT 531

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              RVAAS L  +G  EL+  T + Y+  AI L  D
Sbjct: 532 FPGRVAASLLNAVGLAELVTHTPQAYEQRAIELARD 567


>gi|398825494|ref|ZP_10583785.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Bradyrhizobium sp. YR681]
 gi|398223362|gb|EJN09707.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Bradyrhizobium sp. YR681]
          Length = 757

 Score =  137 bits (344), Expect = 3e-30,   Method: Composition-based stats.
 Identities = 64/163 (39%), Positives = 98/163 (60%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P +  + +R++ GLP    ++C FN  YKI P    +W+ +L+ V  S+LWL    +  E
Sbjct: 539 PDAAHMPTREEAGLPCHGFIFCCFNSSYKITPEIFDIWMRLLRQVSGSVLWLYADRSSAE 598

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N++  A A G+D  R++F+   A+ +H+ R +LAD+ LDT   N HTT+ D LW G PV
Sbjct: 599 VNLRQAAAAHGIDPARLVFARRMAQGDHLARHRLADLFLDTLPYNAHTTAADALWAGLPV 658

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT PG++ A RVA+S L  +G PEL+      Y+ +A+RL  +
Sbjct: 659 VTCPGKSFAGRVASSLLRGIGMPELVTSDLGSYERLALRLALE 701


>gi|345564020|gb|EGX47001.1| hypothetical protein AOL_s00097g47 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1822

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 60/126 (47%), Positives = 92/126 (73%), Gaps = 4/126 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D ++  NFNQLYKI+P+T + W+ +L  VPN++LWLL+FP +GE+N+   A+    D
Sbjct: 1598 LSDDKVILGNFNQLYKIEPTTFRTWLRILMQVPNAVLWLLRFPDLGESNLLRFARLWAGD 1657

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE--TLA 164
            +   RI+F++VAAK++H+ R ++ D+ LDTP CN HTT+ DVLW+G+P++T P     + 
Sbjct: 1658 EVASRIIFTDVAAKDQHISRARVCDLFLDTPECNAHTTAADVLWSGSPLLTFPRHEYKMC 1717

Query: 165  SRVAAS 170
            SR+AAS
Sbjct: 1718 SRIAAS 1723


>gi|398379826|ref|ZP_10537946.1| hypothetical protein PMI03_03578 [Rhizobium sp. AP16]
 gi|397722458|gb|EJK83002.1| hypothetical protein PMI03_03578 [Rhizobium sp. AP16]
          Length = 647

 Score =  136 bits (343), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 98/155 (63%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R Q+GLPEDA V+ +FN   KI    + +W N+LK   NS+LWLL      EAN+ A 
Sbjct: 416 TTRAQHGLPEDAFVFASFNGNRKITSQMVDVWCNILKRTKNSVLWLLCNGPRAEANLWAR 475

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +A G+++ R++F+     E+H+ R QLAD+ LDT   NGHTT+ + LW G PV+T+ G 
Sbjct: 476 LEARGVNRKRVVFTTRIRYEDHIDRQQLADLGLDTFPVNGHTTTSEQLWGGLPVLTMKGT 535

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             ASRV+ S L  +G PEL+A   + Y+++A+   
Sbjct: 536 NFASRVSESLLNAIGVPELVAADVQAYENMAVEFA 570


>gi|392378325|ref|YP_004985485.1| conserved protein of unknown function; putative TPR domain
           [Azospirillum brasilense Sp245]
 gi|356879807|emb|CCD00735.1| conserved protein of unknown function; putative TPR domain
           [Azospirillum brasilense Sp245]
          Length = 827

 Score =  136 bits (343), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPED  V+C FN  YK+ P+   +W  +L+AVP S+LWL     +   N++  A
Sbjct: 624 SRADCGLPEDGFVFCCFNNPYKLTPAVFDVWARLLRAVPGSVLWLYAGNPLVAGNLRGEA 683

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   R++F+      EH+ R +LAD+ LDT   N HTT+ D LWTG PVVT  G T
Sbjct: 684 AARGVAPDRLVFAPPRPLVEHLARHRLADLFLDTLPYNAHTTASDALWTGLPVVTCRGAT 743

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A RVAAS L T+G PEL+  +   Y+ +A+ L  D
Sbjct: 744 FAGRVAASLLDTVGLPELVTDSPAAYEALALGLARD 779


>gi|222084872|ref|YP_002543401.1| glycosyltransferase domain-containing protein [Agrobacterium
           radiobacter K84]
 gi|221722320|gb|ACM25476.1| glycosyltransferase TPR domain protein [Agrobacterium radiobacter
           K84]
          Length = 673

 Score =  136 bits (343), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 98/155 (63%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R Q+GLPEDA V+ +FN   KI    + +W N+LK   NS+LWLL      EAN+ A 
Sbjct: 442 TTRAQHGLPEDAFVFASFNGNRKITSQMVDVWCNILKRTKNSVLWLLCNGPRAEANLWAR 501

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +A G+++ R++F+     E+H+ R QLAD+ LDT   NGHTT+ + LW G PV+T+ G 
Sbjct: 502 LEARGVNRKRVVFTTRIRYEDHIDRQQLADLGLDTFPVNGHTTTSEQLWGGLPVLTMKGT 561

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             ASRV+ S L  +G PEL+A   + Y+++A+   
Sbjct: 562 NFASRVSESLLNAIGVPELVAADVQAYENMAVEFA 596


>gi|86138806|ref|ZP_01057378.1| Glycosyl transferase group 1:TPR repeat [Roseobacter sp. MED193]
 gi|85824453|gb|EAQ44656.1| Glycosyl transferase group 1:TPR repeat [Roseobacter sp. MED193]
          Length = 629

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 64/158 (40%), Positives = 95/158 (60%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           I +R + GLPED  V+C+ N  YK+ P    +W+ +L AVP+S+LWLL      + N+  
Sbjct: 406 IPTRTELGLPEDKFVFCSLNNSYKVTPVEYDIWMRLLHAVPDSVLWLLAANDYVQKNLID 465

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A A G+   R+ F+   +   H+ R   AD+ LDT  C  HTT+ + LW+G P++T PG
Sbjct: 466 EAAARGIGPERLFFAGRVSTTAHLARLPQADLFLDTFNCCAHTTASETLWSGVPLITKPG 525

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +  ASRVAAS L  +GC +LI  + + Y D+A++L  D
Sbjct: 526 DQFASRVAASILTAIGCEDLITDSAENYYDLALKLAQD 563


>gi|126737136|ref|ZP_01752871.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
 gi|126721721|gb|EBA18424.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
          Length = 738

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLP+D  V+C+FN  YKI P    +W+ +L     S+LWLL+       N +  A
Sbjct: 503 TRADCGLPDDGFVFCSFNNSYKITPREFDIWMRLLAQTDGSVLWLLQTSQSSTENFRQEA 562

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+D  R++F+    + EH+ R Q+AD+ LD+   N HTT  D LW G P++TLPG+ 
Sbjct: 563 EKRGIDASRLIFAPPLPQAEHLARQQVADLFLDSFTVNAHTTGSDALWAGVPILTLPGKQ 622

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RV AS L  +G PE+IA +  +Y+  A+ L  D
Sbjct: 623 FAARVGASLLHAVGLPEMIATSEADYEKRALELAHD 658


>gi|424880058|ref|ZP_18303690.1| hypothetical protein Rleg8DRAFT_1588 [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392516421|gb|EIW41153.1| hypothetical protein Rleg8DRAFT_1588 [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 638

 Score =  136 bits (342), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 95/156 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +RQQ GLP+DA ++ +FN   KI P T+  W  +LK  PNS+LWL+      +AN+    
Sbjct: 418 TRQQLGLPDDAFIFASFNGNRKITPETIGSWCRILKRAPNSVLWLMANTPRNQANLLKQF 477

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   RI+F   A  EEH+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 478 QAAGISAKRIIFCPRAPYEEHIDRQQAADIGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 537

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  P+L+A   + Y+D+A+ L  +
Sbjct: 538 FASRVSESLLRAIDLPDLVADDLQAYEDMAVELAEN 573


>gi|23016713|ref|ZP_00056466.1| COG3914: Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Magnetospirillum magnetotacticum MS-1]
          Length = 602

 Score =  135 bits (341), Expect = 7e-30,   Method: Composition-based stats.
 Identities = 67/157 (42%), Positives = 95/157 (60%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           + ++GLPEDA V   FN  +K +  TL +WV VL+A P+++LW ++F     A+++A   
Sbjct: 396 KAKWGLPEDAFVIAVFNNSFKFNAETLAVWVAVLQAQPDAVLWFVEFHPAATASLRAMMG 455

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A+G+D  R++F+    + EH+ R   AD+ LDT  C GHTT+ D LW G P+V   G T 
Sbjct: 456 AVGIDASRLIFAPRLPQAEHMLRLSAADLFLDTWPCAGHTTASDALWAGVPLVAWAGRTF 515

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           ASRVA S L  LG  ELIA +   Y  +A  L  DR+
Sbjct: 516 ASRVAGSLLHALGLDELIAESQGGYHALAQHLAKDRE 552


>gi|456358776|dbj|BAM93221.1| hypothetical protein S58_72570 [Agromonas oligotrophica S58]
          Length = 453

 Score =  135 bits (341), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 71/177 (40%), Positives = 100/177 (56%), Gaps = 8/177 (4%)

Query: 22  TNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVP 81
           + Q + +TA G          +R + GLPE   V+C FNQ YK+ P+   +W  +L A P
Sbjct: 230 SYQPHGRTALGPP-------PTRAEVGLPEAGFVFCCFNQAYKLTPTVFDLWCRLLDATP 282

Query: 82  NSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
           +S+LWLL      E N++  A   G+   R++F+    + EH+RR QLAD+ LDT     
Sbjct: 283 DSVLWLLA-SDQAEGNLRGEALRRGVAPGRLVFAPEMKQSEHLRRLQLADLVLDTAPYGA 341

Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           HTT+ D LW G P+VT  G+T ASRVA S L  +G PELIA    +Y  +A+ L  +
Sbjct: 342 HTTASDALWAGVPIVTCAGDTFASRVAGSLLHAVGLPELIAADEADYVAVALTLAAE 398


>gi|158425090|ref|YP_001526382.1| hypothetical protein AZC_3466 [Azorhizobium caulinodans ORS 571]
 gi|158331979|dbj|BAF89464.1| TPR repeat protein [Azorhizobium caulinodans ORS 571]
          Length = 779

 Score =  135 bits (341), Expect = 7e-30,   Method: Composition-based stats.
 Identities = 68/161 (42%), Positives = 94/161 (58%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S  + SR   GLPE   V+C FN  YKI P+   +W+ +L  VP+S+LWL +  A    N
Sbjct: 567 SETLPSRAACGLPESGFVFCCFNNTYKITPAVFSVWMRLLDEVPDSVLWLYEANAAARDN 626

Query: 98  IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           +   A   G++  RI+F+  A   +H+ R   AD+ LDT   N HTT+ D LW G P+VT
Sbjct: 627 LAYEAAKFGIEPDRIIFAPNALLADHLARHAHADLFLDTLPYNAHTTASDALWAGVPIVT 686

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             GET A+RVAAS L  +G PELI  +  +Y+ +A+ L  D
Sbjct: 687 CAGETFAARVAASLLDAVGMPELITTSLADYEALALALARD 727


>gi|402771865|ref|YP_006591402.1| hypothetical protein BN69_1300 [Methylocystis sp. SC2]
 gi|401773885|emb|CCJ06751.1| TPR repeat protein [Methylocystis sp. SC2]
          Length = 451

 Score =  135 bits (341), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 91/156 (58%), Gaps = 1/156 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLP +  VYC FNQ YK+ P    +W  +L+A P ++LWL     + E N++   
Sbjct: 244 SRAAAGLPAEGFVYCCFNQAYKLTPFIFDLWARLLEATPGAVLWL-SAAMLAEGNLRNEM 302

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+D  R++F+    + EH+ R QLAD+ LDT     HTT+ D LW G P+VT  G+T
Sbjct: 303 RRRGIDAARMIFAPHLPQAEHLARLQLADLALDTAPFGSHTTASDALWAGVPIVTCAGDT 362

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             SRVA S L  +G PELIA    EY +IA+ L  D
Sbjct: 363 FPSRVAGSLLHAIGMPELIAADFDEYLEIALVLAGD 398


>gi|387131239|ref|YP_006294129.1| TPR domain-containing protein [Methylophaga sp. JAM7]
 gi|386272528|gb|AFJ03442.1| TPR domain protein, putative component of TonB system [Methylophaga
           sp. JAM7]
          Length = 646

 Score =  135 bits (341), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 98/155 (63%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           + R+ +GLP++AIV C+FNQLYK+  +T Q W+++L  +P ++LW+L     G   +Q +
Sbjct: 415 SCREDHGLPQEAIVLCSFNQLYKVTEATWQSWLSILVRIPETVLWVLAPSQTGRVVLQQS 474

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
             A G+D  R++F++   + +++ R Q AD+ LDT + N HTT++D LW G P+V + G 
Sbjct: 475 LVAHGIDSQRLIFADRTGQAKNLERMQHADLMLDTEIYNAHTTAVDALWAGVPIVAVKGG 534

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             ASRV+AS L   G PELI     E  D+A+ L 
Sbjct: 535 HFASRVSASLLQACGLPELITHNLDEMADLAVALS 569


>gi|384263551|ref|YP_005418740.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
           photometricum DSM 122]
 gi|378404654|emb|CCG09770.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
           photometricum DSM 122]
          Length = 682

 Score =  135 bits (340), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 68/158 (43%), Positives = 93/158 (58%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR + GLPE+A+VYC FN  YKI   T + W++VL AVP+ +LWLL         +QA
Sbjct: 431 VPSRAEAGLPEEAVVYCCFNAAYKITRFTFERWISVLAAVPDGVLWLLDPGRTTRERLQA 490

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            AQA G+D  R++F+       H+ R  LAD+ LDT     HTT+ D LW G PV+T  G
Sbjct: 491 EAQARGIDPARLVFAPRVPNPVHLARYALADLFLDTAPYGAHTTASDALWRGVPVLTWSG 550

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              ASRV  S +   G P+L+ ++ ++Y   AI LG D
Sbjct: 551 RAFASRVCGSLVRAAGLPDLVCKSPEDYTARAIMLGQD 588


>gi|319761242|ref|YP_004125179.1| tetratricopeptide tpr_1 repeat-containing protein [Alicycliphilus
           denitrificans BC]
 gi|317115803|gb|ADU98291.1| Tetratricopeptide TPR_1 repeat-containing protein [Alicycliphilus
           denitrificans BC]
          Length = 734

 Score =  135 bits (340), Expect = 9e-30,   Method: Composition-based stats.
 Identities = 72/157 (45%), Positives = 95/157 (60%), Gaps = 2/157 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-FPAVGEANIQAT 101
           +R+  GLP    V+C FN  YKI P  L  W  +L AV +S+LWLL+  PAV   N+   
Sbjct: 527 TREALGLPATGFVFCCFNNNYKILPQMLDGWGRILHAVEDSVLWLLEDNPAVSR-NLLRE 585

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           AQA G+   R++F+     +EH+ R +LAD+ LDT  CN HTT+ D LW G PV+T  G+
Sbjct: 586 AQARGIAPQRLVFAQRMPLDEHLARHRLADLFLDTLPCNAHTTASDALWAGLPVLTCAGQ 645

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + ASRVAAS L  +G PEL+  T   Y+  AI L  D
Sbjct: 646 SFASRVAASLLHAVGLPELVTETQGAYEARAIALARD 682


>gi|330823109|ref|YP_004386412.1| hypothetical protein Alide2_0477 [Alicycliphilus denitrificans
           K601]
 gi|329308481|gb|AEB82896.1| Tetratricopeptide TPR_1 repeat-containing protein [Alicycliphilus
           denitrificans K601]
          Length = 734

 Score =  135 bits (340), Expect = 9e-30,   Method: Composition-based stats.
 Identities = 72/157 (45%), Positives = 95/157 (60%), Gaps = 2/157 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-FPAVGEANIQAT 101
           +R+  GLP    V+C FN  YKI P  L  W  +L AV +S+LWLL+  PAV   N+   
Sbjct: 527 TREALGLPATGFVFCCFNNNYKILPQMLDGWGRILHAVEDSVLWLLEDNPAVSR-NLLRE 585

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           AQA G+   R++F+     +EH+ R +LAD+ LDT  CN HTT+ D LW G PV+T  G+
Sbjct: 586 AQARGIAPQRLVFAQRMPLDEHLARHRLADLFLDTLPCNAHTTASDALWAGLPVLTCAGQ 645

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + ASRVAAS L  +G PEL+  T   Y+  AI L  D
Sbjct: 646 SFASRVAASLLHAVGLPELVTETQGAYEARAIALARD 682


>gi|440225535|ref|YP_007332626.1| TPR repeat-containing protein [Rhizobium tropici CIAT 899]
 gi|440037046|gb|AGB70080.1| TPR repeat-containing protein [Rhizobium tropici CIAT 899]
          Length = 647

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 98/155 (63%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R Q+ LPEDA V+ +FN   KI    + +W N+LK   NS+LWLL      EAN+ A 
Sbjct: 416 TTRAQHKLPEDAFVFASFNGNRKITSQMVDVWCNILKRSKNSVLWLLCNGPRAEANLWAR 475

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +A G+ + R++F+     E+H+ R QLAD+ LDT   NGHTT+ + LW G PV+T+ G 
Sbjct: 476 LEARGISRKRVVFTTRIRYEDHIDRQQLADLGLDTFPVNGHTTTSEQLWGGLPVLTVKGT 535

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             ASRV+ S L  +G PEL+A   + Y+D+A++L 
Sbjct: 536 NFASRVSESLLNAIGLPELVAADIQTYEDMAVQLA 570


>gi|319791963|ref|YP_004153603.1| hypothetical protein Varpa_1276 [Variovorax paradoxus EPS]
 gi|315594426|gb|ADU35492.1| Tetratricopeptide repeat [Variovorax paradoxus EPS]
          Length = 833

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 65/159 (40%), Positives = 97/159 (61%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
            +R  +GLPEDA V CNF+Q +KI P T   WV +++ +PN++LWL + P   E N++A 
Sbjct: 617 ATRADHGLPEDAFVLCNFSQSFKIQPETFAAWVRIVRRIPNAVLWLAQGPNGFETNLRAQ 676

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +A GL   R++ S     + H+ R  LAD+ +DT   +   T+ DVLW G P++ L G 
Sbjct: 677 WEAAGLPAERLIVSPRMPIDRHLARIGLADLFIDTFPYSSGATANDVLWAGVPLLALTGT 736

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           T+ SR+A S L+ +G PEL+A T  EY + A+   T R+
Sbjct: 737 TMVSRMAGSLLSAIGLPELVATTRDEYVEKAVHYATHRN 775


>gi|217978051|ref|YP_002362198.1| hypothetical protein Msil_1891 [Methylocella silvestris BL2]
 gi|217503427|gb|ACK50836.1| conserved hypothetical protein [Methylocella silvestris BL2]
          Length = 492

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 69/158 (43%), Positives = 92/158 (58%), Gaps = 1/158 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R Q GLPE   V+C FNQ YKI P+   +W  +L   P S+LWLL+     E N++  A
Sbjct: 280 TRAQAGLPEQGFVFCCFNQAYKITPAVFDLWRRLLDEFPGSVLWLLR-AEQAEGNLRGEA 338

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   R++F+    + +H+ R QLAD+ LDT     HTT+ D LW G P+VT  GET
Sbjct: 339 LRRGVSPDRLIFAPDMNQVDHLGRLQLADLVLDTSPYGAHTTASDALWAGVPIVTFAGET 398

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SRVA S L  +G PELIA   +EY  IA  L ++ D
Sbjct: 399 FPSRVAGSLLHAVGMPELIAADAEEYLAIASVLASEPD 436


>gi|182677437|ref|YP_001831583.1| hypothetical protein Bind_0441 [Beijerinckia indica subsp. indica
           ATCC 9039]
 gi|182633320|gb|ACB94094.1| TPR repeat-containing protein [Beijerinckia indica subsp. indica
           ATCC 9039]
          Length = 1085

 Score =  135 bits (339), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 65/156 (41%), Positives = 94/156 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLPED  V+C FN  YK+ P    +W+ +L+AVP S+ WLL+   +   N+Q  A
Sbjct: 559 TRADCGLPEDGFVFCCFNNSYKLTPVFFDLWMRLLQAVPGSVFWLLESGPLVRENLQHEA 618

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+D +R++F+   A  +H+ R +LAD+ LD    N HTT+ D LW G PV+T  GET
Sbjct: 619 EKRGIDPNRLVFAPRIAIPDHLARHRLADLFLDCLPYNAHTTTSDALWAGLPVLTCAGET 678

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A RVA S L  +G PELI  +  +Y+  A+ L  +
Sbjct: 679 FAGRVAGSLLQAVGLPELITTSLADYEARALHLARN 714


>gi|322709482|gb|EFZ01058.1| UDP-N-acetylglucosaminyltransferase [Metarhizium anisopliae ARSEF 23]
          Length = 1605

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 64/128 (50%), Positives = 90/128 (70%), Gaps = 4/128 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L  DAI+  NFNQLYKIDP+  + W+ +L  VP ++LWLL+FP  GE+N++ATA+A    
Sbjct: 1379 LGNDAIILGNFNQLYKIDPTIFRAWLRILARVPRAVLWLLRFPEPGESNLRATAKAWAGP 1438

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   RI+F++VA K +H+ R  + D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1439 EVADRIIFTDVAPKSQHISRATVCDMFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1498

Query: 165  SRVAASQL 172
            SR+AAS L
Sbjct: 1499 SRMAASIL 1506


>gi|148555054|ref|YP_001262636.1| hypothetical protein Swit_2139 [Sphingomonas wittichii RW1]
 gi|148500244|gb|ABQ68498.1| TPR repeat-containing protein [Sphingomonas wittichii RW1]
          Length = 654

 Score =  135 bits (339), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 69/170 (40%), Positives = 97/170 (57%)

Query: 30  ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
           AT +  P +     R   GLPE   V+ +FN  YKI P    +W+ +L  V  S+LWLL 
Sbjct: 432 ATDDRRPIADRPPRRADVGLPEQGFVFASFNASYKITPDLFAIWMRLLDRVEGSVLWLLG 491

Query: 90  FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
             A   AN++  A+  G+D  R+LF+      EH+ R +LAD+ LD+ +CN HTT+ D L
Sbjct: 492 DDADAAANLRGHARRHGIDPARLLFAAKLPVAEHLARQKLADLFLDSFVCNAHTTASDAL 551

Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           W G P+VT  G   A+RV AS L  +G PEL+  +  +Y+ +A  L TDR
Sbjct: 552 WAGLPIVTRIGAGFAARVCASLLRAVGLPELVTDSAADYERLAFALATDR 601


>gi|421587794|ref|ZP_16033148.1| hypothetical protein RCCGEPOP_04026 [Rhizobium sp. Pop5]
 gi|403707632|gb|EJZ22587.1| hypothetical protein RCCGEPOP_04026 [Rhizobium sp. Pop5]
          Length = 638

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 95/156 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R++ GLPEDA ++ +FN   KI P T+  W  +LK  PNS+LWL+      +AN+    
Sbjct: 418 TREKLGLPEDAFIFASFNGNRKITPETIDSWCRILKRAPNSVLWLMANSPRNQANLLKRF 477

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q+ G+   RI+F   A  EEH+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 478 QSSGISSKRIIFCPRAPYEEHIDRQQAADIGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 537

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  P+L+A   + Y+D+A+ L  +
Sbjct: 538 FASRVSESLLRAIDLPDLVADDLQAYEDMAVELAEN 573


>gi|372488421|ref|YP_005027986.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Dechlorosoma suillum PS]
 gi|359354974|gb|AEV26145.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Dechlorosoma suillum PS]
          Length = 625

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 68/175 (38%), Positives = 102/175 (58%), Gaps = 7/175 (4%)

Query: 25  TNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI 84
           T++     E  P      SR + GLPE  +V+C+F+  YK+ P    +W+N+L+ VP S+
Sbjct: 404 TDSGVKISERTP------SRSECGLPEQGVVFCSFSHDYKVSPPVFDIWMNLLRQVPGSV 457

Query: 85  LWLLKFPAVGEANIQATAQALGLDQHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           LWL+    V + N++  AQA G+D  R++F+  V   E+H+ R + AD+ LDT   N HT
Sbjct: 458 LWLVSRNEVSQGNLRKEAQARGVDPSRLVFAGRVPLVEDHLARYRQADIFLDTHPYNAHT 517

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           T+ D L  G PVVT  G    +RVA S +  +G PEL  ++  EY+ +A+ L TD
Sbjct: 518 TAADALMAGLPVVTYMGNAFPARVAGSLVHAVGLPELATKSLAEYEALALALATD 572


>gi|392378891|ref|YP_004986051.1| protein of unknown function; putative TPR domain [Azospirillum
           brasilense Sp245]
 gi|356880373|emb|CCD01322.1| protein of unknown function; putative TPR domain [Azospirillum
           brasilense Sp245]
          Length = 670

 Score =  135 bits (339), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 93/158 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPE  +V+C FN  YKI P+   +W  +L+AVP+S+LWLL       AN++  A
Sbjct: 459 SRAACGLPERGVVFCCFNAAYKITPALFDLWCRLLRAVPDSVLWLLDSHPEASANLRREA 518

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   R++F+      EH+ R +LAD+ LDT     HTT+ D LW G PV+T+ G+ 
Sbjct: 519 VRRGVAADRLVFAPRRPPAEHLARYRLADLFLDTTPVGAHTTASDALWAGLPVLTVAGKG 578

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ASRV  S L  +G PEL  R+  +Y+  A+RL  D D
Sbjct: 579 FASRVGMSLLRAVGLPELAVRSPADYEAAAVRLAGDPD 616


>gi|384083989|ref|ZP_09995164.1| Tetratricopeptide TPR_4 [Acidithiobacillus thiooxidans ATCC 19377]
          Length = 708

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 64/158 (40%), Positives = 96/158 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +RQ+ GLPE  +V+C  N  YK+DP    +W  +L+ +P S LWLL+ P     N++  A
Sbjct: 507 TRQEAGLPEQGLVFCCMNNSYKLDPFVFSIWCRLLEQIPGSHLWLLQGPQEMVTNLREAA 566

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   R++F+   ++++H+ R Q AD+ LDT   N HTT+ D LW G PV+T+ GET
Sbjct: 567 SAQGIQSERLIFAPRVSRQQHLTRLQCADMFLDTRFYNAHTTATDALWAGVPVLTVAGET 626

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ++RVAAS +  L  PEL+      Y+  A+RL  + D
Sbjct: 627 FSARVAASLVHALEMPELVQPDWAGYEAEALRLAQNPD 664


>gi|85710265|ref|ZP_01041330.1| TPR repeat protein [Erythrobacter sp. NAP1]
 gi|85688975|gb|EAQ28979.1| TPR repeat protein [Erythrobacter sp. NAP1]
          Length = 694

 Score =  134 bits (338), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 68/166 (40%), Positives = 97/166 (58%), Gaps = 2/166 (1%)

Query: 35  VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
           VPQ     +R  +GLP DA V+C FN  YKI      +W+  L AV  S+LWL +  AV 
Sbjct: 479 VPQPS--DTRADHGLPGDAFVFCCFNASYKITRDRFALWMRALGAVEGSVLWLYRSDAVA 536

Query: 95  EANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
           EAN++  A+  G+D HR++F+    + +H+ R + A++ LDT     HTT+ D LW G P
Sbjct: 537 EANLRMEAKRCGIDPHRLVFAYHLPRTQHLARHRHANLALDTSAYGAHTTASDCLWAGLP 596

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           +VT  G+  A+RVAAS L   G  EL+  +  EY+ + + L TD D
Sbjct: 597 IVTRAGDQFAARVAASVLHAAGLDELVTHSDAEYETLIVALATDAD 642


>gi|302511405|ref|XP_003017654.1| hypothetical protein ARB_04536 [Arthroderma benhamiae CBS 112371]
 gi|291181225|gb|EFE37009.1| hypothetical protein ARB_04536 [Arthroderma benhamiae CBS 112371]
          Length = 1707

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/171 (40%), Positives = 103/171 (60%), Gaps = 23/171 (13%)

Query: 49   LPEDAIVYCNFNQLYK----------IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
            L +D ++  NFNQLYK          I+P+T + W+ +L  +PN+ILWLL+FP VGE N+
Sbjct: 1472 LKDDTVILGNFNQLYKNMNKLTMSCKIEPTTFRTWLRILAGIPNAILWLLRFPDVGEHNL 1531

Query: 99   QATAQALGLDQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
            + TA+A   +    RI+F++VA K  H+ R ++ D+ LDTP CN HTT+ DVLW+GTP++
Sbjct: 1532 RQTAKAWAGEATASRIVFTDVAPKHTHISRARICDLFLDTPECNAHTTAADVLWSGTPLL 1591

Query: 157  TLP--GETLASRVAASQLATL---------GCPELIARTHKEYQDIAIRLG 196
            T P     + SR+A+S L +             +LI+    EY+  AI LG
Sbjct: 1592 TFPRYKYKMCSRMASSILTSALPQTEDGKQAARQLISSDEVEYEKTAIELG 1642


>gi|358377457|gb|EHK15141.1| hypothetical protein TRIVIDRAFT_196588 [Trichoderma virens Gv29-8]
          Length = 1566

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 67/158 (42%), Positives = 100/158 (63%), Gaps = 13/158 (8%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +D I+  NFNQLYKIDP+T + W+ +L   P +ILWLL+FP +GE N++ TA++    + 
Sbjct: 1342 DDVIILGNFNQLYKIDPTTFRSWLRILARTPKAILWLLRFPELGETNLRQTAESWAGAEV 1401

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
              R++F++VA K +H+ R ++ D+ LDT  CN HTT+ DVLW+ TP++TLP  +  + SR
Sbjct: 1402 ASRLIFTDVAPKSQHITRARVCDLFLDTAECNAHTTAADVLWSSTPLLTLPRYSYKMCSR 1461

Query: 167  VAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            +AAS                 ELIA +  EY+D A +L
Sbjct: 1462 MAASIFRGALPKTAEGRQAAEELIADSETEYEDSATQL 1499


>gi|241203113|ref|YP_002974209.1| hypothetical protein Rleg_0359 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|240857003|gb|ACS54670.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
           trifolii WSM1325]
          Length = 657

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 94/156 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLP+DA ++ +FN   KI P T+  W  +LK  PNS+LWL+      +AN+    
Sbjct: 437 TREQLGLPDDAFIFASFNGNRKITPETIDSWCRILKRAPNSVLWLMANTPRNQANLLKQF 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 497 QTAGISAKRIIFCPRAPYEQHIDRQQAADIGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  PEL+A   + Y+D+A+ L  +
Sbjct: 557 FASRVSESLLRAIDLPELVAPDLRAYEDMAVELAEN 592


>gi|86356319|ref|YP_468211.1| glycosyltransferase domain-containing protein [Rhizobium etli CFN
           42]
 gi|86280421|gb|ABC89484.1| putative glycosyltransferase TPR domain protein [Rhizobium etli CFN
           42]
          Length = 544

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPEDA ++ +FN   KI P  +  W  +LK  PNS+LWL+      +AN+    
Sbjct: 324 TREQLGLPEDAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 383

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 384 QTAGISPKRIIFCPRAPYEDHISRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 443

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  PEL+A   + Y+D+A+ L  +
Sbjct: 444 FASRVSESLLKAIDLPELVASDLQAYEDLAVELAQN 479


>gi|428181597|gb|EKX50460.1| hypothetical protein GUITHDRAFT_103694 [Guillardia theta CCMP2712]
          Length = 807

 Score =  133 bits (335), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 67/174 (38%), Positives = 98/174 (56%), Gaps = 1/174 (0%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q N   A+    P+  +   R++  LPE A V+CNFN LYKI+P   ++WV ++  VP+S
Sbjct: 484 QVNDHRASYPIRPEKTISEIRRKERLPEHAFVFCNFNGLYKIEPQVFEIWVRIMHRVPDS 543

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LWL+       ANI   A+  G+   RI+ +      EH+ R + AD+ LD    N HT
Sbjct: 544 VLWLVNENNESNANILRQAEVRGISSSRIILAEREDFAEHIFRVRAADMFLDCLTYNAHT 603

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLG-CPELIARTHKEYQDIAIRLG 196
           T++D LW G PV+TLPG  +  R+A   L++ G   E   RT  EY++IA+ L 
Sbjct: 604 TAVDSLWGGLPVMTLPGWNMNMRLATGALSSFGPASETCVRTESEYEEIAVALA 657


>gi|414168065|ref|ZP_11424269.1| hypothetical protein HMPREF9696_02124 [Afipia clevelandensis ATCC
           49720]
 gi|410888108|gb|EKS35912.1| hypothetical protein HMPREF9696_02124 [Afipia clevelandensis ATCC
           49720]
          Length = 661

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 93/155 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLP+D  V+C FN  YKI P T   W+ +L+ VP S+LWL++   +  AN++  A
Sbjct: 451 TRTELGLPQDGFVFCCFNNNYKITPQTFAGWMRILERVPGSVLWLIEDNPLASANLRKEA 510

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F+     E+H+ R + AD+ LDT   N HTT+ D LW   PV+T  G T
Sbjct: 511 GVHGVDPSRLVFAERVPPEDHLARHRCADLFLDTLPYNAHTTASDALWARLPVLTCTGGT 570

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
            A RVAAS L  +  PEL+  +  +Y+ +A+ L T
Sbjct: 571 FAGRVAASLLLNVALPELVTSSQDDYERLAVELAT 605


>gi|338975698|ref|ZP_08631047.1| TPR domain-containing protein [Bradyrhizobiaceae bacterium SG-6C]
 gi|338231007|gb|EGP06148.1| TPR domain-containing protein [Bradyrhizobiaceae bacterium SG-6C]
          Length = 661

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 93/155 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLP+D  V+C FN  YKI P T   W+ +L+ VP S+LWL++   +  AN++  A
Sbjct: 451 TRTELGLPQDGFVFCCFNNNYKITPQTFAGWMRILERVPGSVLWLIEDNPLASANLRKEA 510

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F+     E+H+ R + AD+ LDT   N HTT+ D LW   PV+T  G T
Sbjct: 511 GVHGVDPSRLVFAERVPPEDHLARHRCADLFLDTLPYNAHTTASDALWARLPVLTCTGGT 570

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
            A RVAAS L  +  PEL+  +  +Y+ +A+ L T
Sbjct: 571 FAGRVAASLLLNVALPELVTSSQDDYERLAVELAT 605


>gi|409051565|gb|EKM61041.1| glycosyltransferase family 41 protein [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1339

 Score =  133 bits (334), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 95/153 (62%), Gaps = 5/153 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            LP   +++ NFNQLYKIDP     W+ +L  VPNSILWLL+FPA GE ++   AQ    +
Sbjct: 1088 LPHTVVIFANFNQLYKIDPGVFAAWLRILVRVPNSILWLLRFPAQGEEHLLRQAQMWAGN 1147

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG--ETLA 164
            +   R+ F++VA K  HV RG++AD+ LDT  CN HT + D+LW GTP++T P     + 
Sbjct: 1148 EVASRVKFTDVARKHVHVHRGRVADLFLDTMECNAHTIAGDILWPGTPLITFPKHPHKMC 1207

Query: 165  SRVAASQLATLGC-PELIARTHKEYQDIAIRLG 196
            SRVAAS     G   E++  + + Y++ A+ L 
Sbjct: 1208 SRVAASMANATGFGDEMVVDSLEAYENRAVALA 1240


>gi|344174927|emb|CCA87561.1| conserved hypothethical protein with TPR repeat domain [Ralstonia
           syzygii R24]
          Length = 675

 Score =  133 bits (334), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 93/155 (60%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP D  V C FNQ YKI  +    W  +L   P S+LWLL+  A  +A +   A+
Sbjct: 471 RAGVGLPADGFVLCCFNQAYKITEARANTWFTILSRTPGSVLWLLEPDASAKAALLERAR 530

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G++ HR++F+   A+ EH+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T+
Sbjct: 531 RHGVESHRLVFAPQVAQREHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRMGDTM 590

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G  +L+  T ++Y D A+RL  D
Sbjct: 591 ASRVAASILQAAGLHDLVVTTEEDYVDAAVRLAGD 625


>gi|299135312|ref|ZP_07028503.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
 gi|298590289|gb|EFI50493.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
          Length = 700

 Score =  132 bits (333), Expect = 5e-29,   Method: Composition-based stats.
 Identities = 75/186 (40%), Positives = 107/186 (57%), Gaps = 8/186 (4%)

Query: 16  LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVN 75
           L N    N T T+      +P   V   R+ +GLP+   V+C+FN   KI+P T   W+ 
Sbjct: 470 LPNCYQVNDTTTRV-----IPNDPVF--RRDHGLPDAGFVFCSFNNNNKINPGTFTDWMT 522

Query: 76  VLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLD 135
           +L+ VP S+LWLLK   V   N++  A A G++ +R++F+  A    H+ R +LAD+ LD
Sbjct: 523 ILRRVPGSVLWLLKNSDVVAQNLRREASARGINPNRLVFAPRADLPRHLARHRLADLFLD 582

Query: 136 TPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE-LIARTHKEYQDIAIR 194
           T   N HTT+ D L  G PV+TL GE  A RVAAS L T+G  + L+  + +EY + A+ 
Sbjct: 583 TLPYNAHTTTSDALLAGLPVLTLQGEAFAGRVAASILTTVGLQDGLVTFSREEYVERAVA 642

Query: 195 LGTDRD 200
           L TD D
Sbjct: 643 LATDAD 648


>gi|428179024|gb|EKX47897.1| hypothetical protein GUITHDRAFT_69287, partial [Guillardia theta
           CCMP2712]
          Length = 437

 Score =  132 bits (333), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 104/177 (58%), Gaps = 2/177 (1%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
           QTN+      E+     + +R  + LP+   +V+CNF +L +I     Q+W+N+L+ VPN
Sbjct: 220 QTNSFRHLYPEILDQKNLPTRDDHQLPDHPTVVFCNFCRLGRITADLFQIWMNILRRVPN 279

Query: 83  SILWLLKFPAVGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNG 141
           S++WL K P      +Q  A+  G+   R++F    + K EH++R  LAD+CLDT + NG
Sbjct: 280 SVIWLYKHPKAAVPRLQLQAKRSGIPPERLVFGPPCSPKIEHLKRVTLADLCLDTIVYNG 339

Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           HTT+ D+LW G P VT+ G+   S VA     ++G PE+      EY+++A++LG D
Sbjct: 340 HTTASDMLWAGVPFVTVSGDNWPSLVATCIAKSVGMPEMAVSNLAEYEELAVKLGND 396


>gi|222147822|ref|YP_002548779.1| hypothetical protein Avi_1086 [Agrobacterium vitis S4]
 gi|221734810|gb|ACM35773.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 632

 Score =  132 bits (333), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 64/155 (41%), Positives = 91/155 (58%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  +GLPE  +V+C+FNQ  KI     + W+ +LK+V  S+LWL+    V   N++A A 
Sbjct: 434 RSLHGLPEQGVVFCSFNQAVKIRYQVFKTWMEILKSVDGSVLWLIDMLPVTRDNLRAAAV 493

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
            LG+   R++F+      EH+RR   AD+ LDT  CNGHTT+ D LW G PV+T  G   
Sbjct: 494 RLGVAPERLIFAPKKPLSEHLRRLPYADIALDTGPCNGHTTTADALWAGVPVLTWKGTNF 553

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           A RV+ S L+ +G  EL+A    E+  +A+ L  D
Sbjct: 554 AGRVSESLLSAVGLTELVADDLTEFGRLAVELAQD 588


>gi|393218462|gb|EJD03950.1| TPR-like protein [Fomitiporia mediterranea MF3/22]
          Length = 1383

 Score =  132 bits (333), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 66/151 (43%), Positives = 94/151 (62%), Gaps = 5/151 (3%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +D I++ NFNQ+YK+DPST   W+ +LK VPNSILWLL+FP  GE ++  TA+     + 
Sbjct: 1143 DDTIIFANFNQMYKLDPSTYATWLRILKRVPNSILWLLRFPPAGEEHLLRTAEQWASAEI 1202

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
              R+ F+ V  K+EH+ R  +AD+ LDT  CN HT + DVLW GTP++T P  T  + SR
Sbjct: 1203 ASRVRFTGVVPKDEHIERVGVADIFLDTVECNAHTIAADVLWGGTPIITWPKYTFKMCSR 1262

Query: 167  VAAS-QLATLGCPELIARTHKEYQDIAIRLG 196
            V AS   AT     ++  +   Y++ A+ L 
Sbjct: 1263 VGASIAFATGFGNRMVVDSLAAYEERAVSLA 1293


>gi|116250489|ref|YP_766327.1| hypothetical protein RL0717 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115255137|emb|CAK06211.1| putative conserved hypothetical TPR repeat protein [Rhizobium
           leguminosarum bv. viciae 3841]
          Length = 657

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 94/156 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPEDA ++ +FN   KI P T+  W  +LK  PNS+LWL+      +AN+    
Sbjct: 437 TREQLGLPEDAFIFASFNGNRKITPETIGSWCRILKRAPNSVLWLMANTPRNQANLLKQF 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 497 QGAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  P+L+A   + Y+D+A+ L  +
Sbjct: 557 FASRVSESLLRAIDLPDLVAPDLQAYEDMAVELAEN 592


>gi|424873732|ref|ZP_18297394.1| hypothetical protein Rleg5DRAFT_5272 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393169433|gb|EJC69480.1| hypothetical protein Rleg5DRAFT_5272 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 657

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 94/156 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPEDA ++ +FN   KI P T+  W  +LK  PNS+LWL+      +AN+    
Sbjct: 437 TREQLGLPEDAFIFASFNGNRKITPETIGSWCRILKRAPNSVLWLMANTPRNQANLLKQF 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 497 QGAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  P+L+A   + Y+D+A+ L  +
Sbjct: 557 FASRVSESLLRAIDLPDLVAPDLQAYEDMAVELAEN 592


>gi|194335763|ref|YP_002017557.1| hypothetical protein Ppha_0641 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194308240|gb|ACF42940.1| TPR repeat-containing protein [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 1094

 Score =  132 bits (331), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 64/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R++ GLPE   V+  FN   KI P+T   W+ +L  V  S+LWLL+       N++  A
Sbjct: 607 TRKECGLPEQGFVFACFNNNNKITPATFDGWMRILGQVEGSVLWLLEDNVAAADNLRNEA 666

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F+      EH+ R +L D+ LDT  CN HTT+ D LW G P++TL GE+
Sbjct: 667 MQRGVDAARLVFAKRMPLAEHLARQRLGDLFLDTFPCNAHTTASDALWAGLPLLTLLGES 726

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A RVAAS L  +  PELI    +EY+ +AI L ++
Sbjct: 727 FAGRVAASLLNAMQLPELITSKQEEYEALAIELASN 762


>gi|255616588|ref|XP_002539762.1| hypothetical protein RCOM_1987210 [Ricinus communis]
 gi|223502601|gb|EEF22621.1| hypothetical protein RCOM_1987210 [Ricinus communis]
          Length = 268

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 94/157 (59%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           I +R   GL     V+C FN  YKI P     W+ +L  V  S+LWLL+  A   AN++ 
Sbjct: 57  IFTRGDVGLAPAGFVFCCFNNNYKITPDMFDRWMRILDRVEGSMLWLLEASAESAANLRK 116

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A A G+   R++F+  A+  +H+ R  LAD+ LD+   N HTT+ D LW G PV+T+ G
Sbjct: 117 EAIARGVAAERLVFAQRASLPDHLARHCLADLFLDSTPYNAHTTASDALWAGLPVLTVLG 176

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ET ASRVAAS L  +G PELIA + + Y+ +A+ L T
Sbjct: 177 ETFASRVAASLLHAVGLPELIAESPEAYEQMAVDLAT 213


>gi|417097828|ref|ZP_11959414.1| hypothetical protein RHECNPAF_1760042 [Rhizobium etli CNPAF512]
 gi|327193044|gb|EGE59956.1| hypothetical protein RHECNPAF_1760042 [Rhizobium etli CNPAF512]
          Length = 638

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPEDA ++ +FN   KI P  +  W  +LK  PNS+LWL+      +AN+    
Sbjct: 418 TREQLGLPEDAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 477

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  ++H+ R Q AD+ +DT   NGHTT+ + LW G PV+T  G  
Sbjct: 478 QTAGISPKRIIFCPRAPYDQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 537

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  PEL+A   + Y+D+A+ L  +
Sbjct: 538 FASRVSESLLQAIDLPELVAGDLQAYEDLAVELAQN 573


>gi|399040605|ref|ZP_10735943.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rhizobium sp. CF122]
 gi|398061392|gb|EJL53188.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rhizobium sp. CF122]
          Length = 641

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 66/159 (41%), Positives = 95/159 (59%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R + GLPE A V+ +FN   KI+ + L +W N+LK  PNS+LWL+      + N+   
Sbjct: 419 TTRAKVGLPEGAFVFASFNANRKINTAILDVWCNILKRAPNSVLWLMLSSPRTQTNLLNY 478

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
               GL+  R++F    + EEH+ R Q+AD+ +DT   NGHTT+ + LW G PV+T+ G 
Sbjct: 479 MNKRGLESDRVIFCPRVSYEEHIDRQQVADLGVDTFPVNGHTTTSEQLWGGLPVLTVKGT 538

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             ASRV+ S L  +G PEL+A   K Y+D A+ L    D
Sbjct: 539 NFASRVSESLLNAIGLPELVASDIKAYEDKAVELANAPD 577


>gi|222147876|ref|YP_002548833.1| hypothetical protein Avi_1161 [Agrobacterium vitis S4]
 gi|221734864|gb|ACM35827.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 637

 Score =  131 bits (329), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 65/159 (40%), Positives = 95/159 (59%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           + R  +G+PE A V+ +FN ++KI P T+ +W  VL+A P+S+LW+L   A+   N++A 
Sbjct: 424 SRRADHGVPEHAFVFASFNGVHKITPQTMSLWARVLRAAPDSLLWMLCPDAIARTNLEAA 483

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
             A G+D  RILF+      +HV R  LAD+ LDT  CNGHTT+ D+LW G PV+T  G 
Sbjct: 484 FVAEGIDPARILFAAKQDYGDHVHRLPLADLALDTFPCNGHTTTSDMLWGGLPVLTKRGH 543

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             A RV+ S L  +G  +L+A   + +  +A  L    D
Sbjct: 544 CFAGRVSESLLKAVGLDQLVADDEEAFVSLAAELARQPD 582


>gi|126737924|ref|ZP_01753654.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
 gi|126721317|gb|EBA18021.1| TPR repeat protein [Roseobacter sp. SK209-2-6]
          Length = 603

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 97/164 (59%)

Query: 37  QSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA 96
           Q  V+ SR + GLPE   ++C+ N   K+ P+   +W+ +L  VP S+LW+L      + 
Sbjct: 383 QPDVVPSRTELGLPEGKFIFCSLNNPNKVTPAEFDVWMKLLCNVPESVLWILAPNEEIKN 442

Query: 97  NIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
           N+   A A G+   R++F+   + ++H+ R + AD+ LD   CN HTT+ + +W G P+V
Sbjct: 443 NLTREANARGVGGERLIFAERVSMDDHLARMRQADLFLDAFNCNAHTTASEAVWAGVPLV 502

Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           T  G   A+RVAAS +  +GCP+L+  T  EY D+A +L T+ D
Sbjct: 503 TKAGNQFAARVAASVVTAIGCPDLVTETIDEYYDLAYKLATEAD 546


>gi|407774610|ref|ZP_11121908.1| hypothetical protein TH2_11919 [Thalassospira profundimaris WP0211]
 gi|407282652|gb|EKF08210.1| hypothetical protein TH2_11919 [Thalassospira profundimaris WP0211]
          Length = 734

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 97/158 (61%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPED  V+C  N  YKI P+   +W+ V+  V  S+LWL     V + N++A A
Sbjct: 525 TRAELGLPEDGFVFCCLNNNYKIMPTEFAIWMRVMAKVEGSVLWLWCNNDVAKENLRAAA 584

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F+    + EH+ R + AD+ +DT   N HTT+ D LW G PVVTL G+ 
Sbjct: 585 EKQGISGDRLIFAGYMPQSEHLARLRHADLFIDTFHVNAHTTASDALWAGLPVVTLAGKQ 644

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+RVAAS L+ +G  ELI+ T + Y+++ ++L  + D
Sbjct: 645 FAARVAASLLSAVGLRELISETPEAYEELILKLAQNPD 682


>gi|257092967|ref|YP_003166608.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
           clade IIA str. UW-1]
 gi|257045491|gb|ACV34679.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
           clade IIA str. UW-1]
          Length = 608

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 97/157 (61%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLP+DA V+C+FN  YK  P     W+N+L+ V  S+LWLL   A  EAN++  A
Sbjct: 405 TRESCGLPQDAFVFCSFNNNYKYTPEMFTTWMNILRRVAGSVLWLLADNAWAEANLRQEA 464

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F++  A E ++ R ++AD+ LD+   N  TT+ D LW G PV+T  G T
Sbjct: 465 ITRGIDPARLIFASRVAPENYLARYRVADLFLDSFPFNAGTTANDALWMGLPVLTRSGRT 524

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASR+A + L      EL+  T ++Y++ A+ LG+DR
Sbjct: 525 FASRMAGALLTACRLSELVTYTLQDYEEKAVSLGSDR 561


>gi|295688661|ref|YP_003592354.1| hypothetical protein Cseg_1236 [Caulobacter segnis ATCC 21756]
 gi|295430564|gb|ADG09736.1| TPR repeat-containing protein [Caulobacter segnis ATCC 21756]
          Length = 676

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 92/158 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLP+DA V+C FN + KI P     W+ +LK  P S+LWLL       A ++ +A
Sbjct: 411 SRADAGLPDDAFVFCCFNGVQKITPHMFDRWIQILKRTPGSVLWLLDSNPEANARLRDSA 470

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D+ RI+F+       H+ R +LAD+ LDT     HTT+ D LW   PV+T  G +
Sbjct: 471 EAKGVDRTRIVFAPKLQNAYHLARYRLADLFLDTTPYGAHTTASDALWMAVPVLTWSGRS 530

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ASRV  S + + G PEL+  + + Y + A+ +G DR+
Sbjct: 531 FASRVCGSLVRSAGLPELVVDSGEAYVEKAVEIGADRE 568


>gi|424915500|ref|ZP_18338864.1| hypothetical protein Rleg9DRAFT_3046 [Rhizobium leguminosarum bv.
           trifolii WSM597]
 gi|392851676|gb|EJB04197.1| hypothetical protein Rleg9DRAFT_3046 [Rhizobium leguminosarum bv.
           trifolii WSM597]
          Length = 657

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 91/154 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPE+A ++ +FN   KI P  +  W  +LK  PNS+LWL+      +AN+    
Sbjct: 437 TREQLGLPEEAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLLKHF 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  EEH+ R Q AD+ +DT   NGHTT+ + LW G PV+T  G  
Sbjct: 497 QTAGISPKRIIFCPRAPYEEHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            ASRV+ S L  +  P+L+A   + Y+D+A+ L 
Sbjct: 557 FASRVSESLLRAIDLPDLVASDLQAYEDMAVELA 590


>gi|378734069|gb|EHY60528.1| polypeptide N-acetylglucosaminyltransferase [Exophiala dermatitidis
            NIH/UT8656]
          Length = 1641

 Score =  130 bits (327), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 102/163 (62%), Gaps = 17/163 (10%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L + A++  NFNQLYKIDP+T  M++ +LK VPN+ILWLL+FP +GE  +   A+     
Sbjct: 1400 LSDTAVILGNFNQLYKIDPATFDMYLQILKMVPNAILWLLRFPDLGEQKLLQYARDWASP 1459

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE---TL 163
            +   RI+F++VAAK  H+ R  + D+ LDTP CN HTT+ DV+W+GTP++T  G+    +
Sbjct: 1460 EVASRIVFTDVAAKGTHIMRASVVDLFLDTPECNAHTTAADVIWSGTPILTW-GKWKYKM 1518

Query: 164  ASRVAASQLATLGCPE----------LIARTHKEYQDIAIRLG 196
             SR+A S +A+   PE          L+  + KEY D AI L 
Sbjct: 1519 CSRMAGSIVAS-ALPEGREGDEARRDLLVNSEKEYIDAAIELA 1560


>gi|336383400|gb|EGO24549.1| glycosyltransferase family 41 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 939

 Score =  130 bits (327), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 96/155 (61%), Gaps = 7/155 (4%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP D +++ NFNQLYKIDP     W+ +L  VP SILWLL+FPA GEA++  TAQ     
Sbjct: 697 LPLDVVIFANFNQLYKIDPVIFATWLRILAQVPRSILWLLRFPAAGEAHLLRTAQMWAGQ 756

Query: 109 Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE----T 162
           +   RI F++VA KE+H+ R ++ D+ LDT  C+ HT + DVLW+GTP++          
Sbjct: 757 EVASRIHFTDVARKEQHIYRCRVVDLFLDTVECSAHTVAADVLWSGTPIIACAWSEHRYK 816

Query: 163 LASRVAASQLATLGCPE-LIARTHKEYQDIAIRLG 196
           + SRVAAS     G  + ++  + +EY++ AI L 
Sbjct: 817 MCSRVAASVANATGFGQHMVVSSREEYEERAISLA 851


>gi|402490310|ref|ZP_10837099.1| hypothetical protein RCCGE510_21284 [Rhizobium sp. CCGE 510]
 gi|401810336|gb|EJT02709.1| hypothetical protein RCCGE510_21284 [Rhizobium sp. CCGE 510]
          Length = 623

 Score =  130 bits (327), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/161 (40%), Positives = 95/161 (59%), Gaps = 1/161 (0%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P+   +T R+Q GLPEDA ++ +FN   KI P T+  W  +LK   NS+LWL+      +
Sbjct: 397 PKPRAVT-REQLGLPEDAFIFASFNGNRKITPETIDSWCRILKRAQNSVLWLMANTPRNQ 455

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           AN+    Q  G+   RI+F   A  EEH+ R Q AD+ +DT   NGHTT+ + LW G PV
Sbjct: 456 ANLLKQFQTAGISPKRIIFCPRAPYEEHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPV 515

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           +T+ G   ASRV+ S L  +  P+L+A   + Y+D+A+ L 
Sbjct: 516 LTVKGTNFASRVSESLLRAIDLPDLVADDLQAYEDMAVELA 556


>gi|315497291|ref|YP_004086095.1| hypothetical protein Astex_0244 [Asticcacaulis excentricus CB 48]
 gi|315415303|gb|ADU11944.1| hypothetical protein Astex_0244 [Asticcacaulis excentricus CB 48]
          Length = 675

 Score =  130 bits (327), Expect = 3e-28,   Method: Composition-based stats.
 Identities = 66/158 (41%), Positives = 89/158 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLP+ A V+C FN  +KI   T   W+ VL  V  S+LWLL         ++A A
Sbjct: 415 TRAEAGLPDKAFVFCCFNGTHKITRRTFDRWLAVLDRVAGSVLWLLSGSNSSHLRLKAYA 474

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+D  R++F+   A   H+ R  LAD+ LDT     HTT  D LW G PV+TL G +
Sbjct: 475 QARGIDPSRLIFAEKLANPAHLARYPLADLFLDTSPYGAHTTCSDALWMGVPVLTLSGRS 534

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ASRV  S L   G PEL+    +++ D+A+ LG D D
Sbjct: 535 FASRVCGSLLRAAGLPELVTTRPEDFVDMAVSLGRDSD 572


>gi|209547935|ref|YP_002279852.1| hypothetical protein Rleg2_0327 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|209533691|gb|ACI53626.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
           trifolii WSM2304]
          Length = 657

 Score =  130 bits (327), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPE+A ++ +FN   KI P  +  W  +LK  PNS+LWL+      +AN+    
Sbjct: 437 TREQLGLPEEAFIFASFNGNRKITPEVVNSWCRILKRAPNSVLWLMANSPRNQANLLKHF 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 497 QTAGISSKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  PEL+A   + Y+D+A+ L  +
Sbjct: 557 FASRVSESLLRAIDLPELVAGDLQAYEDMAVELAQN 592


>gi|345869369|ref|ZP_08821327.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
           drewsii AZ1]
 gi|343923292|gb|EGV33984.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
           drewsii AZ1]
          Length = 844

 Score =  130 bits (326), Expect = 3e-28,   Method: Composition-based stats.
 Identities = 68/156 (43%), Positives = 88/156 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE   V+C FN  YKI     + W  +L AVP S+LWL     V   N++  A
Sbjct: 630 SRTEAGLPEQGFVFCCFNDPYKITSEIFERWCAILNAVPGSVLWLYAKTRVVADNLRREA 689

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  GLD  R+ F++  ++ EH+ R  LAD+ LDT   N HTT+ D LW G PV+T  GET
Sbjct: 690 ERRGLDSERLYFASKVSQPEHLARLALADLVLDTLPYNAHTTASDALWVGVPVLTCQGET 749

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             SRVAAS L   G PELI      Y+  A+ L  +
Sbjct: 750 FPSRVAASLLRAAGLPELITGDLAGYEARAVALAKE 785


>gi|239815273|ref|YP_002944183.1| hypothetical protein Vapar_2288 [Variovorax paradoxus S110]
 gi|239801850|gb|ACS18917.1| TPR repeat-containing protein [Variovorax paradoxus S110]
          Length = 740

 Score =  130 bits (326), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 65/159 (40%), Positives = 92/159 (57%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           ++R   GLP    V+C FN  YKI P+    W+ +L+AVP+S+LWL +  +     +Q  
Sbjct: 530 SARMAAGLPASGFVFCCFNNTYKITPTVFATWMRILRAVPDSVLWLYETDSRVGPRLQEE 589

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A   G++  R++F+      EH+ R +LAD+ LDT  CN HTT  D LW G P++T  G 
Sbjct: 590 AVKNGIEPGRLVFAPRQPLGEHLCRHELADLFLDTFPCNAHTTGSDALWAGLPLLTCLGT 649

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           T ASRVAAS L  +G PE++  T   Y   AI L ++ D
Sbjct: 650 TFASRVAASLLHAVGLPEMVMPTLDAYATRAIELASEPD 688


>gi|344338856|ref|ZP_08769787.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
           5811]
 gi|343801438|gb|EGV19381.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiocapsa marina
           5811]
          Length = 788

 Score =  130 bits (326), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 66/154 (42%), Positives = 89/154 (57%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE+  V+C FN   KI P   + W  +L AVP S+LWL         N++  A+
Sbjct: 574 RAAAGLPEEGFVFCCFNNPRKITPEVFERWCRLLHAVPGSLLWLFASQDAVIDNLKQEAE 633

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+D  R++ +    ++EH+ R  LAD+ LDT   N HTT+ D LW G PV+T  G+T 
Sbjct: 634 RRGIDPRRLVLAPRVPQQEHLARLALADLMLDTLPYNAHTTASDALWMGVPVLTCVGDTF 693

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ASRVAAS L   G PELI  +  +Y+  A+RL T
Sbjct: 694 ASRVAASLLGAAGLPELITSSLDDYEAEALRLAT 727


>gi|375104380|ref|ZP_09750641.1| hypothetical protein BurJ1DRAFT_1023 [Burkholderiales bacterium
           JOSHI_001]
 gi|374665111|gb|EHR69896.1| hypothetical protein BurJ1DRAFT_1023 [Burkholderiales bacterium
           JOSHI_001]
          Length = 464

 Score =  130 bits (326), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 89/147 (60%), Gaps = 1/147 (0%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+RQ  GLPE A+V CNFNQ YK  P    +W  +L+  P S+LWLL      + N++  
Sbjct: 255 TTRQDAGLPEGALVLCNFNQAYKFTPQVFGLWCELLERAPGSVLWLLA-DEQAQGNLRQE 313

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A A G++ HR++F+    + +H+ R  LAD+ LDT   N HTT+ D LW G PV+T  G+
Sbjct: 314 ALARGINPHRLVFAPPLPQAQHLARLALADLVLDTLPYNAHTTASDALWAGVPVLTCAGD 373

Query: 162 TLASRVAASQLATLGCPELIARTHKEY 188
           T A+RVA S L  +G PEL+     +Y
Sbjct: 374 TFAARVAGSLLHAVGLPELVTHRLDDY 400


>gi|351730791|ref|ZP_08948482.1| tetratricopeptide tpr_1 repeat-containing protein [Acidovorax
           radicis N35]
          Length = 809

 Score =  129 bits (325), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 65/158 (41%), Positives = 96/158 (60%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + +R++  LP    V+C FN   KI P     W+ +L+AVP+S+LWLL      E +++ 
Sbjct: 600 VFTREEARLPATGRVFCCFNNNQKILPQVFDSWMRILQAVPDSVLWLLADNPAVEGSLRR 659

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            AQA G+   R++F+     ++H+ R +LAD+ LDT   N HTT+ D LW G PV+T  G
Sbjct: 660 EAQARGVAPERLVFAQRLPLDQHLARHRLADLFLDTLPYNAHTTASDALWAGLPVLTQLG 719

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           E+ A+RVAAS L  +G PEL+  +  EY+ +AI L  D
Sbjct: 720 ESFAARVAASLLHAVGLPELVTHSAAEYEALAISLARD 757


>gi|259419160|ref|ZP_05743077.1| TPR repeat-containing protein [Silicibacter sp. TrichCH4B]
 gi|259345382|gb|EEW57236.1| TPR repeat-containing protein [Silicibacter sp. TrichCH4B]
          Length = 616

 Score =  129 bits (325), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 64/158 (40%), Positives = 93/158 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+ + LPE++ V+  FN  YK+ P    +W+N+LK VP+S+LW     A     ++  A
Sbjct: 406 SRESHDLPEESFVFSTFNNPYKVTPREFDIWMNLLKEVPDSVLWYYVSNADIIDRLRKEA 465

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  RI+ +     E H+ R + AD+ LDT   N HTT+ D LW G P+VT  GE 
Sbjct: 466 EARGVDGARIIPTGRMQPEYHLARLKHADLFLDTFNVNAHTTASDALWAGLPLVTKTGEQ 525

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+RVA S L   G  +L+  T K+Y D+A+R+  D D
Sbjct: 526 FAARVAGSILTAAGLEDLVTTTEKKYHDVALRIAQDPD 563


>gi|288957552|ref|YP_003447893.1| hypothetical protein AZL_007110 [Azospirillum sp. B510]
 gi|288909860|dbj|BAI71349.1| hypothetical protein AZL_007110 [Azospirillum sp. B510]
          Length = 888

 Score =  129 bits (325), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 67/154 (43%), Positives = 87/154 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPED  V+C FN  +K+ P+ L  W  +L AVP+S+LWL         N++   
Sbjct: 660 SRAACGLPEDGFVFCCFNSTHKLTPALLDGWARILAAVPDSLLWLYAGNPQAADNLRREG 719

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G D  R++F+      EH+ R +LAD+ LDT   N HTT+ D LW G PV+T  G T
Sbjct: 720 EARGNDPRRLVFAAPLPHAEHLARHRLADLFLDTLPYNAHTTASDALWAGLPVLTRRGTT 779

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            A RVAAS L   G PELI    + Y+  AI L 
Sbjct: 780 FAGRVAASLLRAAGLPELIVEDQQAYEAAAISLA 813


>gi|424888611|ref|ZP_18312214.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rhizobium leguminosarum bv. trifolii WSM2012]
 gi|393174160|gb|EJC74204.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rhizobium leguminosarum bv. trifolii WSM2012]
          Length = 677

 Score =  129 bits (325), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPE+  ++ +FN   KI P T+  W  +LK  PNS+LWL+      +AN+    
Sbjct: 457 TREQLGLPEETFIFASFNGNRKITPETIDSWCRILKRAPNSVLWLMANTPRNQANLLKQF 516

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 517 QTAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 576

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  P+L+A   + Y+D+A+ L  +
Sbjct: 577 FASRVSESLLRAIDLPDLVASDLQAYEDMAVELAEN 612


>gi|300696849|ref|YP_003747510.1| conserved protein of unknown function with TPR repeat domain
           [Ralstonia solanacearum CFBP2957]
 gi|299073573|emb|CBJ53093.1| conserved protein of unknown function with TPR repeat domain
           [Ralstonia solanacearum CFBP2957]
          Length = 672

 Score =  129 bits (325), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLP D +V+C FNQ YKI  +  Q W  +L   P S+LWLL+  A     +Q  A
Sbjct: 467 NRAEVGLPADGLVFCCFNQAYKITEARAQTWFAILSRTPGSVLWLLEPDASARTALQEEA 526

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   R++F+   A+  H+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T
Sbjct: 527 GRHGIASERLVFAPQVAQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDT 586

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +ASRVAAS L   G  +L+  T  +Y + A+RL  D
Sbjct: 587 MASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622


>gi|226941622|ref|YP_002796696.1| TPR repeat protein [Laribacter hongkongensis HLHK9]
 gi|226716549|gb|ACO75687.1| TPR repeat protein [Laribacter hongkongensis HLHK9]
          Length = 839

 Score =  129 bits (325), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 62/157 (39%), Positives = 95/157 (60%)

Query: 40  VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
           +I SR++ GLPE+A V+C+FN L K+ P    +W  +L++V  S+LWL+      + NI 
Sbjct: 350 LIPSRKEMGLPENAFVFCSFNTLVKLTPEMFDVWCRLLQSVSGSVLWLIATDKKAQENIV 409

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             A   G+D  R +F++  A+ +H+ R  LAD+ LD+     HTT+ D LW G P++T+ 
Sbjct: 410 REAMERGVDASRFVFASQVAQSQHLARLTLADLFLDSFPVVAHTTASDSLWAGLPILTMA 469

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           GE+  SRVA S L T+G  ELI  ++ +Y   A+ L 
Sbjct: 470 GESFVSRVAGSILKTIGVDELIVYSYDQYFQKALFLA 506


>gi|12718407|emb|CAC28786.1| related to UDP-N-ACETYLGLUCOSAMINE--PEPTIDE
            N-ACETYLGLUCOSAMINYLTRANSFERASE [Neurospora crassa]
          Length = 1519

 Score =  129 bits (324), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 71/183 (38%), Positives = 105/183 (57%), Gaps = 36/183 (19%)

Query: 49   LPEDAIVYCNFNQLYK-----------------------IDPSTLQMWVNVLKAVPNSIL 85
            LP+D I+  NFNQLYK                       IDP+T + W+ +L  VP ++L
Sbjct: 1282 LPDDRIILGNFNQLYKVRFLNFFQGIACAAPLTNVLVQQIDPTTFRTWLRILAHVPKAVL 1341

Query: 86   WLLKFPAVGEANIQATAQALGLDQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
            WLL+FP +GE N++ TA+    ++   RI+F++VA K++H+ R ++ D+ LDTP CN HT
Sbjct: 1342 WLLRFPELGENNLRRTAKLWAGEEVASRIIFTDVAPKQQHIARARVCDLFLDTPECNAHT 1401

Query: 144  TSMDVLWTGTPVVTLPG--ETLASRVAASQL---------ATLGCPELIARTHKEYQDIA 192
            T+ DVLW+ TP++TLP     + SR+AAS L               +LIA    EY++ A
Sbjct: 1402 TAADVLWSSTPLLTLPRYEYKMCSRMAASILKGALPKSEAGRQAAEDLIAGDDYEYENRA 1461

Query: 193  IRL 195
            I+L
Sbjct: 1462 IKL 1464


>gi|409436081|ref|ZP_11263278.1| putative glycosyltransferase TPR domain protein (SPINDLY-related)
           [Rhizobium mesoamericanum STM3625]
 gi|408752217|emb|CCM74427.1| putative glycosyltransferase TPR domain protein (SPINDLY-related)
           [Rhizobium mesoamericanum STM3625]
          Length = 641

 Score =  129 bits (323), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 65/159 (40%), Positives = 94/159 (59%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R Q GLP  A V+ +FN   KI+ + L +W N+LK  PNS+LWL+      + N+   
Sbjct: 419 TTRAQVGLPVGAFVFASFNANRKINAAILDVWCNILKRAPNSVLWLMLSSPRTQNNLLNY 478

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
               G++  R++F      EEH+ R Q+AD+ +DT   NGHTT+ + LW G PV+T+ G 
Sbjct: 479 MNKKGVESDRVIFCPRVPYEEHIDRQQMADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGT 538

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             ASRV+ S L  +G PEL+A   K Y+D+A+ L    D
Sbjct: 539 NFASRVSESLLNAIGLPELVASDIKVYEDMAVELANTPD 577


>gi|171057695|ref|YP_001790044.1| hypothetical protein Lcho_1008 [Leptothrix cholodnii SP-6]
 gi|170775140|gb|ACB33279.1| TPR repeat-containing protein [Leptothrix cholodnii SP-6]
          Length = 672

 Score =  129 bits (323), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 90/156 (57%)

Query: 40  VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
           V  SRQ+ GLP+DA+V  +FNQ YK  P     W  +L A P ++LW+L   A  +A ++
Sbjct: 456 VAWSRQRCGLPDDALVLASFNQSYKTTPEVFAAWCRILAAQPRALLWMLVPDADTQARLR 515

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             A   G+D  R++F+     E H  R   AD+ LDT  C+GHTT+ D LW G PV+TL 
Sbjct: 516 EVAAGHGVDPQRVVFAPFVDIESHRARLPQADLILDTFPCSGHTTTSDALWAGVPVLTLT 575

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
           G   A+RVAAS + TLG  ELI      Y D AI L
Sbjct: 576 GRNFAARVAASLVHTLGLDELICDDLATYVDRAIEL 611


>gi|424898309|ref|ZP_18321883.1| hypothetical protein Rleg4DRAFT_4290 [Rhizobium leguminosarum bv.
           trifolii WSM2297]
 gi|393182536|gb|EJC82575.1| hypothetical protein Rleg4DRAFT_4290 [Rhizobium leguminosarum bv.
           trifolii WSM2297]
          Length = 657

 Score =  129 bits (323), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPE+A ++ +FN   KI P T+  W  +LK  P S+LWL+      +AN+    
Sbjct: 437 TREQLGLPEEAFIFASFNGNRKITPETIDSWCRILKRAPTSVLWLMANTPRNQANLLKQF 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV+T+ G  
Sbjct: 497 QTAGISPKRIIFCPRAPYEQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTVKGTN 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  P+L+A   + Y+D+A+ L  +
Sbjct: 557 FASRVSESLLRAIDLPDLVASDLQAYEDMAVELAEN 592


>gi|452964763|gb|EME69797.1| SPY protein [Magnetospirillum sp. SO-1]
          Length = 794

 Score =  129 bits (323), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 65/158 (41%), Positives = 92/158 (58%), Gaps = 2/158 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLPED ++ C F+  YKI+P    +W+ VL  VP S+LWLL   A   AN+   A
Sbjct: 589 ARGACGLPEDGVILCAFHNAYKINPPLFDVWMRVLAQVPGSVLWLLDGSA--RANLCREA 646

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  R++F+       ++ R +LAD+ LDT   N H ++ D LW G PVVT  G T
Sbjct: 647 KACGIDPGRLVFAPRVGIGAYLARHRLADLYLDTLPYNAHGSAADALWMGVPVVTCLGRT 706

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
              RVAAS L   G PEL+ RT  +Y+++ + L  D +
Sbjct: 707 FPGRVAASVLKAAGLPELVTRTPAQYEELVLALARDSE 744


>gi|171058438|ref|YP_001790787.1| hypothetical protein Lcho_1755 [Leptothrix cholodnii SP-6]
 gi|170775883|gb|ACB34022.1| Tetratricopeptide TPR_2 repeat protein [Leptothrix cholodnii SP-6]
          Length = 647

 Score =  128 bits (322), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 63/155 (40%), Positives = 92/155 (59%), Gaps = 1/155 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE  +V+C+FN  YKI P     W+ +L A P S+LWL+   A  + N++A A
Sbjct: 439 SRTECGLPESGLVFCSFNHDYKISPHIFAAWMRILAATPGSVLWLMSRGAASQRNLRAAA 498

Query: 103 QALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           QA G+   R++F+  V   E+H+ R + AD+ LDT   N HTT+ D L  G PV+T  G 
Sbjct: 499 QAQGVAPERLVFAQRVPRVEDHLARYRQADLFLDTHPYNAHTTAADALLAGLPVLTYSGN 558

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
              +RVA S L   G P+L+  +  +Y+ +A+RL 
Sbjct: 559 AFPARVAGSLLHAAGLPDLVTHSLSDYEALAVRLA 593


>gi|398955966|ref|ZP_10676713.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Pseudomonas sp. GM33]
 gi|398150432|gb|EJM39025.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Pseudomonas sp. GM33]
          Length = 611

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 65/160 (40%), Positives = 94/160 (58%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + +R + GLP++  V+C+FN  YK +      W+ +L+  P+S+LWLL      + N+ A
Sbjct: 412 LPTRAECGLPQERFVFCSFNNNYKFNEEVFDCWMRILQRAPDSVLWLLADNQWAQENLCA 471

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            AQA G+D  R+LF+   A E+++ R  +AD+ LD    N  TT+ D LW G PV+T  G
Sbjct: 472 RAQAHGVDPARLLFAPRVAPEQYLARYSVADLFLDAYPFNAGTTANDALWMGLPVLTRSG 531

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            T ASR+A S L  L  PELI  T  EY++ A+ L T  D
Sbjct: 532 RTFASRMAGSLLTALDLPELITTTLAEYEERAVELATRAD 571


>gi|218512972|ref|ZP_03509812.1| hypothetical protein Retl8_04370 [Rhizobium etli 8C-3]
          Length = 173

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPEDA ++ +FN   KI    +  W  +LK  PNS+LWL+      +AN+    
Sbjct: 6   TREQLGLPEDAFIFASFNGNRKITHEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 65

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  ++H+ R Q AD+ +DT   NGHTT+ + LW G PV+T  G  
Sbjct: 66  QTAGISPKRIIFCPRAPYDQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 125

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV+ S L  +  PEL+A   + Y+D+A+ L  +
Sbjct: 126 FASRVSESLLQAIDLPELVAGDLQAYEDLAVELAQN 161


>gi|288958280|ref|YP_003448621.1| hypothetical protein AZL_014390 [Azospirillum sp. B510]
 gi|288910588|dbj|BAI72077.1| hypothetical protein AZL_014390 [Azospirillum sp. B510]
          Length = 635

 Score =  128 bits (321), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 64/156 (41%), Positives = 88/156 (56%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           +  R   GLP D +V+C FN  YKI P     W  +L+ VP S+LWLL+ PA    N++ 
Sbjct: 422 VPERADCGLPADGVVFCAFNASYKIGPELFGRWCRILEGVPGSMLWLLEGPAEVALNLRR 481

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A A G+   R++F+      +H+ R +LAD+ LD+     HTT+ D LW G PV+T+PG
Sbjct: 482 AASARGVAPERLVFAPRLPGPDHLARHRLADLFLDSSPVGAHTTASDALWAGLPVLTVPG 541

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            T A RVAAS L  +  PEL+      Y+  A RL 
Sbjct: 542 RTFAGRVAASLLRAVDLPELVLPDWDAYEATARRLA 577


>gi|452963367|gb|EME68440.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
           SO-1]
          Length = 673

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 89/157 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPED  V+C FN   KI     + W+ +L +VP S+LWLL+        +   A
Sbjct: 409 SRSEVGLPEDGAVFCCFNSQQKISRLMFERWMRILHSVPGSVLWLLEANDEVHNRLWEKA 468

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+ ++R++F       +H+ R  LAD+ LDT     HTT+ D LW   P++TL G +
Sbjct: 469 EACGIGRNRVIFGRRLPSPDHLARMTLADLFLDTFPYGAHTTASDALWMSVPILTLSGRS 528

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S L + G PEL+  T  EY + AI LG DR
Sbjct: 529 FASRVGGSLLRSAGLPELVCSTPDEYVETAIALGNDR 565


>gi|392377850|ref|YP_004985009.1| conserved protein of unknown function [Azospirillum brasilense
           Sp245]
 gi|356879331|emb|CCD00243.1| conserved protein of unknown function [Azospirillum brasilense
           Sp245]
          Length = 645

 Score =  127 bits (320), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 67/163 (41%), Positives = 91/163 (55%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P    +  R   GLP D  V+C FN  +KI PS   +W+ VL  VP S+LWL +    G 
Sbjct: 428 PLDAPVPDRAACGLPPDGFVFCAFNAPFKITPSLFGLWMRVLARVPGSVLWLQQPGRDGT 487

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N++  A   G+D  R++F+    + EH+ R +LAD+ LD+    GHTT  D LW G PV
Sbjct: 488 DNLRREAARRGVDPGRLVFAPHRPQAEHLARYRLADLFLDSFPYTGHTTVSDALWMGLPV 547

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT  G+T ASRVAA  L   G PE +  +   Y+ +A+RL  D
Sbjct: 548 VTRMGDTFASRVAAGLLNAAGLPETVTTSFDGYEALAVRLAGD 590


>gi|222147877|ref|YP_002548834.1| hypothetical protein Avi_1162 [Agrobacterium vitis S4]
 gi|221734865|gb|ACM35828.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 633

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 94/155 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  +GLPE+  V+ +FN   KI P ++ +W +++ AVP+S+LW+L   A  +AN     
Sbjct: 416 KRADHGLPEEVFVFASFNSPAKISPQSITLWASIMNAVPDSLLWILCSGAQLQANFAEEF 475

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
             LG+ + RI+F+      +H+ R  LAD+ LDT   NGHTT+ D+LW G PV+T  G +
Sbjct: 476 ARLGIGRDRIVFAQGVDYPDHLSRVGLADLALDTFPYNGHTTTSDLLWGGLPVLTKKGRS 535

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
            A+RV+ S L  +G PEL+AR  +E+ + A+    
Sbjct: 536 FAARVSESLLTAIGLPELVARDGEEFVERAVEFAA 570


>gi|190890370|ref|YP_001976912.1| hypothetical protein RHECIAT_CH0000745 [Rhizobium etli CIAT 652]
 gi|190695649|gb|ACE89734.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
          Length = 638

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 62/154 (40%), Positives = 90/154 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLPEDA ++ +FN   KI    +  W  +LK  PNS+LWL+      +AN+    
Sbjct: 418 TREQLGLPEDAFIFASFNGNRKITHEVVNSWCRILKRAPNSVLWLMANSPRNQANLSKHF 477

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   RI+F   A  ++H+ R Q AD+ +DT   NGHTT+ + LW G PV+T  G  
Sbjct: 478 QTAGISPKRIIFCPRAPYDQHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLPVLTFKGTN 537

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            ASRV+ S L  +  PEL+A   + Y+D+A+ L 
Sbjct: 538 FASRVSESLLQAIDLPELVAGDLQAYEDLAVELA 571


>gi|83312890|ref|YP_423154.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
           magneticum AMB-1]
 gi|82947731|dbj|BAE52595.1| Predicted O-linked N-acetylglucosamine transferase
           [Magnetospirillum magneticum AMB-1]
          Length = 636

 Score =  127 bits (320), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 63/157 (40%), Positives = 91/157 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPED +V+C FN   KI P     W+++L +VP S+LWLL+     +  +   A
Sbjct: 372 SRAEVGLPEDKMVFCCFNSQQKISPLMFDRWMHILHSVPESVLWLLECGEEIKNRLWEHA 431

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+ + R++F       +H+ R  LAD+ LDT     HTT+ D LW   P++TL G +
Sbjct: 432 ERGGIARERVIFGKRLPSPDHLARMTLADLFLDTFPYGAHTTASDALWMSVPILTLSGHS 491

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S   + G PEL+  T +EY ++AI LG DR
Sbjct: 492 FASRVGGSLSRSAGLPELVCSTPEEYVEMAIALGNDR 528


>gi|374291680|ref|YP_005038715.1| hypothetical protein AZOLI_1155 [Azospirillum lipoferum 4B]
 gi|357423619|emb|CBS86479.1| protein of unknown function; putative TPR domain [Azospirillum
           lipoferum 4B]
          Length = 673

 Score =  127 bits (320), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 64/156 (41%), Positives = 87/156 (55%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR   GLP DA+V+C FN  YKI P     W  +L+ VP S+LWLL+ PA    N++ 
Sbjct: 460 VPSRADCGLPADAVVFCAFNAPYKIGPELFGRWCRILQRVPGSVLWLLEGPAEVAVNLRR 519

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A   G+   R++F+       H+ R +LAD+ LD+     HTT+ D LW G PV+T+PG
Sbjct: 520 AAAERGIAAERLVFAPRLPGPAHLARHRLADLFLDSSPVGAHTTASDALWAGLPVLTVPG 579

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            + A RVAAS L  +G PE        Y+  A RL 
Sbjct: 580 HSFAGRVAASLLHAVGLPESAVPDWDAYEATAQRLA 615


>gi|46200779|ref|ZP_00056483.2| COG3914: Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Magnetospirillum magnetotacticum MS-1]
          Length = 673

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/174 (38%), Positives = 96/174 (55%), Gaps = 6/174 (3%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           + K A   E P      SR   GLPEDA+V+C FN   KI     + W+++L +VP S+L
Sbjct: 398 DRKRAVASEPP------SRTDLGLPEDAMVFCCFNSQQKISKMMFERWMHILNSVPGSVL 451

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+     +  +   A+  G+ + R++F    A  +H+ R   AD+ LDT     HTT+
Sbjct: 452 WLLESGEEIQGRLWDHAERCGIARDRLIFGKRLASPDHLARMTQADLFLDTFPYGAHTTA 511

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            D LW   P++TL G + ASRV  S   + G PEL+  T +EY ++AI LG DR
Sbjct: 512 SDALWMSVPILTLSGRSFASRVGGSLSRSAGLPELVCSTPEEYVEMAIALGNDR 565


>gi|40063717|gb|AAR38498.1| TPR repeat protein [uncultured marine bacterium 583]
          Length = 733

 Score =  127 bits (319), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 63/154 (40%), Positives = 91/154 (59%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLP    ++C FN  YKI PST   W+ +LKAV +S+LWL +       N++  A 
Sbjct: 525 RYELGLPNTGFIFCCFNNSYKITPSTFTGWMRILKAVEDSVLWLFENNNNTAKNLKKEAI 584

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+++ R++F+     E+H+ R + AD+ +DT   N HTT+ D L  G PV+T  G + 
Sbjct: 585 KFGINEDRLVFAKYMPVEDHLNRIKQADLFIDTLPYNAHTTASDALRMGIPVLTCIGSSF 644

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ASRVAAS L  +  PELI  T ++Y+  AI L T
Sbjct: 645 ASRVAASLLNAVNLPELITTTQEQYESFAIELAT 678


>gi|40063714|gb|AAR38495.1| TPR repeat protein [uncultured marine bacterium 583]
          Length = 697

 Score =  127 bits (319), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 63/154 (40%), Positives = 91/154 (59%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLP    ++C FN  YKI PST   W+ +LKAV +S+LWL +       N++  A 
Sbjct: 489 RYELGLPNTGFIFCCFNNNYKITPSTFTGWMRILKAVEDSVLWLFENNNNTAKNLKKEAI 548

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+++ R++F+     E+H+ R + AD+ +DT   N HTT+ D L  G PV+T  G + 
Sbjct: 549 KFGINEDRLVFATHMPVEDHLNRIKQADLFIDTLPYNAHTTASDALRMGIPVLTCIGSSF 608

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ASRVAAS L  +  PELI  T ++Y+  AI L T
Sbjct: 609 ASRVAASLLNAVNLPELITTTQEQYESFAIELAT 642


>gi|374291195|ref|YP_005038230.1| hypothetical protein AZOLI_0612 [Azospirillum lipoferum 4B]
 gi|357423134|emb|CBS85979.1| conserved protein of unknown function; putative TPR domain
           [Azospirillum lipoferum 4B]
          Length = 877

 Score =  127 bits (318), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 86/156 (55%), Gaps = 6/156 (3%)

Query: 26  NTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           + + A  E  P       R + GLP D  V+C FN  YK+ P+    W  +L AVP S+L
Sbjct: 640 DRRRAIAEATPM------RAECGLPADGFVFCCFNSAYKLTPALFDGWARILAAVPGSVL 693

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WL        AN++  A+A G+D  R++F+      EH+ R +LAD+ LDT   N HTT+
Sbjct: 694 WLYAGNPQVAANLRREAEARGVDPRRLVFAKPLPHAEHLARHRLADLFLDTLPYNAHTTA 753

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELI 181
            D LW G PV+T  G T A RVAAS L   G PELI
Sbjct: 754 SDALWAGLPVLTRCGTTFAGRVAASLLRAAGLPELI 789


>gi|443326054|ref|ZP_21054721.1| TIGR03032 family protein [Xenococcus sp. PCC 7305]
 gi|442794327|gb|ELS03747.1| TIGR03032 family protein [Xenococcus sp. PCC 7305]
          Length = 1299

 Score =  126 bits (317), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 95/157 (60%), Gaps = 2/157 (1%)

Query: 41   ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            + +R+++GLP+DA V+C+FN  YKI+P     W+ +L+ VP S+LWL      G  N++A
Sbjct: 1097 VMTREEFGLPDDAFVFCSFNAHYKINPELFDAWLRILEQVPQSVLWLAS--GSGRDNLRA 1154

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A+  GL   R++F+     EE++ R  LAD+ LDT + N  +T+  VLW+G P++T PG
Sbjct: 1155 EAKKRGLAADRLIFAEKIPHEEYLARYALADLYLDTLIYNAGSTAAAVLWSGLPMLTCPG 1214

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
             T ASR+ AS     G    I  + +EY+  A+ L T
Sbjct: 1215 NTNASRMGASICLAGGLETAICNSLEEYEQKAVYLAT 1251


>gi|445499828|ref|ZP_21466683.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Janthinobacterium sp.
           HH01]
 gi|444789823|gb|ELX11371.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Janthinobacterium sp.
           HH01]
          Length = 1067

 Score =  126 bits (316), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 96/157 (61%), Gaps = 2/157 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL-KFPAVGEANIQAT 101
           +R    LPEDA V+ +FN  +K  P    +W+N+L+ VPNS+LWL+  +P V E N+   
Sbjct: 417 TRASVNLPEDAFVFVSFNNNFKFTPDLFTVWMNILRRVPNSVLWLVADYPEVRE-NLYRY 475

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G+ + R++F++ A   E++ R QLAD+ LDT   N  TT+ D LW G P++T  G+
Sbjct: 476 AEEAGIARERLIFNSRAVPAEYLARYQLADLFLDTFPFNAGTTASDALWAGLPLLTCAGQ 535

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           T +SR+A S L  +  P+LI     +Y++ A+ L  D
Sbjct: 536 TFSSRMAGSLLRAVDLPQLITYNFADYEEKAVALAND 572


>gi|421900323|ref|ZP_16330686.1| tetratricopeptide-like reapts harboring protein [Ralstonia
           solanacearum MolK2]
 gi|206591529|emb|CAQ57141.1| tetratricopeptide-like reapts harboring protein [Ralstonia
           solanacearum MolK2]
          Length = 672

 Score =  125 bits (315), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 64/155 (41%), Positives = 90/155 (58%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP D +V+C FNQ YKI  +  Q W  +L   P ++LWLL+  A   A +   A 
Sbjct: 468 RAAVGLPADGLVFCCFNQAYKITEARAQTWFAILGRTPGAVLWLLEPDASARAALLEKAG 527

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   R++F+   A+  H+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T+
Sbjct: 528 RHGIASERLVFAPQVAQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDTM 587

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G  +L+  T  +Y + A+RL  D
Sbjct: 588 ASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622


>gi|344257396|gb|EGW13500.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit [Cricetulus griseus]
          Length = 1444

 Score =  125 bits (314), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 54/75 (72%), Positives = 63/75 (84%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 733 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 792

Query: 61  QLYKIDPSTLQMWVN 75
           QLYKIDPSTLQMW N
Sbjct: 793 QLYKIDPSTLQMWAN 807


>gi|83745815|ref|ZP_00942872.1| Predicted O-linked N-acetylglucosamine transferase [Ralstonia
           solanacearum UW551]
 gi|207739799|ref|YP_002258192.1| tetratricopeptide-like reapts harboring protein [Ralstonia
           solanacearum IPO1609]
 gi|83727505|gb|EAP74626.1| Predicted O-linked N-acetylglucosamine transferase [Ralstonia
           solanacearum UW551]
 gi|206593181|emb|CAQ60087.1| tetratricopeptide-like reapts harboring protein [Ralstonia
           solanacearum IPO1609]
          Length = 672

 Score =  125 bits (314), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 64/155 (41%), Positives = 90/155 (58%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP D +V+C FNQ YKI  +  Q W  +L   P ++LWLL+  A   A +   A 
Sbjct: 468 RAAVGLPADGLVFCCFNQAYKITEARAQTWFAILGRTPGAVLWLLEPDASARAALLEKAG 527

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   R++F+   A+  H+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T+
Sbjct: 528 RHGIASERLVFAPQVAQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDTM 587

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G  +L+  T  +Y + A+RL  D
Sbjct: 588 ASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622


>gi|188992878|ref|YP_001904888.1| hypothetical protein xccb100_3483 [Xanthomonas campestris pv.
           campestris str. B100]
 gi|167734638|emb|CAP52848.1| Conserved hypothetical protein [Xanthomonas campestris pv.
           campestris]
          Length = 568

 Score =  125 bits (314), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE  +V C FN  YK++P ++   + VL+ VP+S+LWLL  P   +A ++A A
Sbjct: 367 SRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+D  R++F       +++ R + AD+ LDT   N HTT+ D LWTG PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASD 522


>gi|340788763|ref|YP_004754228.1| TPR repeat protein [Collimonas fungivorans Ter331]
 gi|340554030|gb|AEK63405.1| TPR repeat protein [Collimonas fungivorans Ter331]
          Length = 730

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 89/155 (57%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           RQ  GLP DA V+C+FNQ +K+ PS    W  +L AVP+S+LWL    A   +N++  AQ
Sbjct: 517 RQSEGLPNDAFVFCSFNQCFKLTPSLWDDWCQILNAVPHSVLWLAPMNAAACSNLRREAQ 576

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   R++F+      +H  R  LAD+ LDT   N  TT+ D L  G P+VT  G T 
Sbjct: 577 QRGVAPERLVFAARRPLAQHQSRLALADLALDTMPYNSGTTASDALRAGVPLVTAAGSTF 636

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            SR+AAS L  LG   LIA   + Y D+AI L +D
Sbjct: 637 VSRMAASLLHNLGMDGLIAADRRSYVDLAIALASD 671


>gi|21230340|ref|NP_636257.1| hypothetical protein XCC0866 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66769666|ref|YP_244428.1| hypothetical protein XC_3364 [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|213424033|pdb|2VSY|A Chain A, Xanthomonas Campestris Putative Ogt (Xcc0866),
           Apostructure
 gi|213424034|pdb|2VSY|B Chain B, Xanthomonas Campestris Putative Ogt (Xcc0866),
           Apostructure
 gi|215261005|pdb|2JLB|A Chain A, Xanthomonas Campestris Putative Ogt (Xcc0866), Complex
           With Udp-Glcnac Phosphonate Analogue
 gi|215261006|pdb|2JLB|B Chain B, Xanthomonas Campestris Putative Ogt (Xcc0866), Complex
           With Udp-Glcnac Phosphonate Analogue
 gi|304445671|pdb|2XGM|A Chain A, Substrate And Product Analogues As Human O-Glcnac
           Transferase Inhibitors.
 gi|304445672|pdb|2XGM|B Chain B, Substrate And Product Analogues As Human O-Glcnac
           Transferase Inhibitors.
 gi|304445673|pdb|2XGO|A Chain A, Xcogt In Complex With Udp-S-Glcnac
 gi|304445674|pdb|2XGO|B Chain B, Xcogt In Complex With Udp-S-Glcnac
 gi|304445675|pdb|2XGS|A Chain A, Xcogt In Complex With C-Udp
 gi|304445676|pdb|2XGS|B Chain B, Xcogt In Complex With C-Udp
 gi|21111893|gb|AAM40181.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gi|66574998|gb|AAY50408.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 568

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE  +V C FN  YK++P ++   + VL+ VP+S+LWLL  P   +A ++A A
Sbjct: 367 SRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+D  R++F       +++ R + AD+ LDT   N HTT+ D LWTG PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASD 522


>gi|190016248|pdb|2VSN|A Chain A, Structure And Topological Arrangement Of An O-Glcnac
           Transferase Homolog: Insight Into Molecular Control Of
           Intracellular Glycosylation
 gi|190016249|pdb|2VSN|B Chain B, Structure And Topological Arrangement Of An O-Glcnac
           Transferase Homolog: Insight Into Molecular Control Of
           Intracellular Glycosylation
          Length = 568

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE  +V C FN  YK++P ++   + VL+ VP+S+LWLL  P   +A ++A A
Sbjct: 367 SRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+D  R++F       +++ R + AD+ LDT   N HTT+ D LWTG PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASD 522


>gi|399066550|ref|ZP_10748490.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Novosphingobium sp. AP12]
 gi|398027979|gb|EJL21504.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Novosphingobium sp. AP12]
          Length = 741

 Score =  125 bits (314), Expect = 1e-26,   Method: Composition-based stats.
 Identities = 61/137 (44%), Positives = 86/137 (62%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R ++GLP+   V+ +FNQ YKI P+   +W+ +L+ VP S+LWLL   A  +AN++  A+
Sbjct: 534 RTRWGLPDTGFVFASFNQSYKISPAEWDIWMGLLREVPGSVLWLLDTGAQVQANLRREAE 593

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G++  R++FS   A  EH+ R   AD+ LDT   N HTT+ D LW G PV+TL G   
Sbjct: 594 VRGVEPERLVFSKPIAHAEHLGRLIHADLFLDTFAVNAHTTASDALWAGLPVLTLAGRQF 653

Query: 164 ASRVAASQLATLGCPEL 180
           A+RVAAS +   G PEL
Sbjct: 654 AARVAASLVHAAGLPEL 670


>gi|194364498|ref|YP_002027108.1| hypothetical protein Smal_0720 [Stenotrophomonas maltophilia
           R551-3]
 gi|194347302|gb|ACF50425.1| Tetratricopeptide TPR_2 repeat protein [Stenotrophomonas
           maltophilia R551-3]
          Length = 570

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 90/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLPE  +V+C FN  YK++P ++     VL+AVP S+LWLL  P   +A ++A A
Sbjct: 364 TRADCGLPEHGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRAAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q+ GLD  R++F       +++ R QLAD+ LDT   N HTT+ D LW G PV+T PG T
Sbjct: 424 QSAGLDPMRLVFMPKLPHPQYLARYQLADLFLDTNPYNAHTTASDALWAGCPVLTCPGAT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   + A     +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNAADDAAFIATASVLGND 519


>gi|222148903|ref|YP_002549860.1| hypothetical protein Avi_2579 [Agrobacterium vitis S4]
 gi|221735889|gb|ACM36852.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 588

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/158 (41%), Positives = 89/158 (56%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           TSR    LPEDA V  +FN + KI P T ++W  +L A+P+SILW+L         + A 
Sbjct: 365 TSRSLLDLPEDAFVLASFNMVRKISPQTARLWARMLDAIPSSILWVLCAGREQRDRLTAF 424

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
               G+++ R+ F+   +   H+ R Q AD+ LDT   NGHTT+ D LW G PV+T  G 
Sbjct: 425 MTGCGIERSRLYFTGAESYAPHIARMQAADLGLDTYPYNGHTTTSDKLWAGLPVITFKGS 484

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             ASRV+ S L  LG P+L+A T  +   +A  L  DR
Sbjct: 485 NFASRVSESLLTALGVPQLVAETPDDMVRLAADLAQDR 522


>gi|402564953|ref|YP_006614298.1| tpr repeat-containing protein [Burkholderia cepacia GG4]
 gi|402246150|gb|AFQ46604.1| tpr repeat-containing protein [Burkholderia cepacia GG4]
          Length = 604

 Score =  125 bits (313), Expect = 1e-26,   Method: Composition-based stats.
 Identities = 61/155 (39%), Positives = 91/155 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPED  V+C+FN  YK  P     W+ +L+ VP S+LWLL      EAN++  A
Sbjct: 407 SRASCGLPEDGFVFCSFNNNYKFTPEVFGAWMRILQRVPASVLWLLADNPWAEANLRKEA 466

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F++  + E ++ R  +AD+ LD+   N  TT+ D LW G PV+TL G++
Sbjct: 467 ARYGIDGGRLVFASRVSPENYLARFAVADLFLDSFPFNAGTTANDALWMGLPVLTLSGKS 526

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
            ASR+A + L   G  ELI    + Y++ A+ L +
Sbjct: 527 FASRMAGAMLTAAGIEELITYDLQAYEEKAVALAS 561


>gi|386335303|ref|YP_006031473.1| tetratricopeptide-like reapts harboring protein [Ralstonia
           solanacearum Po82]
 gi|334197753|gb|AEG70937.1| tetratricopeptide-like reapts harboring protein [Ralstonia
           solanacearum Po82]
          Length = 672

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 90/155 (58%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLP D +V+C FNQ YKI  +  Q W  +L   P ++LWLL+  A   A +   A 
Sbjct: 468 RAELGLPADGLVFCCFNQAYKITEARAQTWFAILGRTPGAVLWLLEPDASARAALLEKAG 527

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   R++F+    +  H+ R QLAD+ LDT     HTT+ D+LW G P++T  G+T+
Sbjct: 528 RHGIASERLVFAPQVVQRAHIARLQLADLALDTFPYTSHTTASDLLWAGVPLLTRAGDTM 587

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ASRVAAS L   G  +L+  T  +Y + A+RL  D
Sbjct: 588 ASRVAASILQAAGLHDLVVTTEDDYVNAAVRLAGD 622


>gi|296448170|ref|ZP_06890068.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
 gi|296254301|gb|EFH01430.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
          Length = 676

 Score =  124 bits (312), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 63/156 (40%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SRQ+ GLPE A V+C+ N + K+   T   W+ +L AVP+S+LWLL         I+A A
Sbjct: 417 SRQEAGLPESAFVFCSLNGMQKLTALTFDRWMTILAAVPDSVLWLLTGTQATNERIRAAA 476

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   RI+F+       H+ R  LAD+ LD      HTT+ D LW G P++T PG +
Sbjct: 477 SARGVAPERIVFAQKMPNPRHLARYPLADLFLDNFPYGAHTTAADSLWMGVPILTFPGRS 536

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV AS +   G  EL+    ++Y   AI LG +
Sbjct: 537 FASRVCASLVRAAGVEELVCADAEDYVARAIELGRN 572


>gi|259415576|ref|ZP_05739497.1| tetratricopeptide TPR_2 repeat protein [Silicibacter sp. TrichCH4B]
 gi|259348806|gb|EEW60568.1| tetratricopeptide TPR_2 repeat protein [Silicibacter sp. TrichCH4B]
          Length = 582

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 90/155 (58%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R ++GLPED  ++ NFN  +K+ PS    W+++L+ VPNS+L          A + + A+
Sbjct: 371 RSKFGLPEDKFIFANFNHPHKVGPSEFATWMDILRQVPNSVLLFYAGKNDLSAELASRAE 430

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+D  RIL     +++EH+ R    D+CLD    N HTT+ D LW G P++TL G   
Sbjct: 431 AHGIDAARILPCGSLSQQEHLERIAQVDLCLDCFAYNAHTTASDALWAGVPLLTLCGRQF 490

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           A+RVA S ++  G PEL   + + Y   A+R+ T+
Sbjct: 491 AARVATSIMSAAGVPELSTDSKEAYVAAAVRIATN 525


>gi|91775742|ref|YP_545498.1| TPR repeat-containing protein [Methylobacillus flagellatus KT]
 gi|91709729|gb|ABE49657.1| TPR repeat [Methylobacillus flagellatus KT]
          Length = 700

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   G+PEDA VYC FNQ +KI P     W+ +L+A P S+LWLL+  +    N++  A
Sbjct: 502 SRASCGIPEDAFVYCCFNQSFKITPEVFSCWMRLLQATPGSVLWLLEGHSNACNNLRQAA 561

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F+     ++H+ R   AD+ LDT   N HTT+ D LW G P++T  G+ 
Sbjct: 562 EHHGVAASRLVFAPRVPMDQHLARHAHADLFLDTLPYNAHTTASDALWMGVPLITCSGDA 621

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRVAAS L  +G  EL+      Y+ +A +L  D
Sbjct: 622 FASRVAASLLQAVGLKELVTTDLFAYETLARQLAHD 657


>gi|344206149|ref|YP_004791290.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
 gi|343777511|gb|AEM50064.1| Tetratricopeptide TPR_1 repeat-containing protein [Stenotrophomonas
           maltophilia JV3]
          Length = 568

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 89/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE  +V+C FN  YK++P ++     VL+AVP S+LWLL  P   +  ++  A
Sbjct: 364 SRTECGLPEHGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADTRLRTAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA GLD  R++F       +++ R QLAD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYQLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAAFIATASALGND 519


>gi|325921456|ref|ZP_08183311.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xanthomonas gardneri ATCC 19865]
 gi|325548003|gb|EGD19002.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xanthomonas gardneri ATCC 19865]
          Length = 568

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE  +V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRTQCGLPEHGVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAVA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA  +D  R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 QAQDVDAQRLVFMPKLPHAQYLARYRHADLFLDTHPYNAHTTASDALWVGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLAEMNVADDAAFVAKAVALASD 522


>gi|304312282|ref|YP_003811880.1| hypothetical protein HDN1F_26540 [gamma proteobacterium HdN1]
 gi|301798015|emb|CBL46237.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
          Length = 730

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 96/157 (61%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R++YGLP    V C+ N  +KI P    +W+N+L+ VPNS+LWLL      + N+   A
Sbjct: 402 TRKEYGLPARKTVLCSLNNNHKITPEVFDVWMNILRRVPNSVLWLLADNEWAKENLIKEA 461

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+   +++F+  + + +++ R  +AD+ LDT   N  TT+ D LW G PV+T+ G  
Sbjct: 462 KARGIPAKQLVFAERSGQGDYLARYAVADLFLDTFPFNAGTTANDALWMGLPVLTMSGRA 521

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASR+A + L   G PELI    + Y+++A++L +D+
Sbjct: 522 FASRMAGALLTAAGLPELITHDLQGYEELAVQLASDK 558


>gi|299135117|ref|ZP_07028308.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
 gi|298590094|gb|EFI50298.1| Tetratricopeptide TPR_2 repeat protein [Afipia sp. 1NLS2]
          Length = 732

 Score =  124 bits (310), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLP   +V+C FN  +KI P    +W+ +L  V  S+LWL +  A  +AN+   A
Sbjct: 518 SREDVGLPASGLVFCCFNNTFKITPEVFDIWMRLLGQVEGSVLWLFEGNATAKANLIKEA 577

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+   R++F+ +A    ++ R QLAD+ LD    NGH+T+ D L  G PV+T  G +
Sbjct: 578 VARGISPERLVFAPLADYATYLARTQLADLFLDNNYWNGHSTASDALRCGVPVITCQGTS 637

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRVA+S L  +  PELI  + ++Y+ +A++L  D
Sbjct: 638 FASRVASSLLRAIEMPELITTSLEDYEALALKLACD 673


>gi|99080945|ref|YP_613099.1| hypothetical protein TM1040_1104 [Ruegeria sp. TM1040]
 gi|99037225|gb|ABF63837.1| TPR repeat [Ruegeria sp. TM1040]
          Length = 616

 Score =  124 bits (310), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 61/161 (37%), Positives = 94/161 (58%)

Query: 40  VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
           V  +R  + LPE+  V+ +FN  YK+ P    +W+++LK VP+S+LW     A     ++
Sbjct: 403 VADTRASHDLPEEGFVFSSFNNPYKVTPREFGIWMDLLKEVPDSVLWFYVSKAEIIDRLR 462

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             A++ G+D  RI+ +     E H+ R + AD+ LDT   N HTT+ D LW G PVVT  
Sbjct: 463 KEAESRGVDGARIIPTGRMQPEYHLARLKHADLFLDTFNVNAHTTASDALWAGLPVVTKT 522

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+RVA S L+  G  +L+  + K+Y ++A+R+  D D
Sbjct: 523 GEQFAARVAGSILSAAGLEDLVTHSEKKYYEVALRIAQDPD 563


>gi|410462143|ref|ZP_11315745.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfovibrio magneticus str. Maddingley MBC34]
 gi|409984752|gb|EKO41039.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfovibrio magneticus str. Maddingley MBC34]
          Length = 672

 Score =  124 bits (310), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 61/157 (38%), Positives = 90/157 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE+ +V+C FN  +KI  +T   W+ +L   P S+LWLL         +   A
Sbjct: 409 SRAEAGLPEEGMVFCCFNGAHKIHRATFDRWLEILARTPGSVLWLLGGDEGVSKRLGDYA 468

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+ + R++F+   A   H+ R  LAD+ LDT     HTT+ D LW+G PV+T+ G +
Sbjct: 469 EAKGIARERLVFAKKTANAAHLARYPLADLFLDTTPYGAHTTASDALWSGVPVLTVSGRS 528

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S +   G PEL+  T ++Y + A+  G DR
Sbjct: 529 FASRVCGSLVRAAGLPELVCATTQDYVERAVAYGNDR 565


>gi|114769247|ref|ZP_01446873.1| TPR repeat [Rhodobacterales bacterium HTCC2255]
 gi|114550164|gb|EAU53045.1| TPR repeat [Rhodobacterales bacterium HTCC2255]
          Length = 688

 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 61/156 (39%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR ++GLPE+ IV+C FN  YKI      +W+ +L     S+LWL K     + NIQ  A
Sbjct: 473 SRSEFGLPENEIVFCCFNNNYKITADEFNIWMRILIKNKKSVLWLKKSNKWSKENIQLAA 532

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G++  RI+F+     E+H+   +LAD+ +DT   N HTT+ + LW G PV+T  G+ 
Sbjct: 533 KERGVNPKRIIFAEKLPIEKHLASYKLADIFIDTFSYNAHTTATEALWAGLPVITKIGKG 592

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S L+ L   ELI  +  EY+++  +L +D
Sbjct: 593 FAARVAGSLLSALDLKELITASEIEYENLIKKLASD 628


>gi|16125359|ref|NP_419923.1| hypothetical protein CC_1107 [Caulobacter crescentus CB15]
 gi|221234101|ref|YP_002516537.1| O-linked GLCNAC transferase [Caulobacter crescentus NA1000]
 gi|13422417|gb|AAK23091.1| TPR domain protein [Caulobacter crescentus CB15]
 gi|220963273|gb|ACL94629.1| O-linked GLCNAC transferase [Caulobacter crescentus NA1000]
          Length = 672

 Score =  123 bits (309), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 89/159 (55%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + +R + GLPED  V+C FN   KI P     W+ +LK    S+LWLL       A ++ 
Sbjct: 409 VPTRTEAGLPEDGFVFCCFNGTQKITPHVFDRWMEILKRTSGSVLWLLDSNPETNARLRD 468

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A+A G+D+ R++F+       H+ R +LAD+ LDT     HTT+ D LW   PV+T  G
Sbjct: 469 AAEARGVDRMRLVFAPKMPNAFHLARYRLADLFLDTTPYGAHTTASDALWMAVPVLTWSG 528

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            + ASRV  S + + G PEL+  +   Y + A+ +G+DR
Sbjct: 529 RSFASRVCGSLVRSAGLPELVVDSAAAYVEKAVEIGSDR 567


>gi|386717186|ref|YP_006183512.1| glycosyl transferases group 1:TPR repeat [Stenotrophomonas
           maltophilia D457]
 gi|384076748|emb|CCH11333.1| Glycosyl transferases group 1:TPR repeat [Stenotrophomonas
           maltophilia D457]
          Length = 568

 Score =  123 bits (309), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 70/175 (40%), Positives = 94/175 (53%), Gaps = 6/175 (3%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q +  T T E  P      SR + GLPE  +V+C FN  YK++P ++     VL+AVP S
Sbjct: 351 QPSDNTRTLEPSP------SRAECGLPEHGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGS 404

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LWLL  P   +A ++  AQA GLD  R++F        ++ R  LAD+ LDT   N HT
Sbjct: 405 LLWLLSGPGQADARLRTAAQAAGLDPARLVFMAKLPHPRYLARYPLADLFLDTHPYNAHT 464

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           T+ D LW G PV+T PG+T A+RVA S    LG   +       +   A  LG D
Sbjct: 465 TASDALWAGCPVLTCPGDTFAARVAGSLNHHLGLARMNVADDAAFIATASALGND 519


>gi|445499833|ref|ZP_21466688.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Janthinobacterium sp.
           HH01]
 gi|444789828|gb|ELX11376.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Janthinobacterium sp.
           HH01]
          Length = 1058

 Score =  123 bits (309), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 89/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R    LPE A V+C+FN  YK  P     W+N+L+ VP S+LWL+    +   N++  A
Sbjct: 413 TRASVKLPEHAFVFCSFNNSYKFKPELFAHWMNILRRVPGSVLWLVADTPLVADNLKRHA 472

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F+  A   +++ R QLAD+ LDT   N  TT+ D LW G P++T  G T
Sbjct: 473 AEAGIDSDRLIFAERALPADYLARYQLADLFLDTYPFNAGTTASDALWAGLPLLTCAGPT 532

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASR+A S L  +  P+LI     +Y++ A+ L  D
Sbjct: 533 FASRMAGSLLRAVDLPQLITYNFADYEEQAVALAND 568


>gi|253996225|ref|YP_003048289.1| hypothetical protein Mmol_0852 [Methylotenera mobilis JLW8]
 gi|253982904|gb|ACT47762.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera mobilis JLW8]
          Length = 701

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 62/154 (40%), Positives = 90/154 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +RQ   LP++A V+C+FNQ +KI      +W+ +L  +PNS+LWLL      + N+   A
Sbjct: 504 TRQACNLPDNAFVFCSFNQSFKITADVFSIWLRLLNKLPNSVLWLLDCNTWAKQNLWLAA 563

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+ + RI+F+      +H+ R   AD+ LDT   N HTT  D LW G PV+T  G+T
Sbjct: 564 QRSGIAKERIVFAPRVNIADHLARHIHADLFLDTQPYNAHTTCSDALWMGLPVLTCAGDT 623

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            +SRVA S L   G P+LI  +  EY+  A++L 
Sbjct: 624 FSSRVAGSLLQAAGLPQLITHSLVEYEARALQLA 657


>gi|398872865|ref|ZP_10628142.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Pseudomonas sp. GM74]
 gi|398201465|gb|EJM88343.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Pseudomonas sp. GM74]
          Length = 611

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 63/160 (39%), Positives = 92/160 (57%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + +R + GLP++  V+C+FN  YK +      W+ +L+  P+S+LWLL      + N+ A
Sbjct: 412 LPTRAECGLPQERFVFCSFNNNYKFNEEVFDCWMRILQRAPDSVLWLLADNPWAQENLCA 471

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            AQA G++  R+LF+   A  +++ R   AD+ LD    N  TT+ D LW G PV+T  G
Sbjct: 472 RAQAHGVNPARLLFAPRVAPAQYLARYSAADLFLDAYPFNAGTTANDALWMGLPVLTRSG 531

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            T ASR+A S L  L  PELI  T  EY++ A+ L T  D
Sbjct: 532 RTFASRMAGSLLTALDLPELITTTLAEYEERAVELATQAD 571


>gi|16127948|ref|NP_422512.1| hypothetical protein CC_3718 [Caulobacter crescentus CB15]
 gi|221236770|ref|YP_002519207.1| hypothetical protein CCNA_03834 [Caulobacter crescentus NA1000]
 gi|13425486|gb|AAK25680.1| TPR domain protein [Caulobacter crescentus CB15]
 gi|220965943|gb|ACL97299.1| tetratricopeptide repeat family protein [Caulobacter crescentus
           NA1000]
          Length = 596

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 65/160 (40%), Positives = 88/160 (55%)

Query: 39  IVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
           I   SR   GLPE   V+C FN   KI P     W+ +L+A P+S+LWL         N+
Sbjct: 401 IPAPSRADMGLPERVRVFCCFNNPAKITPEVFATWMAILRAAPDSVLWLYAGAPGAADNL 460

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
           +  A+A G+   R++F+  A  E H+ R  LAD+ LDT     HTT+ D L  G PV+TL
Sbjct: 461 RGHARAAGVAPERLVFAEPAPHEAHLARHVLADLVLDTWPYGAHTTASDALRMGVPVLTL 520

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           PGE+ ASRV AS    +G  ELIA +  +Y   A+ +  D
Sbjct: 521 PGESFASRVGASLATAIGMTELIATSRADYVAKALAMAAD 560


>gi|303246263|ref|ZP_07332543.1| conserved hypothetical protein [Desulfovibrio fructosovorans JJ]
 gi|302492326|gb|EFL52198.1| conserved hypothetical protein [Desulfovibrio fructosovorans JJ]
          Length = 676

 Score =  123 bits (308), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPE+ +VYC FN L+K+   T   W+++L  VP S+LWLL      E  ++  A
Sbjct: 408 TRAEAGLPEEGMVYCCFNGLHKVHRFTFDRWLSILAGVPGSVLWLLGSSETVEKRLRDYA 467

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+       H+ R  LAD+ LDT     HTT+ D LW G PV+TL G +
Sbjct: 468 AGRGIAGERIVFAPKVKNPNHLARYPLADLFLDTTPYGAHTTASDALWMGVPVLTLSGRS 527

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASRV  S +   G PE++  T + + + A+  G D
Sbjct: 528 FASRVCGSLVRAAGLPEMVCETPEAFVERAVAFGKD 563


>gi|118594932|ref|ZP_01552279.1| SPY protein [Methylophilales bacterium HTCC2181]
 gi|118440710|gb|EAV47337.1| SPY protein [Methylophilales bacterium HTCC2181]
          Length = 621

 Score =  123 bits (308), Expect = 5e-26,   Method: Composition-based stats.
 Identities = 64/160 (40%), Positives = 93/160 (58%), Gaps = 1/160 (0%)

Query: 33  EEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
           ++ P++  I  +  YG+ +DA V+ +F Q  KI  S  ++W+ +LK    SILWLL+  A
Sbjct: 416 DDRPKTQAI-KKSVYGIDDDAFVFASFGQSIKITESMFRLWIKLLKQKDGSILWLLESNA 474

Query: 93  VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
             E NI   A+  G+D  RI F+      +H+ R  + D+ LDT   N HT+S D +W G
Sbjct: 475 QAENNIHKYAKKEGVDSVRIKFAPKVNFNDHINRHSIIDLFLDTYPYNAHTSSSDAIWAG 534

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIA 192
            PV+TL G+T ASRVA S L  + C  LI +T +EY  +A
Sbjct: 535 CPVLTLSGKTFASRVAGSILKEISCEMLITKTEEEYFKVA 574


>gi|348687110|gb|EGZ26924.1| hypothetical protein PHYSODRAFT_308484 [Phytophthora sojae]
          Length = 1367

 Score =  123 bits (308), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 61/159 (38%), Positives = 98/159 (61%), Gaps = 5/159 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA-- 100
           +R Q+GLP DA+V+CNFN + K++  +  +W+++LK VP S+LWLL    V  + ++   
Sbjct: 528 NRSQHGLPSDAVVFCNFNTINKMESESFSVWMSILKQVPKSVLWLLAPSGVDASRVKELL 587

Query: 101 --TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A A G+   R++F+    K  H+ R  +AD+ LD+ + N H+T+ D LW   P+VT+
Sbjct: 588 HDQAMAHGVLPSRVIFAPRVDKFSHLARVTVADLFLDSFIYNAHSTAADALWANVPIVTI 647

Query: 159 PGETLASRVAASQLA-TLGCPELIARTHKEYQDIAIRLG 196
            G+T  SRVAAS +   +  PELI+ + K+Y+  A+ L 
Sbjct: 648 WGDTFPSRVAASLIQNAIPFPELISHSVKDYERTAVYLA 686


>gi|83311603|ref|YP_421867.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
           magneticum AMB-1]
 gi|82946444|dbj|BAE51308.1| Predicted O-linked N-acetylglucosamine transferase
           [Magnetospirillum magneticum AMB-1]
          Length = 658

 Score =  122 bits (307), Expect = 6e-26,   Method: Composition-based stats.
 Identities = 60/160 (37%), Positives = 92/160 (57%)

Query: 40  VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
           V  SR   GLP+D +V C+FN   K+D +T  +W+ +L  +P ++LWLL  P +    ++
Sbjct: 448 VAPSRASLGLPDDRLVLCSFNGHRKLDRATFTLWLEILAELPQAVLWLLSPPDLARQRLE 507

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A A   G+D  R++++    + +H+ R   AD+ +D  +C  HTT+ D L  G P++T  
Sbjct: 508 AAAAKAGIDSARLIWAPSLPRPDHLARLPAADLFVDALVCGAHTTAADSLRMGVPLITAA 567

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           G  L SRVAAS L  LG P+L   +    + +AI LG DR
Sbjct: 568 GNRLGSRVAASLLHALGLPDLAVESPSALKALAIELGRDR 607


>gi|239908057|ref|YP_002954798.1| hypothetical protein DMR_34210 [Desulfovibrio magneticus RS-1]
 gi|239797923|dbj|BAH76912.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 672

 Score =  122 bits (307), Expect = 6e-26,   Method: Composition-based stats.
 Identities = 62/157 (39%), Positives = 89/157 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE A+V+C FN  +KI  +T   W+ +L   P S+LWLL         +   A
Sbjct: 409 SRAEAGLPEGAMVFCCFNGAHKIHRATFDRWLEILARAPGSVLWLLGGDEGVSKRLSDYA 468

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+ + R++F+   A   H+ R  LAD+ LDT     HTT+ D LW+G PV+T+ G +
Sbjct: 469 AAKGVARERLVFAKKTANAAHLARYPLADLFLDTTPYGAHTTASDALWSGVPVLTVSGRS 528

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S +   G PEL+  T ++Y + A+  G DR
Sbjct: 529 FASRVCGSLVRAAGLPELVCETTQDYVERAVAYGNDR 565


>gi|333981959|ref|YP_004511169.1| hypothetical protein [Methylomonas methanica MC09]
 gi|333806000|gb|AEF98669.1| Tetratricopeptide TPR_2 repeat-containing protein [Methylomonas
           methanica MC09]
          Length = 604

 Score =  122 bits (307), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 92/158 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+   LP +  V+C+FN  YK  P   + W+N+L+ VP+S LWLL      E N++  A
Sbjct: 407 SRESCNLPAEGFVFCSFNNNYKYTPELFETWMNILRRVPDSTLWLLADNPWSEINLRNHA 466

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+D +R++F+   A E+++ R  +AD+ LD+   N  TT+ D LW G PV+TL G +
Sbjct: 467 KNFGIDPNRLIFAGRVAPEDYLARYAVADLFLDSFPFNAGTTANDALWMGLPVLTLTGRS 526

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ASR+A + L     PELI     +Y++ A+ L    D
Sbjct: 527 FASRMAGALLTAAELPELITYNLLDYENKAVELAQAPD 564


>gi|356960833|ref|ZP_09063815.1| TPR repeat-containing protein, partial [gamma proteobacterium SCGC
           AAA001-B15]
          Length = 262

 Score =  122 bits (307), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 93/156 (59%), Gaps = 1/156 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLP +  V+C FN  YKI P T   W+ +L  V  S+LWL +  +    N++  A
Sbjct: 52  TRAEIGLPINGFVFCCFNNHYKITPDTFIGWMRILSQVDGSVLWLSEANSTAVNNLKKEA 111

Query: 103 QALGLDQHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           +  G+D+ R++F+  +  KE+H+ R + AD+ +DT   N HTT+ D L  G PV+T  G 
Sbjct: 112 KKNGVDEDRLIFAPRLTFKEDHLNRIKQADLFIDTLPYNAHTTASDALRMGLPVLTCIGS 171

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           + ASRVAAS L  +  PELI  T ++Y+  AI L T
Sbjct: 172 SFASRVAASLLNAVNLPELITTTQEQYESFAIELAT 207


>gi|384426734|ref|YP_005636091.1| hypothetical protein XCR_1056 [Xanthomonas campestris pv. raphani
           756C]
 gi|341935834|gb|AEL05973.1| tetratricopeptide repeat domain protein [Xanthomonas campestris pv.
           raphani 756C]
          Length = 568

 Score =  122 bits (307), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLP   +V C FN  YK++P ++   + VL+AVP+S+LWLL  P   +A ++A A
Sbjct: 367 SRTQCGLPAQGVVLCCFNNSYKLNPRSMARMLAVLRAVPDSVLWLLSGPGEADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+D  R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 HAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWVGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVVDDAAFVAKAVALASD 522


>gi|50543546|ref|XP_499939.1| YALI0A10296p [Yarrowia lipolytica]
 gi|49645804|emb|CAG83866.1| YALI0A10296p [Yarrowia lipolytica CLIB122]
          Length = 1330

 Score =  122 bits (307), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 67/158 (42%), Positives = 95/158 (60%), Gaps = 8/158 (5%)

Query: 44   RQQY--GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
            RQQ    +P++A++  +FNQLYKI P TL MW+ +L+  PN+ LWLL+FP  GE+ I+  
Sbjct: 1103 RQQIFPDIPKEAVLLGSFNQLYKITPETLFMWLLILQRQPNAYLWLLQFPPAGESKIREM 1162

Query: 102  AQAL--GLDQ--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
            A     G D    RILF+ V  K   V R ++ D+ LDTP CN HTT+ DV+W GTP++T
Sbjct: 1163 ASKWYQGTDDINKRILFTPVTDKNVFVTRSRVCDLFLDTPECNAHTTAADVIWNGTPILT 1222

Query: 158  LP--GETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
             P     ++SRVA+S +A        A     Y+D+ +
Sbjct: 1223 FPKHDHKMSSRVASSIIAASIPHPPSAEAEAMYRDLVV 1260


>gi|405378056|ref|ZP_11031985.1| hypothetical protein PMI11_01952 [Rhizobium sp. CF142]
 gi|397325406|gb|EJJ29742.1| hypothetical protein PMI11_01952 [Rhizobium sp. CF142]
          Length = 639

 Score =  122 bits (307), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 93/161 (57%), Gaps = 3/161 (1%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           PQ +   +R+Q GLPE+  ++ +FN   KI    +  W  +LK  P+S+LWL+      +
Sbjct: 414 PQPV---TREQLGLPENTFIFASFNGNRKITSQVVDSWSRILKRAPDSVLWLMANTTRNK 470

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N+    QA G+   RI+F   A  E+H+ R Q AD+ +DT   NGHTT+ + LW G PV
Sbjct: 471 ENLLERFQANGISAKRIIFCPRAPYEQHISRQQAADLGIDTFPVNGHTTTSEQLWGGLPV 530

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           +T+ G   ASRV+ S L  +  PEL+A   + Y+D+A+ L 
Sbjct: 531 LTVKGTNFASRVSESLLRAIDLPELVADDLQAYEDLAVELA 571


>gi|254522827|ref|ZP_05134882.1| TPR repeat protein [Stenotrophomonas sp. SKA14]
 gi|219720418|gb|EED38943.1| TPR repeat protein [Stenotrophomonas sp. SKA14]
          Length = 568

 Score =  122 bits (306), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 89/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPE  +V+C FN  YK++P ++     VL+AVP S+LWLL  P   +A ++A A
Sbjct: 364 TRAECGLPERGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRAAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  GLD  R++F       +++ R  LAD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 424 QKAGLDPARLVFMAKLPHPQYLARYPLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAAFIATASALGND 519


>gi|428175509|gb|EKX44399.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
            [Guillardia theta CCMP2712]
          Length = 1433

 Score =  122 bits (306), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 63/159 (39%), Positives = 95/159 (59%), Gaps = 3/159 (1%)

Query: 44   RQQYG-LPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV-GEANIQA 100
            R  YG LP D   ++  FNQL+KI+P     W+  ++  P + LWL+K  +   E N++A
Sbjct: 1211 RLAYGSLPPDIPFLFGAFNQLHKIEPDIWDTWMEAMRNAPWAHLWLIKIHSRDAELNLKA 1270

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A+ LG+++ R+       +EEH+     AD+ LDTP  N H+T  DVLW+GTP++T+PG
Sbjct: 1271 RARQLGVEEERVHVVQGYNEEEHLYVKAAADLFLDTPSYNAHSTGCDVLWSGTPLLTIPG 1330

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            + + SRVAAS    +GC E   RT  EY ++A+ L   +
Sbjct: 1331 DKMGSRVAASLNRAIGCEETTVRTEDEYAELAVALAKGK 1369


>gi|424667172|ref|ZP_18104197.1| hypothetical protein A1OC_00730 [Stenotrophomonas maltophilia
           Ab55555]
 gi|401069841|gb|EJP78362.1| hypothetical protein A1OC_00730 [Stenotrophomonas maltophilia
           Ab55555]
          Length = 570

 Score =  122 bits (306), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 88/156 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLP   +V+C FN  YK++P ++     VL+AVP S+LWLL  P   +A ++  A
Sbjct: 364 TRANCGLPAQGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRTAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA GLD  R++F       +++ R QLAD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYQLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGMARMNVADDAAFITTASALGND 519


>gi|301123575|ref|XP_002909514.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase 110
           kDa subunit, putative [Phytophthora infestans T30-4]
 gi|262100276|gb|EEY58328.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase 110
           kDa subunit, putative [Phytophthora infestans T30-4]
          Length = 1353

 Score =  122 bits (306), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 101/160 (63%), Gaps = 7/160 (4%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-----N 97
           +R Q+GLP D +V+CNFN + K++  +  +W+++L+ +PNS+LWLL  P+  +A      
Sbjct: 527 NRSQHGLPTDGLVFCNFNTINKMESESFAVWMSILRQIPNSVLWLLA-PSGPDALRVMEK 585

Query: 98  IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           +   A A G+   R++F+    K  H+ R  +AD+ LD+ + N H+T+ D LW   P+VT
Sbjct: 586 LHDQAMAHGVLPSRVIFAPRVDKYSHLARITVADIFLDSFVYNAHSTAADALWANVPIVT 645

Query: 158 LPGETLASRVAASQLAT-LGCPELIARTHKEYQDIAIRLG 196
           L G+T  SRVAAS +   +  PEL++++ K+Y+ +A+ L 
Sbjct: 646 LWGDTFPSRVAASLIRNAIPYPELVSQSVKDYERMAVYLA 685


>gi|333902093|ref|YP_004475966.1| hypothetical protein Psefu_3916 [Pseudomonas fulva 12-X]
 gi|333117358|gb|AEF23872.1| Tetratricopeptide repeat-containing protein [Pseudomonas fulva
           12-X]
          Length = 611

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 89/158 (56%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR + GLPED  V+C FN  YK +      W+ +L AVP+S+ WLL      +AN+ A
Sbjct: 411 LPSRSECGLPEDRFVFCCFNNNYKFNEQMFSCWMRILAAVPDSLFWLLADNQWSQANLIA 470

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A+  G+   R++F+   A + ++ R  +AD+ LD    NG TT+ D LW G PV+T  G
Sbjct: 471 CAERHGVSADRLVFAPRVAPDLYLARYTVADLFLDAYPFNGGTTANDALWMGLPVLTRSG 530

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            T ASR+A + L  L  PELI     +Y+  AI L  D
Sbjct: 531 RTFASRMAGALLTALELPELITENLADYEQRAIELARD 568


>gi|145590007|ref|YP_001156604.1| hypothetical protein Pnuc_1827 [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
 gi|145048413|gb|ABP35040.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
          Length = 761

 Score =  122 bits (305), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 62/158 (39%), Positives = 92/158 (58%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           I SR+  GLP+D  V+  FN  YKI P+T   W+N+LKA P S+L+L       + N+  
Sbjct: 551 IFSRESLGLPKDEFVFACFNNNYKILPATFNSWMNILKATPKSVLYLYADNPWSKDNLMK 610

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A+A G+   R++F      ++++ R +  D+ LDT   N  TT+ D LW G PV+TL G
Sbjct: 611 EAEARGVKADRLIFGGRIDADQYLARYRACDLFLDTAPYNAGTTASDALWAGLPVLTLIG 670

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ++  SRVA+S L  +G PEL+  +  EY+  AI L  +
Sbjct: 671 QSFPSRVASSLLNAVGLPELVTSSAAEYEIRAIELAMN 708


>gi|428185342|gb|EKX54195.1| hypothetical protein GUITHDRAFT_84189 [Guillardia theta CCMP2712]
          Length = 367

 Score =  121 bits (304), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 59/150 (39%), Positives = 85/150 (56%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP  + V CNFNQ +K+DP T   W ++L  V N++LW+L+   V E N++   QA G+ 
Sbjct: 138 LPATSFVLCNFNQHFKVDPVTFSSWTSLLGNVSNAVLWMLEGTPVSERNLRRELQAAGVS 197

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R++F+  AA ++H+RR  L D+ LDTP  N   T  D L+ G P++ LP      R+ 
Sbjct: 198 PDRLIFARRAAVKDHLRRAALCDLSLDTPNYNSGATGADTLFAGVPILHLPSTKPIGRMM 257

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +S L  L   ELI R H EY  +A R   +
Sbjct: 258 SSMLRALRLQELIVRDHDEYFKLAHRFARE 287


>gi|73540476|ref|YP_294996.1| hypothetical protein Reut_A0772 [Ralstonia eutropha JMP134]
 gi|72117889|gb|AAZ60152.1| TPR repeat protein [Ralstonia eutropha JMP134]
          Length = 611

 Score =  121 bits (304), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 88/154 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+   LP D  V+C+FN  YK  P+    W+ +L  VP S+LWLL      EAN++  A
Sbjct: 414 SRESCNLPADGFVFCSFNNNYKFTPAVFGTWMRILSRVPGSVLWLLADNEWAEANLRKEA 473

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           + LG+D  R++F+   A E ++ R   AD+ LDT   N  TT+ D LWTG PV+T  G +
Sbjct: 474 EKLGVDGGRLIFAPRVAPENYLARFAAADLFLDTFPFNAGTTANDALWTGLPVLTRSGRS 533

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            ASR+A + L      ELI  +  +Y++ A+ L 
Sbjct: 534 FASRMAGALLTAAELDELITYSPNDYEETAVALA 567


>gi|297538669|ref|YP_003674438.1| hypothetical protein M301_1481 [Methylotenera versatilis 301]
 gi|297258016|gb|ADI29861.1| TPR repeat-containing protein [Methylotenera versatilis 301]
          Length = 629

 Score =  121 bits (304), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+   LPE A V+C FNQ +KI P   ++W+ +L AVPNS+LWLL+     + N+   A
Sbjct: 421 SRESCNLPEGAFVFCCFNQSFKITPEFFKVWMRLLNAVPNSVLWLLESNTWAKQNLINQA 480

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               +   R++F+   +  +H+ R   AD+ LDT   N HTT  D LW G PV+T  G+T
Sbjct: 481 AKNNITAERLIFAPRMSIADHLARHIHADLFLDTSPYNAHTTCSDGLWMGLPVLTCVGDT 540

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S L     PELI  + ++Y++ A+ L  +
Sbjct: 541 FAARVAGSLLTAADMPELITYSLQDYENKALYLANN 576


>gi|46202065|ref|ZP_00053827.2| COG3914: Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Magnetospirillum magnetotacticum MS-1]
          Length = 722

 Score =  121 bits (304), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 66/170 (38%), Positives = 90/170 (52%), Gaps = 6/170 (3%)

Query: 30  ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
           +  E VP+      R  +GLPE   V+C FN   K  P    +W+ +LKAVP+S+LWLL 
Sbjct: 506 SMAESVPK------RSDFGLPEAGFVFCCFNNAGKFTPEVFAVWMRLLKAVPDSVLWLLD 559

Query: 90  FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
                + NI A   A G+D+ R++ +       H+ R QLAD+ LDT   N H T+ D L
Sbjct: 560 RNGTVKDNILAQVDAHGVDRGRVVLAPRVPLPLHLARQQLADLFLDTLPYNAHVTASDAL 619

Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           W G P++T  G   A RVA S L  L   ELI    ++Y+  A+ L  DR
Sbjct: 620 WAGLPLLTCLGHAFAGRVAGSLLKVLELDELITTDLEQYEARALELARDR 669


>gi|410692351|ref|YP_003622972.1| putative UDP-N-ACETYLGLUCOSAMINE--PEPTIDE
           N-ACETYLGLUCOSAMINYLTRANSFERASE SUBUNIT [Thiomonas sp.
           3As]
 gi|294338775|emb|CAZ87109.1| putative UDP-N-ACETYLGLUCOSAMINE--PEPTIDE
           N-ACETYLGLUCOSAMINYLTRANSFERASE SUBUNIT [Thiomonas sp.
           3As]
          Length = 753

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/153 (40%), Positives = 89/153 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLP    V+C FN   KI       W+ +L+  P S+LWL +   + + N++  A
Sbjct: 540 AREQAGLPPQGFVFCCFNNTNKISAEVFARWMRILQRTPGSVLWLYRTHDLVDENLRRAA 599

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            ALG++  R++F+    +E H+ R QLAD+ LDT     HTT+ D L  G PV+T+PG+T
Sbjct: 600 AALGVEPQRLVFAPHLPREWHMARLQLADLFLDTTPYGAHTTTGDALRAGVPVLTVPGQT 659

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
            ASRVAAS L     PE I      Y++ A+RL
Sbjct: 660 FASRVAASMLDAARLPECIQPDWPAYEEEAVRL 692


>gi|296134823|ref|YP_003642065.1| Tetratricopeptide TPR_4 [Thiomonas intermedia K12]
 gi|295794945|gb|ADG29735.1| Tetratricopeptide TPR_4 [Thiomonas intermedia K12]
          Length = 733

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/153 (40%), Positives = 89/153 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+Q GLP    V+C FN   KI       W+ +L+  P S+LWL +   + + N++  A
Sbjct: 520 AREQAGLPPQGFVFCCFNNTNKISAEVFARWMRILQRTPGSVLWLYRTHDLVDENLRRAA 579

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            ALG++  R++F+    +E H+ R QLAD+ LDT     HTT+ D L  G PV+T+PG+T
Sbjct: 580 AALGVEPQRLVFAPHLPREWHMARLQLADLFLDTTPYGAHTTTGDALRAGVPVLTVPGQT 639

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
            ASRVAAS L     PE I      Y++ A+RL
Sbjct: 640 FASRVAASMLDAARLPECIQPDWPAYEEEAVRL 672


>gi|171463940|ref|YP_001798053.1| hypothetical protein Pnec_1319 [Polynucleobacter necessarius subsp.
           necessarius STIR1]
 gi|171193478|gb|ACB44439.1| TPR repeat-containing protein [Polynucleobacter necessarius subsp.
           necessarius STIR1]
          Length = 295

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/144 (43%), Positives = 82/144 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE   VYC FN  YKI P     W  +L AV  S+LWL++  A  E N++A  
Sbjct: 144 SRTELGLPEAGFVYCCFNNNYKITPEIFDRWTQILLAVEGSVLWLIQDNAPAEKNLKAEV 203

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+   +  EH+ R ++AD+ LDT   N HTT+ D LW G PV+TL G T
Sbjct: 204 LKRGVAPERIIFAKRLSLPEHLARHKMADLFLDTLPYNAHTTASDSLWAGVPVLTLLGST 263

Query: 163 LASRVAASQLATLGCPELIARTHK 186
              RV+AS L  L   EL+ +  K
Sbjct: 264 FPGRVSASLLNALDLQELLLKHQK 287


>gi|408822221|ref|ZP_11207111.1| hypothetical protein PgenN_03833 [Pseudomonas geniculata N1]
          Length = 568

 Score =  120 bits (302), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 88/156 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPE  +V+C FN  YK++P ++     +L+AVP S+LWL+  P   +A ++A A
Sbjct: 364 SRADCGLPEQGVVFCCFNNSYKLNPRSMGRAFAILQAVPGSVLWLMSGPGQADARLRAAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  GLD  R++F       +++ R  LAD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 424 QTAGLDPARLVFMAKLPHPQYLARYLLADLFLDTYPYNAHTTASDALWAGCPVLTCPGDT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAGFIATASALGND 519


>gi|347736202|ref|ZP_08868901.1| glycosyl transferase group 1:TPR repeat-containing protein
           [Azospirillum amazonense Y2]
 gi|346920380|gb|EGY01508.1| glycosyl transferase group 1:TPR repeat-containing protein
           [Azospirillum amazonense Y2]
          Length = 619

 Score =  120 bits (302), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/173 (38%), Positives = 96/173 (55%), Gaps = 6/173 (3%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q N   A  ++ P      +R+  GLP D  VY  F+  YKI       W+ +L  V NS
Sbjct: 405 QVNDSKALVDQSP------TRESVGLPADKFVYTCFSNTYKITEEVFDAWMRILGGVDNS 458

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LWLL        N+Q  A+A G+D  R+LF+  A+  +++ R  LADV LDT   N  T
Sbjct: 459 VLWLLARTPWARQNMQERARARGIDPGRLLFAGPASPAQYLARLPLADVFLDTYPYNSGT 518

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           T+ D L  G P+VTL G+T ASR+A S L T+G P+ +  +  +Y  +A++LG
Sbjct: 519 TASDALRMGLPIVTLSGKTFASRMAGSLLRTVGLPQGVTTSVDDYVKLAVKLG 571


>gi|383757550|ref|YP_005436535.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
 gi|381378219|dbj|BAL95036.1| TPR repeat protein [Rubrivivax gelatinosus IL144]
          Length = 670

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 66/160 (41%), Positives = 92/160 (57%), Gaps = 8/160 (5%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           +SR + GLPE A V+  FNQ YKI       W  +L+ VP S+LWLL    V +  IQA 
Sbjct: 456 SSRAEAGLPEHAFVFACFNQSYKITEPVFTRWCRILERVPGSVLWLL----VPQVEIQAA 511

Query: 102 AQAL----GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
            +A     G++  R++F+      EH+ R   AD+ LDT     HTT  D LW G PV+T
Sbjct: 512 LRARAAERGIEPERLIFAPFVTPSEHLARLPQADLFLDTFPYGAHTTCSDALWMGLPVLT 571

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
             G + ++RVAAS LA +G PEL   + + Y+++A+RL T
Sbjct: 572 QIGRSFSARVAASLLAAVGLPELAVESEEAYEELAVRLAT 611


>gi|381166882|ref|ZP_09876095.1| Predicted O-linked N-acetylglucosamine transferase [Phaeospirillum
           molischianum DSM 120]
 gi|380683934|emb|CCG40907.1| Predicted O-linked N-acetylglucosamine transferase [Phaeospirillum
           molischianum DSM 120]
          Length = 658

 Score =  120 bits (301), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 58/138 (42%), Positives = 86/138 (62%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLP+D +V+C+FN   K+D +T  +W+ VL AVP S+LW L  P +    ++  A 
Sbjct: 452 RSELGLPDDGVVFCSFNGHRKLDRATFSLWMRVLAAVPGSVLWQLAPPPIARTRLEQAAI 511

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+   R++++     +EH++R   AD+ LD  +C  HTT+ D L  G P++T  G  L
Sbjct: 512 AAGIAPERLIWAPRLPWDEHLKRLPAADLFLDALVCGAHTTAADALRMGVPLLTCAGPRL 571

Query: 164 ASRVAASQLATLGCPELI 181
           ASRVAAS L ++G PELI
Sbjct: 572 ASRVAASLLESIGLPELI 589


>gi|393765846|ref|ZP_10354406.1| tetratricopeptide tpr 2 repeat protein [Methylobacterium sp. GXF4]
 gi|392728738|gb|EIZ86043.1| tetratricopeptide tpr 2 repeat protein [Methylobacterium sp. GXF4]
          Length = 618

 Score =  120 bits (301), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 61/163 (37%), Positives = 92/163 (56%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P    + +R   GLPE   V+C+F+  YKI P+   +W+ +L+A+P S+LWLL       
Sbjct: 417 PPRSALPARSTEGLPEAGFVFCSFSNSYKIVPTIFAIWMRLLQAMPGSVLWLLADNEPAR 476

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N++  A   G+D  R++F+   + E ++ R  LAD+ LDT   N  TT+ D LW G P+
Sbjct: 477 GNLRREAGRHGVDPQRLVFAQRVSYEAYLARLPLADLFLDTFPFNAGTTASDALWMGLPL 536

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +T  G   ASR+A S L  LG PEL+  +   Y+  A++L  D
Sbjct: 537 MTCSGRAYASRMAGSLLTALGLPELVTDSLDAYERQALQLARD 579


>gi|119898993|ref|YP_934206.1| hypothetical protein azo2703 [Azoarcus sp. BH72]
 gi|119671406|emb|CAL95319.1| conserved hypothetical protein [Azoarcus sp. BH72]
          Length = 774

 Score =  120 bits (301), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 61/156 (39%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLP+DA+V   F+Q YKI  ++  +W+ +L+ +P+++LWLL+      A +   A
Sbjct: 566 SRAEAGLPDDALVLAAFHQHYKITRASFALWMRLLRGLPDALLWLLEGAPSAMARLSQEA 625

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  R+ ++       H+ R  +AD+ LD    N HTT+ D LW G P V   G+T
Sbjct: 626 RAHGVDPARLCWAPRVPVVHHLNRLAVADIALDAFPVNAHTTASDALWAGVPQVARRGDT 685

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             SRV+AS L   G PELIA     Y+ + + LGTD
Sbjct: 686 FVSRVSASILDAAGLPELIAADDASYEALVLALGTD 721


>gi|456738066|gb|EMF62743.1| TPR domain protein, putative component of TonB system
           [Stenotrophomonas maltophilia EPM1]
          Length = 570

 Score =  120 bits (300), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLP   +V+C FN  YK++P ++     VL+AVP S+LWLL  P   +A ++  A
Sbjct: 364 TRADCGLPAQGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRTAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA GLD  R++F       +++ R  LAD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYPLADLFLDTHPYNAHTTASDALWAGCPVLTYPGDT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGMARMNVADDAAFITTASALGND 519


>gi|255606103|ref|XP_002538503.1| hypothetical protein RCOM_1858570 [Ricinus communis]
 gi|223511823|gb|EEF23880.1| hypothetical protein RCOM_1858570 [Ricinus communis]
          Length = 182

 Score =  120 bits (300), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 85/134 (63%)

Query: 63  YKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEE 122
           +KI P+   +W+ +L+  PNS+LWLL+   + E N++A AQA G+   R++F+      +
Sbjct: 2   FKILPAVFDVWMRLLRETPNSVLWLLQSNPLAERNLKAEAQARGVAAERLIFAPRLPLAQ 61

Query: 123 HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIA 182
           H+ R Q AD+ LDT   N HTT+ D LWTG P+VT  G+T A RVAAS L  +G PELI 
Sbjct: 62  HLARQQWADLLLDTLPYNAHTTASDALWTGVPMVTCLGDTFAGRVAASLLHAVGLPELIT 121

Query: 183 RTHKEYQDIAIRLG 196
            + ++Y   A+RL 
Sbjct: 122 HSLEDYAAWAVRLA 135


>gi|190572912|ref|YP_001970757.1| TPR domain-containing protein [Stenotrophomonas maltophilia K279a]
 gi|190010834|emb|CAQ44443.1| putative TPR domain protein [Stenotrophomonas maltophilia K279a]
          Length = 570

 Score =  119 bits (299), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLP   +V+C FN  YK++P ++     VL+AVP S+LWLL  P   +A ++  A
Sbjct: 364 TRADCGLPAQGVVFCCFNNSYKLNPRSMGRAFAVLQAVPGSVLWLLSGPGQADARLRTAA 423

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA GLD  R++F       +++ R  LAD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 424 QAAGLDPARLVFMAKLPHPQYLARYPLADLFLDTHPYNAHTTASDALWAGCPVLTCPGDT 483

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A  LG D
Sbjct: 484 FAARVAGSLNHHLGLARMNVADDAAFITTASALGND 519


>gi|23013306|ref|ZP_00053217.1| COG3914: Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Magnetospirillum magnetotacticum MS-1]
          Length = 251

 Score =  119 bits (299), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLPED +V C+FN   K+D ++  +W+ +L  +P ++LW L  P + +  ++  A
Sbjct: 44  SREALGLPEDRLVLCSFNGHRKLDRASFALWLEILADLPQAVLWQLSPPDMAKRRLEDAA 103

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++++    + EH+ R   AD+ +D  +C  HTT+ D L  G P+VT+ GE 
Sbjct: 104 AKAGIDPARLIWAPSLPRAEHLARLPAADLFVDALVCGAHTTAADSLRMGVPLVTVAGER 163

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           L SRVAAS L  +G PEL  ++    +D+ + LG +
Sbjct: 164 LGSRVAASILNAVGLPELAVQSAAAMRDLVVALGRN 199


>gi|428180553|gb|EKX49420.1| hypothetical protein GUITHDRAFT_67997 [Guillardia theta CCMP2712]
          Length = 256

 Score =  119 bits (298), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 56/148 (37%), Positives = 88/148 (59%), Gaps = 1/148 (0%)

Query: 51  EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQH 110
           + + ++  FN+  KI P   ++W+ VLKAVP S+LWL +F    + N+   A+ LG+   
Sbjct: 97  QTSFIFACFNRNLKIKPDVFEVWMKVLKAVPRSVLWLRRFHIRVQENLLRHAERLGVSNA 156

Query: 111 RILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            I+F+  +  + +H+ R +LAD+ LDTP   GH+T  DV W G P + +PGE  +SR+ A
Sbjct: 157 SIIFAGRIPDRSKHLARHRLADLFLDTPRFGGHSTLADVFWAGVPSLVIPGEGASSRIGA 216

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGT 197
           S  + L     IAR   +YQ +AI+L +
Sbjct: 217 SMASALNGSVFIARDLHDYQQLAIQLAS 244


>gi|350561806|ref|ZP_08930644.1| Tetratricopeptide TPR_1 repeat-containing protein [Thioalkalivibrio
           thiocyanoxidans ARh 4]
 gi|349780838|gb|EGZ35156.1| Tetratricopeptide TPR_1 repeat-containing protein [Thioalkalivibrio
           thiocyanoxidans ARh 4]
          Length = 765

 Score =  119 bits (298), Expect = 7e-25,   Method: Composition-based stats.
 Identities = 65/160 (40%), Positives = 92/160 (57%), Gaps = 2/160 (1%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-FPAVGEANIQA 100
           T+R + GL E   V C F+  YK+ P+    W  +L+  P+++LWLL+  PAV  AN++ 
Sbjct: 558 TTRAEQGLSEAGPVLCGFHNPYKVTPTVFDAWCRLLRDSPDAVLWLLEPVPAVA-ANLRR 616

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A+  G+D  R++F+      EH+ R  LAD+ LDT   N HTT+ D LW G PV+T  G
Sbjct: 617 EAEQRGVDPGRLVFAPRLRNPEHLARLALADLFLDTLPVNAHTTASDALWVGVPVLTCTG 676

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           ET A RV AS LA  G  EL+  + + Y+  A  L  + D
Sbjct: 677 ETFAGRVGASLLAAAGLSELVTDSLETYEAKAQELLENPD 716


>gi|222149271|ref|YP_002550228.1| hypothetical protein Avi_3091 [Agrobacterium vitis S4]
 gi|221736255|gb|ACM37218.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 680

 Score =  119 bits (297), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 90/157 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPEDA V+  FN + KI  S    W+ +L+A P S+LWLL         ++ TA
Sbjct: 419 TRAEVGLPEDAFVFACFNGMQKITQSGFTRWMTILQATPGSVLWLLSGNEEVNQRLRDTA 478

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R+LF+  A   +H+ R  +AD+ LDT     H+T+ D +  G PV+T PG++
Sbjct: 479 TRCGVDPARLLFAAKAGNAQHLARIAVADLFLDTFPYGAHSTAADAINMGLPVLTFPGKS 538

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASR  AS +A+ G PELI ++  +Y   AI    DR
Sbjct: 539 FASRFCASVVASAGAPELICQSPDDYVRRAIAFAQDR 575


>gi|428184992|gb|EKX53846.1| hypothetical protein GUITHDRAFT_160947 [Guillardia theta CCMP2712]
          Length = 1178

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 86/136 (63%), Gaps = 3/136 (2%)

Query: 65   IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHV 124
            + P  L+ W  +L  VPNS++WLL+ P V E  +    +  G  + RI+F+  AAKEEHV
Sbjct: 985  LRPRILKRWKRILDRVPNSVIWLLRAPQVAEEKVMQWWREAGDYEERIIFTGTAAKEEHV 1044

Query: 125  RRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL--ASRVAASQLATLGCPELIA 182
            +R  LAD+ LDT +   H+T  D LW G P+++ P   L  ASRV+AS L+T+G  E+I 
Sbjct: 1045 KRAGLADIFLDTDIYGAHSTGTDALWAGLPLLS-PTHLLGMASRVSASLLSTIGLEEMIG 1103

Query: 183  RTHKEYQDIAIRLGTD 198
            RT  EY+++A+ LG +
Sbjct: 1104 RTLDEYENLAVALGRN 1119


>gi|392384038|ref|YP_005033234.1| putative TPR repeat protein [Azospirillum brasilense Sp245]
 gi|356880753|emb|CCD01717.1| putative TPR repeat protein [Azospirillum brasilense Sp245]
          Length = 640

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 91/169 (53%), Gaps = 4/169 (2%)

Query: 32  GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
           G  VP +     R   GLP D  V   FN  YKI P    +W+ +L  +P+++LWL +  
Sbjct: 422 GHGVPPA---PPRSACGLPADGFVLACFNNAYKITPEIWAVWMALLHKIPDAVLWLARTT 478

Query: 92  AVGEANIQATAQALGLDQHRILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
           A  E N+   AQ+ G+   RI+F++ A    +H+ R Q AD+ LDT     HTT+ D LW
Sbjct: 479 AEAEQNLLRAAQSAGIITDRIVFASWAPTLGDHLSRLQNADLMLDTLHYGAHTTASDALW 538

Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G P+VT PG TL SRVAAS L T G P+ +  +   YQ   +    +R
Sbjct: 539 VGLPLVTCPGHTLQSRVAASLLHTAGLPDFVTESLAHYQSAVLHWANNR 587


>gi|158334416|ref|YP_001515588.1| hypothetical protein AM1_1237 [Acaryochloris marina MBIC11017]
 gi|158304657|gb|ABW26274.1| TPR domain protein [Acaryochloris marina MBIC11017]
          Length = 1865

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 57/155 (36%), Positives = 94/155 (60%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            +R ++G+PE   V+C FN  YKI P    +W+ +L+ VP+S+LWL++       N++  A
Sbjct: 1652 TRAEFGVPESGFVFCCFNSHYKITPDLFDVWMRILQQVPDSVLWLIEGAETVMGNLRTEA 1711

Query: 103  QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            +  G++  R++F+   A E+++ R  LAD+ LDT + N  +T++   W+G P++T PG T
Sbjct: 1712 KQRGVNPDRLVFAPKIAHEDYLARYGLADLYLDTFVYNAGSTAVATCWSGLPMLTCPGPT 1771

Query: 163  LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
             ASR+ AS  A  G   LI ++  +Y+  AI L T
Sbjct: 1772 NASRMGASICAAAGLESLICKSVADYEQRAIHLAT 1806



 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 82/164 (50%), Gaps = 9/164 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA------- 96
           R+   + +DA+V+ +    +K+ P     W  +L AVPNS+L L  F A           
Sbjct: 763 RRDLNIADDAVVFVSGANFFKLIPELRITWTQILAAVPNSVLLLYPFSAWSHHYPVSAFR 822

Query: 97  -NIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTSMDVLWTGTP 154
            ++ A     G+D  R++  +     E V++  Q+ADV LD     G T+ +D L  G P
Sbjct: 823 NHMAALFAGDGIDPSRLIIQDPIPNREQVKQLLQIADVYLDAYPFTGVTSLLDPLEVGLP 882

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VVT  G +L SR   + L ++   +LIA   + Y ++AI+LG D
Sbjct: 883 VVTQAGHSLRSRQGPALLESIDLTDLIATDEQGYVELAIQLGQD 926


>gi|72383070|ref|YP_292425.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
 gi|72002920|gb|AAZ58722.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
          Length = 750

 Score =  118 bits (296), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 57/158 (36%), Positives = 89/158 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+ + LP+   ++  FN  YKI      +W+N+L+ +  S+LWL K       N+   A
Sbjct: 555 SRKDFNLPDQGFIFTCFNNNYKITEKEFNIWMNLLRKIEGSVLWLYKANQWSMNNLYKEA 614

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               +D+ RI+FS      +H+ R  L D+ LDT   NG  TS + LWTG PV+T  G+ 
Sbjct: 615 SKRKVDRDRIIFSEKLPMSKHLARHSLGDLALDTFNYNGGVTSSNALWTGLPVLTKIGQN 674

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +R++AS L +LG PELI  +  EY+D A+ + ++ +
Sbjct: 675 FTARMSASLLTSLGLPELITYSESEYEDKALYIASNSE 712


>gi|253745103|gb|EET01207.1| O-linked GlcNAc transferase [Giardia intestinalis ATCC 50581]
          Length = 1480

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 89/157 (56%), Gaps = 2/157 (1%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
            R  YG+P +  ++C FNQ+YK D  TL +   +L++VPN+   LLKFP   + +I+A  +
Sbjct: 1252 RALYGIPSNCFLFCTFNQVYKFDMGTLGIIAALLRSVPNAYYALLKFPPASQQHIEAFFR 1311

Query: 104  ALGLD-QHRILFSNV-AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
                D   RI+F N+   K EH+RR    DV +DT  CNG T  +D LW+G PVV   GE
Sbjct: 1312 HKAPDILDRIIFLNMLPMKVEHIRRYLAVDVFVDTLKCNGSTIVLDALWSGVPVVGFVGE 1371

Query: 162  TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             + SR   S L+ L C +LI  + ++   +  RL  D
Sbjct: 1372 YIISRKTVSFLSVLECTDLICASQEDAVALCTRLAID 1408


>gi|47848089|dbj|BAD21873.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
           Japonica Group]
 gi|47848095|dbj|BAD21878.1| putative O-linked N-acetyl glucosamine transferase [Oryza sativa
           Japonica Group]
          Length = 953

 Score =  118 bits (295), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 51/94 (54%), Positives = 65/94 (69%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            R  YGLPED  ++  FNQLYK+DP     W N+LK VPNS LWLL+FPA GE  ++A A
Sbjct: 783 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNILKRVPNSALWLLRFPAAGETRVRAHA 842

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDT 136
            A G+   +I+F++VA K EH+RR  LAD+ LDT
Sbjct: 843 AARGVRPDQIIFTDVAMKNEHIRRSSLADLFLDT 876


>gi|78188708|ref|YP_379046.1| TPR repeat-containing protein [Chlorobium chlorochromatii CaD3]
 gi|78170907|gb|ABB28003.1| TPR repeat [Chlorobium chlorochromatii CaD3]
          Length = 706

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 56/156 (35%), Positives = 94/156 (60%), Gaps = 1/156 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQAT 101
           +R ++GLPE  +VYC FN +YK++P    +W+ +L  VPNS+LWL+     G    ++A 
Sbjct: 500 TRAEFGLPEKGMVYCCFNGVYKLEPHVFSLWMEILSKVPNSVLWLIDGEESGSNERLRAV 559

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           AQ  G+   R++F+   + EE++   +LAD+ LDT   N   T++     G P++T  GE
Sbjct: 560 AQEAGIAPERLVFAKKRSHEEYLALYRLADLFLDTLSYNAGATAVGAFSAGLPLLTCQGE 619

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
             A+R+ ++    +G PEL+A T  +Y + A++LG+
Sbjct: 620 HYATRMGSALCYAVGLPELVAPTPADYVEFAVQLGS 655


>gi|188575464|ref|YP_001912393.1| hypothetical protein PXO_04549 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|188519916|gb|ACD57861.1| tetratricopeptide repeat domain protein [Xanthomonas oryzae pv.
           oryzae PXO99A]
          Length = 568

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE   V C FN  YK++P ++   + VL+AVP  +LWLL  P    A ++A A
Sbjct: 367 SRAECGLPELGAVLCCFNNSYKLNPRSMARMLAVLRAVPGCVLWLLSGPGEANARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+D  R++F        ++ R + AD+ LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 QTQGVDPQRLVFMPKLPHPRYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVVDDAAFVAKAIALASD 522


>gi|408391244|gb|EKJ70624.1| hypothetical protein FPSE_09134 [Fusarium pseudograminearum CS3096]
          Length = 1564

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 65/160 (40%), Positives = 94/160 (58%), Gaps = 28/160 (17%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
            L +D I+  NFNQLYK               VP +++WLL+FP +GEAN++ TA+    +
Sbjct: 1353 LADDTIIMGNFNQLYK---------------VPKAVIWLLRFPELGEANLRRTAKEWAGE 1397

Query: 109  Q--HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLA 164
            +   R+LF++VA K +H+ R ++ D+ LDTP CN HTT+ DVLW+ TP++TLP     + 
Sbjct: 1398 EVASRLLFTDVAPKSQHISRARVCDLFLDTPECNAHTTAADVLWSSTPLLTLPRYPYKMC 1457

Query: 165  SRVAASQL---------ATLGCPELIARTHKEYQDIAIRL 195
            SR+AAS L               ELIA   KEY+  A++L
Sbjct: 1458 SRMAASILKGALPKSDEGRQAAAELIAGDEKEYEQRAVKL 1497


>gi|72383071|ref|YP_292426.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
 gi|72002921|gb|AAZ58723.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
          Length = 909

 Score =  117 bits (293), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 55/158 (34%), Positives = 90/158 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+ + LP+   ++  FN  YKI      +W+N+L+ V  S+LWL K   +   N+   A
Sbjct: 714 SRKDFNLPDQGFIFTCFNNNYKITKKEFNIWMNLLRKVEGSVLWLYKSNQLSMNNLYKEA 773

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               +D+ RI+F+      +H+ R  L D+ LDT  CNG  T+ D L  G P++T  G++
Sbjct: 774 SKRKIDRDRIIFAEKLPMSKHLARHSLGDLALDTFNCNGGKTTCDALLAGLPLLTKIGQS 833

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +R++AS L +LG PELI  +  EY+D A+ + ++ +
Sbjct: 834 FTARMSASLLTSLGLPELITYSESEYEDKALYIASNSE 871


>gi|325914251|ref|ZP_08176601.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xanthomonas vesicatoria ATCC 35937]
 gi|325539506|gb|EGD11152.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xanthomonas vesicatoria ATCC 35937]
          Length = 568

 Score =  117 bits (292), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 83/138 (60%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLP    V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAECGLPTQGAVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGQVDARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+D  R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PGE 
Sbjct: 427 QAHGVDGQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGEP 486

Query: 163 LASRVAASQLATLGCPEL 180
            A+RVA S    LG  E+
Sbjct: 487 FAARVAGSLNQHLGLEEM 504


>gi|384263553|ref|YP_005418742.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
           photometricum DSM 122]
 gi|378404656|emb|CCG09772.1| Predicted O-linked N-acetylglucosamine transferase [Rhodospirillum
           photometricum DSM 122]
          Length = 652

 Score =  116 bits (291), Expect = 4e-24,   Method: Composition-based stats.
 Identities = 60/154 (38%), Positives = 92/154 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLPED  +YC F+  YKI     + W+++L+ V NS+LW+L      +A+++  A
Sbjct: 456 TRADVGLPEDRFIYCCFSNTYKITEEIFEAWMSILRRVENSVLWVLARNPWAQASMRRHA 515

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  R++F+      +++ R +LAD+ LDT   N  TT+ D L  G P+VTL G T
Sbjct: 516 EARGVDPARLIFAAPTGPAQYLSRLRLADLFLDTYPYNSGTTASDALRMGLPMVTLAGRT 575

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            +SR+A S L  +G    +A T  +Y +IA RLG
Sbjct: 576 FSSRMAGSLLHQVGLGACVAETLDDYINIAARLG 609


>gi|289667595|ref|ZP_06488670.1| tetratricopeptide repeat domain protein [Xanthomonas campestris pv.
           musacearum NCPPB 4381]
          Length = 568

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 90/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE  +V C FN  YK++  ++   + VL+AVP+S+LWLL  P   +A ++A A
Sbjct: 367 SRAKCGLPEQGVVLCCFNNSYKLNARSMLRMLAVLRAVPDSVLWLLSGPGQADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + A + LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 QAQGVAPQRLIFMPKLPHPQYLARYRHAHLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVGDDAAFVAKAIALASD 522


>gi|384418009|ref|YP_005627369.1| hypothetical protein XOC_0996 [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353460923|gb|AEQ95202.1| tetratricopeptide repeat domain protein [Xanthomonas oryzae pv.
           oryzicola BLS256]
          Length = 568

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 88/156 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE   V C FN  YK++P ++   + VL+AVP  +LWLL  P   +A ++A A
Sbjct: 367 SRAECGLPELGAVLCCFNNSYKLNPRSMARMLAVLRAVPGCVLWLLSGPGEADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+D  R++F        ++ R + A++ LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 QTQGVDPQRLVFMPKLPHPRYLARYRHANLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVVDDAAFVAKAIALASD 522


>gi|289663474|ref|ZP_06485055.1| tetratricopeptide repeat domain protein [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 568

 Score =  116 bits (291), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 90/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE  +V C FN  YK++  ++   + VL+AVP+S+LWLL  P   +A ++A A
Sbjct: 367 SRAKCGLPEQGVVLCCFNNSYKLNARSMLRMLAVLRAVPDSVLWLLSGPGQADARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + A + LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 QAQGVAPQRLIFMPKLPHPQYLARYRHAHLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVGDDAAFVAKAIALASD 522


>gi|222147824|ref|YP_002548781.1| hypothetical protein Avi_1088 [Agrobacterium vitis S4]
 gi|221734812|gb|ACM35775.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 566

 Score =  116 bits (291), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 90/154 (58%), Gaps = 6/154 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP+D  V  +FN + K+ P T++ W+ ++  +P+SILW+L    + + N++    
Sbjct: 354 RAALGLPQDRFVLASFNSIKKLSPQTVEAWLKIMADLPDSILWILCRSDIAKDNLRTLFA 413

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGET 162
             GL   R++F+   A   H+ R   AD+ LD+ P C GHTT+ D LW G PV+ L G+ 
Sbjct: 414 RHGLSAERLIFTPPLAYPYHLARLSAADLVLDSFPYC-GHTTTSDCLWAGVPVLALKGQN 472

Query: 163 LASRVAASQLATLGCPELIARTHKEY----QDIA 192
            ASRV+ S L  LG PEL+A T  +Y    QD+A
Sbjct: 473 FASRVSESLLTALGVPELVAATVDDYIAQAQDLA 506


>gi|159114158|ref|XP_001707304.1| O-linked GlcNAc transferase [Giardia lamblia ATCC 50803]
 gi|384950704|sp|A8BFN4.1|OGT1_GIAIC RecName: Full=UDP-N-acetylglucosamine--peptide
            N-acetylglucosaminyltransferase; Short=GlOGT; AltName:
            Full=O-linked GlcNAc transferase; AltName: Full=O-linked
            N-acetylglucosaminyltransferase
 gi|157435408|gb|EDO79630.1| O-linked GlcNAc transferase [Giardia lamblia ATCC 50803]
          Length = 1480

 Score =  116 bits (290), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 88/157 (56%), Gaps = 2/157 (1%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
            R  YG+P +  ++C FNQ+YK D  TL +   +L++VPN+   LLKFP   + +I+A  +
Sbjct: 1252 RALYGIPANCFLFCTFNQVYKFDMGTLGIIAALLRSVPNAYYALLKFPPASQLHIEAFFR 1311

Query: 104  ALGLD-QHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
                D   R++F S +  K EH+RR    DV +DT  CNG T  +D LW+G PVV   GE
Sbjct: 1312 HKAPDILDRVIFLSMLPMKVEHIRRYLAVDVFVDTLKCNGSTIVLDALWSGVPVVGFVGE 1371

Query: 162  TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             + SR   S L+ L C +LI  +  E   +  RL  D
Sbjct: 1372 YILSRKTLSFLSVLECKDLICASQGEAVLLCTRLAVD 1408


>gi|428179062|gb|EKX47935.1| hypothetical protein GUITHDRAFT_54742, partial [Guillardia theta
           CCMP2712]
          Length = 377

 Score =  116 bits (290), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 102/177 (57%), Gaps = 2/177 (1%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
           Q N+     ++V +S  +  R+ + LP+  A+V CNF +L +I       W+ +++ V  
Sbjct: 156 QVNSFNDLYQDVLRSETLPKRRDHFLPDKPAVVLCNFCRLGRISQELFDAWLRIMRRVEG 215

Query: 83  SILWLLKFPAVGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNG 141
           SILWL   P      ++  A+       RI+F+   + K +H++R  LAD+ LDT + NG
Sbjct: 216 SILWLYSHPRAAANRLRDYARRQDFPTDRIIFAPPCSPKLQHMKRLTLADLYLDTLVYNG 275

Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           HTT+ DVLW G P+VT+ G    S VA S   ++G PE++A + +EY+++A+ L ++
Sbjct: 276 HTTASDVLWAGVPIVTVKGSNWPSLVATSLARSVGLPEMVATSLEEYEEMAVDLASN 332


>gi|222148768|ref|YP_002549725.1| hypothetical protein Avi_2410 [Agrobacterium vitis S4]
 gi|221735754|gb|ACM36717.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 624

 Score =  116 bits (290), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 58/151 (38%), Positives = 86/151 (56%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           +SR   GLP D +V   FN   K  P TL++W  +LKA P ++LW++        +  A 
Sbjct: 402 SSRMALGLPADRVVIGAFNSQSKNTPETLRLWARILKANPTALLWMMVDGHSARQSTAAF 461

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            ++LG+ Q ++LF+     E H+ R Q AD  +D+   NGHTT+ D+LW G PV+T  G 
Sbjct: 462 FKSLGVKQSQLLFAPKMLYENHIARAQAADFAIDSFPYNGHTTTSDMLWAGLPVITKRGT 521

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIA 192
             ASRV+ S L  +G  EL+AR    + ++A
Sbjct: 522 NFASRVSESLLKAIGLDELVARDEDNFVELA 552


>gi|58583232|ref|YP_202248.1| hypothetical protein XOO3609 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625068|ref|YP_452440.1| hypothetical protein XOO_3411 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58427826|gb|AAW76863.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369008|dbj|BAE70166.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 568

 Score =  115 bits (289), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPE   V C FN  YK++P ++   + VL+AVP  +LWLL  P    A ++A A
Sbjct: 367 SRAECGLPELGAVLCCFNNSYKLNPRSMARMLAVLRAVPGCVLWLLSGPGEANARLRAFA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G++  R++F        ++ R + AD+ LDT   N HTT+ D LW G PV+T PGET
Sbjct: 427 QTQGVEPQRLVFMPKLPHPRYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGET 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNQHLGLDEMNVVDDAAFVAKAIALASD 522


>gi|326386010|ref|ZP_08207634.1| TPR repeat-containing protein [Novosphingobium nitrogenifigens DSM
           19370]
 gi|326209235|gb|EGD60028.1| TPR repeat-containing protein [Novosphingobium nitrogenifigens DSM
           19370]
          Length = 882

 Score =  115 bits (288), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 61/156 (39%), Positives = 85/156 (54%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR  +GLPE A V+C FN+ YKI       W+ +L AV NS+LWL+      E  ++  A
Sbjct: 671 SRADWGLPEGAFVFCCFNKGYKIARDDFAGWMRILAAVENSVLWLISLSPETEQALRKAA 730

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F+  A  + H+ R   AD+ LDT   N HTT+ D LW G PVVT  G  
Sbjct: 731 ADAGVDPARLIFAPRAPHDVHLARHAHADLGLDTLRYNAHTTASDALWAGLPVVTRAGRC 790

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RV AS    +G  +L+      ++ +AI L  D
Sbjct: 791 FAARVGASLNHAVGLSDLVTHDAAGFESLAIALARD 826


>gi|40063060|gb|AAR37916.1| TPR domain protein [uncultured marine bacterium 560]
          Length = 764

 Score =  115 bits (287), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 68/178 (38%), Positives = 93/178 (52%), Gaps = 5/178 (2%)

Query: 20  LATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKA 79
           L + Q N  T +    P  I  T RQ  GLPE   V+C FN  +KI P+T   W  +L+ 
Sbjct: 535 LPSYQVNDSTQS----PPEIFFT-RQDLGLPETGFVFCCFNNTFKITPTTFDSWGRILEH 589

Query: 80  VPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
           V  SIL +       + N+       G+D  R++F       E++ R ++AD+ LDT   
Sbjct: 590 VEGSILLIYVDNESAKLNLTKEIVLRGIDPSRLIFGEKLPIPEYLARYRVADLFLDTHPY 649

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           N  TTS D L  G PV+TL G +  SR+AAS +  +  PELI  T KEY+ +AI L T
Sbjct: 650 NAGTTSSDALRMGLPVLTLKGNSFNSRMAASVINAVNLPELITSTQKEYESLAIELAT 707


>gi|380510752|ref|ZP_09854159.1| hypothetical protein XsacN4_06033 [Xanthomonas sacchari NCPPB 4393]
          Length = 570

 Score =  115 bits (287), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 89/156 (57%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLPE   V+C FN  YK+ P ++   + VL+ VP+S+LWLL  P   +  ++A A
Sbjct: 366 TRRDCGLPERGTVFCCFNNSYKLGPRSMARMLEVLRQVPDSVLWLLSGPGQADDRLRAAA 425

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R+ F    A  +++ R + AD+ LDT   N HTT+ D LW G PV+T PG T
Sbjct: 426 ATAGVDPTRLRFMPKLAHPDYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTCPGST 485

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+ A     +   A+RLG D
Sbjct: 486 FAARVAGSLNHHLGLDEMNAADDAAFVATAVRLGRD 521


>gi|294666024|ref|ZP_06731286.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292604216|gb|EFF47605.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 568

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLP+ A+V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGQADARMRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+D  R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWVGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522


>gi|154244408|ref|YP_001415366.1| hypothetical protein Xaut_0451 [Xanthobacter autotrophicus Py2]
 gi|154158493|gb|ABS65709.1| Tetratricopeptide TPR_2 repeat protein [Xanthobacter autotrophicus
           Py2]
          Length = 574

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/139 (40%), Positives = 83/139 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLP+DA V+C FN  +K+DP     +  +LKAVP S+LW+L+       N++  A
Sbjct: 361 TRADCGLPDDAFVFCAFNSAWKLDPEVFSAYTRILKAVPGSVLWVLESRENSADNLRREA 420

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A GLD  R++F+ +   ++H  R  LAD+ LDT     HTT+ D+L  G P+VT  G +
Sbjct: 421 EARGLDPARLVFAPMVPLKDHFARLPLADLFLDTFPYTAHTTASDMLRMGLPMVTRTGRS 480

Query: 163 LASRVAASQLATLGCPELI 181
            ASRVAAS +  +   + I
Sbjct: 481 FASRVAASIMTQMDLADFI 499


>gi|157413635|ref|YP_001484501.1| O-linked N-acetylglucosamine transferase [Prochlorococcus marinus
           str. MIT 9215]
 gi|157388210|gb|ABV50915.1| Predicted O-linked N-acetylglucosamine transferase, SPINDLY family
           [Prochlorococcus marinus str. MIT 9215]
          Length = 632

 Score =  114 bits (286), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 57/156 (36%), Positives = 89/156 (57%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           +  +GL E   V+  FN   KI      +W+ +LK V NSILWL K       N++  A+
Sbjct: 433 KSDFGLSEHNFVFACFNANNKITLVEFNIWMRLLKKVKNSILWLYKSNNYSMMNLKKEAE 492

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   RI+F++    E+H+ R + AD+ LDT   N HTT+ D LW   PVVT  G++ 
Sbjct: 493 KQGVQSKRIIFADKMLNEDHLSRIKFADLFLDTFHYNAHTTASDALWAEVPVVTKQGQSF 552

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           ++RV +S L  L   ELI + + EY+++A ++ +D+
Sbjct: 553 SARVCSSLLTALNLEELITKENFEYENLAYKIASDK 588


>gi|452965780|gb|EME70798.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
           SO-1]
          Length = 658

 Score =  114 bits (285), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 56/156 (35%), Positives = 89/156 (57%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           + SR   GLPED +V C+FN   K+D ++  +W+ +L  +P+++LW L  P      ++ 
Sbjct: 449 LPSRAALGLPEDRLVLCSFNGHRKLDRASFTLWLEILAELPDAVLWQLAPPDTARKRLEE 508

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A+  G+D  R++++    + EH+ R   AD+ +D  +C  HTT+ D L  G P+VT  G
Sbjct: 509 AARNAGIDPARLIWAPALPRPEHLARLPAADLFVDALVCGAHTTAADSLRMGVPLVTAAG 568

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             L SRVAAS L  +G PEL        +++A+ LG
Sbjct: 569 PRLGSRVAASLLHAVGLPELAVTDAAALKELAVGLG 604


>gi|308161318|gb|EFO63770.1| O-linked GlcNAc transferase [Giardia lamblia P15]
          Length = 1480

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/157 (38%), Positives = 88/157 (56%), Gaps = 2/157 (1%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
            R  YG+P +  ++C FNQ+YK D  TL +   +L++VPN+   LLKFP   + +I+A  +
Sbjct: 1252 RALYGIPANCFLFCTFNQVYKFDMGTLGIIAALLRSVPNAYYALLKFPPASQLHIEAFFR 1311

Query: 104  ALGLD-QHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
                D   R++F S +  K EH+RR    DV +DT  CNG T  +D LW+G PVV   GE
Sbjct: 1312 HKAPDILDRVIFLSMLPMKVEHIRRYLAVDVFVDTLKCNGSTIVLDALWSGVPVVGFVGE 1371

Query: 162  TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             + SR   S L+ L C +LI  +  +   +  RL  D
Sbjct: 1372 YILSRKTLSFLSVLECNDLICASQGDAVLLCTRLAVD 1408


>gi|347739128|ref|ZP_08870464.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
 gi|346917662|gb|EGX99948.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
          Length = 355

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 86/160 (53%), Gaps = 4/160 (2%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           T+R  +GLP D +V+  FN  YK+ P  +  W  +L  VP S+LWLL  P      +QA 
Sbjct: 143 TTRADWGLPADGVVFACFNATYKLTPEVVACWARLLAKVPGSVLWLLD-PGPALPTLQAM 201

Query: 102 AQALGLDQHRILFSNVAAKE---EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
               G+D  R++ +    K    +H+ R   AD+ LDT   N HTT+ D L+ G PV+T 
Sbjct: 202 LSTEGVDPSRLVVAPALPKARMADHLARHACADLFLDTHPVNAHTTAADALYAGLPVLTR 261

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE   SRVAAS L  LG P+L+  + + Y+  A+    D
Sbjct: 262 AGEPFISRVAASLLTALGLPDLVTYSLEAYEATALDFARD 301


>gi|348679898|gb|EGZ19714.1| hypothetical protein PHYSODRAFT_558418 [Phytophthora sojae]
          Length = 749

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 58/168 (34%), Positives = 98/168 (58%), Gaps = 7/168 (4%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
           E P    +T R+     E+A V+  F+   K+DPS    W+ +L  VP+S++W +++   
Sbjct: 316 ERPTLFALTGRR-----ENAFVFATFSNWQKMDPSVFSAWMEILSRVPSSVMWFIEYFGR 370

Query: 94  --GEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
               AN++A A+A G++  R++FS +    +H  R + AD+ LDT L NGHTT +D L  
Sbjct: 371 EGAIANLRAEAKAHGINGDRLIFSPMDPWIDHTYRKRAADLILDTSLKNGHTTILDALCA 430

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           G PV++L G+ +++R  +S L +L   +L   + KEY ++A+ L T +
Sbjct: 431 GVPVISLEGDRMSNRATSSALNSLDLHDLTVNSLKEYVEVAVYLATHK 478


>gi|428179506|gb|EKX48377.1| hypothetical protein GUITHDRAFT_162478 [Guillardia theta CCMP2712]
          Length = 494

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 55/143 (38%), Positives = 82/143 (57%), Gaps = 1/143 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           I+  +FNQLYKI+P    +W+ V++ VP S LWLLKF       +   A A G+ + R++
Sbjct: 293 IILASFNQLYKIEPKIFNVWMEVMRQVPKSKLWLLKFSKAASKYLLKEAAAAGVKEDRLV 352

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F +   K+  +     AD+ LDTPL N HTT  DVLW G P+++LPGE  A RVA   + 
Sbjct: 353 FHSKFPKDIELLAKSHADLFLDTPLFNAHTTGGDVLWAGIPMLSLPGENFAQRVAGGLIR 412

Query: 174 TLGCPEL-IARTHKEYQDIAIRL 195
           ++    + + R   +Y  +A +L
Sbjct: 413 SVRMSNVSLVRNLDDYAHVATKL 435


>gi|23016729|ref|ZP_00056482.1| COG3914: Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Magnetospirillum magnetotacticum MS-1]
          Length = 686

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/157 (38%), Positives = 85/157 (54%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLP +A V+C FN L KI       ++++L  VPN +LWLL         ++  A
Sbjct: 410 TREAAGLPANATVFCGFNGLQKITAPMWARFMDILSRVPNGVLWLLDGGERINERLRQEA 469

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+      EH+ R  LAD+ LDT  C  HTT+ D LW G PV+T+ G  
Sbjct: 470 IKHGVTPDRIIFAPKLINAEHLSRYPLADLFLDTSPCGAHTTASDSLWMGVPVLTVAGRG 529

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S     G  +L+  T +EY + A+ LG D+
Sbjct: 530 FASRVCGSLAVAAGLSDLVCTTFEEYVEKAVELGNDK 566


>gi|124023730|ref|YP_001018037.1| hypothetical protein P9303_20301 [Prochlorococcus marinus str. MIT
           9303]
 gi|123964016|gb|ABM78772.1| Hypothetical protein P9303_20301 [Prochlorococcus marinus str. MIT
           9303]
          Length = 395

 Score =  113 bits (283), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 93/155 (60%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  +GLPE + V+  F   YKI PS   +W+ +LK V +S+LWL +     E N++  A+
Sbjct: 199 RSDFGLPESSFVFVCFCANYKITPSVFNVWMRLLKQVDDSVLWLYRSNKWAEINLRRQAE 258

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           +  +D  R++F+      +H+ R  LAD+ LDT   N HTT+ D L  G P++TL G++ 
Sbjct: 259 SRDIDPERLIFAGRLPLNKHLARHSLADLFLDTFNVNAHTTASDALLAGLPLLTLAGKSF 318

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            SRVAAS L TL  PELI  T K+Y++ A+ +  D
Sbjct: 319 TSRVAASLLVTLNLPELITYTIKDYEEKALMIALD 353


>gi|325929456|ref|ZP_08190584.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xanthomonas perforans 91-118]
 gi|325540231|gb|EGD11845.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xanthomonas perforans 91-118]
          Length = 568

 Score =  113 bits (282), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE A+V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPEHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  EL       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDELNVADDAAFVATAVALASD 522


>gi|428171863|gb|EKX40776.1| hypothetical protein GUITHDRAFT_75246 [Guillardia theta CCMP2712]
          Length = 717

 Score =  113 bits (282), Expect = 5e-23,   Method: Composition-based stats.
 Identities = 58/173 (33%), Positives = 94/173 (54%), Gaps = 3/173 (1%)

Query: 28  KTATGEEVPQSIVITSRQQYGLPEDA--IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
           K +  E++       +     LP  +  +V  NFN L K+ P T  +W+ +++ +P+S+L
Sbjct: 423 KQSQAEQILGDFTAANASNLSLPHRSQQLVLANFNHLQKLGPHTFHLWLRIMQELPDSVL 482

Query: 86  WLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
           WLL+FP   E N++   +   +   R+  S+    + H+   + A + LDT   N H + 
Sbjct: 483 WLLRFPPEAEPNLRREFEEREIGT-RLHLSDKFPSKIHLNVKRAASLLLDTLEYNAHVSG 541

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +D LW G P+VTL G  +A R  AS L T+  P+L+ART +EY  IA +LG +
Sbjct: 542 LDALWAGLPLVTLAGSNMARRCGASFLRTMKVPQLLARTSEEYVSIATQLGEN 594


>gi|345869929|ref|ZP_08821884.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodococcus
           drewsii AZ1]
 gi|343922316|gb|EGV33018.1| Tetratricopeptide TPR_1 repeat-containing protein [Thiorhodococcus
           drewsii AZ1]
          Length = 693

 Score =  113 bits (282), Expect = 5e-23,   Method: Composition-based stats.
 Identities = 59/156 (37%), Positives = 91/156 (58%), Gaps = 1/156 (0%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATA 102
           R   GLPE A+VYC  N +YKI+     +W+ +L+ VP S+LWLL     G  A ++  A
Sbjct: 484 RAACGLPESAMVYCCLNGIYKIESGVFALWMRILERVPESVLWLLDGGESGSNARLREAA 543

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+D  R++F+   A  E++   +LAD+ LDT + N   T++  L  G PV+T PG  
Sbjct: 544 RTAGIDPDRLIFAPKRAHAEYLAHYRLADLFLDTLVYNAGATAVGALAAGLPVLTCPGGH 603

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+R+ AS    +G PEL+  + ++Y + A+ LG D
Sbjct: 604 YAARMGASLSHAVGLPELVCASPEDYVEQAVALGRD 639


>gi|428184442|gb|EKX53297.1| hypothetical protein GUITHDRAFT_101001 [Guillardia theta CCMP2712]
          Length = 738

 Score =  112 bits (281), Expect = 6e-23,   Method: Composition-based stats.
 Identities = 58/167 (34%), Positives = 93/167 (55%), Gaps = 19/167 (11%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVL----------------KAVPNSILWLLKFP 91
           G  +D ++ C++N L+K DP TLQ W+ +L                + VP + LW+++F 
Sbjct: 496 GFEDDDVIMCSWNTLFKCDPDTLQTWITLLVSFIPLPLLVLSQVLKRKVPKAKLWIMQFD 555

Query: 92  AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
                N+   A   G+D  RI +S    ++   +   LADV +DTPL N HTT +D LW 
Sbjct: 556 PTPPPNVALLAADNGVDVTRIAWSTFFPRQLEFKIKSLADVFVDTPLFNAHTTGVDALWC 615

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPEL---IARTHKEYQDIAIRL 195
           GTP+VT+P E  ++RVAA  +A++   +L   +AR+ ++Y+ +  RL
Sbjct: 616 GTPLVTVPLEDFSARVAAGLVASVSRMQLGTTVARSMEDYEQVLTRL 662


>gi|78046530|ref|YP_362705.1| hypothetical protein XCV0974 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78034960|emb|CAJ22605.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 568

 Score =  112 bits (281), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 93/156 (59%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE A+V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPEHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QALG+   R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QALGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522


>gi|285017296|ref|YP_003375007.1| hypothetical protein XALc_0485 [Xanthomonas albilineans GPE PC73]
 gi|283472514|emb|CBA15019.1| hypothetical protein XALC_0485 [Xanthomonas albilineans GPE PC73]
          Length = 569

 Score =  112 bits (281), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 85/156 (54%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPE   V+C FN  YK+ P ++   + VL+ VP+S+LWLL  P   +A ++  A
Sbjct: 367 SRHDCGLPEAGTVFCCFNNSYKLSPRSMARMLAVLRQVPDSVLWLLSGPGQADARLREAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               L+  R+ F    A  E++ R + AD+ LDT   N HTT+ D LW G PV+T PG T
Sbjct: 427 CKADLEPTRLRFMPKLAHPEYLSRYRHADLFLDTHPYNAHTTASDALWAGCPVLTCPGNT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    L   ++       +   A+RLG D
Sbjct: 487 FAARVAGSLNHHLNMDDMNVVDDDAFIATAVRLGRD 522


>gi|375104670|ref|ZP_09750931.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Burkholderiales bacterium JOSHI_001]
 gi|374665401|gb|EHR70186.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Burkholderiales bacterium JOSHI_001]
          Length = 644

 Score =  112 bits (281), Expect = 7e-23,   Method: Composition-based stats.
 Identities = 60/161 (37%), Positives = 85/161 (52%), Gaps = 4/161 (2%)

Query: 43  SRQQYGLPEDA----IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
           +R + GLP+D     +V C FNQ +KI       W  +L  +P ++LWLL++      NI
Sbjct: 432 TRSEAGLPDDGQGGPLVLCGFNQPFKISAPVFDTWCRLLHRLPQAVLWLLEWNGQVRKNI 491

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
           +  A   G+   R+ ++     + H+ R QLAD+ +DT  CN HTT+ D LW   PVVT 
Sbjct: 492 ELEAAKRGIAPERLAWAPRRHPKHHMARLQLADIFIDTWPCNAHTTASDALWAAVPVVTF 551

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G T ASRVAAS    +G   LI      Y++  + L  DR
Sbjct: 552 AGRTFASRVAASLNHAVGLDGLICDDVAHYEERIVALAHDR 592


>gi|418300833|ref|ZP_12912644.1| hypothetical protein ATCR1_24890 [Agrobacterium tumefaciens
           CCNWGS0286]
 gi|355532841|gb|EHH02194.1| hypothetical protein ATCR1_24890 [Agrobacterium tumefaciens
           CCNWGS0286]
          Length = 653

 Score =  112 bits (280), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 90/158 (56%), Gaps = 2/158 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPEDA VY +FN + KI  S    W+ +L   P S+LWLL        ++Q  A
Sbjct: 392 SRAEVGLPEDAFVYASFNGMQKITESCFARWMTILSETPGSLLWLLAGDEDVNEHLQGIA 451

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F+      +H+ R  +AD+ LDT     H+T+ D + +G PV+T+PG+T
Sbjct: 452 EKSGIAPERLVFAPKVPNPQHIARIGVADLFLDTFPYGAHSTAADAITSGLPVLTMPGKT 511

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R   S +   G PE+I  + ++Y  +A  +G +RD
Sbjct: 512 FAARFCGSIVTAAGVPEMICSSPEDY--VARAIGFERD 547


>gi|409441295|ref|ZP_11268287.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Rhizobium
           mesoamericanum STM3625]
 gi|408747107|emb|CCM79485.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Rhizobium
           mesoamericanum STM3625]
          Length = 685

 Score =  112 bits (280), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 88/157 (56%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLPEDA ++  FN + KI  +  + W+ +L A PNSILWLL      +  ++ +A+
Sbjct: 422 RAEVGLPEDAFIFACFNGMQKITAACFERWMAILSATPNSILWLLSGGEDVDQRLRQSAE 481

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   R++F+  A   +H+ R  +AD+ LDT     H+T+ D +  G PV+T PG   
Sbjct: 482 QHGVAPERLIFAPKAPNPKHLARIGVADLFLDTFPYGAHSTAGDAVTMGLPVLTFPGNGF 541

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           ASR  AS +A  G PELI     +Y   A+   +DR+
Sbjct: 542 ASRFCASIVAAAGVPELICDGPDDYVGKAVAFASDRE 578


>gi|196232460|ref|ZP_03131313.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196223532|gb|EDY18049.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 792

 Score =  112 bits (280), Expect = 9e-23,   Method: Composition-based stats.
 Identities = 58/145 (40%), Positives = 80/145 (55%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + + + N   KI+ +  Q W  VL+    S L LL +P     +  A  +  G++++R+ 
Sbjct: 599 VTFASLNDFGKINRAVFQRWAAVLRKATGSRLLLLTYPGEHRRDTIALLREEGIEENRVE 658

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F     + E++R    AD  LD    NGHTTS+D LW G PVV+L GE + SR   SQL 
Sbjct: 659 FLEPRPRAEYLRYYHRADAMLDPFPYNGHTTSLDALWMGVPVVSLAGERMVSRAGLSQLT 718

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG PEL+ART  EY +IA RL  D
Sbjct: 719 NLGLPELVARTEDEYVEIATRLAGD 743


>gi|99078404|ref|YP_611662.1| tetratricopeptide TPR_2 [Ruegeria sp. TM1040]
 gi|99035542|gb|ABF62400.1| Tetratricopeptide TPR_2 [Ruegeria sp. TM1040]
          Length = 560

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 57/154 (37%), Positives = 84/154 (54%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R  +GLP+D  V+ +FN  +K+ PS    W+ +L+ VP ++L      A   A +   AQ
Sbjct: 351 RAAFGLPDDRFVFASFNHPHKVGPSEFATWMEILREVPQAVLLFYSGKADLGAALAERAQ 410

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G++  R+L      +  H+ R    D+CLD    N HTT+ D +W G P++TL G   
Sbjct: 411 AHGVEPSRVLACGPLPQTAHLERIAQVDLCLDCFAYNAHTTASDAVWAGVPLLTLAGRQF 470

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           A+RVA S L+T G PEL   + K+Y   A+ L T
Sbjct: 471 AARVATSILSTAGVPELSTVSTKDYVAKAVHLAT 504


>gi|452963366|gb|EME68439.1| O-linked N-acetylglucosamine transferase [Magnetospirillum sp.
           SO-1]
          Length = 690

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/157 (37%), Positives = 84/157 (53%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLP +A V+C FN L K        ++++L  VPN +LWLL         ++  A
Sbjct: 410 TREAAGLPANATVFCGFNGLQKFTAPMWARFMDILSRVPNGVLWLLDGGERINERLRQEA 469

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+      EH+ R  LAD+ LDT  C  HTT+ D LW G PV+T+ G  
Sbjct: 470 IKHGVTPDRIIFAPKLINAEHLSRYPLADLFLDTSPCGAHTTASDALWMGVPVLTVAGRG 529

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S     G  EL+  T ++Y + A+ LG D+
Sbjct: 530 FASRVCGSLAVAAGLGELVCTTFEDYVEKAVDLGNDK 566


>gi|83312891|ref|YP_423155.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
           magneticum AMB-1]
 gi|82947732|dbj|BAE52596.1| Predicted O-linked N-acetylglucosamine transferase
           [Magnetospirillum magneticum AMB-1]
          Length = 691

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 85/157 (54%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLP +A V+C FN + KI     + ++++L  VPN +LWLL         ++  A
Sbjct: 410 TREAAGLPANATVFCGFNGVQKITAPMWERFMDILSRVPNGVLWLLDGGERINERLRQEA 469

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+      EH+ R  LAD+ LDT  C  HTT+ D LW G PV+T+ G  
Sbjct: 470 IRHGVTPDRIIFAPKLINAEHLSRYPLADLFLDTSPCGAHTTASDALWMGVPVLTVAGRG 529

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV  S     G  +++  T  EY + A+ LG D+
Sbjct: 530 FASRVCGSLAVAAGLGDMVCTTFAEYVEKAVELGNDK 566


>gi|381171822|ref|ZP_09880962.1| putative uncharacterized protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380687782|emb|CCG37449.1| putative uncharacterized protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 568

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLP+ A+V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNHRLGLDEMNVADDATFVAKAIELASD 522


>gi|21241713|ref|NP_641295.1| hypothetical protein XAC0943 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21107081|gb|AAM35831.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 568

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLP+ A+V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNHRLGLDEMNVADDATFVAKAIELASD 522


>gi|117926636|ref|YP_867253.1| hypothetical protein Mmc1_3361 [Magnetococcus marinus MC-1]
 gi|117610392|gb|ABK45847.1| TPR repeat-containing protein [Magnetococcus marinus MC-1]
          Length = 750

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 94/177 (53%), Gaps = 9/177 (5%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q N +T   +  P      SRQ+ GLP   +V+ +FNQ YKI P  + +W  +L AV + 
Sbjct: 518 QPNPRTLACDPTP------SRQEAGLPPQGVVFGSFNQSYKITPQIVALWSRLLHAVEHG 571

Query: 84  ILWLLK--FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
           +LWLL    PA  E ++QA   A G+ + R++F+   +   H+ R  L DV LDT     
Sbjct: 572 VLWLLDPDLPAAVE-HLQAHFAAHGIAKQRVIFAPRLSMAAHMGRLPLVDVALDTFPYTS 630

Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           HTT+ D LW+G P+V   GET ASRV+      +G  EL+    + Y  +A  L  D
Sbjct: 631 HTTASDALWSGVPLVAWRGETFASRVSTCLAINMGMQELVVEDGEHYLALAQALAHD 687


>gi|428181344|gb|EKX50208.1| hypothetical protein GUITHDRAFT_161893 [Guillardia theta CCMP2712]
          Length = 799

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/177 (36%), Positives = 102/177 (57%), Gaps = 7/177 (3%)

Query: 20  LATNQTNTKTATGEEVPQSIVITSRQQYGLP-EDAIVYCNFNQLYKIDPSTLQMWVNVLK 78
           L T + +++     + P  +    R++ GLP + A+V+  FN  YKIDP   +MW+ +L+
Sbjct: 569 LQTYRRDSRAGIAVDRPHQL----REELGLPAQGALVFGAFNDHYKIDPQLFRMWMRLLR 624

Query: 79  AVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPL 138
           +   S++W ++    GEA +Q  A   G+D  R++ +    K EH+     +DV LDT L
Sbjct: 625 SSNGSVIWTIRHG--GEAILQKEAVKEGIDPARLVIAEPFPKVEHLAIKGYSDVFLDTHL 682

Query: 139 CNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
            NGH+T+ D+LW   PVVTLPGE +A+R AA  +A       IAR H +Y +++ RL
Sbjct: 683 YNGHSTTSDMLWAQVPVVTLPGEKMAARTAAGIMAAANAAIGIARNHDDYFELSKRL 739


>gi|418518286|ref|ZP_13084435.1| hypothetical protein MOU_15947 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|418523120|ref|ZP_13089144.1| hypothetical protein WS7_19138 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410700276|gb|EKQ58838.1| hypothetical protein WS7_19138 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410704363|gb|EKQ62847.1| hypothetical protein MOU_15947 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 568

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLP+ A+V C FN  YK++P ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNPRSMARMLAVLRAVPGSVLWLLSGPGEADARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   AI L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDATFVAKAIELASD 522


>gi|30250032|ref|NP_842102.1| TPR repeat-containing glycosyl transferase [Nitrosomonas europaea
           ATCC 19718]
 gi|30139139|emb|CAD86003.1| Glycosyl transferases group 1:TPR repeat [Nitrosomonas europaea
           ATCC 19718]
          Length = 1189

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 83/147 (56%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLP    V+C+FN  +K        W+N+L+ VPNS+LWLL        N+Q  A
Sbjct: 437 TRKDCGLPARKFVFCSFNNNHKYTLEVFTTWMNILRRVPNSVLWLLADNPWARENLQKQA 496

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+D  R++F+      +++ R  +AD+ LDT   N  TT+ D LW G PV+T+ G +
Sbjct: 497 KAQGIDPKRLVFAERTMPADYLARYLVADLFLDTFPFNAGTTANDALWMGLPVLTMSGRS 556

Query: 163 LASRVAASQLATLGCPELIARTHKEYQ 189
            ASR+A + L     PELI    + Y+
Sbjct: 557 FASRMAGALLTAADLPELITHDLQTYE 583


>gi|428176488|gb|EKX45372.1| hypothetical protein GUITHDRAFT_163259 [Guillardia theta CCMP2712]
          Length = 826

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 3/172 (1%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           +  T    G    +  V   R+  GLP +  V  NFNQ+YK++P T ++W ++++    S
Sbjct: 524 EQETSGPAGSPSREEYVERQRRLLGLPANKTVVSNFNQMYKVNPETFKLWADIVREEETS 583

Query: 84  ILWLLKFPAVGEA---NIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCN 140
           + W +     GE      +  A A  +   ++ ++      E +RR  L+D+ LDT   N
Sbjct: 584 LFWQILEGPQGERISRRYELHASAHRIAAEKLFWTYKTEIVEFIRRCGLSDIVLDTFPVN 643

Query: 141 GHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIA 192
            HT +MDVLW GTP+++ PG   ASRVA+S L++LG  +LIAR  +++ +++
Sbjct: 644 AHTVAMDVLWMGTPLLSRPGMLFASRVASSILSSLGQTQLIARNEEDFVELS 695


>gi|445499835|ref|ZP_21466690.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Janthinobacterium sp.
           HH01]
 gi|444789830|gb|ELX11378.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Janthinobacterium sp.
           HH01]
          Length = 957

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 87/156 (55%), Gaps = 6/156 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQ 99
           +R  YGLPE   VYC F+   +I P     W+ +L  VP+S+LWL    A GE    N++
Sbjct: 424 ARAAYGLPEPGFVYCCFSAPEQITPERYATWMRILHQVPDSVLWLQ---AAGEQVRDNLR 480

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             A   G+   R+ F++ AA E+ + R Q AD+ LDT        + DVLW G P++T  
Sbjct: 481 VAAMQRGIPSERLCFADPAAPEQQLARYQAADLFLDTAPYAAGAMAGDVLWAGLPLLTST 540

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
           G+  ASR +AS L   G PEL+  + K+Y+  A+RL
Sbjct: 541 GDGSASRTSASLLHAAGLPELVTASEKKYEARAVRL 576


>gi|346723854|ref|YP_004850523.1| O-linked N-acetylglucosamine transferase [Xanthomonas axonopodis
           pv. citrumelo F1]
 gi|346648601|gb|AEO41225.1| O-linked N-acetylglucosamine transferase [Xanthomonas axonopodis
           pv. citrumelo F1]
          Length = 568

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 91/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLPE A+V C FN  YK++P ++   + VL+ VP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPEHAVVLCCFNNSYKLNPRSMARMLAVLRVVPGSVLWLLSGPGEADARLRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+   R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVHAQRLVFMPKLPHAQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522


>gi|408377703|ref|ZP_11175304.1| hypothetical protein QWE_08941 [Agrobacterium albertimagni AOL15]
 gi|407748694|gb|EKF60209.1| hypothetical protein QWE_08941 [Agrobacterium albertimagni AOL15]
          Length = 679

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 87/157 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPEDA V+  FN + KI       W+ +L A P SILWLL         ++  A
Sbjct: 422 TRSEVGLPEDAFVFACFNGMQKITGPMFARWMAILTATPGSILWLLGGSDAVNQRLRDLA 481

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           ++ G+D  R+ F++      H+ R  +AD+ LDT     H+T+ D L  G PV+TLPG++
Sbjct: 482 KSAGVDPARLYFASKVPNPNHLARIAIADLFLDTFPYGAHSTASDALTMGLPVLTLPGKS 541

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASR  +S + + G PELI  +  +Y   A+ L  D+
Sbjct: 542 FASRFCSSIVKSAGIPELICASPDDYVRKAVALANDK 578


>gi|40063716|gb|AAR38497.1| TPR repeat protein [uncultured marine bacterium 583]
          Length = 733

 Score =  110 bits (276), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 59/157 (37%), Positives = 87/157 (55%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           I +R+  GLPE   V+C FN  YKI P+T   W  +L+ V  S+L +     + E N+  
Sbjct: 519 IFTRKDLGLPETGFVFCCFNNTYKITPTTFDGWGRILEQVDGSVLLIYADNELAEINLTK 578

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
                G++  R++F     K E++ R ++AD+ LDT   N  TT+ D L  G PV+T  G
Sbjct: 579 EIVLRGINPSRLIFGKRLPKPEYLARYRVADLFLDTLPYNAGTTASDALRMGLPVLTCMG 638

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ++ ASR AAS +     PELI  T ++Y+ +AI L T
Sbjct: 639 KSFASREAASVINAFNLPELITTTQEQYESLAIELAT 675


>gi|294626108|ref|ZP_06704715.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292599557|gb|EFF43687.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 568

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 92/156 (58%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR Q GLP+ A+V C FN  YK++  ++   + VL+AVP S+LWLL  P   +A ++A A
Sbjct: 367 SRAQCGLPDHAVVLCCFNNSYKLNTRSMARMLAVLRAVPGSVLWLLSGPGQADARMRAAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           QA G+D  R++F       +++ R + AD+ LDT   N HTT+ D LW G PV+T PG+T
Sbjct: 427 QAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWAGCPVLTTPGQT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG  E+       +   A+ L +D
Sbjct: 487 FAARVAGSLNHHLGLDEMNVADDAAFVATAVALASD 522


>gi|254464315|ref|ZP_05077726.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit, putative [Rhodobacterales bacterium Y4I]
 gi|206685223|gb|EDZ45705.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit, putative [Rhodobacterales bacterium Y4I]
          Length = 714

 Score =  110 bits (276), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 58/155 (37%), Positives = 88/155 (56%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLP+   V+C+F    +I P    +W+ +L + P+S+LWLL   A   +N++  A 
Sbjct: 500 RADCGLPDAGFVFCSFAAASQITPREFDIWMRLLSSTPDSVLWLLDHGAHAVSNLRRAAA 559

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+D  R++F+  A ++ H+ R  LA +CLDT   N    + D L  G PV+TLPG   
Sbjct: 560 ARGVDPDRLIFALPAERKAHLARLTLAGLCLDTFTVNASAAARDALAAGVPVLTLPGRQF 619

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           A+R  AS L   G P+LIA + ++Y++ A  L  D
Sbjct: 620 AARTGASLLRAAGLPDLIADSAEDYEEKAAELAAD 654


>gi|409989551|ref|ZP_11273106.1| hypothetical protein APPUASWS_02083, partial [Arthrospira platensis
           str. Paraca]
 gi|409939587|gb|EKN80696.1| hypothetical protein APPUASWS_02083, partial [Arthrospira platensis
           str. Paraca]
          Length = 632

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 97/158 (61%), Gaps = 4/158 (2%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE--ANIQA 100
           +R  +GLPE   V+  FN+ YKI P   Q+W+ +L+ V  S+LWL    AV E  ++++ 
Sbjct: 439 NRSDFGLPETGFVFACFNRHYKISPDLFQVWMQLLQEVEGSVLWLSL--AVEETLSHLRQ 496

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
           TA+A G+D  R++F+      +++ R +LAD+ LDT   NG +T++  L+ G PV+T PG
Sbjct: 497 TAKAAGVDPQRLIFAPKIPHNQYLARMELADLGLDTWNYNGGSTTIAALYGGLPVLTKPG 556

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            T ASR+ AS  A+ G  E+I  +  +YQ+ A+   T+
Sbjct: 557 VTNASRMGASICASAGVTEMICNSAVDYQEKALHWATN 594


>gi|291567291|dbj|BAI89563.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 1020

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 97/158 (61%), Gaps = 4/158 (2%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE--ANIQA 100
           +R  +GLPE   V+  FN+ YKI P   Q+W+ +L+ V  S+LWL    AV E  ++++ 
Sbjct: 827 NRSDFGLPETGFVFACFNRHYKISPDLFQVWMQLLQEVEGSVLWLSL--AVEETLSHLRQ 884

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
           TA+A G+D  R++F+      +++ R +LAD+ LDT   NG +T++  L+ G PV+T PG
Sbjct: 885 TAKAAGVDPQRLIFAPKIPHNQYLARMELADLGLDTWNYNGGSTTIAALYGGLPVLTKPG 944

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            T ASR+ AS  A+ G  E+I  +  +YQ+ A+   T+
Sbjct: 945 VTNASRMGASICASAGVTEMICNSAVDYQEKALHWATN 982


>gi|418939709|ref|ZP_13493098.1| hypothetical protein PDO_2859 [Rhizobium sp. PDO1-076]
 gi|375053594|gb|EHS49984.1| hypothetical protein PDO_2859 [Rhizobium sp. PDO1-076]
          Length = 681

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 58/156 (37%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R + GLPEDA V+  FN + KI  +    W+ +L A P S+LWLL         ++ +A
Sbjct: 420 TRTEAGLPEDAFVFACFNGMQKITANGFARWMKILAATPGSVLWLLTGSDAVNQRLRQSA 479

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   RI+F+  A   +H+ R  +AD+ LDT     H+T+ D +  G PV+TLPG++
Sbjct: 480 TVCGVAADRIIFAPKAPNAQHLARIGVADLFLDTFPYGAHSTASDAITMGLPVLTLPGKS 539

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            ASR  AS ++  G PELI  T  +Y + A+    D
Sbjct: 540 FASRFCASIVSAAGIPELICSTPDDYVNKAVAYARD 575


>gi|254476420|ref|ZP_05089806.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit, putative [Ruegeria sp. R11]
 gi|214030663|gb|EEB71498.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110kDa subunit, putative [Ruegeria sp. R11]
          Length = 547

 Score =  110 bits (274), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 55/155 (35%), Positives = 83/155 (53%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R ++G+P+D  V+  FN +YKI P     W+ +LKA PNS+L         +  + A  +
Sbjct: 350 RDRFGIPQDRFVFATFNNIYKIGPREFATWMEILKAAPNSVLLFYLSNLDLKDRLIAKVK 409

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A G+D  R++ +    +++H+ R    D+CLD    N HTT+ D +W G P++TL GE  
Sbjct: 410 AAGVDPDRVILTGPLPQKDHLDRISQVDLCLDCFSYNAHTTASDAIWCGVPILTLCGEQF 469

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           A+RVA S L      EL   +  EY  +A  L  D
Sbjct: 470 AARVATSILHAANLAELSVTSVAEYVQLAAALAKD 504


>gi|154244409|ref|YP_001415367.1| hypothetical protein Xaut_0452 [Xanthobacter autotrophicus Py2]
 gi|154158494|gb|ABS65710.1| Tetratricopeptide TPR_2 repeat protein [Xanthobacter autotrophicus
           Py2]
          Length = 569

 Score =  110 bits (274), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 88/158 (55%), Gaps = 7/158 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQA 100
           R + GLP+   V+C FN   K+DP+T  +W+ +L  VP S+LWLL     G+   AN++ 
Sbjct: 354 RAECGLPDTGFVFCAFNNTRKLDPATFALWLRLLMEVPGSVLWLL----AGDDARANLRR 409

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
            A+A G+   R++F+      EH+ R  LAD+ LDT     HTT+ D L  G P+VT  G
Sbjct: 410 EAEAAGVAPERLVFAPHRPLPEHLARMALADLFLDTFPYTAHTTASDALRVGLPLVTRTG 469

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            + AS+VA   L   G  +L+  T   Y+ +A+ L  D
Sbjct: 470 RSFASKVAGRLLQLSGVGDLVTDTPAAYEALALSLARD 507


>gi|298708309|emb|CBJ48372.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Ectocarpus siliculosus]
          Length = 662

 Score =  109 bits (273), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 65/190 (34%), Positives = 108/190 (56%), Gaps = 17/190 (8%)

Query: 16  LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPED--AIVYCNFNQLYKIDPSTLQMW 73
           L  GLA     T +++G +  ++++   R+  GLP D    V+ NFN++ K+DP +  +W
Sbjct: 367 LYEGLAP----TSSSSGRKHDRAVL---RRGQGLPGDLSRAVFANFNKVDKLDPESFSLW 419

Query: 74  VNVLKAVPNSILWLLKFPAVG------EANIQATAQALGLDQHRILFSNVAAKEEHVRRG 127
           + +L+ VP S+LWLL+  AV          ++  A++ G+D  RI+++   +K EH+ R 
Sbjct: 420 MQILRRVPGSVLWLLEASAVDSERAAIRTRLRQEAESQGVDGGRIVWAKWVSKSEHLSRH 479

Query: 128 QLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE--LIARTH 185
            LAD+ LDT     H+T+ D L  G P++TL G + ASRV  S L   G  +  L+    
Sbjct: 480 ALADLFLDTLTYGAHSTATDALAGGLPLLTLAGASFASRVGISLLRNAGPSQSALLVAGQ 539

Query: 186 KEYQDIAIRL 195
           +E++D+A+ L
Sbjct: 540 REFEDLAVDL 549


>gi|349605893|gb|AEQ00976.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           110 kDa subunit-like protein, partial [Equus caballus]
          Length = 418

 Score =  109 bits (272), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 47/67 (70%), Positives = 56/67 (83%)

Query: 1   MIASGQVQTSVNGIVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFN 60
           MI  GQ+Q ++NG  + NGLAT Q N K ATGEEVP++I++T+R QYGLPEDAIVYCNFN
Sbjct: 352 MINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFN 411

Query: 61  QLYKIDP 67
           QLYKIDP
Sbjct: 412 QLYKIDP 418


>gi|145588310|ref|YP_001154907.1| hypothetical protein Pnuc_0122 [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
 gi|145046716|gb|ABP33343.1| Tetratricopeptide TPR_2 repeat protein [Polynucleobacter
           necessarius subsp. asymbioticus QLW-P1DMWA-1]
          Length = 660

 Score =  108 bits (271), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 54/155 (34%), Positives = 84/155 (54%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           ++  +G+ ED  V+CN N  +KI  +    W+ +LK  PNS++ L    A    N+   A
Sbjct: 452 AKSDFGIQEDQFVFCNLNNAFKITQAIFDSWLRILKMTPNSVMLLYAENAEATKNLLEYA 511

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+   R +F+    + +++ R ++ D+ LDT   N  TT+ D LW   PV+TL GE+
Sbjct: 512 SKNGVAPQRFVFAERLTRADYLARYKITDLFLDTTPYNAGTTASDALWMDIPVITLQGES 571

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
             SRVA+S L  L  PELI ++   Y+  A+ L T
Sbjct: 572 FPSRVASSLLTHLDLPELIHQSMAGYETQAVELAT 606


>gi|398350156|ref|YP_006395620.1| hypothetical protein USDA257_c02650 [Sinorhizobium fredii USDA 257]
 gi|390125482|gb|AFL48863.1| hypothetical protein USDA257_c02650 [Sinorhizobium fredii USDA 257]
          Length = 684

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 84/157 (53%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPEDA ++  FN + KI  +    W+ +L A P S LWLL      +  ++  A
Sbjct: 420 SRAEVGLPEDAFIFACFNGMQKITAACFDRWMAILSATPGSFLWLLGGGEDVDQRLKQAA 479

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F++ A   +H+ R  LAD+ LDT     H+T+ D L  G PV+T PG  
Sbjct: 480 EQHGVAPERLIFASKAPNPKHLARIGLADLFLDTFPYGAHSTAGDALTMGLPVLTFPGNG 539

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASR  +S +A  G PELI     ++   AI    DR
Sbjct: 540 FASRFCSSIVAAAGVPELICDGPDDFVRKAIGFARDR 576


>gi|372487918|ref|YP_005027483.1| hypothetical protein Dsui_1242 [Dechlorosoma suillum PS]
 gi|359354471|gb|AEV25642.1| hypothetical protein Dsui_1242 [Dechlorosoma suillum PS]
          Length = 724

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/166 (37%), Positives = 91/166 (54%), Gaps = 3/166 (1%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL-LKFPA 92
           E P ++ +    + G P   +V+  FNQ +K+D   + +W  VL AVP ++LWL  + PA
Sbjct: 496 EAP-AVPLLEAAELGAPAAPVVFACFNQTWKVDAQAMALWQAVLAAVPEAVLWLHAETPA 554

Query: 93  VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
           V  A  +  A+A G+   R+ FS    +  ++ R +     LDTPL +GH T  D L  G
Sbjct: 555 VAAALTRRAAEA-GIGATRLHFSPGTTRPRYLARYRHVHAVLDTPLYHGHGTGADALGMG 613

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           TPV++L GE   +RV  S +  LG P+L A + + Y   AIRLG D
Sbjct: 614 TPVLSLQGEGFQARVGGSLVRALGLPQLAAESAEAYVATAIRLGRD 659


>gi|170739041|ref|YP_001767696.1| hypothetical protein M446_0702 [Methylobacterium sp. 4-46]
 gi|168193315|gb|ACA15262.1| Tetratricopeptide TPR_2 repeat protein [Methylobacterium sp. 4-46]
          Length = 667

 Score =  107 bits (268), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 56/150 (37%), Positives = 86/150 (57%)

Query: 47  YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALG 106
           +GLP + +V+  F+  YK+ P     W+ +L+AVP S+LWL+   A    N++  A   G
Sbjct: 453 HGLPAEGLVFGAFSASYKLTPDLFAAWMRLLRAVPGSVLWLVADHAPTRDNLRREAGRHG 512

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +D  R++ +  A  E ++ R +  D+ LDT   N  TT+ D LW G P++TL G   ASR
Sbjct: 513 VDPDRLVLAGRAPYETYLDRLRHIDLFLDTFPYNAGTTASDALWMGVPLLTLAGRAYASR 572

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLG 196
           +A S L  LG P+L+A +   Y+  A+RL 
Sbjct: 573 MAGSLLTALGLPDLVAASLDAYERQALRLA 602


>gi|424795604|ref|ZP_18221433.1| hypothetical protein XTG29_03097 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
 gi|422795393|gb|EKU24099.1| hypothetical protein XTG29_03097 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
          Length = 523

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 85/156 (54%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLPE   V+C FN  YK+ P ++   + VL+ VP+S+LWLL  P   +  ++  A
Sbjct: 321 SRRDCGLPEHGTVFCCFNNSYKLTPRSMTRMLAVLRQVPDSVLWLLSGPGQADTRLRDAA 380

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +   +D  R+ F    A  +++ R + AD+ LDT   N HTT+ D LW G P++  PG T
Sbjct: 381 RRADVDPARLRFMPKLAHADYLARYRHADLFLDTHPYNAHTTASDALWAGCPLLICPGTT 440

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A++LG D
Sbjct: 441 FAARVAGSLNHHLGMDAMNVADDDAFIAKAVQLGRD 476


>gi|428168147|gb|EKX37095.1| hypothetical protein GUITHDRAFT_145209 [Guillardia theta CCMP2712]
          Length = 585

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 78/130 (60%), Gaps = 3/130 (2%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA--VGEANIQA 100
           +RQ++ LP   ++   FN  +KI P  L +W+++L +VP S L  +K+        NI  
Sbjct: 384 TRQEFSLPLSGVLLACFNAFFKITPDILDLWLHILLSVPQSFLLFVKYQYHQAARRNILQ 443

Query: 101 TAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             +   +   RI+F N  A+  +H+RR  L+DV LDT L N HT ++DVLW+G P+V LP
Sbjct: 444 VVKQKNISSARIIFMNRTASLRDHMRRTSLSDVYLDTRLYNAHTIAVDVLWSGVPIVALP 503

Query: 160 GETLASRVAA 169
           G +LASRV A
Sbjct: 504 GTSLASRVTA 513


>gi|254468836|ref|ZP_05082242.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [beta proteobacterium KB13]
 gi|207087646|gb|EDZ64929.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [beta proteobacterium KB13]
          Length = 624

 Score =  107 bits (267), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 56/155 (36%), Positives = 86/155 (55%), Gaps = 1/155 (0%)

Query: 40  VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
           + T R  Y LP++  ++ +FNQ  KI      +W+N+L+  P S LW+L    + + NI+
Sbjct: 420 ITTPRDTYRLPDNHFIFASFNQNLKITKDIFDVWINILQECPKSSLWILLQNDISKRNIK 479

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             A+  GLD +RI+ +  A  +EH+ R + AD+ LD    N HTT  D ++   P +T  
Sbjct: 480 EYARKNGLDDNRIIIAEYAPIDEHIGRIEHADLFLDCYPYNAHTTIADAIYQNKPTITHI 539

Query: 160 GETLASRVAASQLATLGCP-ELIARTHKEYQDIAI 193
           GE + SRV AS L  + C  EL+    ++YQ  AI
Sbjct: 540 GEVMQSRVGASLLHAIDCSEELVFDNLQDYQKQAI 574


>gi|406706967|ref|YP_006757320.1| hypothetical protein HIMB5_00014060 [alpha proteobacterium HIMB5]
 gi|406652743|gb|AFS48143.1| tetratricopeptide repeat protein [alpha proteobacterium HIMB5]
          Length = 657

 Score =  107 bits (267), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 63/161 (39%), Positives = 91/161 (56%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   S+Q  GL E + V+C FN   KI+P    +W+ +LK    S+LWLLK     E N
Sbjct: 445 SVNSISKQSLGLTEGSFVFCCFNSHQKINPIIFDVWLEILKKTEKSVLWLLKDNKFSEKN 504

Query: 98  IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           I+   +   +D  R++F++    E+H++R Q AD+ LDT   N HTT  D L    PV+T
Sbjct: 505 IKLYCEKNKVDPDRLIFADRVELEDHLQRIQYADLFLDTFPYNAHTTCSDALRMNIPVLT 564

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           L G + ASRVA+S L +L   ELI      Y+D A+++  D
Sbjct: 565 LKGNSFASRVASSLLNSLDLSELITHDLNNYKDAALKIYKD 605


>gi|400755802|ref|YP_006564170.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Phaeobacter gallaeciensis 2.10]
 gi|398654955|gb|AFO88925.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Phaeobacter
           gallaeciensis 2.10]
          Length = 730

 Score =  107 bits (266), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 55/158 (34%), Positives = 84/158 (53%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLP++A V+C+F   + I P    +W+ +L  +  S+LWL   P   +A ++  A
Sbjct: 496 SRRDCGLPDEAFVFCSFTASHAITPREFDIWMRLLTKIDGSVLWLADHPEEAQAALRQAA 555

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               +D  R++F+     EE V R  +AD+ LD+   N    + D L  G PV+T+ G  
Sbjct: 556 GDRRVDPDRVIFAAPCTGEERVARSMVADLFLDSFTLNAGPAARDALVAGLPVLTMAGRQ 615

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R  AS LA  G  EL   T + Y+  A+ L +DRD
Sbjct: 616 FAARTTASLLAAAGMEELQTTTPQAYEARALELASDRD 653


>gi|428176588|gb|EKX45472.1| hypothetical protein GUITHDRAFT_108736 [Guillardia theta CCMP2712]
          Length = 710

 Score =  107 bits (266), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 48/128 (37%), Positives = 81/128 (63%)

Query: 45  QQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
           ++ G+      Y +FN LYK+ PS    W+ +L  V +SILWLLKFP+  E  ++  A+A
Sbjct: 520 KELGIGPSYFRYASFNSLYKLTPSVFGAWMKILSQVDSSILWLLKFPSSAEKRLRKEAKA 579

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            G+D  R++F+++  + EH+++   A + LDT + N H+++ D L +G PV+++ G  + 
Sbjct: 580 RGVDDKRLVFTDLYPRAEHLQKKSGASLFLDTLVYNAHSSAADSLASGVPVLSMAGSGIV 639

Query: 165 SRVAASQL 172
           +RVAAS L
Sbjct: 640 NRVAASLL 647


>gi|227820814|ref|YP_002824784.1| hypothetical protein NGR_c02330 [Sinorhizobium fredii NGR234]
 gi|227339813|gb|ACP24031.1| conserved hypothetical protein contains tetratricopeptide protein
           domain [Sinorhizobium fredii NGR234]
          Length = 685

 Score =  107 bits (266), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 82/157 (52%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPED  ++  FN   KI  +    W+ +L A P S LWLL      +  ++  A
Sbjct: 420 SRAEAGLPEDGFIFACFNGAQKITAACFDRWMAILAATPGSCLWLLGGGEDVDQRLKQAA 479

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           Q  G+   R++F++ A   +H+ R  LAD+ LDT     H+T+ D L  G PV+T PG  
Sbjct: 480 QQRGVAPERLIFASKAPNPKHLARIGLADLFLDTFPYGAHSTAGDALTMGLPVLTFPGNG 539

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASR  +S +A  G PELI     ++   AI    DR
Sbjct: 540 FASRFCSSIVAAAGVPELICEGPDDFVRKAIGFAKDR 576


>gi|301106148|ref|XP_002902157.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Phytophthora infestans T30-4]
 gi|262098777|gb|EEY56829.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Phytophthora infestans T30-4]
          Length = 744

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 90/155 (58%), Gaps = 2/155 (1%)

Query: 47  YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQA 104
           + L +   V+  F+   K+DPS    W+ +L  VP+S++W  ++     A  N++  A+A
Sbjct: 321 FDLDDYTFVFATFSNWQKMDPSVFSAWMEILARVPSSVMWFQEYFGHEGAITNLRNEAEA 380

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            G++  R++FS +    +H  R ++AD+ LDT + NGHTT +D L  G PV+TL G+ ++
Sbjct: 381 HGINGKRLIFSPLDPWIDHTYRKRIADLVLDTSMKNGHTTVLDALCAGVPVITLEGDQMS 440

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +R  +S L  L   ++   + KEY ++A+ L T +
Sbjct: 441 NRATSSALNALDLHDMTVHSIKEYVEVAVYLATHK 475


>gi|196232461|ref|ZP_03131314.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196223533|gb|EDY18050.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 760

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 59/145 (40%), Positives = 76/145 (52%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + Y +FN   K++P  L++W  VL A   S L LL  P           +  G+  +RI 
Sbjct: 567 VTYGSFNNFCKVNPDVLRLWTRVLAATEGSRLILLSHPGSHRQRTVDIFEREGIAANRIE 626

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F       E++R  Q  D+ LD    NGHTTS+D LW G PVV+L G+  ASR   SQL 
Sbjct: 627 FVEPRPFAEYLRLYQRLDLVLDPFPYNGHTTSLDALWMGVPVVSLAGQLAASRAGLSQLT 686

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG  EL+ART  EY  I+ RL  D
Sbjct: 687 NLGLTELVARTEDEYVAISTRLAGD 711


>gi|399992733|ref|YP_006572973.1| hypothetical protein PGA1_c15440 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
 gi|398657288|gb|AFO91254.1| hypothetical protein PGA1_c15440 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
          Length = 552

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 86/167 (51%), Gaps = 2/167 (1%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
            VP+  V  +R  Y LP+D  V+ N N  YK+ P     W+ +LK  P+S+L        
Sbjct: 343 HVPE--VRNTRSDYNLPQDKFVFANLNNTYKVGPREFATWMKILKRTPDSVLLFYMGKDD 400

Query: 94  GEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
               I    +A G+D  RI+      + +H+ R    D+CLD    N HTT+ D LW G 
Sbjct: 401 LSDVIAQKTEAHGVDPDRIIPCGALPQADHLDRIAQVDLCLDCFSYNAHTTASDALWAGV 460

Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           P++TL G+  A+RVA+S L+    P+L  ++ + + D A+ L  + D
Sbjct: 461 PILTLAGKQFAARVASSILSAAHLPDLSVKSEELFIDKAVSLAKNPD 507


>gi|224010822|ref|XP_002294368.1| o-linked GlcNAc transferase [Thalassiosira pseudonana CCMP1335]
 gi|220969863|gb|EED88202.1| o-linked GlcNAc transferase [Thalassiosira pseudonana CCMP1335]
          Length = 413

 Score =  106 bits (264), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 94/175 (53%), Gaps = 16/175 (9%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAV---------PNSILWLLKFP 91
           ++ RQ+YG+   A VYC  ++  KIDPST + W+  ++ V         P+ +LWLL+  
Sbjct: 184 VSLRQKYGIDPAAFVYCCHSRPDKIDPSTFRSWLRAIRRVISEQSGTDAPSPVLWLLRSG 243

Query: 92  AVGEANI-QATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
              E N+ Q      G D Q  ++F+ VA + EH++R   AD+ LDTP  N HT   D L
Sbjct: 244 EEMEHNLRQLVVCEFGEDMQDCLVFAEVADRREHLKRLGCADLFLDTPAYNAHTLGCDAL 303

Query: 150 WTGTPVVTL-----PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           + G P+V+L         LASRV AS L +  C  LI     EY+D+ ++  +++
Sbjct: 304 YVGVPMVSLLRNYDGNSKLASRVGASMLISASCEALIVSDMNEYEDMMVKCVSEK 358


>gi|162455198|ref|YP_001617565.1| hypothetical protein sce6916 [Sorangium cellulosum So ce56]
 gi|161165780|emb|CAN97085.1| TPR domain protein [Sorangium cellulosum So ce56]
          Length = 686

 Score =  106 bits (264), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 57/151 (37%), Positives = 79/151 (52%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
            L E  + +   N   K++   L +W  VL AVP S L L+         +++  +  G+
Sbjct: 491 ALAEGRVNFGCLNHFRKVNEGVLHVWTRVLAAVPRSRLTLMAPQGRARDRVRSIFEEGGV 550

Query: 108 DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
              RI F +   + +++RR +  D+CLDT   NGHTTS+D LW G PVVTL GET+  R 
Sbjct: 551 QLDRIEFVDRCGRLDYLRRYREIDICLDTFPSNGHTTSLDALWMGVPVVTLAGETVVGRA 610

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              Q   LG PELIA T + Y  +A  L  D
Sbjct: 611 GVCQAMNLGLPELIATTPEHYVRVASSLAED 641


>gi|427717067|ref|YP_007065061.1| hypothetical protein Cal7507_1771 [Calothrix sp. PCC 7507]
 gi|427349503|gb|AFY32227.1| hypothetical protein Cal7507_1771 [Calothrix sp. PCC 7507]
          Length = 1269

 Score =  106 bits (264), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 90/170 (52%), Gaps = 4/170 (2%)

Query: 35   VPQSIVIT----SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF 90
            V  SI IT    +R  +GLP DA V+C FN+  K DP     W+ +L+ VPNS+LWL++ 
Sbjct: 1058 VASSITITDQPLTRADFGLPTDAFVFCCFNRSDKFDPEVFASWMRILQQVPNSVLWLIET 1117

Query: 91   PAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
                   ++  AQ  G+   R++F+      E++   +LAD+ LDT + N   T +  L+
Sbjct: 1118 TPDVSYTLRDMAQQQGIAPTRLVFTPRLPLAEYLAAYRLADLFLDTFVYNAGATGIHALF 1177

Query: 151  TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             G P++T PG+   +R+ AS  A  G   LI  +   Y+  A+ L T  D
Sbjct: 1178 AGLPLITRPGKAFVARMGASICAAAGLDLLICDSSAAYEQKAVHLATHPD 1227


>gi|440733676|ref|ZP_20913367.1| hypothetical protein A989_18498 [Xanthomonas translucens DAR61454]
 gi|440359550|gb|ELP96855.1| hypothetical protein A989_18498 [Xanthomonas translucens DAR61454]
          Length = 569

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 84/156 (53%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLPE   V+C FN  YK+ P ++   + VL+ V  S+LWLL  P   +A +   A
Sbjct: 367 ARRDCGLPEHGTVFCCFNNSYKLTPRSMTRMLAVLRQVQGSVLWLLSGPGQADARLHDAA 426

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +   +D  R+ F    A  +++ R + AD+ LDT   N HTT+ D LW G P++T PG T
Sbjct: 427 RRADVDPARLRFMPKLAHADYLARYRHADLFLDTHPYNAHTTASDALWAGCPLLTCPGTT 486

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A++LG D
Sbjct: 487 FAARVAGSLNHHLGMDAMNVADDDAFVAKAVQLGRD 522


>gi|400754413|ref|YP_006562781.1| hypothetical protein PGA2_c15330 [Phaeobacter gallaeciensis 2.10]
 gi|398653566|gb|AFO87536.1| hypothetical protein PGA2_c15330 [Phaeobacter gallaeciensis 2.10]
          Length = 552

 Score =  105 bits (263), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 10/170 (5%)

Query: 35  VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
           VP+  V  +R  Y LP+D  V+ N N  YK+ P     W+ +LK  P S+L       +G
Sbjct: 344 VPE--VRNTRSDYNLPQDKFVFANLNNTYKVGPREFATWMKILKRTPESVLLFY----MG 397

Query: 95  EAN----IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
           + +    I    +A G+D  RI+      + +H+ R    D+CLD    N HTT+ D LW
Sbjct: 398 KDDLCDVIAQKTEAHGVDPDRIIPCGALPQADHLDRIAQVDLCLDCFSYNAHTTASDALW 457

Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G P++TL G+  A+RVA+S L+    P+L  ++ + + D A+ L  + D
Sbjct: 458 AGVPILTLAGKQFAARVASSILSAAHLPDLSVKSEELFIDKAVSLAKNPD 507


>gi|335035927|ref|ZP_08529257.1| hypothetical protein AGRO_3257 [Agrobacterium sp. ATCC 31749]
 gi|333792491|gb|EGL63858.1| hypothetical protein AGRO_3257 [Agrobacterium sp. ATCC 31749]
          Length = 681

 Score =  105 bits (263), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 88/158 (55%), Gaps = 2/158 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPEDA VY +FN + KI  +    W+ +L   P S+LWLL         ++  A
Sbjct: 420 SRADVGLPEDAFVYASFNGMQKITENCFARWMTILSETPGSLLWLLTGDDDVNQRLRDLA 479

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F+  A   +H+ R  LAD+ LDT     H+T+ D + +G PV+T+ G+T
Sbjct: 480 EKSGVASERLVFAPKAQNPQHIARIGLADLFLDTFPYGAHSTASDAITSGLPVLTMSGKT 539

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R   S +   G PE+I  + ++Y  +A  +G +RD
Sbjct: 540 FAARFCGSIVTAAGVPEMICSSPEDY--VARAIGFERD 575


>gi|40063048|gb|AAR37904.1| TPR domain protein [uncultured marine bacterium 560]
          Length = 732

 Score =  105 bits (262), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 62/178 (34%), Positives = 89/178 (50%), Gaps = 5/178 (2%)

Query: 20  LATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKA 79
           L + Q N  T +    P  I  T RQ  GLPE   V+C FN  +KI P+T   W  +L+ 
Sbjct: 503 LPSYQANDSTQS----PPEIFFT-RQDLGLPETGFVFCCFNNTFKITPTTFDSWGRILEH 557

Query: 80  VPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
           V  S+L +       + N+       G+D  R++F       E++ R ++ D+ LDT   
Sbjct: 558 VEGSVLLIYANNESAKLNLTKEIVLRGIDPSRLIFGEKLPMSEYLARYRVVDLFLDTHPY 617

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           N  T +   L  G PV+T  G + ASR+ AS +  +  PELI  T KEY+ +AI L T
Sbjct: 618 NAGTIASGALRMGLPVLTCIGNSFASRMGASVINAVNLPELITSTQKEYESLAIELAT 675


>gi|46202694|ref|ZP_00052759.2| COG3914: Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Magnetospirillum magnetotacticum MS-1]
          Length = 256

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/167 (37%), Positives = 90/167 (53%)

Query: 32  GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
           G+  P       R  +GLPEDA V+C  N  +K  P T  +W  +L  VP  +LWLL   
Sbjct: 37  GDHPPVGEPSQPRAAHGLPEDAFVFCCINNSFKFRPQTFDVWAEILHQVPRGVLWLLDRG 96

Query: 92  AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
            +   N+       G+D  R++F+      EH+ R +LAD+ LD+   N  TT+ D LW+
Sbjct: 97  PLSRDNMNREMAGRGIDPARLVFAQRVDLAEHMARHRLADLFLDSFPFNAITTASDALWS 156

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           G PV+T  GE+  SR A+S L  LG PEL+  + + Y+  AIRL  +
Sbjct: 157 GLPVLTWMGESFVSRGASSLLLALGLPELVTASPEAYKAEAIRLARE 203


>gi|159185979|ref|NP_356665.2| hypothetical protein Atu3976 [Agrobacterium fabrum str. C58]
 gi|159141113|gb|AAK89450.2| conserved hypothetical protein [Agrobacterium fabrum str. C58]
          Length = 681

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 88/158 (55%), Gaps = 2/158 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLPEDA VY +FN + KI  +    W+ +L   P S+LWLL         ++  A
Sbjct: 420 SRADVGLPEDAFVYASFNGMQKITENCFARWMTILSETPGSLLWLLTGDDDVNQRLRDLA 479

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F+  A   +H+ R  LAD+ LDT     H+T+ D + +G PV+T+ G+T
Sbjct: 480 EKSGVASERLVFAPKAQNPQHIARIGLADLFLDTFPYGAHSTASDAITSGLPVLTMYGKT 539

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R   S +   G PE+I  + ++Y  +A  +G +RD
Sbjct: 540 FAARFCGSIVTAAGVPEMICSSPEDY--VARAIGFERD 575


>gi|378824868|ref|YP_005187600.1| hypothetical protein SFHH103_00273 [Sinorhizobium fredii HH103]
 gi|365177920|emb|CCE94775.1| conserved hypothetical protein [Sinorhizobium fredii HH103]
          Length = 685

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/156 (35%), Positives = 81/156 (51%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R + GLPED  ++  FN   KI  +    W+ +L A P S LWLL      +  ++  AQ
Sbjct: 421 RAEAGLPEDGFIFACFNGAQKITAACFDRWMAILAATPGSFLWLLGGGEDVDQRLKQAAQ 480

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+   R++F++ A   +H+ R  LAD+ LDT     H+T+ D L  G PV+T PG   
Sbjct: 481 QRGVAPERLIFASKAPNPKHLARIGLADLFLDTFPYGAHSTAGDALTMGLPVLTFPGNGF 540

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           ASR  +S +A  G PELI     ++   A+    DR
Sbjct: 541 ASRFCSSIVAAAGVPELICDGPDDFVRKAVGFAKDR 576


>gi|222148769|ref|YP_002549726.1| hypothetical protein Avi_2411 [Agrobacterium vitis S4]
 gi|221735755|gb|ACM36718.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 582

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 65/159 (40%), Positives = 91/159 (57%), Gaps = 6/159 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR   GLP D +V   FN   K    T+++W  VLKA P ++LWL+     G    QATA
Sbjct: 366 SRMALGLPADRVVIGAFNSQRKNSLLTMRLWAEVLKANPKALLWLM---IDGHGARQATA 422

Query: 103 ---QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
              + LG+ Q ++LF+   A   H+ R Q AD  +DT  CNGHTT+ D+LW G PV+T+ 
Sbjct: 423 AHFKTLGVKQSQLLFAPKMAYAAHLARAQAADFAIDTFPCNGHTTTSDMLWAGLPVMTMQ 482

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           G   ASRV+ S L  +G P+L+A   +++  +A  L  D
Sbjct: 483 GRNFASRVSESLLNAIGLPQLVASNEQDFVALATALIND 521


>gi|224010090|ref|XP_002294003.1| UDP-N-acetylglucosamine: polypeptide-n-acetylglucosaminyl
           transferase [Thalassiosira pseudonana CCMP1335]
 gi|220970675|gb|EED89012.1| UDP-N-acetylglucosamine: polypeptide-n-acetylglucosaminyl
           transferase [Thalassiosira pseudonana CCMP1335]
          Length = 420

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 92/156 (58%), Gaps = 17/156 (10%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ-------ATAQAL 105
           + VYCNFN+  K +P T++ W+ +L+ VPNSIL LL+ P  G  N++       A+ Q+ 
Sbjct: 218 SFVYCNFNKFLKHNPQTMRSWIRILQEVPNSILCLLENPIEGIPNLRKFVSEATASPQSD 277

Query: 106 GLDQHRILFSNVAAKEEHVRRGQ-LADVCLDTPLCNGHTTSMDVLWTGTPVVTLP-GETL 163
           G++ ++  F       +H RR   L +V LD+   NGHTT+ D L+ G P+VT   G+ +
Sbjct: 278 GIEGNQNPF-------DHQRRTHSLCNVMLDSHPYNGHTTAQDALYAGVPIVTRSDGDDM 330

Query: 164 ASRVAASQLATLGCPELIARTH-KEYQDIAIRLGTD 198
           +SRV+ S    LG  EL A     EY+DIAIRLG D
Sbjct: 331 SSRVSTSANVVLGLEELNAYNGINEYEDIAIRLGND 366


>gi|428186291|gb|EKX55141.1| hypothetical protein GUITHDRAFT_131413 [Guillardia theta CCMP2712]
          Length = 839

 Score =  104 bits (260), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 52/151 (34%), Positives = 91/151 (60%), Gaps = 5/151 (3%)

Query: 52  DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLD 108
           +  V+C+F + YK+ P   + WV++L   PNSILWLL +P    +    ++  A+ +GLD
Sbjct: 625 ETFVFCSFAKSYKLHPRLFKTWVSILLRSPNSILWLLSYPDGSDLARPRLREHAERMGLD 684

Query: 109 QHRILFSNVAAKEEH-VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R++ + +  ++ H + +G  A + LD+   NGH T+ + L  G PV+ LP    ASR+
Sbjct: 685 PARLVVTGLLERKRHLLLKGTFAHLLLDSFPYNGHLTTAEALAAGIPVLALPQTRPASRL 744

Query: 168 AASQLATLGC-PELIARTHKEYQDIAIRLGT 197
           ++S LA L   P L+AR  ++Y+++A+R+ +
Sbjct: 745 SSSMLAALQLHPRLVARDLEDYEELAVRIAS 775


>gi|254475750|ref|ZP_05089136.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [Ruegeria sp. R11]
 gi|214029993|gb|EEB70828.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [Ruegeria sp. R11]
          Length = 730

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 83/158 (52%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLP+D  V+C     + I P    +W+ +L  V +S+LW+  +P   +  ++  A
Sbjct: 496 SRRDCGLPDDGFVFCTLAPSHAITPREYDIWMRLLTKVGDSVLWMPDYPTKAQVALRKEA 555

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A G+   R++F       E + R ++AD+ LDT   N   T+ + L  G PV+++ G  
Sbjct: 556 EARGVSPERLIFGPACDGSERMARAKVADLFLDTFTLNAGPTAREALIAGLPVLSMAGRQ 615

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R  AS L   G  EL+  + K Y+  A+ L  DRD
Sbjct: 616 FAARTTASLLTAAGLGELLTTSEKTYEARALELAEDRD 653


>gi|399991359|ref|YP_006571599.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Phaeobacter gallaeciensis DSM 17395 = CIP 105210]
 gi|398655914|gb|AFO89880.1| putative UDP-N-acetylglucosamine-peptide
           N-acetylglucosaminyltransferase [Phaeobacter
           gallaeciensis DSM 17395 = CIP 105210]
          Length = 730

 Score =  103 bits (258), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 54/158 (34%), Positives = 83/158 (52%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLP++A V+C+F   + I P    +W+ +L  +  S+LWL   P   +A ++  A
Sbjct: 496 SRRDCGLPDEAFVFCSFTAGHAITPREFDIWMRLLTKIDGSVLWLADHPEEAQAALRQAA 555

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               +D  R++F+     EE + R  +AD+ LD+   N    + D L  G PV+T+ G  
Sbjct: 556 GDHRVDPDRVIFAAPCTGEERMARSMVADLFLDSFTLNAGPAARDALVAGLPVLTMAGRQ 615

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R  AS LA  G  EL   T + Y+  A+ L  DRD
Sbjct: 616 FAARTTASLLAAAGMEELQTTTPQAYEARALELAGDRD 653


>gi|417861444|ref|ZP_12506499.1| hypothetical protein Agau_L101243 [Agrobacterium tumefaciens F2]
 gi|338821848|gb|EGP55817.1| hypothetical protein Agau_L101243 [Agrobacterium tumefaciens F2]
          Length = 685

 Score =  103 bits (256), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 88/158 (55%), Gaps = 2/158 (1%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR + GLPEDA VY +FN + KI  +    W+ +L   P S+LWLL         ++  A
Sbjct: 424 SRAEVGLPEDAFVYASFNGMQKITENCFARWMTILSQTPGSLLWLLTGDDDVNQRLRDVA 483

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+   R++F+      +H+ R  +AD+ LDT     H+T+ D + +G PV+T+ G+T
Sbjct: 484 EKSGVAPERLVFAPKVQNPQHIARIGVADLFLDTFPYGAHSTAADAITSGLPVLTMSGKT 543

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A+R   S +   G PE++  + ++Y  +A  +G +RD
Sbjct: 544 FAARFCGSIVTAAGVPEMLCSSPEDY--VARAIGFERD 579


>gi|222147875|ref|YP_002548832.1| hypothetical protein Avi_1160 [Agrobacterium vitis S4]
 gi|221734863|gb|ACM35826.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 630

 Score =  102 bits (255), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 55/158 (34%), Positives = 83/158 (52%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+  GLP    V+ +FN   KID   +++W  +LK VP +IL +         N++   
Sbjct: 402 SRESVGLPSGRFVFGSFNGSQKIDRQAIRIWAQILKRVPEAILSISCARVAVADNLRVAF 461

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+D  R++F +     E + R    D+ LDT + NGHTT+ D LW G PV+T  G+ 
Sbjct: 462 AQQGIDAGRLIFFDNCPSAEFLARMSATDLVLDTFIYNGHTTTSDALWAGVPVLTKKGKA 521

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            A RV+ S L  +G PEL+A+   ++   A+ L    D
Sbjct: 522 FAGRVSESLLKAVGLPELVAQDADDFIARAVDLAEHPD 559


>gi|300120051|emb|CBK19605.2| unnamed protein product [Blastocystis hominis]
          Length = 318

 Score =  102 bits (254), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 80/158 (50%), Gaps = 1/158 (0%)

Query: 14  IVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
           + L    A N  N           S  +  R  + +P++  +YC  NQLYKI    L  W
Sbjct: 150 VYLPYYYACNHYNKVPDLHPYQSDSATLYLRNTFKVPQEGTLYCFPNQLYKISSDLLDTW 209

Query: 74  VNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAA-KEEHVRRGQLADV 132
            N+L   P+S LWLL+ P  GE +I+    A G+ + R+LFS+    K  ++ R  + D+
Sbjct: 210 ANILHRHPSSYLWLLRHPLEGEESIKLELMARGVHKDRVLFSDFEDNKALYMVRTSICDL 269

Query: 133 CLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
            LD+P+ +   T +D  W+G PVV LPG+    R+  S
Sbjct: 270 ILDSPVWSAGATGLDAYWSGVPVVNLPGDRTVERLGIS 307


>gi|221634584|ref|YP_002523272.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
           KD131]
 gi|221163457|gb|ACM04419.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
           KD131]
          Length = 598

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 82/155 (52%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE A V+   +  YKI       W  ++ AVP S+LWL++    G+A +    Q
Sbjct: 403 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVEDTPEGQAALAGRWQ 462

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A GLD HR++F+       +  R  LAD+ LDT   N  T + D L  G PV+TL G T 
Sbjct: 463 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 522

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + R+AAS LA +G  + IA   + Y   A+ + TD
Sbjct: 523 SGRMAASLLAAVGLEDCIAPDLEAYVARAVAIATD 557


>gi|397644930|gb|EJK76613.1| hypothetical protein THAOC_01614, partial [Thalassiosira oceanica]
          Length = 354

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 86/155 (55%), Gaps = 7/155 (4%)

Query: 51  EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ-----ATAQAL 105
           E + VYCNFN+  K +P T++ W+ +L+ VPNSIL LL+FPA    N++     +     
Sbjct: 114 EVSFVYCNFNKFLKNNPKTMRSWLRILEEVPNSILCLLEFPAEAIPNLREFILDSVQNKS 173

Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP-GETLA 164
             D+   +       +   R   L +V LD+   NGHTT+ D L+ G P+VT   G+ +A
Sbjct: 174 TSDRIHFISWEKNPFDHQARSSSLCNVMLDSHPYNGHTTAQDALYAGVPIVTRSDGKDMA 233

Query: 165 SRVAASQLATLGCPELIA-RTHKEYQDIAIRLGTD 198
           SRV+ S    LG   L A     EY++IAIRLGTD
Sbjct: 234 SRVSTSANVVLGLEALNAVGGTSEYEEIAIRLGTD 268


>gi|428177567|gb|EKX46446.1| hypothetical protein GUITHDRAFT_70511, partial [Guillardia theta
           CCMP2712]
          Length = 518

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 88/177 (49%), Gaps = 23/177 (12%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVL-------------------KAVPNSI 84
           R+Q GL EDA+V  NFN LYKI+P +  +W + +                   +  P  +
Sbjct: 323 RRQEGLEEDALVVANFNGLYKIEPESFHIWRDGILRCKERGRGGGRGGGGGASERTPCKV 382

Query: 85  -LWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
            LWL+        NI+  A  L      I+F+     + H+RR   ADV LD+   N HT
Sbjct: 383 QLWLVSEGEESNENIRKEAGPLL--SPLIVFAQRREFDAHIRRAAAADVFLDSFTYNAHT 440

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLG-CPELIARTHKEYQDIAIRLGTDR 199
           T++D LW G  VVTLPGET+ SRVA   L   G   E + RT  EY ++  RL  DR
Sbjct: 441 TAVDALWGGIAVVTLPGETIVSRVATGVLGGFGPGSETVTRTTGEYAELIGRLLEDR 497


>gi|428178657|gb|EKX47531.1| hypothetical protein GUITHDRAFT_69463, partial [Guillardia theta
           CCMP2712]
          Length = 676

 Score =  102 bits (253), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 53/148 (35%), Positives = 85/148 (57%), Gaps = 4/148 (2%)

Query: 49  LPEDAI---VYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL 105
            P DAI   + CNFNQ YKI P   Q+W+++L  VP+S LWLL +   G+ ++   A+  
Sbjct: 493 FPADAIDSFILCNFNQFYKITPKVWQLWMHLLGQVPDSKLWLLSWNEAGKQSLLRYAKQS 552

Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
           G+   +++F+    ++ HV      D+ +DT     H T+ D L+ G P+++ PGE++AS
Sbjct: 553 GIPSDKLIFTTFFEEKWHVASKANCDLFVDTRPYAAHGTAADALFAGLPIISAPGESMAS 612

Query: 166 RVAASQLATLGCPEL-IARTHKEYQDIA 192
           RV+ S     G  +L IAR  ++Y  +A
Sbjct: 613 RVSLSLSLASGIGDLFIARNDQDYVRLA 640


>gi|88809832|ref|ZP_01125338.1| TPR domain protein [Synechococcus sp. WH 7805]
 gi|88786216|gb|EAR17377.1| TPR domain protein [Synechococcus sp. WH 7805]
          Length = 890

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/154 (36%), Positives = 85/154 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLPEDA+V+  F++  KI P     W+ +L+ VP+++LW++    + E  ++  A
Sbjct: 696 TRNDIGLPEDAVVFACFHRAEKITPMRFHCWLEILQQVPDALLWIINDQPLVEERLRKKA 755

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +A GL   R++FS         +   LAD+ LDT   +   T++  L  G P+++ PGE 
Sbjct: 756 RAAGLGPQRLVFSPKLESALFSQACSLADLLLDTSPYSSGATAVTALAAGLPLLSCPGEN 815

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            ASR+ AS  A  G  ELI  T + YQ  AI LG
Sbjct: 816 FASRMGASLCAATGLNELICSTPEAYQQKAIALG 849


>gi|196230691|ref|ZP_03129552.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196225032|gb|EDY19541.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 684

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 57/145 (39%), Positives = 78/145 (53%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           I +   N L K++   L+ W  +L AV  S L LL         I    Q LG++ HR+ 
Sbjct: 493 ITFGALNNLTKVNEPLLESWARILGAVKGSRLLLLGSRGSHRQRITDFLQRLGVESHRVE 552

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F     +EE++R  Q  D+ LD+   NGHTTS+D LW G PVV+L G T  SR   S L+
Sbjct: 553 FLLPCRREEYLRYHQRIDLILDSFPYNGHTTSLDALWMGVPVVSLAGNTSVSRGGLSILS 612

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            L   EL+AR+  +Y  +A+ L TD
Sbjct: 613 NLALAELVARSADDYVRVAVDLATD 637


>gi|398376282|ref|ZP_10534464.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rhizobium sp. AP16]
 gi|397727476|gb|EJK87900.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rhizobium sp. AP16]
          Length = 691

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 3/180 (1%)

Query: 17  QNGLATNQTNTKTATGEEVPQSIVIT---SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
           +N +  ++   + A  + + +  VI    SR + GLPED  VY  FN + KI   T   W
Sbjct: 391 ENEIYFSEKVLRIACNQPIDRKRVIAERPSRAEAGLPEDVFVYACFNGMQKITQETFAHW 450

Query: 74  VNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVC 133
           + +L     S+LWLL      +  +   A+  G+   R++F+  A   +H+ R  +AD+ 
Sbjct: 451 MKILSGTTGSVLWLLTGGDDVDKRLLQLAENAGVAPERLIFAPKAPNAKHLARIAVADLF 510

Query: 134 LDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
           LDT     H+T+ D L  G PV+T PG++ A+R   S +A  G PELI    ++Y + AI
Sbjct: 511 LDTFPYGAHSTAGDALTMGLPVLTFPGKSFAARFCHSIVAAAGVPELICSNPEDYIERAI 570


>gi|222080835|ref|YP_002540198.1| glycosyltransferase domain-containing protein [Agrobacterium
           radiobacter K84]
 gi|221725514|gb|ACM28603.1| glycosyltransferase TPR domain protein [Agrobacterium radiobacter
           K84]
          Length = 704

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 3/180 (1%)

Query: 17  QNGLATNQTNTKTATGEEVPQSIVIT---SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
           +N +  ++   + A  + + +  VI    SR + GLPED  VY  FN + KI   T   W
Sbjct: 404 ENEIYFSEKVLRIACNQPIDRKRVIAERPSRAEAGLPEDVFVYACFNGMQKITQETFAHW 463

Query: 74  VNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVC 133
           + +L     S+LWLL      +  +   A+  G+   R++F+  A   +H+ R  +AD+ 
Sbjct: 464 MKILAGTTGSVLWLLTGGDDVDKRLLQLAENAGVAPERLIFAPKAPNAKHLARIAVADLF 523

Query: 134 LDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAI 193
           LDT     H+T+ D L  G PV+T PG++ A+R   S +A  G PELI    ++Y + AI
Sbjct: 524 LDTFPYGAHSTAGDALTMGLPVLTFPGKSFAARFCHSIVAAAGVPELICSNPEDYIERAI 583


>gi|428178004|gb|EKX46881.1| hypothetical protein GUITHDRAFT_107237 [Guillardia theta CCMP2712]
          Length = 902

 Score =  101 bits (252), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 56/190 (29%), Positives = 93/190 (48%), Gaps = 34/190 (17%)

Query: 24  QTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS 83
           Q+  K   G+  P+     +   +   +  +++ NFN L K+ P T  +W+ +L  VP+S
Sbjct: 651 QSQMKQVEGDFTPEDAAARNLPHH---QHQVIFANFNHLQKLGPGTFALWMKILGQVPSS 707

Query: 84  ILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
           +LW+L+FP   EA+++  A A  ++  RIL ++               + LDT   N H 
Sbjct: 708 LLWMLRFPREAEAHLKREAAASNINASRILMTD--------------KILLDTLEYNAHV 753

Query: 144 TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIART-----------------HK 186
           + +D LW G P++TL G  +A R  AS L + G   L+ART                  +
Sbjct: 754 SGIDALWIGLPLITLAGSNMARRCGASFLQSQGLDLLLARTQATSHVLHPRTCDGSAEQE 813

Query: 187 EYQDIAIRLG 196
           EY ++A++LG
Sbjct: 814 EYVEMAVKLG 823


>gi|302143604|emb|CBI22357.3| unnamed protein product [Vitis vinifera]
          Length = 351

 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/98 (48%), Positives = 62/98 (63%)

Query: 82  NSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
           +S+LWLL+FPAVGE  +++   A GL   RI+F   A K EH++R  LAD+ LDTPLCN 
Sbjct: 123 SSVLWLLRFPAVGEMRLRSYTVAQGLQPDRIIFIVGAMKHEHIKRSALADLFLDTPLCNA 182

Query: 142 HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
            T   D+LW   P+VTL  E +A+RVA S     G  E
Sbjct: 183 QTIDTDILWASLPMVTLSFEKMATRVAGSLCLATGLDE 220


>gi|196229049|ref|ZP_03127915.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196227330|gb|EDY21834.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 725

 Score =  101 bits (251), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 56/145 (38%), Positives = 77/145 (53%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + + + N   KI+   L++W  VL AV +S L LL  P      +    +A G+ + RI 
Sbjct: 533 VTFASLNNYTKINEPLLKLWARVLAAVADSRLLLLSPPGSHRERVLEIMRAEGVREQRIE 592

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F     + E++ R    DV LD    NGHTTS+D LW G PVV+  GE + SR   SQL+
Sbjct: 593 FVAEGPRREYLERYHRVDVMLDPFPYNGHTTSLDALWMGVPVVSRAGEAVVSRGGLSQLS 652

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG  EL+A + +EY  IA  L  D
Sbjct: 653 NLGLRELVAFSEEEYVKIAAGLAGD 677


>gi|72382664|ref|YP_292019.1| TPR repeat-containing protein [Prochlorococcus marinus str. NATL2A]
 gi|72002514|gb|AAZ58316.1| TPR repeat [Prochlorococcus marinus str. NATL2A]
          Length = 739

 Score =  101 bits (251), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 52/157 (33%), Positives = 84/157 (53%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR++ GLP D+ V+  FN+  KI      +W+ +LK + NS+LWL+K       N+ +  
Sbjct: 545 SREELGLPLDSFVFTCFNRTEKITRKEFDIWMRLLKKIDNSVLWLIKPHKAAIENLYSEL 604

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
               +++ RI+F+      +H+ R +  D+ LDT   N  TT    LW G P++TL G+T
Sbjct: 605 NKQSMNKERIVFAEFMNLNDHLSRHEYGDLFLDTFNYNAGTTGALSLWAGLPIITLAGKT 664

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            +SR +A  L  L    LI     +Y+ +A  L T++
Sbjct: 665 NSSRASAGFLNALDLNGLITYNEYDYESLAYELATNK 701


>gi|433679021|ref|ZP_20510807.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Xanthomonas
           translucens pv. translucens DSM 18974]
 gi|430815863|emb|CCP41351.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Xanthomonas
           translucens pv. translucens DSM 18974]
          Length = 504

 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/156 (37%), Positives = 87/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R+  GLPE   V+C FN  YK+ P ++   + VL+ VP+S+LWLL  P   +A ++  A
Sbjct: 302 ARRDCGLPEHGTVFCCFNNSYKLTPRSMTRMLAVLRQVPDSMLWLLSGPGQADARLREAA 361

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +   +D  R+ F    A  +++ R + AD+ LDT   N HTT+ D LW G P++T PG T
Sbjct: 362 RRADVDPARLRFMPKLAHADYLARYRHADLFLDTHPYNAHTTASDALWAGCPLLTCPGTT 421

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            A+RVA S    LG   +       +   A++LG D
Sbjct: 422 FAARVAGSLNHHLGMDAMNVADDAAFAAKAVQLGRD 457


>gi|428184949|gb|EKX53803.1| hypothetical protein GUITHDRAFT_100772 [Guillardia theta CCMP2712]
          Length = 653

 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 88/157 (56%), Gaps = 2/157 (1%)

Query: 37  QSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA 96
           Q +   S +Q    ++A+V+ + N LYK+D      W+ V++ +P+S+L LL+      A
Sbjct: 426 QKLSSASVKQDKTSKEAVVFASLNNLYKLDERLFATWMQVMRKIPSSMLVLLQGREGMRA 485

Query: 97  N--IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
           N  +   A+  G+ + R++ +    KEEH+ R     + LDT   + H+T++D LW G P
Sbjct: 486 NARLAQYAERFGISKDRMITALPVKKEEHIARHSQFTLFLDTWKVSSHSTAVDSLWAGLP 545

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDI 191
           VV LP E + +RV+A+ L  LG  E +AR   +Y DI
Sbjct: 546 VVVLPHEKMQARVSAALLYVLGMVEFLARDVGDYMDI 582


>gi|254526504|ref|ZP_05138556.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110kDa subunit
           [Prochlorococcus marinus str. MIT 9202]
 gi|221537928|gb|EEE40381.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase 110kDa subunit
           [Prochlorococcus marinus str. MIT 9202]
          Length = 701

 Score =  100 bits (250), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 60/155 (38%), Positives = 92/155 (59%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R+ + LPED ++Y  FN  +KI      +W+ +LK +  S LWLL    +   N++  A+
Sbjct: 503 RKDFDLPEDGVIYTCFNNGFKITEKEFNLWMRLLKEIKRSHLWLLSTNLLMIKNLKDEAK 562

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
              +D +RI F+     E+H+ R  L D+ LDT   N HTT+ D LW G PV+TL G++ 
Sbjct: 563 KRNVDPNRISFAKKLPLEQHLARHSLGDIFLDTFNYNAHTTASDALWAGLPVLTLKGKSF 622

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +SRV++S L +LG  ELI  + +EY + A+ LG +
Sbjct: 623 SSRVSSSLLNSLGLNELICLSEEEYFEKALDLGKN 657


>gi|428180751|gb|EKX49617.1| hypothetical protein GUITHDRAFT_135804 [Guillardia theta CCMP2712]
          Length = 782

 Score =  100 bits (250), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 55/164 (33%), Positives = 91/164 (55%), Gaps = 18/164 (10%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-------PAVGEA---------- 96
           +V  NFN   K+D  TL++W  V+  VPNSILWL+          AVG            
Sbjct: 555 VVIANFNNWKKLDMRTLRVWAKVMGEVPNSILWLMSMLPYEGPSGAVGGGQEELDRCQTD 614

Query: 97  NIQATAQALGLDQHRILFSNVAAKEEH-VRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           ++++  Q+LG+D  R++ + +   E H + + Q  DV LD+   N H+T+++++W   PV
Sbjct: 615 SLKSILQSLGVDASRVVITGLLPSETHLIAKQQATDVFLDSLAYNAHSTAVELIWANVPV 674

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +T P   + +RVA+S L   G   L+AR  ++ +D+A+RL  +R
Sbjct: 675 LTCPELKMTARVASSLLLAHGMSALVAREARDMEDVAVRLLRER 718


>gi|78779585|ref|YP_397697.1| TPR repeat-containing protein [Prochlorococcus marinus str. MIT
           9312]
 gi|78713084|gb|ABB50261.1| TPR repeat [Prochlorococcus marinus str. MIT 9312]
          Length = 632

 Score =  100 bits (249), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 54/141 (38%), Positives = 80/141 (56%), Gaps = 1/141 (0%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LP  +IV   F+  +K+    +  W+ VL    N+ LWL     + + N+ +  Q+  +D
Sbjct: 438 LPPKSIVLAAFHMAFKLSEEVVNSWIKVLNQTENTYLWLKISNKLAKKNLISHFQSKNVD 497

Query: 109 QHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             RILF+  V    EH+ R   AD+ LDT   NGH+T ++ +W+  P +TL GE+ ASRV
Sbjct: 498 LKRILFAEKVDLYTEHISRYSKADLFLDTFNFNGHSTLVECIWSELPFITLAGESFASRV 557

Query: 168 AASQLATLGCPELIARTHKEY 188
            AS L +LG  ELIA++  EY
Sbjct: 558 GASILHSLGLSELIAKSTDEY 578


>gi|390575516|ref|ZP_10255611.1| TPR repeat-containing protein [Burkholderia terrae BS001]
 gi|389932565|gb|EIM94598.1| TPR repeat-containing protein [Burkholderia terrae BS001]
          Length = 495

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/157 (38%), Positives = 88/157 (56%), Gaps = 11/157 (7%)

Query: 49  LPED---AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS--ILWLLKFPAVGEAN--IQAT 101
           LP D    + +   N   K+    + +W NV+  VP++  ILW+    A G+A   + A 
Sbjct: 298 LPADEAGTVTFGCLNNPAKLTDRAIALWANVIANVPDARMILWV----AEGDARERVSAK 353

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +A G+D+ R+ F ++ A+E+++R     DV LDT  CNGHTTS+D LW G PVVT  G 
Sbjct: 354 FEANGVDRSRLQFVDMQAREDYLRTYHRIDVALDTFPCNGHTTSLDALWMGVPVVTRVGS 413

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           T  SR   + LA LG PEL A + + +  IA +L  D
Sbjct: 414 TPISRGGYALLANLGLPELAAESDEAFVKIATQLALD 450


>gi|196232944|ref|ZP_03131793.1| TPR domain protein [Chthoniobacter flavus Ellin428]
 gi|196222922|gb|EDY17443.1| TPR domain protein [Chthoniobacter flavus Ellin428]
          Length = 178

 Score = 99.8 bits (247), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 54/129 (41%), Positives = 73/129 (56%)

Query: 70  LQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQL 129
           L++WV VL+ VP S L LL+         +   +  G+D  R+ F  V A+ E+++    
Sbjct: 2   LKIWVRVLQNVPGSRLLLLRGGGNYAERTRQFFERYGIDAGRVEFLGVRARGEYLKNYHR 61

Query: 130 ADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQ 189
            D+ LDT   NGHTT++D LW G PVV+L GET  SR   SQL  LG  EL+A +   Y 
Sbjct: 62  VDMVLDTFPYNGHTTNLDALWMGVPVVSLAGETAVSRGGLSQLNNLGLTELVAHSEDAYL 121

Query: 190 DIAIRLGTD 198
           +IA RL  D
Sbjct: 122 EIATRLAGD 130


>gi|126464807|ref|YP_001041783.1| TPR repeat-containing protein [Rhodobacter sphaeroides ATCC 17029]
 gi|126106622|gb|ABN79147.1| Tetratricopeptide TPR_2 repeat protein [Rhodobacter sphaeroides
           ATCC 17029]
          Length = 590

 Score = 99.8 bits (247), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 80/155 (51%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE A V+   +  YKI       W  ++ AVP S+LWL+     G+A +    Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVADTPEGQAALAGRWQ 454

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A GLD HR++F+       +  R  LAD+ LDT   N  T + D L  G PV+TL G T 
Sbjct: 455 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 514

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + R+AAS L  +G  + IA   + Y   A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549


>gi|428186635|gb|EKX55485.1| hypothetical protein GUITHDRAFT_99261 [Guillardia theta CCMP2712]
          Length = 656

 Score = 99.4 bits (246), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 88/150 (58%), Gaps = 1/150 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI-QAT 101
           +R+  GLP D+ +  NFN L K++P   ++++ +L+    + LWL ++P      I Q  
Sbjct: 415 ARKSIGLPADSFLLANFNNLEKLEPRLWRLFLLLLRRHRGAHLWLQQYPPAKANRIRQMF 474

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +  G+++ R++ +     ++H+    LAD+ +D+P  N H+T+MD LW   P+ TLP E
Sbjct: 475 KEEEGMEEARLVTTKFFKSDDHIGFKALADIFVDSPSYNAHSTAMDTLWGALPLATLPEE 534

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDI 191
            +A+RV AS L  + C   IAR+ ++ Q++
Sbjct: 535 KMAARVGASLLLAINCSSTIARSMEDLQEL 564


>gi|425472708|ref|ZP_18851549.1| hypothetical protein MICAK_3940006 [Microcystis aeruginosa PCC
           9701]
 gi|389881186|emb|CCI38246.1| hypothetical protein MICAK_3940006 [Microcystis aeruginosa PCC
           9701]
          Length = 268

 Score = 99.4 bits (246), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 88/173 (50%), Gaps = 9/173 (5%)

Query: 34  EVPQSIVIT---------SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI 84
           E+P   V T         SR + GLP++  VY  FN+  K  P     W+ +L+AVPN++
Sbjct: 49  ELPHVFVTTPLSFSEPPPSRSELGLPDEGFVYACFNRTDKWSPELFACWLEILQAVPNAV 108

Query: 85  LWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
           LWL +        ++A A+  G+D  R++F    +  E +   + AD+ LDT L NG  T
Sbjct: 109 LWLAESSEDISQTLRAKAKTAGVDPERLVFLVQRSPWEFISVCRQADLFLDTFLYNGGAT 168

Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           S+  +  G P++T PG+T ASR+  S     G    +  + + YQ  AI  GT
Sbjct: 169 SVCAIQAGVPLLTCPGDTFASRMGMSICHAAGLESFVCESRESYQVQAIYWGT 221


>gi|332561307|ref|ZP_08415622.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
           WS8N]
 gi|332274106|gb|EGJ19423.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
           WS8N]
          Length = 590

 Score = 99.4 bits (246), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 80/155 (51%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE A V+   +  YKI       W  ++ AVP S+LWL+     G+A +    Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVADTPEGQAALAGRWQ 454

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A GLD HR++F+       +  R  LAD+ LDT   N  T + D L  G PV+TL G T 
Sbjct: 455 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 514

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + R+AAS L  +G  + IA   + Y   A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549


>gi|294677605|ref|YP_003578220.1| hypothetical protein [Rhodobacter capsulatus SB 1003]
 gi|3128311|gb|AAC16163.1| hypothetical protein [Rhodobacter capsulatus SB 1003]
 gi|294476425|gb|ADE85813.1| TPR repeat domain protein [Rhodobacter capsulatus SB 1003]
          Length = 617

 Score = 99.4 bits (246), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 56/164 (34%), Positives = 86/164 (52%), Gaps = 1/164 (0%)

Query: 35  VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
           +PQ   +T R + GLPEDA+V+   +  YK+  +    W  ++  VP S+LW++      
Sbjct: 414 MPQLPAVT-RAEEGLPEDAVVFTCVSHHYKLTEAVWGAWCRIVARVPGSVLWIIDDNPES 472

Query: 95  EANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
            A + A   A GL   R++F+       +  R  LAD+ LDT   N  T + D L  G P
Sbjct: 473 RAALTARWGAAGLAPERLIFAARTDPARYRARLALADLFLDTTPYNAGTIASDALRMGLP 532

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           ++T  G   A+R+ AS L  +G P+ IA     Y+D+A+ +GTD
Sbjct: 533 LITTRGRAFAARMGASLLTAIGLPDCIAEDLAGYEDLAVAIGTD 576


>gi|420241336|ref|ZP_14745477.1| hypothetical protein PMI07_03260 [Rhizobium sp. CF080]
 gi|398071470|gb|EJL62724.1| hypothetical protein PMI07_03260 [Rhizobium sp. CF080]
          Length = 646

 Score = 99.0 bits (245), Expect = 9e-19,   Method: Composition-based stats.
 Identities = 59/174 (33%), Positives = 94/174 (54%), Gaps = 7/174 (4%)

Query: 30  ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
           A    +PQ I    R    +P+DA ++  F+  +K  P+T+ +WV VLK  P S L+L+ 
Sbjct: 412 AVHRPMPQPI---DRSLCDMPQDAFIFSCFHSHWKFTPATIDLWVRVLKETPESYLFLIC 468

Query: 90  FPAVGEAN--IQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSM 146
               G     I+A  +A G+ + R++F + V     ++ R  L D+ LDT   NGHTT+ 
Sbjct: 469 RERFGARKNLIKAFVEA-GIPESRLIFGDRVNNYAAYINRIGLTDLGLDTYPYNGHTTTS 527

Query: 147 DVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + LW G P++T  G+  ASRV+ S L  LG PE++A    ++   A+    +R+
Sbjct: 528 EKLWAGLPMLTHKGKNFASRVSESLLNALGLPEMVAEDQDDFVRRAVHYYNNRE 581


>gi|196232370|ref|ZP_03131223.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196223442|gb|EDY17959.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 779

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/145 (37%), Positives = 75/145 (51%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           I + + N   K++   L++W  VL AV NS L +L                 G++  RI 
Sbjct: 588 ITFGSLNNFCKVNEPLLKLWARVLTAVNNSRLIILAHEGSHRQRTSEFFTQEGVEPERIE 647

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F     ++E++      D+ LDT   NGHTTS+D LW G PVV+L GE   SR   SQL 
Sbjct: 648 FVTQRPRKEYLELYHRLDIVLDTFPYNGHTTSLDALWMGVPVVSLCGERPVSRAGLSQLN 707

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG PEL+A T  +Y +IA +L  D
Sbjct: 708 NLGLPELVAFTEDQYVEIATKLAND 732


>gi|397576620|gb|EJK50333.1| hypothetical protein THAOC_30710 [Thalassiosira oceanica]
          Length = 747

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 69/213 (32%), Positives = 101/213 (47%), Gaps = 41/213 (19%)

Query: 28  KTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVL---KAVPNS- 83
           +TA  + V      + R +YG+ + A VYC  ++  KIDP T + W+  L   K   NS 
Sbjct: 465 RTAVAQNVSDDDRRSIRLKYGISQTAFVYCCHSRPDKIDPVTFRSWIRALIRAKDEGNSE 524

Query: 84  -ILWLLKFPAVGEANIQATAQALGLDQHR---ILFSNVAAKEEHVRRGQLADVCLDTPLC 139
            +LWLL+     EAN++ T       QH    ++F+++A +EEH+RR   AD+ LDTP  
Sbjct: 525 PVLWLLRSGKEMEANLR-TFLTREFGQHLDSLLVFADIADREEHLRRLACADLFLDTPAY 583

Query: 140 NGHTTSMDVLWTGTPVVTL--------------------------------PGETLASRV 167
             HT   D L++  P+VTL                                  + LASRV
Sbjct: 584 GAHTVGCDALYSNVPMVTLLRSANDPTKHSFADEDYEPMHSSGSNPERRSITTDKLASRV 643

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            AS L  +G P LIA   ++Y+D+ +R   D D
Sbjct: 644 GASLLRAVGLPMLIAEDMRQYEDLMVRCAHDID 676


>gi|196232942|ref|ZP_03131791.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196222920|gb|EDY17441.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 743

 Score = 98.6 bits (244), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 56/143 (39%), Positives = 74/143 (51%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFS 115
           + + N   K++  TL +W  VL  VPN+ L LL                 G+   RI F+
Sbjct: 553 FGSLNNFCKVNVETLVLWARVLNHVPNARLVLLCEEGSQRQRTWDLFARHGISAERISFA 612

Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
              A+ ++++     D+ LDT   NGHTTS+D LW G PVV+L GE   SR   SQL  L
Sbjct: 613 EPRARTDYLKLYYDVDIVLDTFPYNGHTTSLDALWMGVPVVSLVGEVAVSRAGLSQLNNL 672

Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
           G PE +A T  EY +IA RL  D
Sbjct: 673 GLPEWVAHTEDEYVEIATRLAGD 695


>gi|124024786|ref|YP_001013902.1| hypothetical protein NATL1_00731 [Prochlorococcus marinus str.
           NATL1A]
 gi|123959854|gb|ABM74637.1| Hypothetical protein NATL1_00731 [Prochlorococcus marinus str.
           NATL1A]
          Length = 816

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 49/155 (31%), Positives = 85/155 (54%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R+ + LP+   ++  FN+ YKI      +W+N+L  +  S+LWL K       N+   A+
Sbjct: 623 RKDFNLPDQGFIFTCFNENYKITKKEFNIWMNLLIKIEGSVLWLYKSNQCSMNNLYKEAR 682

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
              ++  R++F+   A  +H+ R  L D+ LDT   NG  T+   L  G PV+T  G++ 
Sbjct: 683 KRKVNPDRLIFAERLAMNKHLPRHSLGDLALDTFNYNGGATTSCALLAGLPVLTKIGQSF 742

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +RV+AS L+++G  ELI  +  EY++ A+ +  +
Sbjct: 743 MARVSASLLSSIGLSELITYSESEYEEKALYIANN 777


>gi|403336494|gb|EJY67446.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Oxytricha trifallax]
          Length = 559

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 6/160 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+Q G+P +A V+  FNQ ++ID  +L+ W  + + VP SIL  +     G+  I A A
Sbjct: 361 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFILVNKEGQQAIVAKA 420

Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           + L +D + + F +   K + +V R   AD+ LDT   N    + D+LW+G P++TL  +
Sbjct: 421 RQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQINARIVNTDLLWSGVPLLTLKSD 480

Query: 162 TLASRVAASQLATLGCPE-----LIARTHKEYQDIAIRLG 196
           + A+RV AS L+ LG  E     L+A T ++Y   A+ L 
Sbjct: 481 SWANRVGASMLSGLGLSEQQYDSLVATTVQQYIQKAVDLA 520


>gi|428184152|gb|EKX53008.1| hypothetical protein GUITHDRAFT_84548 [Guillardia theta CCMP2712]
          Length = 346

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 81/147 (55%), Gaps = 3/147 (2%)

Query: 52  DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHR 111
           D++V  N N  YKIDP + + W+++L  V  S+LW+  +  +GE N+   A  +G D  R
Sbjct: 143 DSLVIGNLNAFYKIDPESFRAWLDILARVNQSLLWIQSWNDLGEQNLVMNADIMGGDPSR 202

Query: 112 ILFSNVAAKEEHVRRGQ---LADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
           + FS       +    Q   +ADV LDT L N  TT+ D+ W G P V+LP E L SR+ 
Sbjct: 203 MCFSKPPPALYNRLWNQHSVIADVHLDTFLYNTVTTATDLFWLGVPSVSLPSERLCSRIG 262

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRL 195
           AS     G   L AR  ++Y+ +A+++
Sbjct: 263 ASFAHAAGSECLTARNLEDYKQVAVQI 289


>gi|403362723|gb|EJY81094.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Oxytricha trifallax]
          Length = 598

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 6/160 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+Q G+P +A V+  FNQ ++ID  +L+ W  + + VP SIL  +     G+  I A A
Sbjct: 359 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFILVNKEGQQAIVAKA 418

Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           + L +D + + F +   K + +V R   AD+ LDT   N    + D+LW+G P++TL  +
Sbjct: 419 RQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQINARIVNTDLLWSGVPLLTLKSD 478

Query: 162 TLASRVAASQLATLGCPE-----LIARTHKEYQDIAIRLG 196
           + A+RV AS L+ LG  E     L+A T ++Y   A+ L 
Sbjct: 479 SWANRVGASMLSGLGLSEQQYDSLVATTVQQYIQKAVDLA 518


>gi|125654622|ref|YP_001033816.1| O-linked acetylglucosamine transferase [Rhodobacter sphaeroides
           2.4.1]
 gi|77386282|gb|ABA81711.1| predicted O-linked acetylglucosamine transferase [Rhodobacter
           sphaeroides 2.4.1]
          Length = 590

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/155 (36%), Positives = 80/155 (51%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE A V+   +  YKI       W  ++ AVP S+LWL+     G+A +    Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVADTPEGQAALAGRWQ 454

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A GLD HR++F+       +  R  LAD+ LDT   N  T + D L  G P++TL G T 
Sbjct: 455 AAGLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPLLTLAGRTF 514

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + R+AAS L  +G  + IA   + Y   A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549


>gi|403342452|gb|EJY70546.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Oxytricha trifallax]
          Length = 598

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 90/160 (56%), Gaps = 6/160 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+Q G+P +A V+  FNQ ++ID  +L+ W  + + VP SIL  +     G+  I A A
Sbjct: 359 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFVLVNKEGQQAIVAKA 418

Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           + L +D + + F +   K + +V R   AD+ LDT   N    + D+LW+G P++TL  +
Sbjct: 419 KQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQVNARIVNTDLLWSGVPLLTLKSD 478

Query: 162 TLASRVAASQLATLGCPE-----LIARTHKEYQDIAIRLG 196
           + A+RV AS L+ LG  E     L+A T ++Y   A+ L 
Sbjct: 479 SWANRVGASMLSGLGLSEQQYDSLVATTVQQYIQKAVDLA 518


>gi|443475508|ref|ZP_21065455.1| hypothetical protein Pse7429DRAFT_1126 [Pseudanabaena biceps PCC
           7429]
 gi|443019627|gb|ELS33688.1| hypothetical protein Pse7429DRAFT_1126 [Pseudanabaena biceps PCC
           7429]
          Length = 736

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 90/161 (55%), Gaps = 9/161 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R Q  +PEDA++Y    + YK  P  +++ + +LK VPNS L L+K   V +AN      
Sbjct: 536 RSQLDIPEDAVIYFTGQRGYKRHPDNIRLQMKILKNVPNSYL-LIK--GVSDANSTEELF 592

Query: 99  QATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
              A++ G+D  R+ F  +   E EH     +ADV LDT   NG TT+++ LW G P+VT
Sbjct: 593 FRLAESEGVDCDRLKFLPLTMSELEHRANLTIADVVLDTYPYNGATTTLETLWMGIPIVT 652

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             GE  ASR + + +   G  E IA T +EY +  ++LGTD
Sbjct: 653 RVGEQFASRNSYTMMLNAGLVEGIAWTDEEYVEWGVKLGTD 693


>gi|209526542|ref|ZP_03275068.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
 gi|209493048|gb|EDZ93377.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
          Length = 728

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 9/163 (5%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI--- 98
           T RQ Y +  D +VY       K +P  ++  VN+LK VP+SIL      A+G+  I   
Sbjct: 526 TLRQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRE 582

Query: 99  --QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
              A  +A+G+ QHRI   S  A +EEH +   LADV LD+   NG T S++ LW  TP+
Sbjct: 583 TYAAACKAIGVGQHRIKQISRFATEEEHRKIYMLADVLLDSYPYNGGTHSLEALWFNTPL 642

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT  G    SR+  S L  +G    IA + +EYQ+  IRLG +
Sbjct: 643 VTRRGSQFLSRMGYSFLKGVGVEMGIADSWEEYQNCGIRLGIE 685


>gi|196233636|ref|ZP_03132477.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196222306|gb|EDY16835.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 752

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 58/157 (36%), Positives = 81/157 (51%), Gaps = 4/157 (2%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           +S +Q G+    + + + N   K++   LQ+W  VL+ V  S L LL             
Sbjct: 555 SSAKQNGI----VTFGSLNNFCKVNDPMLQLWARVLRGVKGSRLVLLSHAGTHRQRTLDI 610

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
               G+D  RI F+    + +++   +  DV LD    NGHTTS+D LW G PVVTL G+
Sbjct: 611 LTEAGIDPSRIEFATHRPRPKYLELYRHLDVVLDPFPYNGHTTSLDALWMGVPVVTLAGK 670

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           T  SR   SQL+ LG PEL+A T  +Y   AI+L  D
Sbjct: 671 TAVSRAGLSQLSNLGLPELVAFTGDDYVAKAIQLAHD 707


>gi|376002124|ref|ZP_09779971.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375329510|emb|CCE15724.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 728

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 9/163 (5%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI--- 98
           T RQ Y +  D +VY       K +P  ++  VN+LK VP+SIL      A+G+  I   
Sbjct: 526 TLRQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRE 582

Query: 99  --QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
              A  +A+G+ QHRI   S  A +EEH +   LADV LD+   NG T S++ LW  TP+
Sbjct: 583 TYAAACKAIGVGQHRIKQISRFATEEEHRKIYMLADVLLDSYPYNGGTHSLEALWFNTPL 642

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT  G    SR+  S L  +G    IA + +EYQ+  IRLG +
Sbjct: 643 VTRRGSQFLSRMGYSFLKGVGVEMGIADSWEEYQNWGIRLGIE 685


>gi|158521101|ref|YP_001528971.1| hypothetical protein Dole_1087 [Desulfococcus oleovorans Hxd3]
 gi|158509927|gb|ABW66894.1| Tetratricopeptide TPR_2 repeat protein [Desulfococcus oleovorans
           Hxd3]
          Length = 808

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 55/154 (35%), Positives = 80/154 (51%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           +R   GLPE   V+C FN +YK++P T   W+ +L+   N++LWL          +   A
Sbjct: 597 TRSDAGLPEAGFVFCCFNAVYKMEPETFGAWMEILREADNAVLWLSAASDAIRQRLTDRA 656

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+D  R++F+      E++ R QLAD+ LDT L    +T++  L  G PV+T  G T
Sbjct: 657 VAHGVDAGRLVFAEKLPHPEYMARYQLADLFLDTFLYTAGSTAVCALHGGVPVLTRTGPT 716

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            ASR+ AS  A  G  E I    + Y   A+ L 
Sbjct: 717 NASRMGASICAAAGMEETICPDTEAYIQRAVHLA 750


>gi|222150113|ref|YP_002551070.1| glycosyltransferase TPR domain-containing protein [Agrobacterium
           vitis S4]
 gi|221737095|gb|ACM38058.1| glycosyltransferase TPR domain-containing protein [Agrobacterium
           vitis S4]
          Length = 594

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 57/157 (36%), Positives = 80/157 (50%), Gaps = 1/157 (0%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SRQQ+ LPED +V+  F+   +I P T  +WVN+L+   NS+LW+         N     
Sbjct: 385 SRQQFDLPEDGVVFAAFHPTEQISPRTADLWVNILRQTENSLLWI-SCGNYARNNFSTWM 443

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
              G+ ++R++F N A   E V    LAD+ LD+   N   +++  L  G PV T  G  
Sbjct: 444 NRQGIAENRLVFRNPARPMEQVGWMTLADIALDSFPYNDGFSTLAALQAGLPVPTFSGGN 503

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            ASRV AS L   G  +LIAR    Y  + I L  D+
Sbjct: 504 FASRVTASLLQACGLEKLIARDADAYVALCIDLARDK 540


>gi|423066382|ref|ZP_17055172.1| TPR repeat-containing protein [Arthrospira platensis C1]
 gi|406712134|gb|EKD07324.1| TPR repeat-containing protein [Arthrospira platensis C1]
          Length = 728

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/163 (39%), Positives = 88/163 (53%), Gaps = 9/163 (5%)

Query: 42  TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI--- 98
           T RQ Y +  D +VY       K +P  ++  VN+LK VP+SIL      A+G+  I   
Sbjct: 526 TLRQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRE 582

Query: 99  --QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
              A  +A+G+ QHRI   S  A +EEH +   LADV LD+   NG T S++ LW  TP+
Sbjct: 583 TYAAACKAIGVGQHRIKQISRFATEEEHRKIYMLADVLLDSYPYNGGTHSLEALWFNTPL 642

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT  G    SR+  S L  +G    IA + +EYQ+  IRLG +
Sbjct: 643 VTRRGSQFLSRMGYSFLKGVGVEMGIADSWEEYQNWGIRLGIE 685


>gi|429208817|ref|ZP_19200061.1| hypothetical protein D516_2259 [Rhodobacter sp. AKP1]
 gi|428188242|gb|EKX56810.1| hypothetical protein D516_2259 [Rhodobacter sp. AKP1]
          Length = 590

 Score = 97.1 bits (240), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 57/155 (36%), Positives = 80/155 (51%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   GLPE A V+   +  YKI       W  ++ AVP S+LWL++    G+A +    Q
Sbjct: 395 RPGEGLPEAAFVFACASHFYKITEPLFAAWCRIVAAVPGSVLWLVEDTPEGQAALAGRWQ 454

Query: 104 ALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           A  LD HR++F+       +  R  LAD+ LDT   N  T + D L  G PV+TL G T 
Sbjct: 455 AARLDPHRLIFAPRVDPARYRARLALADLFLDTMPYNAGTIASDALRMGLPVLTLAGRTF 514

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + R+AAS L  +G  + IA   + Y   A+ + TD
Sbjct: 515 SGRMAASLLTAVGLEDCIAPDLEAYVARAVAIATD 549


>gi|428186472|gb|EKX55322.1| hypothetical protein GUITHDRAFT_131550 [Guillardia theta CCMP2712]
          Length = 864

 Score = 96.7 bits (239), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 57/145 (39%), Positives = 82/145 (56%), Gaps = 1/145 (0%)

Query: 51  EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-AQALGLDQ 109
           E  +V  NF Q+YKIDPS   +W ++L+   ++ILWLL+FP V    I+        L  
Sbjct: 638 ERKMVLANFGQMYKIDPSLFLLWCDILREDADTILWLLEFPPVASKRIRRRFLSTCRLPP 697

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++FS +     HV    LAD+ LDT L NGH+T +D LW G P+VT+PG+ +  R  A
Sbjct: 698 SRLVFSAMLPLHLHVPIKSLADLALDTLLFNGHSTGLDTLWAGVPIVTVPGDRMHRRAGA 757

Query: 170 SQLATLGCPELIARTHKEYQDIAIR 194
           S     G    +AR  ++Y ++A R
Sbjct: 758 SMAHAAGATVWLARNERDYLELARR 782


>gi|428775921|ref|YP_007167708.1| hypothetical protein PCC7418_1295 [Halothece sp. PCC 7418]
 gi|428690200|gb|AFZ43494.1| TPR repeat-containing protein [Halothece sp. PCC 7418]
          Length = 736

 Score = 96.7 bits (239), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/163 (37%), Positives = 97/163 (59%), Gaps = 9/163 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA--- 100
           R+Q  +P++A+VY +     K  P T+++ + +L  VPNS   L+K  A  EA++QA   
Sbjct: 537 REQLNIPQNAVVYFSAQAGKKRHPETVRLQMKILAQVPNSYF-LIKGLA-DEASVQAFFT 594

Query: 101 -TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             AQ  G+D +R+ F    AKEE + R  LA  DV LDT   NG TT+++ LW   P+VT
Sbjct: 595 HIAQEEGVDSNRLRFLPYVAKEE-IHRANLAIADVVLDTYPYNGATTTLETLWMAVPLVT 653

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             GE  ++R + + + ++G  E IA + +EY +  ++LGTD++
Sbjct: 654 RVGEQFSARNSYTFMKSVGVSEGIAWSDEEYIEWGVKLGTDQE 696


>gi|409991408|ref|ZP_11274672.1| hypothetical protein APPUASWS_10241 [Arthrospira platensis str.
           Paraca]
 gi|291571117|dbj|BAI93389.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409937730|gb|EKN79130.1| hypothetical protein APPUASWS_10241 [Arthrospira platensis str.
           Paraca]
          Length = 730

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 63/161 (39%), Positives = 88/161 (54%), Gaps = 9/161 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
           RQ Y +  D +VY       K +P  ++  VN+LK VP+SIL      A+G+  I     
Sbjct: 528 RQAYRISRDQVVYLCVAPGRKFNPDLVKAQVNILKQVPDSIL---VHKALGDTQIIRETY 584

Query: 99  QATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
            A  +A+G+ QHRI   S  A +EEH +   LADV LD+   NG T S++ LW  TP+VT
Sbjct: 585 AAACKAIGVGQHRIKQISRFATEEEHRKIYILADVLLDSYPYNGGTHSLEALWFNTPLVT 644

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             G    SR+  S L  +G    IA + ++YQ+ AIRLG +
Sbjct: 645 RRGSQFLSRMGYSFLKAVGVEMGIADSWEDYQNWAIRLGRE 685


>gi|428165389|gb|EKX34385.1| hypothetical protein GUITHDRAFT_147284 [Guillardia theta CCMP2712]
          Length = 595

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 86/160 (53%), Gaps = 18/160 (11%)

Query: 58  NFNQLYKIDPSTLQMWVNVL----------------KAVPNSILWLLKF--PAVGEANIQ 99
           N+NQ +KID   L+ W+++L                K +  S+L L+KF      E N++
Sbjct: 368 NYNQFFKIDKHILRAWIDILRRDEKVLKLFSSFWPLKGLSQSVLVLVKFLFHKQAERNLR 427

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A  +A G++  R++ +     +EH+ R QL  + LDT L +GHTT++D LW+G PV+  P
Sbjct: 428 AALRANGINATRLILAEKLKNQEHILRSQLMHLHLDTRLQSGHTTTVDALWSGIPVLVWP 487

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             ++ SR AA  +   G   +IAR  ++Y + A+ L   +
Sbjct: 488 ARSMVSRAAAGIVHGAGLSWMIARNGQDYVEAAMVLSRSK 527


>gi|403346239|gb|EJY72509.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           family GT41 [Oxytricha trifallax]
          Length = 641

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 56/160 (35%), Positives = 88/160 (55%), Gaps = 6/160 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR+Q G+P +A V+  FNQ ++ID  +L+ W  + + VP SIL  +     G+  I A A
Sbjct: 402 SRRQMGIPRNAFVFGIFNQYFQIDSLSLETWFKIKEQVPGSILLFILVNKEGQQAIVAKA 461

Query: 103 QALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           + L +D + + F +   K + +V R   AD+ LDT   N    + D+LW+G P++TL  +
Sbjct: 462 RQLEIDLNEVYFLSQDIKSQVNVNRMFTADLALDTLQINARIVNTDLLWSGVPLLTLKSD 521

Query: 162 TLASRVAASQLATLGCP-----ELIARTHKEYQDIAIRLG 196
             A+RV AS L+ LG        L+A T ++Y   A+ L 
Sbjct: 522 NWANRVGASMLSGLGLSVQQYDSLVATTVQQYIQKAVDLA 561


>gi|428210828|ref|YP_007083972.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oscillatoria acuminata PCC 6304]
 gi|427999209|gb|AFY80052.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oscillatoria acuminata PCC 6304]
          Length = 744

 Score = 96.3 bits (238), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 93/160 (58%), Gaps = 6/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----Q 99
           R + G+P+DA+VY +    YK +P  +++ + +++AVPNS   L+K     + ++    +
Sbjct: 539 RDELGIPDDAVVYFSGQTGYKRNPQNVRLQLQIIQAVPNSYF-LIKSSIADQESVKRFFE 597

Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A+  G+ + R+ F   VA++E H     +ADV LDT   NG TT+++ LW G P+VT 
Sbjct: 598 QMAEEEGVKRDRLRFLPPVASEEIHRANLAIADVVLDTYPYNGATTTLETLWMGIPLVTR 657

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE  A+R + + +  +G  E IA T ++Y    IRLGTD
Sbjct: 658 VGEQFAARNSYTMMMNVGVSEGIAWTDEDYVAWGIRLGTD 697


>gi|425445186|ref|ZP_18825222.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
 gi|389734920|emb|CCI01527.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
          Length = 717

 Score = 96.3 bits (238), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 62/163 (38%), Positives = 95/163 (58%), Gaps = 7/163 (4%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ--- 99
           +RQ   +P DAI Y       K +P+T+++ ++++KAVPNS  +LL   A  E  I+   
Sbjct: 515 TRQNLNIPADAITYLTLQVGLKRNPATIRLQMHIIKAVPNS--YLLIKGAGSEKLIKDLF 572

Query: 100 -ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
            + A+  G+D++R+ F S VA +E H    Q+ADV LDT   +G TT+++ LW G PVVT
Sbjct: 573 ISIAKEEGIDENRLRFLSGVATEEIHRANLQIADVVLDTYPYSGATTTLEALWMGIPVVT 632

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             G+  ASR + + +   G  E IA + +EY    I+LG D +
Sbjct: 633 KVGQQWASRNSYTFITYAGISEGIAWSDEEYIQWGIKLGMDEE 675


>gi|119487761|ref|ZP_01621270.1| hypothetical protein L8106_29805 [Lyngbya sp. PCC 8106]
 gi|119455594|gb|EAW36731.1| hypothetical protein L8106_29805 [Lyngbya sp. PCC 8106]
          Length = 744

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 87/160 (54%), Gaps = 5/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA---VGEANIQA 100
           R    +P DA++Y +    YK  P T +M +++LKAVPNS   +  F     V E  IQ 
Sbjct: 541 RDDLNIPGDAVIYLSSQTGYKYHPQTAKMQIHILKAVPNSYFLIKGFKKQDIVKEFFIQ- 599

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A+  G++  R++F S V +   H     +ADV LDT   NG TT+++ LW   P+VT  
Sbjct: 600 LAKDEGVNPDRLIFLSGVPSSIIHRANLAIADVLLDTFPYNGATTTLETLWMEVPMVTRV 659

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           G+  ASR + + +   G  E IA T +EY +  IRLGTD 
Sbjct: 660 GQQFASRNSYTMMVNAGINEGIAWTDEEYVEWGIRLGTDE 699


>gi|196232459|ref|ZP_03131312.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196223531|gb|EDY18048.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 672

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 73/145 (50%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + +   N   K++  T + W  +L+ V  S L LL  P      +    +  G+D  RI 
Sbjct: 480 VTFGCLNNFCKLNEQTWKSWARILRQVSGSRLILLSAPGSHRQRVVDGLEREGIDPRRIE 539

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F   A +  ++   Q  D+ LD+   NGHTTS+D LW G PVV+  G    SR   SQL+
Sbjct: 540 FVERAPRRGYLEYYQRLDIALDSFPYNGHTTSLDALWMGVPVVSRAGRGPLSRAGWSQLS 599

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG PEL+A +  +Y  IA+ L  D
Sbjct: 600 NLGLPELVAFSEDDYIRIAVELAGD 624


>gi|409991248|ref|ZP_11274526.1| hypothetical protein APPUASWS_09479 [Arthrospira platensis str.
           Paraca]
 gi|409937884|gb|EKN79270.1| hypothetical protein APPUASWS_09479 [Arthrospira platensis str.
           Paraca]
          Length = 744

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 86/164 (52%), Gaps = 3/164 (1%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   +R    +P+DAIVY    +  K +P   ++ + +L+ VPNS   +  F      N
Sbjct: 534 SVPTITRADLAIPDDAIVYLGLQRGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSLN 593

Query: 98  --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
                 A   G+ + RI F + V  +EEH     +ADV LDT   NG TT+M+ LW G P
Sbjct: 594 QFFFDIANQQGITRERIKFIAPVKWEEEHRANLLIADVVLDTYPYNGATTTMETLWMGIP 653

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +VT  GE  A+R + + +   G  E IA T +EY +  +RLGTD
Sbjct: 654 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 697


>gi|291566901|dbj|BAI89173.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 744

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 86/164 (52%), Gaps = 3/164 (1%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   +R    +P+DAIVY    +  K +P   ++ + +L+ VPNS   +  F      N
Sbjct: 534 SVPTITRADLAIPDDAIVYLGLQRGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSLN 593

Query: 98  --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
                 A   G+ + RI F + V  +EEH     +ADV LDT   NG TT+M+ LW G P
Sbjct: 594 QFFFDIANQQGITRERIKFIAPVKWEEEHRANLLIADVVLDTYPYNGATTTMETLWMGIP 653

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +VT  GE  A+R + + +   G  E IA T +EY +  +RLGTD
Sbjct: 654 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 697


>gi|428217092|ref|YP_007101557.1| hypothetical protein Pse7367_0829 [Pseudanabaena sp. PCC 7367]
 gi|427988874|gb|AFY69129.1| hypothetical protein Pse7367_0829 [Pseudanabaena sp. PCC 7367]
          Length = 774

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 87/158 (55%), Gaps = 3/158 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           RQ +G+P +A+VY       K  P TLQ  + ++KAVPNS L +  L   A      Q  
Sbjct: 569 RQDFGIPAEAVVYLTAQVGLKRHPDTLQRQLEIIKAVPNSYLAIKGLADNASMTKLCQEM 628

Query: 102 AQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
           A A+ +   R++F  ++A E EH     +ADV LDT   NG TT+++ LW G P+VT  G
Sbjct: 629 AAAMEISYDRLVFLPLSATEAEHRANLTIADVVLDTYPYNGATTTLETLWMGIPIVTRVG 688

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +  ++R A S +   G    IA + +EY +  IRLG+D
Sbjct: 689 KQFSARNAYSLMVNAGISAGIAWSDREYVEWGIRLGSD 726


>gi|376003416|ref|ZP_09781227.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
 gi|375328219|emb|CCE16980.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
          Length = 744

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 86/164 (52%), Gaps = 3/164 (1%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   +R    +P+DAIVY    +  K +P   ++ + +L+ VPNS   +  F      N
Sbjct: 534 SVPTMTRADLEIPDDAIVYLGLQRGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSLN 593

Query: 98  --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
                 A   G+ + RI F + V  +EEH     +ADV LDT   NG TT+M+ LW G P
Sbjct: 594 QFFFDIANQQGITRERIKFITPVKLEEEHRANLLIADVVLDTYPYNGATTTMETLWMGIP 653

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +VT  GE  A+R + + +   G  E IA T +EY +  +RLGTD
Sbjct: 654 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 697


>gi|361068591|gb|AEW08607.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165155|gb|AFG65434.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165157|gb|AFG65435.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165159|gb|AFG65436.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165161|gb|AFG65437.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165163|gb|AFG65438.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165165|gb|AFG65439.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165167|gb|AFG65440.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165169|gb|AFG65441.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165171|gb|AFG65442.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165173|gb|AFG65443.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165175|gb|AFG65444.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165177|gb|AFG65445.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165179|gb|AFG65446.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165181|gb|AFG65447.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165183|gb|AFG65448.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
 gi|383165185|gb|AFG65449.1| Pinus taeda anonymous locus CL741Contig1_01 genomic sequence
          Length = 74

 Score = 93.6 bits (231), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 40/60 (66%), Positives = 49/60 (81%)

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
           +I+F++VAAK EH+RR  LADVCLDTPLCN HTT  DVLW G P++TLP E +A+RVA S
Sbjct: 4   QIIFTDVAAKSEHIRRSSLADVCLDTPLCNAHTTGTDVLWAGVPIITLPLEKMATRVAGS 63


>gi|428175987|gb|EKX44874.1| hypothetical protein GUITHDRAFT_109295 [Guillardia theta CCMP2712]
          Length = 703

 Score = 93.6 bits (231), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 93/185 (50%), Gaps = 22/185 (11%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
           ++P   V   R ++GLPE A + C+F +  KI    +  WV +LK V +S+LW+ ++  +
Sbjct: 458 DLPSQHVPQMRSRFGLPEVAPLVCSFCEANKISQELVDSWVRILKRVEDSVLWIPRYNPL 517

Query: 94  GEANIQATAQALGLDQHRILFSN-------------------VAAKEEHVRRGQLADVCL 134
            E N++   +  GL+  R+LFS                    V +  E++     ADV L
Sbjct: 518 AEENLRVEFKRRGLNVSRVLFSPPEVRCLGGKCTRHAGNGSLVLSDMEYLAAALCADVVL 577

Query: 135 DT-PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLG--CPELIARTHKEYQDI 191
           DT P  +   T   +LW   P +T+PG  ++SR  ASQ+ +LG     L+ART  E +D+
Sbjct: 578 DTGPDISSSATFGSMLWAAIPSLTMPGFNMSSRSGASQVLSLGPAAASLVARTTSELEDM 637

Query: 192 AIRLG 196
            + L 
Sbjct: 638 TVLLA 642


>gi|428174094|gb|EKX42992.1| hypothetical protein GUITHDRAFT_163991 [Guillardia theta CCMP2712]
          Length = 1166

 Score = 93.6 bits (231), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 81/151 (53%), Gaps = 1/151 (0%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           LPED IV+C  + ++K+D     +W+ +L  VP S L L +     +  ++  A   G+D
Sbjct: 620 LPEDEIVFCALHHVWKMDEDIFSVWMELLLHVPGSKLRLQECAEEAKVTMRQFASRKGVD 679

Query: 109 QHRILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R++F+  +    EHV +     + LD P  N HT++ D LW   PV+T+P + +  R 
Sbjct: 680 PSRLVFNPKIGDYYEHVDKLSKCRLFLDAPKYNAHTSAGDALWANVPVITVPHQMMVHRG 739

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AA  + +      I R  ++Y+ +A+RLG+D
Sbjct: 740 AACLVYSTNVTSTIVRNLEDYKVMALRLGSD 770


>gi|428313126|ref|YP_007124103.1| O-linked N-acetylglucosamine transferase, SPINDLY family
           [Microcoleus sp. PCC 7113]
 gi|428254738|gb|AFZ20697.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Microcoleus sp. PCC 7113]
          Length = 742

 Score = 93.2 bits (230), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 88/159 (55%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA---VGEANIQA 100
           R Q  +P DA++Y +  + +K  P T ++ + +LK VPNS   L+K  A     EA  + 
Sbjct: 535 RDQLDIPSDAVIYFSSQRGFKRHPDTARLQMQILKEVPNSYF-LIKGGADEEAIEAFFKE 593

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A+A G++  R+ F   V ++  H     +ADV LDT   NG TT+++ LW G P+VT  
Sbjct: 594 LAEAEGVESDRLRFLPGVFSEATHRANLGIADVVLDTYPYNGATTTLETLWMGVPLVTRV 653

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           GE  A+R + + +   G  E IA T +EY +  IRLG D
Sbjct: 654 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGIRLGKD 692


>gi|119510218|ref|ZP_01629356.1| hypothetical protein N9414_10548 [Nodularia spumigena CCY9414]
 gi|119465168|gb|EAW46067.1| hypothetical protein N9414_10548 [Nodularia spumigena CCY9414]
          Length = 729

 Score = 93.2 bits (230), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 89/163 (54%), Gaps = 8/163 (4%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ--- 99
           SR+   +PEDAIVY +     K +P  +++ + ++K VPNS   LLK       +++   
Sbjct: 527 SRESLDIPEDAIVYFSSQTGLKRNPQNIRLQMQIIKQVPNSYF-LLKSHRTNHQDLKDFI 585

Query: 100 ---ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
              A A+ + L+  R L S V    EH     +ADV LDT   NG TT+++ LW G P+V
Sbjct: 586 TPIAEAEGVDLECFRFLPS-VPTDTEHRANLAIADVVLDTYPYNGATTTLETLWMGIPLV 644

Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           T  GE  A+R + + +  +G  E +AR+ +EY +  +RLG D 
Sbjct: 645 TRVGEQFAARNSYTMMMNVGVTEGLARSDEEYVEWGVRLGKDE 687


>gi|209527310|ref|ZP_03275819.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
 gi|209492236|gb|EDZ92582.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
          Length = 742

 Score = 93.2 bits (230), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 84/160 (52%), Gaps = 5/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
           RQ   +PE+AIVY    +  K +P   Q+ + +LK VPNS L +  F      N      
Sbjct: 539 RQDIHIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLVIKGFGEQESINRLFLDL 598

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A  Q +  DQ R +   VA +E H     +ADV LD+   NG TT+++ LW G P+VT  
Sbjct: 599 AEQQEVSRDQFRFI-GAVALEETHRANLAIADVVLDSYPYNGATTTLETLWMGIPLVTRV 657

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           GE  A+R + + +   G  E IA T +EY +  +RLGTD 
Sbjct: 658 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 697


>gi|33865520|ref|NP_897079.1| TPR domain-containing protein [Synechococcus sp. WH 8102]
 gi|33632689|emb|CAE07501.1| TPR domain protein [Synechococcus sp. WH 8102]
          Length = 928

 Score = 92.8 bits (229), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 88/165 (53%), Gaps = 8/165 (4%)

Query: 37  QSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA 96
           Q +   SR  +GLP++A+V+C FN+  KI       W+ +L+AVP S+LWL   P   ++
Sbjct: 731 QPMAGISRSSFGLPDEAVVFCCFNRGDKITAGIFSSWMTILEAVPGSLLWLAVKPKALQS 790

Query: 97  NIQATAQALGLDQHRILFSNVAAKEEHVRRG----QLADVCLDTPLCNGHTTSMDVLWTG 152
            +QA AQ  G+D HR++    A  ++ V+R       AD+ LDT   N     +  L  G
Sbjct: 791 -LQAQAQKRGIDPHRLV---AAPYQKPVQRFIAAMACADLFLDTAGFNAGAIGVLALNAG 846

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
            P++T+ G    +R+ AS    +  P+LI  T + YQ  AI L T
Sbjct: 847 LPLITIAGNRFCARMGASLCNAVHQPQLITTTPEAYQQKAIELAT 891


>gi|358635829|dbj|BAL23126.1| hypothetical protein AZKH_0787 [Azoarcus sp. KH32C]
          Length = 586

 Score = 92.8 bits (229), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 48/150 (32%), Positives = 85/150 (56%), Gaps = 5/150 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQ 109
           +++ +FN L K+    L +W  VL+A+P++ L++ +  A+G+      +Q   +  G+  
Sbjct: 391 LIFGSFNNLSKVSGPVLDLWAEVLRAIPDAGLFM-QARALGDTGTRERVQGQLEDRGVAP 449

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++ +  ++   H+      DVCLDT   NGH T++D LW G PV+TL G+  A R+ A
Sbjct: 450 ERVVLAGFSSLAAHLNLFNHIDVCLDTFPWNGHMTTLDSLWMGVPVLTLEGDRRAGRMGA 509

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +    +G  + +ART +++ + AI L  DR
Sbjct: 510 TIQRAIGLDDFVARTSQDFVERAIALDKDR 539


>gi|196234038|ref|ZP_03132873.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196221887|gb|EDY16422.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 733

 Score = 92.4 bits (228), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 73/145 (50%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           I +   N   K++   L++W   LKAV NS L LL          +   +  G+D   I 
Sbjct: 542 ITFGCLNNFCKVNDPLLKLWARGLKAVKNSRLILLSHEGSQTQRTRQFLEGEGVDPAHIE 601

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F    +++ ++      D+ LD    NGH+TS+D LW G PVV+L G+   SR   SQL+
Sbjct: 602 FVAPRSRKPYLELYHRLDIALDPFPYNGHSTSLDALWMGVPVVSLCGDRPVSRAGLSQLS 661

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG PEL+A T   Y DIA  L  D
Sbjct: 662 NLGLPELVASTKDRYVDIATGLAND 686


>gi|196232458|ref|ZP_03131311.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196223530|gb|EDY18047.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 747

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 51/145 (35%), Positives = 73/145 (50%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + +   N   K++  TL+ W  VL+ V +S L LL              ++ G++  R+ 
Sbjct: 559 VTFGCLNHFCKVNEPTLRRWAKVLRTVRDSKLMLLSPEGSHRRKTWRILESEGVEARRVE 618

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F     +  ++   Q  DV LDT   NGHTTS+D LW G PVV+L G +  SR   SQL+
Sbjct: 619 FVPRCGRRAYLELYQQMDVALDTFPYNGHTTSLDALWMGVPVVSLAGRSAVSRAGLSQLS 678

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG  EL+A    E+ +IA  L  D
Sbjct: 679 NLGLAELVASAEDEFVEIAAALAQD 703


>gi|428175656|gb|EKX44545.1| hypothetical protein GUITHDRAFT_109660 [Guillardia theta CCMP2712]
          Length = 603

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 63/214 (29%), Positives = 96/214 (44%), Gaps = 36/214 (16%)

Query: 16  LQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVN 75
           LQ G+  + +       E +  S++  SR++ GLP+   ++  FN LYKI P+    W+ 
Sbjct: 361 LQLGMLFSSSTHAINHPEMLSSSLI--SRKEAGLPDKMFIFACFNTLYKITPTLFLAWLK 418

Query: 76  VLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL---------------------- 113
           +LK VP        F AV  +     A+  G  +  ++                      
Sbjct: 419 ILKNVPKRFQIPSPF-AVDSSQCPLVARRTGRREGTLMQLCTRKRRLKEEALVVMVVVVV 477

Query: 114 -----------FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
                       S+    E+H++    AD+ LDTP+ N H+T+ D LW G PV+TL  E 
Sbjct: 478 VEEDEEEEEDEDSSKVPLEKHLQVKANADLFLDTPMYNAHSTAADALWAGVPVLTLAREK 537

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           +A+RV AS   +L  PELI R  +EY  +A  L 
Sbjct: 538 MAARVGASLALSLHMPELIVRNMEEYVSVATLLA 571


>gi|440681073|ref|YP_007155868.1| TPR repeat-containing protein [Anabaena cylindrica PCC 7122]
 gi|428678192|gb|AFZ56958.1| TPR repeat-containing protein [Anabaena cylindrica PCC 7122]
          Length = 737

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 54/162 (33%), Positives = 88/162 (54%), Gaps = 4/162 (2%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL---KFPAVGEANIQ 99
           +R+   +P+DA++Y +    YK +P  +++ + +LK VPNS   +    K   V     +
Sbjct: 534 TRKNLEIPDDAVIYFSAQAGYKRNPDNVRLQMKILKEVPNSYFLIKDIHKEVGVVRQFFE 593

Query: 100 ATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A+  G+ + R+ F N V +   H    Q+ADV LDT   NG TT+M+ LW G P+VT 
Sbjct: 594 KIAEEEGVSKERLRFLNKVPSSAIHRANLQIADVVLDTYPYNGATTTMETLWVGIPLVTK 653

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            GE  ++R + S +   G  E IA + +EY +  ++LG + D
Sbjct: 654 VGEQFSARNSYSMMVNAGITEGIAWSDEEYIEWGVKLGKETD 695


>gi|434404762|ref|YP_007147647.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Cylindrospermum stagnale PCC 7417]
 gi|428259017|gb|AFZ24967.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Cylindrospermum stagnale PCC 7417]
          Length = 742

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 85/160 (53%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---A 100
           RQ   +P DA+VY +  + YK  P T +  + ++K VPNS   L+K  A  EA  Q    
Sbjct: 541 RQDLDIPSDAVVYLSSQRSYKRHPETTKWQMRIIKEVPNSYF-LIKGDADEEAIKQFFYK 599

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            A+  G+D  R+ F       E V R  LA  DV LDT   NG TT+++ LW G P+VT 
Sbjct: 600 IAEEEGVDSSRLRFLP-QDPSEAVHRANLAIADVVLDTFPYNGATTTLETLWMGIPLVTR 658

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE   SR + + +  +G  E +A T +EY +  +RLG D
Sbjct: 659 VGEQFVSRNSYTMMMNVGVTEGLAWTDEEYVEWGVRLGKD 698


>gi|428315860|ref|YP_007113742.1| TPR repeat-containing protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428239540|gb|AFZ05326.1| TPR repeat-containing protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 744

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 87/159 (54%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQA 100
           R +  +P DA+VY +  + +K +P+T ++ + +LK VPNS   LLK  A  E   +    
Sbjct: 541 RDELNIPSDAVVYLSAQRGFKYNPNTARLQMKILKEVPNSYF-LLKGMANQELVKSFFAQ 599

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A+  G+D  R+ F   VA    H     +ADV LDT   NG TT+++ LW G P+VT  
Sbjct: 600 LAEEEGVDCDRLRFLPGVALSATHRANLGIADVVLDTYPYNGATTTLETLWMGIPIVTRV 659

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           G+  +SR + + +   G  E IA T +EY +  IRLG D
Sbjct: 660 GQQFSSRNSYTMMINAGITEGIAWTDEEYLEWGIRLGKD 698


>gi|398804002|ref|ZP_10563006.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Polaromonas sp. CF318]
 gi|398095201|gb|EJL85547.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Polaromonas sp. CF318]
          Length = 710

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 47/156 (30%), Positives = 86/156 (55%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
           SR++ GLP DA+V C       +D ++   W+ +L+++P+++LWL  +     AN+   A
Sbjct: 500 SREKAGLPADALVLCATVPSVMLDAASFAAWMKILRSLPDAVLWLPAYSLAIAANLVREA 559

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
            A G+ + R++F++   + + +   + AD+ LDT   N +    D L  G P ++  GE+
Sbjct: 560 NAAGVHESRLIFTHRLTRAQSLACLKHADLFLDTLRFNANLGLEDALRMGVPALSCAGES 619

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +ASR+  S +   G P+ + ++   Y   A+RLG D
Sbjct: 620 MASRLGGSIIRAAGLPQCVMQSRPAYIAEALRLGRD 655


>gi|298492469|ref|YP_003722646.1| hypothetical protein Aazo_4067 ['Nostoc azollae' 0708]
 gi|298234387|gb|ADI65523.1| conserved hypothetical protein ['Nostoc azollae' 0708]
          Length = 747

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 61/160 (38%), Positives = 87/160 (54%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN---IQA 100
           R+   +P DAI+Y    + YK     L++ + ++K VPNS L L+K  A  EA+    + 
Sbjct: 546 REDLNIPRDAIIYFMTQKGYKRHRPHLRLQMKIIKEVPNSYL-LIKGDADPEASKVFFEE 604

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRG--QLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            AQ  G+D +RI F   A + E V R   Q+ADV LDT   NG TT+++ LW G P+VT 
Sbjct: 605 IAQEEGVDFNRIKFLPYA-RSEAVHRANLQIADVVLDTYPYNGATTTLETLWMGVPLVTR 663

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE  ++R +   +      E IA T  EY +  +RLG D
Sbjct: 664 VGEQFSARNSYGMMINAAITEGIAWTEDEYVEWGVRLGKD 703


>gi|428186203|gb|EKX55054.1| hypothetical protein GUITHDRAFT_99692 [Guillardia theta CCMP2712]
          Length = 741

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 84/156 (53%), Gaps = 4/156 (2%)

Query: 45  QQYG--LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            Q+G  LP    ++  F+ L K+     Q W  +L+AV  S LW++  P  G  N+   A
Sbjct: 503 HQHGVVLPSSGFIFGAFSNLGKVHVRLWQSWCRILQAVNQSHLWIIAEPPFGVDNLIEAA 562

Query: 103 QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           +  G+++ RI+ ++     +H      A V LD+   NGH+T ++ LW G PV+TLP E 
Sbjct: 563 RGFGVEE-RIVATSRIRVVDHASVKHAAGVYLDSRPFNGHSTVVEALWGGIPVLTLPVER 621

Query: 163 LASRVAASQLATLGC-PELIARTHKEYQDIAIRLGT 197
           +A RVAAS L  +G    L  R   +Y+ +AI+L +
Sbjct: 622 MAGRVAASVLHAVGLGRHLTVRDGGDYEALAIKLAS 657


>gi|345871985|ref|ZP_08823926.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
           drewsii AZ1]
 gi|343919796|gb|EGV30539.1| Tetratricopeptide TPR_2 repeat-containing protein [Thiorhodococcus
           drewsii AZ1]
          Length = 2237

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/152 (36%), Positives = 85/152 (55%), Gaps = 6/152 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN----IQATAQALGLDQ 109
           I + +FN + K+ P TL +W NVL+A+P+S L +LK  +V +      +       G+D 
Sbjct: 598 ITFGSFNNIAKLTPDTLDLWGNVLRALPDSRL-VLKSHSVRDTRFWDYMLGRLTERGVDS 656

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
           HR+ +     +  EH+ + +  D+ LDT    G TT+ + L+ G PVVTL G+  A RV 
Sbjct: 657 HRVKILPRAPSYLEHLSQYREIDIALDTYPYQGTTTTCEALFMGVPVVTLAGDRHAGRVG 716

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            S +  +G P LIA +  +Y  IA+ L TDR+
Sbjct: 717 VSLMNQVGLPSLIANSADDYVRIAVSLATDRE 748



 Score = 86.3 bits (212), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 53/152 (34%), Positives = 86/152 (56%), Gaps = 6/152 (3%)

Query: 54   IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
            I + +FN L K+  +TL++W ++L AVPN+ L +LK P   E ++     A     G+D+
Sbjct: 2032 ITFGSFNNLAKVSSTTLRLWCDLLLAVPNARL-VLKTPTTTEGHVWGRVVAYFIEHGIDR 2090

Query: 110  HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+      +   +H+   ++ D+ LDT   NG TT+ + L+ G PV+TL G+   +RV 
Sbjct: 2091 ERLETLPRASGYLKHLASYRVIDIALDTYPYNGTTTTCEALFMGVPVITLLGDRHVARVG 2150

Query: 169  ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            AS L  +G  +LIA + +EY  I+  L  D+D
Sbjct: 2151 ASLLTRVGLTDLIAGSEEEYIRISKDLSGDKD 2182



 Score = 79.7 bits (195), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 90/177 (50%), Gaps = 20/177 (11%)

Query: 31   TGEEVPQSIVITSRQQYGLP----EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILW 86
            +GE++P           GLP     + + + +FN L KI  +TL +W  V+KAVP++ L 
Sbjct: 1271 SGEDLP----------VGLPPCLERNRVTFGSFNNLAKITNTTLDLWGGVMKAVPDARL- 1319

Query: 87   LLKFPAVGEANIQATAQALGLDQ----HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
            LLK     + ++     A   DQ     R+ +     +  EH+ + Q  D+ LD    +G
Sbjct: 1320 LLKTHTASDIDVWNRLVAYLADQGIATDRVEMLPRAPSYMEHLVQYQRLDIALDAFPYHG 1379

Query: 142  HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             TT+ + L+ G PVV+L G+  ASRV  S L   G PELIA + ++Y  IA  L  D
Sbjct: 1380 TTTTCEALFMGVPVVSLMGDRHASRVGGSLLTHAGLPELIAESTEDYIRIARDLARD 1436


>gi|443321167|ref|ZP_21050230.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Gloeocapsa sp. PCC 73106]
 gi|442789133|gb|ELR98803.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Gloeocapsa sp. PCC 73106]
          Length = 726

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 83/163 (50%), Gaps = 13/163 (7%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   G+P DAI+Y N     K  P TL + + ++ AVPNS      F   G  +++   +
Sbjct: 522 RDTLGIPSDAIIYLNIQNALKRHPHTLDLQMQIIGAVPNSY-----FLTKGSGDLEIIKR 576

Query: 104 AL-------GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
                    G+   R+ F      EE H     +ADV LDT   NG TT+++VLW G P+
Sbjct: 577 VFTEAALRHGVSPDRLRFLPFTPDEETHRANLAIADVILDTYPYNGATTTLEVLWMGIPL 636

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT  GE  A+R + + +  +G  E IA T +EY +  I+LGTD
Sbjct: 637 VTRVGEQFAARNSYAFMQQVGLNEGIAWTDEEYVEWGIKLGTD 679


>gi|443325693|ref|ZP_21054376.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xenococcus sp. PCC 7305]
 gi|442794696|gb|ELS04100.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Xenococcus sp. PCC 7305]
          Length = 1020

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/169 (34%), Positives = 84/169 (49%), Gaps = 6/169 (3%)

Query: 35  VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
           +PQ  +   +Q   L    + + +FN   KI P+T+++W  VLKAVPNS L LLK  A+ 
Sbjct: 805 LPQDFLPEPQQLPALKNGYLTFGSFNNSSKISPTTIRLWSQVLKAVPNSRL-LLKALALE 863

Query: 95  EAN----IQATAQALGLDQHRILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
           +       +    A G+D  R++      K E         D+ LD    NG TT+   L
Sbjct: 864 DKGTCEYFRGQFVAQGIDGTRVILEKPTLKIENFFESYHKIDIALDPFPYNGGTTTCQAL 923

Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           W G PV++L G+   SR++ S L  LG PEL   T  +Y  IA+ L  D
Sbjct: 924 WMGVPVISLRGQQFCSRMSHSFLTNLGLPELSVATEADYIAIAVELAQD 972


>gi|307152772|ref|YP_003888156.1| O-linked N-acetylglucosamine transferase [Cyanothece sp. PCC 7822]
 gi|306983000|gb|ADN14881.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
           protein [Cyanothece sp. PCC 7822]
          Length = 718

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/159 (38%), Positives = 89/159 (55%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---A 100
           R+   +P +AI+Y +     K  P T+++ + VLK VPNS L L+K     E   Q   +
Sbjct: 518 REHLNIPSEAIIYLSVQTGLKRHPDTIRLQMKVLKEVPNSYL-LVKGAGQTEKIEQLFLS 576

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A   G+  +RI F S VA++E H     +ADV LDT   NG TT+++VLW G P+VT  
Sbjct: 577 IASEEGVSPNRIRFLSTVASEEIHRANLTIADVVLDTFPYNGATTTLEVLWMGIPLVTRV 636

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           G+  A+R + + +   G  E IA T +EY    I+LGTD
Sbjct: 637 GQQFAARNSYTFMIHAGLTEGIAWTDEEYIQWGIKLGTD 675


>gi|218248286|ref|YP_002373657.1| hypothetical protein PCC8801_3536 [Cyanothece sp. PCC 8801]
 gi|218168764|gb|ACK67501.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
          Length = 731

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/163 (36%), Positives = 89/163 (54%), Gaps = 9/163 (5%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG-EANIQAT 101
           +R+Q  +PEDAI+Y      +K  P  +   + +L+ VPNS L L+K   +G +  IQ  
Sbjct: 526 TREQLNIPEDAIIYWTGQNGFKRHPDMINCQMQILQEVPNSYL-LIK--GIGNDEKIQQL 582

Query: 102 AQALGLDQH----RILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
            + + L       R  F  + A E  H    Q+AD+ LDT   NG TT+++VLW G P+V
Sbjct: 583 FKTIALKYQVSPDRFRFLPIVADELTHRANLQIADIVLDTYPYNGATTTLEVLWMGIPLV 642

Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           T  GE  A+R + + +  +G  E IA T++EY D  I+LG + 
Sbjct: 643 TRVGEQFAARNSYTFMTNVGLTEGIAWTNEEYIDWGIQLGKNE 685


>gi|428173813|gb|EKX42713.1| hypothetical protein GUITHDRAFT_141110 [Guillardia theta CCMP2712]
          Length = 870

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 51/141 (36%), Positives = 76/141 (53%), Gaps = 9/141 (6%)

Query: 57  CNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA--QALGLDQHRILF 114
           CN+NQ +K+DP    +WV +L  VPNS L +LK+P   E +++  A  Q LGL   R++F
Sbjct: 547 CNWNQHFKLDPEFFAVWVRILMQVPNSTLVMLKYPEESEPHLRYAARKQQLGLTGERLIF 606

Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
              A+  EH+ R    D+ LD    N  TT  D LW G P +++P    A+    +QL  
Sbjct: 607 LPKASLTEHLMRTSFCDLFLDNREYNSGTTCTDSLWAGVPTISMPRMKHAAVGGNAQL-- 664

Query: 175 LGCPELIARTHKEYQDIAIRL 195
                L AR  +EY+ +++ L
Sbjct: 665 -----LQARNLEEYESMSVLL 680


>gi|257060387|ref|YP_003138275.1| O-linked N-acetylglucosamine transferase [Cyanothece sp. PCC 8802]
 gi|256590553|gb|ACV01440.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
           protein [Cyanothece sp. PCC 8802]
          Length = 731

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/163 (36%), Positives = 90/163 (55%), Gaps = 9/163 (5%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG-EANIQAT 101
           +R+Q  +PEDAI+Y      +K  P  +   + +L+ VPNS L L+K   +G +  IQ  
Sbjct: 526 TREQLNIPEDAIIYWTGQNGFKRHPDMINCQMQILQDVPNSYL-LIK--GIGNDEKIQQL 582

Query: 102 AQALGLDQH----RILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
            + + L       R+ F  + A E  H    Q+AD+ LDT   NG TT+++VLW G P+V
Sbjct: 583 FKTIALKYQVSPDRLRFLPIVADELTHRANLQIADIVLDTYPYNGATTTLEVLWMGIPLV 642

Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           T  GE  A+R + + +  +G  E IA T++EY D  I+LG + 
Sbjct: 643 TRVGEQFAARNSYTFMTNVGLTEGIAWTNEEYIDWGIQLGKNE 685


>gi|94272257|ref|ZP_01292095.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
 gi|93450186|gb|EAT01491.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
          Length = 743

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 84/164 (51%), Gaps = 1/164 (0%)

Query: 37  QSIVITSRQQYGLP-EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           Q  V+ +R + GLP ED ++   FN++ K+       W+++LK  P + LW+L       
Sbjct: 156 QRPVVPARAELGLPGEDVLIVGMFNKINKVRGEVWAAWMSILKLAPKAWLWVLDPGLAAR 215

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           AN+   A+   + + RILF+    +  H+ R    D+ LD     GHT++ D L+ G PV
Sbjct: 216 ANLLKAAREHQVAEKRILFAPALPQAGHLSRLGACDLMLDPWPYGGHTSTTDALFAGVPV 275

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +TL G   A RV+A  L +   P L+  + ++Y   A  L  +R
Sbjct: 276 ITLEGRNFAGRVSAGLLKSANLPALVRGSVQDYIQTAAGLLNNR 319


>gi|257092158|ref|YP_003165799.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
           clade IIA str. UW-1]
 gi|257044682|gb|ACV33870.1| TPR repeat-containing protein [Candidatus Accumulibacter phosphatis
           clade IIA str. UW-1]
          Length = 597

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 82/162 (50%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P +  +  R +  LP +  V+C F+  YK++P+   +W+ +L+ V  S+LWLL      E
Sbjct: 374 PAAGRVPERVELALPAEGFVFCCFSHSYKLNPAMFAVWMRLLRQVAGSVLWLLAAKPEAE 433

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
             ++  A   G+D  R++F+       H+ R   AD+ LDT   N  TT+   L  G PV
Sbjct: 434 ERLRREADRCGVDPTRLVFAPRIDLAAHLARHAQADLFLDTLPFNAITTTSVALHAGLPV 493

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           ++  G + A+R A S L  +G   L+     +Y+  A+ L T
Sbjct: 494 LSCSGRSFAARGAGSLLQAVGMSALVTGNLSDYEARALELAT 535


>gi|381160366|ref|ZP_09869598.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878430|gb|EIC20522.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 731

 Score = 89.7 bits (221), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 51/166 (30%), Positives = 86/166 (51%), Gaps = 6/166 (3%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P    +  R++   P + + + +FN   K+ P+T+ +W ++L+  PN+ L LL++ ++  
Sbjct: 521 PPEPAVPIRKRNANPRE-LTFGSFNNTIKLSPATIALWSHILRETPNTRL-LLRYASLRH 578

Query: 96  ANIQATA----QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
           A I+        A G+   R+     A++ E +      D+ LD     G  T+ + LW 
Sbjct: 579 AEIRRQLLERFAAHGISAERLTLEGKASRTEMLETYNRVDIALDPTPYGGGITTAEALWM 638

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
           G PV+TL G     R +AS L T+GCP L+A+  +EY  +AI L T
Sbjct: 639 GVPVITLHGGAWPGRHSASILNTIGCPGLVAKNEEEYVALAISLAT 684


>gi|196232796|ref|ZP_03131647.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196223256|gb|EDY17775.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 591

 Score = 89.7 bits (221), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 74/145 (51%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           I +  FN   K +  TL  +  VL +V NS L LL      + ++       G+   RI 
Sbjct: 400 ITFGCFNNFSKFNEETLARFSRVLHSVKNSHLVLLAPEGSAQRHLSERFAHYGIAGDRIE 459

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F+   ++ +++RR    D+ LDT   NG  T+ + LW G PV++L GET   R     L+
Sbjct: 460 FAATGSRADYLRRYHRVDIALDTLPSNGMATTCETLWMGVPVISLVGETSVGRAGLGLLS 519

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
           T+G PEL+ RT  E+   A +L TD
Sbjct: 520 TIGLPELVTRTPDEFVSTATQLATD 544


>gi|390569448|ref|ZP_10249733.1| TPR repeat-containing protein [Burkholderia terrae BS001]
 gi|389938308|gb|EIN00152.1| TPR repeat-containing protein [Burkholderia terrae BS001]
          Length = 709

 Score = 89.4 bits (220), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 78/152 (51%), Gaps = 5/152 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
           AI +  FN L K++ + + +W  VL AVP S L L    L  PAV  A ++  A A G+D
Sbjct: 489 AITFGCFNHLVKLNDAVIALWSRVLDAVPGSCLLLKTRQLDDPAVQRATLERFA-AHGID 547

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+L    + + E +   Q  D+ LD     G TTS++ LW G PV+T  GE   S V 
Sbjct: 548 GARLLLEGQSPRAELLAAYQRVDIALDPFPYAGGTTSVEALWMGVPVLTRRGERFLSHVG 607

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            S + T G PE IA    +Y   A+R  T RD
Sbjct: 608 ESIVNTTGLPEWIAADDADYLSKALRFSTQRD 639


>gi|90079719|dbj|BAE89539.1| unnamed protein product [Macaca fascicularis]
          Length = 113

 Score = 89.4 bits (220), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 40/53 (75%), Positives = 47/53 (88%)

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           MDVLW GTP+VT+PGETLASRVAASQL  LGC ELIA+  +EY+DIA++LGTD
Sbjct: 1   MDVLWAGTPMVTMPGETLASRVAASQLTCLGCLELIAKNRQEYEDIAVKLGTD 53


>gi|428217495|ref|YP_007101960.1| hypothetical protein Pse7367_1237 [Pseudanabaena sp. PCC 7367]
 gi|427989277|gb|AFY69532.1| TPR repeat-containing protein [Pseudanabaena sp. PCC 7367]
          Length = 798

 Score = 89.4 bits (220), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 84/159 (52%), Gaps = 3/159 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG--EANIQAT 101
           R+   +P  ++VY +    YK  P   +  + ++KAVPNS   +  F      +      
Sbjct: 596 RRDLNIPAASVVYLSSQIGYKRHPDHARSQMQIIKAVPNSYFLIKGFADQNTVQKFFYEL 655

Query: 102 AQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
           A   G+++ R++F +N + + EH     +ADV LDT   NG TT+M+ LW G P+VT  G
Sbjct: 656 ADQEGVERDRLIFLANTSLEIEHRANLAIADVVLDTFPYNGATTTMETLWAGVPIVTRVG 715

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +  A+R + S L   G    IA + +EY D  IRLG+DR
Sbjct: 716 QQFAARNSYSMLVNAGIEAGIAWSDQEYIDWGIRLGSDR 754


>gi|75908610|ref|YP_322906.1| hypothetical protein Ava_2394 [Anabaena variabilis ATCC 29413]
 gi|75702335|gb|ABA22011.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 740

 Score = 89.0 bits (219), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 89/167 (53%), Gaps = 9/167 (5%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+    R+Q  +P DAIVY +  + +K  P T ++ + ++K VPNS   L+K   V E  
Sbjct: 532 SVPTIRREQLDIPRDAIVYFSGQRGFKRHPHTTRLQLKIIKEVPNSYF-LIK--GVSEEE 588

Query: 98  -----IQATAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
                 +  A+  G++  ++ F  + + E  H    ++ D+ LDT   NG TT+++ LW 
Sbjct: 589 GIKKFFEQLAEEEGVELSKLRFLPIDSTEPIHRANLEIVDIVLDTYPYNGATTTLETLWM 648

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             P+VT  GE  A+R + + +   G PE IA T +EY D  +RLG D
Sbjct: 649 CIPMVTRVGEQFAARNSYTMMMNAGIPEGIAWTDEEYVDWGVRLGKD 695


>gi|381160356|ref|ZP_09869588.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878420|gb|EIC20512.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 655

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 81/146 (55%), Gaps = 6/146 (4%)

Query: 58  NFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA----TAQALGLDQHRIL 113
           +FN L K+   TL +W  V++A+P S+L +LK     +A +Q     T +  G++  R+ 
Sbjct: 466 SFNNLAKVGEQTLALWSRVMRALPESML-ILKGRGAEDAAVQTRILETLRGHGIEGARLE 524

Query: 114 F-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
           F     ++ EH+      D+ LDT   N  TT+ D LW GTP+VT+ G  LA R AAS L
Sbjct: 525 FWPYTPSRSEHLALYNQIDLALDTIPFNSATTACDALWMGTPLVTVLGGQLAGRQAASIL 584

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
             LG PE IAR + E+ D+   L +D
Sbjct: 585 TGLGRPEWIARNNDEFVDVVASLASD 610


>gi|414077428|ref|YP_006996746.1| hypothetical protein ANA_C12192 [Anabaena sp. 90]
 gi|413970844|gb|AFW94933.1| TPR repeat-containing protein [Anabaena sp. 90]
          Length = 733

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 85/162 (52%), Gaps = 6/162 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ--- 99
           SR+   +P D I+Y +     K +P  +++ + ++K VPNS   LLK       ++Q   
Sbjct: 530 SRESLDIPNDGIIYFSSQSGLKRNPDNIRLQMQIIKQVPNSYF-LLKTSRSNHEDMQNFI 588

Query: 100 -ATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
              A+A GLD     F   A  E EH     +AD+ LDT   NG TT+++ LW G P+VT
Sbjct: 589 APLAEAQGLDLECFRFLPTAPTEMEHRANLAIADIVLDTYPYNGATTTLETLWMGVPIVT 648

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             GE  A+R + + +  +G  E +A + +EY +  +RLG D 
Sbjct: 649 RVGEQFAARNSYTMMMNVGVTEGLAWSDEEYVEWGVRLGKDE 690


>gi|253702411|ref|YP_003023600.1| hypothetical protein GM21_3825 [Geobacter sp. M21]
 gi|251777261|gb|ACT19842.1| Tetratricopeptide TPR_2 repeat protein [Geobacter sp. M21]
          Length = 589

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/152 (37%), Positives = 78/152 (51%), Gaps = 6/152 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLD 108
           A+ + +FN L K+ P T+ +W  VL AVP S L LLK     ++ ++    +L    G+ 
Sbjct: 396 AVTFGSFNNLAKVTPETIALWCRVLDAVPGSRL-LLKGRPFADSGVRERIASLFARGGIA 454

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R+ L         H+ +    D+ LDT   NG TT+ + LW G PVVTL G   A+R 
Sbjct: 455 GERVELHPGEPENSAHLAQYGRVDIALDTFPYNGTTTTCEALWMGVPVVTLAGTRHAART 514

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            AS L   G  EL+A    EY +IA RL  DR
Sbjct: 515 GASILTNCGLDELVAEDEGEYLEIARRLAADR 546


>gi|300866601|ref|ZP_07111289.1| hypothetical protein OSCI_3020012 [Oscillatoria sp. PCC 6506]
 gi|300335373|emb|CBN56449.1| hypothetical protein OSCI_3020012 [Oscillatoria sp. PCC 6506]
          Length = 1294

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 61/175 (34%), Positives = 88/175 (50%), Gaps = 11/175 (6%)

Query: 35   VPQSI--VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
            VP  I  V  +R  +G  E  +V+ +  + +KI P   + W  ++ AVPNSIL L  F +
Sbjct: 1071 VPSEIAKVHPTRSSWGASEGTVVFMSGARAFKIIPELRETWAKIIAAVPNSILVLHPFRS 1130

Query: 93   VGEA--------NIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHT 143
             GE+         I+A     G+D+ RI+   V       R   +LADV LD+    G T
Sbjct: 1131 DGESYPIIPFFNQIRAVFSEKGIDKKRIVVIKVLPNRADFRECLKLADVYLDSFPYAGAT 1190

Query: 144  TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            + ++ L  G P V   G+T  SR + + L  L  PELIA T   Y +IA+ LGT+
Sbjct: 1191 SMVEPLLVGIPTVVKEGQTARSRQSGAILRELQLPELIANTEDAYINIAVALGTN 1245


>gi|307152771|ref|YP_003888155.1| O-linked N-acetylglucosamine transferase [Cyanothece sp. PCC 7822]
 gi|306982999|gb|ADN14880.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
           protein [Cyanothece sp. PCC 7822]
          Length = 724

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 87/165 (52%), Gaps = 13/165 (7%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA 102
            RQQ  +P+ A VY +     K  P T+++ + +LKAVP+S L L+K    G  N +   
Sbjct: 520 KRQQLDIPDGAFVYLSVQTGLKRHPDTIRLQMKILKAVPDSYL-LIK----GAGNFEKIQ 574

Query: 103 QAL-------GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
           Q          +  +R+ F + A  EE H     +ADV LDT   NG TT+++VLW G P
Sbjct: 575 QLFVSIASEEDVSPNRLRFLSQAPTEEIHRANLAIADVVLDTYPYNGATTTLEVLWMGIP 634

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +VT  GE  A+R + + +   G  E IA T +EY    ++LGTD 
Sbjct: 635 LVTRVGEQFAARNSYTFMINAGLTEGIAWTDEEYIQWGVKLGTDE 679


>gi|75756032|gb|ABA27069.1| TO89-1 [Taraxacum officinale]
          Length = 89

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/79 (54%), Positives = 55/79 (69%), Gaps = 1/79 (1%)

Query: 122 EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ-LATLGCPEL 180
           E++RR  LAD+CLDTPLCN HT   DVLW G P+VTLP E +A+RVA S  LAT    E+
Sbjct: 1   EYIRRSSLADLCLDTPLCNAHTAGTDVLWAGLPMVTLPLEKMATRVAGSLCLATGVGEEM 60

Query: 181 IARTHKEYQDIAIRLGTDR 199
           I  + KEY++ A+ L  +R
Sbjct: 61  IVNSMKEYEERAVYLALNR 79


>gi|376003415|ref|ZP_09781226.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
 gi|375328218|emb|CCE16979.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira sp. PCC 8005]
          Length = 746

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/160 (36%), Positives = 84/160 (52%), Gaps = 5/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
           RQ   +PE+AIVY    +  K +P   Q+ + +LK VPNS L +  F       G     
Sbjct: 543 RQDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLIIKGFDKEQSIAGLFFEL 602

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A  Q +  +Q R L  +V  ++ H     +ADV LD+   NG TT+++ LW G P+VT  
Sbjct: 603 AEEQGVSREQLR-LTGSVLLEQTHRANLAIADVVLDSYPYNGATTTLETLWMGIPLVTRV 661

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           GE  A+R + + +   G  E IA T +EY +  +RLGTD 
Sbjct: 662 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 701


>gi|253996290|ref|YP_003048354.1| hypothetical protein Mmol_0918 [Methylotenera mobilis JLW8]
 gi|253982969|gb|ACT47827.1| Tetratricopeptide TPR_2 repeat protein [Methylotenera mobilis JLW8]
          Length = 693

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/144 (32%), Positives = 78/144 (54%), Gaps = 5/144 (3%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN---IQATAQA 104
            L  D I +  FN++ KI PS +++W  +L AVPNS L L   P  G  +    +  + A
Sbjct: 488 ALSNDFITFGCFNRVEKITPSVIKLWSGLLNAVPNSKLLLGAMPKDGSYDGVINELASHA 547

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
           +  D  R++  + +  + +++   L D+CLDT   NG TT+    W G P + L G++L 
Sbjct: 548 ISAD--RLIIHHRSTMDNYLKLHNLVDICLDTFPSNGVTTTCHAAWMGVPTLCLEGQSLM 605

Query: 165 SRVAASQLATLGCPELIARTHKEY 188
           SR A + +  LG PE + ++H ++
Sbjct: 606 SRGAMAVMLHLGLPEFVVKSHDDF 629


>gi|197120103|ref|YP_002140530.1| hypothetical protein [Geobacter bemidjiensis Bem]
 gi|197089463|gb|ACH40734.1| TPR domain protein [Geobacter bemidjiensis Bem]
          Length = 589

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 75/152 (49%), Gaps = 4/152 (2%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQ 109
           A+ + +FN L K+ P  + +W  VL AVP S L L   P       +  A      G+  
Sbjct: 396 AVTFGSFNNLAKVTPEVIALWCRVLHAVPGSRLLLKGRPFEDSGVRERIASQFSTGGIGG 455

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            RI L         H+ +    D+ LDT   NG TT+ + LW G PVVTL G   A+R  
Sbjct: 456 ERIELNPGEPESSAHLAQYGRVDIALDTFPYNGTTTTCEALWMGVPVVTLAGTRHAARTG 515

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           AS LA  G  EL+A   +EY +IA R+  DR+
Sbjct: 516 ASILANCGLDELVAENEEEYLEIARRMAADRE 547


>gi|423066870|ref|ZP_17055660.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
 gi|406711635|gb|EKD06835.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
          Length = 746

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/160 (36%), Positives = 84/160 (52%), Gaps = 5/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
           RQ   +PE+AIVY    +  K +P   Q+ + +LK VPNS L +  F       G     
Sbjct: 543 RQDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLIIKGFDKEQSIAGLFFEL 602

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A  Q +  +Q R L  +V  ++ H     +ADV LD+   NG TT+++ LW G P+VT  
Sbjct: 603 AEEQGVSREQLR-LTGSVLLEQTHRANLAIADVVLDSYPYNGATTTLETLWMGIPLVTRV 661

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           GE  A+R + + +   G  E IA T +EY +  +RLGTD 
Sbjct: 662 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 701


>gi|428221015|ref|YP_007105185.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 7502]
 gi|427994355|gb|AFY73050.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 7502]
          Length = 604

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 86/165 (52%), Gaps = 13/165 (7%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN------ 97
           R    +PE AIVY      YK  P T+++ + ++KAVP+S L ++K    G AN      
Sbjct: 403 RSLLDIPEGAIVYFTSQSGYKRHPDTIKLQLQIIKAVPHSFL-IIK----GTANQETLKT 457

Query: 98  -IQATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
                 + +GLD+ R+ F  +     EH     +ADV LDT   NG TT+M+ LW   P+
Sbjct: 458 LFYEIGEEVGLDRDRLRFLPMTKGVLEHRANLDIADVVLDTYPYNGATTTMETLWMCIPM 517

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           VT  G+  A+R + + +   G  E IA T KEY +  IRLG D +
Sbjct: 518 VTKVGQQFAARNSYTMMMNAGITEGIAWTDKEYVEWGIRLGRDEN 562


>gi|309256371|gb|ADO61013.1| spindly [Helianthus annuus]
          Length = 266

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T + LGL+ H
Sbjct: 11  VTFGSFNNLAKITPKVLQVWAKILCAVPNSRLIVKCKPFCCDSVRQKFLSTLEQLGLESH 70

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 71  RVDLVPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGLVHAHNVGV 130

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+T  EY  +AI L +D
Sbjct: 131 SLLTAVGLERLVAKTEDEYIRLAIELASD 159


>gi|94264577|ref|ZP_01288362.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
 gi|93454995|gb|EAT05229.1| Glycosyl transferase, family 2 [delta proteobacterium MLMS-1]
          Length = 994

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 83/164 (50%), Gaps = 1/164 (0%)

Query: 37  QSIVITSRQQYGLP-EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           Q  V+ +R + GLP ED ++   FN++ K+       W+++LK  P + LW+L       
Sbjct: 407 QRPVVPARAELGLPDEDVLIVGMFNKINKVRGEVWAAWMSILKLAPKAWLWVLDPGLAAR 466

Query: 96  ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            N+   A+   + + RILF+    +  H+ R    D+ LD     GHT++ D L+ G PV
Sbjct: 467 DNLLKAAREHQVAEKRILFAPALPQAGHLSRLGACDLMLDPWPYGGHTSTTDALFAGVPV 526

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +TL G   A RV+A  L +   P L+  + ++Y   A  L  +R
Sbjct: 527 ITLAGRNFAGRVSAGLLKSANLPALVRGSVQDYIQTAAGLLNNR 570


>gi|209527309|ref|ZP_03275818.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
 gi|209492235|gb|EDZ92581.1| Tetratricopeptide TPR_2 repeat protein [Arthrospira maxima CS-328]
          Length = 758

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 3/164 (1%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   +R    +P+DAIVY    +  K +P   ++ + +L+ VPNS   +  F      +
Sbjct: 548 SVPTITRADLEIPDDAIVYLGLQKGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSFS 607

Query: 98  --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
                 A   G+   RI F   V  +E H     +ADV LDT   NG TT+M+ LW G P
Sbjct: 608 KFFFDIANQQGITTDRIKFIPTVKLEENHRANLLIADVVLDTYPYNGATTTMETLWMGIP 667

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +VT  GE  A+R + + +   G  E IA T +EY +  +RLGTD
Sbjct: 668 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 711


>gi|309256349|gb|ADO61002.1| spindly [Helianthus annuus]
          Length = 266

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T + LGL+ H
Sbjct: 11  VTFGSFNNLAKITPKVLQVWAKILCAVPNSRLIVKCKPFCCDSVRQKFLSTLEQLGLESH 70

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 71  RVDLVPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGLVHAHNVGV 130

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+T  EY  +AI L +D
Sbjct: 131 SLLTAVGLXRLVAKTEDEYIRLAIELASD 159


>gi|423066869|ref|ZP_17055659.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
 gi|406711634|gb|EKD06834.1| tetratricopeptide TPR_2 repeat protein [Arthrospira platensis C1]
          Length = 745

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 3/164 (1%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   +R    +P+DAIVY    +  K +P   ++ + +L+ VPNS   +  F      +
Sbjct: 535 SVPTITRADLEIPDDAIVYLGLQKGPKYNPHIAKLQLEILREVPNSYFLVKGFGQQDSFS 594

Query: 98  --IQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
                 A   G+   RI F   V  +E H     +ADV LDT   NG TT+M+ LW G P
Sbjct: 595 KFFFDIANQQGITTDRIKFIPTVKLEENHRANLLIADVVLDTYPYNGATTTMETLWMGIP 654

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +VT  GE  A+R + + +   G  E IA T +EY +  +RLGTD
Sbjct: 655 LVTRVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTD 698


>gi|409991249|ref|ZP_11274527.1| hypothetical protein APPUASWS_09484 [Arthrospira platensis str.
           Paraca]
 gi|409937885|gb|EKN79271.1| hypothetical protein APPUASWS_09484 [Arthrospira platensis str.
           Paraca]
          Length = 742

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 85/160 (53%), Gaps = 5/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
           R+   +PE+AIVY    +  K +P   Q+ + +LK VPNS L +  F       G     
Sbjct: 539 REDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLVIKGFDKEQSIAGLFFEL 598

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A  Q +  +Q R L  +V+ ++ H     +ADV LD+   NG TT+++ LW G P+VT  
Sbjct: 599 AEEQGVSSEQLR-LTGSVSTEQTHRANLLIADVVLDSYPYNGATTTLETLWMGIPLVTRV 657

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           GE  A+R + + +   G  E IA T +EY +  +RLGTD 
Sbjct: 658 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 697


>gi|291566900|dbj|BAI89172.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 742

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 85/160 (53%), Gaps = 5/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
           R+   +PE+AIVY    +  K +P   Q+ + +LK VPNS L +  F       G     
Sbjct: 539 REDINIPENAIVYFTAQRGPKYNPELAQLQMQILKQVPNSYLVIKGFDKEQSIAGLFFEL 598

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           A  Q +  +Q R L  +V+ ++ H     +ADV LD+   NG TT+++ LW G P+VT  
Sbjct: 599 AEEQGVSSEQLR-LTGSVSTEQTHRANLLIADVVLDSYPYNGATTTLETLWMGIPLVTRV 657

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           GE  A+R + + +   G  E IA T +EY +  +RLGTD 
Sbjct: 658 GEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGTDE 697


>gi|428221016|ref|YP_007105186.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 7502]
 gi|427994356|gb|AFY73051.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 7502]
          Length = 734

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 59/167 (35%), Positives = 90/167 (53%), Gaps = 15/167 (8%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI---- 98
           +R+Q  +P + I+Y +  +  K  P T+++ + +LK VPNS   L+K   + + N     
Sbjct: 533 TREQLEVPANGIIYFSSQKSDKRHPETVRLQMQILKQVPNSYF-LIK--GLSDQNSIKDF 589

Query: 99  ---QATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGT 153
               A  + +  DQ R L     ++EE+  RG L  ADV LDT   NG TT+M+VLW+G 
Sbjct: 590 FYEMAELEGVNRDQLRFL---PYSREENEHRGNLPIADVVLDTYPYNGATTTMEVLWSGI 646

Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           P+VT  GE  A+R + + +   G  E IA T +EY    +R G + D
Sbjct: 647 PLVTRVGEQFAARNSYTMMMNAGISEGIAWTPEEYVAWGVRFGLEED 693


>gi|334118799|ref|ZP_08492887.1| hypothetical protein MicvaDRAFT_5391 [Microcoleus vaginatus FGP-2]
 gi|333459029|gb|EGK87644.1| hypothetical protein MicvaDRAFT_5391 [Microcoleus vaginatus FGP-2]
          Length = 746

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/156 (37%), Positives = 83/156 (53%), Gaps = 9/156 (5%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL---WLLKFPAVGEANIQ-ATAQA 104
           +P DA++Y      YK +P+T ++ + ++KAVPNS     W  +  ++    IQ A ++ 
Sbjct: 548 IPSDAVIYYCSQMGYKYNPNTARLQMQIIKAVPNSYFVTKWFAEQESLKNFFIQIAESEG 607

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           L  D+ R L     A  E V R  L  ADV LDT   NG TT+M+ LW   P+VT  G+ 
Sbjct: 608 LSADRLRFL---PHAPSEAVHRANLGIADVVLDTYPYNGATTTMETLWMCIPLVTRVGQQ 664

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +SR +   +   G  E IA T +EY +  IRLG D
Sbjct: 665 FSSRNSYGMMMNAGILEGIAWTDEEYVEWGIRLGKD 700


>gi|443312171|ref|ZP_21041791.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechocystis sp. PCC 7509]
 gi|442777849|gb|ELR88122.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechocystis sp. PCC 7509]
          Length = 730

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/161 (35%), Positives = 92/161 (57%), Gaps = 9/161 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA--- 100
           R+   +P DA++Y +    YK +P T+++ + ++K VPNS   L+K  A  E +++    
Sbjct: 521 REHLEIPPDAVIYYSGQTGYKRNPETVRLQMQIIKQVPNSYF-LVKGKA-DETSVKQFFT 578

Query: 101 -TAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             A+A G+D +R+ F     + E V R  L  ADV LDT   NG TT+++ LW G P+VT
Sbjct: 579 QVAEAEGVDPNRLKFIP-RDESELVHRANLGVADVVLDTYPYNGATTTLETLWMGVPLVT 637

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             GE  ++R + + +  +G  E IA T +EY +  +RLG +
Sbjct: 638 RVGEQFSARNSYTFMMNVGVTEGIAWTDEEYVEWGVRLGKE 678


>gi|428175309|gb|EKX44200.1| hypothetical protein GUITHDRAFT_140027 [Guillardia theta CCMP2712]
          Length = 806

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 45/118 (38%), Positives = 72/118 (61%), Gaps = 3/118 (2%)

Query: 82  NSILWLLKFP--AVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
           +S +W+LK       E  ++A A+  G+D+ R+ F+  A  ++HV     AD+ LDT L 
Sbjct: 634 SSQIWILKLNEGTSAEPQLKAEARRQGIDEGRLFFTGSADLKDHVAYKSAADLLLDTDLY 693

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLG-CPELIARTHKEYQDIAIRLG 196
           N H+T+ D L+ G P++TLPG  ++SR+ AS    LG    ++AR  K+Y++IA+RL 
Sbjct: 694 NAHSTAADALFAGVPLITLPGTRMSSRIGASIANGLGHASVMVARDLKDYEEIAVRLA 751


>gi|403376314|gb|EJY88133.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oxytricha trifallax]
          Length = 236

 Score = 87.0 bits (214), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 84/155 (54%), Gaps = 6/155 (3%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
           G+P +A V+  FNQ ++ID  +L+    + + VP SIL  +     G+ +I A A+ L +
Sbjct: 2   GIPRNAFVFGIFNQYFQIDSLSLETGFKIKQQVPGSILLFVLVNKEGQQDIVAKARQLEV 61

Query: 108 DQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           D + + F +   K + +V R   AD+ LDT   N    + D+LW+G P++TL  +  A+R
Sbjct: 62  DLNEVYFLSQDIKSQVNVNRMFTADLALDTLQVNARIVNADLLWSGVPLLTLKSDNWANR 121

Query: 167 VAASQLATLGCP-----ELIARTHKEYQDIAIRLG 196
           V AS L+ LG        L+A T ++Y   A+ L 
Sbjct: 122 VGASMLSGLGLSVQQYDSLVASTVQQYIQKAVDLA 156


>gi|186681176|ref|YP_001864372.1| hypothetical protein Npun_F0677 [Nostoc punctiforme PCC 73102]
 gi|186463628|gb|ACC79429.1| Tetratricopeptide TPR_2 repeat protein [Nostoc punctiforme PCC
           73102]
          Length = 736

 Score = 87.0 bits (214), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 7/161 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
           R    +P DA+VY +  + +K  P+T Q+ + ++K VPNS   L+K  A  EA       
Sbjct: 534 RDSLDIPSDAVVYLSAQRGFKRHPNTAQLQMRIIKEVPNSYF-LIKGLADSEAVQKFFMQ 592

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            A+  G++  R+ F  +   E  V R  LA  DV LDT   NG TT+++ LW G P+VT 
Sbjct: 593 LAEQEGVECSRLRFLQLDPSES-VHRANLAIADVILDTYPYNGATTTLETLWMGIPLVTR 651

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G+  A+R + + +   G  E IA T +EY +  +RLG D 
Sbjct: 652 VGQQFAARNSYTMMMNAGIAEGIAWTDEEYVEWGVRLGKDE 692


>gi|119510220|ref|ZP_01629358.1| TPR repeat protein [Nodularia spumigena CCY9414]
 gi|119465170|gb|EAW46069.1| TPR repeat protein [Nodularia spumigena CCY9414]
          Length = 743

 Score = 87.0 bits (214), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 61/167 (36%), Positives = 93/167 (55%), Gaps = 17/167 (10%)

Query: 44  RQQYGLPEDAIVYCNFNQLYK--IDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
           R+Q  +PEDA+VY    + YK  +D + LQ  + ++K VPNS L L+K    GEA+ QA+
Sbjct: 539 REQLDIPEDAVVYFVTQKGYKRHLDHAKLQ--IAIIKEVPNSYL-LIK----GEADEQAS 591

Query: 102 -------AQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
                  A++  +D +R  F  + + E  H     LADV LDT   NG TT+++ LW   
Sbjct: 592 EKFYLDLAKSQNVDFNRFRFLPLVSSEAIHRANLSLADVVLDTYPYNGATTTLETLWREI 651

Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           P+VT  G+  A+R + + +   G  E IA + +EY +  +RLG D +
Sbjct: 652 PLVTKVGQQFAARNSYTMMINAGITEGIAWSDEEYVEWGVRLGKDEN 698


>gi|381160369|ref|ZP_09869601.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878433|gb|EIC20525.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 721

 Score = 86.7 bits (213), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 53/151 (35%), Positives = 78/151 (51%), Gaps = 5/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           I + +FN + KI  ST+++W  VL    +S L +L+  +  E +++    A   + GL  
Sbjct: 530 ITFGSFNNVAKISDSTIKLWARVLDESKDSRL-VLRDKSFTELSMRKRMLARFGSFGLAA 588

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+       + E++   +  DV LD     G TT+ D LW G PVVTL G+T A R++ 
Sbjct: 589 ERLSLEPALGRREYLESYRHIDVALDPTPYGGGTTTADALWMGVPVVTLRGKTWAGRIST 648

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  LG PE  A+   EY  IA  L  DR+
Sbjct: 649 SILNALGIPEFSAQDENEYVSIASSLARDRE 679


>gi|443475509|ref|ZP_21065456.1| TPR repeat-containing protein [Pseudanabaena biceps PCC 7429]
 gi|443019628|gb|ELS33689.1| TPR repeat-containing protein [Pseudanabaena biceps PCC 7429]
          Length = 741

 Score = 86.3 bits (212), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 57/167 (34%), Positives = 91/167 (54%), Gaps = 5/167 (2%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           SI   SR+Q  +P+DA+VY      YK  P T+ + + ++ +V NS  +L+K     E  
Sbjct: 532 SIPSISREQLNIPDDAVVYLCTQTSYKRFPDTVHLQMQIINSVSNS-YFLIKGWGEEEGT 590

Query: 98  IQ---ATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
            +     A+  G+ + R+ F  +A  E EH    ++ADV LDT   NG TT+M+ LW   
Sbjct: 591 KRFFIDIAKKEGVSEDRLRFLPLADSEAEHRANLRIADVVLDTYPYNGATTTMETLWMCI 650

Query: 154 PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           P+VT  G+  ++R + + +  +G  E I+ T +EY +  IRLG D +
Sbjct: 651 PLVTKVGQQFSARNSYTMMMNVGVTEGISWTDEEYVEWGIRLGKDEN 697


>gi|428318228|ref|YP_007116110.1| Tetratricopeptide TPR_1 repeat-containing protein [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428241908|gb|AFZ07694.1| Tetratricopeptide TPR_1 repeat-containing protein [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 729

 Score = 86.3 bits (212), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/181 (35%), Positives = 90/181 (49%), Gaps = 15/181 (8%)

Query: 35  VPQSIV-ITSRQQYGLPE-----------DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
           +P S V ++  Q+YG+             D IVY       K +   ++  + +LK VPN
Sbjct: 504 MPDSFVAVSGFQRYGVDPVALRRSNRIGLDQIVYLCVAPGRKFNYELVKAQIAILKQVPN 563

Query: 83  SIL--WLLKFPAVGEANIQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLC 139
           SIL    L   AV EA      +A  + +HRI F    A +EEH     LADV LD+   
Sbjct: 564 SILVHKALGDAAVFEAAYHQACEAEKVSKHRIKFLPRFATEEEHRIIYLLADVLLDSYPY 623

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           NG T +++ LW   PVVT  GE   SR+  S L TLG    +A +  EY ++ ++LG DR
Sbjct: 624 NGGTHTLEALWFNLPVVTRTGEQFLSRMGYSFLQTLGIETGVALSWSEYVEVGVKLGKDR 683

Query: 200 D 200
           +
Sbjct: 684 E 684


>gi|374999547|ref|YP_004975635.1| hypothetical protein AZOLI_p60052 [Azospirillum lipoferum 4B]
 gi|357428518|emb|CBS91475.1| conserved protein of unknown function; putative TPR domains
           [Azospirillum lipoferum 4B]
          Length = 583

 Score = 86.3 bits (212), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQATAQALGLDQH 110
           + + +FN+L KI   T+++W  VL+ VP+S L L   P       A ++A   A G+   
Sbjct: 379 VTFGSFNKLSKITDHTVRLWAEVLRRVPDSRLLLKDRPLSDPGTAAGLRARFAAAGIAPD 438

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +     H+      D+ LD    NG  T+ D LW G P+VTL G   A+RV A
Sbjct: 439 RLDLVGFIKDAAGHLAAYNRIDIALDPHPYNGTITTCDTLWMGAPLVTLAGGRHAARVGA 498

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S +A++G PEL+A T   Y  IA  L  D
Sbjct: 499 SLMASIGLPELVAATPDRYAAIAAELAGD 527


>gi|347736536|ref|ZP_08869150.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
 gi|346919949|gb|EGY01259.1| TPR repeat-containing protein [Azospirillum amazonense Y2]
          Length = 525

 Score = 86.3 bits (212), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 55/149 (36%), Positives = 80/149 (53%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           + +  FN   K++   +  W  +L  VP S L LLK   + EA +QA      +A G+D 
Sbjct: 341 VTFGCFNIAAKLNAEVIACWAQILNRVPGSRL-LLKSRGLDEAPVQARILEQFRAGGIDA 399

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+ F     + EHV   Q  D+ LD     G TT+++ LW G PVVTL G+T ++R + 
Sbjct: 400 ARLDFLGATPQAEHVACMQRVDIALDPFPFAGSTTTLESLWMGLPVVTLTGQTFSARHST 459

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
             L+ LG   L+A T ++Y DIA RL T+
Sbjct: 460 CFLSVLGLTSLMATTVEQYVDIAARLATN 488


>gi|218670286|ref|ZP_03519957.1| hypothetical protein RetlG_00835 [Rhizobium etli GR56]
          Length = 176

 Score = 86.3 bits (212), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%)

Query: 95  EANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
           +AN+    QA G+   RI+F   A  EEH+ R Q AD+ +DT   NGHTT+ + LW G P
Sbjct: 8   QANLLKQFQAAGISPKRIIFCPRAPYEEHIDRQQAADLGIDTFPVNGHTTTSEQLWGGLP 67

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           V+T+ G   ASRV+ S L  +  PEL+A   + Y+D+A+ L  +
Sbjct: 68  VLTVKGTNFASRVSESLLRAIDLPELVAGDLQAYEDLAVELAQN 111


>gi|377820855|ref|YP_004977226.1| hypothetical protein BYI23_A014110 [Burkholderia sp. YI23]
 gi|357935690|gb|AET89249.1| TPR repeat-containing protein [Burkholderia sp. YI23]
          Length = 798

 Score = 85.9 bits (211), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 57/151 (37%), Positives = 69/151 (45%), Gaps = 1/151 (0%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGL 107
            L    I +   N   K    TL MW  VL A P + L LL  P            ALG+
Sbjct: 603 ALASGHITFGCLNNPCKASDRTLNMWAAVLAATPGARLVLLS-PHGARERAGERLGALGV 661

Query: 108 DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
           D  RI F +   ++ ++R     D+ LDT   NGHTTS+D LW G PV +  GET  SR 
Sbjct: 662 DIARIEFLDYQPRDAYLRTYDRIDIGLDTFPYNGHTTSLDALWMGVPVPSRAGETAVSRA 721

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             S L  LG  +L A     Y DI  RL  D
Sbjct: 722 GLSFLMNLGLGDLAAHDDAAYVDIVSRLARD 752


>gi|356570865|ref|XP_003553604.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like [Glycine
           max]
          Length = 917

 Score = 85.9 bits (211), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 81/149 (54%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+WV +L A+PNS L +   P   ++  Q   +T + LGL+  
Sbjct: 651 VTFGSFNNLAKITPKVLQVWVRILCAIPNSRLVVKCKPFCSDSVRQIFLSTLEQLGLEPL 710

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 711 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 770

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   LIAR   EY ++A +L +D
Sbjct: 771 SLLSKVGLEHLIARNEDEYVELAQQLASD 799


>gi|224028913|gb|ACN33532.1| unknown [Zea mays]
          Length = 144

 Score = 85.9 bits (211), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 1/78 (1%)

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGC-P 178
           K EH+RR  LAD+ LDTPLCN HTT  D+LW G P++TLP E +A+RVA S     G   
Sbjct: 2   KNEHIRRSALADLFLDTPLCNAHTTGTDILWAGLPMITLPLEKMATRVAGSLCVATGLGE 61

Query: 179 ELIARTHKEYQDIAIRLG 196
           E+I  + K+Y+D A+ L 
Sbjct: 62  EMIVSSMKKYEDRAVELA 79


>gi|427737301|ref|YP_007056845.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rivularia sp. PCC 7116]
 gi|427372342|gb|AFY56298.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Rivularia sp. PCC 7116]
          Length = 744

 Score = 85.9 bits (211), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 58/160 (36%), Positives = 87/160 (54%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
           R+   +PEDAIVY +     K  P T+++ + +LKAVPNS  +LL      E  IQ    
Sbjct: 542 RENLNIPEDAIVYFSGQGGCKRHPDTVRLQMRILKAVPNS--YLLIKGLADEKAIQNFFN 599

Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A A G++   + F   V+++  H     +ADV LDT   NG TT+++ LW G P+VT 
Sbjct: 600 EIAIAEGVEPKYLRFLPMVSSEATHRANLGIADVVLDTYPYNGATTTLETLWMGIPLVTK 659

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE   +R + + +  +G  E I+ + +EY +  IRLG D
Sbjct: 660 VGEQFVARNSYTMMMNVGIKEGISWSDEEYLEWGIRLGKD 699


>gi|197120102|ref|YP_002140529.1| hypothetical protein [Geobacter bemidjiensis Bem]
 gi|197089462|gb|ACH40733.1| TPR domain protein [Geobacter bemidjiensis Bem]
          Length = 1106

 Score = 85.9 bits (211), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 52/151 (34%), Positives = 78/151 (51%), Gaps = 4/151 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
           + + +FN   KI P T+ +W  VL+AVP S L L  +    A     ++ T    G+++ 
Sbjct: 429 VTFGSFNNPAKITPETVLLWSGVLRAVPGSQLLLKGYSLACAETRLRLEETFAGHGIERE 488

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L  N  +  +H+      D+ LD+   NG TT+ + LW G PVVTL G    SRV A
Sbjct: 489 RLELLGNTPSYRDHLSLYDRVDIALDSYPYNGTTTTCEALWMGVPVVTLAGSAHRSRVGA 548

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  LG   L+A   +++  +A  L  DR+
Sbjct: 549 SILQALGLEGLVAHEARKFVVLAQALAGDRE 579


>gi|440683084|ref|YP_007157879.1| methyltransferase FkbM family [Anabaena cylindrica PCC 7122]
 gi|428680203|gb|AFZ58969.1| methyltransferase FkbM family [Anabaena cylindrica PCC 7122]
          Length = 1200

 Score = 85.9 bits (211), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 54/154 (35%), Positives = 80/154 (51%), Gaps = 8/154 (5%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEANIQATAQALGL 107
             + +FN L K+ P T  +W+ +LKAVPNS L L      +  P + E  IQ      G+
Sbjct: 384 FTFSSFNNLRKLTPETFALWIEILKAVPNSRLVLKCASSDVFSPLIAE-KIQTPFVEQGI 442

Query: 108 DQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           D  RI L+   AA E+H+      D+ LD+    G TT+ + LW G P +T+ G     R
Sbjct: 443 DPKRIFLYGGYAADEDHLNLYNQVDLHLDSLPYTGCTTTCEALWMGVPTITVAGTRKMER 502

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           ++AS L ++G  + IA +  EY + AI L  + D
Sbjct: 503 MSASILHSVGLDDCIAYSAVEYVEKAIALARNPD 536


>gi|17228058|ref|NP_484606.1| hypothetical protein all0562 [Nostoc sp. PCC 7120]
 gi|17129907|dbj|BAB72520.1| all0562 [Nostoc sp. PCC 7120]
          Length = 739

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 86/161 (53%), Gaps = 7/161 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
           R Q  +P+DA+VY    + +K  P   ++ + ++K VPNS   L+K  +  E+     + 
Sbjct: 538 RDQLDIPDDAVVYFCGQRGFKRHPDITRLQLQIIKEVPNSYF-LIKGISDEESIKNFFEE 596

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            A+  G+D  R+ F  + A E  V R  L  AD+ LDT   NG TT+++ LW   P+VT 
Sbjct: 597 LAEEEGVDYSRLRFLPIVATES-VHRANLGIADIVLDTYPYNGATTTLETLWMCIPMVTK 655

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            GE  A+R + + +   G  E IA T +EY +  +RLG D 
Sbjct: 656 VGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGKDE 696


>gi|288963188|ref|YP_003453467.1| TPR repeat-containing protein [Azospirillum sp. B510]
 gi|288915440|dbj|BAI76923.1| TPR repeat-containing protein [Azospirillum sp. B510]
          Length = 654

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 54/155 (34%), Positives = 80/155 (51%), Gaps = 6/155 (3%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----A 104
           L    + + +FN   K+    +++W  +LK VPN+ L LLK   +G+   +A  +     
Sbjct: 455 LDSGVVTFGSFNNTSKVTAEVVRVWAEILKRVPNARL-LLKSRQMGDDETRARYRNSFIT 513

Query: 105 LGLDQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
            G+D  R+ L + + A + H+R     D+ LD    NG TT+ + LW G PVVTL G + 
Sbjct: 514 HGIDPDRVELLARIPAADGHLRAYDRLDIALDPFPYNGTTTTCEALWMGVPVVTLAGRSH 573

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +RV AS L  +G  ELIA    EY   A+ L  D
Sbjct: 574 VARVGASLLTNVGLEELIASDEAEYIAKAVALAGD 608


>gi|196232228|ref|ZP_03131082.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196223596|gb|EDY18112.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 660

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/154 (35%), Positives = 78/154 (50%), Gaps = 8/154 (5%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNS-ILWLLKFPAVGEANIQATAQALGLDQHRI 112
           I +  FN   KI+   LQ+W  +L+ VP+S ++ LLK  A  E  ++A  Q  G+   R+
Sbjct: 464 ITFGCFNNFSKINERVLQLWARILQGVPDSRLILLLKTGAHREKTLEAI-QRHGIAAERL 522

Query: 113 LFSNVA------AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +F +        +    ++R    D+ LD    NG TT+ D LW G PV+ L G    SR
Sbjct: 523 VFHDYEPASLERSASHFLQRYAQVDIALDPFPYNGMTTTCDALWMGVPVIALRGTLGISR 582

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            + S LA LG PE    +  EY + A+ L  DRD
Sbjct: 583 ASFSLLANLGLPEYAVASEDEYCETAVALAGDRD 616


>gi|392380152|ref|YP_004987310.1| conserved protein of unknown function [Azospirillum brasilense
           Sp245]
 gi|356882519|emb|CCD03531.1| conserved protein of unknown function [Azospirillum brasilense
           Sp245]
          Length = 550

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/150 (36%), Positives = 77/150 (51%), Gaps = 7/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
             + +FN L K++ STL +W  VL AVP + L L    L    V    + A A A G+++
Sbjct: 350 FTFGSFNNLTKLNASTLALWGRVLAAVPEARLMLKWRGLSGGGVARRVLDAFA-AHGVEE 408

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+L    A       R  L D  LD    NG TT+ D LW G PV+++ GE + SR  A
Sbjct: 409 SRLLLRGTAPDPYEDYR--LLDCALDPVFANGGTTTCDALWMGVPVLSIAGEAMISRWGA 466

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           + L ++G   L+     +Y  +A+RL TDR
Sbjct: 467 TMLGSVGLGGLVVEREDDYVALAVRLATDR 496


>gi|381160360|ref|ZP_09869592.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878424|gb|EIC20516.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 914

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 75/145 (51%), Gaps = 4/145 (2%)

Query: 58  NFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQHRI-L 113
           +FN L+K+   +L +W  VL A+P + L L  F      +     +AL   G+D+ R+ L
Sbjct: 711 SFNNLFKVREESLALWARVLHALPQAHLALKDFKGANRVHQLRVLKALAAQGIDESRVAL 770

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
                +  EH+      D+ LDT   N  TT+ D LW G P+VTL G+ +A   AAS L+
Sbjct: 771 LRATPSWSEHMAYYNRLDIALDTLPFNSATTAFDALWMGCPLVTLAGDRMAGLQAASALS 830

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG  + +AR   +Y  IA+ L  D
Sbjct: 831 GLGRTQWVARDEDDYVRIAVELARD 855


>gi|428210827|ref|YP_007083971.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oscillatoria acuminata PCC 6304]
 gi|427999208|gb|AFY80051.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oscillatoria acuminata PCC 6304]
          Length = 746

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 87/160 (54%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R +  +P+DA+VY +    YK  P T ++ + ++KAVPNS   +  F    ++    IQ 
Sbjct: 543 RDRLNIPQDAVVYFSGQTGYKRHPDTARLQMQIIKAVPNSYFLIKGFAEQESIKRFFIQ- 601

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            A+  G++  R+ F   A  E  + R  L  ADV LDT   NG TT+++ LW G PVVT 
Sbjct: 602 LAEEEGVEGDRLRFLPPAPSEA-IHRANLGIADVVLDTYPYNGATTTLETLWMGIPVVTK 660

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            G+  A+  + + +  +G  E I+ T +EY +  IRLG D
Sbjct: 661 VGKQFAACNSYTMMMNVGVSEGISWTDQEYVEWGIRLGKD 700


>gi|218437263|ref|YP_002375592.1| hypothetical protein PCC7424_0256 [Cyanothece sp. PCC 7424]
 gi|218169991|gb|ACK68724.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
          Length = 719

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 82/160 (51%), Gaps = 6/160 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
           R+   +  DAI+Y +    +K  P T+++ + +LK VPNS  + L      E  IQ    
Sbjct: 520 REDLNIANDAIIYLSTQTGFKRHPDTIRLQMKILKEVPNS--YFLIQGLTDEEKIQQLFT 577

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             AQ  G++ +R+ F      E +     +ADV LDT   NG TT++D+LW G PVVT  
Sbjct: 578 EIAQQEGVNPNRLRFLPYYPTETYRANLTIADVVLDTYPFNGGTTTLDILWQGIPVVTRV 637

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           G+  ASR   + L   G  E IA   ++Y    I+LGTD 
Sbjct: 638 GQQWASRNGYTLLINAGITEGIAWNDEDYIKWGIQLGTDE 677


>gi|254431323|ref|ZP_05045026.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [Cyanobium sp. PCC 7001]
 gi|197625776|gb|EDY38335.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase,
           putative [Cyanobium sp. PCC 7001]
          Length = 747

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 77/148 (52%), Gaps = 5/148 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----ALGLDQ 109
           +V+  FN + K+ PST+ +W  +L+AVP S L LLK P++ +  ++ + Q    A G+  
Sbjct: 539 VVFGCFNNVMKLSPSTIALWARILQAVPESSL-LLKAPSLRDGLVRHSFQERFAAHGIGP 597

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           HR+           ++     D+ LD    NG TT++  LW G PVV L G    SR+ A
Sbjct: 598 HRLRLEGPTELSVMMQTYGQIDIGLDPFPYNGGTTTLQALWMGVPVVVLEGGNFVSRMGA 657

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGT 197
           S L +LG PE I      Y  IA+ L +
Sbjct: 658 SFLTSLGRPEWIGADPDSYARIAVALAS 685


>gi|256772634|emb|CAX46402.1| putative SPINDLY protein [Rosa lucieae]
          Length = 916

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 81/149 (54%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L A+PNS L +   P   ++      +T + LGL+  
Sbjct: 662 ITFGSFNNLAKITPKVLQVWARILSAIPNSRLVVKCKPFSCDSVRERFLSTLEQLGLEPL 721

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 722 RVDLLPLILLNYDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 781

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+T+G   LIA+  +EY  +A++L +D
Sbjct: 782 SILSTVGLGNLIAKNEEEYVQLAVQLASD 810


>gi|113476017|ref|YP_722078.1| hypothetical protein Tery_2387 [Trichodesmium erythraeum IMS101]
 gi|110167065|gb|ABG51605.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
          Length = 745

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 88/163 (53%), Gaps = 9/163 (5%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT- 101
            R+  G+P DA+VY    + +K +P T+++ + +LK VP S      F  +G+++  +  
Sbjct: 539 KREDLGIPSDAVVYFTAQRGHKYNPDTVRLQMKILKEVPQS---YFLFKNLGDSDTLSEL 595

Query: 102 ----AQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVV 156
               A++ G+   R++    V+ ++ H     +AD+ LDT   NG TT+++ LW   P+V
Sbjct: 596 LIDIAESEGISSDRLIPIEEVSLEQTHRANLGIADIVLDTYPYNGATTTLETLWMCIPMV 655

Query: 157 TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           T  G+  A+R + + +   G  E IA + +EY +  IRLG D 
Sbjct: 656 TRVGQQFAARNSYTMMMNTGITEGIAWSDEEYVEWGIRLGKDE 698


>gi|428162823|gb|EKX31932.1| hypothetical protein GUITHDRAFT_121883 [Guillardia theta CCMP2712]
          Length = 672

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 87/184 (47%), Gaps = 33/184 (17%)

Query: 18  NGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKI------DPSTLQ 71
           N L+ N+   +  + +E P      +R++ G  +D I+   FNQ+         D  T  
Sbjct: 444 NILSKNEKVRRITSDDEQP------TRRELGFDDDDIILACFNQVNAHAVVVCPDNRTQI 497

Query: 72  MWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLAD 131
           M + + K  P                     +  G+D  RI+      +E   R   LAD
Sbjct: 498 MPLQLYKIEP---------------------EKRGVDPSRIVSHEQFPREREFRVKGLAD 536

Query: 132 VCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDI 191
           V LDTPL N HTT+ DVLW+G P++TLPGE+   RVA+  + + G   ++ R   +Y+++
Sbjct: 537 VFLDTPLFNAHTTAGDVLWSGVPILTLPGESFVQRVASGMVKSAGMQWMVTRNLLDYEEL 596

Query: 192 AIRL 195
           A+ L
Sbjct: 597 AVAL 600


>gi|427707007|ref|YP_007049384.1| hypothetical protein Nos7107_1593 [Nostoc sp. PCC 7107]
 gi|427359512|gb|AFY42234.1| Tetratricopeptide TPR_1 repeat-containing protein [Nostoc sp. PCC
           7107]
          Length = 739

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 85/162 (52%), Gaps = 7/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R +  +P DA+VY +  + +K  P   ++ + +LK VPNS   L+K  A  E+  +  AQ
Sbjct: 538 RDELDIPNDAVVYFSGQRGFKRHPDITRLQLKILKEVPNSYF-LIKGLADEESTHKFFAQ 596

Query: 104 ALG---LDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
                 +D  R+ F    A  E V R  L  AD+ LDT   NG TT+++ LW   P+VT 
Sbjct: 597 IAKEEEVDYARLRFL-ADAPTEGVHRANLDIADIVLDTYPYNGATTTLETLWMCIPMVTR 655

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            GE  A+R + + +   G  E IA T +EY +  IRLG D D
Sbjct: 656 VGEQFAARNSYTMMMNAGISEGIAWTDEEYVEWGIRLGKDAD 697


>gi|196232945|ref|ZP_03131794.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196222923|gb|EDY17444.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 808

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 49/142 (34%), Positives = 70/142 (49%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + +   N   K++   L++W  VL AV ++ L L               +  G+   R+ 
Sbjct: 615 LTFGCLNNFCKVNEPLLRLWARVLVAVKDARLVLSSPRGSHRQRTLDFLERAGVAPRRVE 674

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F     +  ++      DV LDT   NGHTTS+D LW G PVV+L G+   SR   SQL+
Sbjct: 675 FVEPCPRRAYLELYHQLDVVLDTFPYNGHTTSLDALWMGVPVVSLAGKPAVSRAGLSQLS 734

Query: 174 TLGCPELIARTHKEYQDIAIRL 195
            LG PEL+A +  EY  +A RL
Sbjct: 735 NLGLPELVAHSEDEYVAVAARL 756


>gi|242081821|ref|XP_002445679.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
 gi|241942029|gb|EES15174.1| hypothetical protein SORBIDRAFT_07g024110 [Sorghum bicolor]
          Length = 910

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/150 (36%), Positives = 81/150 (54%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   E NI+    +T + LGL+ 
Sbjct: 646 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCE-NIRQKFLSTLEELGLES 704

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V 
Sbjct: 705 LRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVG 764

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L+ +G   L+A+T  EY  +A+ L +D
Sbjct: 765 VSLLSKVGLGRLVAKTEDEYVSLALGLASD 794


>gi|212274891|ref|NP_001130413.1| uncharacterized protein LOC100191509 [Zea mays]
 gi|194689056|gb|ACF78612.1| unknown [Zea mays]
          Length = 567

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/150 (36%), Positives = 81/150 (54%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + + +FN L KI P  LQ+W  +L +VPNS L +   P   E NI+    AT + LGL+ 
Sbjct: 302 VTFGSFNNLAKITPKVLQVWARILCSVPNSRLVVKCKPFCCE-NIRQKFLATLEELGLES 360

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V 
Sbjct: 361 LRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVG 420

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L+ +G   L+A+T  EY  +A+ L +D
Sbjct: 421 VSLLSKVGLGRLVAKTEDEYVSLALDLASD 450


>gi|288963063|ref|YP_003453342.1| TPR repeat protein [Azospirillum sp. B510]
 gi|288915315|dbj|BAI76798.1| TPR repeat protein [Azospirillum sp. B510]
          Length = 1000

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 56/164 (34%), Positives = 85/164 (51%), Gaps = 10/164 (6%)

Query: 35  VPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL-LKFPAV 93
           VPQ +   SR   G+PED  V+  ++Q+ KI+  TL +W+ VL+  PNSIL L    P +
Sbjct: 789 VPQPL---SRSALGIPEDCFVFLGYHQVMKINKKTLDLWMEVLRNCPNSILILNWTNPWI 845

Query: 94  GEANIQATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDT-PLCNGHTTSMDVLWT 151
               I+  A++ G+D  RI+F      E EH+ R  +AD+ LDT P  +   T  D L+ 
Sbjct: 846 ----IRNLARSHGIDASRIMFFRFVRTEVEHLMRLGVADLFLDTNPFTSAALTGADALYM 901

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
             P +TL    L SR     +  +G  + +    KE+ + A+ L
Sbjct: 902 KVPRITLVEANLYSRFGQIMMNGIGLNDWVCHDRKEFIERAVAL 945


>gi|334119237|ref|ZP_08493324.1| methyltransferase FkbM family [Microcoleus vaginatus FGP-2]
 gi|333458708|gb|EGK87325.1| methyltransferase FkbM family [Microcoleus vaginatus FGP-2]
          Length = 2742

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/169 (33%), Positives = 93/169 (55%), Gaps = 16/169 (9%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-PAV-----GEA 96
            +R   G+ E+++V+ +   LYK+ P   + W  ++ AVPNS+L+L+ F P+      G A
Sbjct: 2076 TRNSIGISEESVVFISSANLYKLVPELRETWAKIMAAVPNSVLFLMPFGPSWTNHYPGGA 2135

Query: 97   ---NIQATAQALGLDQHRIL----FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
               N++A     G++ +R+     F N A  +E ++   L DV LD+    G T+ +D L
Sbjct: 2136 FVNNMKAVFAKYGIESNRLRVLKSFPNRADVKEVLK---LGDVYLDSYPYAGTTSLVDPL 2192

Query: 150  WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              G P V   G+TL SR+ A+ + +L  P+LIA +   Y  +A+ LGT+
Sbjct: 2193 EVGLPTVVRDGDTLRSRMGAAVVRSLSMPDLIANSEASYIQLAVTLGTN 2241


>gi|356505394|ref|XP_003521476.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like [Glycine
           max]
          Length = 919

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L A+PNS L +   P   ++  Q   +T + LGL+  
Sbjct: 652 VTFGSFNNLAKITPKVLQVWARILCAIPNSRLVVKCKPFCSDSVRQRFLSTLEQLGLEPL 711

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 712 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 771

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   LIAR   EY ++A +L +D
Sbjct: 772 SLLSKVGLEHLIARNEDEYVELAQQLASD 800


>gi|186477743|ref|YP_001859213.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
 gi|184194202|gb|ACC72167.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
          Length = 708

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 54/152 (35%), Positives = 78/152 (51%), Gaps = 5/152 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
           AI +  FN L K++ + + +W  VL A+P S L L    L  PAV  A + A   A G+D
Sbjct: 488 AITFGCFNHLVKLNEAVVALWARVLDAMPGSRLLLKTRQLDDPAVQHATL-ARFAAHGID 546

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R++    + + E +      D+ LD     G TTS++ LW G PV+T  GE   S V 
Sbjct: 547 GARLMLEGQSPRAELLAAYNRVDIALDPFPYAGGTTSIEALWMGVPVLTRRGERFLSHVG 606

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            S + T+G P+ IA    +Y   A+R+G  RD
Sbjct: 607 ESIVDTVGLPDWIASDDADYIAKAVRVGVQRD 638


>gi|428227088|ref|YP_007111185.1| family 2 glycosyl transferase [Geitlerinema sp. PCC 7407]
 gi|427986989|gb|AFY68133.1| glycosyl transferase family 2 [Geitlerinema sp. PCC 7407]
          Length = 1249

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 90/181 (49%), Gaps = 12/181 (6%)

Query: 30   ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
            + G E+  S ++ +RQ  G+ ++ +V+C+    +KI P+ LQ W+ +L+ VPNS+L L+ 
Sbjct: 891  SYGTEIASSSLVVTRQDLGIKDEQVVFCSGANFFKITPALLQSWLEILQNVPNSLLLLMP 950

Query: 90   F-----PAVGEANIQATAQ------ALGLDQHRILFSNVAAKEEHVRR-GQLADVCLDTP 137
            F      A  +   Q   Q       L  DQ  +L  +       V++  +LAD+ LD+ 
Sbjct: 951  FGPNWSNAYPKGYFQKYLQNAFVEAGLSKDQFLVLDPDPVPNRADVKQYFKLADIYLDSY 1010

Query: 138  LCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
               G T+ ++ L    P+VT  GE     + A  L  LG  +++A    EY   AI LGT
Sbjct: 1011 PFAGTTSLIEPLEVNLPIVTRVGEYFRGAMGAGILRALGLVQMVAADEAEYVRKAIALGT 1070

Query: 198  D 198
            D
Sbjct: 1071 D 1071


>gi|75318818|sp|O82039.1|SPY_PETHY RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY; AltName:
           Full=PhSPY
 gi|3319682|emb|CAA76834.1| SPINDLY protein [Petunia x hybrida]
          Length = 932

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 83/150 (55%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + + +FN L KI P  LQ+W  +L AVP+S L ++K    G  +++    +  + LGL+ 
Sbjct: 660 VTFGSFNNLAKITPKVLQVWARILCAVPHSRL-IVKCKPFGCDSVRQRFLSILEQLGLEP 718

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V 
Sbjct: 719 QRVDLVPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGSVHAHNVG 778

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L T+G  +L+AR   EY ++AI+L +D
Sbjct: 779 VSLLKTVGLRKLVARNEDEYVELAIQLASD 808


>gi|356500631|ref|XP_003519135.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like [Glycine
           max]
 gi|401721241|gb|AFP99901.1| putative UDP-N-acetylglucosamin [Glycine max]
          Length = 928

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L A+PNS L +   P   ++  Q   +T + LGL+  
Sbjct: 658 VTFGSFNNLAKITPKVLQVWAKILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEKLGLEPL 717

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 718 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 777

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   LIA+   EY  +A++L +D
Sbjct: 778 SLLSKVGLGNLIAKNEDEYVKLAVKLASD 806


>gi|434386708|ref|YP_007097319.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Chamaesiphon minutus PCC 6605]
 gi|428017698|gb|AFY93792.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Chamaesiphon minutus PCC 6605]
          Length = 738

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 87/162 (53%), Gaps = 5/162 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEANIQ 99
            R+   +P+DAI+Y +    +K  P T+++ + +LK+VPNS   L+K    P +      
Sbjct: 524 KRRDLNIPDDAIIYWSGQVGHKRHPDTVRLQLRILKSVPNSYF-LIKGDTDPDIIREFFG 582

Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A   G++  R+ F  NV  +  H     +ADV LDT   NG TT++++LW G P+VT 
Sbjct: 583 KIAAEEGVEFDRLRFLGNVPDEYTHRANLGIADVALDTYPYNGATTTLEILWMGIPLVTR 642

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+   +R + + +   G  E IA   +EY +  I+LG DR+
Sbjct: 643 VGQQFIARNSYTFMLNAGIEEGIAWNAEEYVEWGIKLGLDRE 684


>gi|391232225|ref|ZP_10268431.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Opitutaceae bacterium TAV1]
 gi|391221886|gb|EIQ00307.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Opitutaceae bacterium TAV1]
          Length = 637

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 56/150 (37%), Positives = 77/150 (51%), Gaps = 6/150 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLD 108
           ++ +  FN   K   S   +W  +L AVP S L LLK   + EA ++    A  Q  GL 
Sbjct: 439 SVTFGCFNSPTKFTDSQFALWARILAAVPASRL-LLKGAGLEEAPVRELLLARLQRAGLP 497

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R+ L         H+   +  D+ LDT   NG TT+ + LW G PV+TL G   A+RV
Sbjct: 498 ADRVELLPRTDDTASHLALYRRVDIALDTFPYNGTTTTCEALWMGRPVITLAGNRHAARV 557

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGT 197
           +AS LA +GCPE IA   ++Y   AI L +
Sbjct: 558 SASLLAAIGCPEWIASCQEDYVARAIGLAS 587


>gi|428306910|ref|YP_007143735.1| hypothetical protein Cri9333_3397 [Crinalium epipsammum PCC 9333]
 gi|428248445|gb|AFZ14225.1| hypothetical protein Cri9333_3397 [Crinalium epipsammum PCC 9333]
          Length = 719

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 88/161 (54%), Gaps = 9/161 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---- 99
           R   G+P DA++Y +    +K  P T ++ + ++K VPNS   L+K  +  EA I+    
Sbjct: 517 RDDLGIPADAVIYFSAQHGHKRHPDTTRLQMQIIKQVPNSYF-LIKGRS-DEATIKQFFT 574

Query: 100 ATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             A++ G+D  R+++      E +V R  L  AD+ LDT   NG TT+++ LW G P+VT
Sbjct: 575 KIAESEGVDPSRLIYLPRDINE-YVHRANLGIADIVLDTYPYNGATTTLETLWMGVPLVT 633

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             G+  A+R + + L   G  E IA T  EY +  ++LG D
Sbjct: 634 RVGKQFAARNSYTFLMNAGVTEGIAWTDAEYVEWGVKLGLD 674


>gi|254413207|ref|ZP_05026978.1| hypothetical protein MC7420_694 [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196179827|gb|EDX74820.1| hypothetical protein MC7420_694 [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 745

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 86/160 (53%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
           R    +P DA++Y +    +K  P T ++ + +LK VPNS   L+K  A    V +  IQ
Sbjct: 542 RDHLDIPNDAVIYLSAQMGHKRHPDTARLQMQILKQVPNSYF-LIKGIANQESVKKFFIQ 600

Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A+  G+D  R+ F   VA +  H     +ADV LDT   NG TT+++ LW G P+VT 
Sbjct: 601 -IAEEEGVDCQRLRFLPEVALEATHRANLGIADVVLDTYPYNGATTTLETLWMGIPLVTR 659

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            G+  ++R + + +   G  E IA T +EY +  +RLG D
Sbjct: 660 VGQQFSARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGKD 699


>gi|332712009|ref|ZP_08431939.1| hypothetical protein LYNGBM3L_68530 [Moorea producens 3L]
 gi|332349337|gb|EGJ28947.1| hypothetical protein LYNGBM3L_68530 [Moorea producens 3L]
          Length = 344

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 83/159 (52%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
           R    +P DA +Y +  + YK +P T ++ + ++KAVPNS   L+K  +  ++       
Sbjct: 147 RDHLDIPNDATIYLSSQKGYKRNPDTTRLQMKIIKAVPNSYF-LIKGKSDQDSITRFFNQ 205

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A+  G+D  R+ F   V  +  H     +ADV LDT   NG TT+++ LW G P+VT  
Sbjct: 206 IAEEEGVDCDRLRFLPEVPTEAVHRANLGIADVVLDTYPYNGATTTLETLWMGIPLVTRV 265

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           GE   +R + + +   G  E IA T  +Y +  IRLG D
Sbjct: 266 GEQFVARNSYTMMMNAGITEGIAWTDDQYIEWGIRLGKD 304


>gi|356534448|ref|XP_003535766.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like [Glycine
           max]
          Length = 929

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L A+PNS L +   P   ++  Q   +T + LGL+  
Sbjct: 659 VTFGSFNNLAKITPKVLQVWAKILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEKLGLEPL 718

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 719 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 778

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   LIA+   EY  +A++L +D
Sbjct: 779 SLLSKVGLGNLIAKNEDEYVKLALKLASD 807


>gi|411117158|ref|ZP_11389645.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oscillatoriales cyanobacterium JSC-12]
 gi|410713261|gb|EKQ70762.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Oscillatoriales cyanobacterium JSC-12]
          Length = 741

 Score = 84.0 bits (206), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 84/160 (52%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA--- 100
           R+   +P DAIVY +  + YK    T ++ + ++K VPNS   +  F    + +IQA   
Sbjct: 537 REDLDIPSDAIVYLSTQKGYKRHRETAKLQMRIIKNVPNSYFLIKGF--ADQNSIQAFFY 594

Query: 101 -TAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A+  G++  ++ F   A  E  H     +ADV LDT   NG TT+++ LW G P+VT 
Sbjct: 595 QIAEEEGVNPEQLRFLPDAPSEAVHRANMMIADVVLDTFPYNGATTTLETLWMGVPLVTR 654

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE  A+R + + L  +G    IA T +EY +  +R G D
Sbjct: 655 VGEQFAARNSYTMLKNVGVEAGIAWTDEEYVEWGVRFGKD 694


>gi|16330340|ref|NP_441068.1| hypothetical protein slr1816 [Synechocystis sp. PCC 6803]
 gi|383322081|ref|YP_005382934.1| hypothetical protein SYNGTI_1172 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325250|ref|YP_005386103.1| hypothetical protein SYNPCCP_1171 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491134|ref|YP_005408810.1| hypothetical protein SYNPCCN_1171 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436401|ref|YP_005651125.1| hypothetical protein SYNGTS_1172 [Synechocystis sp. PCC 6803]
 gi|451814498|ref|YP_007450950.1| hypothetical protein MYO_111820 [Synechocystis sp. PCC 6803]
 gi|1652829|dbj|BAA17748.1| slr1816 [Synechocystis sp. PCC 6803]
 gi|339273433|dbj|BAK49920.1| hypothetical protein SYNGTS_1172 [Synechocystis sp. PCC 6803]
 gi|359271400|dbj|BAL28919.1| hypothetical protein SYNGTI_1172 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274570|dbj|BAL32088.1| hypothetical protein SYNPCCN_1171 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359277740|dbj|BAL35257.1| hypothetical protein SYNPCCP_1171 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958259|dbj|BAM51499.1| hypothetical protein BEST7613_2568 [Synechocystis sp. PCC 6803]
 gi|451780467|gb|AGF51436.1| hypothetical protein MYO_111820 [Synechocystis sp. PCC 6803]
          Length = 717

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 53/161 (32%), Positives = 87/161 (54%), Gaps = 5/161 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQ 99
           SR+Q G+  + ++Y +     K +P+ L++ + +L++VP+S+ +L+K     E   A ++
Sbjct: 516 SREQLGIKPNTVIYLSVQNKIKSNPNILKLQLQILQSVPDSV-FLIKGGGDNELIRAMVE 574

Query: 100 ATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A+ +GLD HRI F      E  H     +ADV LDT   NG TT+++ LW   P+VT 
Sbjct: 575 QLAKEMGLDLHRIRFLPKDGDEMTHRANLSIADVVLDTYPYNGATTTLETLWMEVPLVTR 634

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G+  ASR + + L  L   E +A   +EY     R G ++
Sbjct: 635 VGKQFASRNSYTFLTQLNIREGLAWNDQEYVYWGTRFGLEK 675


>gi|269839835|ref|YP_003324528.1| hypothetical protein Tter_2822 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269791565|gb|ACZ43705.1| Tetratricopeptide TPR_2 repeat protein [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 529

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 54/148 (36%), Positives = 79/148 (53%), Gaps = 5/148 (3%)

Query: 55  VYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQH 110
           V+  FN+L KI P  L +W  +L A+P + L L    L+ P V  + +    +  G+   
Sbjct: 332 VFGCFNRLSKIGPEVLDLWAKILLALPRARLRLIATGLQDP-VTSSRLMRALEGRGVAGE 390

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
           R+   +   + + +      DV LDT   +G TTS++ LW G PVVTL G  +A R  +S
Sbjct: 391 RLELLSPMPRTDLLATYNDIDVALDTLPYSGCTTSLEALWMGVPVVTLEGADMAGRATSS 450

Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTD 198
            L   G  EL++RT +EY DIA+ LG D
Sbjct: 451 LLRWAGLQELVSRTQEEYIDIALGLGRD 478


>gi|357148919|ref|XP_003574938.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like
           [Brachypodium distachyon]
          Length = 959

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/150 (35%), Positives = 81/150 (54%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           I + +FN L KI P  LQ+W  +L AVPNS L ++K    G  +I+    +T + LGL+ 
Sbjct: 651 ITFGSFNNLAKITPKVLQVWARILCAVPNSRL-VVKCKPFGCDSIRQKFLSTLEELGLES 709

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V 
Sbjct: 710 IRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVG 769

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L  +G   L+A++  EY  +A+ L +D
Sbjct: 770 VSLLTKVGLGRLVAKSEDEYVSLALDLASD 799


>gi|449526080|ref|XP_004170042.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like [Cucumis
           sativus]
          Length = 925

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 81/149 (54%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  L++W  +L A+PNS L +   P   ++  Q   +T + LGL+  
Sbjct: 660 ITFGSFNNLAKITPKVLEVWARILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEQLGLESQ 719

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 720 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 779

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   L+A+  +EY  +A++L +D
Sbjct: 780 SLLSKVGLGHLVAKNEEEYVKLALQLASD 808


>gi|375152092|gb|AFA36504.1| putative O-linked N-acetyl glucosamine transferase, partial [Lolium
           perenne]
          Length = 225

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 32/58 (55%), Positives = 41/58 (70%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
            R  YGLPED  ++  FNQLYK+DP     W N++K VPNS+LWLL+FPA GE  ++A
Sbjct: 168 KRSDYGLPEDKFIFACFNQLYKMDPEIFDTWCNIVKRVPNSVLWLLRFPATGEMRVKA 225


>gi|75318847|sp|O82422.1|SPY_HORVU RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY; AltName:
           Full=HvSPY
 gi|3617837|gb|AAC36055.1| gibberellin action negative regulator SPY [Hordeum vulgare subsp.
           vulgare]
 gi|326515544|dbj|BAK07018.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 944

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 79/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  +Q+W  +L AVPNS L +   P   ++  Q   +T + LGL+  
Sbjct: 646 ITFGSFNNLAKITPKVMQVWARILCAVPNSRLVVKCKPFCCDSIRQKFLSTLEELGLESL 705

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 706 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 765

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+T  EY  +A+ L +D
Sbjct: 766 SLLTKVGLGRLVAKTEDEYVSLALDLASD 794


>gi|334120476|ref|ZP_08494556.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
           vaginatus FGP-2]
 gi|333456454|gb|EGK85086.1| Tetratricopeptide TPR_1 repeat-containing protein [Microcoleus
           vaginatus FGP-2]
          Length = 726

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 62/181 (34%), Positives = 89/181 (49%), Gaps = 15/181 (8%)

Query: 35  VPQSIV-ITSRQQYGLPE-----------DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPN 82
           +P S V ++  Q+YG+             D IVY       K +   ++  + +LK VPN
Sbjct: 504 MPDSFVAVSGFQRYGVDPVALRRSNRIGLDQIVYLCVAPGRKFNHELVKAQIAILKQVPN 563

Query: 83  SILW--LLKFPAVGEANIQATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLC 139
           SIL    L   AV EA      +A  + +HRI F    A +EEH     LADV LD+   
Sbjct: 564 SILIHKALGDAAVFEAAYHQACEAEKVSKHRIKFLPRFATEEEHRTIYLLADVLLDSYPY 623

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           NG T +++ LW   PVVT  GE   SR+  S L  LG    +A +  EY ++ ++LG +R
Sbjct: 624 NGGTHTLEALWCNLPVVTRTGEQFLSRMGYSFLQALGIETGVALSWSEYVEVGVKLGKER 683

Query: 200 D 200
           +
Sbjct: 684 E 684


>gi|253702410|ref|YP_003023599.1| hypothetical protein GM21_3824 [Geobacter sp. M21]
 gi|251777260|gb|ACT19841.1| Tetratricopeptide TPR_2 repeat protein [Geobacter sp. M21]
          Length = 1106

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 51/149 (34%), Positives = 76/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
           + + +FN   KI P T+ +W  VL+AVP S L L  +    A     ++      G+++ 
Sbjct: 429 VTFGSFNNPAKITPETVLLWSGVLRAVPGSHLLLKGYSLACAETRLRLEEAFAGHGIERE 488

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L  N  +  +H+      D+ LD+   NG TTS + LW G PVVTL G +  SRV A
Sbjct: 489 RLELMGNTPSYRDHLALYDRVDIALDSYPYNGTTTSCEALWMGVPVVTLAGSSHRSRVGA 548

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  LG   L+A   +++  +A  L  D
Sbjct: 549 SLLQALGLEGLVAHEARKFVALAAALAGD 577


>gi|449451161|ref|XP_004143330.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY-like [Cucumis
           sativus]
          Length = 920

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 81/149 (54%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  L++W  +L A+PNS L +   P   ++  Q   +T + LGL+  
Sbjct: 655 ITFGSFNNLAKITPKVLEVWARILCAIPNSRLVVKCKPFCCDSVRQRFLSTLEQLGLESQ 714

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 715 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 774

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   L+A+  +EY  +A++L +D
Sbjct: 775 SLLSKVGLGHLVAKNEEEYVKLALQLASD 803


>gi|326512858|dbj|BAK03336.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 644

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 79/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  +Q+W  +L AVPNS L +   P   ++  Q   +T + LGL+  
Sbjct: 346 ITFGSFNNLAKITPKVMQVWARILCAVPNSRLVVKCKPFCCDSIRQKFLSTLEELGLESL 405

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 406 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 465

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+T  EY  +A+ L +D
Sbjct: 466 SLLTKVGLGRLVAKTEDEYVSLALDLASD 494


>gi|393770118|ref|ZP_10358624.1| tpr domain protein [Methylobacterium sp. GXF4]
 gi|392724408|gb|EIZ81767.1| tpr domain protein [Methylobacterium sp. GXF4]
          Length = 713

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 55/151 (36%), Positives = 78/151 (51%), Gaps = 6/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
           I +  FN+L K+  + L  W ++L AVP S L++    +  P V EA ++    A GL  
Sbjct: 520 ITFGCFNRLTKVCDAVLTTWASILDAVPESRLFMVVGDIGTPEVREA-VETRLTAAGLPL 578

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++F    +   H    ++ D+ LD    NG TTS D L  G P V L G+  A+RV +
Sbjct: 579 DRVIFQPRVSSGYHELYHRV-DIALDPYPYNGGTTSFDTLSMGVPFVALSGDHAAARVGS 637

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  +G PEL+A +   Y  IA  L  DRD
Sbjct: 638 SILRVIGLPELVADSQDAYVAIARDLALDRD 668


>gi|225440809|ref|XP_002281883.1| PREDICTED: probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Vitis vinifera]
 gi|297740152|emb|CBI30334.3| unnamed protein product [Vitis vinifera]
          Length = 914

 Score = 83.2 bits (204), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T + LGL+  
Sbjct: 658 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSVRQRFLSTLEQLGLESL 717

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 718 RVDLLPLILLNHDHMQAYALMDISLDTFPYAGTTTTCESLFMGVPCVTMAGSVHAHNVGV 777

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+T  EY  +A++L +D
Sbjct: 778 SLLNKVGLGRLVAKTEDEYVQLALQLASD 806


>gi|195423963|gb|ACF96937.1| SPINDLY [Sinningia speciosa]
          Length = 934

 Score = 83.2 bits (204), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 79/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++      +T + LGL+  
Sbjct: 659 ITFGSFNNLAKITPRVLQVWARILCAVPNSRLIVKCKPFCSDSVRLQFLSTLEKLGLESL 718

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P +++ G   A  V  
Sbjct: 719 RVDLLPLILLNRDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCISMGGHVHAHNVGV 778

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L T+G   L+A+   EY  +A++L +D
Sbjct: 779 SLLNTVGLSNLVAKNEDEYVQLALQLASD 807


>gi|196232229|ref|ZP_03131083.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
 gi|196223597|gb|EDY18113.1| TPR repeat-containing protein [Chthoniobacter flavus Ellin428]
          Length = 701

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
           + + ++N + K    T+++W  +L+ +P + L L       PA  E  ++       +D 
Sbjct: 506 VTFASYNIVPKTTGQTIELWSRILRELPEARLTLGNRGFGTPAAVE-RVRKRFAERAIDP 564

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            RI+F     + E +     AD+ LDT   NG  T+ + +W G PVVT PGET ASR   
Sbjct: 565 SRIIFQGWVPRAELLAAYNQADIALDTLPYNGGLTTCEAMWMGVPVVTCPGETFASRHGL 624

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + L   G PE IAR   +Y  IA+ L  D
Sbjct: 625 AHLTAAGVPETIARDPDDYVKIAVDLARD 653


>gi|425445185|ref|ZP_18825221.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389734919|emb|CCI01526.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
          Length = 713

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 86/172 (50%), Gaps = 30/172 (17%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R   G+PEDAI Y       K +P T+++ + VLKAVPNS L           +IQ  + 
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPNSYL-----------SIQGLSD 561

Query: 104 ALGLDQHRILFSNVAAKE----EHVR----------RGQL--ADVCLDTPLCNGHTTSMD 147
           A  L++   LF  VA +E    E ++          R  L  ADV LDT    G  T++D
Sbjct: 562 AKSLEK---LFFKVAEEEGINYERLKIFPLYPTGIYRANLRIADVVLDTYPFTGGMTTLD 618

Query: 148 VLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           VLW G P+VT  G+  +SR + + +   G  E IA + +EY D  I+LG D 
Sbjct: 619 VLWMGIPLVTKVGQQWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 670


>gi|220906369|ref|YP_002481680.1| hypothetical protein Cyan7425_0933 [Cyanothece sp. PCC 7425]
 gi|219862980|gb|ACL43319.1| TPR repeat-containing protein [Cyanothece sp. PCC 7425]
          Length = 738

 Score = 82.8 bits (203), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 90/161 (55%), Gaps = 7/161 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF----PAVGEANIQ 99
           R+  G+P DA VY +  +  K +P  +++ + +LKAVP S   L+K      AV E  +Q
Sbjct: 534 REDLGIPADAPVYFSSQRGNKRNPHLIRLQMQILKAVPGSYF-LIKGQSDQAAVQEVFLQ 592

Query: 100 ATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A+A G+   R+ F   V   EEH    ++A++ LDT   NG TT+++ LW   PVVTL
Sbjct: 593 -MAEAEGISVDRLKFPPLVNTCEEHRANLRIANIILDTYPYNGATTTLEALWMEVPVVTL 651

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G+  ++R + + L+  G  E IA +  EY +  +RLG D+
Sbjct: 652 VGQQFSARNSYTFLSNAGVTEGIAWSDAEYVEWGVRLGQDQ 692


>gi|381160368|ref|ZP_09869600.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878432|gb|EIC20524.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 717

 Score = 82.8 bits (203), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 77/151 (50%), Gaps = 5/151 (3%)

Query: 52  DAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGL 107
           D++ + +FN + K+   T+++W  VLK VP S L +    L  P V  A +       G 
Sbjct: 515 DSVTFASFNNILKLSERTIRLWSRVLKEVPGSNLLIRDKVLADPGV-RAKVLGEFVKSGT 573

Query: 108 DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
              +++  +  ++EE++R     D+ L      G TT+ + LW G PVV L G T A R+
Sbjct: 574 CPEQLVMESSVSREEYLREYCDVDISLSPTPFGGGTTTAEALWMGVPVVFLSGGTWAGRI 633

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             S L T+G PEL+A+  + Y  IA  L  D
Sbjct: 634 GESILRTVGLPELVAQDEERYVGIATALAGD 664


>gi|425464283|ref|ZP_18843605.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389833753|emb|CCI21475.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 716

 Score = 82.4 bits (202), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 2/159 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P T+++ + +LKAVPNS L +  L      E      
Sbjct: 516 RDDLGIPEDAINYLTAQTGVKRNPETIRLQLQILKAVPNSYLSIQGLSDAKSVEKLFFKV 575

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 576 AEEEGINYERLKIFPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +SR + + +   G  E IA + +EY D  I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674


>gi|354566941|ref|ZP_08986112.1| TPR repeat-containing protein [Fischerella sp. JSC-11]
 gi|353544600|gb|EHC14054.1| TPR repeat-containing protein [Fischerella sp. JSC-11]
          Length = 740

 Score = 82.4 bits (202), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 86/167 (51%), Gaps = 7/167 (4%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-- 95
           S+   +R+Q  +P DA+VY    + +K      ++ + +LK VPNS   L+K  +  E  
Sbjct: 532 SVPTITREQLDIPHDAVVYLCAQRGFKRHLDITRLQLRILKEVPNSYF-LIKGISDEEST 590

Query: 96  -ANIQATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTG 152
            A  +  A+  G+D  +  F  +   E  V R  L  ADV LDT   NG TT+++ LW  
Sbjct: 591 KAFFEKLAEEEGVDTSKFRFLPIVVAES-VHRANLGIADVVLDTYPYNGATTTLETLWMC 649

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            P+VT  GE  A+R + + +   G  E IA T +EY +  +RLG D 
Sbjct: 650 IPMVTRVGEQFAARNSYTMMINAGITEGIAWTDEEYVEWGVRLGKDE 696


>gi|350535410|ref|NP_001233937.1| probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Solanum
           lycopersicum]
 gi|75330646|sp|Q8RVB2.1|SPY_SOLLC RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY; Short=LeSPY
 gi|19913115|emb|CAC85168.1| SPY protein [Solanum lycopersicum]
 gi|19913117|emb|CAC85169.1| SPY protein [Solanum lycopersicum]
          Length = 931

 Score = 82.4 bits (202), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 79/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  L++W  +L AVP+S L +   P   ++  Q   +  + LGL+  
Sbjct: 660 VTFGSFNNLAKITPKVLKVWARILSAVPHSRLIVKCKPFCCDSVRQRFLSILEQLGLEPQ 719

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 720 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMGGSVHAHNVGV 779

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L T+G   L+AR   EY + AI+L +D
Sbjct: 780 SLLKTVGLENLVARNEDEYVESAIQLASD 808


>gi|428182321|gb|EKX51182.1| hypothetical protein GUITHDRAFT_134689 [Guillardia theta CCMP2712]
          Length = 726

 Score = 82.4 bits (202), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 76/127 (59%), Gaps = 5/127 (3%)

Query: 73  WVNVLKAVP--NSILWLLKFPAVGEANIQATAQA-LGLDQH-RILFSNVAAKEEHVRRGQ 128
           W+++L+     ++ LWL  FP V +  +   A+A LG +   RI+++N+  + EH++   
Sbjct: 533 WMDILRRTTEHHTTLWLPLFPDVAKPQLMRIARAKLGKNATTRIVWTNLFDESEHLQVKS 592

Query: 129 LADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCP-ELIARTHKE 187
           LA + LDT L  GHT+  D+LW G P VTLPG   ++RV AS L  L     L+ART ++
Sbjct: 593 LATMQLDTFLYCGHTSGADILWAGVPTVTLPGIMQSARVGASLLKGLSLHGHLVARTAED 652

Query: 188 YQDIAIR 194
           Y D+ +R
Sbjct: 653 YADVVVR 659


>gi|425468294|ref|ZP_18847324.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
 gi|389885054|emb|CCI34705.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
          Length = 716

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 55/158 (34%), Positives = 81/158 (51%), Gaps = 2/158 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P T+++ + VLKAVPNS L +  L      E      
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             +SR + + +   G  E IA + +EY D  I+LG D 
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 673


>gi|428318600|ref|YP_007116482.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
 gi|428242280|gb|AFZ08066.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
          Length = 2237

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 59/179 (32%), Positives = 86/179 (48%), Gaps = 12/179 (6%)

Query: 30   ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL---- 85
            AT  EV  + V  +RQ  G+ ++  V+ +    YKI P   + WV +L AVP+SIL    
Sbjct: 1560 ATESEV--ATVKPTRQSLGISDNTTVFISGANFYKILPELRESWVKILAAVPDSILILYP 1617

Query: 86   ----WLLKFPAVGEAN-IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLC 139
                W   +PA    N + A     G++ +R+ F         V+   QLADV LD+   
Sbjct: 1618 FGPAWTRTYPATPFVNNLNAVCAKYGVNNNRLRFVKQLPSRADVKEFLQLADVYLDSYPY 1677

Query: 140  NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             G T+ +D L  G P V + G  L  R  ++ L  L  P+LIA     Y  +A+ LGT+
Sbjct: 1678 AGATSLIDPLQVGLPAVVVEGNALRFRQGSAMLRELQMPDLIAYDEASYIQLAVTLGTN 1736


>gi|358639499|dbj|BAL26796.1| hypothetical protein AZKH_4523 [Azoarcus sp. KH32C]
          Length = 585

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 80/150 (53%), Gaps = 5/150 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLD 108
           A+ + +FN L K++ + + +W  VL+A+P S L L++  A+ +   ++   A     G+ 
Sbjct: 390 ALTFGSFNALGKVNDAVIDLWAEVLRALPASRL-LMQARALSDPGSRSCVAARFGERGIA 448

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+         EH+      DVCLDT   NGH T++D LW G PV+TL G+  A+R+ 
Sbjct: 449 SERVEMHGFMPLAEHLALFHRTDVCLDTFPWNGHMTTLDSLWMGVPVLTLKGDRRAARMG 508

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            + ++ +G    +A + +++   A+ L  D
Sbjct: 509 TAIMSAIGLEGFVAESREDFVAKAVGLAGD 538


>gi|428166307|gb|EKX35285.1| hypothetical protein GUITHDRAFT_146593 [Guillardia theta CCMP2712]
          Length = 757

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 66/123 (53%), Gaps = 6/123 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAV------PNSILWLLKFPAVGEAN 97
           R++ GLP +  V C FNQ +K+DP T ++W + ++ +       +  LWLL+   +   N
Sbjct: 604 RKEEGLPPEGPVICYFNQHFKLDPPTFKIWTDAIQKLVSFNSSAHPTLWLLQGSPISHRN 663

Query: 98  IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           ++   Q+ GL    ++ +     + H+RR  L D+ +DTP+ N   T  D L+   P++ 
Sbjct: 664 LKREFQSAGLPGENLVIAKRVLVKTHIRRASLCDLAVDTPMYNSGATGADTLYAAVPLLH 723

Query: 158 LPG 160
           +PG
Sbjct: 724 IPG 726


>gi|374293478|ref|YP_005040513.1| hypothetical protein AZOLI_3149 [Azospirillum lipoferum 4B]
 gi|357425417|emb|CBS88304.1| protein of unknown function; putative TPR repeat domains
           [Azospirillum lipoferum 4B]
          Length = 896

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 53/158 (33%), Positives = 80/158 (50%), Gaps = 9/158 (5%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQ 103
           GL +    + +FN   K++ ST+ +W  VL AVP + L +LK+  +  +++     A   
Sbjct: 689 GLADGTFTFGSFNNPTKLNASTIALWARVLHAVPRARL-MLKWRGLTSSSLGGRLLAQFA 747

Query: 104 ALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET 162
           ALG+   R++ S       E  RR    D  LD    NG TT+ D LW G PV+++ G  
Sbjct: 748 ALGIAGDRLILSGTTPDPYESYRR---IDCGLDPVFANGGTTTCDSLWMGVPVLSIAGTA 804

Query: 163 LASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             SR   S L+ +G P+L+A     Y  +A+RL  D D
Sbjct: 805 AISRWGISLLSAVGLPDLVADDDDSYVALAVRLAGDPD 842


>gi|220906370|ref|YP_002481681.1| hypothetical protein Cyan7425_0934 [Cyanothece sp. PCC 7425]
 gi|219862981|gb|ACL43320.1| TPR repeat-containing protein [Cyanothece sp. PCC 7425]
          Length = 739

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 85/159 (53%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI---QA 100
           R+  G+P DA++Y +  +  K +P   Q+ + +L+ +PNS L +  F       +   Q 
Sbjct: 537 REDLGIPADAVIYLSSQRAAKRNPHLTQLQMQILQEMPNSYLLIKGFANQKSLQVWFEQI 596

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A+A G+   R+ F  +  + E+H    ++ADV LDT   NG TT+++ LW   P+VT  
Sbjct: 597 AAEA-GVSSDRLRFVPSCPSDEQHRANLRIADVVLDTYPYNGATTTLETLWMEIPLVTRV 655

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           GE  A+R + + L   G  E IA +  EY +  +RLG +
Sbjct: 656 GEQFAARNSYTFLVNAGVEEGIAWSDAEYVEWGVRLGKE 694


>gi|189425152|ref|YP_001952329.1| hypothetical protein Glov_2093 [Geobacter lovleyi SZ]
 gi|189421411|gb|ACD95809.1| TPR repeat-containing protein [Geobacter lovleyi SZ]
          Length = 1714

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 6/150 (4%)

Query: 54   IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL----WLLKFPAVGEANIQATAQALGLDQ 109
            + + +FN + K  P  ++ W  +L  V  S++    W     ++ +      A A G+  
Sbjct: 1521 VTFGSFNNISKTSPYVIETWATILNRVEGSVIMLKNWFFSDQSIQQKFYDGFA-AFGITS 1579

Query: 110  HRILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             RI+    A   +EH+   Q  D+ LDT   NG TT+ + LW G PVVT  G+  A RV 
Sbjct: 1580 DRIILLPAAPDIKEHLCMYQHVDIALDTFPYNGTTTTCEALWMGVPVVTFTGDRHAGRVG 1639

Query: 169  ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            AS L  +G  EL+AR   +Y D A+ L  +
Sbjct: 1640 ASILNRIGLSELVARDRNDYIDRAVMLAEN 1669



 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 5/154 (3%)

Query: 49   LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL--- 105
            L  + I + +FN   KI  + +++W +++K VP S L +LKF    + +++   + L   
Sbjct: 991  LDSEFITFGSFNNPLKISENVVEVWSDIMKRVPKSRL-VLKFSTFKDVSVRRRFRELFST 1049

Query: 106  -GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
             G+   RI F   ++    +      D+ LDT    G  TS++ LW G P++TL G T  
Sbjct: 1050 YGVSPRRIEFRTFSSPYLMLMEHGDIDIMLDTFPFTGGMTSLNALWMGVPIITLAGTTPI 1109

Query: 165  SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            SR   + L  LG  EL+     EY + A++L  D
Sbjct: 1110 SRQTKTFLDLLGLYELVTTNKDEYIETAVKLSFD 1143



 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 73/149 (48%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQ 109
           I + +FN ++K+    +  W  VL AV +S L LLK  +  +     +     +  G+  
Sbjct: 404 ITFGSFNSIHKLTAEVVDAWSRVLHAVSHSKL-LLKSKSFSDDVVVDDFIRRFKIHGISS 462

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           +RI+F  +++  E ++     D+ LD    NG  ++ + LW G PVVTL   T  SR + 
Sbjct: 463 NRIIFKKLSSHYEMLQEYGEIDIALDPFPYNGGASTCEALWMGVPVVTLEMGTPISRQSK 522

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + L  +   EL+A T  +Y  IA  L  D
Sbjct: 523 AFLYAIDHHELVASTLDDYVQIAQNLALD 551


>gi|302878329|ref|YP_003846893.1| hypothetical protein Galf_1101 [Gallionella capsiferriformans ES-2]
 gi|302581118|gb|ADL55129.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
            capsiferriformans ES-2]
          Length = 1646

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 78/149 (52%), Gaps = 5/149 (3%)

Query: 54   IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLDQ 109
            I + +FN+L KI   T   W  +L+A+P+S L +LK  A+ +A  +          G+D 
Sbjct: 1444 ITFGSFNRLVKISEQTYLAWAKILRALPDSRL-ILKTHALDDAGTRERVSEHFIRAGIDP 1502

Query: 110  HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
             RI+     ++E  +      D+ LDT    G  T+++ L  G PVVTL   TL  RV+A
Sbjct: 1503 ARIILLGKTSREAQLAAFNRVDIALDTFPHGGGMTTLEGLVMGVPVVTLRWPTLTGRVSA 1562

Query: 170  SQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L TLG P+ IA +  EY  +AI+   D
Sbjct: 1563 SILTTLGMPDWIAESQDEYVKLAIQKAAD 1591



 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 81/149 (54%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
           + + ++N+L K   +  + W  +L+AVP+S + + K P + +A +Q   +A     G+  
Sbjct: 759 VTFGSYNRLAKNSEAAYRAWAKILRAVPDSRM-IFKTPVLDDAAVQDQVRAYFTSAGVAA 817

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            RI+    +  +EH+      D+ LD     G  ++++ L  G PVVTL   +LA R++A
Sbjct: 818 ERIILLGKSPHDEHMAAFNQIDISLDPFPHGGGVSALEGLLMGVPVVTLNWPSLAGRISA 877

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S + TL  P+ IA + +EY  +AI+  TD
Sbjct: 878 SIMTTLAMPDWIAGSEEEYVQLAIQKATD 906


>gi|340516226|gb|EGR46476.1| glycosyltransferase family 41 [Trichoderma reesei QM6a]
          Length = 1738

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/161 (35%), Positives = 80/161 (49%), Gaps = 47/161 (29%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +DAI+  NFNQLYK                              E N++ TA+A    + 
Sbjct: 1542 DDAIILANFNQLYK----------------------------ARETNLRQTAEAWAGAEV 1573

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
              R++F++VA K +H+ R ++ D+ LDT  CN HTT+ DVLW+ TP++TLP  +  + SR
Sbjct: 1574 ASRLVFTDVAPKNQHINRARVCDLFLDTAECNAHTTAADVLWSSTPLLTLPRYSYKMCSR 1633

Query: 167  VAAS------------QLATLGCPELIARTHKEYQDIAIRL 195
            +AAS            Q A L   ELIA    EY+D A  L
Sbjct: 1634 MAASILRGALPKSAEGQQAAL---ELIADGETEYEDQAAEL 1671


>gi|428207291|ref|YP_007091644.1| hypothetical protein Chro_2283 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428009212|gb|AFY87775.1| hypothetical protein Chro_2283 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 725

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN------ 97
           R+   +P +A+VY +    +K  P T+++ + +L  VPNS      F   G+A+      
Sbjct: 523 REHLEIPSNAVVYLSSQVGFKRHPDTVRLQMQILSQVPNSY-----FLVKGKADENTIRE 577

Query: 98  -IQATAQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTP 154
                A+  G++  R+ F +     E V R  L  ADV LDT   NG TT+++ LW G P
Sbjct: 578 FFNKIAEDEGVNPDRLRFLS-RDPSEFVHRANLGIADVVLDTYPYNGATTTLETLWMGIP 636

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +VT  G+  ASR + + +  +G  E IA T +EY +  ++LGTD
Sbjct: 637 LVTKVGQQFASRNSYTFMMNVGVTEGIAWTDEEYVEWGVKLGTD 680


>gi|253996282|ref|YP_003048346.1| hypothetical protein Mmol_0910 [Methylotenera mobilis JLW8]
 gi|253982961|gb|ACT47819.1| TPR repeat-containing protein [Methylotenera mobilis JLW8]
          Length = 1673

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 77/150 (51%), Gaps = 7/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL-WLLKFPAVGEANIQATAQA----LGLD 108
           I + +F  L K     L +W  V++A+P+S L W  K  + G+AN+ A   A    LG+D
Sbjct: 638 ITFASFQTLAKAGDDVLALWAEVMRALPDSRLRWQCK--SFGDANVVANMAAKFAKLGID 695

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+      ++E +       D+ LDT    G TT+ + LW G P +TL G++L +R  
Sbjct: 696 VARLSLLGAVSRENYFASHHDVDMILDTFPYPGGTTTCEALWMGVPTLTLAGDSLIARQG 755

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AS L   G  + +A T  EY D A++  +D
Sbjct: 756 ASMLTAAGLADWVAETKAEYLDKALKHCSD 785


>gi|75906571|ref|YP_320867.1| hypothetical protein Ava_0346 [Anabaena variabilis ATCC 29413]
 gi|75700296|gb|ABA19972.1| TPR repeat protein [Anabaena variabilis ATCC 29413]
          Length = 739

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 84/166 (50%), Gaps = 7/166 (4%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+   +R++  +P DA+VY    + +K  P   ++ + ++K VPNS  + L      E +
Sbjct: 532 SVPTVTREELDIPHDAVVYLCGQRGFKRHPDITRLQLKIIKEVPNS--YFLIKGISDEDS 589

Query: 98  IQATAQAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
           I+     L    G+D  R+ F  +   E  H     +AD+ LDT   NG TT+++ LW  
Sbjct: 590 IKLFFDGLADEEGVDTSRLRFLPIVMSESIHRANLDIADIVLDTYPYNGATTTLETLWMC 649

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            P+VT  GE  A+R + + +   G  E IA T  EY +  +RLG D
Sbjct: 650 IPMVTRVGEQFAARNSYTMMMNAGITEGIAWTDDEYIEWGVRLGKD 695


>gi|312282561|dbj|BAJ34146.1| unnamed protein product [Thellungiella halophila]
          Length = 762

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 78/149 (52%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q    T + LGL+  
Sbjct: 517 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 576

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 577 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 636

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   L+A+   EY  +++ L +D
Sbjct: 637 SLLSKVGLGHLVAKNEDEYVQLSVDLASD 665


>gi|166363521|ref|YP_001655794.1| hypothetical protein MAE_07800 [Microcystis aeruginosa NIES-843]
 gi|166085894|dbj|BAG00602.1| hypothetical protein MAE_07800 [Microcystis aeruginosa NIES-843]
          Length = 713

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 2/159 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P T+++ + VLKAVP+S L +  L      E      
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPHSYLSIQGLSDAKSVEKLFFKV 572

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 573 AEEEGINYERLKIFPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 632

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +SR + + +   G  E IA + +EY D  I+LG D +
Sbjct: 633 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 671


>gi|427720511|ref|YP_007068505.1| hypothetical protein Cal7507_5336 [Calothrix sp. PCC 7507]
 gi|427352947|gb|AFY35671.1| TPR repeat-containing protein [Calothrix sp. PCC 7507]
          Length = 739

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 83/159 (52%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
           R+   +P +A+VY +  + YK  P T +  + +LK VP+S   L+K  +  EA       
Sbjct: 537 REDLDIPTNAVVYLSAQRGYKRHPETTRWQMQILKQVPDSYF-LIKGLSEEEAIKRFFYQ 595

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A+  G+D  R+ F   V  +  H     +ADV LDT   NG TT+++ LW G P+VT  
Sbjct: 596 IAEEEGVDCSRLRFLPQVYLESVHRANLGIADVVLDTFPYNGATTTLETLWMGIPLVTRV 655

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           GE  A+R + + +   G  E +A T  EY +  +RLG D
Sbjct: 656 GEQFAARNSYTMMINAGITEGVAWTDAEYIEWGVRLGKD 694


>gi|358400410|gb|EHK49741.1| glycosyltransferase family 41 protein [Trichoderma atroviride IMI
            206040]
          Length = 1702

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 79/160 (49%), Gaps = 44/160 (27%)

Query: 51   EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQ- 109
            +DAI+  NFNQLYK                              + N++ TA+A    + 
Sbjct: 1507 DDAIILANFNQLYK-----------------------------AKVNLRRTAEAWAGAEV 1537

Query: 110  -HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGET--LASR 166
              R++F++VA K +H+ R ++ D+ LDT  CN HTT+ DVLW+ TP++TLP  +  + SR
Sbjct: 1538 ASRLIFTDVAPKSQHITRARVCDLFLDTAECNAHTTAADVLWSSTPLLTLPRYSYKMCSR 1597

Query: 167  VAASQLATLGCP----------ELIARTHKEYQDIAIRLG 196
            +AAS L     P          ELIA    EY+D A  L 
Sbjct: 1598 MAASILRG-ALPKTAEGEQVAVELIAENETEYEDSATELA 1636


>gi|320166199|gb|EFW43098.1| hypothetical protein CAOG_08230 [Capsaspora owczarzaki ATCC 30864]
          Length = 875

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 79/159 (49%), Gaps = 1/159 (0%)

Query: 40  VITSRQQYGLPED-AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
           V+  + + GLP D A  +C+   ++K + ST+  W+ +LK  P + LW++  P    AN 
Sbjct: 609 VVAKKTELGLPNDGAFTFCSLGPIFKANKSTVHSWIRILKQAPEARLWVIAHPNDAAANF 668

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
           +       ++   IL   +  ++       + D+ LDT   N HTT+ ++L  G PV+T 
Sbjct: 669 RRLVAEAQVEAQVILTPRLEREDFLEYAPSMCDLFLDTLPYNAHTTASEMLTGGLPVLTQ 728

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGT 197
            G+T   R AA  L++    + IA +   Y+ +A+   T
Sbjct: 729 AGQTRCGRSAAGMLSSASLDQFIATSAGLYEAMAVEFAT 767


>gi|428167107|gb|EKX36071.1| hypothetical protein GUITHDRAFT_165818 [Guillardia theta CCMP2712]
          Length = 299

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 77/161 (47%), Gaps = 6/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ 103
           R +Y +P  A VY     ++K+ P    +   +L   PNSI+   + P   +   Q   +
Sbjct: 88  RSRYNIPPRATVYACLQTIFKVSPKMDFVAKRILNEDPNSIIMFKELPMTDQCGKQVLER 147

Query: 104 -----ALGLDQH-RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
                 L  D+  RI F       ++     + DV LD+    GHTTSM+ L  G P+VT
Sbjct: 148 MRNKSKLTEDEMARIKFLPAMHDADYSDMFAIVDVILDSYPFGGHTTSMEALAVGRPIVT 207

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           LP + ++ R     LA LG  ELIA   +++  IA++LG D
Sbjct: 208 LPTDFMSGRCTQGFLAFLGLQELIASDLEDFIRIAVKLGKD 248


>gi|297829706|ref|XP_002882735.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328575|gb|EFH58994.1| hypothetical protein ARALYDRAFT_478494 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 897

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q    T + LGL+  
Sbjct: 638 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 697

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 698 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 757

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+   EY  +++ L +D
Sbjct: 758 SLLTKVGLGHLVAKNEDEYVQLSVDLASD 786


>gi|115477811|ref|NP_001062501.1| Os08g0559300 [Oryza sativa Japonica Group]
 gi|75325353|sp|Q6YZI0.1|SPY_ORYSJ RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY
 gi|45736106|dbj|BAD13137.1| putative gibberellin action negative regulator SPY [Oryza sativa
           Japonica Group]
 gi|113624470|dbj|BAF24415.1| Os08g0559300 [Oryza sativa Japonica Group]
 gi|215686860|dbj|BAG89710.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 927

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T   LGL+  
Sbjct: 646 ITFGSFNNLAKITPKVLQVWAKILCAVPNSRLVVKCKPFCCDSIRQKFLSTLAELGLEPL 705

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 706 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 765

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A++  EY  +A+ L  D
Sbjct: 766 SLLTKVGLGRLVAKSENEYVSLALDLAAD 794


>gi|218201606|gb|EEC84033.1| hypothetical protein OsI_30264 [Oryza sativa Indica Group]
          Length = 947

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T   LGL+  
Sbjct: 666 ITFGSFNNLAKITPKVLQVWAKILCAVPNSRLVVKCKPFCCDSIRQKFLSTLAELGLEPL 725

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 726 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 785

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A++  EY  +A+ L  D
Sbjct: 786 SLLTKVGLGRLVAKSENEYVSLALDLAAD 814


>gi|222641017|gb|EEE69149.1| hypothetical protein OsJ_28275 [Oryza sativa Japonica Group]
          Length = 947

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T   LGL+  
Sbjct: 666 ITFGSFNNLAKITPKVLQVWAKILCAVPNSRLVVKCKPFCCDSIRQKFLSTLAELGLEPL 725

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 726 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 785

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A++  EY  +A+ L  D
Sbjct: 786 SLLTKVGLGRLVAKSENEYVSLALDLAAD 814


>gi|42572381|ref|NP_974286.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
           thaliana]
 gi|332641540|gb|AEE75061.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
           thaliana]
          Length = 732

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q    T + LGL+  
Sbjct: 473 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 532

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 533 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 592

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+   EY  +++ L +D
Sbjct: 593 SLLTKVGLGHLVAKNEDEYVQLSVDLASD 621


>gi|323451078|gb|EGB06956.1| hypothetical protein AURANDRAFT_65153 [Aureococcus anophagefferens]
          Length = 1100

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 53/145 (36%), Positives = 70/145 (48%), Gaps = 23/145 (15%)

Query: 55  VYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG-------------------E 95
           VY +FN+  K+ P  ++ W  +L   P S+LWLL+FP  G                   E
Sbjct: 494 VYASFNKFEKLSPGVVEAWATILLDAPGSVLWLLEFPRSGVDGARRALEGAADRVRQKRE 553

Query: 96  ANIQATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
            +  A A A  LD  R++F   A  + E+  R  L D CLDT     HT+S+D L+ G P
Sbjct: 554 GSRDAPAGA--LDPARLIFGAFAPTDRENQARLPLIDACLDTWPYGAHTSSLDCLFAGVP 611

Query: 155 VVT-LPGETLASRVAASQLATLGCP 178
           V+T   G  LA RVA S L  L  P
Sbjct: 612 VITRRDGGHLAGRVALSHLVALSDP 636


>gi|167620865|ref|ZP_02389496.1| TPR domain protein [Burkholderia thailandensis Bt4]
          Length = 781

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 78/148 (52%), Gaps = 1/148 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
           + Y +FN+L K+  + + +W  VL A P++ + +      G+   I     + G+D HR+
Sbjct: 570 VTYGSFNRLNKLRQNVIALWSQVLLATPSARMVIGSIEQEGDRETIIEWFVSEGIDAHRL 629

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F   AA   ++++    DVCLD     G TT+++ LW G P VTL G+++ SR +AS L
Sbjct: 630 TFQPRAAMPVYLQQHHHVDVCLDAFPYAGSTTTLNALWMGVPTVTLSGKSIPSRGSASWL 689

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + +G  E IA   K +   +I    D D
Sbjct: 690 SHVGLAEYIAHDQKSFVRTSIAAAQDLD 717


>gi|443657012|ref|ZP_21131811.1| hypothetical protein C789_2351 [Microcystis aeruginosa DIANCHI905]
 gi|159029888|emb|CAO90942.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443333289|gb|ELS47856.1| hypothetical protein C789_2351 [Microcystis aeruginosa DIANCHI905]
          Length = 716

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 2/159 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P  +++ + VLKAVPNS L +  L      E      
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +SR + + +   G  E IA + +EY D  I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674


>gi|83721418|ref|YP_443704.1| TPR domain-containing protein [Burkholderia thailandensis E264]
 gi|83655243|gb|ABC39306.1| TPR domain protein [Burkholderia thailandensis E264]
          Length = 781

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 78/148 (52%), Gaps = 1/148 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
           + Y +FN+L K+  + + +W  VL A P++ + +      G+   I     + G+D HR+
Sbjct: 570 VTYGSFNRLNKLRQNVIALWSQVLLATPSARMVIGSIEQEGDRETIIEWFVSEGIDAHRL 629

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F   AA   ++++    DVCLD     G TT+++ LW G P VTL G+++ SR +AS L
Sbjct: 630 TFQPRAAMPVYLQQHHHVDVCLDAFPYAGSTTTLNALWMGVPTVTLSGKSIPSRGSASWL 689

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + +G  E IA   K +   +I    D D
Sbjct: 690 SHVGLAEYIAHDQKSFVRTSIAAAQDLD 717


>gi|328875280|gb|EGG23645.1| glycosyltransferase [Dictyostelium fasciculatum]
          Length = 687

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 56/170 (32%), Positives = 82/170 (48%), Gaps = 12/170 (7%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P S+   ++  Y      + +  +N L K+   T + W  +L  +P + L ++K P   E
Sbjct: 484 PVSVAPCAKNGY------VTFGCYNTLSKVQDPTWKCWKQILDRLPQARL-IIKAPLFIE 536

Query: 96  ----ANIQATAQALGLDQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
                + +   Q LG+D  R+ L +     + H       DV LD    NG TTSMD LW
Sbjct: 537 ESAAQHYRDRLQKLGVDTSRVSLRAYSMDTQNHYVSYDEMDVSLDPFPYNGTTTSMDSLW 596

Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G P VT  G T   RV AS L  +G  +L+  + +EY DIA++LG D D
Sbjct: 597 MGVPFVTYAGTTHVHRVGASILNNVGLGDLVGYSTQEYVDIAVKLGQDLD 646


>gi|425436659|ref|ZP_18817093.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
 gi|389678582|emb|CCH92567.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
          Length = 716

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 2/159 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P  +++ + VLKAVPNS L +  L      E      
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +SR + + +   G  E IA + +EY D  I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674


>gi|334117819|ref|ZP_08491910.1| glycosyl transferase family 2 [Microcoleus vaginatus FGP-2]
 gi|333460928|gb|EGK89536.1| glycosyl transferase family 2 [Microcoleus vaginatus FGP-2]
          Length = 1283

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 93/179 (51%), Gaps = 10/179 (5%)

Query: 29   TATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLL 88
            + T E  P  +  T R  +G  ED++V+ +  Q + I P   + W  ++ AVPNSIL L 
Sbjct: 1065 SYTVESAPAQVEPT-RSSWGATEDSVVFMSGAQAFTIIPELRETWAKIIAAVPNSILVLY 1123

Query: 89   KFPAVGE--------ANIQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLC 139
             F +  +          I++     G+D+ R ++   + ++ + ++  +LAD+ LD+   
Sbjct: 1124 PFSSHNKNYPTVPFYKQIRSLFAEFGVDKKRLVVIKALPSRADCIKCLELADIYLDSYPY 1183

Query: 140  NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +    + + L  G P+V   G+T  SR +A+ L  L  PEL+A + K Y +++I LGT+
Sbjct: 1184 SSACYAAEPLLVGLPIVVRKGQTTRSRQSAAVLEELLIPELVADSEKSYLELSIALGTN 1242


>gi|75908605|ref|YP_322901.1| glycosyl transferase family protein [Anabaena variabilis ATCC
           29413]
 gi|75702330|gb|ABA22006.1| Glycosyl transferase, family 2 [Anabaena variabilis ATCC 29413]
          Length = 1007

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 51/181 (28%), Positives = 92/181 (50%), Gaps = 12/181 (6%)

Query: 32  GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF- 90
           G E  +  ++  R   G+PE+A+V+ +    +KI P  +  W N++  VPNS+L LL F 
Sbjct: 645 GTEEGKLTILVKRNSLGIPENAVVFISGANYFKIVPELVATWANIISRVPNSVLVLLPFG 704

Query: 91  ----PAVGEAN----IQATAQALGLDQHRILFSNVAA---KEEHVRRGQLADVCLDTPLC 139
                A  +AN    + +     GL   R++  ++     +E+     ++ADV LD+   
Sbjct: 705 PNWSNAYPKANFIDHLNSIFSQHGLATERLIVLDIQPIPDREDMKEYYKIADVYLDSYPF 764

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G T+ ++ L    PV+   G    S + A+ + TL  P+L+A + + Y ++A+ LGT+ 
Sbjct: 765 AGTTSLIEPLQVNLPVIARQGNCFRSAMGAAIIQTLNIPDLVADSEESYIELAVALGTNS 824

Query: 200 D 200
           +
Sbjct: 825 E 825


>gi|425458863|ref|ZP_18838349.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
 gi|389823717|emb|CCI27874.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
          Length = 716

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 2/159 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P  +++ + VLKAVPNS L +  L      E      
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +SR + + +   G  E IA + +EY D  I+LG D +
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDEN 674


>gi|357407018|ref|YP_004918942.1| TPR domain/SEC-C motif domain protein (fragment), partial
           [Methylomicrobium alcaliphilum 20Z]
 gi|351719683|emb|CCE25359.1| TPR domain/SEC-C motif domain protein (fragment) [Methylomicrobium
           alcaliphilum 20Z]
          Length = 488

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 85/153 (55%), Gaps = 7/153 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEAN-IQATAQALGLDQ 109
           + +  FN++ KI   T+ +WV +L+AVP+S  ++LK   F    E N I+    + GL+Q
Sbjct: 293 VTFGTFNKISKITDRTIALWVQILQAVPDS-KFVLKGRGFEKKVEQNRIKKRFASAGLEQ 351

Query: 110 --HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
               +L+     + EH+      D+ LDT   NG TT+M  ++ G PV+ L G++  +RV
Sbjct: 352 IDRLVLYGFSPKRSEHLLLYNELDIALDTFPYNGTTTTMQAIFMGVPVIALDGKSHVARV 411

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + S L+ +G  EL+A   ++Y   A+ L  D++
Sbjct: 412 SQSILSAIGANELVAMDEEDYFLKAVHLANDKE 444


>gi|60598530|gb|AAX25814.1| unknown [Schistosoma japonicum]
          Length = 114

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/55 (67%), Positives = 44/55 (80%)

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           MDVLW G PVVT P ETLASRVAASQL TLGCPEL+A + ++Y  IA +LG +R+
Sbjct: 1   MDVLWAGCPVVTFPLETLASRVAASQLHTLGCPELVANSQEDYVRIATKLGNNRE 55


>gi|15229778|ref|NP_187761.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
           thaliana]
 gi|75332921|sp|Q96301.1|SPY_ARATH RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY
 gi|12322895|gb|AAG51433.1|AC008153_6 spindly (gibberellin signal transduction protein); 75377-80082
           [Arabidopsis thaliana]
 gi|1589778|gb|AAC49446.1| SPINDLY [Arabidopsis thaliana]
 gi|62319977|dbj|BAD94086.1| spindly [Arabidopsis thaliana]
 gi|332641539|gb|AEE75060.1| putative UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY [Arabidopsis
           thaliana]
          Length = 914

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           + + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q    T + LGL+  
Sbjct: 655 VTFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQRFLTTLEQLGLESK 714

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 715 RVDLLPLILFNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGV 774

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+   EY  +++ L +D
Sbjct: 775 SLLTKVGLGHLVAKNEDEYVQLSVDLASD 803


>gi|75330266|sp|Q8LP10.1|SPY_EUSGR RecName: Full=Probable UDP-N-acetylglucosamine--peptide
           N-acetylglucosaminyltransferase SPINDLY; Short=EgSPY
 gi|22773823|dbj|BAC11808.1| SPINDLY [Eustoma grandiflorum]
          Length = 918

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AV NS L +   P   E+  Q   +T + LGL+  
Sbjct: 629 ITFGSFNNLAKITPKVLQVWARILCAVSNSRLIVKCKPFCCESVRQTFLSTLEQLGLEST 688

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P +T+ G   A  V  
Sbjct: 689 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCITMRGLVHAHNVGV 748

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+T+G   L+A+   +Y  +A++L +D
Sbjct: 749 SLLSTVGLGHLVAKNEDDYVRLAVQLASD 777


>gi|425454003|ref|ZP_18833752.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389799814|emb|CCI20674.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 713

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/158 (34%), Positives = 81/158 (51%), Gaps = 2/158 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P T+++ + VLKAVP+S L +  L      E      
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKAVPHSYLSIQGLSDAKSVEKLFFKV 572

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 573 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 632

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             +SR + + +   G  E IA + +EY D  I+LG D 
Sbjct: 633 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 670


>gi|170731575|ref|YP_001763522.1| hypothetical protein Bcenmc03_0220 [Burkholderia cenocepacia MC0-3]
 gi|169814817|gb|ACA89400.1| TPR repeat-containing protein [Burkholderia cenocepacia MC0-3]
          Length = 779

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 75/144 (52%), Gaps = 1/144 (0%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRILF 114
           Y +FN+L K+ P  +++W  VL A P + + L    + G+   +     A G+D  R+ F
Sbjct: 569 YASFNRLNKLSPHVIEVWARVLHADPTARMALGAIGSEGDKQTLTEWFAAAGIDASRLSF 628

Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
              +    ++++    D+CLD     G TT+++ LW G P VT+PG T+A R +A  L  
Sbjct: 629 HRRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGATMAGRGSAGWLQH 688

Query: 175 LGCPELIARTHKEYQDIAIRLGTD 198
           +G    IA+   ++   A+ LG D
Sbjct: 689 VGLDAYIAKDEDDFVARALALGRD 712


>gi|434386709|ref|YP_007097320.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Chamaesiphon minutus PCC 6605]
 gi|428017699|gb|AFY93793.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Chamaesiphon minutus PCC 6605]
          Length = 735

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 88/160 (55%), Gaps = 7/160 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEANIQA 100
           RQ   +P+DA++Y +    YK +P+ ++  + ++K+VPNS   L+K    P + +     
Sbjct: 522 RQDLDIPDDAVIYFSGQSGYKRNPNCIRAQMEIVKSVPNSYF-LIKGSSDPEIIKNLFGN 580

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            A  +G+   R+ F +   ++E   R  LA  D+ LDT   NG TT+++ LW G P+VT 
Sbjct: 581 LAAEVGISFDRLKFID-RVEDEPTHRANLAIADIVLDTFPYNGATTTLETLWMGIPMVTQ 639

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            G+  ++R + + +   G  E IA + +EY +  I+LG D
Sbjct: 640 VGKQFSARNSYTFMLNAGIEEGIAWSQQEYIEWGIKLGLD 679


>gi|427732274|ref|YP_007078511.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Nostoc sp. PCC 7524]
 gi|427368193|gb|AFY50914.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Nostoc sp. PCC 7524]
          Length = 739

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 56/179 (31%), Positives = 85/179 (47%), Gaps = 31/179 (17%)

Query: 38  SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN 97
           S+    R++  +P DAIVY    + +K  P   ++ + ++K VPNS      F   G ++
Sbjct: 532 SVPTVRREELDIPNDAIVYFCGQRGFKRHPDITRLQLQIIKEVPNSY-----FLIKGMSD 586

Query: 98  IQATAQALGLDQHRILFSNVAAKE---------------EHVRRGQL--ADVCLDTPLCN 140
            ++T         R+ F  +A KE               E V R  L  AD+ LDT   N
Sbjct: 587 EEST---------RVFFEELADKEGVEISQLRFLPGVISESVHRANLGIADIVLDTYPYN 637

Query: 141 GHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           G TT+++ LW   P+VT  GE  A+R + + +   G  E IA T +EY +  +RLG D 
Sbjct: 638 GATTTLETLWMCIPMVTKVGEQFAARNSYTMMMNAGITEGIAWTDEEYVEWGVRLGKDE 696


>gi|425449355|ref|ZP_18829195.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
 gi|389764012|emb|CCI09573.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
          Length = 716

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/158 (34%), Positives = 80/158 (50%), Gaps = 2/158 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+PEDAI Y       K +P  +++ + VLKAVPNS L +  L      E      
Sbjct: 516 RDDLGIPEDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 575

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 576 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 635

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             +SR + + +   G  E IA + +EY D  I+LG D 
Sbjct: 636 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 673


>gi|421866704|ref|ZP_16298367.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Burkholderia cenocepacia H111]
 gi|358073189|emb|CCE49245.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Burkholderia cenocepacia H111]
          Length = 782

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 76/147 (51%), Gaps = 7/147 (4%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
           Y +FN+L K+    + +W NVL A P + + L    A+G    Q T      A G+D  R
Sbjct: 572 YGSFNRLNKLSRHVISVWANVLHANPTARMAL---GAIGNDGDQHTLTEWFAAAGIDVER 628

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    D+CLDT    G TT+++ LW G P VTLPG TLA R +A+ 
Sbjct: 629 LSFHRRSNIPVYMQQHHGVDLCLDTFPYTGSTTTLNALWMGVPTVTLPGTTLAGRGSATW 688

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L  +G    IA    ++   A+ LG D
Sbjct: 689 LQHVGLDNYIATNEDDFVAKALALGRD 715


>gi|196231119|ref|ZP_03129979.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
 gi|196224949|gb|EDY19459.1| Tetratricopeptide TPR_2 repeat protein [Chthoniobacter flavus
           Ellin428]
          Length = 600

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 74/151 (49%), Gaps = 5/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
           I + +FN + K    T+ +W  +L+ +P S L ++K      A +QA    L     +D 
Sbjct: 397 ITFASFNIVSKTSGPTIALWSRILRRLPTSRL-IIKNKGFEGARLQADIHQLFAQESVDP 455

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            RI F   + + E +     AD+ LDT   NG  T+ + LW G PVV+  GET A R   
Sbjct: 456 ARIEFRGPSPQAEFLASYGDADIALDTFPYNGGLTTCEALWMGLPVVSCAGETFAGRHGL 515

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + L   G PE +A     Y D+A+ L +D D
Sbjct: 516 AHLTAAGLPEWVAADFDAYVDLAVALASDLD 546


>gi|119488020|ref|ZP_01621464.1| hypothetical protein L8106_11502 [Lyngbya sp. PCC 8106]
 gi|119455309|gb|EAW36448.1| hypothetical protein L8106_11502 [Lyngbya sp. PCC 8106]
          Length = 727

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQA 100
           R+ Y +  D ++Y +     K +   ++  V +LK VPNSIL + K     AV +   + 
Sbjct: 525 RKAYRIGLDQVIYLSVAPGRKFNRELVEAQVAILKQVPNSIL-IHKAAGDVAVFQGAYEQ 583

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
             QA+G+  HR+ F      +EEH +   LADV LD+   NG T +++ LW   PVVT  
Sbjct: 584 ACQAVGVSLHRVKFIPRFPTEEEHRQVYALADVLLDSYPYNGGTHTLEALWFEVPVVTRK 643

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           GE   SR+  S L  +G    IA + ++Y +  +R G D
Sbjct: 644 GEQFLSRMGYSFLQGVGVEAGIAESWEDYINWGVRFGED 682


>gi|374999506|ref|YP_004975594.1| hypothetical protein AZOLI_p60009 [Azospirillum lipoferum 4B]
 gi|357428477|emb|CBS91434.1| conserved protein of unknown function; putative TPR domains
           [Azospirillum lipoferum 4B]
          Length = 652

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 52/152 (34%), Positives = 77/152 (50%), Gaps = 6/152 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA----TAQALGLDQ 109
           + + +FN   K+    +++W  +LK VPNS L ++K   +G+   +     T    G+D 
Sbjct: 456 VTFGSFNNTSKVTGEVVRVWSEILKRVPNSRL-IVKSAQMGDEETRRRYLNTFVKFGIDA 514

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            RI L S + A + H+R     D+ LD    NG TT+ + LW G PVVT+ G    +RV 
Sbjct: 515 GRIELLSRIDAADGHLRAYDRIDIGLDPFPYNGTTTTCEALWMGVPVVTVAGANHVARVG 574

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           AS L   G  EL+A    +Y   A+ L  D D
Sbjct: 575 ASLLTHCGLNELVASDEADYIATAVALAGDPD 606


>gi|323527755|ref|YP_004229908.1| tetratricopeptide repeat-containing protein [Burkholderia sp.
           CCGE1001]
 gi|323384757|gb|ADX56848.1| Tetratricopeptide repeat [Burkholderia sp. CCGE1001]
          Length = 810

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 10/160 (6%)

Query: 48  GLP---EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-- 102
           GLP   +  I Y +FN+L K+ P  + +W  +L+  P S + +    A+G    + T   
Sbjct: 578 GLPALHKGYITYGSFNRLNKLSPEVIALWSTILREQPTSRMVI---GAIGSKLDEETYLE 634

Query: 103 --QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             +  G+D  R+ F        ++++    D+CLDT    G TT+++ LW G P VT+PG
Sbjct: 635 WFRNEGIDTSRLTFCPRTTLPVYMQQHHHVDLCLDTFPYVGSTTTLNALWMGVPTVTMPG 694

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ++ SR  A  L  +G  E IAR   E+   +I L  D D
Sbjct: 695 ISMPSRCGAGWLEQVGLHEFIARDKDEFVKKSIELTRDLD 734


>gi|224088480|ref|XP_002308458.1| predicted protein [Populus trichocarpa]
 gi|222854434|gb|EEE91981.1| predicted protein [Populus trichocarpa]
          Length = 934

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 81/150 (54%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           I + +FN L KI P  LQ+W  +L AVPNS L ++K    G  +++       + LGL+ 
Sbjct: 660 ITFGSFNNLAKITPKVLQVWARILCAVPNSRL-VVKCKPFGCDSVRQRFLTVLEQLGLEP 718

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P +T+ G   A  V 
Sbjct: 719 LRVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCITMAGAVHAHNVG 778

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L+ +G   L+A+  +EY  +A++L +D
Sbjct: 779 VSLLSKVGLGHLVAKNEEEYVQLALQLASD 808


>gi|430747186|ref|YP_007206315.1| O-linked N-acetylglucosamine transferase, SPINDLY family
           [Singulisphaera acidiphila DSM 18658]
 gi|430018906|gb|AGA30620.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Singulisphaera acidiphila DSM 18658]
          Length = 827

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 8/171 (4%)

Query: 33  EEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
           EE P+   + + +  G     + + + ++L K++   L++W  +L+++P + L L +   
Sbjct: 617 EEAPEVNTLPALRAKG-----VTFGSLHKLPKLNARVLELWAELLRSIPKARLLLYRNNL 671

Query: 93  VGE--ANIQATAQALGLDQHRILFSNV-AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
            G+    I A  +A G+   ++   N+      H +  Q  D+ LD    +GHTT+ + L
Sbjct: 672 RGQRKEEILAFFKAQGVAADQLELRNIIEGGGSHYQVYQDIDISLDVFPWSGHTTACESL 731

Query: 150 WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           W G P+VT+ G   A R+ AS L  LG  + IA T ++Y DIA R  TD D
Sbjct: 732 WMGVPIVTIRGNRHAGRMTASVLTCLGLSDWIAETPEQYLDIAQRFATDLD 782


>gi|422303123|ref|ZP_16390477.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389791934|emb|CCI12280.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 713

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 81/161 (50%), Gaps = 8/161 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R   G+PEDAI Y       K +P T+++ + VLK+VPNS    L    + +A       
Sbjct: 513 RDDLGIPEDAINYLTVQTGVKRNPETIRLQLQVLKSVPNS---YLSIQGLSDAKSVKKLF 569

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT 
Sbjct: 570 FKVAEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTK 629

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            G+  +SR + + +   G  E IA + +EY D  I+LG D 
Sbjct: 630 VGQQWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 670


>gi|182412285|ref|YP_001817351.1| hypothetical protein Oter_0461 [Opitutus terrae PB90-1]
 gi|177839499|gb|ACB73751.1| Tetratricopeptide TPR_2 repeat protein [Opitutus terrae PB90-1]
          Length = 627

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 59/168 (35%), Positives = 84/168 (50%), Gaps = 8/168 (4%)

Query: 36  PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE 95
           P S+   +R+     + ++V+ +FN L KI  S L +W  +LK VPN  L LLK   + E
Sbjct: 413 PDSLAPNTREPAA--DGSVVFGSFNNLSKITDSMLAVWGRLLKQVPNGRL-LLKGVGLSE 469

Query: 96  ANIQ----ATAQALGLDQHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLW 150
              +    A  +  GL   R+ L         H+      DV LDT   +G TT+ + LW
Sbjct: 470 PESRGRYVARMETAGLPVDRVELLERTPDTAAHLALYHRVDVALDTFPYHGTTTTCEALW 529

Query: 151 TGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            G PVVTL G+  +SRV  S L+ +G  E IA +  +Y  IA+ L  D
Sbjct: 530 MGVPVVTLAGDRHSSRVGVSLLSAVGHSEWIAHSADDYVAIALALAAD 577


>gi|421480197|ref|ZP_15927838.1| bacterial transcriptional activator domain protein [Burkholderia
           multivorans CF2]
 gi|400221527|gb|EJO51976.1| bacterial transcriptional activator domain protein [Burkholderia
           multivorans CF2]
          Length = 779

 Score = 80.1 bits (196), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 7/147 (4%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
           Y +FN+L K+ P  + +W  VL A P + + L    A+G A  Q T      A G+D  R
Sbjct: 569 YGSFNRLNKLSPQVIAVWARVLHADPTARMAL---GAIGNAGDQRTLTEWFAAAGIDASR 625

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    D+CLD     G TT+++ LW G P VT+PG T+A R +A  
Sbjct: 626 LTFHPRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGSTVAGRGSAGW 685

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L  +   E IA+   ++   A+ LG D
Sbjct: 686 LQHVRLDEYIAKDEDDFVAKAVALGRD 712


>gi|428217091|ref|YP_007101556.1| hypothetical protein Pse7367_0828 [Pseudanabaena sp. PCC 7367]
 gi|427988873|gb|AFY69128.1| hypothetical protein Pse7367_0828 [Pseudanabaena sp. PCC 7367]
          Length = 809

 Score = 79.7 bits (195), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 84/160 (52%), Gaps = 5/160 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN---IQ 99
           +R +  +P DA+V+       K  P T +M + ++  VPNS L L+K  A  E+     +
Sbjct: 608 NRDRLDIPADAVVFLTAQTGQKRHPDTTRMQLQIISQVPNSYL-LIKGIADQESMQNLFK 666

Query: 100 ATAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
             A   G+ ++++ F  +   EE H     +ADV LDT   NG TT+M+ LW G P+VT 
Sbjct: 667 NLAIEAGVSENQLKFLPLVKNEETHRANLAIADVVLDTYPYNGATTTMETLWMGIPIVTR 726

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            GE  ++R +   +   G  E IA + +EY +  +RLG D
Sbjct: 727 VGEQFSARNSYGMMINAGINEGIAWSDQEYIEWGVRLGKD 766


>gi|221202121|ref|ZP_03575156.1| TPR repeat protein [Burkholderia multivorans CGD2M]
 gi|221209071|ref|ZP_03582066.1| TPR repeat protein [Burkholderia multivorans CGD2]
 gi|221171066|gb|EEE03518.1| TPR repeat protein [Burkholderia multivorans CGD2]
 gi|221177915|gb|EEE10327.1| TPR repeat protein [Burkholderia multivorans CGD2M]
          Length = 779

 Score = 79.7 bits (195), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 7/147 (4%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
           Y +FN+L K+ P  + +W  VL A P + + L    A+G A  Q T      A G+D  R
Sbjct: 569 YGSFNRLNKLSPQVIAVWARVLHADPTARMAL---GAIGNAGDQRTLTEWFAAAGIDASR 625

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    D+CLD     G TT+++ LW G P VT+PG T+A R +A  
Sbjct: 626 LTFHPRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGSTVAGRGSAGW 685

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L  +   E IA+   ++   A+ LG D
Sbjct: 686 LQHVRLDEYIAKDEDDFVAKAVALGRD 712


>gi|300869150|ref|ZP_07113747.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
 gi|300332849|emb|CBN58945.1| tetratricopeptide TPR_2 [Oscillatoria sp. PCC 6506]
          Length = 746

 Score = 79.7 bits (195), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 9/161 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-- 101
           R +  +P +A++Y +    YK    T ++ V +LK VPNS   L+K  A  + +I+    
Sbjct: 545 RDRLNIPSNAVIYLSSQTGYKRHLDTARLQVKILKEVPNSYF-LIKGGA-DQNSIKKLFN 602

Query: 102 --AQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             A+  GL+  R+ F    + EE V R  L  ADV LDT   NG TT+++ LW   P+VT
Sbjct: 603 QLAEEEGLEGDRLRFLPAVSSEE-VHRANLGIADVVLDTYPYNGATTTLETLWMCIPLVT 661

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             G+  A+R + + +   G  E IA T +EY +  IRLG D
Sbjct: 662 RVGQQFAARNSYTMMMNAGVTEGIAWTDEEYIEWGIRLGKD 702


>gi|421471789|ref|ZP_15920043.1| bacterial transcriptional activator domain protein [Burkholderia
           multivorans ATCC BAA-247]
 gi|400224616|gb|EJO54837.1| bacterial transcriptional activator domain protein [Burkholderia
           multivorans ATCC BAA-247]
          Length = 779

 Score = 79.7 bits (195), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 7/147 (4%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQHR 111
           Y +FN+L K+ P  + +W  VL A P + + L    A+G A  Q T      A G+D  R
Sbjct: 569 YGSFNRLNKLSPRVVAVWARVLHADPTARMAL---GAIGNAGDQRTLTEWFAAAGIDASR 625

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    D+CLD     G TT+++ LW G P VT+PG T+A R +A  
Sbjct: 626 LTFHPRSNIPVYMQQHHGVDLCLDAFPYTGSTTTLNALWMGVPTVTIPGSTVAGRGSAGW 685

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L  +   E IA+   ++   A+ LG D
Sbjct: 686 LQHVRLDEYIAKDEDDFVAKAVALGRD 712


>gi|224140075|ref|XP_002323412.1| predicted protein [Populus trichocarpa]
 gi|222868042|gb|EEF05173.1| predicted protein [Populus trichocarpa]
          Length = 917

 Score = 79.7 bits (195), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 78/149 (52%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQ- 109
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q      + LGL+  
Sbjct: 664 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSVRQRFLTMLEQLGLEPL 723

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           H  L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V A
Sbjct: 724 HVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVGA 783

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   L+A+  +EY   A++L +D
Sbjct: 784 SLLSNVGLGHLVAKNEEEYVQSALQLASD 812


>gi|392409193|ref|YP_006445800.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfomonile tiedjei DSM 6799]
 gi|390622329|gb|AFM23536.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfomonile tiedjei DSM 6799]
          Length = 583

 Score = 79.3 bits (194), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 78/151 (51%), Gaps = 5/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           + +C+FNQL KI    L+ W  +L  +P S L +++  A+ +A I+ +      +LG+ +
Sbjct: 395 VTFCSFNQLAKISDKALETWAEILGQLPRSRL-VMRGKALNDATIRDSFAEKLSSLGISR 453

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+         +++R     D+ LDT    G TT+ D LW G PVVT  G+    R +A
Sbjct: 454 DRMDLLPKTTFHDYLRTYNDVDIALDTFPFAGGTTTCDSLWMGVPVVTFTGDRFCHRHSA 513

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S +   G  +L+A   + Y   AI+L +D D
Sbjct: 514 SHIINSGHEDLVAHDIEGYIHKAIQLASDLD 544


>gi|126728515|ref|ZP_01744331.1| TPR domain/SEC-C motif domain protein [Sagittula stellata E-37]
 gi|126711480|gb|EBA10530.1| TPR domain/SEC-C motif domain protein [Sagittula stellata E-37]
          Length = 555

 Score = 79.3 bits (194), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 74/139 (53%), Gaps = 3/139 (2%)

Query: 63  YKIDPSTLQMWVNVLKAVPNSILWLL-KFPAVGEANIQATAQ--ALGLDQHRILFSNVAA 119
           YK+ P+ +  W  +LK VP + L+L  +    GE  +  T +    G+D  R++F   A 
Sbjct: 369 YKLTPTVIDAWSEILKGVPGATLFLANRALESGENRVWLTERFAERGIDTDRLIFDGGAE 428

Query: 120 KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPE 179
             ++++     D+ LD    NG TT+++ +W G PV+T  G+  ASR +AS +      +
Sbjct: 429 HYDYLKNYDRMDIALDAFPYNGGTTTVEAIWQGVPVLTFDGDRWASRTSASLICYTHLKD 488

Query: 180 LIARTHKEYQDIAIRLGTD 198
            +A + +   D+AIR+G D
Sbjct: 489 FVAPSRQAMIDMAIRMGND 507


>gi|288963192|ref|YP_003453471.1| hypothetical protein AZL_f01670 [Azospirillum sp. B510]
 gi|288915444|dbj|BAI76927.1| hypothetical protein AZL_f01670 [Azospirillum sp. B510]
          Length = 1587

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 48/166 (28%), Positives = 87/166 (52%), Gaps = 11/166 (6%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---------PAVG 94
            R + GLP D +V+ +   + KI P   ++W  +L  VP S+L L+ +          A+ 
Sbjct: 1375 RMRLGLPADVVVFGSTANVNKIIPDLQEVWAAILARVPGSVLLLMPYGPHWNHSYPEALF 1434

Query: 95   EANIQATAQALGLDQHRILFSNVAAKEEH-VRRG-QLADVCLDTPLCNGHTTSMDVLWTG 152
              + +   +A G+ + R++  +    + H VRR    AD+ LD+   +G T+ ++ +  G
Sbjct: 1435 HDSFRQRLRAAGVAEDRLVILSERGWDRHDVRRALGAADIYLDSFPFSGSTSLIEPMEAG 1494

Query: 153  TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             P++   G T   ++ A+ L  LGCPEL+A +  +Y + A+RL +D
Sbjct: 1495 LPIICRAGATFRGQMGAAMLTDLGCPELVAGSDTDYIEKAVRLASD 1540


>gi|414870070|tpg|DAA48627.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
 gi|414870071|tpg|DAA48628.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
 gi|414870072|tpg|DAA48629.1| TPA: hypothetical protein ZEAMMB73_519907, partial [Zea mays]
          Length = 786

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 75/140 (53%), Gaps = 6/140 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + + +FN L KI P  LQ+W  +L +VPNS L +   P   E NI+    AT + LGL+ 
Sbjct: 646 VTFGSFNNLAKITPKVLQVWARILCSVPNSRLVVKCKPFCCE-NIRQKFLATLEELGLES 704

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V 
Sbjct: 705 LRVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAVHAHNVG 764

Query: 169 ASQLATLGCPELIARTHKEY 188
            S L+ +G   L+A+T  EY
Sbjct: 765 VSLLSKVGLGRLVAKTEDEY 784


>gi|354598043|ref|ZP_09016060.1| hypothetical protein BrE312_2631 [Brenneria sp. EniD312]
 gi|353675978|gb|EHD22011.1| hypothetical protein BrE312_2631 [Brenneria sp. EniD312]
          Length = 1133

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 78/157 (49%), Gaps = 6/157 (3%)

Query: 48  GLP---EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQAT 101
           GLP   +  I +  FN   KI+   L  W  +L AVPNS L+L  F    ++    I AT
Sbjct: 416 GLPATSKGYITFGCFNNASKINDILLAQWAIILHAVPNSRLFLKSFNYKNQSLSERIYAT 475

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
            +  G+ + R+     +  +E +      D+ LD    +G  T+ + +  G PVVTLPG 
Sbjct: 476 LEKHGIVRERVRIEGSSPHKELLASYNDVDIALDPWPYSGGLTTCEAMAMGVPVVTLPGP 535

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           T A R +AS L   G PEL+A   ++Y +I + +  D
Sbjct: 536 TFAGRHSASHLVNAGMPELLANDWEQYINITVGIAND 572


>gi|443657013|ref|ZP_21131812.1| hypothetical protein C789_2352 [Microcystis aeruginosa DIANCHI905]
 gi|159029887|emb|CAO90941.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443333290|gb|ELS47857.1| hypothetical protein C789_2352 [Microcystis aeruginosa DIANCHI905]
          Length = 726

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDKN 679


>gi|322421642|ref|YP_004200865.1| hypothetical protein GM18_4175 [Geobacter sp. M18]
 gi|320128029|gb|ADW15589.1| Tetratricopeptide TPR_1 repeat-containing protein [Geobacter sp.
           M18]
          Length = 1104

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 52/152 (34%), Positives = 76/152 (50%), Gaps = 6/152 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL----WLLKFPAVGEANIQATAQALGLD 108
           AI + +FN   KI P T+ +W  VL++VP S L    + L  P      ++ +  + G+ 
Sbjct: 428 AITFGSFNNPAKITPETVALWAGVLRSVPGSRLLVKGYSLACPG-SRKRLEESFASHGIG 486

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R+ L  N  +  +H+      D+ LDT   NG TT+ + LW G PVVTL G    SRV
Sbjct: 487 PERLELTGNTPSYRDHLALYGRVDIALDTFPYNGTTTTCEALWMGVPVVTLAGSAHRSRV 546

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            AS L  LG   L+A     + ++A  L  D+
Sbjct: 547 GASLLHALGLDALVAHDSGAFVELARALALDQ 578


>gi|167582750|ref|ZP_02375624.1| TPR domain protein [Burkholderia thailandensis TXDOH]
          Length = 781

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 78/148 (52%), Gaps = 1/148 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
           + Y +FN+L K+  + + +W  VL A P++ + +      G+   I     + G+D HR+
Sbjct: 570 VTYGSFNRLNKLRQNVIALWSQVLLATPSARMVIGSIEQEGDRETIIEWFVSEGIDPHRL 629

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F   AA   ++++    DVCLD     G TT+++ LW G P VTL G+++ SR +AS L
Sbjct: 630 TFQPRAAMPVYLQQHHHVDVCLDAFPYAGSTTTLNALWMGVPTVTLSGKSIPSRGSASWL 689

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + +G  E IA   + +   +I    D D
Sbjct: 690 SHVGLTEYIAHDQESFVRTSIAAAQDLD 717


>gi|206558517|ref|YP_002229277.1| putative TPR repeat protein [Burkholderia cenocepacia J2315]
 gi|444358767|ref|ZP_21160141.1| tetratricopeptide repeat protein [Burkholderia cenocepacia BC7]
 gi|444367389|ref|ZP_21167339.1| tetratricopeptide repeat protein [Burkholderia cenocepacia
           K56-2Valvano]
 gi|198034554|emb|CAR50419.1| putative TPR repeat protein [Burkholderia cenocepacia J2315]
 gi|443602945|gb|ELT70984.1| tetratricopeptide repeat protein [Burkholderia cenocepacia
           K56-2Valvano]
 gi|443603327|gb|ELT71348.1| tetratricopeptide repeat protein [Burkholderia cenocepacia BC7]
          Length = 780

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 76/147 (51%), Gaps = 7/147 (4%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----ALGLDQHR 111
           Y +FN+L K+    + +W  VL A P + + L    A+G    Q T      A G+D  R
Sbjct: 570 YGSFNRLNKLSSHVIAVWARVLHADPTARMAL---GAIGNEGDQQTLTEWFVAAGIDASR 626

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    D+CLDT    G TT+++ LW G P +T+PG T+A R +AS 
Sbjct: 627 LSFHRRSNIPVYMQQHHGVDLCLDTFPYTGSTTTLNGLWMGVPTLTIPGSTMAGRGSASW 686

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L  +GC   IA    ++   A+ L +D
Sbjct: 687 LQHVGCDAYIANDEDDFVAKALALSSD 713


>gi|92114109|ref|YP_574037.1| sulfotransferase [Chromohalobacter salexigens DSM 3043]
 gi|91797199|gb|ABE59338.1| sulfotransferase [Chromohalobacter salexigens DSM 3043]
          Length = 1415

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 72/143 (50%), Gaps = 3/143 (2%)

Query: 59  FNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQHRILFS 115
           FN   K++  TL+ W  ++  +P+S L L   P   E+    + AT +A G+ + R++  
Sbjct: 427 FNNPTKLNDVTLKQWAGIMHELPDSRLMLKGRPYTSESFCERLYATLEAAGIARERLIIE 486

Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
              +  E +     AD+ LD    +G  T+ +    G PVVTLPG T A R +A+ L   
Sbjct: 487 GPGSNYEMLDAYNRADIALDPWPYSGGLTTCEAFIMGVPVVTLPGPTFAGRHSATHLVHA 546

Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
           G PEL+  +  EY+   I L +D
Sbjct: 547 GMPELVVNSWDEYRARVIELASD 569


>gi|428227087|ref|YP_007111184.1| hypothetical protein GEI7407_3665 [Geitlerinema sp. PCC 7407]
 gi|427986988|gb|AFY68132.1| TPR repeat-containing protein [Geitlerinema sp. PCC 7407]
          Length = 746

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 82/159 (51%), Gaps = 5/159 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQA 100
           R+  G+  DA++Y +     K +P T+++ + VL  VPNS L L+K  A   A   + + 
Sbjct: 538 REDLGISTDAVIYLSAQTGIKRNPDTVRLQMKVLAQVPNSYL-LIKGWADESALQQSFKQ 596

Query: 101 TAQALGLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A   G+D  R+ F  +   E  H     +ADV LDT   NG TT+M+ LW   P+VT  
Sbjct: 597 IASEEGVDSERLRFLPIVPSEATHRANMGIADVILDTYPYNGATTTMEALWMCVPIVTKV 656

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           G+  ++R + S +   G  E IA + +EY    + LG D
Sbjct: 657 GQQFSARNSYSMMMNAGIQEGIAWSDEEYIRWGVTLGRD 695


>gi|397628033|gb|EJK68713.1| hypothetical protein THAOC_10083, partial [Thalassiosira oceanica]
          Length = 670

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 52/156 (33%), Positives = 77/156 (49%), Gaps = 3/156 (1%)

Query: 47  YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL- 105
           + LP  A +Y    + +K  P+   M   V +A PN+IL L +    GE N       L 
Sbjct: 453 HSLPGTATLYLCMQKPHKYSPAFDGMLAGVHRADPNAILVLHRVDQDGEHNHARFTDRLV 512

Query: 106 --GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL 163
             G+D  R+ F         +    LADV LD+    G TT+ +    G  VVTLP + L
Sbjct: 513 RAGVDLDRVAFLPTQPHSLLMALYSLADVVLDSYHAGGCTTTREAFEVGALVVTLPAQYL 572

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            SR + +  + +G  +L+A T +EY DIA+R+ TD+
Sbjct: 573 GSRWSLAYYSIMGVTDLVAETREEYVDIAVRMATDK 608


>gi|425445188|ref|ZP_18825224.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
 gi|389734922|emb|CCI01529.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9443]
          Length = 717

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLDKN 679


>gi|357633166|ref|ZP_09131044.1| Tetratricopeptide TPR_1 repeat-containing protein [Desulfovibrio
           sp. FW1012B]
 gi|357581720|gb|EHJ47053.1| Tetratricopeptide TPR_1 repeat-containing protein [Desulfovibrio
           sp. FW1012B]
          Length = 741

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 80/151 (52%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQ 109
           I + +FN L KI  +TL +W  +L A+P   L LLK   +  A     + A  +  G+ +
Sbjct: 552 IWFGSFNHLSKIQDATLDLWSRILSALPEGKL-LLKSGQLDHAANRERMLAAFEGRGIAR 610

Query: 110 HR-ILFSNVAAKEEHVRRGQLADVCLD-TPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
            R +L    A    H+    L DV LD     +G TT+ D LW G PVVT+ G+ LA+R+
Sbjct: 611 DRLVLMGQTANWPAHMAAYDLVDVALDPVDAVSGVTTTCDALWMGVPVVTMAGDRLATRM 670

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AAS ++ LG  + +A T + Y  +A+ L  D
Sbjct: 671 AASLVSGLGHGDWVAATPEAYAGLAVALARD 701


>gi|238927671|ref|ZP_04659431.1| possible TPR repeat-containing protein [Selenomonas flueggei ATCC
           43531]
 gi|238884387|gb|EEQ48025.1| possible TPR repeat-containing protein [Selenomonas flueggei ATCC
           43531]
          Length = 1077

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 52/150 (34%), Positives = 75/150 (50%), Gaps = 7/150 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLD 108
           +IV+ +FN   KI+   L++W  +L+ VP S L+L K     E + +A       A G+ 
Sbjct: 359 SIVFGSFNNFTKINDHVLRVWAEILRRVPESRLFL-KAAVFSEEDARAEMLTRIAAAGIS 417

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+         +++      D+ LD     G  T+ D L+ G PVVTL GETL SR  
Sbjct: 418 LSRV--DTEGNSPDYLAAYNRMDIALDPFPYPGGGTTCDALYMGVPVVTLSGETLGSRFG 475

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AS L  +G   LIA T +EY  +A+ L  D
Sbjct: 476 ASLLTNIGAEALIAHTEEEYIALAVSLAGD 505



 Score = 65.1 bits (157), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 54/177 (30%), Positives = 77/177 (43%), Gaps = 14/177 (7%)

Query: 35   VPQSIVITSRQQYGLP-------EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL 87
            +P   V   R +  +P         AI++  FNQ  K     L +W  +L+ VP S L L
Sbjct: 861  LPSQFVYVPRAELPVPTGTPARQRGAILFGVFNQYRKWTDEMLCVWREILERVPQSKL-L 919

Query: 88   LK----FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHT 143
            LK    F A      +     LG D  RIL     A  +++ R    D+ LDT    G  
Sbjct: 920  LKSQVFFAAAMRRAAEERLARLGFDLDRILLE--PATTDYMERYLDVDIALDTYPWTGGG 977

Query: 144  TSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            T+ D L+ G PVVT+ G   ++R + + L  +G  EL   +   Y   A+ L  D D
Sbjct: 978  TTCDALYMGVPVVTMYGTRRSTRFSYAMLHLVGAEELAVASASAYISCAVSLAEDLD 1034


>gi|425458862|ref|ZP_18838348.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
 gi|389823716|emb|CCI27872.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9808]
          Length = 721

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679


>gi|386392026|ref|ZP_10076807.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfovibrio sp. U5L]
 gi|385732904|gb|EIG53102.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfovibrio sp. U5L]
          Length = 741

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 80/151 (52%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF----PAVGEANIQATAQALGLDQ 109
           I + +FN L KI  +T+ +W  +L A+P   L LLK      A     + A  +  G+ +
Sbjct: 552 IWFGSFNHLSKIQNATVDLWSRILSALPEGKL-LLKSGQLDHAANRERMLAAFEGRGISR 610

Query: 110 HR-ILFSNVAAKEEHVRRGQLADVCLD-TPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
            R +L    A  + H+    L DV LD     +G TT+ D LW G PVVT+ G+ LA+R+
Sbjct: 611 DRLVLMGQTANWQAHMAAYDLIDVALDPVDAVSGVTTTCDALWMGVPVVTMAGDRLATRM 670

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AAS ++ LG  + +A T + Y  +A+ L  D
Sbjct: 671 AASLISGLGHGDWVAATPQAYAGLAVALARD 701


>gi|440755492|ref|ZP_20934694.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
           protein [Microcystis aeruginosa TAIHU98]
 gi|440175698|gb|ELP55067.1| O-linked N-acetylglucosamine transferase SPINDLY family-like
           protein [Microcystis aeruginosa TAIHU98]
          Length = 721

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679


>gi|425436658|ref|ZP_18817092.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
 gi|389678581|emb|CCH92566.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9432]
          Length = 721

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679


>gi|425449354|ref|ZP_18829194.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
 gi|389764011|emb|CCI09572.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 7941]
          Length = 726

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDLHEETHRANLGIADIILDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSEEEYVQWGIKLGLDQN 679


>gi|381160353|ref|ZP_09869585.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878417|gb|EIC20509.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 629

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 74/145 (51%), Gaps = 4/145 (2%)

Query: 58  NFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG---EANIQATAQALGLDQHRILF 114
           +FN L K+   +L +W  VL+A+P + L L    A+    +A I    +  G+   R+ F
Sbjct: 431 SFNNLTKVRHESLVLWSQVLRALPQARLILKDKRALDHSVQARILGPLREQGIHPDRVEF 490

Query: 115 SNV-AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
               AA  +H+      D+ LD+   N  TT  D LW GTP++TL G+ LA R AAS L+
Sbjct: 491 RGASAAWSDHMDAYNDTDIALDSLPFNSATTGFDALWMGTPLITLTGDRLAGRQAASLLS 550

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            LG  E IAR   E+  I   L  D
Sbjct: 551 GLGRTEWIARDADEFVAIVTGLAQD 575


>gi|390440787|ref|ZP_10228990.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389835918|emb|CCI33116.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 256

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + ++  VPNS   L+K   VG+   IQ   
Sbjct: 58  REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKIIAQVPNSYF-LIK--GVGKTEKIQELF 114

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 115 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 174

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 175 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 216


>gi|63095209|gb|AAY32335.1| SPY [Phyllostachys praecox]
          Length = 781

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q   +T + LG +  
Sbjct: 502 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSIRQKFLSTLEELGSESL 561

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G      V  
Sbjct: 562 RVDLLPLIHLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHGHNVGV 621

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   L+A+   EY  +A+ L +D
Sbjct: 622 SLLTKVGLGRLVAKPEDEYISLALDLASD 650


>gi|295678098|ref|YP_003606622.1| hypothetical protein BC1002_3066 [Burkholderia sp. CCGE1002]
 gi|295437941|gb|ADG17111.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia sp. CCGE1002]
          Length = 1634

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 72/137 (52%), Gaps = 2/137 (1%)

Query: 54   IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATAQALGLDQHRI 112
            + + +F+++ KI P  + +W  VL+ VP+S + +    A  + + + A     G+D  RI
Sbjct: 1161 MTFGSFSKMLKIGPEVIALWARVLRKVPHSRMVIGAIVAPEQISKVSALFANEGIDGSRI 1220

Query: 113  LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
             F   A    ++ +    DVCLD     G TT+M  LW G P VTLPG ++ASR +   L
Sbjct: 1221 SFLPRAGMAAYLEQHHQIDVCLDAFPFAGSTTTMHALWMGVPTVTLPGVSMASRGSTGWL 1280

Query: 173  ATLGCPE-LIARTHKEY 188
            + LG  E  +AR   ++
Sbjct: 1281 SQLGIDEAFVARDEDDF 1297



 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 74/151 (49%), Gaps = 10/151 (6%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
           + + +F+++ KI P+ + +W  VL+ VP++ L +       E + +     A G+   R+
Sbjct: 533 LTFGSFSRMSKIGPAVVALWARVLREVPDARLVIGSISGQDEMDKLTERFAAEGIGAARL 592

Query: 113 LF----SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            F      +  +E+H       DVCLD     G TT++  LW G P VTLPG+++ASR +
Sbjct: 593 GFLPRKGMLVYQEQH----HQIDVCLDAFPFGGSTTTLQALWMGVPTVTLPGDSMASRSS 648

Query: 169 ASQLATLGC-PELIARTHKEYQDIAIRLGTD 198
              L+ LG     +A    +Y    + L  D
Sbjct: 649 TGWLSLLGLDAPFVAHDKDDYVRKCVTLAAD 679


>gi|425440157|ref|ZP_18820465.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389719465|emb|CCH96700.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 508

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 310 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 366

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 367 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 426

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 427 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 468


>gi|427713486|ref|YP_007062110.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Synechococcus sp. PCC 6312]
 gi|427377615|gb|AFY61567.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Synechococcus sp. PCC 6312]
          Length = 1128

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 82/164 (50%), Gaps = 9/164 (5%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT- 101
             R    +P D I++ +     K  P T+ + + ++K VPNS   +  F    + +IQ + 
Sbjct: 915  KRHDLDIPNDVIIFLSSQGGQKRHPDTMSLQLQIIKNVPNSFFLIKGF--ADQVSIQESF 972

Query: 102  ---AQALGLDQHRILFSNVAAKEEHVRRGQL--ADVCLDTPLCNGHTTSMDVLWTGTPVV 156
               A  +GL   ++ F    +    + R  L  ADV LDT   NG TT+++ LW G P+V
Sbjct: 973  YSIASEIGLSASQLRFLKPDSTS-MIHRANLGIADVILDTFPYNGATTTLEALWMGIPLV 1031

Query: 157  TLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            T  G+  A+R + + L   G  E IA T +EY +  I+ GTD +
Sbjct: 1032 TRVGQQFAARNSYTFLKNAGVEEGIAWTDEEYVEWGIKFGTDHE 1075


>gi|307731403|ref|YP_003908627.1| hypothetical protein [Burkholderia sp. CCGE1003]
 gi|307585938|gb|ADN59336.1| Tetratricopeptide TPR_1 repeat-containing protein [Burkholderia sp.
           CCGE1003]
          Length = 820

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 10/160 (6%)

Query: 48  GLP---EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA-- 102
           GLP   +  I Y +FN+L K+    + +W  +L+A P S + +    A+G    + T   
Sbjct: 580 GLPALHKGYITYGSFNRLNKLSTEVIALWSTILRAQPTSRMVI---GAIGSKLDEETYLE 636

Query: 103 --QALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             +  G+D  R+ F        ++++    D+CLDT    G TT+++ LW G P VT+PG
Sbjct: 637 WFRNEGIDTSRLTFCPRTTLPVYMQQHHHVDLCLDTFPYVGSTTTLNALWMGVPTVTMPG 696

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            ++ SR  A  L  +G  + +AR   E+   +I L  D D
Sbjct: 697 TSMPSRCGAGWLEQVGLHDFVARDKDEFVKKSIELTRDLD 736


>gi|255579381|ref|XP_002530535.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
 gi|223529939|gb|EEF31867.1| o-linked n-acetylglucosamine transferase, ogt, putative [Ricinus
           communis]
          Length = 930

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I + +FN L KI P  LQ+W  +L AVPNS L +   P   ++  Q      + LGL+  
Sbjct: 661 ITFGSFNNLAKITPKVLQVWARILCAVPNSRLVVKCKPFCCDSVRQRFLTMLEELGLESL 720

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G   A  V  
Sbjct: 721 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGAIHAHNVGV 780

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G   L+A+    Y  +A++L +D
Sbjct: 781 SLLSKVGLGHLVAQNEDNYVQLALQLASD 809


>gi|440754653|ref|ZP_20933855.1| hypothetical protein O53_3044 [Microcystis aeruginosa TAIHU98]
 gi|440174859|gb|ELP54228.1| hypothetical protein O53_3044 [Microcystis aeruginosa TAIHU98]
          Length = 705

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 79/158 (50%), Gaps = 2/158 (1%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL--LKFPAVGEANIQAT 101
           R   G+P DAI Y       K +P  +++ + VLKAVPNS L +  L      E      
Sbjct: 505 RDDLGIPGDAINYLTVQTGVKRNPEIIRLQLQVLKAVPNSYLSIQGLSDAKSVEKLFFKI 564

Query: 102 AQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
           A+  G++  R+    +     +    ++ADV LDT    G  T++DVLW G P+VT  G+
Sbjct: 565 AEEEGINYERLKILPLYPTGIYRANLRIADVVLDTYPFTGGMTTLDVLWMGIPLVTKVGQ 624

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             +SR + + +   G  E IA + +EY D  I+LG D 
Sbjct: 625 QWSSRNSYTLMVNAGISEGIAWSDEEYIDWGIKLGKDE 662


>gi|332530455|ref|ZP_08406398.1| hypothetical protein HGR_11012 [Hylemonella gracilis ATCC 19624]
 gi|332040115|gb|EGI76498.1| hypothetical protein HGR_11012 [Hylemonella gracilis ATCC 19624]
          Length = 755

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/152 (34%), Positives = 83/152 (54%), Gaps = 8/152 (5%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLDQ 109
           + + + N L K+    L +W  +L  VPNS L L++   +G+    A+ +A    LG+D 
Sbjct: 414 VTFGSCNNLGKLTDDVLALWGRILATVPNSRL-LIEGKNLGQPEFSASYRARCERLGIDV 472

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+    +   ++++   ++ DV LD  PL  G TTS DV+W G P++T+ G+ L SR+ 
Sbjct: 473 SRLDLVPLDYAKQYLVYHRI-DVALDPFPLVGG-TTSNDVIWMGVPLITMNGDALKSRMG 530

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
              LA +G  + IAR   EY  IA RL +D D
Sbjct: 531 VGMLAHMGRHDWIARNPDEYVAIASRLASDVD 562


>gi|427406429|ref|ZP_18896634.1| hypothetical protein HMPREF9161_00994 [Selenomonas sp. F0473]
 gi|425708248|gb|EKU71288.1| hypothetical protein HMPREF9161_00994 [Selenomonas sp. F0473]
          Length = 559

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 77/149 (51%), Gaps = 7/149 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           IV+ + N   K++ + L++W  +L  VP S L LLK      A+  A A     A G+  
Sbjct: 364 IVFGSLNNFTKVNDTVLRVWAKILHRVPTSRL-LLKAEIFSVADGAAEAWRRIAAAGIPA 422

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+      A  +++   +  D+ LDT    G  T+ + L+ G PVVTL GETL SR  A
Sbjct: 423 ERV--ETEGASADYLTAYRRIDIALDTFPYPGGGTTCEALYMGVPVVTLAGETLGSRFGA 480

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   LIART +EY + A+ L  D
Sbjct: 481 SLLQNIGAQALIARTTEEYIERAVLLAQD 509


>gi|166364311|ref|YP_001656584.1| hypothetical protein MAE_15700 [Microcystis aeruginosa NIES-843]
 gi|166086684|dbj|BAG01392.1| hypothetical protein MAE_15700 [Microcystis aeruginosa NIES-843]
          Length = 711

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 513 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 570 TEIAILEGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 630 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671


>gi|425467182|ref|ZP_18846466.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389830109|emb|CCI28120.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 711

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 513 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 570 TEIAILEGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 630 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671


>gi|330820193|ref|YP_004349055.1| TPR repeat-containing protein [Burkholderia gladioli BSR3]
 gi|327372188|gb|AEA63543.1| TPR repeat-containing protein [Burkholderia gladioli BSR3]
          Length = 786

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 47/154 (30%), Positives = 73/154 (47%), Gaps = 3/154 (1%)

Query: 48  GLPEDAIVYCNF---NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA 104
            LP  A  Y  F   N   K+  ++ ++W  V++    + L L+         +    + 
Sbjct: 588 ALPALANGYPTFGCLNNPCKLGDASFRLWAEVMRGFDEARLILMAPEGSARRLLLDRLRG 647

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            G+D  R+ F+    + E++R     D+ LDT   NGHTTS+D  W G PVVT   +T+ 
Sbjct: 648 HGIDDERVAFTAFRPRAEYLRTYHEIDIGLDTLPYNGHTTSLDSYWMGVPVVTRITDTVV 707

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            R   SQ A LG  E++A +   + +IA  L  D
Sbjct: 708 GRAGLSQAANLGLSEIVADSDARFVEIATGLARD 741


>gi|425454289|ref|ZP_18834035.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389805097|emb|CCI15371.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 711

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 513 REDLEIPPDATIYFSVQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 570 TEIAILEGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 630 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671


>gi|418542280|ref|ZP_13107723.1| TPR domain-containing protein [Burkholderia pseudomallei 1258a]
 gi|418548903|ref|ZP_13113999.1| TPR domain-containing protein [Burkholderia pseudomallei 1258b]
 gi|385355974|gb|EIF62122.1| TPR domain-containing protein [Burkholderia pseudomallei 1258a]
 gi|385356826|gb|EIF62911.1| TPR domain-containing protein [Burkholderia pseudomallei 1258b]
          Length = 775

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 74/145 (51%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + + +FN+L K+    + +W  +++AVP S + L   P  G A +    +  G+ + R+ 
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGAAMIEWFEQEGIARERLS 624

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + ++
Sbjct: 625 FQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTWMS 684

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            +G    IA    ++    I L +D
Sbjct: 685 HVGLESFIADDIDDFVAKGIALASD 709


>gi|328856695|gb|EGG05815.1| family 41 glycosyltransferase [Melampsora larici-populina 98AG31]
          Length = 313

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 50/71 (70%), Gaps = 2/71 (2%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           L +D +++ NFNQLYK+DP   + W+ +L +VP SILWLL+FPA GE +++ TA+     
Sbjct: 240 LSDDTVIFANFNQLYKVDPVIFKYWLEILASVPKSILWLLRFPAPGEPHLKLTAEKWAGP 299

Query: 109 Q--HRILFSNV 117
           +   RI+F++V
Sbjct: 300 EVASRIMFTDV 310


>gi|217423849|ref|ZP_03455349.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 576]
 gi|217392912|gb|EEC32934.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 576]
          Length = 775

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 74/145 (51%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + + +FN+L K+    + +W  +++AVP S + L   P  G A +    +  G+ + R+ 
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGAAMIEWFEQEGIARERLS 624

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + ++
Sbjct: 625 FQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTWMS 684

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            +G    IA    ++    I L +D
Sbjct: 685 HVGLESFIADDIDDFVAKGIALASD 709


>gi|148238461|ref|YP_001223848.1| glycosyl transferase family protein [Synechococcus sp. WH 7803]
 gi|147847000|emb|CAK22551.1| Glycosyltransferase of family GT41 [Synechococcus sp. WH 7803]
          Length = 721

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 79/147 (53%), Gaps = 4/147 (2%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQHRI 112
           + +FN   K++P T++++  VL+ VPNS L L     V +A      QAL   G+ + RI
Sbjct: 531 FGSFNHSRKLNPGTIRLYAGVLRDVPNSTLVLKSVSFVEQAEKTRVMQALTAAGIGEDRI 590

Query: 113 LF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           L   + A  +EH+   +  DV LD     G T++ D L  G PVV+L    +  R+++S 
Sbjct: 591 LLLPSTANHKEHLALYREVDVVLDPFPYGGATSTCDALMMGVPVVSLASAGMVGRLSSSI 650

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           LA+ G  + IAR+  +Y+ IA  L  +
Sbjct: 651 LASAGLEKWIARSQTDYRRIARELAAE 677


>gi|422302254|ref|ZP_16389617.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389788584|emb|CCI15664.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 711

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 513 REDLEIPPDATIYFSAQSGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 569

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 570 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 629

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 630 LVGEEFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 671


>gi|374293909|ref|YP_005040932.1| hypothetical protein AZOLI_p30436 [Azospirillum lipoferum 4B]
 gi|357427312|emb|CBS90256.1| conserved protein of unknown function; putative TPR domains
           [Azospirillum lipoferum 4B]
          Length = 732

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 49/147 (33%), Positives = 75/147 (51%), Gaps = 5/147 (3%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQHR 111
           + +FN L K++   L  W  +L  +P S L +    L  PA   A  +    A G++  R
Sbjct: 545 FGSFNILTKLNDGVLAAWAAILGRMPGSRLLMKTKALSCPATA-ALWRDRLAAAGIEPDR 603

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           +     ++  +H+RR    DV LD    +G TT+++ LW G PVVTLPGET +SR + + 
Sbjct: 604 VTMVGASSSLDHMRRCASVDVALDPFPFSGSTTTLETLWMGVPVVTLPGETFSSRHSLAF 663

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L   G    IA    +Y + A+ L +D
Sbjct: 664 LTVAGVEGCIATDPADYVERAVALASD 690


>gi|386390730|ref|ZP_10075512.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfovibrio sp. U5L]
 gi|385735611|gb|EIG55808.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Desulfovibrio sp. U5L]
          Length = 609

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/150 (36%), Positives = 74/150 (49%), Gaps = 9/150 (6%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATAQALGLDQHRI 112
           I + +FN + KI P+++ +W  VL +VP S L LLK  A  +A  +     A G     +
Sbjct: 342 ITFGSFNNVAKISPASIALWREVLASVPGSRL-LLKGKAFADAPTVDRFRDAFGPQAPSV 400

Query: 113 LF----SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            F    S+  +  +  RR    D+ LDT   NG  T+ + LW G PV+TL G+   SRV 
Sbjct: 401 EFLPWSSDAVSHLDVYRR---IDIALDTIPYNGTMTTCEALWMGVPVITLLGDRFTSRVG 457

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AS L   G  E IA     Y  +A RL  D
Sbjct: 458 ASLLTQAGLSEWIADDQAAYIRLAQRLAAD 487


>gi|428211055|ref|YP_007084199.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Oscillatoria acuminata PCC 6304]
 gi|427999436|gb|AFY80279.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Oscillatoria acuminata PCC 6304]
          Length = 2093

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 83/173 (47%), Gaps = 10/173 (5%)

Query: 36   PQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-PAVG 94
            P S V  +R  +G  E  +V+ +    YKI P   + W  +L AVPNSIL L  F PA  
Sbjct: 1733 PASTVNPTRASWGATEQTVVFMSGANFYKIIPELTETWAKILAAVPNSILVLYPFGPAWN 1792

Query: 95   EA--------NIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTS 145
             +         +       G+D+ R++  N       ++   +LAD+ LD+   +G T+ 
Sbjct: 1793 SSYPAMPFWERLHRIFAQYGVDKGRLVLINTLPSPADIKECVKLADLYLDSYPYSGATSL 1852

Query: 146  MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +D L  G P +   G +L SR AA+ L  +   +LIA+    Y  +A +L T+
Sbjct: 1853 LDPLQLGIPAIAWEGSSLRSRQAAALLREIHLADLIAQDESSYIQLATKLATN 1905


>gi|425444850|ref|ZP_18824891.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389735306|emb|CCI01162.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
          Length = 712

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 514 REDLEIPPDATIYFSAQSGMKRHPDTIKLQMKILARVPNSYF-LIK--GVGKTEKIQELF 570

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 571 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 630

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE  ++R + + +   G    I     EY D  ++LG DR
Sbjct: 631 LVGEQFSARNSYTFMINAGIEAGIGWNEAEYIDWGVKLGLDR 672


>gi|118581731|ref|YP_902981.1| hypothetical protein Ppro_3331 [Pelobacter propionicus DSM 2379]
 gi|118504441|gb|ABL00924.1| Tetratricopeptide TPR_2 repeat protein [Pelobacter propionicus DSM
           2379]
          Length = 633

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 9/159 (5%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLD 108
           L   A+ +  +N   K++   L +W  VL+ +P S L  LK  A+   +++   + L L 
Sbjct: 432 LERGAVTFVAYNNFAKLNCPLLDLWCRVLRMIPGSRL-ALKDRALAN-DVEFRMEMLDLF 489

Query: 109 QHR-------ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE 161
             R       I+         H+      D+ LD+   +G TT+ + LW G PVVTL G 
Sbjct: 490 AQRGITADRLIIRDRTETVHAHLDALAEGDIALDSHPYSGTTTTCESLWMGVPVVTLAGG 549

Query: 162 TLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           T ASRV+AS L ++G PEL+A +  ++  IA+ L  D D
Sbjct: 550 THASRVSASLLTSVGVPELVAESDDDFVAIAVALAKDTD 588


>gi|254186382|ref|ZP_04892899.1| tetratricopeptide repeat protein [Burkholderia pseudomallei Pasteur
           52237]
 gi|157934067|gb|EDO89737.1| tetratricopeptide repeat protein [Burkholderia pseudomallei Pasteur
           52237]
          Length = 836

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 42/145 (28%), Positives = 74/145 (51%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + + +FN+L K+    + +W  +++AVP S + L   P  G A +    +  G+ + R+ 
Sbjct: 626 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGAAMIEWFEQEGIARERLS 685

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLA 173
           F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + ++
Sbjct: 686 FQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTWMS 745

Query: 174 TLGCPELIARTHKEYQDIAIRLGTD 198
            +G    IA    ++    I L +D
Sbjct: 746 HVGLESFIADDIDDFVAKGIALASD 770


>gi|301113846|ref|XP_002998693.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
           putative [Phytophthora infestans T30-4]
 gi|262111994|gb|EEY70046.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase,
           putative [Phytophthora infestans T30-4]
          Length = 482

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 79/166 (47%), Gaps = 16/166 (9%)

Query: 14  IVLQNGLATNQTNTKTATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMW 73
           ++L N    N  +   A+ E + +   +    ++   ++  V+ NFN++ K+  S    W
Sbjct: 256 VILPNCYQVNYYDQILASSETISRRSDLWDIAKHEGGDEGFVFVNFNKIDKLKASLFSTW 315

Query: 74  VNVLKAVPNSILWLLKFPAVGE---------------ANIQATAQALGLDQHRILFSNVA 118
           ++ L+ VP S L LL  P   E                N+   AQA G+ + R+ F +  
Sbjct: 316 MSTLRRVPRSSLLLLD-PGRNELGDDVTESVTSREIKKNLWKEAQAQGISRGRVRFVSRI 374

Query: 119 AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            K EH+ R  +  + LDT +   H+T+ D L+ G PV+TL G++ A
Sbjct: 375 PKAEHLLRHHVGGLFLDTFIYGAHSTATDALYAGLPVITLAGDSFA 420


>gi|425445187|ref|ZP_18825223.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389734921|emb|CCI01528.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
          Length = 719

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + ++K+VPNS   +     V + N      
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLWDLF 575

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 576 CQIAAEVGVETNRIKMLPIYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E IA + +EY    I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677


>gi|238025842|ref|YP_002910073.1| TPR domain-containing protein [Burkholderia glumae BGR1]
 gi|237875036|gb|ACR27369.1| TPR domain protein [Burkholderia glumae BGR1]
          Length = 790

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 75/146 (51%), Gaps = 1/146 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-AQALGLDQHRI 112
           + + +FN+L K+    + +W  +L AVPNS + +   P VG  ++      A G+   R+
Sbjct: 570 VTFGSFNRLNKLRREVIAVWARILHAVPNSRMRVGSVPRVGGVDMLLEWFTAEGIAHQRL 629

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
                A    ++++    D CLDT    G TT+++ LW G P +T+ G TLASR  A  +
Sbjct: 630 DLQPRAPAAVYLQQHHHVDFCLDTFPYTGSTTALNALWMGVPTLTIRGGTLASRAGAVWM 689

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
           +++G  + +A +  ++    I L  D
Sbjct: 690 SSVGLEQFVADSPDDFVARGIALAND 715


>gi|425464281|ref|ZP_18843603.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9809]
 gi|389833751|emb|CCI21471.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9809]
          Length = 721

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  +++
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMIYLQLQILQQVPNSYLLIKGFADKETIRELFLKS 579

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
            A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 580 -ADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG +++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLNKN 679


>gi|425468292|ref|ZP_18847322.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
 gi|389885052|emb|CCI34703.1| Similar to tr|Q8YZB7|Q8YZB7 [Microcystis aeruginosa PCC 9701]
          Length = 721

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 85/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS L +  F     + E  ++ 
Sbjct: 520 RTDLNIPDDAIIYLTVQSGLKRTLNMICLQLQILQQVPNSYLLIKGFADKETIRELFLK- 578

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 579 SADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 638

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D++
Sbjct: 639 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLDQN 679


>gi|322421643|ref|YP_004200866.1| hypothetical protein GM18_4176 [Geobacter sp. M18]
 gi|320128030|gb|ADW15590.1| Tetratricopeptide TPR_1 repeat-containing protein [Geobacter sp.
           M18]
          Length = 598

 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 76/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           I + +FN L K+ P  + +W  VL+AVP S+L L   P   +     I     A G+   
Sbjct: 401 ITFGSFNNLAKVTPEVIALWSRVLQAVPGSLLLLKAKPLADQGVRRRILDLFAAGGVAGD 460

Query: 111 RILFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           RI F        EH+ + +  DV LDT   NG TT+ + LW G P++TL G+  ASR  A
Sbjct: 461 RIEFDPGQPGIREHLAQYRRVDVALDTFPYNGTTTTCEALWMGVPIITLAGDRHASRTGA 520

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L   G  +L+ R+  +Y + A ++  D
Sbjct: 521 SLLTNCGLADLVTRSEAQYLETARQVAQD 549


>gi|107024411|ref|YP_622738.1| hypothetical protein Bcen_2869 [Burkholderia cenocepacia AU 1054]
 gi|116688262|ref|YP_833885.1| TPR repeat-containing protein [Burkholderia cenocepacia HI2424]
 gi|105894600|gb|ABF77765.1| Tetratricopeptide TPR_2 [Burkholderia cenocepacia AU 1054]
 gi|116646351|gb|ABK06992.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia cenocepacia
           HI2424]
          Length = 780

 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 1/144 (0%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT-AQALGLDQHRILF 114
           Y +FN+L K+ P  + +W  VL A P + + L      G+  +      A G+D  R+ F
Sbjct: 570 YGSFNRLNKLSPHVIGVWARVLHADPTARMALGAIGNEGDQRVLIDWFAAAGIDASRLSF 629

Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
              +    ++++    D+CLDT    G TT+++ LW G P +T+ G TLA R +A+ L  
Sbjct: 630 HRRSNIPVYLQQHHGVDLCLDTFPYTGSTTTLNALWMGVPTLTIAGVTLAGRGSANWLMH 689

Query: 175 LGCPELIARTHKEYQDIAIRLGTD 198
           +G    IA+   E+   A+ LG D
Sbjct: 690 VGLDTYIAKNEDEFVAKALALGKD 713


>gi|53804390|ref|YP_113965.1| hypothetical protein MCA1509 [Methylococcus capsulatus str. Bath]
 gi|53758151|gb|AAU92442.1| TPR domain protein [Methylococcus capsulatus str. Bath]
          Length = 699

 Score = 76.3 bits (186), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 67/149 (44%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ---ATAQALGLDQH 110
           I +   N L KI+   L +W  +L A P S L +         N++   A     GLD  
Sbjct: 482 ITFGCLNNLAKINTEVLDLWAQILAASPTSRLLIQTVALCDRLNLERFRALCVERGLDPA 541

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+      + E+  +     D+ LD  P C G  TS D LW G PV+TL  + LA R   
Sbjct: 542 RLELRPATSLEKFAQTYHQIDIALDPFPFCGG-FTSFDALWMGVPVITLEQQRLAGRQTL 600

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  LG P LIA    +Y  IA+ L  D
Sbjct: 601 SMLMNLGLPHLIASNKTDYVSIALELSRD 629


>gi|254246744|ref|ZP_04940065.1| TPR repeat [Burkholderia cenocepacia PC184]
 gi|124871520|gb|EAY63236.1| TPR repeat [Burkholderia cenocepacia PC184]
          Length = 781

 Score = 76.3 bits (186), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 5/146 (3%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ---ALGLDQHRI 112
           Y +FN+L K+ P  + +W  VL     + + L      GE + Q   +   A G+D  R+
Sbjct: 571 YGSFNRLNKLSPQVVAVWARVLHGAQTARMVLGAIG--GEEDRQTLTEWFSAAGIDAGRL 628

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F   ++   ++++    D+CLDT    G TT+++ LW GTP +TL G T+A R +++ L
Sbjct: 629 SFHRRSSIPVYMQQHHGVDLCLDTFPYTGSTTTLNALWMGTPTLTLSGATMAGRGSSTWL 688

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
             +G    IA+   E+   A+ LG D
Sbjct: 689 QHVGLDAYIAKNEDEFVAKALALGKD 714


>gi|300867986|ref|ZP_07112625.1| hypothetical protein OSCI_3520023 [Oscillatoria sp. PCC 6506]
 gi|300334007|emb|CBN57803.1| hypothetical protein OSCI_3520023 [Oscillatoria sp. PCC 6506]
          Length = 2187

 Score = 76.3 bits (186), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 79/166 (47%), Gaps = 10/166 (6%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL--------WLLKFPAVG 94
            +R   G+ E   V+ +    YKI P   ++W  +L AVPNS+L        W   +PA  
Sbjct: 1512 TRNSLGISEQTNVFISGANFYKILPELREIWAKILAAVPNSMLILYPFGPAWTRSYPATP 1571

Query: 95   EAN-IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTSMDVLWTG 152
              N + A     G++ +R+L          V+    LADV LD+    G T+ +D L  G
Sbjct: 1572 FVNNLYAVLAKYGVESNRLLIVKQLPSRGDVKEFLGLADVYLDSYPYAGATSLIDPLQVG 1631

Query: 153  TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             P V + G  L  R  ++ L  +  PELIA + + Y  +A+ LG++
Sbjct: 1632 LPTVVMEGNALRFRQGSAMLREIEIPELIADSQQSYIRLAVALGSN 1677


>gi|302878310|ref|YP_003846874.1| hypothetical protein Galf_1080 [Gallionella capsiferriformans ES-2]
 gi|302581099|gb|ADL55110.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
           capsiferriformans ES-2]
          Length = 963

 Score = 76.3 bits (186), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 77/145 (53%), Gaps = 5/145 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
           I + +FN+L K   +T   W  +L+A+P S + +    L  P   E  I    +A G+  
Sbjct: 771 ITFGSFNRLVKNSDATYAAWAKILRALPESRMIIKTEALDDPVTRERVIACFVRA-GIAA 829

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            RIL     A+++H+      D+ LD     G  T+M+ L  G PVVTL   T+A R++A
Sbjct: 830 DRILLQGKTARDQHLAAFNQIDIALDPFPHGGGVTAMEGLMMGVPVVTLRWPTIAGRLSA 889

Query: 170 SQLATLGCPELIARTHKEYQDIAIR 194
           S L TLG  + IA T ++Y ++AI+
Sbjct: 890 SILTTLGLTDWIAETQQQYIELAIQ 914


>gi|222053752|ref|YP_002536114.1| hypothetical protein Geob_0648 [Geobacter daltonii FRC-32]
 gi|221563041|gb|ACM19013.1| TPR repeat-containing protein [Geobacter daltonii FRC-32]
          Length = 621

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 52/144 (36%), Positives = 72/144 (50%), Gaps = 5/144 (3%)

Query: 59  FNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAVGEAN-IQATAQALGLDQHRILF 114
           FN + KI+   + +W  +L  + +S L LLK   F   G A   Q      G+D  RI+ 
Sbjct: 433 FNNVAKINQRVITLWAKILHTLRDSRL-LLKNRLFDDPGCARTFQDRFTHAGIDPGRIML 491

Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
              ++  E +R   L D+ LDT    G TT+ + LW G PVV + G T A R A S LA 
Sbjct: 492 RGHSSHMETLREYSLVDIALDTFPFPGGTTTCEALWMGVPVVAMSGVTFAERQAISLLAN 551

Query: 175 LGCPELIARTHKEYQDIAIRLGTD 198
            G  E IA + +EY +I + L  D
Sbjct: 552 AGHREWIATSEEEYMEIVLDLAAD 575


>gi|166363525|ref|YP_001655798.1| hypothetical protein MAE_07840 [Microcystis aeruginosa NIES-843]
 gi|166085898|dbj|BAG00606.1| hypothetical protein MAE_07840 [Microcystis aeruginosa NIES-843]
          Length = 264

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 84/161 (52%), Gaps = 5/161 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
           R    +P+DAI+Y       K   + + + + +L+ VPNS + +  F     + E  ++ 
Sbjct: 67  RTDLNIPDDAIIYLTVQSGLKRTLNMICLQLQILQQVPNSYILIKGFADKETIRELFLK- 125

Query: 101 TAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP 159
           +A  LG+ Q R+ F  N   +E H     +AD+ LDT   +G TT+++ LW G P+VT  
Sbjct: 126 SADELGISQDRLRFLPNDFNEETHRANLGIADIVLDTYPYSGATTTLETLWMGIPLVTRV 185

Query: 160 GETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           GE  A+R + + +   G  + IA + +EY    I+LG D +
Sbjct: 186 GEQFAARNSYTFMKNAGISQGIAWSDEEYVQWGIKLGLDEN 226


>gi|117924934|ref|YP_865551.1| hypothetical protein Mmc1_1636 [Magnetococcus marinus MC-1]
 gi|117608690|gb|ABK44145.1| Tetratricopeptide TPR_2 repeat protein [Magnetococcus marinus MC-1]
          Length = 968

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 75/147 (51%), Gaps = 6/147 (4%)

Query: 57  CNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLDQHRI 112
            +FN L K+    L++W  +L  +P + L LLK    G      +     ++LG+D  RI
Sbjct: 500 ASFNTLTKMGHEVLRLWAEILTRLPQATL-LLKNGGFGSQAVVDDYLQRFESLGIDSKRI 558

Query: 113 LFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
               V    EEH+      D+ LDT   +G TT+ + LW G PVVTL G+   +RV  S 
Sbjct: 559 RCMGVTDTLEEHLNLYNQVDIALDTFPYHGTTTTCEALWMGVPVVTLQGDHHVNRVGGSL 618

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           L  +G  +L+A T K++ D+ + L  D
Sbjct: 619 LHQVGLHKLVASTPKQFVDLVVGLAQD 645


>gi|39997695|ref|NP_953646.1| TPR domain/SEC-C motif domain-containing protein [Geobacter
           sulfurreducens PCA]
 gi|39984587|gb|AAR35973.1| TPR domain/SEC-C motif domain protein [Geobacter sulfurreducens
           PCA]
          Length = 585

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 69/141 (48%), Gaps = 6/141 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLD 108
            + + +FN L K+ P  +  W  +L  VP S L +LKF    +      +     A G+ 
Sbjct: 392 GVTFGSFNNLAKVSPQVIDAWSAILGRVPGSRL-MLKFKGFADETCRRRVSDAFGARGVS 450

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R+ L S V    +H+      D+ LDT   NG  T+ D LW G PV+T  G    +RV
Sbjct: 451 PDRLELVSFVENFNDHLALYNRLDLALDTFPYNGTATTCDALWMGVPVLTCCGTRHVARV 510

Query: 168 AASQLATLGCPELIARTHKEY 188
            AS L +LG PELI  + +EY
Sbjct: 511 GASILTSLGLPELITHSPEEY 531


>gi|409913047|ref|YP_006891512.1| TPR domain/SEC-C motif domain-containing protein [Geobacter
           sulfurreducens KN400]
 gi|307635063|gb|ADI85350.2| TPR domain/SEC-C motif domain protein [Geobacter sulfurreducens
           KN400]
          Length = 585

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 69/141 (48%), Gaps = 6/141 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLD 108
            + + +FN L K+ P  +  W  +L  VP S L +LKF    +      +     A G+ 
Sbjct: 392 GVTFGSFNNLAKVSPQVIDAWSAILGRVPGSRL-MLKFKGFADETCRRRVSDAFGARGVS 450

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R+ L S V    +H+      D+ LDT   NG  T+ D LW G PV+T  G    +RV
Sbjct: 451 PDRLELVSFVEDFNDHLALYNRLDLALDTFPYNGTATTCDALWMGVPVLTCCGTRHVARV 510

Query: 168 AASQLATLGCPELIARTHKEY 188
            AS L +LG PELI  + +EY
Sbjct: 511 GASILTSLGLPELITHSPEEY 531


>gi|428179442|gb|EKX48313.1| hypothetical protein GUITHDRAFT_105920 [Guillardia theta CCMP2712]
          Length = 698

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 50/152 (32%), Positives = 79/152 (51%), Gaps = 15/152 (9%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           + +CN + LYK+DP+    W+ +L+ V  S L LL      E+++   ++A    + RI+
Sbjct: 479 LTFCNHDSLYKLDPTRFTSWMKLLEEVEGSTLLLLAQGNSTESSL--LSRAPPHLRSRIV 536

Query: 114 FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS--- 170
           F       +H+ R +  D+ LDT   +GHTTS + LW G P+V+LP    A RV +S   
Sbjct: 537 FQPFLDVRKHLERIRACDLALDTHGISGHTTSANYLWGGVPLVSLPSFMWAGRVTSSLLS 596

Query: 171 ------QLATLGCPELIARTHKEYQDIAIRLG 196
                  LA LG    IAR+  ++  ++ RL 
Sbjct: 597 ALSASCHLALLG----IARSEDDFVQLSKRLA 624


>gi|425464282|ref|ZP_18843604.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389833752|emb|CCI21473.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 719

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + ++K+VPNS   +     V + N      
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLLDLF 575

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E IA + +EY    I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677


>gi|428181283|gb|EKX50147.1| hypothetical protein GUITHDRAFT_135331 [Guillardia theta CCMP2712]
          Length = 511

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 84/150 (56%), Gaps = 11/150 (7%)

Query: 56  YCNFNQLYKID--PSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           Y ++N L K+   P   + +   L+ +P S      F   G +  +   +AL  +++   
Sbjct: 325 YMHYNILDKVQAPPEHAEFYSERLQLLPVS-----AFKGEGSSEEEVMTEALPAERNLKN 379

Query: 114 FSNVAA---KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG-ETLASRVAA 169
           F+       ++E ++  ++AD+ LDTP  N H T+ +VL +G PV+TLPG +  +SRVAA
Sbjct: 380 FAKEFKNFLEQERIKGKRVADLFLDTPEYNAHGTATEVLSSGVPVLTLPGSKAHSSRVAA 439

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           S +   G PE+IAR  ++Y++IA +L + R
Sbjct: 440 SVVIAAGLPEIIARDLEDYEEIATKLISRR 469


>gi|71906424|ref|YP_284011.1| hypothetical protein Daro_0785 [Dechloromonas aromatica RCB]
 gi|71846045|gb|AAZ45541.1| TPR repeat [Dechloromonas aromatica RCB]
          Length = 796

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 3/144 (2%)

Query: 60  NQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA---QALGLDQHRILFSN 116
           N+  K+    L +W ++L  VPNS L L+      E  ++A     Q  G+   R+    
Sbjct: 607 NRFSKVSMRALDLWGDLLLKVPNSRLLLIGRGGSDEQTVRALRKRFQEKGVPAERLEILE 666

Query: 117 VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLG 176
                E++      D+CLD    NG TT  D +W G P VTL G++L SR  ++ L  + 
Sbjct: 667 SKPPHEYLDTYNQVDLCLDPFPFNGGTTGFDSIWMGVPYVTLRGDSLHSRAGSNILKYVD 726

Query: 177 CPELIARTHKEYQDIAIRLGTDRD 200
             +L+A +  EY   A+ L  DRD
Sbjct: 727 LADLVADSESEYVAKAVALAADRD 750


>gi|302878313|ref|YP_003846877.1| hypothetical protein Galf_1083 [Gallionella capsiferriformans ES-2]
 gi|302581102|gb|ADL55113.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
           capsiferriformans ES-2]
          Length = 750

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 5/146 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN----IQATAQALGLD 108
           +I + +FN+L KI   T ++W  +L  VP S L +LK P + E +    I    +  GL 
Sbjct: 553 SITFGSFNRLAKISGQTYRIWAQILLQVPGSCL-ILKTPELNERSNRERILRYFENAGLT 611

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             RI+    ++  EH+R  Q  D+ LD     G  T+M+ L  G PV+TL    L  R++
Sbjct: 612 ADRIVMLGRSSWFEHMRAYQQVDIALDPFPHGGGMTAMEGLMMGVPVITLNWPILTGRLS 671

Query: 169 ASQLATLGCPELIARTHKEYQDIAIR 194
           +S + TLG  + I ++  +Y ++AI+
Sbjct: 672 SSLMTTLGLQDWIVQSEDQYLELAIK 697


>gi|428309702|ref|YP_007120679.1| O-linked N-acetylglucosamine transferase, SPINDLY family
           [Microcoleus sp. PCC 7113]
 gi|428251314|gb|AFZ17273.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Microcoleus sp. PCC 7113]
          Length = 506

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 4/166 (2%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
           EVP   +  +R  +GL +DA+VY      YK  P    ++V + K VP +    ++   V
Sbjct: 301 EVPS--LTKTRSDFGLRDDAVVYLCCQAPYKYLPQHDFIFVEIAKRVPQAQFVFVR-ADV 357

Query: 94  GEANIQATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTG 152
            +  ++AT  A+G   +   +F +    E ++R   L+DV LDT    G  T++  L   
Sbjct: 358 LKPRLKATFAAVGKSSEDYCVFLSFLNHENYLRLNLLSDVFLDTLGFTGGNTTLAALACN 417

Query: 153 TPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            PVVT PGE +   ++   L  L   E IA    EY +IA+RLG +
Sbjct: 418 LPVVTCPGECMRGYMSTGMLKMLRLSETIAENEAEYIEIAVRLGLE 463


>gi|422343493|ref|ZP_16424421.1| hypothetical protein HMPREF9432_00481 [Selenomonas noxia F0398]
 gi|355378800|gb|EHG25980.1| hypothetical protein HMPREF9432_00481 [Selenomonas noxia F0398]
          Length = 1079

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 51/150 (34%), Positives = 76/150 (50%), Gaps = 7/150 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLD 108
           +IV+ +FN   K++   L +W  +L+ VP S L LLK      A+ +  A    +A G+ 
Sbjct: 361 SIVFGSFNNFTKLNDRVLSVWAEILRRVPESRL-LLKTDVFSYADSRREALRRIEAAGIP 419

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+      A  +++      D+ LD     G  T+ D L+ G PVVTL GE+L SR  
Sbjct: 420 LVRV--DAEGASADYLAAYARVDIALDPFPYPGGGTTCDALYMGVPVVTLAGESLGSRFG 477

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AS L  +G  EL A T ++Y  +A+ L  D
Sbjct: 478 ASLLENIGAKELAAHTEEDYIALAVSLAQD 507



 Score = 63.5 bits (153), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 52/177 (29%), Positives = 83/177 (46%), Gaps = 18/177 (10%)

Query: 35   VPQSIVITSRQQYGLPEDA---------IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
            +P   V   R   GLP  A         I +  FNQ  K   + L +W  +++ VP + L
Sbjct: 862  LPSQFVYVPRA--GLPAAAETPARSRGYITFGVFNQYRKYTDAMLTVWREIMERVPTARL 919

Query: 86   WLLKFPAVGEANIQATAQA----LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
             L+K      + +  TA+A    LG D  R+      A  ++++R    D+ LDT    G
Sbjct: 920  -LIKSQVFFSSAMTETARARMKRLGFDLRRVALE--PATTDYMQRYLNVDIALDTYPWPG 976

Query: 142  HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              T+ D L+ G PVVT+ G   ++R + + LA +G  +L  +T  +Y + A+ L  D
Sbjct: 977  GGTTCDALYMGVPVVTMYGARRSTRFSYALLAHIGRTDLAVQTPADYIERAVSLAGD 1033


>gi|288963191|ref|YP_003453470.1| TPR repeat-containing protein [Azospirillum sp. B510]
 gi|288915443|dbj|BAI76926.1| TPR repeat-containing protein [Azospirillum sp. B510]
          Length = 1246

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 52/152 (34%), Positives = 74/152 (48%), Gaps = 5/152 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
           A+ + +FN + K++P  +++W  VL+AVP S L L    L  P V     +A A   G+ 
Sbjct: 424 AVTFGSFNLIAKLNPRVIRVWSRVLQAVPASRLLLKSINLSDPVVCHRLEEAFA-VHGIG 482

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
           + R+     ++           D+ LD     G  TS + LW G P+VT+PGE   SR  
Sbjct: 483 RDRLELRGPSSHAAMFEEYGDVDIALDPFPFTGGITSCEALWMGLPIVTMPGERPVSRQT 542

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           A  L  LG   LIAR   EY  IA  L +D +
Sbjct: 543 AGLLDILGLNGLIARDEDEYVRIAAALASDSE 574


>gi|381160367|ref|ZP_09869599.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878431|gb|EIC20523.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 697

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/151 (31%), Positives = 77/151 (50%), Gaps = 5/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT----AQALGLDQ 109
           I + +FN + K+   T+ +W  ++  VPNS L +++  A+G+A ++          G+ Q
Sbjct: 498 IKFASFNNIAKLSDQTIWLWAQIILRVPNSQL-VIRDKALGDATVRQRIIDRFAIQGVVQ 556

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+       +EE++      D+ L      G TT+ + LW G PVV L G T   R++ 
Sbjct: 557 ERLDIKPSLRREEYLESYNDVDISLSPTPFGGGTTTAEALWMGVPVVCLRGGTWVGRISE 616

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S + T+G  +L+A T +EY  IA  L T  D
Sbjct: 617 SIIKTVGLRDLVAETEEEYIHIATSLATRAD 647


>gi|67640477|ref|ZP_00439282.1| TPR domain protein [Burkholderia mallei GB8 horse 4]
 gi|121600690|ref|YP_994729.1| hypothetical protein BMASAVP1_A3451 [Burkholderia mallei SAVP1]
 gi|124383342|ref|YP_001027648.1| hypothetical protein BMA10229_A1668 [Burkholderia mallei NCTC
           10229]
 gi|126449970|ref|YP_001082618.1| hypothetical protein BMA10247_3099 [Burkholderia mallei NCTC 10247]
 gi|167000422|ref|ZP_02266236.1| TPR domain protein [Burkholderia mallei PRL-20]
 gi|254174654|ref|ZP_04881315.1| TPR domain protein [Burkholderia mallei ATCC 10399]
 gi|254201458|ref|ZP_04907822.1| TPR domain protein [Burkholderia mallei FMH]
 gi|254206800|ref|ZP_04913151.1| TPR domain protein [Burkholderia mallei JHU]
 gi|254357176|ref|ZP_04973450.1| TPR domain protein [Burkholderia mallei 2002721280]
 gi|121229500|gb|ABM52018.1| TPR domain protein [Burkholderia mallei SAVP1]
 gi|124291362|gb|ABN00631.1| TPR domain protein [Burkholderia mallei NCTC 10229]
 gi|126242840|gb|ABO05933.1| TPR domain protein [Burkholderia mallei NCTC 10247]
 gi|147747352|gb|EDK54428.1| TPR domain protein [Burkholderia mallei FMH]
 gi|147752342|gb|EDK59408.1| TPR domain protein [Burkholderia mallei JHU]
 gi|148026240|gb|EDK84325.1| TPR domain protein [Burkholderia mallei 2002721280]
 gi|160695699|gb|EDP85669.1| TPR domain protein [Burkholderia mallei ATCC 10399]
 gi|238521193|gb|EEP84646.1| TPR domain protein [Burkholderia mallei GB8 horse 4]
 gi|243063614|gb|EES45800.1| TPR domain protein [Burkholderia mallei PRL-20]
          Length = 776

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT+++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTALNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|428315272|ref|YP_007113154.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
 gi|428238952|gb|AFZ04738.1| glycosyl transferase family 2 [Oscillatoria nigro-viridis PCC 7112]
          Length = 1301

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 85/165 (51%), Gaps = 9/165 (5%)

Query: 43   SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA------ 96
            +R  +G  ++++V+ +  Q   I P   + W  ++ AVPNSIL L  F +  E       
Sbjct: 1077 TRSSWGATDESVVFVSGAQALTITPELRETWAKIIAAVPNSILVLYPFSSRSENYPTVPF 1136

Query: 97   --NIQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
               I+A     G+D+ R ++   + ++ + ++  +LAD+ LD+   +   +  + L  G 
Sbjct: 1137 YNQIRALFAEYGIDKKRLVVIKALPSRADCIKCLELADIYLDSYPYSSACSLAEPLLVGL 1196

Query: 154  PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            PVV   G+T  SR +A+ L  L  PE++A +   Y D++  LGT+
Sbjct: 1197 PVVARKGQTTRSRQSAAVLEELLLPEMVADSENSYLDLSTALGTN 1241


>gi|395650965|ref|ZP_10438815.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           protein [Pseudomonas extremaustralis 14-3 substr. 14-3b]
          Length = 1104

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 72/151 (47%), Gaps = 5/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG----EANIQATAQALGLDQ 109
           + Y  FN   K++   L  W  +++A P+S L LLK  A G      ++     A G+D+
Sbjct: 423 VTYGCFNNPTKVNDVLLAHWAELMRATPDSRL-LLKGGAFGTRALRDHVHGVMAAHGIDR 481

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++       +  +      D+ LD    +G  T+ + L  G PVVTLPG T A R +A
Sbjct: 482 ERVMIEGPVGHKHLLETYNRVDIALDPWPYSGGLTTCEALLMGVPVVTLPGPTFAGRHSA 541

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           + L   G PEL+  +  +Y+   + L  D D
Sbjct: 542 THLVNAGLPELVVHSWAQYRQRVMELAGDLD 572


>gi|304438234|ref|ZP_07398176.1| TPR domain/SEC-C domain protein [Selenomonas sp. oral taxon 149
           str. 67H29BP]
 gi|304368841|gb|EFM22524.1| TPR domain/SEC-C domain protein [Selenomonas sp. oral taxon 149
           str. 67H29BP]
          Length = 1077

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 15/154 (9%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRI 112
            IV+ +FN   K++   L++W  +L+ VP S L+L       +A+I +   A      RI
Sbjct: 359 GIVFGSFNNFTKVNDRVLRVWAEILRRVPESRLFL-------KADIFSEDDAHAEVLERI 411

Query: 113 LFSNVAAK--------EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
             + +            +++      D+ LD     G  T+ D L+ G PVVTL GETL 
Sbjct: 412 AAAGIPTSCVDTEGNSPDYLAAYNRMDIALDPFPYPGGGTTCDALYMGVPVVTLSGETLG 471

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           SR  AS L  +G   LIA T +EY  +A+ L  D
Sbjct: 472 SRFGASLLTNIGAEALIAHTEEEYISLAVSLAGD 505



 Score = 66.6 bits (161), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 54/171 (31%), Positives = 77/171 (45%), Gaps = 12/171 (7%)

Query: 34   EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---- 89
            E+P      +RQ+      AI++  FNQ  K     L +W  +L+ VP S L LLK    
Sbjct: 872  ELPAPTGTPARQR-----GAILFGVFNQYRKWTDEMLCVWREILERVPQSKL-LLKSQVF 925

Query: 90   FPAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVL 149
            F A      +     LG D  RIL     A  +++ R    D+ LDT    G  T+ D L
Sbjct: 926  FAAAMRREAEERLARLGFDLDRILLE--PATTDYMERYLDVDIALDTYPWPGGGTTCDAL 983

Query: 150  WTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            + G PVVT+ G   ++R + + L  +G  EL   +   Y   A+ L  D D
Sbjct: 984  YMGVPVVTMYGTRRSTRFSYAMLHLVGAEELAVTSASAYISCAVSLAEDPD 1034


>gi|425468293|ref|ZP_18847323.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389885053|emb|CCI34704.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 719

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + +LK+VPNS   +     V + N      
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQILKSVPNSYFLI---QGVADDNSLLDLF 575

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E IA + +EY    I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677


>gi|401408423|ref|XP_003883660.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
 gi|325118077|emb|CBZ53628.1| hypothetical protein NCLIV_034150 [Neospora caninum Liverpool]
          Length = 987

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 78/150 (52%), Gaps = 5/150 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           I + +FN L K+    +++W  +LKAVP S L L   P   E      +A  +A G+   
Sbjct: 792 ITFGSFNNLAKLGKQVIELWSRILKAVPTSRLLLKARPFANEEMQRKFKAKFEAHGVSGD 851

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETL-ASRVA 168
           RI   + V A  +H+    L D+ LD+    G TT+ + L  G PVV+L  + + A  V 
Sbjct: 852 RIDAMALVPACMDHLMVYSLVDIALDSFPYAGTTTTCEALVMGVPVVSLRRKNIHAHNVG 911

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           A+ LA  G PEL+A T ++Y  +A+ L  D
Sbjct: 912 ATLLAHYGLPELVADTPEQYVRLAVELAGD 941


>gi|357059079|ref|ZP_09119924.1| hypothetical protein HMPREF9334_01641 [Selenomonas infelix ATCC
           43532]
 gi|355372987|gb|EHG20325.1| hypothetical protein HMPREF9334_01641 [Selenomonas infelix ATCC
           43532]
          Length = 561

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 73/154 (47%), Gaps = 17/154 (11%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
           +V+ +FN   K+  + L+ W  +L+ VP S L L       P   E  ++    A G+  
Sbjct: 365 VVFGSFNNFTKLSDTVLRTWAEILRRVPESRLLLKTDVFSCPDAREEALRRI-MAAGIPS 423

Query: 110 HRILFSNVAAKEEHVRRGQLA-----DVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            R+       + E   +G LA     D+ LD     G  T+ D L+ G PVVTL GE+L 
Sbjct: 424 ERV-------ETEGTSQGYLAAYNRVDIALDPFPYPGGGTTCDALYMGVPVVTLAGESLG 476

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           SR  AS L  +G   LIA T +EY  +A  L  D
Sbjct: 477 SRFGASLLENIGAGALIAHTEEEYTALAAALAGD 510


>gi|425443193|ref|ZP_18823419.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389715556|emb|CCI00089.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 569

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + +LK+VPNS   +     V + N      
Sbjct: 369 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQILKSVPNSYFLI---QGVADDNSLWDLF 425

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 426 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 485

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E IA + +EY    I+LG D++
Sbjct: 486 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 527


>gi|94971753|ref|YP_593801.1| hypothetical protein Acid345_4728 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94553803|gb|ABF43727.1| TPR repeat protein [Candidatus Koribacter versatilis Ellin345]
          Length = 718

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 4/137 (2%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV---GEANIQATAQALGLDQHRI 112
           + + N+  KI   +L +W  V++AVP S L L    A        I A     G+   RI
Sbjct: 525 FGSLNRYSKIGQESLDLWAKVMEAVPKSKLLLEDRTADDTDAHQRISAELATHGIGADRI 584

Query: 113 LFS-NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
            F   +   E H+R     DV LDT   N  TT+ D LW G P+V + G   ASR+AA  
Sbjct: 585 EFEPYIPGHERHMRLYDRVDVALDTIPLNSGTTACDALWMGVPLVAMEGNRTASRIAAGF 644

Query: 172 LATLGCPELIARTHKEY 188
           L  +G  E IA + + Y
Sbjct: 645 LRAIGRTEWIADSEQNY 661


>gi|53724345|ref|YP_104386.1| hypothetical protein BMA2876 [Burkholderia mallei ATCC 23344]
 gi|52427768|gb|AAU48361.1| TPR domain protein [Burkholderia mallei ATCC 23344]
          Length = 821

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 610 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 668

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT+++ LW G P +T+ GETL SR   + 
Sbjct: 669 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTALNALWMGVPTLTMRGETLPSRAGLTW 728

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 729 MSHVGLESFIADDIDDFVAKGIALASD 755


>gi|384253147|gb|EIE26622.1| O-linked N-acetylglucosamine transferase [Coccomyxa subellipsoidea
           C-169]
          Length = 937

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 74/150 (49%), Gaps = 5/150 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
           + + +FN L KI P  +  W  +L A+P+S L L   P   A   +++    QA G+D  
Sbjct: 700 VTFGSFNNLAKITPRVMAAWARILTALPSSRLVLKNKPFACASARSHVLGQLQAAGVDAS 759

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L    AA  +H+    L D+ LD     G TT+ + L+ G PV+TL G   A  V A
Sbjct: 760 RVDLLPLAAANSDHLATYALMDISLDPFPYAGTTTTCESLYMGVPVITLAGGCHAHNVGA 819

Query: 170 SQLATLGCPE-LIARTHKEYQDIAIRLGTD 198
           S L  +G  E  +A +  EY  +A+    D
Sbjct: 820 SLLDAMGMREGWVAHSEDEYVRLAVAAAAD 849


>gi|428180905|gb|EKX49771.1| hypothetical protein GUITHDRAFT_162101 [Guillardia theta CCMP2712]
          Length = 758

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 47/166 (28%), Positives = 75/166 (45%), Gaps = 28/166 (16%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSI--LWLLKFPAVGEANIQ------------ 99
           I++ NFN LYKIDP+ + +W + L +  +    LW+   P     N++            
Sbjct: 532 IIFANFNSLYKIDPALVNVWKSALFSSNDKSWQLWITNRPKKAAENLKNHLTDSKNELRQ 591

Query: 100 --------------ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTS 145
                          + +        IL +++ + ++H+R    AD+ LD+     H+T+
Sbjct: 592 WSESEDESLSHQFFTSQEKEKFSDELILVTDLQSAKDHIRVKSCADIFLDSFRLGSHSTA 651

Query: 146 MDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDI 191
            D L  G  +VTLP     SRVA+S LA  G    I RT +EY D+
Sbjct: 652 ADALAAGLVLVTLPHIGALSRVASSLLAASGASATIVRTSQEYHDL 697


>gi|428181273|gb|EKX50137.1| hypothetical protein GUITHDRAFT_103951 [Guillardia theta CCMP2712]
          Length = 865

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 34/93 (36%), Positives = 56/93 (60%)

Query: 47  YGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALG 106
           + +P D+IV   FN++ KID  T + W+ +L+    ++LWL++FP   +  I   A   G
Sbjct: 618 HDVPLDSIVLACFNRIQKIDEVTFETWMRILEQESRTVLWLIEFPKEAQRRINQHASRRG 677

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLC 139
           +D  R+++S +AA+EE +     AD+ LDTP C
Sbjct: 678 VDPRRLIWSPMAAEEEFMHMAAAADLFLDTPRC 710


>gi|409992536|ref|ZP_11275720.1| type 11 methyltransferase [Arthrospira platensis str. Paraca]
 gi|291569835|dbj|BAI92107.1| probable glycosyl transferase [Arthrospira platensis NIES-39]
 gi|409936606|gb|EKN78086.1| type 11 methyltransferase [Arthrospira platensis str. Paraca]
          Length = 2556

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 11/172 (6%)

Query: 36   PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
            PQ + V   R Q+G  +  IV+ +    YKI P   + WV +L  VPNSIL L  F    
Sbjct: 2023 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 2082

Query: 95   EAN---------IQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
             +N         + AT    G++  R +L  ++ ++ +     +  D+ LD+   +G  +
Sbjct: 2083 SSNYASSQFIRRMYATLSKYGVEDKRLVLIKSLPSRTDVKACLEQGDIYLDSYPHSGSNS 2142

Query: 145  SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             +D L  G PVV   G+TL S+  A+ + +LG P+LI  +   Y  +A+ L 
Sbjct: 2143 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDSESSYISLAVTLA 2194


>gi|425454004|ref|ZP_18833753.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389799815|emb|CCI20675.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 724

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 85/162 (52%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + ++K+VPNS   L++   V + N      
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYF-LIQ--GVADDNSLWDLF 575

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 576 CQIAAEVGVETNRIKMLPIYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E IA + +EY    I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGIAWSDEEYVQWGIKLGLDKN 677


>gi|334126612|ref|ZP_08500561.1| TPR domain/SEC-C domain protein [Centipeda periodontii DSM 2778]
 gi|333391283|gb|EGK62401.1| TPR domain/SEC-C domain protein [Centipeda periodontii DSM 2778]
          Length = 548

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 76/155 (49%), Gaps = 15/155 (9%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           +V+ +FN   K++   L+ W  +L+ VP S L+L       + ++ + A A     HR+ 
Sbjct: 360 VVFGSFNNFTKLNDRVLRTWAEILRRVPESRLFL-------KTDVFSYADARAEVLHRMG 412

Query: 114 FSNV--------AAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
            + +         + ++++      D+ LD     G  T+ D L+ G PVVTL G +L S
Sbjct: 413 EAGIPLDRVDTEGSSKDYLAAYNRVDIALDPFPYPGGGTTCDALYMGVPVVTLAGASLGS 472

Query: 166 RVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           R  AS L  +G   L+AR  +EY  +A+ L  D D
Sbjct: 473 RFGASLLENVGAGALVARAEEEYIALAVSLAGDAD 507


>gi|166363522|ref|YP_001655795.1| hypothetical protein MAE_07810 [Microcystis aeruginosa NIES-843]
 gi|166085895|dbj|BAG00603.1| hypothetical protein MAE_07810 [Microcystis aeruginosa NIES-843]
          Length = 719

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + ++K+VPNS   +     V + N      
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLWDLF 575

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E I+ + +EY    I+LG DR+
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGISWSDEEYVQWGIKLGLDRN 677


>gi|357635838|ref|ZP_09133715.1| transcriptional activator domain protein [Desulfovibrio sp.
           FW1012B]
 gi|357580678|gb|EHJ46012.1| transcriptional activator domain protein [Desulfovibrio sp.
           FW1012B]
          Length = 788

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 6/151 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLD 108
           A+ + +FN+L K+  +TL +W  +L A+P + L +    L  P    A  +  A+A GL 
Sbjct: 591 AVTFGSFNKLAKLSDTTLALWKEILAALPRATLRIKDIALSDPGCRAALAERCARA-GLP 649

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
             R+ L      ++EH+ +    D+ LDT   +G TT+ + LW G PVVTL GE   SRV
Sbjct: 650 MDRLALLPGSPTRQEHLSQYGTIDIALDTFPYHGTTTTCEALWMGVPVVTLIGEQHHSRV 709

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             S L++L      A    +Y   A+ L +D
Sbjct: 710 GLSVLSSLDATAWAATDKADYIGKAVALASD 740


>gi|381160359|ref|ZP_09869591.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
 gi|380878423|gb|EIC20515.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Thiorhodovibrio sp. 970]
          Length = 955

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 8/155 (5%)

Query: 50  PEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF----PAVGEANIQATAQAL 105
           P   +   +FNQ  K+   +L++W  +L+AVP + L+L       P V E  I  T    
Sbjct: 749 PAGRLWLASFNQFAKVREESLELWSEILRAVPQAHLFLKDKKAADPMVRE-RILGTLARH 807

Query: 106 GLDQHRI-LFSNVAAKEEHVRRGQLADVCLD-TPLCNGHTTSMDVLWTGTPVVTLPGETL 163
           G+   R+ + S   +  +H+      D+ LD TP  +  TT  D LW GTP+VTL G+  
Sbjct: 808 GIHADRVKMASRTNSWSDHMGLYNYVDLALDPTPFTSA-TTGFDALWMGTPLVTLLGDQP 866

Query: 164 ASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           A R AAS L  LG  + IA T ++Y  IA+ L  D
Sbjct: 867 AGRQAASLLTGLGRTDWIACTQEDYVRIALDLLDD 901


>gi|167740689|ref|ZP_02413463.1| putative TPR domain protein [Burkholderia pseudomallei 14]
          Length = 629

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 418 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 476

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 477 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 536

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 537 MSHVGLESFIADDIDDFVAKGIALASD 563


>gi|167721711|ref|ZP_02404947.1| putative TPR domain protein [Burkholderia pseudomallei DM98]
          Length = 622

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 411 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 469

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 470 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 529

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 530 MSHVGLESFIADDIDDFVAKGIALASD 556


>gi|167896374|ref|ZP_02483776.1| putative TPR domain protein [Burkholderia pseudomallei 7894]
 gi|167920964|ref|ZP_02508055.1| putative TPR domain protein [Burkholderia pseudomallei BCC215]
 gi|254261993|ref|ZP_04953047.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1710a]
 gi|254220682|gb|EET10066.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1710a]
          Length = 776

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|254194528|ref|ZP_04900959.1| tetratricopeptide repeat protein [Burkholderia pseudomallei S13]
 gi|169651278|gb|EDS83971.1| tetratricopeptide repeat protein [Burkholderia pseudomallei S13]
          Length = 776

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|126441749|ref|YP_001060867.1| hypothetical protein BURPS668_3873 [Burkholderia pseudomallei 668]
 gi|126221242|gb|ABN84748.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 668]
          Length = 776

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|300868250|ref|ZP_07112881.1| TPR repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300333774|emb|CBN58065.1| TPR repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 728

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 85/166 (51%), Gaps = 9/166 (5%)

Query: 41  ITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA 100
           I  R+ + +  + +VY   +   K +   ++  V +LK VPNSIL      A+G+  +  
Sbjct: 522 IDLRKSHRIGLEQVVYLCVSPGRKFNRELVKAQVAILKQVPNSIL---IHKAIGDIEVFQ 578

Query: 101 TA-----QALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTP 154
           +A     +A G+ +HRI F   V  +EEH     LADV LD+   NG T +++ LW   P
Sbjct: 579 SAYHQACEAEGIGKHRIKFLPRVPTEEEHRLIYLLADVFLDSYPYNGGTHTLEALWFNLP 638

Query: 155 VVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           VVT  GE   SR+  S +  L     ++ + +EY +  ++LG + D
Sbjct: 639 VVTRVGEQFLSRMGYSFMQALKMETGVSWSWEEYTEWGVKLGNNLD 684


>gi|167913028|ref|ZP_02500119.1| putative TPR domain protein [Burkholderia pseudomallei 112]
          Length = 776

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|134281637|ref|ZP_01768345.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 305]
 gi|167826278|ref|ZP_02457749.1| putative TPR domain protein [Burkholderia pseudomallei 9]
 gi|226193032|ref|ZP_03788642.1| tetratricopeptide repeat protein [Burkholderia pseudomallei
           Pakistan 9]
 gi|134247304|gb|EBA47390.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 305]
 gi|225934632|gb|EEH30609.1| tetratricopeptide repeat protein [Burkholderia pseudomallei
           Pakistan 9]
          Length = 776

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|53720932|ref|YP_109918.1| TPR domain-containing protein [Burkholderia pseudomallei K96243]
 gi|167817901|ref|ZP_02449581.1| putative TPR domain protein [Burkholderia pseudomallei 91]
 gi|167847785|ref|ZP_02473293.1| putative TPR domain protein [Burkholderia pseudomallei B7210]
 gi|167904747|ref|ZP_02491952.1| putative TPR domain protein [Burkholderia pseudomallei NCTC 13177]
 gi|254298777|ref|ZP_04966228.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 406e]
 gi|386863576|ref|YP_006276525.1| hypothetical protein BP1026B_I3558 [Burkholderia pseudomallei
           1026b]
 gi|418534702|ref|ZP_13100540.1| TPR domain-containing protein [Burkholderia pseudomallei 1026a]
 gi|52211346|emb|CAH37335.1| putative TPR domain protein [Burkholderia pseudomallei K96243]
 gi|157808480|gb|EDO85650.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 406e]
 gi|385358823|gb|EIF64806.1| TPR domain-containing protein [Burkholderia pseudomallei 1026a]
 gi|385660704|gb|AFI68127.1| TPR domain-containing protein [Burkholderia pseudomallei 1026b]
          Length = 776

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|320530500|ref|ZP_08031558.1| conserved domain protein [Selenomonas artemidis F0399]
 gi|320137333|gb|EFW29257.1| conserved domain protein [Selenomonas artemidis F0399]
          Length = 565

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 74/151 (49%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV----GEANIQATAQALGLDQ 109
           IV+ + N   KI+   LQ+W  +L+ VP + L LLK        G A  +    A G+  
Sbjct: 363 IVFGSLNNFTKINDRVLQVWAEILRRVPTARL-LLKTEIFSVSDGAAEARRRIAAAGIPP 421

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+      A  +++      D+ LDT    G  T+ D L+ G PVVT  GETL SR  A
Sbjct: 422 ERV--ETEGASADYLAAYGRIDIALDTFPYPGGGTTCDALYMGVPVVTRAGETLGSRFGA 479

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  +G   LIA T +EY + A+ L  D D
Sbjct: 480 SLLRNIGADALIAYTEEEYIERAVFLAQDFD 510


>gi|428769585|ref|YP_007161375.1| hypothetical protein Cyan10605_1212 [Cyanobacterium aponinum PCC
           10605]
 gi|428683864|gb|AFZ53331.1| hypothetical protein Cyan10605_1212 [Cyanobacterium aponinum PCC
           10605]
          Length = 746

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 50/158 (31%), Positives = 80/158 (50%), Gaps = 6/158 (3%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILW--LLKFPAVGEANIQA 100
           +R+Q  +P+DA++Y N     K++P  +QM + +++ V NS L   + K   V +  + +
Sbjct: 543 TRKQLNIPDDAVIYLNMQNAAKLNPHIVQMQMEIIQNVKNSYLLFNVRKDKNVLKKYLYS 602

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLA--DVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
            A      + RI F  + +    + R  LA  D+ LDT   NG  T +D LW   P+VT 
Sbjct: 603 LADKFQGVKDRIRF--IPSYPPAIYRANLAIADIILDTYPFNGAVTVLDSLWQNIPLVTR 660

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
            G+   +R   S L  LG  E IA T ++Y +  I+ G
Sbjct: 661 VGQQYHARQGYSFLKNLGIEEGIAWTDRKYIEWGIKFG 698


>gi|398805152|ref|ZP_10564133.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Polaromonas sp. CF318]
 gi|398092314|gb|EJL82729.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Polaromonas sp. CF318]
          Length = 759

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 72/148 (48%), Gaps = 3/148 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
           + +  FN L K++ + +++W  +L  VP S L+L       A  +  + A   A G++  
Sbjct: 555 VTFGCFNNLTKMNDAVVKLWAQILNTVPASRLFLKARQLQQASAQREVMARFAAHGIEPG 614

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
           R++  +   +E ++   Q  D+ LD     G TT+++ LW G PV+TL GE   SR    
Sbjct: 615 RLILEDYVPRENYLAAYQRVDIALDPFPYPGGTTTVEALWMGVPVLTLAGERFLSRQGVG 674

Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTD 198
            +   G PE +A   ++Y   A+    D
Sbjct: 675 LMMNAGLPEWVASDPRDYLSRAVAHAGD 702


>gi|374292328|ref|YP_005039363.1| hypothetical protein AZOLI_1877 [Azospirillum lipoferum 4B]
 gi|357424267|emb|CBS87134.1| conserved protein of unknown function; glycosyltransferase and TPR
           domains [Azospirillum lipoferum 4B]
          Length = 631

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 72/149 (48%), Gaps = 5/149 (3%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQHR 111
           + + N L K++     +W  +L AVP S L LL+ P + + +++A   AL    G D  R
Sbjct: 431 FGSLNALPKLNAQVAALWTRLLAAVPESRL-LLRTPGLDDPDLRARTLALFTAAGADPER 489

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           I     A   E +      DV LD    +G  T+++ LW G PVVTL G+   +R A + 
Sbjct: 490 IELRGAAPHAEFLATYGEIDVALDPFPYSGGLTTLEALWMGVPVVTLGGDRFCARHAVTH 549

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTDRD 200
           L + G P L     + Y  +A  L +D D
Sbjct: 550 LTSAGLPALAVEGEEAYLAMAAALVSDSD 578


>gi|254183905|ref|ZP_04890496.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1655]
 gi|184214437|gb|EDU11480.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1655]
          Length = 837

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 626 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 684

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 685 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 744

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 745 MSHVGLESFIADDIDDFVAKGIALASD 771


>gi|402302461|ref|ZP_10821573.1| glycosyltransferases group 1 [Selenomonas sp. FOBRC9]
 gi|400380528|gb|EJP33345.1| glycosyltransferases group 1 [Selenomonas sp. FOBRC9]
          Length = 565

 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 74/151 (49%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV----GEANIQATAQALGLDQ 109
           IV+ + N   KI+   LQ+W  +L+ VP + L LLK        G A  +    A G+  
Sbjct: 363 IVFGSLNNFTKINDRVLQVWAEILRRVPTARL-LLKTEIFSVSDGAAEARRRIAAAGIPP 421

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+      A  +++      D+ LDT    G  T+ D L+ G PVVT  GETL SR  A
Sbjct: 422 ERV--ETEGASADYLAAYGRIDIALDTFPYPGGGTTCDALYMGVPVVTRAGETLGSRFGA 479

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  +G   LIA T +EY + A+ L  D D
Sbjct: 480 SLLRNIGADALIAYTEEEYIERAVFLAQDFD 510


>gi|76812180|ref|YP_331511.1| hypothetical protein BURPS1710b_0094 [Burkholderia pseudomallei
           1710b]
 gi|76581633|gb|ABA51108.1| TPR domain protein [Burkholderia pseudomallei 1710b]
          Length = 821

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 610 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 668

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 669 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 728

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 729 MSHVGLESFIADDIDDFVAKGIALASD 755


>gi|197103965|ref|YP_002129342.1| TPR domain-containing protein [Phenylobacterium zucineum HLK1]
 gi|196477385|gb|ACG76913.1| TPR domain protein [Phenylobacterium zucineum HLK1]
          Length = 761

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 48/148 (32%), Positives = 72/148 (48%), Gaps = 5/148 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           + +   N  YK +P  L+ W  +L   PNS    ++     E+   ++ A   A G+   
Sbjct: 573 VTFGTANNPYKYNPDVLRTWGRILARTPNSRFLFIRPEGGSESFRRHMTAAFAAEGVAPE 632

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
           RILF  V  + +H+      D+ LDT    G TT+ + LW G PVV+L GE L  R++ S
Sbjct: 633 RILFEPV--RGQHLPHYNRLDMSLDTFPQTGGTTTCESLWMGAPVVSLVGEALYERLSYS 690

Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTD 198
            L  +G  +L A T   Y++ A  L  D
Sbjct: 691 VLNNIGLGDLCAETVAGYEETACALAAD 718


>gi|186477739|ref|YP_001859209.1| TPR repeat-containing protein [Burkholderia phymatum STM815]
 gi|184194198|gb|ACC72163.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia phymatum
           STM815]
          Length = 789

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 71/139 (51%), Gaps = 5/139 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ----ALGLDQ 109
           + +  F +L KI P+ +  W  +L+AVP + L ++K   +G A+ + +A     A G+D 
Sbjct: 596 VTFGYFGKLVKITPNVIAAWSRLLRAVPGAKL-MMKSHELGAAHARRSAAEQFAAQGIDA 654

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++    + + E+++     D+ L      G TT+ + LW GTPVV L G+   + +  
Sbjct: 655 LRLILEGGSPRHEYLKAYNRVDIMLSPFPYPGGTTTAEALWMGTPVVALQGDRFVTHICE 714

Query: 170 SQLATLGCPELIARTHKEY 188
           S L   G  E IAR   EY
Sbjct: 715 SVLNAAGLGEWIARDEDEY 733


>gi|126452788|ref|YP_001068175.1| hypothetical protein BURPS1106A_3955 [Burkholderia pseudomallei
           1106a]
 gi|242316562|ref|ZP_04815578.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1106b]
 gi|403520597|ref|YP_006654731.1| hypothetical protein BPC006_I3996 [Burkholderia pseudomallei
           BPC006]
 gi|126226430|gb|ABN89970.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1106a]
 gi|242139801|gb|EES26203.1| tetratricopeptide repeat protein [Burkholderia pseudomallei 1106b]
 gi|403076239|gb|AFR17819.1| TPR repeat-containing protein [Burkholderia pseudomallei BPC006]
          Length = 776

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIAGDIDDFVAKGIALASD 710


>gi|428219241|ref|YP_007103706.1| hypothetical protein Pse7367_3029 [Pseudanabaena sp. PCC 7367]
 gi|427991023|gb|AFY71278.1| Tetratricopeptide TPR_2 repeat-containing protein [Pseudanabaena
           sp. PCC 7367]
          Length = 1639

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 84/165 (50%), Gaps = 10/165 (6%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQ 99
           RQ+ G+ +  +VY       K +P+++     +L  VPNSI+  L+  A      +A   
Sbjct: 645 RQKLGITDQQVVYLYAAPAQKSNPASIAAQAQILAQVPNSII--LRKGAQDLDTAKALYA 702

Query: 100 ATAQALGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
                L +   RI+F      +EEH    Q+ADV LD+   NG T +++ LW   PVVTL
Sbjct: 703 EYCDPLAVSSDRIIFLPRTNTEEEHRIIYQIADVGLDSYPYNGGTHNLESLWFNLPVVTL 762

Query: 159 PGETLASRVAASQLATLGCPEL---IARTHKEYQDIAIRLGTDRD 200
            G+   +R+  S +AT+G  +    IA+   EY   A+RLG DRD
Sbjct: 763 VGDQSFARMGYSFVATIGLSDKKAGIAQNWHEYIAEAVRLGGDRD 807



 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 77/165 (46%), Gaps = 10/165 (6%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL--------WLLKFP-AVG 94
            R  +G   +AIV+ +    YKI P     W  VL AVPNSIL        W  K+P    
Sbjct: 1403 RASWGANPEAIVFISGANFYKILPELRHAWAKVLAAVPNSILVLYPFGGNWRRKYPIQTF 1462

Query: 95   EANIQATAQALGLDQHRIL-FSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
             +++       G+   R++   N+  + +     Q AD+ LD+    G  +++D L  G 
Sbjct: 1463 VSHLLEVLAWYGVKPERLVTLRNIPTRADVKVCLQQADIYLDSFPYGGANSTIDPLEVGL 1522

Query: 154  PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            P V + GE L ++  A+ L  LG   LI     EY D A++L TD
Sbjct: 1523 PTVAMDGECLRNQQGAALLRQLGLDGLIVNNEAEYIDRAVQLATD 1567


>gi|326402277|ref|YP_004282358.1| hypothetical protein ACMV_01290 [Acidiphilium multivorum AIU301]
 gi|325049138|dbj|BAJ79476.1| hypothetical protein ACMV_01290 [Acidiphilium multivorum AIU301]
          Length = 733

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + +  FN L K+   TL +W  +L+ V N+ L LL+ P   E  I     + A ALG+D 
Sbjct: 538 VTFGCFNNLTKLTDETLSLWGRILEQVDNAHL-LLRCPQFSEDGIPERFISRAHALGVDT 596

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+     A   E +   +  D+ LD    +G  T+ + ++ G PV+TL G+  A+R + 
Sbjct: 597 SRVELRGRAPHPEFIEGYKDVDIALDPFPYSGGLTTCESMFMGVPVITLAGDFFAARHSV 656

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G  + +  + ++Y D A+ + +D
Sbjct: 657 SHLSNVGLTDCVTESPEQYIDRAVAMSSD 685


>gi|348670339|gb|EGZ10161.1| hypothetical protein PHYSODRAFT_523651 [Phytophthora sojae]
          Length = 502

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 64/128 (50%), Gaps = 16/128 (12%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG---------------EAN 97
             ++ NFN++ K++ S    W+ +L+ VP S L LL  P                    N
Sbjct: 296 GFLFVNFNKIDKLEASVFSTWMGILRRVPRSSLLLLD-PGRHVLDGNMTQSVTSHEIRKN 354

Query: 98  IQATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           +   AQA G+   RI F +   K EH++R +   + LDT +   H+T+ D L+ G PV+T
Sbjct: 355 LWREAQAQGISTRRIRFVSRTPKLEHLQRHRTGGLFLDTFIYGAHSTATDALYAGLPVLT 414

Query: 158 LPGETLAS 165
           L G++ AS
Sbjct: 415 LAGDSFAS 422


>gi|148259123|ref|YP_001233250.1| hypothetical protein Acry_0103 [Acidiphilium cryptum JF-5]
 gi|146400804|gb|ABQ29331.1| TPR repeat-containing protein [Acidiphilium cryptum JF-5]
          Length = 733

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + +  FN L K+   TL +W  +L+ V N+ L LL+ P   E  I     + A ALG+D 
Sbjct: 538 VTFGCFNNLTKLTDETLSLWGRILEQVDNAHL-LLRCPQFSEDGIPERFISRAHALGVDT 596

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+     A   E +   +  D+ LD    +G  T+ + ++ G PV+TL G+  A+R + 
Sbjct: 597 SRVELRGRAPHPEFIEGYKDVDIALDPFPYSGGLTTCESMFMGVPVITLAGDFFAARHSV 656

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G  + +  + ++Y D A+ + +D
Sbjct: 657 SHLSNVGLTDCVTESPEQYIDRAVAMSSD 685


>gi|425472063|ref|ZP_18850914.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389881954|emb|CCI37543.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 712

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 80/162 (49%), Gaps = 9/162 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA-NIQATA 102
           R+   +P DA +Y +     K  P T+++ + +L  VPNS   L+K   VG+   IQ   
Sbjct: 514 REDLEIPPDATIYFSVQNGMKRHPDTIKLQMKILAQVPNSYF-LIK--GVGKTEKIQELF 570

Query: 103 QAL----GLDQHRILFSNVAAKEE-HVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
             +    G++  R+ F      E  H    Q+ADV LDT   NG TT+++VLW G P+VT
Sbjct: 571 TEIAIREGVNPQRLRFLPRDIDEYTHRANLQIADVVLDTYPYNGATTTLEVLWQGIPLVT 630

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           L GE   +R + + +   G    I     EY    ++LG DR
Sbjct: 631 LVGEQFYARNSYTFMINAGIEAGIGWNEVEYIHWGVKLGLDR 672


>gi|113476143|ref|YP_722204.1| hypothetical protein Tery_2526 [Trichodesmium erythraeum IMS101]
 gi|110167191|gb|ABG51731.1| Tetratricopeptide TPR_2 [Trichodesmium erythraeum IMS101]
          Length = 1154

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 48/165 (29%), Positives = 82/165 (49%), Gaps = 13/165 (7%)

Query: 42   TSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQAT 101
            ++R   G+  D +VY       K +P  ++  + +L+  P SIL        G+ ++   
Sbjct: 948  STRNSLGIDSDQMVYLCIAPGRKTNPKMVRAQLKILQHTPKSILI-----RKGQGDVDVI 1002

Query: 102  AQA-------LGLDQHRILF-SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGT 153
             Q+       LG+D +RI+F      +EEH    Q+ADV LD+   NG T +++ LW+  
Sbjct: 1003 YQSYLEESENLGIDFNRIMFLGQTQTEEEHRAIYQIADVLLDSYPYNGGTHNLEALWSNL 1062

Query: 154  PVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            P++T  G+   SR+  S L  +     +A + +EY +  I+LG D
Sbjct: 1063 PIITRAGKQYLSRMGYSFLQNVNLDVGVAWSWEEYTEWGIKLGQD 1107


>gi|338988961|ref|ZP_08633852.1| TPR repeat-containing protein [Acidiphilium sp. PM]
 gi|338206109|gb|EGO94354.1| TPR repeat-containing protein [Acidiphilium sp. PM]
          Length = 719

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ----ATAQALGLDQ 109
           + +  FN L K+   TL +W  +L+ V N+ L LL+ P   E  I     + A ALG+D 
Sbjct: 524 VTFGCFNNLTKLTDETLSLWGRILEQVDNAHL-LLRCPQFSEDGIPERFISRAHALGVDT 582

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+     A   E +   +  D+ LD    +G  T+ + ++ G PV+TL G+  A+R + 
Sbjct: 583 SRVELRGRAPHPEFIEGYKDVDIALDPFPYSGGLTTCESMFMGVPVITLAGDFFAARHSV 642

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L+ +G  + +  + ++Y D A+ + +D
Sbjct: 643 SHLSNVGLTDCVTESPEQYIDRAVAMSSD 671


>gi|209527307|ref|ZP_03275816.1| Methyltransferase type 11 [Arthrospira maxima CS-328]
 gi|209492233|gb|EDZ92579.1| Methyltransferase type 11 [Arthrospira maxima CS-328]
          Length = 924

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/184 (26%), Positives = 90/184 (48%), Gaps = 13/184 (7%)

Query: 30  ATGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK 89
           + G E+PQ      RQ  G+  D +++ +    +KI P  ++ W  ++  VPNS+L L+ 
Sbjct: 565 SYGPELPQVRTTVDRQSLGIGSDRVIFISGANFFKIIPDLIETWAKIIAGVPNSVLVLMP 624

Query: 90  F-----------PAVGEANIQATAQALGLDQHRILFSNVAAKEEHVRR-GQLADVCLDTP 137
           F             + + N   T   +  ++  +L  N     + V+   ++AD+ LD+ 
Sbjct: 625 FGPNWSNTYPKKSFIKQLNHHFTQHGIQTERLVVLDPNPTPNRDDVKEYFKIADIYLDSY 684

Query: 138 LCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIART-HKEYQDIAIRLG 196
             +G T+ M+ L  G P+V+  G +  S +  + L +L  P+L+  T  K Y D+A+ LG
Sbjct: 685 PFSGTTSLMEPLQVGLPIVSRRGSSFRSSMGVAILQSLNIPDLVTLTSEKSYIDLAVALG 744

Query: 197 TDRD 200
           T+ +
Sbjct: 745 TNSE 748


>gi|418394627|ref|ZP_12968742.1| TPR domain-containing protein [Burkholderia pseudomallei 354a]
 gi|418554742|ref|ZP_13119513.1| TPR domain-containing protein [Burkholderia pseudomallei 354e]
 gi|385369925|gb|EIF75216.1| TPR domain-containing protein [Burkholderia pseudomallei 354e]
 gi|385374781|gb|EIF79601.1| TPR domain-containing protein [Burkholderia pseudomallei 354a]
          Length = 776

 Score = 73.2 bits (178), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 75/147 (51%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARAR 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA    ++    I L +D
Sbjct: 684 MSHVGLESFIADDIDDFVAKGIALASD 710


>gi|237814265|ref|YP_002898716.1| TPR domain protein [Burkholderia pseudomallei MSHR346]
 gi|237505900|gb|ACQ98218.1| TPR domain protein [Burkholderia pseudomallei MSHR346]
          Length = 776

 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 74/147 (50%), Gaps = 3/147 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQATAQALGLDQHR 111
           + + +FN+L K+    + +W  +++AVP S + L   P    G A I+   Q  G+ + R
Sbjct: 565 VSFGSFNRLNKLRGDVIAVWARIMRAVPGSRIVLGSIPKDGGGAAMIEWFEQE-GIARER 623

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           + F   +    ++++    DVCLDT    G TT ++ LW G P +T+ GETL SR   + 
Sbjct: 624 LSFQPRSVTAVYLQQHHHVDVCLDTFPYTGSTTVLNALWMGVPTLTMRGETLPSRAGLTW 683

Query: 172 LATLGCPELIARTHKEYQDIAIRLGTD 198
           ++ +G    IA     +    I L +D
Sbjct: 684 MSHVGLESFIADDIDNFVAKGIALASD 710


>gi|427712404|ref|YP_007061028.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 6312]
 gi|427376533|gb|AFY60485.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 6312]
          Length = 738

 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 79/161 (49%), Gaps = 9/161 (5%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
           R    +P DA++Y       K  P TL+    +L  VPNS   L+K    GE  I     
Sbjct: 539 RADLDIPIDAVIYFTAQNGLKQHPETLKQQCQILAQVPNSYF-LIKL--RGEKGILRELF 595

Query: 99  QATAQALGLDQHRILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           ++ AQ  G+   R+ F +  + E  H    QLAD+ LD+   NG TT+++ LW   P+VT
Sbjct: 596 ESIAQEEGVSLSRLRFVDFDSDELTHRANLQLADIVLDSYPYNGATTTLETLWLEIPLVT 655

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             G+  A+R + + +   G  E IA +  EY    I+LG D
Sbjct: 656 QVGQQFAARNSYTFMVNAGITEGIAWSAPEYVAWGIKLGQD 696


>gi|117925099|ref|YP_865716.1| hypothetical protein Mmc1_1802 [Magnetococcus marinus MC-1]
 gi|117608855|gb|ABK44310.1| TPR repeat-containing protein [Magnetococcus marinus MC-1]
          Length = 822

 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 72/151 (47%), Gaps = 5/151 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           I Y  FN L K+ P TL++W  +LK +PNS L  LK  +V +A  +        ALG+  
Sbjct: 632 ITYGCFNNLAKLSPRTLRLWGEILKRLPNSRL-ALKSASVCDAETREALVKQLAALGVSA 690

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++F   +    ++   Q  D+ LD    NG  T++  LW G P+VT  G     RV  
Sbjct: 691 QRLIFIQPSDYGRYLEDYQFVDIALDPLPFNGGITTLQGLWQGVPIVTRWGLGQRDRVGG 750

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  L     IA + + Y + A+    D D
Sbjct: 751 SILTDLDLVSWIADSDQAYIECAVAKAADLD 781


>gi|422303122|ref|ZP_16390476.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389791933|emb|CCI12279.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 719

 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 83/162 (51%), Gaps = 8/162 (4%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-----I 98
           R+  G+  DA++Y +     K +P+ +++ + ++K+VPNS   +     V + N      
Sbjct: 519 REDLGINNDAVIYLSSQTAIKRNPAMIRLQMQIIKSVPNSYFLI---QGVADDNSLLDLF 575

Query: 99  QATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              A  +G++ +RI    +   E +     +ADV LDT   NG TT+++ LW G P+V  
Sbjct: 576 CQIAAEVGVETNRIKMLPLYQTETYRANLAIADVVLDTYPFNGGTTTLETLWMGIPLVVK 635

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+  +SR   + +   G  E I+ + +EY    I+LG D++
Sbjct: 636 VGQQWSSRNGYTLMMNAGITEGISWSDEEYVQWGIKLGLDKN 677


>gi|83311709|ref|YP_421973.1| O-linked N-acetylglucosamine transferase [Magnetospirillum
           magneticum AMB-1]
 gi|82946550|dbj|BAE51414.1| Predicted O-linked N-acetylglucosamine transferase
           [Magnetospirillum magneticum AMB-1]
          Length = 639

 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 77/156 (49%), Gaps = 8/156 (5%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG----EANIQATAQA 104
           L   ++ + +FN + K++   +  W  +L  VPNS LWL K+P +      + ++    A
Sbjct: 438 LERGSVTFGSFNNIAKLNEQVIAAWARILNRVPNSRLWL-KWPHLALDEVASPLRQAFAA 496

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDT-PLCNGHTTSMDVLWTGTPVVTLPGETL 163
            G+D  R+     +   + +      D+ LD  P C G  TS + LW G PVVTLPG   
Sbjct: 497 HGVDPGRLELRGNSPPAQLLAEYAEIDIALDPFPYC-GAFTSCEALWMGVPVVTLPGPRP 555

Query: 164 ASRVAASQLATLGCPELIARTH-KEYQDIAIRLGTD 198
            SR   + LA +G  + +AR     Y+D+A+ L  D
Sbjct: 556 FSRQTLALLAAMGLEDELARPDLAAYEDLAVALAAD 591


>gi|452821043|gb|EME28078.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
            SPINDLY-lie protein [Galdieria sulphuraria]
          Length = 1154

 Score = 72.4 bits (176), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 77/153 (50%), Gaps = 6/153 (3%)

Query: 54   IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQALGLDQH 110
            I + +FN L K   +T+ +W  +L  VP S L L   P   +      +   +A+G+   
Sbjct: 958  ITFGSFNVLAKTQANTIALWSKILHLVPKSRLLLKAKPFASSFARRRFEYVFEAVGITPD 1017

Query: 111  RILFSNVAAK-EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLP--GETLASRV 167
            R+    +  +   H+    L D+CLD     G TT+ + L+ G PVV+L   G+  A  V
Sbjct: 1018 RLDLLPLLPETRNHLETYSLVDICLDPFPYAGTTTTCEALYMGVPVVSLSAAGQNHAHSV 1077

Query: 168  AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
              + L ++G  EL+A + +EY DIA+ L +D D
Sbjct: 1078 GETLLKSIGHSELVAHSEEEYVDIAVSLASDLD 1110


>gi|423065081|ref|ZP_17053871.1| glycosyl transferase family 2 [Arthrospira platensis C1]
 gi|406714324|gb|EKD09492.1| glycosyl transferase family 2 [Arthrospira platensis C1]
          Length = 2280

 Score = 72.4 bits (176), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 79/172 (45%), Gaps = 11/172 (6%)

Query: 36   PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
            PQ + V   R Q+G  +  IV+ +    YKI P   + WV +L  VPNSIL L  F    
Sbjct: 1746 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 1805

Query: 95   EAN---------IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTT 144
              N         + AT    G+ + R++          V+   +  D+ LD+   +G  +
Sbjct: 1806 SNNYASSQFIRRMHATLSKYGVAEKRLVLIKTLPSRTDVKACLEQGDIYLDSYPHSGSNS 1865

Query: 145  SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             +D L  G PVV   G+TL S+  A+ + +LG P+LI      Y  +A+ L 
Sbjct: 1866 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDDESSYISLAVTLA 1917


>gi|378948679|ref|YP_005206167.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Pseudomonas fluorescens F113]
 gi|359758693|gb|AEV60772.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Pseudomonas fluorescens F113]
          Length = 1142

 Score = 72.4 bits (176), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 71/149 (47%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
           I Y  FN   K++   L  W  +++A  +S L LLK  A G + ++   QA     G+ +
Sbjct: 432 ITYGCFNNPTKVNDVLLAQWAELMRATADSRL-LLKGGAFGNSELREHVQAFMAAQGIAR 490

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+L       +  +      D+ LD    +G  T+ + L  G PVVTLPG T A R +A
Sbjct: 491 ERVLVEGPVGHKHLLETYNRIDIALDPWPYSGGLTTCEALLMGVPVVTLPGPTFAGRHSA 550

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + L   G PEL+  +  EY+   I L  D
Sbjct: 551 THLVNAGLPELVVDSWDEYRQRVIGLAGD 579


>gi|209515740|ref|ZP_03264603.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia sp. H160]
 gi|209503767|gb|EEA03760.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia sp. H160]
          Length = 1004

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 74/146 (50%), Gaps = 1/146 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATAQALGLDQHRI 112
           + + +FN++ K+    + +W  ++ AVPNS + +   P  G    +    +  G+ + R+
Sbjct: 779 VTFGSFNRMDKLRRDVVALWSRLMHAVPNSRMVIGAMPRDGSLGKLPDWFEEEGIARDRL 838

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F   A+   ++++    D CLD+   +G TT++  LW G P +TLPGET+  R   + +
Sbjct: 839 DFMPRASVPVYLQQHHRVDFCLDSFPFSGLTTALHSLWMGVPTLTLPGETVPGRSGLTAM 898

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
             +G    +AR   ++    + L +D
Sbjct: 899 THVGLANFVARDKDDFVRKGVALASD 924


>gi|113478271|ref|YP_724332.1| hypothetical protein Tery_4945 [Trichodesmium erythraeum IMS101]
 gi|110169319|gb|ABG53859.1| TPR repeat [Trichodesmium erythraeum IMS101]
          Length = 750

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 55/174 (31%), Positives = 87/174 (50%), Gaps = 9/174 (5%)

Query: 31  TGEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF 90
           +G E  ++   T R+   +  D IVY       K +   +   V +LK VPNSIL     
Sbjct: 512 SGFERIETSKTTLRKSQRIGLDQIVYLCLASGRKFNRDLVNGQVAILKQVPNSIL---VH 568

Query: 91  PAVGEANIQATA-----QALGLDQHRILFSNV-AAKEEHVRRGQLADVCLDTPLCNGHTT 144
            A+G+  +  +A     +A G+  HR+ F  + + +EEH     LAD+ LD+   NG T 
Sbjct: 569 KALGDIEVFQSAYHEACKAEGVSIHRVKFMPIFSTEEEHRSIYSLADILLDSYPYNGGTH 628

Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +++ LW   PVVT  GE   SR+  S L +L     ++ + +EY +  ++LG D
Sbjct: 629 TLEALWFNLPVVTYAGEQFLSRMGYSFLESLDIKLGVSWSWEEYINWGVKLGQD 682


>gi|376006070|ref|ZP_09783407.1| putative glycosyl transferase [Arthrospira sp. PCC 8005]
 gi|375325526|emb|CCE19160.1| putative glycosyl transferase [Arthrospira sp. PCC 8005]
          Length = 2557

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 79/172 (45%), Gaps = 11/172 (6%)

Query: 36   PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
            PQ + V   R Q+G  +  IV+ +    YKI P   + WV +L  VPNSIL L  F    
Sbjct: 2023 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 2082

Query: 95   EAN---------IQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTT 144
              N         + AT    G+ + R++          V+   +  D+ LD+   +G  +
Sbjct: 2083 SNNYASSQFIRRMHATLSKYGVAEKRLVLIKTLPSRTDVKACLEQGDIYLDSYPHSGSNS 2142

Query: 145  SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             +D L  G PVV   G+TL S+  A+ + +LG P+LI      Y  +A+ L 
Sbjct: 2143 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDDESSYISLAVTLA 2194


>gi|313897099|ref|ZP_07830646.1| tetratricopeptide repeat family protein [Selenomonas sp. oral taxon
           137 str. F0430]
 gi|312974546|gb|EFR40014.1| tetratricopeptide repeat family protein [Selenomonas sp. oral taxon
           137 str. F0430]
          Length = 565

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 75/149 (50%), Gaps = 7/149 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK---FPAV-GEANIQATAQALGLDQ 109
           +V+ + N   KI+   L++W  +L+ VP + L LLK   F    G A  +    A GL  
Sbjct: 363 VVFGSLNNFTKINDRVLKVWAEILRRVPTARL-LLKTEIFSVTDGAAEARRRIAAAGLPL 421

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+      A  +++      D+ LDT    G  T+ D L+ G PVVTL GETL SR  A
Sbjct: 422 ARV--ETEGASADYLAAYGRIDIALDTFPYPGGGTTCDALYMGVPVVTLAGETLGSRFGA 479

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  +G   LIA T +EY + A+ L  D
Sbjct: 480 SLLRNIGADALIAHTAEEYIERAVFLAQD 508


>gi|16124374|ref|NP_418938.1| hypothetical protein CC_0119 [Caulobacter crescentus CB15]
 gi|221233057|ref|YP_002515493.1| porphyrin biosynthesis protein [Caulobacter crescentus NA1000]
 gi|13421226|gb|AAK22106.1| TPR domain protein [Caulobacter crescentus CB15]
 gi|220962229|gb|ACL93585.1| porphyrin biosynthesis protein [Caulobacter crescentus NA1000]
          Length = 747

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 71/150 (47%), Gaps = 3/150 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQATAQALGLDQH 110
           + +     L K++   + +W  +L+AVP S L L   P V     A+IQA     G+   
Sbjct: 547 MTFGCLGNLSKMNGEVVALWSQILRAVPESKLLLKAKPFVDAQVVADIQARFAREGVSSD 606

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
           R++    +++ E+       D+ LDT    G TTS+D LW G PV+TL G+   +R+  S
Sbjct: 607 RLILEGPSSRAEYFETYNRIDLVLDTFPYPGGTTSVDALWMGVPVLTLAGDRFLARLGES 666

Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTDRD 200
                G    +AR   +Y   A+   + RD
Sbjct: 667 IARNAGQSAWVARDRADYLAKAVAFASVRD 696


>gi|209525307|ref|ZP_03273849.1| glycosyl transferase family 2 [Arthrospira maxima CS-328]
 gi|209494322|gb|EDZ94635.1| glycosyl transferase family 2 [Arthrospira maxima CS-328]
          Length = 2681

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 82/172 (47%), Gaps = 11/172 (6%)

Query: 36   PQ-SIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG 94
            PQ + V   R Q+G  +  IV+ +    YKI P   + WV +L  VPNSIL L  F    
Sbjct: 2147 PQPATVKPHRSQWGANDQTIVFMSGANFYKIIPEQGETWVKILAQVPNSILVLYPFAPSW 2206

Query: 95   EAN---------IQATAQALGLDQHR-ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
              N         + AT    G+ + R +L  ++ ++ +     +  D+ LD+   +G  +
Sbjct: 2207 SNNYASSQFIRRMHATLSKYGVAEKRLVLIKSLPSRTDVKACLEQGDIYLDSYPHSGSNS 2266

Query: 145  SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
             +D L  G PVV   G+TL S+  A+ + +LG P+LI      Y  +A+ L 
Sbjct: 2267 LVDPLEVGLPVVVRDGDTLRSKHGAAMMRSLGIPDLITDDESSYISLAVTLA 2318


>gi|422343491|ref|ZP_16424419.1| hypothetical protein HMPREF9432_00479 [Selenomonas noxia F0398]
 gi|355378798|gb|EHG25978.1| hypothetical protein HMPREF9432_00479 [Selenomonas noxia F0398]
          Length = 497

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 75/154 (48%), Gaps = 4/154 (2%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQALG 106
           +P D+ V+ +FN   KI    LQ+W  +L+AVP + L L       E    +Q  A+  G
Sbjct: 302 VPHDSPVFASFNNFAKITDEMLQIWGGILRAVPGARLLLKNVHPQRETLHRMQRRAEQAG 361

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +D  R+        ++++R    AD+ LDT    G  T+ + L  G PVVTL G    +R
Sbjct: 362 IDPVRLELR--PGSKDYLRDYLDADIILDTYPYQGGGTTCEALCMGLPVVTLAGTRHGAR 419

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
              S L   G  EL A + + Y + A+ L  DR+
Sbjct: 420 FGVSLLQNAGLGELAADSVQAYAERAVLLAQDRE 453


>gi|428221793|ref|YP_007105963.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 7502]
 gi|427995133|gb|AFY73828.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Synechococcus sp. PCC 7502]
          Length = 773

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 87/175 (49%), Gaps = 13/175 (7%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
           E PQ ++   R Q+ + EDAIVY     LYK  P    ++  + K VP +    + +   
Sbjct: 561 ESPQYLL--ERSQFQISEDAIVYLCCQSLYKYLPKYDYIFAAIAKRVPKAKFAFI-YSNN 617

Query: 94  GEANIQATAQALGL---------DQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTT 144
           G+   +   Q L +         ++H ++   +  ++E++   ++AD+ LDT   +G  T
Sbjct: 618 GDYVTRQFKQRLDIAFARQNQNSEEHCLIMPRLN-RDEYLSLNRIADIFLDTIDWSGGNT 676

Query: 145 SMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           +++ +    PVVTLP + +  R + + L  L   E IA +  EY DIA++LG D 
Sbjct: 677 TLEAISCNLPVVTLPTQFMRGRHSYAMLQMLEVTETIASSESEYIDIAVKLGLDH 731


>gi|372273166|ref|ZP_09509214.1| sulfotransferase [Marinobacterium stanieri S30]
          Length = 1017

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 73/154 (47%), Gaps = 3/154 (1%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQ---A 104
            L  + I +  FN   KI+   L  W +++   PNS L L  F       ++   Q   A
Sbjct: 424 ALNNEYITFGCFNNPTKINKVILSCWADIMHLTPNSRLLLKSFQFNSPELVKRITQEMEA 483

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
           +G+D+ R++    +  +E +R     D+ LDT   +G  T+ + +  G PVVT  G T A
Sbjct: 484 MGIDKDRLILEGPSPHQELLRSYNRVDIALDTWPYSGGLTTCEAMLMGVPVVTCTGPTFA 543

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            R +AS +   G  EL+  +  +Y+   + L +D
Sbjct: 544 GRHSASHIKNAGMYELVTDSFSQYKTRVLELTSD 577


>gi|430759455|ref|YP_007215312.1| TPR domain protein, putative component of TonB system
           [Thioalkalivibrio nitratireducens DSM 14787]
 gi|430009079|gb|AGA31831.1| TPR domain protein, putative component of TonB system
           [Thioalkalivibrio nitratireducens DSM 14787]
          Length = 739

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 50/152 (32%), Positives = 79/152 (51%), Gaps = 8/152 (5%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN----IQATAQALGLDQ 109
           + +  FN   KI    + +W  +L AVP+S L L+K   +  +     + +  +A GL  
Sbjct: 358 VTFGTFNNYAKISREVIAVWARILAAVPDSRL-LIKSTGLDSSETRTRLMSLFEAAGLSN 416

Query: 110 --HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
              RI L + V + EEH+R     D+ LDT   +G TT+   LW G PV+TL G +   R
Sbjct: 417 PGARIDLMNRVRSTEEHLRIYDRVDLALDTFPYSGTTTTCQALWMGVPVLTLYGRSHVGR 476

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           V+AS L  +   +L+A + ++Y   A +LG +
Sbjct: 477 VSASVLRQMALDDLVAGSEEDYIARAGQLGRE 508


>gi|22298553|ref|NP_681800.1| hypothetical protein tll1010 [Thermosynechococcus elongatus BP-1]
 gi|22294733|dbj|BAC08562.1| tll1010 [Thermosynechococcus elongatus BP-1]
          Length = 1126

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 77/158 (48%), Gaps = 4/158 (2%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF---PAVGEANIQA 100
            R Q GL  +A+V+    +  KI P  L++   +++AVPN++L ++K+     + E   ++
Sbjct: 914  RDQLGLSAEAVVFLCAQRTSKIQPEILKLQFAIVEAVPNAVL-VVKYHRGSPIFEEWCRS 972

Query: 101  TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             A+  G D  ++ F      E H       DV LDT    G   S++ LW   P+VT  G
Sbjct: 973  VAEVEGFDLDKLYFLTPNLPEIHRANLYDVDVVLDTYPYAGGAMSLEALWLEVPIVTKVG 1032

Query: 161  ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +   +R   + L  +G  E IA   +EY    IR GT+
Sbjct: 1033 QQFVARHTYTFLKNVGVEEGIAFNDEEYVHWGIRFGTE 1070


>gi|75906573|ref|YP_320869.1| hypothetical protein Ava_0348 [Anabaena variabilis ATCC 29413]
 gi|75700298|gb|ABA19974.1| TPR repeat protein [Anabaena variabilis ATCC 29413]
          Length = 563

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 82/161 (50%), Gaps = 2/161 (1%)

Query: 39  IVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI 98
           +V+ +R  +GL +D I+Y      +K  P    ++  +   +P +    L+   + E  +
Sbjct: 364 LVLKTRADFGLSDDDILYLCCQAPFKYLPQYDFIFAEIASRIPQAKFIFLRGTLLQE-RL 422

Query: 99  QATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
           Q     LGL  +   +F N+  + +++    L+D+ LDT   +G  T+++ +    P+VT
Sbjct: 423 QKAFGNLGLKFEDYCVFLNIPERLDYLMINLLSDIYLDTFTWSGGNTTLEAIACNLPIVT 482

Query: 158 LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            PGE +  R + S L  LG  + IA+   EY DIA++LG +
Sbjct: 483 CPGEFMRGRHSDSFLKMLGVTDTIAQNEGEYIDIAVKLGQN 523


>gi|440791074|gb|ELR12328.1| SPY protein [Acanthamoeba castellanii str. Neff]
          Length = 378

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 83/176 (47%), Gaps = 18/176 (10%)

Query: 35  VPQSIVITSRQQYGLP--EDA-IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
           +P  + +  R+  GLP  ED   ++ NF + YK+D      W+ +L    N++L LL   
Sbjct: 182 IPFELQLPDRKALGLPPKEDGTFIFFNFAEPYKVDRQVFATWMQILNVTQNTVL-LLPSG 240

Query: 92  AVGEAN--IQATAQALGLDQHRILFSNV---------AAKEEHVRRGQLADVCLDT-PLC 139
           A  E N  ++    A G+D  R++F+ +              H+      D+ LDT P  
Sbjct: 241 ATNERNQNMKKYMSARGIDPKRLIFTTINGVPWQLTSMTDTRHIL--SFVDLYLDTFPFG 298

Query: 140 NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRL 195
               + ++ L+   PV+ L G + +  V  S L TLG PELIA++  +Y   A RL
Sbjct: 299 TDVLSLVETLYHEVPVLALQGRSFSEMVGTSFLKTLGLPELIAQSKTKYVATAARL 354


>gi|428316233|ref|YP_007114115.1| hypothetical protein Osc7112_1150 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428239913|gb|AFZ05699.1| hypothetical protein Osc7112_1150 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 899

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 82/167 (49%), Gaps = 14/167 (8%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-----------PA 92
           R+  G+ + A+V+ +    +KI P   + W  ++ AVPNS+L L  F           P 
Sbjct: 691 RKSLGISDTAVVFISGANTFKIIPELRETWAKIIAAVPNSVLVLYPFGNTWSGDYVKQPF 750

Query: 93  VGEANIQATAQALGLDQHRILFSNVAAKEEHVRRG-QLADVCLDTPLCNGHTTSMDVLWT 151
           + +  + A      +D+ R++  N  A  E V+   QLADV LD+    G  +++D L  
Sbjct: 751 INK--MSAIFDKYSIDRTRLILLNTLANREDVKALLQLADVYLDSYPYAGANSTVDPLEV 808

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           G P V   G  L SR  A+ L  +   +LIA + + Y ++++ LG +
Sbjct: 809 GLPTVVREGNNLRSRQGAAILRDIQLFDLIADSEESYINLSVALGIN 855


>gi|219113085|ref|XP_002186126.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582976|gb|ACI65596.1| UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase
           [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 812

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 60/190 (31%), Positives = 88/190 (46%), Gaps = 25/190 (13%)

Query: 32  GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
           G   PQ     +    G  + A VYCNFN+  K +P T++ W+ +L+ VP+SIL LL  P
Sbjct: 534 GTGSPQENRCLAPPDVGPTDVAFVYCNFNKFLKNNPETVRGWIQILRQVPDSILCLLDNP 593

Query: 92  AVG---------------EANIQATAQALGLDQ--HRILFSNVAAKE-EHVRRGQLADVC 133
             G               + N   + Q    D   +R+ F        +H +R +  D C
Sbjct: 594 RDGIPYLHKFIHEAAGTSDGNSPDSFQPGDGDDLVNRVHFLPWEPNPFDHQQRNR--DFC 651

Query: 134 ---LDTPLCNGHTTSMDVLWTGTPVVTLP-GETLASRVAASQLATLGCPELIA-RTHKEY 188
              LD+   NGHT + D L+ G P+VT   G+ +++RV  S    LG   L A     +Y
Sbjct: 652 NAMLDSHPYNGHTVAQDALYAGVPIVTRSDGDDMSARVTTSANLVLGLSHLNAVHGPAQY 711

Query: 189 QDIAIRLGTD 198
             IA+ LGT+
Sbjct: 712 VAIAVALGTN 721


>gi|91785677|ref|YP_560883.1| TPR domain-containing protein [Burkholderia xenovorans LB400]
 gi|91689631|gb|ABE32831.1| putative TPR domain protein [Burkholderia xenovorans LB400]
          Length = 790

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 77/152 (50%), Gaps = 13/152 (8%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL-------G 106
           I + +FN+L K+    + +W  +L+A P + + L      G        Q L       G
Sbjct: 571 ITFGSFNRLNKLRRDVIALWAELLRAQPTARMLL------GAIATDEDEQLLIDWFAGEG 624

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +++ R++F   ++   ++++    D+CLDT    G TT ++ LW G P +T+ G+TLASR
Sbjct: 625 IERGRLMFRRRSSIPVYLQQHFHVDICLDTFPYTGSTTVLNSLWMGVPTLTIAGDTLASR 684

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              + ++ +G  E +A   +E+    + L +D
Sbjct: 685 AGTTWMSHVGLEEFVAANKEEFVARGVALASD 716


>gi|407715135|ref|YP_006835700.1| hypothetical protein BUPH_03787 [Burkholderia phenoliruptrix
           BR3459a]
 gi|407237319|gb|AFT87518.1| TPR domain-containing protein [Burkholderia phenoliruptrix BR3459a]
          Length = 787

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 76/146 (52%), Gaps = 1/146 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA-VGEANIQATAQALGLDQHRI 112
           I + +FN+L K+    + +W  +L+A+P++ + L        E  +     A G+D+ R+
Sbjct: 568 ITFGSFNRLNKLRRDVIALWAELLRALPDAKMLLGAIATDQDEQMLIDWFTAEGIDRERL 627

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F   ++   ++++    D+CLDT    G TT ++ LW G P +T+ G TLASR   + +
Sbjct: 628 TFRRRSSIPVYLQQHFHVDICLDTFPYTGSTTVLNSLWMGVPTLTIAGSTLASRAGTTWM 687

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
           + +G  + +    +E+    + L +D
Sbjct: 688 SHVGLEQFVVANKEEFVARGVALASD 713


>gi|288962056|ref|YP_003452366.1| tetratricopeptide TPR_2 repeat protein [Azospirillum sp. B510]
 gi|288914336|dbj|BAI75822.1| tetratricopeptide TPR_2 repeat protein [Azospirillum sp. B510]
          Length = 721

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 45/144 (31%), Positives = 73/144 (50%), Gaps = 5/144 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA----VGEANIQATAQALGLDQ 109
           + + +FN L K++   L  W  +L  +P++ L L+K  A    V  A  +    A G+ +
Sbjct: 533 VTFGSFNILTKLNDDVLAAWAAILNRMPDARL-LMKTKALSCPVTAALWRCRLTAAGIAE 591

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+         +H+R     DV LD    +G TT+++ LW G PV+TLPGET +SR + 
Sbjct: 592 DRVTMVGATNSLDHMRWCASVDVALDPFPFSGSTTTLETLWMGVPVITLPGETFSSRHSL 651

Query: 170 SQLATLGCPELIARTHKEYQDIAI 193
           + L   G  + +A    +Y D A+
Sbjct: 652 AFLTVAGVADCVATDPADYVDRAV 675


>gi|119487585|ref|ZP_01621195.1| TPR repeat protein [Lyngbya sp. PCC 8106]
 gi|119455754|gb|EAW36890.1| TPR repeat protein [Lyngbya sp. PCC 8106]
          Length = 1107

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 84/163 (51%), Gaps = 13/163 (7%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN------ 97
            R+Q  + +D +VY       K +   ++  V +LKAVP+S++        G+ +      
Sbjct: 905  RKQLKIDQDKLVYLCVAPGRKTNFEMIEAQVKILKAVPDSLM-----IRKGQGDHLLIRE 959

Query: 98   -IQATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
                + +  G+D +RILF  +   +EEH    Q+ DV LD+   NG T +++ LW+  P+
Sbjct: 960  MYHQSCEEQGVDFNRILFIGLTKTEEEHRAIYQVVDVLLDSYPYNGGTHNLEALWSNLPI 1019

Query: 156  VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            +T  GE   SR+  + L  +     +A + +EY ++ I+LGTD
Sbjct: 1020 LTRAGEQYLSRMGYAFLQAVNLDVGVAWSWEEYTELGIKLGTD 1062


>gi|94314154|ref|YP_587363.1| hypothetical protein Rmet_5235 [Cupriavidus metallidurans CH34]
 gi|93358006|gb|ABF12094.1| conserved hypothetical protein; TPR domain protein [Cupriavidus
           metallidurans CH34]
          Length = 797

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 5/150 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQH 110
           I Y +FN+L K+    + +W  +L A P++ + +      G+A+ Q         G+D  
Sbjct: 579 ITYGSFNRLNKVSRQVVALWARILHADPSARMIIGAIE--GDADQQTCLTRFADEGIDAG 636

Query: 111 RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAAS 170
           R+ F        ++++    D+CLDT    G TT+++ LW G P VT+ G +  SR +AS
Sbjct: 637 RLSFRTRTETPVYLQQHHHVDICLDTFPYTGATTTINALWMGVPTVTIAGNSPLSRGSAS 696

Query: 171 QLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            L  LG  + IA    ++   A+ L  D D
Sbjct: 697 WLGQLGLHQYIANDADDFVQRALALSKDLD 726


>gi|377819539|ref|YP_004975910.1| hypothetical protein BYI23_A000950 [Burkholderia sp. YI23]
 gi|357934374|gb|AET87933.1| TPR repeat-containing protein [Burkholderia sp. YI23]
          Length = 767

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/149 (24%), Positives = 77/149 (51%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG----EANIQATAQALGLDQ 109
           + + +FN++ K+ P  + +W  +L A+P+S + +      G    EA+        G+ +
Sbjct: 561 VTFGSFNRINKVRPEVIALWAKILHAMPSSKMLIGAIRREGDRSDEAHYADQFAKQGISR 620

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R++F   +A   ++++    D+CLD    +  TT+++ LW G P +T+ G +L SR A 
Sbjct: 621 DRLMFVPRSATAVYLQQHHQVDLCLDAFPFSASTTTLNALWMGVPTLTMAGNSLVSRGAT 680

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + L+ +G    + +   ++   A+ + +D
Sbjct: 681 TWLSHVGLESFVTKDKDDFVQSALSIVSD 709


>gi|302878311|ref|YP_003846875.1| hypothetical protein Galf_1081 [Gallionella capsiferriformans ES-2]
 gi|302581100|gb|ADL55111.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
           capsiferriformans ES-2]
          Length = 961

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 73/154 (47%), Gaps = 3/154 (1%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP---AVGEANIQATAQA 104
            L +  I +  F++L K+   T Q W +V+ +VP+S + +       AV  A +    Q 
Sbjct: 759 ALTKKTITFGTFSRLEKVTEQTWQTWADVVLSVPDSCMLIKNAEMDHAVARARVAGYFQR 818

Query: 105 LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
            G+   R++F    A  +H+      D+CLDT    G  T+++ L  G PV+TL   T  
Sbjct: 819 AGVALDRLIFHGRTAWNDHMAVFNQVDICLDTFPQGGGVTTLEGLMMGVPVITLHSPTFV 878

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            R   S L  LG  + +A T ++Y  IA +   D
Sbjct: 879 GRTGVSILTALGLVDWVAETPEQYVKIAKQKAQD 912


>gi|357059078|ref|ZP_09119923.1| hypothetical protein HMPREF9334_01640 [Selenomonas infelix ATCC
           43532]
 gi|355372986|gb|EHG20324.1| hypothetical protein HMPREF9334_01640 [Selenomonas infelix ATCC
           43532]
          Length = 906

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 50/151 (33%), Positives = 76/151 (50%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLDQ 109
           I + +FNQ  K+    L  W  +++ VPN+ L LLK  A G   + A  Q     LG D 
Sbjct: 713 IQFASFNQHAKLRDDMLLAWREIMERVPNAWL-LLKNKAYGRRGVVAALQERLRRLGFDM 771

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           +R++F    A  ++++R    D+ LDT    G  TS D L+ G PVV+   E  ++R   
Sbjct: 772 NRVMFE--VATRDYMQRYLDVDIALDTFPWPGGGTSCDALYMGVPVVSYYTERHSTRFTY 829

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S LA +G  +L +   ++Y   A+ L  D D
Sbjct: 830 SLLANMGLADLASTKLEDYIATAVALAGDID 860


>gi|425444417|ref|ZP_18824468.1| TPR repeat protein [Microcystis aeruginosa PCC 9443]
 gi|389735843|emb|CCI00716.1| TPR repeat protein [Microcystis aeruginosa PCC 9443]
          Length = 873

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 51/163 (31%), Positives = 79/163 (48%), Gaps = 8/163 (4%)

Query: 43  SRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNS--ILWLLKFPAVGEANIQA 100
           +R + GL ED I+Y N   L+K  P    ++  + + VPNS  I    +   V       
Sbjct: 672 TRLEMGLTEDKIIYLNCQSLFKYLPENDDIFPRIAQQVPNSQFIFICHRSEFVTHCFQSR 731

Query: 101 TAQA-----LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
            +QA     L    +R++   +  ++++ R   LAD+ LD    +G  T+++ +    PV
Sbjct: 732 LSQAFNKYGLNWQDYRVMMPQLE-QDDYFRLNLLADIYLDNLSWSGGNTTLEAIACQLPV 790

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT PGE +  R + + L  LG  E IA     Y +IAIRLG D
Sbjct: 791 VTCPGEFMRGRHSYAILKKLGITETIATDKNHYIEIAIRLGLD 833


>gi|357059077|ref|ZP_09119922.1| hypothetical protein HMPREF9334_01639 [Selenomonas infelix ATCC
           43532]
 gi|355372985|gb|EHG20323.1| hypothetical protein HMPREF9334_01639 [Selenomonas infelix ATCC
           43532]
          Length = 488

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 72/154 (46%), Gaps = 4/154 (2%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQALG 106
           +P D  V+ +FN   KI    L++W  +L AVP + L L       EA   ++  A+  G
Sbjct: 293 VPHDPPVFGSFNNFSKITDEMLRLWGEILAAVPRARLLLKNVHPSREALCRMRTRAERAG 352

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +D  R+         E++R     D+ LDT    G  T+ + L+ G PVVT+ G    +R
Sbjct: 353 IDMERLDLR--PGTREYLRDYLEVDIILDTYPYQGGGTTCEALFMGLPVVTMAGTRHGAR 410

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             A  L  +G  EL+      Y   A+ L +DR+
Sbjct: 411 FGAGLLHNVGLAELVTEDSAAYTACAVGLASDRE 444


>gi|168066691|ref|XP_001785267.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663137|gb|EDQ49919.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 910

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/150 (32%), Positives = 79/150 (52%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP----AVGEANIQATAQALGLDQ 109
           + + +FN L KI P  L++W  +L  +P++ L +   P    +V EA + A  + LG++ 
Sbjct: 635 VTFGSFNNLAKITPRVLRVWARILIKMPSARLVVKCKPFCCDSVREAFL-AKLEELGVES 693

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            RI L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G+  A  V 
Sbjct: 694 LRIDLLPLILLNHDHMQAYCLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGKVHAHNVG 753

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            + L  +G   L+ +T  EY D AI L ++
Sbjct: 754 VTLLHQVGLENLVVKTEDEYVDKAIELASN 783


>gi|86606504|ref|YP_475267.1| hypothetical protein CYA_1854 [Synechococcus sp. JA-3-3Ab]
 gi|86555046|gb|ABD00004.1| TPR repeat protein [Synechococcus sp. JA-3-3Ab]
          Length = 749

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 51/163 (31%), Positives = 83/163 (50%), Gaps = 10/163 (6%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP--AVGEANIQAT 101
           RQ +GLP++A+VY     L K  P    ++  + + VP +    L  P  A+G+   Q  
Sbjct: 539 RQDFGLPQEAVVYLTCQYLGKYLPQYDYLFPAIARQVPRAQFVFLALPNAAIGQRFWQRL 598

Query: 102 AQ-----ALGLDQH-RILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPV 155
           A       L ++ H RIL       ++++   + ADV LD+   +G  T+++ +  G PV
Sbjct: 599 ASIFSSYGLAVEDHCRIL--PALDHQDYLDLNRCADVMLDSYGWSGGITTLEAIAAGLPV 656

Query: 156 VTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           VT PG  L  R   + L  +G   LIA   + Y+++A++LGTD
Sbjct: 657 VTCPGRFLRGRHTYAILKRMGLELLIAPDLQRYEELAVQLGTD 699


>gi|428306904|ref|YP_007143729.1| hypothetical protein Cri9333_3391 [Crinalium epipsammum PCC 9333]
 gi|428248439|gb|AFZ14219.1| Tetratricopeptide TPR_1 repeat-containing protein [Crinalium
           epipsammum PCC 9333]
          Length = 1016

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 87/184 (47%), Gaps = 28/184 (15%)

Query: 36  PQSIVITS---RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA 92
           PQS++ TS   R  + L +DA+VY +   L+K  P    ++  + K VP S    +K   
Sbjct: 800 PQSVLPTSLKKRADFSLEDDALVYLSCQSLFKYLPQHDYIFAAIAKCVPQSQFVFIK--- 856

Query: 93  VGEANIQATAQALGLDQHRIL---FSNVAAKEE-------------HVRRGQLADVCLDT 136
                    ++A+    HR L   F++   K E             ++    ++DV LDT
Sbjct: 857 ------SDISEAITRKLHRRLQKAFASFGLKSEEYCVMVPPLNQVDYLNLNLVSDVFLDT 910

Query: 137 PLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLG 196
              +G  T+++ +    P+VT PGE +  R +   L  LG  E IA +  EY +IA+RLG
Sbjct: 911 LGWSGGNTTLEAISCNLPIVTCPGEFMRGRHSYGILQMLGVTETIASSEAEYIEIAVRLG 970

Query: 197 TDRD 200
            +++
Sbjct: 971 LEQE 974


>gi|288549794|ref|ZP_05968179.2| sulfotransferase [Enterobacter cancerogenus ATCC 35316]
 gi|288317412|gb|EFC56350.1| sulfotransferase [Enterobacter cancerogenus ATCC 35316]
          Length = 1390

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 3/143 (2%)

Query: 59  FNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE---ANIQATAQALGLDQHRILFS 115
           FN   KI+   L  W  +L +VP+S L+L  F    E     +  T +  G+ + R+   
Sbjct: 416 FNNASKINDFLLNQWAVLLHSVPDSRLFLKSFNFDNENLKERVLTTLEGHGIARERVRIE 475

Query: 116 NVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATL 175
            +A  +E +      D+ LD    +G  T+ + L  G PVVTLPG T A R +AS L   
Sbjct: 476 GMAPHQELLACYNEVDIALDPWPYSGGLTTCEALAMGVPVVTLPGPTFAGRHSASHLVNA 535

Query: 176 GCPELIARTHKEYQDIAIRLGTD 198
           G  EL+A   + + DI + L  D
Sbjct: 536 GLQELVASDWQNFIDITVGLTQD 558


>gi|440753447|ref|ZP_20932650.1| methyltransferase, FkbM family domain protein [Microcystis
           aeruginosa TAIHU98]
 gi|440177940|gb|ELP57213.1| methyltransferase, FkbM family domain protein [Microcystis
           aeruginosa TAIHU98]
          Length = 1246

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 8/158 (5%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
            L  + I +   N   K++P  L +W  +L+A+P S L L      L  P   ++  +  
Sbjct: 505 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 564

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             Q L L+Q  ++ S   A E++       D+ LDT   NG TT+ D LW G PV+TL G
Sbjct: 565 QEQGLNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 623

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +    R+  S L  +G  + IA + +EY + AI    D
Sbjct: 624 DRKIQRMGNSLLQAIGLGDWIAHSPEEYVNKAITFAQD 661


>gi|395010255|ref|ZP_10393652.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Acidovorax sp. CF316]
 gi|394311671|gb|EJE48985.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Acidovorax sp. CF316]
          Length = 726

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 74/149 (49%), Gaps = 6/149 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
           I + + N L K+    L +W  +L+AVP S L +    L  P    A  Q   + LGLD 
Sbjct: 401 ITFGSCNNLGKLTDEVLALWGQILEAVPGSRLLIEGKNLDQPDFANAYRQ-RCRGLGLDP 459

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+    +    +++   ++ D+ LD     G TT+ DVLW G P+V++ G +  SR+  
Sbjct: 460 DRLELVALNGDNQYLTYHRI-DIALDPFPLTGGTTTFDVLWMGLPIVSMVGSSFKSRMGV 518

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
             L+ LG  E +A T ++Y  IA  L  D
Sbjct: 519 GLLSYLGRTEWLAETSEDYVRIASGLAAD 547


>gi|159028890|emb|CAO90695.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 1261

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 8/158 (5%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
            L  + I +   N   K++P  L +W  +L+A+P S L L      L  P   ++  +  
Sbjct: 520 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 579

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             Q L L+Q  ++ S   A E++       D+ LDT   NG TT+ D LW G PV+TL G
Sbjct: 580 QEQGLNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 638

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +    R+  S L  +G  + IA + +EY + AI    D
Sbjct: 639 DRKIQRMGNSLLQAIGLGDWIAHSPEEYVNKAITFAQD 676


>gi|376007446|ref|ZP_09784641.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375324082|emb|CCE20394.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 1103

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 9/161 (5%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
            RQQ  +P +A+ Y       K +   ++  VN+L++VPNS+L        G+A +     
Sbjct: 901  RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 957

Query: 99   QATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
                + LG+D +R++F  +   +EEH    ++ADV LD+   NG T +++ LW+  PVVT
Sbjct: 958  NQACEDLGVDINRLIFLGLTKTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1017

Query: 158  LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              G    SR+  + L ++     +A + +EY  + I  G +
Sbjct: 1018 RSGRQYLSRMGYAFLKSVNLDIGVAWSWEEYTQLGIEFGQN 1058


>gi|209522682|ref|ZP_03271240.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
 gi|209496731|gb|EDZ97028.1| TPR repeat-containing protein [Arthrospira maxima CS-328]
          Length = 1103

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 9/161 (5%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
            RQQ  +P +A+ Y       K +   ++  VN+L++VPNS+L        G+A +     
Sbjct: 901  RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 957

Query: 99   QATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
                + LG+D +R++F  +   +EEH    ++ADV LD+   NG T +++ LW+  PVVT
Sbjct: 958  NQACEDLGVDINRLIFLGLTKTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1017

Query: 158  LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              G    SR+  + L ++     +A + +EY  + I  G +
Sbjct: 1018 RSGRQYLSRMGYAFLKSVNLDIGVAWSWEEYTQLGIEFGQN 1058


>gi|423063206|ref|ZP_17051996.1| TPR repeat-containing protein [Arthrospira platensis C1]
 gi|406715328|gb|EKD10484.1| TPR repeat-containing protein [Arthrospira platensis C1]
          Length = 1103

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 9/161 (5%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
            RQQ  +P +A+ Y       K +   ++  VN+L++VPNS+L        G+A +     
Sbjct: 901  RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 957

Query: 99   QATAQALGLDQHRILFSNVAA-KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
                + LG+D +R++F  +   +EEH    ++ADV LD+   NG T +++ LW+  PVVT
Sbjct: 958  NQACEDLGVDINRLIFLGLTKTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1017

Query: 158  LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              G    SR+  + L ++     +A + +EY  + I  G +
Sbjct: 1018 RSGRQYLSRMGYAFLKSVNLDIGVAWSWEEYTQLGIEFGQN 1058


>gi|443646868|ref|ZP_21129546.1| methyltransferase, FkbM family domain protein [Microcystis
           aeruginosa DIANCHI905]
 gi|443335697|gb|ELS50161.1| methyltransferase, FkbM family domain protein [Microcystis
           aeruginosa DIANCHI905]
          Length = 1246

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 8/158 (5%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
            L  + I +   N   K++P  L +W  +L+A+P S L L      L  P   ++  +  
Sbjct: 505 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 564

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             Q L L+Q  ++ S   A E++       D+ LDT   NG TT+ D LW G PV+TL G
Sbjct: 565 QEQGLNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 623

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +    R+  S L  +G  + IA + +EY + AI    D
Sbjct: 624 DRKIQRMGNSLLQAIGLGDWIAHSPEEYVNKAITFAQD 661


>gi|427732279|ref|YP_007078516.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Nostoc sp. PCC 7524]
 gi|427368198|gb|AFY50919.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
            [Nostoc sp. PCC 7524]
          Length = 1323

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 48/181 (26%), Positives = 84/181 (46%), Gaps = 12/181 (6%)

Query: 32   GEEVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP 91
            G E  +      R   G+PEDA+V+ +    +K  P  +  W  ++  V NS+L LL F 
Sbjct: 955  GTEEGKLTTPVERNSLGVPEDAVVFISGANYFKTVPELMGTWAKIISKVSNSVLVLLPFG 1014

Query: 92   AVGEAN---------IQATAQALGLDQHRILFSN---VAAKEEHVRRGQLADVCLDTPLC 139
                 N         +++     GL   R++  +   V  +E+     ++ADV LD+   
Sbjct: 1015 LNWSNNYPKIEFIHHLKSIFSKHGLATERLIVLDPQPVPDREDMKEYYKIADVYLDSYPF 1074

Query: 140  NGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             G T+ ++ L    PV+   G    S + A+ + TL  P L+A + + Y ++A+ LGT+ 
Sbjct: 1075 AGTTSLIEPLQVNLPVIARQGNCFRSAMGAAIIQTLDIPNLVADSEESYIELAVALGTNS 1134

Query: 200  D 200
            +
Sbjct: 1135 E 1135


>gi|330827422|ref|XP_003291797.1| hypothetical protein DICPUDRAFT_39606 [Dictyostelium purpureum]
 gi|325077989|gb|EGC31666.1| hypothetical protein DICPUDRAFT_39606 [Dictyostelium purpureum]
          Length = 717

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 53/153 (34%), Positives = 74/153 (48%), Gaps = 6/153 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-PAVGEANIQATAQAL---GLDQ 109
           I + +FN + K     L  W  +L+  P S   LLK  P V E   Q+  + L   G + 
Sbjct: 532 ITFGSFNIMAKYSDKCLNNWRIILEKSPPSTRLLLKSKPFVCEKTKQSFLKKLAKMGFNT 591

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR-V 167
            +I L      +++H++  ++ D+ LDT    G TT+ + LW G PVVTL      S  V
Sbjct: 592 SQIDLIGLFPQQKDHLQYYKMMDISLDTFPYAGTTTTCEALWMGVPVVTLYSPNCHSHNV 651

Query: 168 AASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             S L  L  P LIA +  EY DIA+ L  D D
Sbjct: 652 GKSILTNLDIPSLIAHSESEYVDIALSLSKDID 684


>gi|429736729|ref|ZP_19270617.1| hypothetical protein HMPREF9163_01491 [Selenomonas sp. oral taxon
           138 str. F0429]
 gi|429154357|gb|EKX97091.1| hypothetical protein HMPREF9163_01491 [Selenomonas sp. oral taxon
           138 str. F0429]
          Length = 905

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 46/151 (30%), Positives = 77/151 (50%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           I++ +FNQ  KI    L  W  +++ +PNS L LLK  A  +  +  TA    + +G D 
Sbjct: 712 ILFASFNQYRKIRDDMLVAWRQIMERIPNSRL-LLKNSAYQQPGVAMTAYKRLKEMGFDM 770

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+ F      ++++ R    D+ LDT    G  T+ D L+ G PV++   E  ++R + 
Sbjct: 771 SRVTFE--PGTKDYMMRYLDVDIALDTFPWPGGGTTCDALYMGVPVISYYTERHSTRFSY 828

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S LA +G  EL +   ++Y + A+ L  + D
Sbjct: 829 SILANMGLAELASERMEDYIETAVALAGNLD 859


>gi|395762104|ref|ZP_10442773.1| TPR domain-containing protein [Janthinobacterium lividum PAMC
           25724]
          Length = 1070

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP-AVGEANIQATAQALGLDQHRI 112
           + + +FN++ KI+   +  W  +L+A+P++ + L   P +     I    +  G+   R+
Sbjct: 861 LTFGSFNRITKINRKVIARWSMLLRAIPDARMLLAAMPTSAPHPLIMRWLEEEGIAPERL 920

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F       + +      D+CLDT    G TT+   LW G P +T  G TL SR+  S L
Sbjct: 921 TFEGRTGITQFMEMHHRVDICLDTFPYGGGTTTFHALWMGVPTLTTAGTTLPSRIGTSIL 980

Query: 173 ATLGCPELIARTHKEY 188
             +G  E IA+   +Y
Sbjct: 981 LQVGLEEFIAKDDDDY 996


>gi|302878312|ref|YP_003846876.1| hypothetical protein Galf_1082 [Gallionella capsiferriformans ES-2]
 gi|302581101|gb|ADL55112.1| Tetratricopeptide TPR_1 repeat-containing protein [Gallionella
           capsiferriformans ES-2]
          Length = 793

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 76/145 (52%), Gaps = 5/145 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQ 109
           + + +FN+L K   +    W  +L AVP S + +LK   + +A + A         G++ 
Sbjct: 597 VTFGSFNRLVKSSEAVYAAWAKILLAVPGSRM-ILKTHELDDAVVSARVAGYFVRAGVEV 655

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            RI+     ++E H+      D+ LD     G  T+++ L  G PV+TL   T+  RV+A
Sbjct: 656 GRIVLQGKTSREAHLAAFNQIDIALDPFPHGGGMTAIEGLMMGVPVITLRWPTMVGRVSA 715

Query: 170 SQLATLGCPELIARTHKEYQDIAIR 194
           S + TLG  + IA+T +EY D+A++
Sbjct: 716 SIMTTLGLTDWIAQTPQEYVDMAVQ 740


>gi|392377847|ref|YP_004985006.1| conserved protein of unknown function; putative TPR domains
           [Azospirillum brasilense Sp245]
 gi|356879328|emb|CCD00240.1| conserved protein of unknown function; putative TPR domains
           [Azospirillum brasilense Sp245]
          Length = 634

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 6/150 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQA----TAQALGLDQ 109
           + + +FN   K+ P  +++W  +L  VP++ L  LK  A G+A  +          G+D 
Sbjct: 434 VTFGSFNNAAKVTPEVMRVWSAILARVPSARL-CLKSRAFGDAPTRERYLRQFAGNGVDP 492

Query: 110 HRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
            R+ L   +   + H++     D+ LD    NG TT+ + LW G PV+TL G    +RV 
Sbjct: 493 GRVDLLPPMDVIDHHLQAYDRIDIGLDPFPYNGTTTTCEALWMGVPVITLAGRHHVARVG 552

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           AS L   G  E IA     Y + A+ L  D
Sbjct: 553 ASLLTQCGLAEFIATDEAGYMETAVALAGD 582


>gi|83593683|ref|YP_427435.1| glycosyl transferase family protein [Rhodospirillum rubrum ATCC
            11170]
 gi|386350429|ref|YP_006048677.1| glycosyl transferase family protein [Rhodospirillum rubrum F11]
 gi|83576597|gb|ABC23148.1| Glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
 gi|346718865|gb|AEO48880.1| glycosyl transferase [Rhodospirillum rubrum F11]
          Length = 1837

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 68/141 (48%), Gaps = 4/141 (2%)

Query: 63   YKIDPSTLQMWVNVLKAVPNSILWLLKFPAVG---EANIQATAQALGLDQHRILFSNVAA 119
            ++I  S L+ W ++L  + N+ L +L   A      A      QA G+   R+++    A
Sbjct: 982  FRISRSCLETWADLLAGMDNARLMILGRQAHDWEFRAGFSRLLQACGIAPERVVYRFQQA 1041

Query: 120  KEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGE-TLASRVAASQLATLGCP 178
              +H+      D+ LDT   NG     D LW G PVVTL G   LA R  AS LA   C 
Sbjct: 1042 YADHLHSFAGIDIALDTFPANGGLCLADALWMGVPVVTLAGTGLLAERQGASLLAAADCG 1101

Query: 179  ELIARTHKEYQDIAIRLGTDR 199
            + IAR+  +Y  IA +L  DR
Sbjct: 1102 DWIARSPADYLAIARKLAGDR 1122


>gi|167589230|ref|ZP_02381618.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia ubonensis Bu]
          Length = 760

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 49/150 (32%), Positives = 72/150 (48%), Gaps = 5/150 (3%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALGLD 108
           ++ + + N   K+  + + +W  VL AVP S L LLK P + EA    ++ A   A G+ 
Sbjct: 559 SVTFGSLNNANKLGDAVIALWARVLHAVPGSRL-LLKSPQLDEAALRVSLAARFAAHGIT 617

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+     +A+  H+      D+ LD     G TTS++ LW G P+VT  G    S + 
Sbjct: 618 AERLPMRGGSARLAHLSAFNDIDIALDPFPYPGGTTSLEGLWMGVPLVTRRGGRFLSHIG 677

Query: 169 ASQLATLGCPELIARTHKEYQDIAIRLGTD 198
            S L TL  PE IAR   +Y   A    +D
Sbjct: 678 ESILHTLDMPEWIARDDDDYVAKAAAFASD 707


>gi|292670017|ref|ZP_06603443.1| TPR domain/SEC-C domain protein [Selenomonas noxia ATCC 43541]
 gi|292648318|gb|EFF66290.1| TPR domain/SEC-C domain protein [Selenomonas noxia ATCC 43541]
          Length = 468

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 4/154 (2%)

Query: 49  LPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA--NIQATAQALG 106
           +P D+ V+ +FN   KI    LQ+W  +L+AVP + L L       E    +Q  A+  G
Sbjct: 273 VPHDSPVFASFNNFAKITDEMLQIWGGILRAVPGAHLLLKNVHPQRETLRRMQRRAERAG 332

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +D  R+        ++++R    AD+ LDT    G  T+ + L  G PVVTL G    +R
Sbjct: 333 IDPVRLELR--PGSKDYLRDYLDADIILDTYPYQGGGTTCEALCMGLPVVTLAGTRHGAR 390

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
              S L   G  EL A + + Y + A  L  DR+
Sbjct: 391 FGVSLLQNAGLGELAADSVQAYAERAALLAQDRE 424


>gi|409993375|ref|ZP_11276518.1| hypothetical protein APPUASWS_19737 [Arthrospira platensis str.
            Paraca]
 gi|291568213|dbj|BAI90485.1| TPR domain protein [Arthrospira platensis NIES-39]
 gi|409935756|gb|EKN77277.1| hypothetical protein APPUASWS_19737 [Arthrospira platensis str.
            Paraca]
          Length = 1104

 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 82/161 (50%), Gaps = 9/161 (5%)

Query: 44   RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANI----- 98
            RQQ  +P +A+ Y       K +   ++  VN+L++VPNS+L        G+A +     
Sbjct: 902  RQQLNIPPNAVAYLCVAPGRKTNGEMIKAQVNILRSVPNSVLIR---KGQGDAQLLREMY 958

Query: 99   QATAQALGLDQHRILFSNVA-AKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVT 157
                Q +G+D +R++F  +   +EEH    ++ADV LD+   NG T +++ LW+  PVVT
Sbjct: 959  NQACQEVGVDLNRLIFLGLTQTEEEHRAIYKVADVMLDSYPYNGGTHNLEALWSELPVVT 1018

Query: 158  LPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              G    SR+  + L  +     +A + +EY  + I  G +
Sbjct: 1019 RSGRQYLSRMGYAFLKAVNLDIGVAWSWEEYTQLGIEFGHN 1059


>gi|392381948|ref|YP_005031145.1| conserved protein of unknown function; glycosyltransferase and TPR
           domains [Azospirillum brasilense Sp245]
 gi|356876913|emb|CCC97706.1| conserved protein of unknown function; glycosyltransferase and TPR
           domains [Azospirillum brasilense Sp245]
          Length = 636

 Score = 70.1 bits (170), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 5/131 (3%)

Query: 56  YCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL----GLDQHR 111
           + +FN L K++     +W  VL AVP S L LL+ P + +A   A   AL    GLD  R
Sbjct: 436 FGSFNALPKLNAEVAALWARVLGAVPGSRL-LLRTPGLDDAGTAARTLALFERAGLDPAR 494

Query: 112 ILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQ 171
           +     A   E +      DV LD    +G  T+++ LW G PVVTL G+   +R + + 
Sbjct: 495 VDLRGGAPHREFLAGYGEVDVALDPFPYSGGLTTLEALWMGVPVVTLDGDRFCARHSVTH 554

Query: 172 LATLGCPELIA 182
           LA+ G  +L+A
Sbjct: 555 LASAGLSDLVA 565


>gi|288960866|ref|YP_003451206.1| TPR repeat-containing protein [Azospirillum sp. B510]
 gi|288913174|dbj|BAI74662.1| TPR repeat-containing protein [Azospirillum sp. B510]
          Length = 974

 Score = 70.1 bits (170), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 72/153 (47%), Gaps = 12/153 (7%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRIL 113
           I + +FN + K++ +T+ +W  +L AVP + L +LK+  +G+             QH I 
Sbjct: 769 ITFGSFNNITKVNAATIALWARLLAAVPTARL-VLKWRGLGDGGGAGLRLLAAFAQHGIE 827

Query: 114 FSNVAAK-------EEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
              +  +       +++V      DV LD    NG TT  D LW G PV+   G T   R
Sbjct: 828 AERLDLRGITPDPYQDYV----TIDVALDPAFANGGTTICDALWMGVPVLNQSGPTKIGR 883

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
             A+ L  +G  EL+ R   EY    +RL TDR
Sbjct: 884 WGATMLDAVGLGELVTRDDGEYLARGVRLATDR 916


>gi|425460198|ref|ZP_18839680.1| Genome sequencing data, contig C315 [Microcystis aeruginosa PCC
           9808]
 gi|389827193|emb|CCI21793.1| Genome sequencing data, contig C315 [Microcystis aeruginosa PCC
           9808]
          Length = 1246

 Score = 70.1 bits (170), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 75/158 (47%), Gaps = 8/158 (5%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL------LKFPAVGEA-NIQA 100
            L  + I +   N   K++P  L +W  +L+A+P S L L      L  P   ++  +  
Sbjct: 505 ALSSEYITFGCLNNFSKLNPFLLSLWAKILQALPQSRLILKSHYHNLDDPEEKQSVELFL 564

Query: 101 TAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPG 160
             Q   L+Q  ++ S   A E++       D+ LDT   NG TT+ D LW G PV+TL G
Sbjct: 565 QEQGFNLEQVELIDSPTLA-EDYFALYHRIDIHLDTFPYNGCTTTCDALWMGVPVLTLAG 623

Query: 161 ETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +    R+  S L  +G  + IA + +EY + AI    D
Sbjct: 624 DRKIQRMGNSLLQAIGLEDWIAHSPEEYVNKAITFAQD 661


>gi|170694349|ref|ZP_02885503.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
 gi|170140772|gb|EDT08946.1| TPR repeat-containing protein [Burkholderia graminis C4D1M]
          Length = 791

 Score = 70.1 bits (170), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 75/148 (50%), Gaps = 1/148 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPA-VGEANIQATAQALGLDQHRI 112
           + Y +FN+L K+    + +W  +L+A P+S + +    + + E        A G+   R+
Sbjct: 575 VTYGSFNRLNKLSQEVIALWSVILRAEPSSRMVIGAISSKLDEPTYLEWFAAEGITADRL 634

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
            F    +   ++++    D+CLDT    G TT+++ LW G P +T+PG ++ SR  A  L
Sbjct: 635 TFCPRGSLPLYMQQHHQVDLCLDTFPYTGSTTTLNALWMGVPTITMPGISMPSRGGACWL 694

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTDRD 200
             +G  + I R  +++   ++ L  D D
Sbjct: 695 EHVGLEQFIVRDKEDFVRKSLELTRDLD 722


>gi|281200878|gb|EFA75092.1| hypothetical protein PPL_11166 [Polysphondylium pallidum PN500]
          Length = 667

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 45/151 (29%), Positives = 73/151 (48%), Gaps = 4/151 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQAL---GLDQH 110
           I +  +N L K+   T  +W  +L  +P + + +     + E+  QA  + L   G+D  
Sbjct: 477 ITFGCYNTLSKVQKGTWAVWKLILDRLPTARIAIKAPLFIVESAAQAYLKKLADIGVDTT 536

Query: 111 RILFSNVAAKE-EHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+L    + +   H       D+ LD    NG TTS+D LW G P V++ G T    V  
Sbjct: 537 RVLLKTYSMETGNHYDSYNEMDISLDPFPYNGTTTSLDSLWMGVPFVSMSGVTHVHNVGR 596

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S L  +G  EL+    +EY +IA++L +D +
Sbjct: 597 SILTNVGLKELVGSNPEEYVNIAVQLASDTE 627


>gi|17228054|ref|NP_484602.1| hypothetical protein alr0558 [Nostoc sp. PCC 7120]
 gi|17129903|dbj|BAB72516.1| alr0558 [Nostoc sp. PCC 7120]
          Length = 1013

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 79/167 (47%), Gaps = 12/167 (7%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKF-----------PA 92
           R   G+P +A+V+ +    +K  P  +  W  ++  VPNS+L LL F             
Sbjct: 656 RNSLGIPGNAVVFISGANYFKTVPELMDTWAKIISIVPNSVLVLLPFGPNWSSNYPKAEF 715

Query: 93  VGEANIQATAQALGLDQHRILFSN-VAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWT 151
           +   N   +   LG ++ R+L    V  +E+     ++ADV LD+    G T+ ++ L  
Sbjct: 716 INHLNSIFSRHGLGTERLRVLDPQPVPDREDMKEYYKIADVYLDSYPFAGTTSLIEPLQV 775

Query: 152 GTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
             PV+   G    S + A+ + TL  P+L+A +   Y ++A+ LG +
Sbjct: 776 NLPVIARQGSCFRSAMGAAIIQTLNIPDLVADSEDSYIELAVALGNN 822


>gi|186681177|ref|YP_001864373.1| group 1 glycosyl transferase [Nostoc punctiforme PCC 73102]
 gi|186463629|gb|ACC79430.1| glycosyl transferase, group 1 [Nostoc punctiforme PCC 73102]
          Length = 2289

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 84/161 (52%), Gaps = 2/161 (1%)

Query: 40   VITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQ 99
            VI +R  + L +DA++Y      +K  P    ++  +   +P +    L+   + E  ++
Sbjct: 2092 VIKTRSDFRLEDDAVIYLCCQAPFKYLPQYDFIFAEIACRLPQAKFVFLR-GTLLEPRLK 2150

Query: 100  ATAQALGLD-QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
                A+GL+ +   +F ++  + +++    L+DV LDT   +G  T+++ +    P+VT 
Sbjct: 2151 RAFAAVGLNSEDYCVFLSIPERLDYLMINLLSDVYLDTFTWSGGNTTLEAIACNLPIVTC 2210

Query: 159  PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
            PGE +  R + S L  LG  + IA+   EY +IA++LG D+
Sbjct: 2211 PGEFMRGRHSDSFLKMLGVTDTIAQNEAEYIEIAVKLGLDQ 2251


>gi|401565692|ref|ZP_10806514.1| glycosyltransferase family 41 domain protein [Selenomonas sp.
           FOBRC6]
 gi|400185544|gb|EJO19772.1| glycosyltransferase family 41 domain protein [Selenomonas sp.
           FOBRC6]
          Length = 906

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 46/151 (30%), Positives = 77/151 (50%), Gaps = 7/151 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           I++ +FNQ  KI    L  W  +++ +PNS L LLK  A  +  +  TA    + +G D 
Sbjct: 713 ILFASFNQYRKIRDDMLVAWHQIMERIPNSRL-LLKNSAYQQPGVAMTAYKRLKEMGFDM 771

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+ F      ++++ R    D+ LDT    G  T+ D L+ G PV++   E  ++R + 
Sbjct: 772 SRVTFE--PGTKDYMMRYLDVDIALDTFPWPGGGTTCDALYMGVPVISYYTERHSTRFSY 829

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           S LA +G  EL +   ++Y + A+ L  + D
Sbjct: 830 SILANMGLAELASERMEDYIETAVVLAGNLD 860


>gi|429736728|ref|ZP_19270616.1| tetratricopeptide repeat protein [Selenomonas sp. oral taxon 138
           str. F0429]
 gi|429154356|gb|EKX97090.1| tetratricopeptide repeat protein [Selenomonas sp. oral taxon 138
           str. F0429]
          Length = 1079

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 51/156 (32%), Positives = 75/156 (48%), Gaps = 15/156 (9%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQALGLDQHRI 112
           +I++ +FN   K++   L+ W  +L  V  S L L       +A++ + A A      RI
Sbjct: 359 SIIFGSFNNFTKLNDEVLRAWAEILHRVEGSRLLL-------KADVFSYADARAEVMQRI 411

Query: 113 LFSN---VAAKEEHVRRGQLA-----DVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLA 164
             +    V    E   R  LA     D+ LD     G  T+ D L+ G PVVT+ GE+L 
Sbjct: 412 EAAGIPLVCVDTEGASRDYLAAYNRMDIALDPFPYPGGGTTCDALYMGVPVVTMRGESLG 471

Query: 165 SRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
           SR  AS +  +G   LIA+T +EY D A+ L  D +
Sbjct: 472 SRFGASLVENIGAGALIAQTTEEYIDRAVSLARDTE 507



 Score = 65.9 bits (159), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 49/152 (32%), Positives = 76/152 (50%), Gaps = 9/152 (5%)

Query: 54   IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLK-----FPAVGEANIQATAQALGLD 108
            I++  FNQ  K     L +W  +++ +P S L LLK      PA+ EA  +   + LG D
Sbjct: 887  ILFGVFNQYRKFTDEMLLLWREIMERMPTSKL-LLKSQIFFAPAMVEA-ARMRLERLGFD 944

Query: 109  QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
              R++     A  +++ R    D+ LDT    G  T+ D L+ G PVVTL GE  ++R +
Sbjct: 945  LTRVILE--PATTDYMERYLDVDIALDTYPWPGGGTTCDALYMGVPVVTLYGERRSTRFS 1002

Query: 169  ASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
             + L  +G  EL  ++  +Y   A+ L  D D
Sbjct: 1003 YALLTHVGLEELAVQSSADYVAYAVTLAGDLD 1034


>gi|119488074|ref|ZP_01621518.1| TPR repeat protein [Lyngbya sp. PCC 8106]
 gi|119455363|gb|EAW36502.1| TPR repeat protein [Lyngbya sp. PCC 8106]
          Length = 759

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 50/179 (27%), Positives = 86/179 (48%), Gaps = 22/179 (12%)

Query: 34  EVPQSIVITSRQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAV 93
           E+P+  +  +R  +GL  D ++Y +   L+K  P   Q++V + + VP     L +F  V
Sbjct: 551 EIPE--LEKTRSNFGLKNDTVIYLSCQSLFKYLPQYDQIFVEIAQQVP-----LAQFVFV 603

Query: 94  GEANIQATAQALGLDQHRILFSNVA-------------AKEEHVRRGQLADVCLDTPLCN 140
              N  A ++     + ++ F++                K ++++   +AD+ LDT    
Sbjct: 604 SHWN-SAISEKF-RKRLKLAFADFGLNSEDYCTILPRLEKSDYLQLNLIADIGLDTIQFT 661

Query: 141 GHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDR 199
           G  T++D +    P+VT  GE + SR  A  L  +G  E I R  KEY  IA++LG +R
Sbjct: 662 GFLTTLDSIACNLPIVTCEGELMRSRQTAGILKRIGVTETIVRNEKEYIKIAVQLGLNR 720


>gi|172059231|ref|YP_001806883.1| hypothetical protein BamMC406_0166 [Burkholderia ambifaria MC40-6]
 gi|171991748|gb|ACB62667.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia ambifaria
           MC40-6]
          Length = 754

 Score = 69.7 bits (169), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 49/142 (34%), Positives = 67/142 (47%), Gaps = 5/142 (3%)

Query: 51  EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALG 106
           E  + +   N   KI  + + +W  VL AVP S L LLK   + EA    +I A   A G
Sbjct: 558 EGVVTFGCLNNANKIGDAVVALWSRVLHAVPGSRL-LLKSAQLDEAALRTSIAARFAAHG 616

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +   R+L    + +  H+      D+ LD     G TTSM+ LW G P +T  G+   S 
Sbjct: 617 IPVERLLLRGGSKRLAHIATYNDIDIVLDPFPYPGGTTSMEGLWMGAPFITRRGDRFLSH 676

Query: 167 VAASQLATLGCPELIARTHKEY 188
           +  S L TLG PE IA    +Y
Sbjct: 677 IGESILHTLGMPEWIADNDDDY 698


>gi|330815133|ref|YP_004358838.1| hypothetical protein bgla_1g01810 [Burkholderia gladioli BSR3]
 gi|327367526|gb|AEA58882.1| TPR domain protein [Burkholderia gladioli BSR3]
          Length = 814

 Score = 69.7 bits (169), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 73/146 (50%), Gaps = 1/146 (0%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEAN-IQATAQALGLDQHRI 112
           + + +FN+L K+    + +W  +L AVP S + +   P  G  + +     A G+   R+
Sbjct: 592 VTFGSFNRLNKLRHDVIAVWARILHAVPGSRMRIGSIPRDGGVDMLLGWFSAEGISHDRL 651

Query: 113 LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQL 172
                A    ++++    D  LDT    G TT+++ LW G P +T+ G+TLASR  A  +
Sbjct: 652 DLQPRAPAAVYLQQHHHVDFGLDTFPYTGSTTALNALWMGVPTLTIRGDTLASRAGAVWM 711

Query: 173 ATLGCPELIARTHKEYQDIAIRLGTD 198
           +++G  + +A    ++   AI L  D
Sbjct: 712 SSVGLEQFVADDVDDFVARAIALAGD 737


>gi|115524525|ref|YP_781436.1| hypothetical protein RPE_2518 [Rhodopseudomonas palustris BisA53]
 gi|115518472|gb|ABJ06456.1| Tetratricopeptide TPR_2 repeat protein [Rhodopseudomonas palustris
           BisA53]
          Length = 676

 Score = 69.7 bits (169), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 68/134 (50%), Gaps = 5/134 (3%)

Query: 59  FNQLYKIDPSTLQMWVNVLKAVPNSIL----WLLKFPAVGEANIQATAQALGLDQHRILF 114
           FN++ KI  ++ Q W  +L  VPN+      W L   A+   N+ A    LG+   RI  
Sbjct: 487 FNRIGKISDASAQAWAQILHLVPNARFVVKHWGLD-DALTRDNLLARFARLGVPPERIAL 545

Query: 115 SNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLAT 174
               ++ +H+      D+CLDT   NG  ++ + L  G PVV L G T +SRVAA+ L  
Sbjct: 546 RGTTSRPDHLAALNGVDICLDTFPQNGGVSTWEALQMGVPVVALIGATASSRVAAAILTA 605

Query: 175 LGCPELIARTHKEY 188
           +G  + IA + + Y
Sbjct: 606 VGMTDWIADSPESY 619


>gi|428769583|ref|YP_007161373.1| hypothetical protein Cyan10605_1210 [Cyanobacterium aponinum PCC
           10605]
 gi|428683862|gb|AFZ53329.1| hypothetical protein Cyan10605_1210 [Cyanobacterium aponinum PCC
           10605]
          Length = 725

 Score = 69.7 bits (169), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 46/162 (28%), Positives = 78/162 (48%), Gaps = 5/162 (3%)

Query: 44  RQQYGLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFP-AVGEANIQ--- 99
           ++  G+   AI+Y +     K  P  +++ + +L+ V +S   +  +   +   NI+   
Sbjct: 520 KESLGISNSAIIYLSLQTGLKRHPEYIRLQMKILRQVADSYFLVSGYKNELSMNNIKNLF 579

Query: 100 -ATAQALGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTL 158
              AQ  G++  RI F      +++     + DV LDT   NG TT++D LW   P+VT 
Sbjct: 580 TQIAQQEGVNPDRIKFLPYMPLQDYRANLFIGDVVLDTYPFNGATTTLDALWLNIPLVTR 639

Query: 159 PGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTDRD 200
            G+   SR   + L  LG  E +A T +EY    ++ GTD +
Sbjct: 640 VGQQFHSRQGYTFLQNLGITEGMAYTDEEYIQWGVKFGTDEE 681


>gi|225163601|ref|ZP_03725909.1| tetratricopeptide TPR_2 repeat protein [Diplosphaera colitermitum
           TAV2]
 gi|224801803|gb|EEG20091.1| tetratricopeptide TPR_2 repeat protein [Diplosphaera colitermitum
           TAV2]
          Length = 652

 Score = 69.3 bits (168), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 71/141 (50%), Gaps = 6/141 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATAQA----LGLD 108
           ++ +  FN   K   +    W  +L AVP S L LLK   + +A ++A   A     GL 
Sbjct: 455 SVTFGCFNSPTKFTDTLFATWARLLAAVPGSSL-LLKGRDLEDAGVRAHLFARMNRAGLP 513

Query: 109 QHRI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRV 167
           + R  L    A    H+   +  D+ LDT   NG TT+ + LW G PV+T+ G+  A+RV
Sbjct: 514 EERTELLPRTADTASHLALYKRVDIALDTFPYNGTTTTCESLWMGRPVITIGGDRHAARV 573

Query: 168 AASQLATLGCPELIARTHKEY 188
           +AS L  +G PE IA    +Y
Sbjct: 574 SASLLTAIGRPEWIASCPDDY 594


>gi|448747261|ref|ZP_21728922.1| Methyltransferase FkbM [Halomonas titanicae BH1]
 gi|445565173|gb|ELY21285.1| Methyltransferase FkbM [Halomonas titanicae BH1]
          Length = 1072

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 65/134 (48%), Gaps = 3/134 (2%)

Query: 70  LQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQHRILFSNVAAKEEHVRR 126
           L+ W  ++ A+PNS L L       E     I+ T    G++  R+L    +   E +  
Sbjct: 3   LEQWAKIMLALPNSKLMLKSMQYKSEERCQKIKDTMALYGIESDRLLIEGPSPHAELLDA 62

Query: 127 GQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHK 186
               D+ LD    +G  T+ +    G PVV+LPG T A R +A+ L   G PEL+  + +
Sbjct: 63  YNKVDISLDPWPYSGGLTTCESFLMGVPVVSLPGPTFAGRHSATHLINAGMPELVVSSWE 122

Query: 187 EYQDIAIRLGTDRD 200
           EY++  + L +D D
Sbjct: 123 EYRERVLELASDLD 136


>gi|347821315|ref|ZP_08874749.1| hypothetical protein VeAt4_19630, partial [Verminephrobacter
           aporrectodeae subsp. tuberculatae At4]
          Length = 728

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 73/149 (48%), Gaps = 6/149 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWL----LKFPAVGEANIQATAQALGLDQ 109
           + + + N L K+    L +W  VL A P S L +    L  P   +A  Q   Q LGL+ 
Sbjct: 412 VTFGSCNNLGKLSDEVLALWGRVLAATPGSRLLIEGKNLDRPDFADAYRQ-RCQGLGLEP 470

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+    +    +++   ++ D+ LD    +G TT+ D LW G PVV++ G +   R+  
Sbjct: 471 QRLDLVALRNANQYLTYHRI-DIALDPFPLSGGTTTFDALWMGLPVVSMVGASFKGRMGV 529

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           S L  LG  E +A T ++Y  IA RL  D
Sbjct: 530 SLLTHLGRSEWLAETPQDYVRIAQRLAAD 558


>gi|159472004|ref|XP_001694146.1| O-linked N-acetylglucosamine transferase [Chlamydomonas
           reinhardtii]
 gi|158277313|gb|EDP03082.1| O-linked N-acetylglucosamine transferase [Chlamydomonas
           reinhardtii]
          Length = 837

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 72/145 (49%), Gaps = 7/145 (4%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           I + +FN L KI P  L++W  +L AVP S L L   P   EA   ++     ALG++  
Sbjct: 644 ITFGSFNNLAKITPQVLRVWAAILAAVPRSRLVLKNKPFACEAARSHLLRQLAALGVEGW 703

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L        +H+ +  L D+ LD     G TT+ + L+ G P +TL G   A  V  
Sbjct: 704 RVDLLPLAPGNSQHLSQYALMDISLDPFPYAGTTTTTESLYMGVPTLTLAGRCHAHNVGV 763

Query: 170 SQLATLG---CPELIARTHKEYQDI 191
           S L  +G    P  +AR+ +EY ++
Sbjct: 764 SLLTAVGLHPLPPWVARSEEEYVEL 788


>gi|390960058|ref|YP_006423815.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Terriglobus roseus DSM 18391]
 gi|390414976|gb|AFL90480.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Terriglobus roseus DSM 18391]
          Length = 598

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 70/147 (47%), Gaps = 1/147 (0%)

Query: 48  GLPEDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGE-ANIQATAQALG 106
            L      +C   ++ K++   + +W  +L+  P S + L   P   E   ++A     G
Sbjct: 397 ALASGVFTFCCVARINKLNRPHVALWARMLQCAPASRMLLATLPGGAEPERVKAWFAEAG 456

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +   +++F++    EE +R   +AD+CLDT   NG TT    L  G P +TL G  L SR
Sbjct: 457 VGAEQLIFAHARTVEEQLRLHHMADLCLDTFPYNGATTISHALCLGVPTLTLRGAVLGSR 516

Query: 167 VAASQLATLGCPELIARTHKEYQDIAI 193
           + AS  + +G  E IA   + Y   A+
Sbjct: 517 LGASINSHVGITEFIAEDGEAYVRKAV 543


>gi|115350209|ref|YP_772048.1| hypothetical protein Bamb_0153 [Burkholderia ambifaria AMMD]
 gi|115280197|gb|ABI85714.1| Tetratricopeptide TPR_2 repeat protein [Burkholderia ambifaria
           AMMD]
          Length = 754

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 50/152 (32%), Positives = 69/152 (45%), Gaps = 5/152 (3%)

Query: 51  EDAIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA----NIQATAQALG 106
           E  + +   N   KI  + + +W  VL AVP S L LLK   + EA    ++ A   A G
Sbjct: 558 EGVVTFGCLNNANKIGDAVVALWSRVLHAVPGSRL-LLKSAQLDEAALRTSLAARFAAHG 616

Query: 107 LDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASR 166
           +   R+L    + +  H+      D+ LD     G TTSM+ LW G P +T  G+   S 
Sbjct: 617 IPAERLLLRGGSKRLAHIATYNDIDIVLDPFPYPGGTTSMEGLWMGAPFITRRGDRFLSH 676

Query: 167 VAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
           +  S L TLG PE IA    +Y   A     D
Sbjct: 677 IGESILHTLGMPEWIADNDDDYVAKAAAFAQD 708


>gi|292670019|ref|ZP_06603445.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
 gi|292648320|gb|EFF66292.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
          Length = 1079

 Score = 69.3 bits (168), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 48/145 (33%), Positives = 73/145 (50%), Gaps = 7/145 (4%)

Query: 53  AIVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLD 108
           +IV+ +FN   K++   L +W  +L+ VP S L LLK      A+ +  A    +A G+ 
Sbjct: 361 SIVFGSFNNFTKLNDRVLSVWAEILRRVPESRL-LLKTDVFSYADSRREALRRIEAAGIP 419

Query: 109 QHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVA 168
             R+      A  +++      D+ LD     G  T+ D L+ G PVVTL G +L SR  
Sbjct: 420 LVRV--DAEGASADYLAAYARVDIALDPFPYPGGGTTCDALYMGVPVVTLAGASLGSRFG 477

Query: 169 ASQLATLGCPELIARTHKEYQDIAI 193
           AS L  +G  EL A T ++Y  +A+
Sbjct: 478 ASLLENIGAKELAAHTEEDYIALAV 502



 Score = 62.8 bits (151), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 52/177 (29%), Positives = 82/177 (46%), Gaps = 18/177 (10%)

Query: 35   VPQSIVITSRQQYGLPEDA---------IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSIL 85
            +P   V   R   GLP  A         I +  FNQ  K   + L +W  +++ VP + L
Sbjct: 862  LPSQFVYVPRA--GLPAAAETPARSRGYITFGVFNQYRKYTDAMLTVWREIMERVPTARL 919

Query: 86   WLLKFPAVGEANIQATAQA----LGLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNG 141
             L+K        +  TA+A    LG D  R+      A  ++++R    D+ LDT    G
Sbjct: 920  -LIKSQVFFSPAMTETARARMKRLGFDLRRVALE--PATTDYMQRYLDVDIALDTYPWPG 976

Query: 142  HTTSMDVLWTGTPVVTLPGETLASRVAASQLATLGCPELIARTHKEYQDIAIRLGTD 198
              T+ D L+ G PVVT+ G   ++R + + LA +G  +L  +T  +Y + A+ L  D
Sbjct: 977  GGTTCDALYMGVPVVTMYGARRSTRFSYALLAHIGQTDLAVQTPADYIERAVSLAGD 1033


>gi|168061951|ref|XP_001782948.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665566|gb|EDQ52246.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 899

 Score = 69.3 bits (168), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 75/149 (50%), Gaps = 4/149 (2%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEA---NIQATAQALGLDQH 110
           + + +FN L KI P  L++W  +L  VP + L +   P   ++      +  + LG+   
Sbjct: 635 VTFGSFNNLAKITPRVLRVWARILIKVPTARLVVKCKPFCCDSVRDTFLSKLEELGVKSL 694

Query: 111 RI-LFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
           R+ L   +    +H++   L D+ LDT    G TT+ + L+ G P VT+ G+  A  V  
Sbjct: 695 RVDLLPLILLNHDHMQAYSLMDISLDTFPYAGTTTTCESLYMGVPCVTMAGKVHAHNVGV 754

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
           + L  +G   L+ RT  +Y D A++L ++
Sbjct: 755 TLLHQVGLENLVVRTEDDYVDKAVQLASN 783


>gi|399155209|ref|ZP_10755276.1| hypothetical protein gproSAA_05251, partial [gamma proteobacterium
           SCGC AAA007-O20]
          Length = 163

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 54/91 (59%)

Query: 106 GLDQHRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLAS 165
           G+   R++F       E++ R ++AD+ LDT   N  TT+ D L  G PV+T  G + AS
Sbjct: 17  GIHPSRLIFGKRLPTPEYLARYRVADLFLDTLPYNAGTTASDALRMGLPVLTCIGNSFAS 76

Query: 166 RVAASQLATLGCPELIARTHKEYQDIAIRLG 196
           RVAAS L  +  PELI  T ++Y+ +AI+L 
Sbjct: 77  RVAASILNAVNLPELITTTQEQYESLAIQLA 107


>gi|398805153|ref|ZP_10564134.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Polaromonas sp. CF318]
 gi|398092315|gb|EJL82730.1| putative O-linked N-acetylglucosamine transferase, SPINDLY family
           [Polaromonas sp. CF318]
          Length = 733

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 71/149 (47%), Gaps = 5/149 (3%)

Query: 54  IVYCNFNQLYKIDPSTLQMWVNVLKAVPNSILWLLKFPAVGEANIQATA----QALGLDQ 109
           + +  FN L K+  + + +W  +L+AVP S L L+    + EA+ + T      A G+D 
Sbjct: 524 VTFACFNALPKMGDAVVALWARILQAVPGSRLHLMA-SQLSEASTRQTTLERFMAHGIDA 582

Query: 110 HRILFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDVLWTGTPVVTLPGETLASRVAA 169
            R+L      + ++++  Q  D+ LD     G TT+ + LW G PV+TL G T  SR   
Sbjct: 583 TRLLIQGPMPRIKYLQTYQRVDIALDPFPYTGGTTTAEALWMGVPVLTLAGNTFLSRQGV 642

Query: 170 SQLATLGCPELIARTHKEYQDIAIRLGTD 198
             L   G  E IA    +Y   A+    D
Sbjct: 643 GLLMNAGLSEWIASDADDYVKRAVAHAGD 671


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.131    0.385 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,964,749,775
Number of Sequences: 23463169
Number of extensions: 113467777
Number of successful extensions: 288947
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1446
Number of HSP's successfully gapped in prelim test: 51
Number of HSP's that attempted gapping in prelim test: 286377
Number of HSP's gapped (non-prelim): 1593
length of query: 200
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 65
effective length of database: 9,191,667,552
effective search space: 597458390880
effective search space used: 597458390880
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)