BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>026712
MSSISIPSCLTLARLNANGVINSVPHLQLPITVATPSHLSKRLRFFTVSREVKAFAHNGV
GITNSVPPRNGTYTVGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVG
VVSDYDLLALDSISGGNQNDTSLFPNVNSTWKTFNELQRLLSKTNGKVVGDLMTPAPLVV
HENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKRDGERST

High Scoring Gene Products

Symbol, full name Information P value
LEJ2
AT4G36910
protein from Arabidopsis thaliana 5.3e-72
LEJ1
AT4G34120
protein from Arabidopsis thaliana 1.4e-71
CPS_2970
CBS domain protein
protein from Colwellia psychrerythraea 34H 9.7e-12
CPS_2970
CBS domain protein
protein from Colwellia psychrerythraea 34H 9.7e-12
acuB
Acetoin utilization protein AcuB
protein from Carboxydothermus hydrogenoformans Z-2901 1.4e-11
CHY_0173
acetoin utilization protein AcuB
protein from Carboxydothermus hydrogenoformans Z-2901 1.4e-11
GSU2010
CBS domain pair-containing protein
protein from Geobacter sulfurreducens PCA 1.6e-11
GSU_2010
CBS domain protein
protein from Geobacter sulfurreducens PCA 1.6e-11
GSU1801
CBS domain pair-containing protein
protein from Geobacter sulfurreducens PCA 5.8e-11
GSU_1801
CBS domain protein
protein from Geobacter sulfurreducens PCA 5.8e-11
DDB_G0289609
CBS (cystathionine-beta-synthase) domain-containing protein
gene from Dictyostelium discoideum 8.5e-09
CHY_0137
CBS/GGDEF domain protein
protein from Carboxydothermus hydrogenoformans Z-2901 3.1e-07
CHY_0137
CBS/GGDEF domain protein
protein from Carboxydothermus hydrogenoformans Z-2901 3.1e-07
guaB
Inosine-5'-monophosphate dehydrogenase
protein from Mycobacterium tuberculosis 5.6e-07
SPO2201
CBS domain protein
protein from Ruegeria pomeroyi DSS-3 3.7e-06
SPO_2201
CBS domain protein
protein from Ruegeria pomeroyi DSS-3 3.7e-06
SO_2228
CBS domain containing protein
protein from Shewanella oneidensis MR-1 2.7e-05
SO_2228
CBS domain protein
protein from Shewanella oneidensis MR-1 2.7e-05
VC_1031
Inosine monophosphate dehydrogenase-related protein
protein from Vibrio cholerae O1 biovar El Tor str. N16961 4.2e-05
VC_1031
CBS domain protein
protein from Vibrio cholerae O1 biovar El Tor 4.2e-05
HNE_2660
CBS domain protein
protein from Hyphomonas neptunium ATCC 15444 4.5e-05
CHY_1949
polyA polymerase family protein
protein from Carboxydothermus hydrogenoformans Z-2901 7.0e-05
PSPPH_2494
CBS domain protein
protein from Pseudomonas syringae pv. phaseolicola 1448A 0.00016
APH_0088
inosine-5'-monophosphate dehydrogenase
protein from Anaplasma phagocytophilum HZ 0.00067
BAS4505
Thioesterase family protein
protein from Bacillus anthracis 0.00081
BA_4858
thioesterase family protein
protein from Bacillus anthracis str. Ames 0.00081
MCA2402
CBS domain protein
protein from Methylococcus capsulatus str. Bath 0.00095

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  026712
        (234 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2114970 - symbol:LEJ2 "AT4G36910" species:3702...   728  5.3e-72   1
TAIR|locus:2124276 - symbol:LEJ1 "AT4G34120" species:3702...   724  1.4e-71   1
UNIPROTKB|Q47ZU9 - symbol:CPS_2970 "CBS domain protein" s...   113  9.7e-12   2
TIGR_CMR|CPS_2970 - symbol:CPS_2970 "CBS domain protein" ...   113  9.7e-12   2
UNIPROTKB|Q3AFN9 - symbol:acuB "Acetoin utilization prote...   115  1.4e-11   2
TIGR_CMR|CHY_0173 - symbol:CHY_0173 "acetoin utilization ...   115  1.4e-11   2
UNIPROTKB|Q74BN1 - symbol:GSU2010 "CBS domain pair-contai...   113  1.6e-11   2
TIGR_CMR|GSU_2010 - symbol:GSU_2010 "CBS domain protein" ...   113  1.6e-11   2
UNIPROTKB|Q74C73 - symbol:GSU1801 "CBS domain pair-contai...   152  5.8e-11   1
TIGR_CMR|GSU_1801 - symbol:GSU_1801 "CBS domain protein" ...   152  5.8e-11   1
DICTYBASE|DDB_G0289609 - symbol:DDB_G0289609 "CBS (cystat...    90  8.5e-09   2
UNIPROTKB|Q3AFS5 - symbol:CHY_0137 "CBS/GGDEF domain prot...   110  3.1e-07   2
TIGR_CMR|CHY_0137 - symbol:CHY_0137 "CBS/GGDEF domain pro...   110  3.1e-07   2
UNIPROTKB|P65167 - symbol:guaB "Inosine-5'-monophosphate ...   105  5.6e-07   3
UNIPROTKB|Q5LRC8 - symbol:SPO2201 "CBS domain protein" sp...    91  3.7e-06   2
TIGR_CMR|SPO_2201 - symbol:SPO_2201 "CBS domain protein" ...    91  3.7e-06   2
UNIPROTKB|Q8EEY2 - symbol:SO_2228 "CBS domain containing ...    79  2.7e-05   2
TIGR_CMR|SO_2228 - symbol:SO_2228 "CBS domain protein" sp...    79  2.7e-05   2
UNIPROTKB|Q9KT74 - symbol:VC_1031 "Inosine monophosphate ...    79  4.2e-05   2
TIGR_CMR|VC_1031 - symbol:VC_1031 "CBS domain protein" sp...    79  4.2e-05   2
UNIPROTKB|Q0BYV1 - symbol:HNE_2660 "CBS domain protein" s...    80  4.5e-05   2
TIGR_CMR|CHY_1949 - symbol:CHY_1949 "polyA polymerase fam...   100  7.0e-05   2
UNIPROTKB|Q48IU0 - symbol:PSPPH_2494 "CBS domain protein"...    78  0.00016   2
TIGR_CMR|APH_0088 - symbol:APH_0088 "inosine-5'-monophosp...    88  0.00067   2
UNIPROTKB|Q81KX8 - symbol:BAS4505 "Thioesterase family pr...    88  0.00081   2
TIGR_CMR|BA_4858 - symbol:BA_4858 "thioesterase family pr...    88  0.00081   2
UNIPROTKB|Q604Y3 - symbol:MCA2402 "CBS domain protein" sp...   111  0.00095   1


>TAIR|locus:2114970 [details] [associations]
            symbol:LEJ2 "AT4G36910" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM;IDA] [GO:0045454 "cell redox
            homeostasis" evidence=IDA] InterPro:IPR000644 Pfam:PF00571
            PROSITE:PS51371 SMART:SM00116 GO:GO:0009507 EMBL:CP002687
            GenomeReviews:CT486007_GR eggNOG:COG0517 GO:GO:0045454 EMBL:Z99707
            EMBL:AL161590 EMBL:AY085164 EMBL:BT003835 EMBL:BT005190
            IPI:IPI00526612 PIR:H85435 RefSeq:NP_195409.1 UniGene:At.31287
            UniGene:At.65827 PDB:4GQV PDB:4GQW PDBsum:4GQV PDBsum:4GQW
            ProteinModelPortal:O23193 SMR:O23193 IntAct:O23193 PaxDb:O23193
            PRIDE:O23193 EnsemblPlants:AT4G36910.1 GeneID:829844
            KEGG:ath:AT4G36910 TAIR:At4g36910 HOGENOM:HOG000187907
            InParanoid:O23193 OMA:GRTENSM PhylomeDB:O23193
            ProtClustDB:CLSN2685590 Genevestigator:O23193 Uniprot:O23193
        Length = 236

 Score = 728 (261.3 bits), Expect = 5.3e-72, P = 5.3e-72
 Identities = 152/233 (65%), Positives = 179/233 (76%)

Query:     5 SIPSCLTLARLNANGVINSVPHLQLPITVAT-PSHLSKRLRFFTVSREVKAFAHNGVG-- 61
             S+P   T  R +++    S P+L LP  ++  P H     R F     + + A +  G  
Sbjct:     7 SVPLSFTPLRASSSP---SSPYLLLPRFLSVQPCHKFTFSRSFPSKSRIPS-ASSAAGST 62

Query:    62 -ITNSVPPRNGTYTVGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVG 120
              +TNS  PR+G YTVG+FMTKKEDLH VK TTTVDEALE LVE RITGFPVID+DWKLVG
Sbjct:    63 LMTNSSSPRSGVYTVGEFMTKKEDLHVVKPTTTVDEALELLVENRITGFPVIDEDWKLVG 122

Query:   121 VVSDYDLLALDSISGGNQNDTSLFPNVNSTWKTFNELQRLLSKTNGKVVGDLMTPAPLVV 180
             +VSDYDLLALDSISG  + + S+FP V+STWKTFN +Q+LLSKTNGK+VGDLMTPAPLVV
Sbjct:   123 LVSDYDLLALDSISGSGRTENSMFPEVDSTWKTFNAVQKLLSKTNGKLVGDLMTPAPLVV 182

Query:   181 HENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKRDGERS 233
              E TNLEDAA++LLETKYRRLPVVD  GKLVG+ITRGNVVRAALQIKR G+R+
Sbjct:   183 EEKTNLEDAAKILLETKYRRLPVVDSDGKLVGIITRGNVVRAALQIKRSGDRN 235


>TAIR|locus:2124276 [details] [associations]
            symbol:LEJ1 "AT4G34120" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM;IDA] [GO:0009570 "chloroplast stroma"
            evidence=IDA] [GO:0045454 "cell redox homeostasis" evidence=IDA]
            [GO:0042744 "hydrogen peroxide catabolic process" evidence=RCA]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            GO:GO:0009570 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG0517 GO:GO:0045454 EMBL:AL021961 EMBL:AL161584
            HOGENOM:HOG000187907 ProtClustDB:CLSN2685590 EMBL:AF360331
            EMBL:AY056339 IPI:IPI00529341 PIR:T05424 RefSeq:NP_567952.1
            UniGene:At.27243 UniGene:At.49487 UniGene:At.68482 PDB:3SL7
            PDBsum:3SL7 ProteinModelPortal:Q9C5D0 SMR:Q9C5D0 IntAct:Q9C5D0
            STRING:Q9C5D0 PaxDb:Q9C5D0 PRIDE:Q9C5D0 ProMEX:Q9C5D0
            EnsemblPlants:AT4G34120.1 GeneID:829558 KEGG:ath:AT4G34120
            TAIR:At4g34120 InParanoid:Q9C5D0 OMA:KRETENS PhylomeDB:Q9C5D0
            Genevestigator:Q9C5D0 Uniprot:Q9C5D0
        Length = 238

 Score = 724 (259.9 bits), Expect = 1.4e-71, P = 1.4e-71
 Identities = 155/241 (64%), Positives = 190/241 (78%)

Query:     1 MSSISIPSCLTLARLNANGVINSVPHLQ-LPITVATPS--HLSKRLRF--FTVSREVKAF 55
             M SIS+ + + + RL    ++ S+ H   LPI+ ++ S   LS R R   F+ S  V AF
Sbjct:     1 MGSISLSNSMPITRLP---LLTSLYHQSFLPISSSSFSLLPLSNRRRSSTFSPSITVSAF 57

Query:    56 --AHNGVGITNSVPPRNGTYTVGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVID 113
               A   V   NSVP +NG YTVGDFMT +++LH VK +T+VD+ALE LVEK++TG PVID
Sbjct:    58 FAAPASVNNNNSVPAKNGGYTVGDFMTPRQNLHVVKPSTSVDDALELLVEKKVTGLPVID 117

Query:   114 DDWKLVGVVSDYDLLALDSISGGNQNDTSLFPNVNSTWKTFNELQRLLSKTNGKVVGDLM 173
             D+W LVGVVSDYDLLALDSISG +QNDT+LFP+V+STWKTFNELQ+L+SKT GKVVGDLM
Sbjct:   118 DNWTLVGVVSDYDLLALDSISGRSQNDTNLFPDVDSTWKTFNELQKLISKTYGKVVGDLM 177

Query:   174 TPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKRDGERS 233
             TP+PLVV ++TNLEDAARLLLETK+RRLPVVD  GKL+G++TRGNVVRAALQIKR+ E S
Sbjct:   178 TPSPLVVRDSTNLEDAARLLLETKFRRLPVVDADGKLIGILTRGNVVRAALQIKRETENS 237

Query:   234 T 234
             T
Sbjct:   238 T 238


>UNIPROTKB|Q47ZU9 [details] [associations]
            symbol:CPS_2970 "CBS domain protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            eggNOG:COG0517 EMBL:CP000083 GenomeReviews:CP000083_GR
            HOGENOM:HOG000187907 RefSeq:YP_269670.1 ProteinModelPortal:Q47ZU9
            STRING:Q47ZU9 GeneID:3522254 KEGG:cps:CPS_2970 PATRIC:21468959
            OMA:LEHRISG BioCyc:CPSY167879:GI48-3019-MONOMER Uniprot:Q47ZU9
        Length = 147

 Score = 113 (44.8 bits), Expect = 9.7e-12, Sum P(2) = 9.7e-12
 Identities = 31/70 (44%), Positives = 39/70 (55%)

Query:   164 TNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAA 223
             T G++V D MT     +  NTNL D A   L    R LPVV+G GK+VGL+ R  V+   
Sbjct:    68 TIGELVRDRMTTEVESIGTNTNLVDVAEKFLHNNRRVLPVVEG-GKVVGLVQRRGVLEQL 126

Query:   224 L-QIKRDGER 232
             L QI   GE+
Sbjct:   127 LEQIDSKGEQ 136

 Score = 66 (28.3 bits), Expect = 9.7e-12, Sum P(2) = 9.7e-12
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query:    97 ALERLVEKRITGFPVIDDDWKLVGVVSDYDLLALDSISGG 136
             AL +L+   ++G PV+D D  L+G +S+ D +   ++ GG
Sbjct:    26 ALRQLLTASLSGAPVVDGDNNLIGFLSEADCMR-GALMGG 64


>TIGR_CMR|CPS_2970 [details] [associations]
            symbol:CPS_2970 "CBS domain protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            eggNOG:COG0517 EMBL:CP000083 GenomeReviews:CP000083_GR
            HOGENOM:HOG000187907 RefSeq:YP_269670.1 ProteinModelPortal:Q47ZU9
            STRING:Q47ZU9 GeneID:3522254 KEGG:cps:CPS_2970 PATRIC:21468959
            OMA:LEHRISG BioCyc:CPSY167879:GI48-3019-MONOMER Uniprot:Q47ZU9
        Length = 147

 Score = 113 (44.8 bits), Expect = 9.7e-12, Sum P(2) = 9.7e-12
 Identities = 31/70 (44%), Positives = 39/70 (55%)

Query:   164 TNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAA 223
             T G++V D MT     +  NTNL D A   L    R LPVV+G GK+VGL+ R  V+   
Sbjct:    68 TIGELVRDRMTTEVESIGTNTNLVDVAEKFLHNNRRVLPVVEG-GKVVGLVQRRGVLEQL 126

Query:   224 L-QIKRDGER 232
             L QI   GE+
Sbjct:   127 LEQIDSKGEQ 136

 Score = 66 (28.3 bits), Expect = 9.7e-12, Sum P(2) = 9.7e-12
 Identities = 14/40 (35%), Positives = 25/40 (62%)

Query:    97 ALERLVEKRITGFPVIDDDWKLVGVVSDYDLLALDSISGG 136
             AL +L+   ++G PV+D D  L+G +S+ D +   ++ GG
Sbjct:    26 ALRQLLTASLSGAPVVDGDNNLIGFLSEADCMR-GALMGG 64


>UNIPROTKB|Q3AFN9 [details] [associations]
            symbol:acuB "Acetoin utilization protein AcuB"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0006091
            "generation of precursor metabolites and energy" evidence=ISS]
            [GO:0045150 "acetoin catabolic process" evidence=ISS]
            InterPro:IPR000644 InterPro:IPR002912 Pfam:PF00571 Pfam:PF01842
            PROSITE:PS51371 SMART:SM00116 EMBL:CP000141
            GenomeReviews:CP000141_GR eggNOG:COG0517 GO:GO:0006091
            GO:GO:0016597 GO:GO:0045150 RefSeq:YP_359045.1
            ProteinModelPortal:Q3AFN9 STRING:Q3AFN9 GeneID:3726273
            KEGG:chy:CHY_0173 PATRIC:21273531 HOGENOM:HOG000187908 KO:K04767
            OMA:VQTMNPI BioCyc:CHYD246194:GJCN-174-MONOMER Uniprot:Q3AFN9
        Length = 210

 Score = 115 (45.5 bits), Expect = 1.4e-11, Sum P(2) = 1.4e-11
 Identities = 32/66 (48%), Positives = 41/66 (62%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDL-LALDSI 133
             V D MT+  +L  VK+T T+ EA+ +  EKRI   PV+DD  KLVG+VSD DL  A  S 
Sbjct:     3 VKDIMTR--ELITVKSTDTIREAMAKGHEKRIRHLPVVDDG-KLVGIVSDRDLRYACPSP 59

Query:   134 SGGNQN 139
               G +N
Sbjct:    60 FTGEKN 65

 Score = 95 (38.5 bits), Expect = 1.4e-11, Sum P(2) = 1.4e-11
 Identities = 23/66 (34%), Positives = 41/66 (62%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKR 228
             VGD+M    +  H    +E+AA+++LE +   LPV+    +LVG+IT+G++V A  ++  
Sbjct:    73 VGDIMQKRVVTAHPLDPVEEAAKMMLENRVGCLPVLLD-DELVGIITQGDIVMAFAELMG 131

Query:   229 DGERST 234
               +RS+
Sbjct:   132 VYKRSS 137


>TIGR_CMR|CHY_0173 [details] [associations]
            symbol:CHY_0173 "acetoin utilization protein AcuB"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0006091
            "generation of precursor metabolites and energy" evidence=ISS]
            [GO:0045150 "acetoin catabolic process" evidence=ISS]
            InterPro:IPR000644 InterPro:IPR002912 Pfam:PF00571 Pfam:PF01842
            PROSITE:PS51371 SMART:SM00116 EMBL:CP000141
            GenomeReviews:CP000141_GR eggNOG:COG0517 GO:GO:0006091
            GO:GO:0016597 GO:GO:0045150 RefSeq:YP_359045.1
            ProteinModelPortal:Q3AFN9 STRING:Q3AFN9 GeneID:3726273
            KEGG:chy:CHY_0173 PATRIC:21273531 HOGENOM:HOG000187908 KO:K04767
            OMA:VQTMNPI BioCyc:CHYD246194:GJCN-174-MONOMER Uniprot:Q3AFN9
        Length = 210

 Score = 115 (45.5 bits), Expect = 1.4e-11, Sum P(2) = 1.4e-11
 Identities = 32/66 (48%), Positives = 41/66 (62%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDL-LALDSI 133
             V D MT+  +L  VK+T T+ EA+ +  EKRI   PV+DD  KLVG+VSD DL  A  S 
Sbjct:     3 VKDIMTR--ELITVKSTDTIREAMAKGHEKRIRHLPVVDDG-KLVGIVSDRDLRYACPSP 59

Query:   134 SGGNQN 139
               G +N
Sbjct:    60 FTGEKN 65

 Score = 95 (38.5 bits), Expect = 1.4e-11, Sum P(2) = 1.4e-11
 Identities = 23/66 (34%), Positives = 41/66 (62%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKR 228
             VGD+M    +  H    +E+AA+++LE +   LPV+    +LVG+IT+G++V A  ++  
Sbjct:    73 VGDIMQKRVVTAHPLDPVEEAAKMMLENRVGCLPVLLD-DELVGIITQGDIVMAFAELMG 131

Query:   229 DGERST 234
               +RS+
Sbjct:   132 VYKRSS 137


>UNIPROTKB|Q74BN1 [details] [associations]
            symbol:GSU2010 "CBS domain pair-containing protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 InterPro:IPR013785 Pfam:PF00571
            PROSITE:PS51371 SMART:SM00116 GO:GO:0003824 Gene3D:3.20.20.70
            GO:GO:0008152 EMBL:AE017180 GenomeReviews:AE017180_GR
            HOGENOM:HOG000187907 KO:K04767 RefSeq:NP_953059.1
            ProteinModelPortal:Q74BN1 GeneID:2688077 KEGG:gsu:GSU2010
            PATRIC:22026881 OMA:TAREDSS ProtClustDB:CLSK924560
            BioCyc:GSUL243231:GH27-2044-MONOMER Uniprot:Q74BN1
        Length = 144

 Score = 113 (44.8 bits), Expect = 1.6e-11, Sum P(2) = 1.6e-11
 Identities = 30/80 (37%), Positives = 47/80 (58%)

Query:   150 TWKTFNELQRLLSKTNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGK 209
             TW    E+  +LSKT+   V + M P P  V  +T+L +AA+LL + K   + VVD   +
Sbjct:    66 TW----EVHYILSKTS---VTEAMNPKPYKVKPDTDLTEAAQLLHDRKLNGVLVVDDNDR 118

Query:   210 LVGLITRGNVVRAALQIKRD 229
             LVG++T  N + A ++I +D
Sbjct:   119 LVGILTVTNALEALIEICKD 138

 Score = 64 (27.6 bits), Expect = 1.6e-11, Sum P(2) = 1.6e-11
 Identities = 17/50 (34%), Positives = 29/50 (58%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSD 124
             VG +MTK      ++   TV EA+  + EK I   PV+D +  +VG++++
Sbjct:     4 VGTWMTKNPV--TIEKDATVIEAVHLMKEKSIRRLPVMDKE-TIVGILTE 50


>TIGR_CMR|GSU_2010 [details] [associations]
            symbol:GSU_2010 "CBS domain protein" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 InterPro:IPR013785 Pfam:PF00571 PROSITE:PS51371
            SMART:SM00116 GO:GO:0003824 Gene3D:3.20.20.70 GO:GO:0008152
            EMBL:AE017180 GenomeReviews:AE017180_GR HOGENOM:HOG000187907
            KO:K04767 RefSeq:NP_953059.1 ProteinModelPortal:Q74BN1
            GeneID:2688077 KEGG:gsu:GSU2010 PATRIC:22026881 OMA:TAREDSS
            ProtClustDB:CLSK924560 BioCyc:GSUL243231:GH27-2044-MONOMER
            Uniprot:Q74BN1
        Length = 144

 Score = 113 (44.8 bits), Expect = 1.6e-11, Sum P(2) = 1.6e-11
 Identities = 30/80 (37%), Positives = 47/80 (58%)

Query:   150 TWKTFNELQRLLSKTNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGK 209
             TW    E+  +LSKT+   V + M P P  V  +T+L +AA+LL + K   + VVD   +
Sbjct:    66 TW----EVHYILSKTS---VTEAMNPKPYKVKPDTDLTEAAQLLHDRKLNGVLVVDDNDR 118

Query:   210 LVGLITRGNVVRAALQIKRD 229
             LVG++T  N + A ++I +D
Sbjct:   119 LVGILTVTNALEALIEICKD 138

 Score = 64 (27.6 bits), Expect = 1.6e-11, Sum P(2) = 1.6e-11
 Identities = 17/50 (34%), Positives = 29/50 (58%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSD 124
             VG +MTK      ++   TV EA+  + EK I   PV+D +  +VG++++
Sbjct:     4 VGTWMTKNPV--TIEKDATVIEAVHLMKEKSIRRLPVMDKE-TIVGILTE 50


>UNIPROTKB|Q74C73 [details] [associations]
            symbol:GSU1801 "CBS domain pair-containing protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 InterPro:IPR013785 Pfam:PF00571
            PROSITE:PS51371 SMART:SM00116 GO:GO:0003824 Gene3D:3.20.20.70
            GO:GO:0008152 EMBL:AE017180 GenomeReviews:AE017180_GR
            HOGENOM:HOG000187907 RefSeq:NP_952851.1 ProteinModelPortal:Q74C73
            GeneID:2688718 KEGG:gsu:GSU1801 PATRIC:22026457 OMA:NGWCVAD
            ProtClustDB:CLSK828567 BioCyc:GSUL243231:GH27-1844-MONOMER
            Uniprot:Q74C73
        Length = 149

 Score = 152 (58.6 bits), Expect = 5.8e-11, P = 5.8e-11
 Identities = 46/148 (31%), Positives = 74/148 (50%)

Query:    74 TVGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLLALDSI 133
             TV D MT   ++   +  TT+ E  E   + RI   PV+D+   L G+V++ DL+  D  
Sbjct:     3 TVRDIMTT--NVVTARRETTIRELAELFSKYRIGSIPVVDEAGNLAGIVTESDLIEQDK- 59

Query:   134 SGGNQNDTSLFPNVNSTWKTFNELQRLLSKTNGKVVGDLMTPAPLVVHENTNLEDAARLL 193
             S       SLF  V    ++  + +R + K  G+  GD+ T     V   T +   A ++
Sbjct:    60 SLHIPTVISLFDWVIYL-ESAKKFEREIQKVTGQTAGDIYTTEVESVTPETPVSTVADIM 118

Query:   194 LETKYRRLPVVDGYGKLVGLITRGNVVR 221
                K   LPVV+G  KLVG+++R +++R
Sbjct:   119 ANKKLHTLPVVEGK-KLVGIVSRIDLIR 145


>TIGR_CMR|GSU_1801 [details] [associations]
            symbol:GSU_1801 "CBS domain protein" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 InterPro:IPR013785 Pfam:PF00571 PROSITE:PS51371
            SMART:SM00116 GO:GO:0003824 Gene3D:3.20.20.70 GO:GO:0008152
            EMBL:AE017180 GenomeReviews:AE017180_GR HOGENOM:HOG000187907
            RefSeq:NP_952851.1 ProteinModelPortal:Q74C73 GeneID:2688718
            KEGG:gsu:GSU1801 PATRIC:22026457 OMA:NGWCVAD ProtClustDB:CLSK828567
            BioCyc:GSUL243231:GH27-1844-MONOMER Uniprot:Q74C73
        Length = 149

 Score = 152 (58.6 bits), Expect = 5.8e-11, P = 5.8e-11
 Identities = 46/148 (31%), Positives = 74/148 (50%)

Query:    74 TVGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLLALDSI 133
             TV D MT   ++   +  TT+ E  E   + RI   PV+D+   L G+V++ DL+  D  
Sbjct:     3 TVRDIMTT--NVVTARRETTIRELAELFSKYRIGSIPVVDEAGNLAGIVTESDLIEQDK- 59

Query:   134 SGGNQNDTSLFPNVNSTWKTFNELQRLLSKTNGKVVGDLMTPAPLVVHENTNLEDAARLL 193
             S       SLF  V    ++  + +R + K  G+  GD+ T     V   T +   A ++
Sbjct:    60 SLHIPTVISLFDWVIYL-ESAKKFEREIQKVTGQTAGDIYTTEVESVTPETPVSTVADIM 118

Query:   194 LETKYRRLPVVDGYGKLVGLITRGNVVR 221
                K   LPVV+G  KLVG+++R +++R
Sbjct:   119 ANKKLHTLPVVEGK-KLVGIVSRIDLIR 145


>DICTYBASE|DDB_G0289609 [details] [associations]
            symbol:DDB_G0289609 "CBS
            (cystathionine-beta-synthase) domain-containing protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR000644
            Pfam:PF00571 PROSITE:PS51371 SMART:SM00116 dictyBase:DDB_G0289609
            eggNOG:COG0517 EMBL:AAFI02000147 RefSeq:XP_636127.1
            ProteinModelPortal:Q54H97 STRING:Q54H97 PRIDE:Q54H97
            EnsemblProtists:DDB0230005 GeneID:8627230 KEGG:ddi:DDB_G0289609
            InParanoid:Q54H97 OMA:NANSIHR ProtClustDB:CLSZ2429757
            Uniprot:Q54H97
        Length = 145

 Score = 90 (36.7 bits), Expect = 8.5e-09, Sum P(2) = 8.5e-09
 Identities = 24/59 (40%), Positives = 33/59 (55%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDL-LALDS 132
             V   M+K   L  +   TT+D AL+ L    I   PV+D+D  L G+++D DL LA DS
Sbjct:     5 VKQLMSKS--LFTINLDTTLDVALKSLNANSIHRLPVVDNDGNLKGIITDRDLRLATDS 61

 Score = 89 (36.4 bits), Expect = 8.5e-09, Sum P(2) = 8.5e-09
 Identities = 18/66 (27%), Positives = 38/66 (57%)

Query:   161 LSKTNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVV 220
             L K     V  +M   P+ + + + + +AA+L+  T    LPV+D  G+L+G++TR +++
Sbjct:    72 LEKLRLHKVSSIMKQNPVTIEDFSPVVEAAKLMRVTNVGGLPVLDKKGRLIGMVTRSDLL 131

Query:   221 RAALQI 226
                +++
Sbjct:   132 DLLIKV 137


>UNIPROTKB|Q3AFS5 [details] [associations]
            symbol:CHY_0137 "CBS/GGDEF domain protein" species:246194
            "Carboxydothermus hydrogenoformans Z-2901" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 InterPro:IPR001054
            InterPro:IPR013785 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            Pfam:PF00990 Gene3D:3.20.20.70 GO:GO:0035556 EMBL:CP000141
            GenomeReviews:CP000141_GR eggNOG:COG0517 GO:GO:0005622
            SUPFAM:SSF55073 GO:GO:0016849 GO:GO:0009190 InterPro:IPR000160
            TIGRFAMs:TIGR00254 PROSITE:PS50887 RefSeq:YP_359009.1
            ProteinModelPortal:Q3AFS5 STRING:Q3AFS5 GeneID:3727538
            KEGG:chy:CHY_0137 PATRIC:21273467 HOGENOM:HOG000262568 OMA:EIAIIFL
            BioCyc:CHYD246194:GJCN-137-MONOMER Uniprot:Q3AFS5
        Length = 271

 Score = 110 (43.8 bits), Expect = 3.1e-07, Sum P(2) = 3.1e-07
 Identities = 25/62 (40%), Positives = 35/62 (56%)

Query:   160 LLSKTNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNV 219
             L+     ++V D MT  P+V+ E   L DA + +LE    RLPV+D  G L G+ITR  +
Sbjct:    54 LIFYPENRLVIDAMTEEPVVIEEKAYLFDAYQKMLENNIERLPVIDESGALTGIITRKVI 113

Query:   220 VR 221
              R
Sbjct:   114 ER 115

 Score = 65 (27.9 bits), Expect = 3.1e-07, Sum P(2) = 3.1e-07
 Identities = 12/33 (36%), Positives = 24/33 (72%)

Query:    96 EALERLVEKRITGFPVIDDDWKLVGVVSDYDLL 128
             +A E +  +RI G PV++++ KL+G+++  DL+
Sbjct:    24 DAKELMRSQRIGGLPVVENE-KLIGIITSKDLI 55


>TIGR_CMR|CHY_0137 [details] [associations]
            symbol:CHY_0137 "CBS/GGDEF domain protein" species:246194
            "Carboxydothermus hydrogenoformans Z-2901" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 InterPro:IPR001054
            InterPro:IPR013785 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            Pfam:PF00990 Gene3D:3.20.20.70 GO:GO:0035556 EMBL:CP000141
            GenomeReviews:CP000141_GR eggNOG:COG0517 GO:GO:0005622
            SUPFAM:SSF55073 GO:GO:0016849 GO:GO:0009190 InterPro:IPR000160
            TIGRFAMs:TIGR00254 PROSITE:PS50887 RefSeq:YP_359009.1
            ProteinModelPortal:Q3AFS5 STRING:Q3AFS5 GeneID:3727538
            KEGG:chy:CHY_0137 PATRIC:21273467 HOGENOM:HOG000262568 OMA:EIAIIFL
            BioCyc:CHYD246194:GJCN-137-MONOMER Uniprot:Q3AFS5
        Length = 271

 Score = 110 (43.8 bits), Expect = 3.1e-07, Sum P(2) = 3.1e-07
 Identities = 25/62 (40%), Positives = 35/62 (56%)

Query:   160 LLSKTNGKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNV 219
             L+     ++V D MT  P+V+ E   L DA + +LE    RLPV+D  G L G+ITR  +
Sbjct:    54 LIFYPENRLVIDAMTEEPVVIEEKAYLFDAYQKMLENNIERLPVIDESGALTGIITRKVI 113

Query:   220 VR 221
              R
Sbjct:   114 ER 115

 Score = 65 (27.9 bits), Expect = 3.1e-07, Sum P(2) = 3.1e-07
 Identities = 12/33 (36%), Positives = 24/33 (72%)

Query:    96 EALERLVEKRITGFPVIDDDWKLVGVVSDYDLL 128
             +A E +  +RI G PV++++ KL+G+++  DL+
Sbjct:    24 DAKELMRSQRIGGLPVVENE-KLIGIITSKDLI 55


>UNIPROTKB|P65167 [details] [associations]
            symbol:guaB "Inosine-5'-monophosphate dehydrogenase"
            species:1773 "Mycobacterium tuberculosis" [GO:0003938 "IMP
            dehydrogenase activity" evidence=IDA] [GO:0005618 "cell wall"
            evidence=IDA] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0006177 "GMP biosynthetic process" evidence=IDA] [GO:0006204
            "IMP catabolic process" evidence=IDA] [GO:0040007 "growth"
            evidence=IMP] [GO:0097293 "XMP biosynthetic process" evidence=IDA]
            UniPathway:UPA00601 HAMAP:MF_01964 InterPro:IPR000644
            InterPro:IPR001093 InterPro:IPR005990 InterPro:IPR013785
            InterPro:IPR015875 Pfam:PF00478 Pfam:PF00571 PIRSF:PIRSF000130
            PROSITE:PS00487 PROSITE:PS51371 SMART:SM00116 GO:GO:0005886
            GO:GO:0040007 GO:GO:0005618 Gene3D:3.20.20.70 EMBL:AE000516
            GenomeReviews:AE000516_GR GenomeReviews:AL123456_GR GO:GO:0046872
            EMBL:BX842583 eggNOG:COG0517 GO:GO:0006177 KO:K00088 GO:GO:0003938
            PANTHER:PTHR11911:SF6 TIGRFAMs:TIGR01302 OMA:SAGLKES
            HOGENOM:HOG000165755 ProtClustDB:PRK05567 PIR:H70736
            RefSeq:NP_217928.1 RefSeq:NP_338043.1 RefSeq:YP_006516897.1
            ProteinModelPortal:P65167 SMR:P65167 PRIDE:P65167
            EnsemblBacteria:EBMYCT00000000064 EnsemblBacteria:EBMYCT00000069182
            GeneID:13317015 GeneID:887498 GeneID:922953 KEGG:mtc:MT3519
            KEGG:mtu:Rv3411c KEGG:mtv:RVBD_3411c PATRIC:18129477
            TubercuList:Rv3411c GO:GO:0006204 GO:GO:0097293 Uniprot:P65167
        Length = 529

 Score = 105 (42.0 bits), Expect = 5.6e-07, Sum P(3) = 5.6e-07
 Identities = 25/56 (44%), Positives = 33/56 (58%)

Query:   167 KVVGDLMTPAPLVV-HENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVR 221
             K V ++MT APL+   E  +   A  LL   K  +LPVVDG G+L GLIT  + V+
Sbjct:   183 KQVAEVMTKAPLITAQEGVSASAALGLLRRNKIEKLPVVDGRGRLTGLITVKDFVK 238

 Score = 75 (31.5 bits), Expect = 5.6e-07, Sum P(3) = 5.6e-07
 Identities = 28/97 (28%), Positives = 46/97 (47%)

Query:    39 LSKRLRFFTVSREVKAFAH-NGVGITN-SVPPRNGTYTVGDFMTKKEDLHAVKTTTTV-- 94
             +S  +   T SR   A A   G+G+ + ++P       V   M K+ +   V    T   
Sbjct:    81 VSSAMDTVTESRMAIAMARAGGMGVLHRNLPVAEQAGQVE--MVKRSEAGMVTDPVTCRP 138

Query:    95 DEALERL----VEKRITGFPVIDDDWKLVGVVSDYDL 127
             D  L ++       RI+G PV+DDD  LVG++++ D+
Sbjct:   139 DNTLAQVDALCARFRISGLPVVDDDGALVGIITNRDM 175

 Score = 37 (18.1 bits), Expect = 5.6e-07, Sum P(3) = 5.6e-07
 Identities = 7/16 (43%), Positives = 11/16 (68%)

Query:    29 LPITVATPSHLSKRLR 44
             +P T  T S L+K++R
Sbjct:    60 VPATADTSSQLTKKIR 75


>UNIPROTKB|Q5LRC8 [details] [associations]
            symbol:SPO2201 "CBS domain protein" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            EMBL:CP000031 GenomeReviews:CP000031_GR HOGENOM:HOG000187909
            OMA:ERMTDRR RefSeq:YP_167427.1 ProteinModelPortal:Q5LRC8
            GeneID:3192666 KEGG:sil:SPO2201 PATRIC:23377753
            ProtClustDB:CLSK933793 Uniprot:Q5LRC8
        Length = 144

 Score = 91 (37.1 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
 Identities = 20/56 (35%), Positives = 33/56 (58%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAAL 224
             V   MT   +    +  ++D    + E ++R +PV++  GKLVGL+T G+VV+A L
Sbjct:    73 VSTYMTKDLITCGRDAKVQDVLSQMTEGRFRHMPVIEE-GKLVGLVTLGDVVKAQL 127

 Score = 61 (26.5 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
 Identities = 18/49 (36%), Positives = 27/49 (55%)

Query:    88 VKTTTTVDEALERLVEKRITGFPVIDDDWKLV-GVVSDYDLLALDSISG 135
             V    TV +A   L EKRI G  V+ DD +   G++S+ D++   + SG
Sbjct:    18 VSAAATVAQAAAILAEKRI-GTVVVSDDGETAQGILSERDIVRELAASG 65


>TIGR_CMR|SPO_2201 [details] [associations]
            symbol:SPO_2201 "CBS domain protein" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            EMBL:CP000031 GenomeReviews:CP000031_GR HOGENOM:HOG000187909
            OMA:ERMTDRR RefSeq:YP_167427.1 ProteinModelPortal:Q5LRC8
            GeneID:3192666 KEGG:sil:SPO2201 PATRIC:23377753
            ProtClustDB:CLSK933793 Uniprot:Q5LRC8
        Length = 144

 Score = 91 (37.1 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
 Identities = 20/56 (35%), Positives = 33/56 (58%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAAL 224
             V   MT   +    +  ++D    + E ++R +PV++  GKLVGL+T G+VV+A L
Sbjct:    73 VSTYMTKDLITCGRDAKVQDVLSQMTEGRFRHMPVIEE-GKLVGLVTLGDVVKAQL 127

 Score = 61 (26.5 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
 Identities = 18/49 (36%), Positives = 27/49 (55%)

Query:    88 VKTTTTVDEALERLVEKRITGFPVIDDDWKLV-GVVSDYDLLALDSISG 135
             V    TV +A   L EKRI G  V+ DD +   G++S+ D++   + SG
Sbjct:    18 VSAAATVAQAAAILAEKRI-GTVVVSDDGETAQGILSERDIVRELAASG 65


>UNIPROTKB|Q8EEY2 [details] [associations]
            symbol:SO_2228 "CBS domain containing protein"
            species:211586 "Shewanella oneidensis MR-1" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371
            SMART:SM00116 EMBL:AE014299 GenomeReviews:AE014299_GR
            HOGENOM:HOG000187907 RefSeq:NP_717825.1 ProteinModelPortal:Q8EEY2
            GeneID:1169964 KEGG:son:SO_2228 PATRIC:23524087 OMA:YHCQDTH
            ProtClustDB:CLSK906643 Uniprot:Q8EEY2
        Length = 138

 Score = 79 (32.9 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
 Identities = 22/55 (40%), Positives = 30/55 (54%)

Query:   168 VVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRA 222
             VV D M    L V  + ++   A  +L  K +  PVVD  GK++G I R NV+RA
Sbjct:    72 VVKDCMRTEVLYVGPDESILQLAEQMLGAKPKIYPVVDN-GKVIGTINRTNVLRA 125

 Score = 69 (29.3 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
 Identities = 17/56 (30%), Positives = 31/56 (55%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLLAL 130
             + D M ++  L  ++   ++  A+E+L++    G  V+DD   LVG +S  D LA+
Sbjct:     6 IRDHMDRQPVL--LRANMSLATAVEKLLDNNKMGAAVVDDSGNLVGFLSQQDCLAV 59


>TIGR_CMR|SO_2228 [details] [associations]
            symbol:SO_2228 "CBS domain protein" species:211586
            "Shewanella oneidensis MR-1" [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            EMBL:AE014299 GenomeReviews:AE014299_GR HOGENOM:HOG000187907
            RefSeq:NP_717825.1 ProteinModelPortal:Q8EEY2 GeneID:1169964
            KEGG:son:SO_2228 PATRIC:23524087 OMA:YHCQDTH ProtClustDB:CLSK906643
            Uniprot:Q8EEY2
        Length = 138

 Score = 79 (32.9 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
 Identities = 22/55 (40%), Positives = 30/55 (54%)

Query:   168 VVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRA 222
             VV D M    L V  + ++   A  +L  K +  PVVD  GK++G I R NV+RA
Sbjct:    72 VVKDCMRTEVLYVGPDESILQLAEQMLGAKPKIYPVVDN-GKVIGTINRTNVLRA 125

 Score = 69 (29.3 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
 Identities = 17/56 (30%), Positives = 31/56 (55%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLLAL 130
             + D M ++  L  ++   ++  A+E+L++    G  V+DD   LVG +S  D LA+
Sbjct:     6 IRDHMDRQPVL--LRANMSLATAVEKLLDNNKMGAAVVDDSGNLVGFLSQQDCLAV 59


>UNIPROTKB|Q9KT74 [details] [associations]
            symbol:VC_1031 "Inosine monophosphate dehydrogenase-related
            protein" species:243277 "Vibrio cholerae O1 biovar El Tor str.
            N16961" [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            EMBL:AE003852 GenomeReviews:AE003852_GR OMA:YHCQDTH PIR:E82251
            RefSeq:NP_230676.1 ProteinModelPortal:Q9KT74 DNASU:2614301
            GeneID:2614301 KEGG:vch:VC1031 PATRIC:20081162
            ProtClustDB:CLSK874186 Uniprot:Q9KT74
        Length = 146

 Score = 79 (32.9 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query:   168 VVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRA 222
             +V D M    L V   T++ + A ++   K +  PVV   GKLVG+ITR +V+RA
Sbjct:    80 LVSDCMHKEVLSVAPETSIIELAAMMKVGKPKMYPVVSD-GKLVGVITRRDVLRA 133

 Score = 69 (29.3 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
 Identities = 22/62 (35%), Positives = 33/62 (53%)

Query:    69 RNGTYTVGDFMTKKEDLHAVKTTT--TVDEALERLVEKRITGFPVIDDDWKLVGVVSDYD 126
             R  +  V D+MT    L AV  +   ++  ALE++V     G PVID    ++G +S+ D
Sbjct:     8 RMDSLKVSDYMT----LQAVTFSPDMSLSAALEKVVRSDNMGGPVIDAQRHVIGFLSEQD 63

Query:   127 LL 128
             LL
Sbjct:    64 LL 65


>TIGR_CMR|VC_1031 [details] [associations]
            symbol:VC_1031 "CBS domain protein" species:686 "Vibrio
            cholerae O1 biovar El Tor" [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371
            SMART:SM00116 EMBL:AE003852 GenomeReviews:AE003852_GR OMA:YHCQDTH
            PIR:E82251 RefSeq:NP_230676.1 ProteinModelPortal:Q9KT74
            DNASU:2614301 GeneID:2614301 KEGG:vch:VC1031 PATRIC:20081162
            ProtClustDB:CLSK874186 Uniprot:Q9KT74
        Length = 146

 Score = 79 (32.9 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query:   168 VVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRA 222
             +V D M    L V   T++ + A ++   K +  PVV   GKLVG+ITR +V+RA
Sbjct:    80 LVSDCMHKEVLSVAPETSIIELAAMMKVGKPKMYPVVSD-GKLVGVITRRDVLRA 133

 Score = 69 (29.3 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
 Identities = 22/62 (35%), Positives = 33/62 (53%)

Query:    69 RNGTYTVGDFMTKKEDLHAVKTTT--TVDEALERLVEKRITGFPVIDDDWKLVGVVSDYD 126
             R  +  V D+MT    L AV  +   ++  ALE++V     G PVID    ++G +S+ D
Sbjct:     8 RMDSLKVSDYMT----LQAVTFSPDMSLSAALEKVVRSDNMGGPVIDAQRHVIGFLSEQD 63

Query:   127 LL 128
             LL
Sbjct:    64 LL 65


>UNIPROTKB|Q0BYV1 [details] [associations]
            symbol:HNE_2660 "CBS domain protein" species:228405
            "Hyphomonas neptunium ATCC 15444" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371 SMART:SM00116
            eggNOG:COG0517 EMBL:CP000158 GenomeReviews:CP000158_GR
            HOGENOM:HOG000187909 OMA:ERMTDRR RefSeq:YP_761342.1
            ProteinModelPortal:Q0BYV1 STRING:Q0BYV1 GeneID:4287219
            KEGG:hne:HNE_2660 PATRIC:32218185
            BioCyc:HNEP228405:GI69-2670-MONOMER Uniprot:Q0BYV1
        Length = 144

 Score = 80 (33.2 bits), Expect = 4.5e-05, Sum P(2) = 4.5e-05
 Identities = 16/53 (30%), Positives = 33/53 (62%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVR 221
             VG+ MT A + +  +  +++A +L+ + + R LPVV    +L G ++ G++V+
Sbjct:    72 VGNAMTRAVITISADAEVDEALQLMTDRRIRHLPVVRN-SRLTGFVSIGDLVK 123

 Score = 67 (28.6 bits), Expect = 4.5e-05, Sum P(2) = 4.5e-05
 Identities = 15/48 (31%), Positives = 28/48 (58%)

Query:    81 KKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLL 128
             K  ++  ++   T+ EA   L E+RI     +D D ++VGV+S+ D++
Sbjct:    10 KGREVITLRADDTLREAARLLDERRIGAVVTLDADGEIVGVLSERDIV 57


>TIGR_CMR|CHY_1949 [details] [associations]
            symbol:CHY_1949 "polyA polymerase family protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0006396 "RNA processing" evidence=ISS] [GO:0016779
            "nucleotidyltransferase activity" evidence=ISS] InterPro:IPR000644
            InterPro:IPR001667 InterPro:IPR002646 InterPro:IPR013785
            Pfam:PF00571 Pfam:PF01368 Pfam:PF01743 PROSITE:PS51371
            SMART:SM00116 GO:GO:0005524 Gene3D:3.20.20.70 EMBL:CP000141
            GenomeReviews:CP000141_GR GO:GO:0003723 GO:GO:0006396 GO:GO:0016787
            GO:GO:0030145 eggNOG:COG0617 KO:K00974 GO:GO:0016779
            RefSeq:YP_360768.1 ProteinModelPortal:Q3AAR6 STRING:Q3AAR6
            GeneID:3728062 KEGG:chy:CHY_1949 PATRIC:21276985
            HOGENOM:HOG000011569 OMA:FRAIRFE
            BioCyc:CHYD246194:GJCN-1948-MONOMER Uniprot:Q3AAR6
        Length = 864

 Score = 100 (40.3 bits), Expect = 7.0e-05, Sum P(2) = 7.0e-05
 Identities = 20/49 (40%), Positives = 35/49 (71%)

Query:   173 MTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVR 221
             M+  P+ +    +LE+A RLL++    RLPVV+G GKL+G+I+R ++++
Sbjct:   375 MSKNPVTIEPEASLEEALRLLIKHDIGRLPVVEG-GKLIGIISRTDLLK 422

 Score = 68 (29.0 bits), Expect = 7.0e-05, Sum P(2) = 7.0e-05
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query:    88 VKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDL 127
             V   +TV+EA + +V    +G PV++ D KLVG++S  D+
Sbjct:   320 VTVDSTVEEARKIMVRYGHSGLPVLEGD-KLVGIISRRDV 358


>UNIPROTKB|Q48IU0 [details] [associations]
            symbol:PSPPH_2494 "CBS domain protein" species:264730
            "Pseudomonas syringae pv. phaseolicola 1448A" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 Pfam:PF00571 PROSITE:PS51371
            SMART:SM00116 eggNOG:COG0517 EMBL:CP000058
            GenomeReviews:CP000058_GR HOGENOM:HOG000187909 RefSeq:YP_274691.1
            ProteinModelPortal:Q48IU0 STRING:Q48IU0 GeneID:3557460
            KEGG:psp:PSPPH_2494 PATRIC:19974275 OMA:YARKVIL
            ProtClustDB:CLSK865716 Uniprot:Q48IU0
        Length = 146

 Score = 78 (32.5 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 17/59 (28%), Positives = 36/59 (61%)

Query:   166 GKVVGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAAL 224
             G  V ++M+   + V    ++E    ++ ++  R LPVV+  G+L+GL++ G++V+ A+
Sbjct:    72 GTPVREIMSNKVITVDSQQSVEACMGIMTDSHLRHLPVVED-GRLLGLLSIGDLVKEAI 129

 Score = 65 (27.9 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 15/46 (32%), Positives = 26/46 (56%)

Query:    81 KKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYD 126
             + + +H +     V +AL  + EK I   PV++ +  +VGVVS+ D
Sbjct:    14 QNQQVHTIGPDQMVLDALRLMAEKNIGALPVVEGN-VVVGVVSERD 58


>TIGR_CMR|APH_0088 [details] [associations]
            symbol:APH_0088 "inosine-5'-monophosphate dehydrogenase"
            species:212042 "Anaplasma phagocytophilum HZ" [GO:0003938 "IMP
            dehydrogenase activity" evidence=ISS] [GO:0009152 "purine
            ribonucleotide biosynthetic process" evidence=ISS]
            UniPathway:UPA00601 HAMAP:MF_01964 InterPro:IPR000644
            InterPro:IPR001093 InterPro:IPR005990 InterPro:IPR013785
            InterPro:IPR015875 Pfam:PF00478 Pfam:PF00571 PIRSF:PIRSF000130
            PROSITE:PS00487 PROSITE:PS51371 SMART:SM00116 Gene3D:3.20.20.70
            GO:GO:0000166 GO:GO:0046872 eggNOG:COG0517 EMBL:CP000235
            GenomeReviews:CP000235_GR GO:GO:0006177 KO:K00088 GO:GO:0003938
            PANTHER:PTHR11911:SF6 TIGRFAMs:TIGR01302 OMA:SAGLKES
            HOGENOM:HOG000165755 ProtClustDB:CLSK749478 RefSeq:YP_504716.1
            ProteinModelPortal:Q2GLN5 SMR:Q2GLN5 STRING:Q2GLN5 GeneID:3930597
            KEGG:aph:APH_0088 PATRIC:20948738
            BioCyc:APHA212042:GHPM-123-MONOMER Uniprot:Q2GLN5
        Length = 486

 Score = 88 (36.0 bits), Expect = 0.00067, Sum P(2) = 0.00067
 Identities = 25/58 (43%), Positives = 32/58 (55%)

Query:   159 RLLSKTNGKVVGDLMTPAPLV-VHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLIT 215
             R +   N +V  D+MT   LV V E  +  +A RLL + K  RL V D YG  +GLIT
Sbjct:   136 RFVEDMNCRVC-DIMTKENLVTVREGVSQSEATRLLHKHKIERLIVTDEYGCCIGLIT 192

 Score = 66 (28.3 bits), Expect = 0.00067, Sum P(2) = 0.00067
 Identities = 12/47 (25%), Positives = 29/47 (61%)

Query:    88 VKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLLALDSIS 134
             V    T+  AL  + +   +G PV++++ KLVG++++ D+  ++ ++
Sbjct:    96 VSPDATLKTALSIMQQHSYSGIPVVEENKKLVGIITNRDVRFVEDMN 142


>UNIPROTKB|Q81KX8 [details] [associations]
            symbol:BAS4505 "Thioesterase family protein" species:1392
            "Bacillus anthracis" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR000524
            InterPro:IPR000644 Pfam:PF00392 Pfam:PF00571 PROSITE:PS51371
            SMART:SM00116 Pfam:PF03061 EMBL:AE016879 EMBL:AE017334
            EMBL:AE017225 GenomeReviews:AE016879_GR GenomeReviews:AE017225_GR
            GenomeReviews:AE017334_GR InterPro:IPR006683 GO:GO:0003700
            Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0005622
            InterPro:IPR010766 Pfam:PF07085 HOGENOM:HOG000022009 OMA:YFIKPVQ
            ProtClustDB:CLSK873528 RefSeq:NP_847059.1 RefSeq:YP_021501.1
            RefSeq:YP_030753.1 ProteinModelPortal:Q81KX8 DNASU:1084014
            EnsemblBacteria:EBBACT00000012784 EnsemblBacteria:EBBACT00000018567
            EnsemblBacteria:EBBACT00000019988 GeneID:1084014 GeneID:2818548
            GeneID:2850363 KEGG:ban:BA_4858 KEGG:bar:GBAA_4858 KEGG:bat:BAS4505
            BioCyc:BANT260799:GJAJ-4563-MONOMER
            BioCyc:BANT261594:GJ7F-4718-MONOMER Uniprot:Q81KX8
        Length = 437

 Score = 88 (36.0 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 21/60 (35%), Positives = 36/60 (60%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKR 228
             +  +MT  P+ V+   ++  AAR+++      LPVVD   KL G+I+R +V++A   I+R
Sbjct:   252 IDKVMTRHPITVNGKMSVAAAARMMVWEGIELLPVVDEGNKLQGIISRQDVLQALQMIQR 311

 Score = 64 (27.6 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 15/57 (26%), Positives = 33/57 (57%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITG-FPVIDDDWKLVGVVSDYDLLAL 130
             V D +T  E+   +K   TV +      E+ + G +P++D++ K++G+V+  D++ +
Sbjct:   191 VEDILTPIEETLYLKPNDTVQQ-WHAYNEETMHGRYPIVDENKKVLGIVTSKDMIGV 246


>TIGR_CMR|BA_4858 [details] [associations]
            symbol:BA_4858 "thioesterase family protein" species:198094
            "Bacillus anthracis str. Ames" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000524 InterPro:IPR000644 Pfam:PF00392 Pfam:PF00571
            PROSITE:PS51371 SMART:SM00116 Pfam:PF03061 EMBL:AE016879
            EMBL:AE017334 EMBL:AE017225 GenomeReviews:AE016879_GR
            GenomeReviews:AE017225_GR GenomeReviews:AE017334_GR
            InterPro:IPR006683 GO:GO:0003700 Gene3D:1.10.10.10
            InterPro:IPR011991 GO:GO:0005622 InterPro:IPR010766 Pfam:PF07085
            HOGENOM:HOG000022009 OMA:YFIKPVQ ProtClustDB:CLSK873528
            RefSeq:NP_847059.1 RefSeq:YP_021501.1 RefSeq:YP_030753.1
            ProteinModelPortal:Q81KX8 DNASU:1084014
            EnsemblBacteria:EBBACT00000012784 EnsemblBacteria:EBBACT00000018567
            EnsemblBacteria:EBBACT00000019988 GeneID:1084014 GeneID:2818548
            GeneID:2850363 KEGG:ban:BA_4858 KEGG:bar:GBAA_4858 KEGG:bat:BAS4505
            BioCyc:BANT260799:GJAJ-4563-MONOMER
            BioCyc:BANT261594:GJ7F-4718-MONOMER Uniprot:Q81KX8
        Length = 437

 Score = 88 (36.0 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 21/60 (35%), Positives = 36/60 (60%)

Query:   169 VGDLMTPAPLVVHENTNLEDAARLLLETKYRRLPVVDGYGKLVGLITRGNVVRAALQIKR 228
             +  +MT  P+ V+   ++  AAR+++      LPVVD   KL G+I+R +V++A   I+R
Sbjct:   252 IDKVMTRHPITVNGKMSVAAAARMMVWEGIELLPVVDEGNKLQGIISRQDVLQALQMIQR 311

 Score = 64 (27.6 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 15/57 (26%), Positives = 33/57 (57%)

Query:    75 VGDFMTKKEDLHAVKTTTTVDEALERLVEKRITG-FPVIDDDWKLVGVVSDYDLLAL 130
             V D +T  E+   +K   TV +      E+ + G +P++D++ K++G+V+  D++ +
Sbjct:   191 VEDILTPIEETLYLKPNDTVQQ-WHAYNEETMHGRYPIVDENKKVLGIVTSKDMIGV 246


>UNIPROTKB|Q604Y3 [details] [associations]
            symbol:MCA2402 "CBS domain protein" species:243233
            "Methylococcus capsulatus str. Bath" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000644 InterPro:IPR013785 Pfam:PF00571
            PROSITE:PS51371 SMART:SM00116 Pfam:PF04982 GO:GO:0003824
            Gene3D:3.20.20.70 GO:GO:0008152 EMBL:AE017282
            GenomeReviews:AE017282_GR InterPro:IPR007065 HOGENOM:HOG000198060
            KO:K07168 OMA:LAQPWPV RefSeq:YP_114817.1 ProteinModelPortal:Q604Y3
            GeneID:3104432 KEGG:mca:MCA2402 PATRIC:22608668 Uniprot:Q604Y3
        Length = 449

 Score = 111 (44.1 bits), Expect = 0.00095, P = 0.00095
 Identities = 39/152 (25%), Positives = 67/152 (44%)

Query:    71 GTYTVGDFMTKKEDLHAVKTTTTVDEALERLVEKRITGFPVIDDDWKLVGVVSDYDLLAL 130
             G  T    MT  E L A +    ++    R+    I   PV+D    ++G+V+  D    
Sbjct:   290 GKTTCAGIMTP-EPLTA-EFGDDLESVWRRMQRHGIRALPVVDRGRHVIGIVTFKDFFRH 347

Query:   131 DSISGGNQNDTSLFPNVNSTWKTFNELQRLLSKTNGKVVGDLMTPAPLVVHENTNLEDAA 190
                 G        F ++ +  K        ++ T  +VVG +MT   +    +  + + A
Sbjct:   348 APADG--------FGSLKARLKALLLPSPRVTSTKPEVVGQIMTAPAITARHDAPIVELA 399

Query:   191 RLLLETKYRRLPVVDGYGKLVGLITRGNVVRA 222
             RLL E    ++P+VD   KLVGL+T+ +++ A
Sbjct:   400 RLLSEHGIHQVPIVDERRKLVGLVTQTDLIAA 431


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.134   0.383    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      234       234   0.00087  113 3  11 22  0.43    33
                                                     32  0.43    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  27
  No. of states in DFA:  603 (64 KB)
  Total size of DFA:  166 KB (2098 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  21.51u 0.09s 21.60t   Elapsed:  00:00:01
  Total cpu time:  21.51u 0.09s 21.60t   Elapsed:  00:00:01
  Start:  Sat May 11 02:51:38 2013   End:  Sat May 11 02:51:39 2013

Back to top