Query 537021.9.peg.1064_1 Match_columns 238 No_of_seqs 20 out of 22 Neff 3.3 Searched_HMMs 39220 Date Wed May 25 06:19:27 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_1064.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam09352 DUF1994 Domain of un 45.1 11 0.00028 17.7 1.4 14 52-65 59-72 (162) 2 pfam09003 Phage_integ_N Bacter 43.9 13 0.00034 17.1 1.6 25 44-68 15-44 (81) 3 KOG4778 consensus 41.7 25 0.00064 15.5 5.9 47 161-208 100-146 (186) 4 pfam11169 DUF2956 Protein of u 38.4 28 0.00072 15.1 2.9 41 168-208 18-58 (103) 5 pfam09917 DUF2147 Uncharacteri 38.1 20 0.0005 16.1 1.7 34 33-70 52-85 (110) 6 TIGR02412 pepN_strep_liv amino 37.6 13 0.00032 17.3 0.7 23 42-64 98-122 (884) 7 pfam01350 Flavi_NS4A Flaviviru 36.7 30 0.00076 15.0 6.3 57 167-223 16-72 (145) 8 COG4731 Uncharacterized protei 36.6 16 0.0004 16.7 1.0 17 54-70 119-135 (162) 9 COG4257 Vgb Streptogramin lyas 30.5 38 0.00096 14.3 2.4 33 39-71 196-230 (353) 10 TIGR02103 pullul_strch alpha-1 25.0 24 0.00062 15.5 0.4 43 86-128 571-621 (945) 11 pfam11082 DUF2880 Protein of u 23.4 36 0.00091 14.5 1.0 11 2-12 25-35 (79) 12 cd01792 ISG15_repeat1 ISG15 is 22.9 32 0.00082 14.8 0.7 13 175-187 58-70 (80) 13 COG2309 AmpS Leucyl aminopepti 22.3 50 0.0013 13.6 1.6 50 34-83 189-257 (385) 14 TIGR02025 BchH magnesium chela 21.9 38 0.00096 14.3 0.9 38 42-82 658-700 (1384) 15 TIGR02608 delta_60_rpt delta-6 21.3 40 0.001 14.2 0.9 21 36-56 34-54 (58) 16 TIGR01857 FGAM-synthase phosph 20.9 31 0.0008 14.8 0.3 11 54-64 353-363 (1279) 17 pfam07481 DUF1521 Domain of Un 20.2 38 0.00098 14.3 0.6 27 36-64 126-153 (171) No 1 >pfam09352 DUF1994 Domain of unknown function (DUF1994). This family of proteins are functionally uncharacterized. Probab=45.15 E-value=11 Score=17.68 Aligned_cols=14 Identities=21% Similarity=0.575 Sum_probs=12.0 Q ss_pred CCEEECCCCCCEEE Q ss_conf 84032566773221 Q 537021.9.peg.1 52 VDKIIDSFIGREIS 65 (238) Q Consensus 52 ~~~~~Dp~tGk~y~ 65 (238) ..+|.|||+|++|. T Consensus 59 sG~w~dpYtg~~~~ 72 (162) T pfam09352 59 SGTLHDPYTGKTIT 72 (162) T ss_pred CCEEECCCCCCEEC T ss_conf 05077578998750 No 2 >pfam09003 Phage_integ_N Bacteriophage lambda integrase, N-terminal domain. The amino terminal domain of bacteriophage lambda integrase folds into a three-stranded, antiparallel beta-sheet that packs against a C-terminal alpha-helix, adopting a fold that is structurally related to the three-stranded beta-sheet family of DNA-binding domains (which includes the GCC-box DNA-binding domain and the N-terminal domain of Tn916 integrase). This domain is responsible for high-affinity binding to each of the five DNA arm-type sites and is also a context-sensitive modulator of DNA cleavage. Probab=43.90 E-value=13 Score=17.14 Aligned_cols=25 Identities=4% Similarity=-0.008 Sum_probs=18.1 Q ss_pred CEEEEECCC-----CEEECCCCCCEEECCC Q ss_conf 136886598-----4032566773221354 Q 537021.9.peg.1 44 IWQYKTSGV-----DKIIDSFIGREISIPH 68 (238) Q Consensus 44 sltYsq~g~-----~~~~Dp~tGk~y~~P~ 68 (238) +|.++..+- =.|.||.|||+|.+-. T Consensus 15 NLy~r~~~r~~k~Yy~yr~P~tGk~~~LG~ 44 (81) T pfam09003 15 NLYCRRDKRNGKVYYQYRNPLTGKFHGLGR 44 (81) T ss_pred CCCEECCCCCCCEEEEEECCCCCCEEECCC T ss_conf 513321576772589974688886531674 No 3 >KOG4778 consensus Probab=41.72 E-value=25 Score=15.46 Aligned_cols=47 Identities=15% Similarity=0.160 Sum_probs=39.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 620798757999999987606765758999999999987679999999 Q 537021.9.peg.1 161 RLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAM 208 (238) Q Consensus 161 rl~pq~~~~r~aLetrLaNQGl~~GS~Ay~~AM~~~~q~~nDar~qa~ 208 (238) -|.+.|..-+.+.+.. -+.||++-++|+..||.++..-+-..+-.|+ T Consensus 100 v~~k~W~~y~~~~rq~-~d~~Ir~~~kaq~~al~eLk~es~~Ly~aA~ 146 (186) T KOG4778 100 VILKTWSIYKQQQRQL-EDVGIRPQSKAQFKALNELKSESEELYEAAI 146 (186) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999998-9887788899999999999985589999982 No 4 >pfam11169 DUF2956 Protein of unknown function (DUF2956). This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. Probab=38.35 E-value=28 Score=15.13 Aligned_cols=41 Identities=20% Similarity=0.324 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 57999999987606765758999999999987679999999 Q 537021.9.peg.1 168 QDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAM 208 (238) Q Consensus 168 ~~r~aLetrLaNQGl~~GS~Ay~~AM~~~~q~~nDar~qa~ 208 (238) -.+...+|+|+-|||.-|-+-|..-+.+-.+...-+|-+.. T Consensus 18 PGQTKEQTKLIAqGIeKGIa~YKKqqK~KaRe~DK~rKk~~ 58 (103) T pfam11169 18 PGQTKEQTKLIAQGIEKGIAEYKKQQKAKAREADKARKQQL 58 (103) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98888999999999998999999998999898889999999 No 5 >pfam09917 DUF2147 Uncharacterized protein conserved in bacteria (DUF2147). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=38.06 E-value=20 Score=16.11 Aligned_cols=34 Identities=26% Similarity=0.215 Sum_probs=23.5 Q ss_pred HCCCCCCCCCCCEEEEECCCCEEECCCCCCEEECCCCC Q ss_conf 35677658865136886598403256677322135421 Q 537021.9.peg.1 33 RENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYL 70 (238) Q Consensus 33 lnnvNq~TP~GsltYsq~g~~~~~Dp~tGk~y~~P~~t 70 (238) |..+.-..|.+.-.|+ .+++||-+||+|...... T Consensus 52 ~~il~~l~~~~~~~w~----G~iydP~~GktY~~~~~l 85 (110) T pfam09917 52 LTILWDLKPDGGGKWS----GKIYDPENGKTYSGKITL 85 (110) T ss_pred CEEEEEEEECCCCEEE----EEEEECCCCCEEEEEEEE T ss_conf 0435733668998888----899907999999999999 No 6 >TIGR02412 pepN_strep_liv aminopeptidase N; InterPro: IPR012778 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family is a subset of the members of the zinc metallopeptidases belonging to MEROPS peptidase family M1 (aminopeptidase N, clan MA), with a single member characterised in Streptomyces lividans 66: aminopeptidase G . The rest of the members of this family are identified as aminopeptidase N of the actinomycete-type. The spectrum of activity may differ somewhat from the aminopeptidase N clade of Escherichia coli and most other proteobacteria, which are well separated phylogenetically within the M1 family. . Probab=37.59 E-value=13 Score=17.35 Aligned_cols=23 Identities=17% Similarity=0.400 Sum_probs=11.8 Q ss_pred CCCEEEEECCCC--EEECCCCCCEE Q ss_conf 651368865984--03256677322 Q 537021.9.peg.1 42 DGIWQYKTSGVD--KIIDSFIGREI 64 (238) Q Consensus 42 ~GsltYsq~g~~--~~~Dp~tGk~y 64 (238) .+.+.|+.+|-. +++||..||+| T Consensus 98 ~~~~~y~~tGEGlHrF~DPvDg~vY 122 (884) T TIGR02412 98 EAKLAYTNTGEGLHRFVDPVDGEVY 122 (884) T ss_pred EEEECCCCCCCCCEEEECCCCCCEE T ss_conf 8740756458713375448898367 No 7 >pfam01350 Flavi_NS4A Flavivirus non-structural protein NS4A. Flaviviruses encode a single polyprotein. This is cleaved into three structural and seven non-structural proteins. The NS4A protein is small and poorly conserved among the Flaviviruses. NS4A contains multiple hydrophobic potential membrane spanning regions. NS4A has only been found in cells infected by Kunjin virus. Probab=36.69 E-value=30 Score=14.97 Aligned_cols=57 Identities=25% Similarity=0.217 Sum_probs=36.8 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 757999999987606765758999999999987679999999810138999999875 Q 537021.9.peg.1 167 QQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQE 223 (238) Q Consensus 167 ~~~r~aLetrLaNQGl~~GS~Ay~~AM~~~~q~~nDar~qa~L~g~qe~~r~~~l~~ 223 (238) +|-++|++|=-.=--=.|||+||+.|+.|.-++---...-.++.-....-....+.+ T Consensus 16 ~r~~~A~Dt~~~l~~ae~gsRA~r~A~~elPEAl~till~~ll~~~T~Gv~~~lm~~ 72 (145) T pfam01350 16 QRTQEALDTLYTLLTAEPGSRAYRMALEELPEALETILLVALLGLATLGVFLLLMAR 72 (145) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 989999989986650798867999999868299999999999999999999987627 No 8 >COG4731 Uncharacterized protein conserved in bacteria [Function unknown] Probab=36.56 E-value=16 Score=16.73 Aligned_cols=17 Identities=24% Similarity=0.180 Sum_probs=12.9 Q ss_pred EEECCCCCCEEECCCCC Q ss_conf 03256677322135421 Q 537021.9.peg.1 54 KIIDSFIGREISIPHYL 70 (238) Q Consensus 54 ~~~Dp~tGk~y~~P~~t 70 (238) +++||..||+|....-. T Consensus 119 ~v~dP~~gKtYs~k~~v 135 (162) T COG4731 119 KVYDPANGKTYSGKVTV 135 (162) T ss_pred EEECCCCCCEEEEEEEE T ss_conf 77526789889899998 No 9 >COG4257 Vgb Streptogramin lyase [Defense mechanisms] Probab=30.54 E-value=38 Score=14.34 Aligned_cols=33 Identities=30% Similarity=0.408 Sum_probs=15.2 Q ss_pred CCCCCCEEEE--ECCCCEEECCCCCCEEECCCCCC Q ss_conf 5886513688--65984032566773221354210 Q 537021.9.peg.1 39 MTPDGIWQYK--TSGVDKIIDSFIGREISIPHYLQ 71 (238) Q Consensus 39 ~TP~GsltYs--q~g~~~~~Dp~tGk~y~~P~~ta 71 (238) +||+||+=|+ +...---+||++|-.-.+|..++ T Consensus 196 atpdGsvwyaslagnaiaridp~~~~aev~p~P~~ 230 (353) T COG4257 196 ATPDGSVWYASLAGNAIARIDPFAGHAEVVPQPNA 230 (353) T ss_pred ECCCCCEEEEECCCCCEEECCCCCCCCCEECCCCC T ss_conf 78998588873146603880644578524259971 No 10 >TIGR02103 pullul_strch alpha-1,6-glucosidases, pullulanase-type; InterPro: IPR011839 Members of this protein family include secreted (or membrane-anchored) pullulanases of Gram-negative bacteria and pullulanase-type starch debranching enzymes of plants. Both enzymes hydrolyze alpha-1,6 glycosidic linkages. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family is closely homologous to, but architecturally different from the pullulanases of Gram-positive bacteria (IPR011838 from INTERPRO).; GO: 0051060 pullulanase activity, 0005975 carbohydrate metabolic process. Probab=25.04 E-value=24 Score=15.52 Aligned_cols=43 Identities=16% Similarity=-0.043 Sum_probs=29.7 Q ss_pred HHHHHCCHHHHHH-HHHHHHHHHH--CCCHHC-----CCCCCCHHCCCCCC Q ss_conf 5553121788765-4434444530--110000-----01332000011343 Q 537021.9.peg.1 86 QNINNLLLSDLLT-QRIQDLLPHH--HTHVNT-----TKDFPPQQLRDNDV 128 (238) Q Consensus 86 ~~~t~lnLa~La~-~qsg~l~~~~--~t~~~~-----~~~~~~q~~~~~~~ 128 (238) .++||+||+|-.- ++|.+|+|+. |.||+- -++.+++.+.|-.| T Consensus 571 ~~AtQlNL~GTGIGtFsDRlRDA~RGGsPFd~GvdseG~d~Lr~~QGfg~G 621 (945) T TIGR02103 571 INATQLNLAGTGIGTFSDRLRDAVRGGSPFDSGVDSEGKDALRQNQGFGSG 621 (945) T ss_pred CHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCHHHCCEEEE T ss_conf 103555413676441114688764078888877367777644010020241 No 11 >pfam11082 DUF2880 Protein of unknown function (DUF2880). This bacterial family of proteins has no known function. Probab=23.38 E-value=36 Score=14.48 Aligned_cols=11 Identities=45% Similarity=0.561 Sum_probs=8.2 Q ss_pred CCCCCCCCCCH Q ss_conf 89999987997 Q 537021.9.peg.1 2 GKQRASLAPDP 12 (238) Q Consensus 2 Gk~kAP~aPDP 12 (238) ||.+||++|-- T Consensus 25 GKdEAPeAPva 35 (79) T pfam11082 25 GKDEAPEAPVA 35 (79) T ss_pred CCCCCCCCCHH T ss_conf 98889887489 No 12 >cd01792 ISG15_repeat1 ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains that becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system. Probab=22.92 E-value=32 Score=14.78 Aligned_cols=13 Identities=46% Similarity=0.552 Sum_probs=10.5 Q ss_pred HHHHHHCCCCCHH Q ss_conf 9987606765758 Q 537021.9.peg.1 175 TKLHNQGLVSGSV 187 (238) Q Consensus 175 trLaNQGl~~GS~ 187 (238) -.|.+|||.|||. T Consensus 58 ~pL~~~gl~pgst 70 (80) T cd01792 58 VPLVSQGLGPGST 70 (80) T ss_pred CCHHHCCCCCCCE T ss_conf 6765527799989 No 13 >COG2309 AmpS Leucyl aminopeptidase (aminopeptidase T) [Amino acid transport and metabolism] Probab=22.30 E-value=50 Score=13.57 Aligned_cols=50 Identities=20% Similarity=0.231 Sum_probs=36.1 Q ss_pred CCCCCCCCCC-CEEEEECCCCEEECCC-----------CCCEEECCC-------CCCCCCCCHHHHHHH Q ss_conf 5677658865-1368865984032566-----------773221354-------210122487787731 Q 537021.9.peg.1 34 ENIDRMTPDG-IWQYKTSGVDKIIDSF-----------IGREISIPH-------YLQSYSLHPIQQQIH 83 (238) Q Consensus 34 nnvNq~TP~G-sltYsq~g~~~~~Dp~-----------tGk~y~~P~-------~tatqtLsp~qQaI~ 83 (238) .-|-.+||.| .||.+--|...|.|.- +|-.+..|+ .+.+..|++...-|. T Consensus 189 ~~Vh~~tp~gTDLt~~l~~~~~~~~~g~~~~~~~~NmPtgEVFtaP~~~sv~G~V~~~~Pl~~~G~lie 257 (385) T COG2309 189 SAVHITTPGGTDLTFGLAGRHAWNDAGSVNKGGNANMPTGEVFTAPHEDSVNGVVVFDKPLSYQGTLIE 257 (385) T ss_pred CEEEEECCCCCEEEEECCCCEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCEEEE T ss_conf 579995899750699758960651677668787788997535766665764359997867885877660 No 14 >TIGR02025 BchH magnesium chelatase, H subunit; InterPro: IPR011771 This entry represents the H subunit of the magnesium chelatase complex responsible for magnesium insertion into the protoporphyrin IX ring in the biosynthesis of both chlorophyll and bacteriochlorophyll. In chlorophyll-utilizing species, this gene is known as ChlH, while in bacteriochlorophyll-utilizing spoecies it is called BchH. Subunit H is the largest (~140kDa) of the three subunits (the others being BchD/ChlD and BchI/ChlI), and is known to bind protoporphyrin IX. Subunit H is homologous to the CobN subunit of cobaltochelatase and by anology with that enzyme, subunit H is believed to also bind the magnesium ion which is inserted into the ring. In conjunction with the hydrolysis of ATP by subunits I and D, a conformation change is believed to happen in subunit H causing the magnesium ion insertion into the distorted protoporphyrin ring .; GO: 0016851 magnesium chelatase activity, 0015995 chlorophyll biosynthetic process. Probab=21.89 E-value=38 Score=14.33 Aligned_cols=38 Identities=21% Similarity=0.445 Sum_probs=25.1 Q ss_pred CCCEEE---EECCCC--EEECCCCCCEEECCCCCCCCCCCHHHHHH Q ss_conf 651368---865984--03256677322135421012248778773 Q 537021.9.peg.1 42 DGIWQY---KTSGVD--KIIDSFIGREISIPHYLQSYSLHPIQQQI 82 (238) Q Consensus 42 ~GsltY---sq~g~~--~~~Dp~tGk~y~~P~~tatqtLsp~qQaI 82 (238) -|||+| +|+|++ .|.|-+.| +||+|=-|-.==|=.-+| T Consensus 658 HGslEfMPG~Q~Gltg~CWPD~LlG---~lPnfY~YavNNPSEa~I 700 (1384) T TIGR02025 658 HGSLEFMPGKQVGLTGECWPDRLLG---DLPNFYLYAVNNPSEATI 700 (1384) T ss_pred CCCEEECCCCCCCCCCCCCCCHHCC---CCCCCEECCCCCHHHHHH T ss_conf 6532226778758887777720103---675100027888037899 No 15 >TIGR02608 delta_60_rpt delta-60 repeat domain; InterPro: IPR013431 This repeat often occurs in tandem, up to as many as 13 times, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The length of the repeat ranges from about 57 to 61 amino acids - this entry represents a core region of approximately 54 amino acids.. Probab=21.32 E-value=40 Score=14.21 Aligned_cols=21 Identities=24% Similarity=0.420 Sum_probs=16.3 Q ss_pred CCCCCCCCCEEEEECCCCEEE Q ss_conf 776588651368865984032 Q 537021.9.peg.1 36 IDRMTPDGIWQYKTSGVDKII 56 (238) Q Consensus 36 vNq~TP~GsltYsq~g~~~~~ 56 (238) |=++||||||.=++.+..|.+ T Consensus 34 ~~RlN~DGsLDt~FG~~G~~~ 54 (58) T TIGR02608 34 LARLNADGSLDTSFGGDGKVT 54 (58) T ss_pred EEEECCCCCCCCCCCCCCEEE T ss_conf 998679998107736984589 No 16 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=20.93 E-value=31 Score=14.84 Aligned_cols=11 Identities=36% Similarity=0.525 Sum_probs=5.2 Q ss_pred EEECCCCCCEE Q ss_conf 03256677322 Q 537021.9.peg.1 54 KIIDSFIGREI 64 (238) Q Consensus 54 ~~~Dp~tGk~y 64 (238) -|+||++|+.| T Consensus 353 AIRDPLSGRSY 363 (1279) T TIGR01857 353 AIRDPLSGRSY 363 (1279) T ss_pred CCCCCCCCHHH T ss_conf 41377763003 No 17 >pfam07481 DUF1521 Domain of Unknown Function (DUF1521). This family of unknown function is found in a limited set of Bradyrhizobium proteins. There appears to be a periodic -DG- motif in it. Probab=20.20 E-value=38 Score=14.30 Aligned_cols=27 Identities=33% Similarity=0.678 Sum_probs=14.6 Q ss_pred CCCCCCCCCE-EEEECCCCEEECCCCCCEE Q ss_conf 7765886513-6886598403256677322 Q 537021.9.peg.1 36 IDRMTPDGIW-QYKTSGVDKIIDSFIGREI 64 (238) Q Consensus 36 vNq~TP~Gsl-tYsq~g~~~~~Dp~tGk~y 64 (238) .+++||||-. .+-|+| +-|.|+ +|+.- T Consensus 126 LD~ltpdG~~~i~~q~G-~gw~d~-tG~~~ 153 (171) T pfam07481 126 LDQLTPDGAQTIHEQTG-QGWVDP-TGRAV 153 (171) T ss_pred CHHHCCCHHHHHHCCCC-CCCCCC-CCCCC T ss_conf 01027543433330688-611388-77645 Done!