Query psy10899
Match_columns 218
No_of_seqs 126 out of 607
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 21:23:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10899.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10899hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3182|consensus 100.0 8.9E-48 1.9E-52 325.7 16.0 152 4-171 39-196 (212)
2 PF04752 ChaC: ChaC-like prote 100.0 2.8E-45 6.1E-50 309.0 16.3 142 4-158 30-177 (178)
3 COG3703 ChaC Uncharacterized p 100.0 3.1E-37 6.7E-42 260.5 15.0 139 4-163 41-186 (190)
4 PHA03014 hypothetical protein; 99.0 4.8E-09 1E-13 87.9 12.3 109 15-159 47-161 (163)
5 PF13772 AIG2_2: AIG2-like fam 98.9 4.2E-09 9E-14 78.2 7.7 82 30-152 1-83 (83)
6 cd06661 GGCT_like GGCT-like do 98.6 1.5E-07 3.2E-12 69.8 7.6 60 19-94 40-99 (99)
7 KOG4059|consensus 97.7 9.4E-05 2E-09 62.1 6.7 117 16-152 67-184 (193)
8 PF06094 AIG2: AIG2-like famil 97.5 0.00047 1E-08 51.6 7.0 62 18-95 40-102 (102)
9 COG2105 Uncharacterized conser 95.0 0.1 2.2E-06 42.0 7.0 61 20-99 44-104 (120)
10 PF06570 DUF1129: Protein of u 72.4 3.7 8.1E-05 35.1 3.1 51 121-176 3-56 (206)
11 PRK07668 hypothetical protein; 58.6 12 0.00025 33.8 3.6 52 121-176 3-57 (254)
12 TIGR00290 MJ0570_dom MJ0570-re 49.0 1.1E+02 0.0024 26.8 8.1 79 87-167 26-114 (223)
13 KOG4311|consensus 39.7 30 0.00064 32.0 3.1 73 13-103 145-231 (359)
14 PF09164 VitD-bind_III: Vitami 37.7 53 0.0012 24.0 3.5 30 125-159 12-41 (68)
15 PF10264 Stork_head: Winged he 29.9 1.2E+02 0.0027 22.7 4.6 38 128-167 32-69 (80)
16 PF05269 Phage_CII: Bacterioph 29.7 79 0.0017 24.2 3.5 53 112-166 27-84 (91)
17 PF11784 DUF3320: Protein of u 25.6 56 0.0012 22.1 1.9 21 102-122 24-44 (52)
18 PF09286 Pro-kuma_activ: Pro-k 25.1 1.1E+02 0.0023 24.2 3.8 44 94-148 37-80 (143)
19 COG0177 Nth Predicted EndoIII- 24.6 1.3E+02 0.0029 26.3 4.6 50 102-154 64-113 (211)
20 PRK04456 acetyl-CoA decarbonyl 23.6 1.4E+02 0.0031 29.2 4.8 47 83-129 279-338 (463)
21 PF07454 SpoIIP: Stage II spor 21.5 1.2E+02 0.0025 27.6 3.7 40 83-142 182-221 (268)
22 COG4759 Uncharacterized protei 20.3 56 0.0012 30.4 1.4 28 3-34 168-197 (316)
No 1
>KOG3182|consensus
Probab=100.00 E-value=8.9e-48 Score=325.70 Aligned_cols=152 Identities=40% Similarity=0.682 Sum_probs=140.3
Q ss_pred eecccCCC---CCCCCCCeeEEEeeCCCCeEEEEEEEeCCC---ChhhHHHhhccCCCCCcceEEEEEeCCCCCCCCCCC
Q psy10899 4 LLAWSGGN---SSTRTRVRYFGHILSTRSIVWGRAFLVSGK---SAVPYLHTREGTLGGYKTSFEQFYPKDHSDQSSSSE 77 (218)
Q Consensus 4 ~~~wq~s~---~T~e~PGrV~tL~~~~gg~cwGvAYrI~~~---~~l~yLd~RE~~~ggY~~~~v~~~~~~~~~~~~~~~ 77 (218)
+||||++. |||+.||||+||+++.+++|||+||+|+++ ++++||+.|| +|||..+.|+||+.+..
T Consensus 39 RrF~q~s~dHRGtp~~PGRv~TLi~~~e~~~wGvay~V~g~~~~~~l~yl~~RE--~nGY~~~~v~f~~e~~~------- 109 (212)
T KOG3182|consen 39 RRFWQGSTDHRGTPEHPGRVATLIPYEEAITWGVAYRVRGKQASEVLEYLNVRE--LNGYTTHEVEFYPEDAA------- 109 (212)
T ss_pred hheeccccccCCCCCCCceeEEeecCCcceEeeEEEEecchhHHHHHHHHHHHh--hcCcceeeeeeeccCCC-------
Confidence 69999999 699999999999999999999999999996 5899999999 79999999999997543
Q ss_pred CCCCCCCCcEEEEEEEecCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCccHHHHHHHHH
Q psy10899 78 DGLSETSQPIRTMMFIACPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPELFTLEIMIR 157 (218)
Q Consensus 78 ~~~~~~g~~v~AlvYiat~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr 157 (218)
....++.+||||||++|++|+||+++++||+||++|+||||+|+||||+|+++|++++|++ .|+||++|++.||
T Consensus 110 ----~~p~v~~vlvyvaTp~N~~ylGp~ple~iArqI~t~~GpsG~N~eYLf~La~am~~l~p~~--~D~hl~eL~~~Vr 183 (212)
T KOG3182|consen 110 ----ELPEVLGVLVYVATPDNEYYLGPAPLEEIARQIVTARGPSGPNREYLFNLAKAMRQLFPGA--EDEHLFELENEVR 183 (212)
T ss_pred ----CCCceEEEEEEEecCCCccccCCccHHHHHHHHHhccCCCCCcHHHHHHHHHHHHHcCCCc--hhHHHHHHHHHHH
Confidence 2468899999999999999999999999999999999999999999999999999999988 9999999999999
Q ss_pred HHhhhcCCCccccc
Q psy10899 158 SRLKEKKIPLHVIM 171 (218)
Q Consensus 158 ~~~~~~~~~~~~~~ 171 (218)
+.+.+.. .+.+.|
T Consensus 184 k~l~~~~-~~~~al 196 (212)
T KOG3182|consen 184 KYLVESR-PLVHAL 196 (212)
T ss_pred HHHhccc-hhhhhh
Confidence 9988665 554444
No 2
>PF04752 ChaC: ChaC-like protein; InterPro: IPR006840 The ChaC protein is thought to be associated with the putative ChaA Ca2+/H+ cation transport protein in Escherichia coli. Its function is not known. This family also includes homologues regions from several other bacterial and eukaryotic proteins.
Probab=100.00 E-value=2.8e-45 Score=309.04 Aligned_cols=142 Identities=33% Similarity=0.575 Sum_probs=127.8
Q ss_pred eecccCCC---CCCCCCCeeEEEeeCCCCeEEEEEEEeCCC---ChhhHHHhhccCCCCCcceEEEEEeCCCCCCCCCCC
Q psy10899 4 LLAWSGGN---SSTRTRVRYFGHILSTRSIVWGRAFLVSGK---SAVPYLHTREGTLGGYKTSFEQFYPKDHSDQSSSSE 77 (218)
Q Consensus 4 ~~~wq~s~---~T~e~PGrV~tL~~~~gg~cwGvAYrI~~~---~~l~yLd~RE~~~ggY~~~~v~~~~~~~~~~~~~~~ 77 (218)
++|||.|. |||++||||+||++.++|.|||+||+|+++ ++++|||.||| .+||.+++++|++.....
T Consensus 30 R~F~~~s~~hRGTpe~PGrvltL~~~~~~~c~Gvayrv~~~~~~~~l~~L~~RE~-~~Gy~~~~v~~~~~~~~~------ 102 (178)
T PF04752_consen 30 RRFCQGSTDHRGTPEQPGRVLTLDPGEEGSCWGVAYRVPEEDAEEVLEYLDEREM-IGGYTRHWVPFYPEVDTD------ 102 (178)
T ss_pred cceEeeccccCCCcCCCcceeeeeeCCCCEEEEEEEEecCcCHHHHHHHHhhccc-ccccceEEEEEEEeccCC------
Confidence 57888886 799999999999988889999999999997 47999999997 699999999999832210
Q ss_pred CCCCCCCCcEEEEEEEecCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCccHHHHHHHHH
Q psy10899 78 DGLSETSQPIRTMMFIACPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPELFTLEIMIR 157 (218)
Q Consensus 78 ~~~~~~g~~v~AlvYiat~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr 157 (218)
..+..++||+||++++||+|+|+++++++|++|++|+||||+|+|||++|+++|+++.|++ .|+||++|+++|+
T Consensus 103 ----~~~~~~~al~yv~~~~n~~y~g~~~~~~~A~~Ia~a~G~~G~N~eYL~~l~~~L~~~gp~i--~D~~l~~L~~~V~ 176 (178)
T PF04752_consen 103 ----SGPVIVEALVYVADPDNPQYLGPLPLEEIARIIATASGPSGSNREYLFNLAEALRELGPGI--RDPHLFALERRVR 176 (178)
T ss_pred ----CCceEEEEEEEEecCCCccccCCCCHHHHHHHHhheECcCcCCHHHHHHHHHHHHHhCCCC--CChHHHHHHHHHh
Confidence 1334569999999999999999999999999999999999999999999999999996656 9999999999998
Q ss_pred H
Q psy10899 158 S 158 (218)
Q Consensus 158 ~ 158 (218)
+
T Consensus 177 ~ 177 (178)
T PF04752_consen 177 E 177 (178)
T ss_pred h
Confidence 6
No 3
>COG3703 ChaC Uncharacterized protein involved in cation transport [Inorganic ion transport and metabolism]
Probab=100.00 E-value=3.1e-37 Score=260.47 Aligned_cols=139 Identities=19% Similarity=0.224 Sum_probs=129.1
Q ss_pred eecccCCC---CCCCCCCeeEEEeeCCCCeEEEEEEEeCCC---ChhhHHHhhcc-CCCCCcceEEEEEeCCCCCCCCCC
Q psy10899 4 LLAWSGGN---SSTRTRVRYFGHILSTRSIVWGRAFLVSGK---SAVPYLHTREG-TLGGYKTSFEQFYPKDHSDQSSSS 76 (218)
Q Consensus 4 ~~~wq~s~---~T~e~PGrV~tL~~~~gg~cwGvAYrI~~~---~~l~yLd~RE~-~~ggY~~~~v~~~~~~~~~~~~~~ 76 (218)
+.||.+|+ ||+++||+|++|+ +||+|+|||||||++ ++++||++||| .+..|.++|+++.++
T Consensus 41 Rsfc~~s~~~RGT~~~PGlvl~L~--~GGsc~GvafRip~~~~~~v~~yL~~RE~~~t~~y~p~~l~v~~~--------- 109 (190)
T COG3703 41 RSFCLRSTDHRGTAEQPGLVLGLD--RGGSCEGVAYRIPEAHAEEVLEYLREREMNYTLVYVPRWLPVELE--------- 109 (190)
T ss_pred eEEEEEEeeecCCcCCCceEEEee--CCCcEEEEEEEcCchhhHHHHHHHHHhhccccceeeeEEEEEecC---------
Confidence 34666665 7999999999999 999999999999987 48999999999 889999999999986
Q ss_pred CCCCCCCCCcEEEEEEEecCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCccHHHHHHHH
Q psy10899 77 EDGLSETSQPIRTMMFIACPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPELFTLEIMI 156 (218)
Q Consensus 77 ~~~~~~~g~~v~AlvYiat~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~V 156 (218)
+|..+.|++||++++|++|.|+++.+++|.+|+.|+|+||+|.|||++|.++|+++ ||+|+.|+.|+..|
T Consensus 110 ------~g~~~~al~~v~~~~h~qyag~l~~~~~A~~ia~a~G~sG~n~eYL~~t~~hL~~~----gi~d~~l~~l~~~v 179 (190)
T COG3703 110 ------GGRRVNALVFVGDRKHPQYAGDLDAEQIAAIIAAAVGLSGPNAEYLFNTLQHLRKL----GIRDHNLEDLLELV 179 (190)
T ss_pred ------CCcEEEEEEEEecCCccccCCCCcHHHHHHHHHHHhCCCCCcHHHHHHHHHHHHhc----CCcchhHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999 77999999999999
Q ss_pred HHHhhhc
Q psy10899 157 RSRLKEK 163 (218)
Q Consensus 157 r~~~~~~ 163 (218)
.+..++.
T Consensus 180 ~~~~~~~ 186 (190)
T COG3703 180 AALLAEQ 186 (190)
T ss_pred HHHHHHh
Confidence 9877643
No 4
>PHA03014 hypothetical protein; Provisional
Probab=99.02 E-value=4.8e-09 Score=87.85 Aligned_cols=109 Identities=13% Similarity=0.057 Sum_probs=89.8
Q ss_pred CCCCeeEEEeeCCCCeEEEEEEEeCCCChhhHHHhhccCC-CCCcceEEEEEeCCCCCCCCCCCCCCCCCCCcEEEEEE-
Q psy10899 15 RTRVRYFGHILSTRSIVWGRAFLVSGKSAVPYLHTREGTL-GGYKTSFEQFYPKDHSDQSSSSEDGLSETSQPIRTMMF- 92 (218)
Q Consensus 15 e~PGrV~tL~~~~gg~cwGvAYrI~~~~~l~yLd~RE~~~-ggY~~~~v~~~~~~~~~~~~~~~~~~~~~g~~v~AlvY- 92 (218)
.-.|-++|+++++|+.|||+.|+|+. .-++-||+-|.+. +-|.+.+++|.+.+ ++..++|++|
T Consensus 47 ~~~Ga~ATIvp~~g~~V~Gvlw~i~~-~dl~~LD~~EGvp~~~Y~~~~v~V~~~~--------------~~~~~~a~~Y~ 111 (163)
T PHA03014 47 EIDSVIASIKKDDNGIVFGILYEFNE-SIMKKFDKQEFIDKNIYKLAKMNVLDLE--------------DEKIIEAQAYK 111 (163)
T ss_pred CcCCceEEEEECCCCEEEEEEEEeCH-HHHHHHhhhcCCCcCceEEEEEEEEeCC--------------CCcEEEEEEEe
Confidence 34788999999999999999999998 6788999999765 44999999998763 3578999999
Q ss_pred -EecCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCC---CCccHHHHHHHHHHH
Q psy10899 93 -IACPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHV---HDPELFTLEIMIRSR 159 (218)
Q Consensus 93 -iat~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi---~D~hL~~Le~~Vr~~ 159 (218)
+.++.++.+. +.=|...||--+.+..++. |+ ..+|+..|++..++.
T Consensus 112 ~~~~~~~~~~~-----------------~~~Ps~~Yl~~I~~Ga~e~----Gl~~~P~~Y~~~l~~~~~~~ 161 (163)
T PHA03014 112 AILDDDNNMFY-----------------DAPNFNIYKDIIIDALIEN----NILDYPLWYIKHINNIFKEY 161 (163)
T ss_pred hhcCCCccccc-----------------CCCChHHHHHHHHHHHHHh----CCCCCcHHHHHHHHHHHHHh
Confidence 6666665331 1135679999999999999 99 999999999887653
No 5
>PF13772 AIG2_2: AIG2-like family; PDB: 2QIK_A 2RBH_B 2I5T_B 2Q53_A 2PN7_B 3CRY_A.
Probab=98.93 E-value=4.2e-09 Score=78.19 Aligned_cols=82 Identities=20% Similarity=0.289 Sum_probs=66.6
Q ss_pred eEEEEEEEeCCCChhhHHHhhccCCC-CCcceEEEEEeCCCCCCCCCCCCCCCCCCCcEEEEEEEecCCCCCCCCCCCHH
Q psy10899 30 IVWGRAFLVSGKSAVPYLHTREGTLG-GYKTSFEQFYPKDHSDQSSSSEDGLSETSQPIRTMMFIACPTNSMWLGEAPLH 108 (218)
Q Consensus 30 ~cwGvAYrI~~~~~l~yLd~RE~~~g-gY~~~~v~~~~~~~~~~~~~~~~~~~~~g~~v~AlvYiat~~n~~y~G~~~le 108 (218)
.|||+.|+|++ +.++.||.+|.... .|.+..++|... ++..+.|++|++++... ++
T Consensus 1 ~V~Gvly~l~~-~d~~~LD~~Eg~~~g~Y~~~~v~V~~~---------------~g~~~~a~tY~~~~~~~---~~---- 57 (83)
T PF13772_consen 1 RVWGVLYELSE-EDLESLDRYEGVPIGAYRRIEVTVSTA---------------DGKPVEAFTYVANPKPE---GP---- 57 (83)
T ss_dssp EEEEEEEEEEG-GGHHHHHHHTTTTTTSEEEEEEEEEET---------------TCEEEEEEEEEESSEEE---------
T ss_pred CEEEEEEEECH-HHHHHHHHhcCCCCCCEEEEEEEEEcC---------------CCCEEEEEEEEcCCCCC---CC----
Confidence 58999999999 78999999996544 699999999885 57889999999999633 21
Q ss_pred HHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCccHHHH
Q psy10899 109 EIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPELFTL 152 (218)
Q Consensus 109 ~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~L 152 (218)
|..+||-.+.+..++. |+...|+..|
T Consensus 58 --------------Ps~~Yl~~i~~GA~e~----gLp~~Yv~~L 83 (83)
T PF13772_consen 58 --------------PSDRYLDLILRGAREH----GLPAEYVEKL 83 (83)
T ss_dssp ----------------HHHHHHHHHHHHHC----T--HHHHHHH
T ss_pred --------------CCHHHHHHHHHHHHHc----CCCHHHHhhC
Confidence 8899999999999999 7799988876
No 6
>cd06661 GGCT_like GGCT-like domains, also called AIG2-like family. Gamma-glutamyl cyclotransferase (GGCT) catalyzes the formation of pyroglutamic acid (5-oxoproline) from dipeptides containing gamma-glutamyl, and is a dimeric protein. In Homo sapiens, the protein is encoded by the gene C7orf24, and the enzyme participates in the gamma-glutamyl cycle. Hereditary defects in the gamma-glutamyl cycle have been described for some of the genes involved, but not for C7orf24. The synthesis and metabolism of glutathione (L-gamma-glutamyl-L-cysteinylglycine) ties the gamma-glutamyl cycle to numerous cellular processes; glutathione acts as a ubiquitous reducing agent in reductive mechanisms involved in protein and DNA synthesis, transport processes, enzyme activity, and metabolism. AIG2 (avrRpt2-induced gene) is an Arabidopsis protein that exhibits RPS2- and avrRpt2-dependent induction early after infection with Pseudomonas syringae pv maculicola strain ES4326 carrying avrRpt2. avrRpt2 is an avir
Probab=98.63 E-value=1.5e-07 Score=69.81 Aligned_cols=60 Identities=22% Similarity=0.431 Sum_probs=51.7
Q ss_pred eeEEEeeCCCCeEEEEEEEeCCCChhhHHHhhccCCCCCcceEEEEEeCCCCCCCCCCCCCCCCCCCcEEEEEEEe
Q psy10899 19 RYFGHILSTRSIVWGRAFLVSGKSAVPYLHTREGTLGGYKTSFEQFYPKDHSDQSSSSEDGLSETSQPIRTMMFIA 94 (218)
Q Consensus 19 rV~tL~~~~gg~cwGvAYrI~~~~~l~yLd~RE~~~ggY~~~~v~~~~~~~~~~~~~~~~~~~~~g~~v~AlvYia 94 (218)
.+.+|++++++.|||++|+|+ ++.++.||.+|...++|.+..++++.. ++..++|++|+.
T Consensus 40 ~~p~~~~~~~~~v~G~v~~i~-~~~l~~LD~~E~~~~~Y~r~~v~v~~~---------------~~~~~~a~~Y~~ 99 (99)
T cd06661 40 GYPGLVPGPGARVWGELYEVD-PEDLARLDAFEGVPGGYRREEVEVELE---------------DGEGVEAWVYVA 99 (99)
T ss_pred ccCEEEeCCCCEEEEEEEEEC-HHHHHhhhhhcCCCCCeEEEEEEEEeC---------------CCCEEEEEEEeC
Confidence 456788889999999999999 588999999995446999999999986 457899999984
No 7
>KOG4059|consensus
Probab=97.73 E-value=9.4e-05 Score=62.11 Aligned_cols=117 Identities=15% Similarity=0.181 Sum_probs=82.2
Q ss_pred CCCeeEEEeeCCCCeEEEEEEEeCCCChhhHHHhhccCCCC-CcceEEEEEeCCCCCCCCCCCCCCCCCCCcEEEEEEEe
Q psy10899 16 TRVRYFGHILSTRSIVWGRAFLVSGKSAVPYLHTREGTLGG-YKTSFEQFYPKDHSDQSSSSEDGLSETSQPIRTMMFIA 94 (218)
Q Consensus 16 ~PGrV~tL~~~~gg~cwGvAYrI~~~~~l~yLd~RE~~~gg-Y~~~~v~~~~~~~~~~~~~~~~~~~~~g~~v~AlvYia 94 (218)
--|-|+|+++.+|..|||++++++- .-+.-||.-|.++-| |.+..|.|.+. .|+.+.|=.|..
T Consensus 67 W~G~vATI~~t~GdeVWG~vWKm~~-snl~slDeQEgv~~G~Y~~~~V~V~t~---------------eg~~itcR~Yl~ 130 (193)
T KOG4059|consen 67 WSGSVATIVPTQGDEVWGTVWKMDL-SNLPSLDEQEGVSQGIYEPRTVYVKTH---------------EGESITCRAYLL 130 (193)
T ss_pred cccceeEEecCCCCeEEEEEEEccc-ccCccchhhhcccccceEEEEEEEecC---------------CCceeehhHhhh
Confidence 4578999999999999999999987 457778999987766 99999999987 356677777776
Q ss_pred cCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCccHHHH
Q psy10899 95 CPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPELFTL 152 (218)
Q Consensus 95 t~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~L 152 (218)
+. -.=+...|-..--..|....=.++--.||+-+|-.-=+.-+-|. .|.++..+
T Consensus 131 sn--l~~~P~~PSp~Yk~~i~~GAkEn~lP~dY~qkL~aIe~NgfaG~--V~~~ie~~ 184 (193)
T KOG4059|consen 131 SN--LYELPKQPSPTYKQCIVKGAKENSLPEDYVQKLRAIEHNGFAGQ--VNSYIERK 184 (193)
T ss_pred hh--hhhccCCCCchHHhhhhhcccccCCcHHHHHHHhccccCCcccc--hhhHHHHH
Confidence 42 11111134445567788888888888999877643322222233 55444433
No 8
>PF06094 AIG2: AIG2-like family; InterPro: IPR009288 AIG2 is an Arabidopsis protein that exhibit RPS2- and avrRpt2-dependent induction early after infection with Pseudomonas syringae pv maculicola strain ES4326 carrying avrRpt2 []. Its structure consists of a five-stranded beta-barrel surrounded by two alpha-helices and a small beta-sheet. A long flexible alpha-helix protrudes from the structure at the C-terminal end. Conserved residues in a hydrophilic cavity, which are able to bind small ligands, may act as an active site in AIG2-like proteins [].; PDB: 1XHS_A 2KL2_A 1VKB_A 3JUD_A 3JUB_A 3JUC_A 2JQV_A 2QIK_A 2G0Q_A 1V30_A.
Probab=97.48 E-value=0.00047 Score=51.64 Aligned_cols=62 Identities=15% Similarity=0.122 Sum_probs=49.2
Q ss_pred CeeEEEeeCCCC-eEEEEEEEeCCCChhhHHHhhccCCCCCcceEEEEEeCCCCCCCCCCCCCCCCCCCcEEEEEEEec
Q psy10899 18 VRYFGHILSTRS-IVWGRAFLVSGKSAVPYLHTREGTLGGYKTSFEQFYPKDHSDQSSSSEDGLSETSQPIRTMMFIAC 95 (218)
Q Consensus 18 GrV~tL~~~~gg-~cwGvAYrI~~~~~l~yLd~RE~~~ggY~~~~v~~~~~~~~~~~~~~~~~~~~~g~~v~AlvYiat 95 (218)
|-.-+|++.+++ .|+|..|.|.++ .++.||.-|...+.|.+..+++... ++..++|.+|+..
T Consensus 40 ~~yP~l~~~~~~~~V~G~l~~v~~~-~l~~LD~~E~~~~~Y~R~~v~v~~~---------------~g~~~~a~vYv~~ 102 (102)
T PF06094_consen 40 GGYPALVPGEGSGRVEGELYEVDDE-ELARLDEYEGEGSLYRRVRVPVELG---------------DGEEVEAWVYVWN 102 (102)
T ss_dssp SSCEEEESCTTSSEEEEEEEEE-HH-HHHHHHHHTTTTTSEEEEEEEEECC---------------TSSEEEEEEEEE-
T ss_pred CCCCEEEEcCCCCEEEEEEEEECHH-HHHhhHhhcCCCCceEEEEEEEEeC---------------CCCEeEEEEEEEC
Confidence 444467767776 999999999774 4999999975578999999999976 5677899999863
No 9
>COG2105 Uncharacterized conserved protein [Function unknown]
Probab=95.05 E-value=0.1 Score=41.97 Aligned_cols=61 Identities=15% Similarity=0.232 Sum_probs=46.9
Q ss_pred eEEEeeCCCCeEEEEEEEeCCCChhhHHHhhccCCCCCcceEEEEEeCCCCCCCCCCCCCCCCCCCcEEEEEEEecCCCC
Q psy10899 20 YFGHILSTRSIVWGRAFLVSGKSAVPYLHTREGTLGGYKTSFEQFYPKDHSDQSSSSEDGLSETSQPIRTMMFIACPTNS 99 (218)
Q Consensus 20 V~tL~~~~gg~cwGvAYrI~~~~~l~yLd~RE~~~ggY~~~~v~~~~~~~~~~~~~~~~~~~~~g~~v~AlvYiat~~n~ 99 (218)
..+++++++ .|||=+|+|+ +..++.||.=|.. .+|.+.++.+.+.. |.. .|.+|+.++...
T Consensus 44 YP~~~~g~~-~V~Gevy~~d-~~~l~~LDelE~~-~~y~r~~v~v~~~~---------------G~~-~aw~Y~y~~~~~ 104 (120)
T COG2105 44 YPGLVPGEG-KVHGEVYRID-EETLEALDELEDY-GGYYRREVEVTTPL---------------GSK-EAWLYVYAERVR 104 (120)
T ss_pred CcEEcCCCC-EEEEEEEEEC-HHHHhhhhhhhcc-CceEEEEEEEEcCC---------------CCE-EEEEEEEcCCCC
Confidence 556664444 9999999999 4788889988953 47999999888762 333 899999987644
No 10
>PF06570 DUF1129: Protein of unknown function (DUF1129); InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=72.42 E-value=3.7 Score=35.06 Aligned_cols=51 Identities=24% Similarity=0.391 Sum_probs=40.2
Q ss_pred CCCcHHHHHHHHHHHHHhCCCCCCCCccHHHHHHHHHHHh---hhcCCCcccccCCCCC
Q psy10899 121 SGHNAEYVLRLAIFMRENIPDAHVHDPELFTLEIMIRSRL---KEKKIPLHVIMGEEAP 176 (218)
Q Consensus 121 SG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr~~~---~~~~~~~~~~~g~~~~ 176 (218)
+-.|.||++++-..|++- ++.|...+++-..+...+ +++|.....+.| .+.
T Consensus 3 TkkN~~y~~~l~~~L~~~----~~~e~~~e~~L~eil~~LleaQk~G~tA~~lfG-~P~ 56 (206)
T PF06570_consen 3 TKKNQEYIFDLRKYLRSS----GVSEEEIEELLEEILPHLLEAQKKGKTARQLFG-DPK 56 (206)
T ss_pred chHHHHHHHHHHHHHHHc----CCCHHHHHHHHHHHHHHHHHHHhCCCcHHHHcC-CHH
Confidence 457999999999999888 669988877766555544 577999999999 543
No 11
>PRK07668 hypothetical protein; Validated
Probab=58.64 E-value=12 Score=33.76 Aligned_cols=52 Identities=8% Similarity=0.192 Sum_probs=42.0
Q ss_pred CCCcHHHHHHHHHHHHHhCCCCCCCCccHHHHHHHHHHHh---hhcCCCcccccCCCCC
Q psy10899 121 SGHNAEYVLRLAIFMRENIPDAHVHDPELFTLEIMIRSRL---KEKKIPLHVIMGEEAP 176 (218)
Q Consensus 121 SG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr~~~---~~~~~~~~~~~g~~~~ 176 (218)
|-.|.||+.++-..|+.. |+.|....++...+...+ +++|.+.+++.|..+.
T Consensus 3 TkeNeefl~~L~~yL~~~----glseeeieeiL~Ei~~hLlEgQk~GkTA~~IfG~sPk 57 (254)
T PRK07668 3 SKEGRKFLDDTRVYLIAK----GIKEEDIESFLEDAELHLIEGEKDGKTVEDIFGDSPK 57 (254)
T ss_pred CHHHHHHHHHHHHHHHHC----CCCHHHHHHHHHHHHHHHHHHHHcCCcHHHHhCCCHH
Confidence 346999999999999988 669988888777666654 6789999999997543
No 12
>TIGR00290 MJ0570_dom MJ0570-related uncharacterized domain. Proteins with this uncharacterized domain include two apparent ortholog families in the Archaea, one of which is universal among the first four completed archaeal genomes, and YLR143W, a much longer protein from Saccharomyces cerevisiae. The domain comprises the full length of the archaeal proteins and the first third of the yeast protein.
Probab=49.02 E-value=1.1e+02 Score=26.81 Aligned_cols=79 Identities=10% Similarity=0.027 Sum_probs=48.9
Q ss_pred EEEEEEEecCCCCCCCCCCCHHHHHHHHHcccC-------CCCCcHHHHHHHHHHHHHhCCCC---CCCCccHHHHHHHH
Q psy10899 87 IRTMMFIACPTNSMWLGEAPLHEIANQIVSCKG-------PSGHNAEYVLRLAIFMRENIPDA---HVHDPELFTLEIMI 156 (218)
Q Consensus 87 v~AlvYiat~~n~~y~G~~~le~iA~~Ia~a~G-------pSG~N~EYL~~La~~Lr~~~pg~---gi~D~hL~~Le~~V 156 (218)
|.+|+-+...+...|.-..+..++.+..|.|-| ..|++-+|+-.|.+.|++. ++ -..|=.+...-..+
T Consensus 26 v~~L~t~~~~~~~s~~~H~~~~~~~~~qA~algipl~~~~~~~~~e~~~e~l~~~l~~~--gv~~vv~GdI~s~~qr~~~ 103 (223)
T TIGR00290 26 VISLVNIMPENEESYMFHGVNAHLTDLQAESIGIPLIKLYTEGTEEDEVEELKGILHTL--DVEAVVFGAIYSEYQKTRI 103 (223)
T ss_pred eEEEEEEecCCCCcccccccCHHHHHHHHHHcCCCeEEeecCCCccHHHHHHHHHHHHc--CCCEEEECCcccHHHHHHH
Confidence 344443333333345555555688888898888 3678899999999999997 21 01344555555555
Q ss_pred HHHhhhcCCCc
Q psy10899 157 RSRLKEKKIPL 167 (218)
Q Consensus 157 r~~~~~~~~~~ 167 (218)
...+.+.++..
T Consensus 104 e~v~~~lgl~~ 114 (223)
T TIGR00290 104 ERVCRELGLKS 114 (223)
T ss_pred HHHHHhcCCEE
Confidence 55555555544
No 13
>KOG4311|consensus
Probab=39.75 E-value=30 Score=31.99 Aligned_cols=73 Identities=10% Similarity=0.147 Sum_probs=46.2
Q ss_pred CCCCCCeeEEEeeCCCCeEEEEEEEeCCCChhhHHHhhccC--------------CCCCcceEEEEEeCCCCCCCCCCCC
Q psy10899 13 STRTRVRYFGHILSTRSIVWGRAFLVSGKSAVPYLHTREGT--------------LGGYKTSFEQFYPKDHSDQSSSSED 78 (218)
Q Consensus 13 T~e~PGrV~tL~~~~gg~cwGvAYrI~~~~~l~yLd~RE~~--------------~ggY~~~~v~~~~~~~~~~~~~~~~ 78 (218)
|...-|+|.||+-++-|.|-|..|-=.+. +-.-++-+-.+ +-|--.+.++++.+
T Consensus 145 tDr~dgl~~tlvv~~~g~~Lglvysskes-~a~ti~~g~gvy~SRsR~~lW~KGetSgn~q~ll~i~vD----------- 212 (359)
T KOG4311|consen 145 TDRKDGLVATLVVVDTGAVLGLVYSSKES-LATTISSGKGVYFSRSRSTLWTKGETSGNFQNLLDIYVD----------- 212 (359)
T ss_pred cCCCCCeEEEEEehhhhhhhhhhcccHHH-HHHHHhcCcceEEecccceeeeccccCcCceeeEEEeec-----------
Confidence 55677999999999999999999975542 22333333310 22222333344433
Q ss_pred CCCCCCCcEEEEEEEecCCCCCCCC
Q psy10899 79 GLSETSQPIRTMMFIACPTNSMWLG 103 (218)
Q Consensus 79 ~~~~~g~~v~AlvYiat~~n~~y~G 103 (218)
---++|.|++++++|-|.-
T Consensus 213 ------CD~D~l~f~v~q~g~gfCH 231 (359)
T KOG4311|consen 213 ------CDRDSLIFLVTQDGPGFCH 231 (359)
T ss_pred ------cCccceEEEEecCCCcccc
Confidence 1137999999999997753
No 14
>PF09164 VitD-bind_III: Vitamin D binding protein, domain III; InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=37.68 E-value=53 Score=23.96 Aligned_cols=30 Identities=37% Similarity=0.551 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHHhCCCCCCCCccHHHHHHHHHHH
Q psy10899 125 AEYVLRLAIFMRENIPDAHVHDPELFTLEIMIRSR 159 (218)
Q Consensus 125 ~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr~~ 159 (218)
.||=-+|++.|+.-+|.+ .- ..|+.+|.++
T Consensus 12 tEyKKrL~e~l~~k~P~a--t~---~~l~~lve~R 41 (68)
T PF09164_consen 12 TEYKKRLAERLRAKLPDA--TP---TELKELVEKR 41 (68)
T ss_dssp HHHHHHHHHHHHHH-TTS---H---HHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHCCCC--CH---HHHHHHHHHH
Confidence 589999999999987766 54 6777777654
No 15
>PF10264 Stork_head: Winged helix Storkhead-box1 domain; InterPro: IPR019391 In humans the Storkhead-box protein controls polyploidization of extravillus trophoblast and is implicated in pre-eclampsia []. This entry represents the conserved N-terminal winged-helix domain, which is likely to bind DNA.
Probab=29.86 E-value=1.2e+02 Score=22.69 Aligned_cols=38 Identities=24% Similarity=0.207 Sum_probs=33.6
Q ss_pred HHHHHHHHHHhCCCCCCCCccHHHHHHHHHHHhhhcCCCc
Q psy10899 128 VLRLAIFMRENIPDAHVHDPELFTLEIMIRSRLKEKKIPL 167 (218)
Q Consensus 128 L~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr~~~~~~~~~~ 167 (218)
+-.|-++|++.+|++ .=|--.-|.+.+-.++++++|..
T Consensus 32 ~E~l~~~L~~~yp~i--~~Ps~e~l~~~L~~Li~erkIY~ 69 (80)
T PF10264_consen 32 QETLREHLRKHYPGI--AIPSQEVLYNTLGTLIKERKIYH 69 (80)
T ss_pred HHHHHHHHHHhCCCC--CCCCHHHHHHHHHHHHHcCceee
Confidence 567889999999988 88999999999999999998865
No 16
>PF05269 Phage_CII: Bacteriophage CII protein; InterPro: IPR007933 The CII protein is a transcription activator, conserved in bacteriophage lambda and related phages, that plays a key role in the decision between lytic or lysogenic phage development. CII is regulated at multiple levels including transcription, translation initiation, mRNA stability, and proteolysis []. Conditions that stabilise cII favour lysogenic development. The lambda CII protein activates three specific promoters, binding to direct repeat sequences rather than the more usual inverted repeats. Structurally, CII is a homotetramer where each monomer is composed of four alpha helices and a disordered C terminus [, ]. The alpha helical region is responsible for DNA binding and multimerisation. The homotetramer has an unusual spatial arrangement that allows recognition of the direct repeat sequences.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1ZS4_C 1ZPQ_C 1XWR_A.
Probab=29.65 E-value=79 Score=24.25 Aligned_cols=53 Identities=17% Similarity=0.340 Sum_probs=39.3
Q ss_pred HHHHcccCCCCCc-----HHHHHHHHHHHHHhCCCCCCCCccHHHHHHHHHHHhhhcCCC
Q psy10899 112 NQIVSCKGPSGHN-----AEYVLRLAIFMRENIPDAHVHDPELFTLEIMIRSRLKEKKIP 166 (218)
Q Consensus 112 ~~Ia~a~GpSG~N-----~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~~Vr~~~~~~~~~ 166 (218)
+.+|.+.|-+=+. .+|+.+.+..|.-+ +.++.|..+..+++.+.+.+..++.+
T Consensus 27 ~~vA~~~Gv~eStISR~k~~~~~~~a~lLa~L--~~~v~~~~i~~~~~~~~~~l~~~K~p 84 (91)
T PF05269_consen 27 KKVAEAMGVDESTISRWKNDFIEKMAMLLAAL--ELGVEDSEIARVAKQAAEILTKKKRP 84 (91)
T ss_dssp HHHHHHHTSSTTTHHHHHHHHHHHHHHHHHHT--TTTHHHHHHHHHHHHHHHHHHT----
T ss_pred HHHHHHhCCCHHHHHHHHhhHHHHHHHHHHHH--HhcccccHHHHHHHHHHHHHHhccCC
Confidence 4567777776664 48899999999977 55779999999999998887655443
No 17
>PF11784 DUF3320: Protein of unknown function (DUF3320); InterPro: IPR021754 This family is conserved in Proteobacteria and Chlorobi families. Many members are annotated as being putative DNA helicase-related proteins.
Probab=25.62 E-value=56 Score=22.09 Aligned_cols=21 Identities=29% Similarity=0.256 Sum_probs=15.4
Q ss_pred CCCCCHHHHHHHHHcccCCCC
Q psy10899 102 LGEAPLHEIANQIVSCKGPSG 122 (218)
Q Consensus 102 ~G~~~le~iA~~Ia~a~GpSG 122 (218)
.||...+.+++.|+++.|-+=
T Consensus 24 EgPI~~~~L~~Ri~~a~G~~R 44 (52)
T PF11784_consen 24 EGPIHEDELARRIARAWGLSR 44 (52)
T ss_pred cCCccHHHHHHHHHHHcCccc
Confidence 467777888888888877653
No 18
>PF09286 Pro-kuma_activ: Pro-kumamolisin, activation domain ; InterPro: IPR015366 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found at the N terminus of peptidases belonging to MEROPS peptidase family S53 (sedolisin, clan SB). The domain adopts a ferredoxin-like fold, with an alpha+beta sandwich. Cleavage of the domain results in activation of the peptidase []. ; GO: 0008236 serine-type peptidase activity; PDB: 1T1E_A 3EDY_A 3EE6_A.
Probab=25.13 E-value=1.1e+02 Score=24.24 Aligned_cols=44 Identities=14% Similarity=0.113 Sum_probs=27.6
Q ss_pred ecCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCcc
Q psy10899 94 ACPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPE 148 (218)
Q Consensus 94 at~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~h 148 (218)
.+|.+|+|--.++.+++++.. |+..+.+-....+|++. |+....
T Consensus 37 sdP~s~~Ygk~Lt~~e~~~~~-------~p~~~~v~~V~~wL~~~----G~~~~~ 80 (143)
T PF09286_consen 37 SDPGSPNYGKYLTPEEFAALF-------APSPEDVAAVKSWLKSH----GLTVVE 80 (143)
T ss_dssp HTTTSTTTT----HHHHHHHH-------S--HHHHHHHHHHHHHC----T-EEEE
T ss_pred cCCCCcccccCCCHHHHHHHH-------CCCHHHHHHHHHHHHHc----CCceeE
Confidence 457788887778888888876 57788888888888887 445544
No 19
>COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]
Probab=24.59 E-value=1.3e+02 Score=26.32 Aligned_cols=50 Identities=20% Similarity=0.082 Sum_probs=39.7
Q ss_pred CCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCCCCCCccHHHHHH
Q psy10899 102 LGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDAHVHDPELFTLEI 154 (218)
Q Consensus 102 ~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~gi~D~hL~~Le~ 154 (218)
+-.++.++++..|. ++|-.=.=+.||..+++.|-+.+.|- ...-+.+|+.
T Consensus 64 l~~a~~~~l~~~I~-~iGlyr~KAk~I~~~~~~l~e~~~g~--vP~~~~eL~~ 113 (211)
T COG0177 64 LLNADEEELEELIK-SIGLYRNKAKNIKELARILLEKFGGE--VPDTREELLS 113 (211)
T ss_pred HHcCCHHHHHHHHH-hcCCcHHHHHHHHHHHHHHHHHcCCC--CCchHHHHHh
Confidence 34578899999998 88999999999999999999997664 4544555543
No 20
>PRK04456 acetyl-CoA decarbonylase/synthase complex subunit beta; Reviewed
Probab=23.57 E-value=1.4e+02 Score=29.23 Aligned_cols=47 Identities=21% Similarity=0.323 Sum_probs=38.8
Q ss_pred CCCcEEEEEEEecC------CCCCCCCC----CCHHHHHHHHH---cccCCCCCcHHHHH
Q psy10899 83 TSQPIRTMMFIACP------TNSMWLGE----APLHEIANQIV---SCKGPSGHNAEYVL 129 (218)
Q Consensus 83 ~g~~v~AlvYiat~------~n~~y~G~----~~le~iA~~Ia---~a~GpSG~N~EYL~ 129 (218)
+...++|++|+.-. -|..|.|. .+..++|.++. +..|..|....|+.
T Consensus 279 sCGCFE~i~~~iPe~~G~~iV~R~y~g~TP~Gm~FstlAg~~gGG~Q~~GF~Gi~~~y~~ 338 (463)
T PRK04456 279 SCGCFEAVAFYIPEVDGIGIVHREYKGETPNGLPFSTMAGQTSGGKQVEGFLGISIEYMR 338 (463)
T ss_pred ccchhhhheeeeecCceeEEEecCcCCCCCCCCchHHhhhccCCccccCCceeeehhhcc
Confidence 56789999987643 47789886 46899999997 78899999999985
No 21
>PF07454 SpoIIP: Stage II sporulation protein P (SpoIIP); InterPro: IPR010897 This family contains the bacterial stage II sporulation protein P (SpoIIP) (approximately 350 residues long). It has been shown that a block in polar cytokinesis in Bacillus subtilis is mediated partly by transcription of spoIID, spoIIM and spoIIP. This inhibition of polar division is involved in the locking in of asymmetry after the formation of a polar septum during sporulation []. SpoIIP is one of the three genes (spoIID, spoIIM and spoIIP, [, , ]), under the control of sigma E, that have been shown to be essential for the engulfment of the forespore by the mother cell. Their products are involved in degradation of the septal peptidoglycan and mutations in spoIID, spoIIM or spoIIP block sporulation at morphological stage II, prior to the stage of engulfment. These three genes are absolutely conserved (sometimes even duplicated) in all endospore formers [].
Probab=21.53 E-value=1.2e+02 Score=27.56 Aligned_cols=40 Identities=18% Similarity=0.399 Sum_probs=36.0
Q ss_pred CCCcEEEEEEEecCCCCCCCCCCCHHHHHHHHHcccCCCCCcHHHHHHHHHHHHHhCCCC
Q psy10899 83 TSQPIRTMMFIACPTNSMWLGEAPLHEIANQIVSCKGPSGHNAEYVLRLAIFMRENIPDA 142 (218)
Q Consensus 83 ~g~~v~AlvYiat~~n~~y~G~~~le~iA~~Ia~a~GpSG~N~EYL~~La~~Lr~~~pg~ 142 (218)
.|..+-=+.||...+||.| -.|..+-.+|...+.+++||+
T Consensus 182 nG~~~Aki~fVvG~~np~~--------------------~~N~~fA~~l~~~~~~~yPGl 221 (268)
T PF07454_consen 182 NGKNYAKIMFVVGRDNPNW--------------------EKNLAFAKQLHAKLEKKYPGL 221 (268)
T ss_pred CCEEeeEEEEEEcCCCCCH--------------------HHHHHHHHHHHHHHHhHCCCc
Confidence 5677888889999999999 689999999999999999987
No 22
>COG4759 Uncharacterized protein conserved in bacteria containing thioredoxin-like domain [Posttranslational modification, protein turnover, chaperones]
Probab=20.28 E-value=56 Score=30.38 Aligned_cols=28 Identities=14% Similarity=0.219 Sum_probs=20.9
Q ss_pred eeecccCCC--CCCCCCCeeEEEeeCCCCeEEEE
Q psy10899 3 CLLAWSGGN--SSTRTRVRYFGHILSTRSIVWGR 34 (218)
Q Consensus 3 ~~~~wq~s~--~T~e~PGrV~tL~~~~gg~cwGv 34 (218)
-||+||.|+ |+.=.| ||+.-+.|..||.
T Consensus 168 ~lRvWq~SHfgGHrFAP----TlidlP~GqyyG~ 197 (316)
T COG4759 168 NLRVWQSSHFGGHRFAP----TLIDLPQGQYYGH 197 (316)
T ss_pred ceEEEEecccCccccCc----hhhcCCCCceeee
Confidence 478999999 566677 5565577888875
Done!