Query psy5105
Match_columns 100
No_of_seqs 112 out of 364
Neff 5.0
Searched_HMMs 46136
Date Fri Aug 16 17:24:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5105.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5105hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05971 Methyltransf_10: Prot 100.0 7.7E-46 1.7E-50 292.7 4.0 97 1-97 1-98 (299)
2 PRK11727 23S rRNA mA1618 methy 100.0 5.8E-42 1.2E-46 272.1 9.6 95 1-95 10-105 (321)
3 KOG2912|consensus 100.0 9.6E-42 2.1E-46 274.6 9.4 99 1-99 1-100 (419)
4 COG3129 Predicted SAM-dependen 99.9 4.1E-27 8.9E-32 183.7 7.2 71 26-96 1-72 (292)
5 KOG3357|consensus 81.4 0.7 1.5E-05 34.0 0.9 53 20-73 30-90 (167)
6 PF12368 DUF3650: Protein of u 69.0 1.8 3.8E-05 23.7 0.2 10 6-15 3-12 (28)
7 cd02873 GH18_IDGF The IDGF's ( 56.2 29 0.00063 28.5 5.2 76 19-94 65-181 (413)
8 smart00636 Glyco_18 Glycosyl h 54.6 45 0.00098 25.7 5.8 75 19-94 57-138 (334)
9 PF14492 EFG_II: Elongation Fa 49.0 34 0.00075 21.4 3.7 48 20-69 26-73 (75)
10 PF01113 DapB_N: Dihydrodipico 47.8 23 0.00049 24.1 2.9 26 40-67 69-94 (124)
11 cd06544 GH18_narbonin Narbonin 47.0 62 0.0013 25.0 5.5 71 18-95 60-142 (253)
12 cd06591 GH31_xylosidase_XylS X 46.6 44 0.00095 26.3 4.7 46 16-66 94-151 (319)
13 KOG4513|consensus 46.2 34 0.00074 29.5 4.2 66 27-97 248-339 (531)
14 cd00598 GH18_chitinase-like Th 45.2 99 0.0021 21.8 6.0 75 18-95 54-137 (210)
15 cd06599 GH31_glycosidase_Aec37 43.4 56 0.0012 25.7 4.8 25 42-66 134-160 (317)
16 cd02872 GH18_chitolectin_chito 41.5 77 0.0017 24.9 5.3 76 19-95 61-147 (362)
17 cd06594 GH31_glucosidase_YihQ 39.4 63 0.0014 25.6 4.5 27 41-67 131-160 (317)
18 cd06597 GH31_transferase_CtsY 39.2 43 0.00093 26.8 3.6 26 41-66 152-179 (340)
19 cd06548 GH18_chitinase The GH1 39.0 80 0.0017 24.6 5.1 76 18-94 74-164 (322)
20 PF07624 PSD2: Protein of unkn 38.9 17 0.00037 23.1 1.0 54 12-66 5-63 (76)
21 PF13708 Methyltransf_27: Meth 37.6 44 0.00095 24.8 3.2 26 37-62 165-190 (194)
22 PF08984 DUF1858: Domain of un 35.0 18 0.00038 21.8 0.6 28 19-50 7-34 (59)
23 COG4536 CorB Putative Mg2+ and 34.3 25 0.00055 29.8 1.6 39 25-72 330-368 (423)
24 PF04847 Calcipressin: Calcipr 33.8 21 0.00046 26.6 1.0 65 17-82 10-90 (184)
25 KOG0942|consensus 33.1 14 0.00031 34.4 -0.0 68 17-91 771-844 (1001)
26 PF13893 RRM_5: RNA recognitio 32.7 80 0.0017 17.8 3.2 38 21-58 3-41 (56)
27 TIGR03595 Obg_CgtA_exten Obg f 31.5 45 0.00097 20.9 2.1 17 42-58 24-40 (69)
28 cd06592 GH31_glucosidase_KIAA1 30.1 71 0.0015 24.9 3.5 26 41-66 130-157 (303)
29 cd06602 GH31_MGAM_SI_GAA This 29.4 80 0.0017 25.2 3.7 28 42-69 131-161 (339)
30 cd02879 GH18_plant_chitinase_c 28.9 1.4E+02 0.003 23.2 4.9 72 18-94 55-138 (299)
31 PRK11104 hemG protoporphyrinog 27.5 59 0.0013 23.4 2.4 18 43-60 155-172 (177)
32 TIGR00043 metalloprotein, YbeY 27.3 78 0.0017 21.5 2.9 34 39-72 1-35 (110)
33 TIGR02130 dapB_plant dihydrodi 26.9 72 0.0016 25.4 3.0 21 41-61 72-92 (275)
34 PF02244 Propep_M14: Carboxype 26.4 1.3E+02 0.0027 18.2 3.5 49 43-98 3-58 (74)
35 PF13979 SopA_C: SopA-like cat 26.2 1E+02 0.0022 23.2 3.6 44 44-96 96-139 (172)
36 COG2170 Uncharacterized conser 26.1 86 0.0019 26.3 3.4 32 68-99 179-210 (369)
37 COG0289 DapB Dihydrodipicolina 26.1 66 0.0014 25.7 2.7 26 40-65 71-96 (266)
38 PF12563 Hemolysin_N: Hemolyti 24.4 1.2E+02 0.0026 23.2 3.7 40 27-67 66-107 (187)
39 PF12018 DUF3508: Domain of un 24.0 1.3E+02 0.0028 23.6 3.9 42 16-57 194-250 (281)
40 smart00362 RRM_2 RNA recogniti 22.8 1.4E+02 0.003 16.1 3.2 37 18-54 15-55 (72)
41 PRK14434 acylphosphatase; Prov 22.7 1.2E+02 0.0027 19.9 3.1 42 20-61 22-65 (92)
42 PLN02775 Probable dihydrodipic 22.7 96 0.0021 24.9 3.0 21 41-61 83-103 (286)
43 PF14330 DUF4387: Domain of un 22.4 1E+02 0.0023 21.2 2.7 38 29-66 4-51 (99)
44 cd06545 GH18_3CO4_chitinase Th 22.3 2E+02 0.0044 21.5 4.6 42 46-94 79-127 (253)
45 COG1713 Predicted HD superfami 22.1 1.5E+02 0.0034 22.5 3.9 73 20-95 50-132 (187)
46 KOG0799|consensus 21.5 1.6E+02 0.0035 24.8 4.2 16 6-21 125-146 (439)
47 COG3228 Uncharacterized protei 21.0 63 0.0014 25.9 1.6 22 72-93 166-187 (266)
48 TIGR02691 arsC_pI258_fam arsen 20.8 20 0.00043 24.7 -1.1 35 52-87 60-96 (129)
49 PRK10658 putative alpha-glucos 20.5 2.7E+02 0.0058 24.7 5.5 29 41-69 384-414 (665)
50 PRK14432 acylphosphatase; Prov 20.3 1.5E+02 0.0033 19.5 3.1 39 21-60 23-63 (93)
No 1
>PF05971 Methyltransf_10: Protein of unknown function (DUF890); InterPro: IPR010286 This family consists of several conserved hypothetical proteins from both eukaryotes and prokaryotes. The function of members of this family are unknown but are predicted to be SAM-dependent methyltransferases.; GO: 0008168 methyltransferase activity; PDB: 2H00_A.
Probab=100.00 E-value=7.7e-46 Score=292.74 Aligned_cols=97 Identities=43% Similarity=0.809 Sum_probs=39.7
Q ss_pred CCCCCCCCCCCCCCCCccHHHHHHhCCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHHhhhhCce-EecCCCCcCCCCc
Q psy5105 1 MSFNKFMHPRNKYRVPPDFKQLAIEYPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVLRKDFGLN-VEIPPMRLVPTLP 79 (100)
Q Consensus 1 m~~~~~mHprN~y~~~~DF~~La~~~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~-~~lP~~~LcP~iP 79 (100)
|+.++.|||||+|+++|||++||++||+|++||..+++|+.+|||+|++||++||||||++||||+ ||||+|+||||||
T Consensus 1 ~~~~~~mHprN~~~~~~dF~~L~~~~p~l~~~v~~~~~g~~~idF~~~~Av~~Ln~aLLk~dfgl~~wdiP~~~LcP~iP 80 (299)
T PF05971_consen 1 MAQKKSMHPRNPYKDRYDFAALAKKYPELKKFVIINKKGRVSIDFSDPEAVRELNKALLKHDFGLDVWDIPEGRLCPPIP 80 (299)
T ss_dssp ------------------------------------------S-TTSHHHHHHHHHHHHHHHH--------TTS----HH
T ss_pred CCCcCCCCCCCCCCCCCCHHHHHHhCcchhHhhEECCCCcEEEecCCHHHHHHHHHHHHHHhcCCccccCCCCCcCCCCc
Confidence 999999999999999999999999999999999999999999999999999999999999999999 7999999999999
Q ss_pred chHHHHHHHHHHHhhcCC
Q psy5105 80 LRLNYILWIEDLLEANQI 97 (100)
Q Consensus 80 ~R~nYI~wi~DLL~~~~~ 97 (100)
||+||||||+|||..+.-
T Consensus 81 ~R~nYi~~i~DlL~~~~~ 98 (299)
T PF05971_consen 81 NRLNYIHWIADLLASSNP 98 (299)
T ss_dssp HHHHHHHHHHHHHT--TC
T ss_pred hhHHHHHHHHHHhhcccc
Confidence 999999999999987643
No 2
>PRK11727 23S rRNA mA1618 methyltransferase; Provisional
Probab=100.00 E-value=5.8e-42 Score=272.09 Aligned_cols=95 Identities=29% Similarity=0.491 Sum_probs=90.9
Q ss_pred CCCCCCCCCCCCCCCCccHHHHHHhCCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHHhhhhCce-EecCCCCcCCCCc
Q psy5105 1 MSFNKFMHPRNKYRVPPDFKQLAIEYPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVLRKDFGLN-VEIPPMRLVPTLP 79 (100)
Q Consensus 1 m~~~~~mHprN~y~~~~DF~~La~~~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~-~~lP~~~LcP~iP 79 (100)
|..++.|||||+|+++|||++|+++||+|++||..|+.|+.+|||+|++||++||||||++||||+ ||||+++||||||
T Consensus 10 ~~~~~~~h~rn~~~~~~df~~L~~~~p~l~~~v~~~~~g~~~idF~~~~Av~~LnkalL~~~ygl~~wdip~~~LcPpiP 89 (321)
T PRK11727 10 SAQKPGLHPRNRHRGRYDFAALIQSHPELKPFVILNPYGEQSIDFANPLAVKALNKALLAHFYGVAHWDIPAGYLCPPIP 89 (321)
T ss_pred CccccCCCCCCcCCCCCCHHHHHHhChhHHHHhccCCCCCeeeeCCCHHHHHHHHHHHHHHhcCCCcccCCCCCcCCCCC
Confidence 345788999999999999999999999999999999999999999999999999999999999999 8999999999999
Q ss_pred chHHHHHHHHHHHhhc
Q psy5105 80 LRLNYILWIEDLLEAN 95 (100)
Q Consensus 80 ~R~nYI~wi~DLL~~~ 95 (100)
+|+||||||+|||..+
T Consensus 90 ~R~~Yi~~l~dll~~~ 105 (321)
T PRK11727 90 GRADYIHHLADLLAED 105 (321)
T ss_pred cHHHHHHHHHHHhccc
Confidence 9999999999999764
No 3
>KOG2912|consensus
Probab=100.00 E-value=9.6e-42 Score=274.58 Aligned_cols=99 Identities=54% Similarity=0.997 Sum_probs=95.8
Q ss_pred CCCCCCCCCCCCCCC-CccHHHHHHhCCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHHhhhhCceEecCCCCcCCCCc
Q psy5105 1 MSFNKFMHPRNKYRV-PPDFKQLAIEYPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVLRKDFGLNVEIPPMRLVPTLP 79 (100)
Q Consensus 1 m~~~~~mHprN~y~~-~~DF~~La~~~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~~~lP~~~LcP~iP 79 (100)
|++++.|||||+|++ ||||+.||.+||+|++||..+.+|++++||+|++|||+||++||++||||.++||+++|||+||
T Consensus 1 M~~~k~mhpRn~Y~dkPPDfa~LaseyPsfK~fvq~~~ngRv~~Dfkd~~AvR~Lt~tLL~~Dfgl~veiP~grLcPtVP 80 (419)
T KOG2912|consen 1 MALSKSMHPRNRYKDKPPDFAYLASEYPSFKQFVQINLNGRVSLDFKDPEAVRALTCTLLREDFGLSVEIPLGRLCPTVP 80 (419)
T ss_pred CCcccccCCcccccCCCccHHHHHHhCccchhheEeccCCeEEeecCCHHHHHHHHHHHHhhccCceEecCccccCCCCc
Confidence 999999999999999 9999999999999999999999999999999999999999999999999999999999999999
Q ss_pred chHHHHHHHHHHHhhcCCCC
Q psy5105 80 LRLNYILWIEDLLEANQISS 99 (100)
Q Consensus 80 ~R~nYI~wi~DLL~~~~~~~ 99 (100)
||+||||||+|||.+..-++
T Consensus 81 nR~nYihwI~DLLss~q~~k 100 (419)
T KOG2912|consen 81 NRLNYIHWIEDLLSSQQSDK 100 (419)
T ss_pred cchhhHHHHHHHhhcccCCC
Confidence 99999999999999876543
No 4
>COG3129 Predicted SAM-dependent methyltransferase [General function prediction only]
Probab=99.94 E-value=4.1e-27 Score=183.70 Aligned_cols=71 Identities=28% Similarity=0.503 Sum_probs=68.2
Q ss_pred CCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHHhhhhCce-EecCCCCcCCCCcchHHHHHHHHHHHhhcC
Q psy5105 26 YPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVLRKDFGLN-VEIPPMRLVPTLPLRLNYILWIEDLLEANQ 96 (100)
Q Consensus 26 ~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~-~~lP~~~LcP~iP~R~nYI~wi~DLL~~~~ 96 (100)
.|+|..|+..++.|+.+|||.||.||+.||||||.++|+++ ||||+|+||||||||++|||+|+|||.+..
T Consensus 1 ~Pel~~f~~~~p~G~~siDFanp~AVk~LnKAlL~~fY~v~~wdiPeg~LCPpvPgRAdYih~laDLL~s~~ 72 (292)
T COG3129 1 MPELILFLRLTPAGRQSIDFANPLAVKALNKALLAHFYAVRYWDIPEGFLCPPVPGRADYIHHLADLLASTS 72 (292)
T ss_pred CcceeeeeeccCCCceeeccCCHHHHHHHHHHHHHHhcceeEecCCCCCcCCCCCChhHHHHHHHHHHHhcC
Confidence 38999999999999999999999999999999999999998 999999999999999999999999998654
No 5
>KOG3357|consensus
Probab=81.37 E-value=0.7 Score=34.02 Aligned_cols=53 Identities=21% Similarity=0.427 Sum_probs=42.5
Q ss_pred HHHHHhCCchhhchhhcCCC-c--EEeeCCCHHHHHHHHHH-----HHhhhhCceEecCCCC
Q psy5105 20 KQLAIEYPEFRKHLHQELSG-K--LKFNFQNPDALRIFTTT-----VLRKDFGLNVEIPPMR 73 (100)
Q Consensus 20 ~~La~~~p~l~~~v~~~~~G-~--~~iDF~~~~Av~~Ltka-----LL~~dfgL~~~lP~~~ 73 (100)
+.|.+.|..|-.||..|++. + ..+. +|+|.+|-..|+ |||..|++++|||-.+
T Consensus 30 qrlkeey~sli~yvqnnk~~d~dwfrle-sn~egtrwfgkcwy~hnllkyefdvefdipity 90 (167)
T KOG3357|consen 30 QRLKEEYQSLIAYVQNNKSNDNDWFRLE-SNKEGTRWFGKCWYVHNLLKYEFDVEFDIPITY 90 (167)
T ss_pred HHHHHHHHHHHHHHHhCcccCCcceEec-cCccccceehhhhHhhhhhhheeeeeecccccc
Confidence 57899999999999887753 2 2344 788988877775 8999999999999765
No 6
>PF12368 DUF3650: Protein of unknown function (DUF3650) ; InterPro: IPR022111 This domain family is found in bacteria, and is approximately 30 amino acids in length. The family is found in association with PF00581 from PFAM. There is a single completely conserved residue N that may be functionally important.
Probab=68.97 E-value=1.8 Score=23.66 Aligned_cols=10 Identities=50% Similarity=0.909 Sum_probs=7.7
Q ss_pred CCCCCCCCCC
Q psy5105 6 FMHPRNKYRV 15 (100)
Q Consensus 6 ~mHprN~y~~ 15 (100)
+-||||.|-.
T Consensus 3 r~hprNrYV~ 12 (28)
T PF12368_consen 3 RVHPRNRYVK 12 (28)
T ss_pred ccCcchhhHH
Confidence 4699999953
No 7
>cd02873 GH18_IDGF The IDGF's (imaginal disc growth factors) are a family of growth factors identified in insects that include at least five members, some of which are encoded by genes in a tight cluster. The IDGF's have an eight-stranded alpha/beta barrel fold and are related to the glycosyl hydrolase family 18 (GH18) chitinases, but they have an amino acid substitution known to abolish chitinase catalytic activity. IDGFs may have evolved from chitinases to gain new functions as growth factors, interacting with cell surface glycoproteins involved in growth-promoting processes.
Probab=56.20 E-value=29 Score=28.48 Aligned_cols=76 Identities=16% Similarity=0.291 Sum_probs=45.0
Q ss_pred HHHHHHhCCchhhchhhcCCCc---------EEeeCCCHHHHHHHHHHHH----hhhh-Cc--eEecCCCC---------
Q psy5105 19 FKQLAIEYPEFRKHLHQELSGK---------LKFNFQNPDALRIFTTTVL----RKDF-GL--NVEIPPMR--------- 73 (100)
Q Consensus 19 F~~La~~~p~l~~~v~~~~~G~---------~~iDF~~~~Av~~LtkaLL----~~df-gL--~~~lP~~~--------- 73 (100)
|.+|.+++|.++-.+.....+. .+.-.+|++.-+.+.++++ +..| || +|+.|...
T Consensus 65 ~~~lk~~~p~lKvllSiGGw~~~~~~~~s~~fs~~~~~~~~R~~Fi~siv~~l~~~~fDGidiDWEyP~~~~~~~~g~~~ 144 (413)
T cd02873 65 ITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTYGFDGLDLAWQFPKNKPKKVRGTFG 144 (413)
T ss_pred HHHHHhhCCCCeEEEeecCCCCCCCcccchhhHHHhCCHHHHHHHHHHHHHHHHHcCCCCeEeeeeCCCCcccccccccc
Confidence 5678899999998887543221 1111345555444554444 4445 34 48888632
Q ss_pred --------------cCC--CCcchHHHHHHHHHHHhh
Q psy5105 74 --------------LVP--TLPLRLNYILWIEDLLEA 94 (100)
Q Consensus 74 --------------LcP--~iP~R~nYI~wi~DLL~~ 94 (100)
.++ ..-.|.||+.-|++|=..
T Consensus 145 ~~~~~~~~~~~g~~~~~~~~~~d~~nf~~Ll~elr~~ 181 (413)
T cd02873 145 SAWHSFKKLFTGDSVVDEKAAEHKEQFTALVRELKNA 181 (413)
T ss_pred hhhhhhhcccccccccCCCChhHHHHHHHHHHHHHHH
Confidence 122 224699999999988443
No 8
>smart00636 Glyco_18 Glycosyl hydrolase family 18.
Probab=54.60 E-value=45 Score=25.69 Aligned_cols=75 Identities=23% Similarity=0.289 Sum_probs=42.0
Q ss_pred HHHHHHhCCchhhchhhcCCCcEEeeC----CCHHHHHHHHHHHH--hhhhCce-EecCCCCcCCCCcchHHHHHHHHHH
Q psy5105 19 FKQLAIEYPEFRKHLHQELSGKLKFNF----QNPDALRIFTTTVL--RKDFGLN-VEIPPMRLVPTLPLRLNYILWIEDL 91 (100)
Q Consensus 19 F~~La~~~p~l~~~v~~~~~G~~~iDF----~~~~Av~~LtkaLL--~~dfgL~-~~lP~~~LcP~iP~R~nYI~wi~DL 91 (100)
|.+|++++|.++-.+.....+. +-.| +|+++...+.+.+. ...||++ ++|-=...-+.-..|.+|+..|.+|
T Consensus 57 ~~~l~~~~~~~kvl~svgg~~~-s~~f~~~~~~~~~r~~fi~~i~~~~~~~~~DGidiDwE~~~~~~~d~~~~~~ll~~l 135 (334)
T smart00636 57 LKALKKKNPGLKVLLSIGGWTE-SDNFSSMLSDPASRKKFIDSIVSFLKKYGFDGIDIDWEYPGARGDDRENYTALLKEL 135 (334)
T ss_pred HHHHHHhCCCCEEEEEEeCCCC-CcchhHHHCCHHHHHHHHHHHHHHHHHcCCCeEEECCcCCCCCccHHHHHHHHHHHH
Confidence 6788899998887775543221 1123 45666666665553 3344444 3332222111114688999998887
Q ss_pred Hhh
Q psy5105 92 LEA 94 (100)
Q Consensus 92 L~~ 94 (100)
=..
T Consensus 136 r~~ 138 (334)
T smart00636 136 REA 138 (334)
T ss_pred HHH
Confidence 543
No 9
>PF14492 EFG_II: Elongation Factor G, domain II; PDB: 1WDT_A 2DY1_A 2XEX_A 1ELO_A 2XSY_Y 2WRK_Y 1DAR_A 2WRI_Y 2XUY_Y 3J0E_H ....
Probab=48.96 E-value=34 Score=21.43 Aligned_cols=48 Identities=25% Similarity=0.336 Sum_probs=32.0
Q ss_pred HHHHHhCCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHHhhhhCceEec
Q psy5105 20 KQLAIEYPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVLRKDFGLNVEI 69 (100)
Q Consensus 20 ~~La~~~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~~~l 69 (100)
.+|++..|.|+-... ...|...|-.- =+--.+.-...|++.||+++++
T Consensus 26 ~~l~~eDP~l~~~~d-~et~e~~l~g~-Gelhlev~~~~L~~~~~v~v~~ 73 (75)
T PF14492_consen 26 QKLSEEDPSLRVERD-EETGELILSGM-GELHLEVLLERLKRRFGVEVEF 73 (75)
T ss_dssp HHHHHH-TTSEEEEE-TTTSEEEEEES-SHHHHHHHHHHHHHTTCEBEEE
T ss_pred HHHHhcCCeEEEEEc-chhceEEEEEC-CHHHHHHHHHHHHHHHCCeeEe
Confidence 578888998875442 23566555543 3455677788899999999875
No 10
>PF01113 DapB_N: Dihydrodipicolinate reductase, N-terminus; InterPro: IPR000846 Dihydrodipicolinate reductase catalyzes the second step in the biosynthesis of diaminopimelic acid and lysine, the NAD or NADP-dependent reduction of 2,3-dihydrodipicolinate into 2,3,4,5-tetrahydrodipicolinate [, , ]. In Escherichia coli and Mycobacterium tuberculosis, dihydrodipicolinate reductase has equal specificity for NADH and NADPH, however in Thermotoga maritima there it has a greater affinity for NADPH []. In addition, the enzyme is inhibited by high concentrations of its substrate, which consequently acts as a feedback control on the lysine biosynthesis pathway. In T. maritima, the enzyme also lacks N-terminal and C-terminal loops which are present in enzyme of the former two organisms. This entry represents the N-terminal domain of dihydrodipicolinate reductase which binds the dinucleotide NAD(P)H.; GO: 0008839 dihydrodipicolinate reductase activity, 0009089 lysine biosynthetic process via diaminopimelate, 0055114 oxidation-reduction process; PDB: 3QY9_D 1VM6_C 1ARZ_A 1DIH_A 1DRW_A 1DRV_A 1DRU_A 2DAP_A 1DAP_B 3DAP_A ....
Probab=47.79 E-value=23 Score=24.06 Aligned_cols=26 Identities=27% Similarity=0.460 Sum_probs=20.4
Q ss_pred cEEeeCCCHHHHHHHHHHHHhhhhCceE
Q psy5105 40 KLKFNFQNPDALRIFTTTVLRKDFGLNV 67 (100)
Q Consensus 40 ~~~iDF~~~~Av~~LtkaLL~~dfgL~~ 67 (100)
.+-|||++|+++...-+..+++ |+.+
T Consensus 69 DVvIDfT~p~~~~~~~~~~~~~--g~~~ 94 (124)
T PF01113_consen 69 DVVIDFTNPDAVYDNLEYALKH--GVPL 94 (124)
T ss_dssp SEEEEES-HHHHHHHHHHHHHH--T-EE
T ss_pred CEEEEcCChHHhHHHHHHHHhC--CCCE
Confidence 3789999999999999999887 6653
No 11
>cd06544 GH18_narbonin Narbonin is a plant 2S protein from the globulin fraction of narbon bean (Vicia narbonensis L.) cotyledons with unknown function. Narbonin has a glycosyl hydrolase family 18 (GH18) domain without the conserved catalytic residues and with no known enzymatic activity. Narbonin amounts to up to 3% of the total seed globulins of mature seeds and was thought to be a storage protein but was found to degrade too slowly during germination. This family also includes the VfNOD32 nodulin from Vicia faba.
Probab=47.04 E-value=62 Score=25.02 Aligned_cols=71 Identities=14% Similarity=0.188 Sum_probs=41.7
Q ss_pred cHHHHHHhCCchhhchhhcCCCcE-EeeCCCHHHHHHHH----H--HHHhhhhCce-----EecCCCCcCCCCcchHHHH
Q psy5105 18 DFKQLAIEYPEFRKHLHQELSGKL-KFNFQNPDALRIFT----T--TVLRKDFGLN-----VEIPPMRLVPTLPLRLNYI 85 (100)
Q Consensus 18 DF~~La~~~p~l~~~v~~~~~G~~-~iDF~~~~Av~~Lt----k--aLL~~dfgL~-----~~lP~~~LcP~iP~R~nYI 85 (100)
.+.+|.+++|.+|-.+.....+.. +.-+.++.+....- + .-+.+.||++ ||-|. ..|.||.
T Consensus 60 ~~~~lK~~~p~lKvllSiGG~~~~~~~~~~~~~~~~~~~~~fv~S~~~~l~~~~fDGiDiDwE~~~-------~d~~~f~ 132 (253)
T cd06544 60 AVKSIKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGIDIDYEHFP-------ADPDTFV 132 (253)
T ss_pred HHHHHHHhCCCcEEEEEeCCCCCCCCccccCchhhhhHHHHHHHHHHHHHHHhCCCceeeecccCC-------cCHHHHH
Confidence 678999999999888765433211 11233443322221 1 1234555554 45452 2489999
Q ss_pred HHHHHHHhhc
Q psy5105 86 LWIEDLLEAN 95 (100)
Q Consensus 86 ~wi~DLL~~~ 95 (100)
..|.+|....
T Consensus 133 ~ll~~l~~~l 142 (253)
T cd06544 133 ECIGQLITEL 142 (253)
T ss_pred HHHHHHHHHh
Confidence 9999998765
No 12
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=46.57 E-value=44 Score=26.31 Aligned_cols=46 Identities=17% Similarity=0.278 Sum_probs=28.7
Q ss_pred CccHHHHHHhCCchhhchhhcCCC----------cEEeeCCCHHHHHHHHHHHH--hhhhCce
Q psy5105 16 PPDFKQLAIEYPEFRKHLHQELSG----------KLKFNFQNPDALRIFTTTVL--RKDFGLN 66 (100)
Q Consensus 16 ~~DF~~La~~~p~l~~~v~~~~~G----------~~~iDF~~~~Av~~LtkaLL--~~dfgL~ 66 (100)
...|++..++ .|++.+..| ..-+||+||+|..-..+.+- ..+.|++
T Consensus 94 ~~~y~e~~~~-----g~~v~~~~g~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~Gvd 151 (319)
T cd06591 94 TENYKEMDEK-----GYLIKTDRGPRVTMQFGGNTRFYDATNPEAREYYWKQLKKNYYDKGVD 151 (319)
T ss_pred ChhHHHHHHC-----CEEEEcCCCCeeeeeCCCCccccCCCCHHHHHHHHHHHHHHhhcCCCc
Confidence 4567777765 233333333 25799999999876554432 2357887
No 13
>KOG4513|consensus
Probab=46.21 E-value=34 Score=29.45 Aligned_cols=66 Identities=21% Similarity=0.320 Sum_probs=45.8
Q ss_pred CchhhchhhcCCCcE-----EeeCCC--HHHHHHHHHHHHhhhhC---------ce--------Eec-CCCCcC-CCCcc
Q psy5105 27 PEFRKHLHQELSGKL-----KFNFQN--PDALRIFTTTVLRKDFG---------LN--------VEI-PPMRLV-PTLPL 80 (100)
Q Consensus 27 p~l~~~v~~~~~G~~-----~iDF~~--~~Av~~LtkaLL~~dfg---------L~--------~~l-P~~~Lc-P~iP~ 80 (100)
..|++|++..+.|++ +|-|-| .+-+++.++||=+.+|+ |+ =++ |..+|+ ||--+
T Consensus 248 e~L~p~vi~ge~Gr~~~~DdtiifFnfRADRMr~ia~a~~~~~~d~~~r~~~p~i~~~gMtqYk~el~psk~Lf~pp~i~ 327 (531)
T KOG4513|consen 248 EYLPPFVIVGERGRAVGPDDTIIFFNFRADRMRMIAKALEYNDFDKFDRVRHPNIRYAGMTQYKGELKPSKYLFSPPCID 327 (531)
T ss_pred cccCCeEEECCCCcccCCCCeEEEEecchHHHHHHHHHhccccccccccccCCcceEeehhhhccccCcceeeecccccc
Confidence 358899998877753 344445 44568999999999992 22 256 889999 88888
Q ss_pred hHHHHHHHHHHHhhcCC
Q psy5105 81 RLNYILWIEDLLEANQI 97 (100)
Q Consensus 81 R~nYI~wi~DLL~~~~~ 97 (100)
| -+++.|+..|+
T Consensus 328 ~-----tlaE~La~~gv 339 (531)
T KOG4513|consen 328 R-----TLAEYLAHNGV 339 (531)
T ss_pred c-----hHHHHHHhcCc
Confidence 8 34555555554
No 14
>cd00598 GH18_chitinase-like The GH18 (glycosyl hydrolase, family 18) type II chitinases hydrolyze chitin, an abundant polymer of beta-1,4-linked N-acetylglucosamine (GlcNAc) which is a major component of the cell wall of fungi and the exoskeleton of arthropods. Chitinases have been identified in viruses, bacteria, fungi, protozoan parasites, insects, and plants. The structure of the GH18 domain is an eight-stranded beta/alpha barrel with a pronounced active-site cleft at the C-terminal end of the beta-barrel. The GH18 family includes chitotriosidase, chitobiase, hevamine, zymocin-alpha, narbonin, SI-CLP (stabilin-1 interacting chitinase-like protein), IDGF (imaginal disc growth factor), CFLE (cortical fragment-lytic enzyme) spore hydrolase, the type III and type V plant chitinases, the endo-beta-N-acetylglucosaminidases, and the chitolectins. The GH85 (glycosyl hydrolase, family 85) ENGases (endo-beta-N-acetylglucosaminidases) are closely related to the GH18 chitinases and are inclu
Probab=45.17 E-value=99 Score=21.76 Aligned_cols=75 Identities=19% Similarity=0.259 Sum_probs=42.7
Q ss_pred cHHHHHHhCCchhhchhhcCCCcEEe--eCCCHHHHHHHHHHHHh--hhhCc---e--EecCCCCcCCCCcchHHHHHHH
Q psy5105 18 DFKQLAIEYPEFRKHLHQELSGKLKF--NFQNPDALRIFTTTVLR--KDFGL---N--VEIPPMRLVPTLPLRLNYILWI 88 (100)
Q Consensus 18 DF~~La~~~p~l~~~v~~~~~G~~~i--DF~~~~Av~~LtkaLL~--~dfgL---~--~~lP~~~LcP~iP~R~nYI~wi 88 (100)
.+.+|++++|.++-.+.....+.... -.+++++.+.+.+++.. +.||+ + |+-|.. ... +.|-+|+.-|
T Consensus 54 ~i~~l~~~~~g~kv~~sigg~~~~~~~~~~~~~~~~~~f~~~~~~~v~~~~~DGidiD~E~~~~--~~~-~~~~~~~~ll 130 (210)
T cd00598 54 ALEELASKKPGLKVLISIGGWTDSSPFTLASDPASRAAFANSLVSFLKTYGFDGVDIDWEYPGA--ADN-SDRENFITLL 130 (210)
T ss_pred HHHHHHHhCCCCEEEEEEcCCCCCCCchhhcCHHHHHHHHHHHHHHHHHcCCCceEEeeeCCCC--cCc-cHHHHHHHHH
Confidence 46788888887777665533221111 13677777767666652 33544 3 333332 111 4588999988
Q ss_pred HHHHhhc
Q psy5105 89 EDLLEAN 95 (100)
Q Consensus 89 ~DLL~~~ 95 (100)
++|=...
T Consensus 131 ~~lr~~l 137 (210)
T cd00598 131 RELRSAL 137 (210)
T ss_pred HHHHHHh
Confidence 8876543
No 15
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=43.39 E-value=56 Score=25.70 Aligned_cols=25 Identities=24% Similarity=0.477 Sum_probs=19.1
Q ss_pred EeeCCCHHHHHHHHHHH--HhhhhCce
Q psy5105 42 KFNFQNPDALRIFTTTV--LRKDFGLN 66 (100)
Q Consensus 42 ~iDF~~~~Av~~LtkaL--L~~dfgL~ 66 (100)
-+||+||+|..-..+.+ ...+.|++
T Consensus 134 ~~Dftnp~a~~ww~~~~~~~~~~~Gvd 160 (317)
T cd06599 134 FVDFTNPEGREWWKEGVKEALLDLGID 160 (317)
T ss_pred eecCCChHHHHHHHHHHHHHHhcCCCc
Confidence 69999999998777655 33456887
No 16
>cd02872 GH18_chitolectin_chitotriosidase This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases. The conserved domain is an eight-stranded alpha/beta barrel fold belonging to the family 18 glycosyl hydrolases. The fold has a pronounced active-site cleft at the C-terminal end of the beta-barrel. The chitolectins lack a key active site glutamate (the proton donor required for hydrolytic activity) but retain highly conserved residues involved in oligosaccharide binding. Chitotriosidase is a chitinolytic enzyme expressed in maturing macrophages, which suggests that it plays a part in antimicrobial defense. Chitotriosidase hydrolyzes chitotriose, as well as colloidal chitin to yield chitobiose and is therefore considered an exochitinase. Chitotriosidase occurs in two major forms, the la
Probab=41.47 E-value=77 Score=24.87 Aligned_cols=76 Identities=22% Similarity=0.315 Sum_probs=43.5
Q ss_pred HHHHHHhCCchhhchhhcCCCcEEeeC----CCHHHHHHHHHHHH----hhhh-Cce--EecCCCCcCCCCcchHHHHHH
Q psy5105 19 FKQLAIEYPEFRKHLHQELSGKLKFNF----QNPDALRIFTTTVL----RKDF-GLN--VEIPPMRLVPTLPLRLNYILW 87 (100)
Q Consensus 19 F~~La~~~p~l~~~v~~~~~G~~~iDF----~~~~Av~~LtkaLL----~~df-gL~--~~lP~~~LcP~iP~R~nYI~w 87 (100)
|.+|.+++|.++-.+.....+..+-.| +|+++...+.+.++ +..| ||+ |+-|..+= =+.-.|.+|+..
T Consensus 61 ~~~lk~~~p~lkvlisiGG~~~~~~~f~~~~~~~~~r~~fi~~iv~~l~~~~~DGidiDwE~p~~~~-~~~~d~~~~~~l 139 (362)
T cd02872 61 FNALKEKNPNLKTLLAIGGWNFGSAKFSAMAASPENRKTFIKSAIAFLRKYGFDGLDLDWEYPGQRG-GPPEDKENFVTL 139 (362)
T ss_pred HHHHHhhCCCceEEEEEcCCCCCcchhHHHhCCHHHHHHHHHHHHHHHHHcCCCCeeeeeeccccCC-CCHHHHHHHHHH
Confidence 567788899999887653221111133 46666555555544 3333 343 66665321 011358999999
Q ss_pred HHHHHhhc
Q psy5105 88 IEDLLEAN 95 (100)
Q Consensus 88 i~DLL~~~ 95 (100)
|++|-+..
T Consensus 140 l~~lr~~l 147 (362)
T cd02872 140 LKELREAF 147 (362)
T ss_pred HHHHHHHH
Confidence 98886543
No 17
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=39.35 E-value=63 Score=25.55 Aligned_cols=27 Identities=26% Similarity=0.356 Sum_probs=20.2
Q ss_pred EEeeCCCHHHHHHHHHHHH--hhhhCce-E
Q psy5105 41 LKFNFQNPDALRIFTTTVL--RKDFGLN-V 67 (100)
Q Consensus 41 ~~iDF~~~~Av~~LtkaLL--~~dfgL~-~ 67 (100)
.-+||+||+|..-..+-+- ..+.|++ |
T Consensus 131 ~~~Dftnp~a~~ww~~~~~~~~~~~Gvdg~ 160 (317)
T cd06594 131 GVLDLTNPAARDWFKQVIKEMLLDLGLSGW 160 (317)
T ss_pred eeeecCCHHHHHHHHHHHHHHhhhcCCcEE
Confidence 4599999999987766653 2567887 5
No 18
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=39.18 E-value=43 Score=26.79 Aligned_cols=26 Identities=27% Similarity=0.623 Sum_probs=19.2
Q ss_pred EEeeCCCHHHHHHHHHHH--HhhhhCce
Q psy5105 41 LKFNFQNPDALRIFTTTV--LRKDFGLN 66 (100)
Q Consensus 41 ~~iDF~~~~Av~~LtkaL--L~~dfgL~ 66 (100)
.-+||+||+|..-....+ +..+.|++
T Consensus 152 ~~~Dftnp~a~~Ww~~~~~~~~~~~Gid 179 (340)
T cd06597 152 LMLDFTNPEAAQWWMEKRRYLVDELGID 179 (340)
T ss_pred eeecCCCHHHHHHHHHHHHHHHHhcCCc
Confidence 469999999998665544 33468887
No 19
>cd06548 GH18_chitinase The GH18 (glycosyl hydrolases, family 18) type II chitinases hydrolyze chitin, an abundant polymer of N-acetylglucosamine and have been identified in bacteria, fungi, insects, plants, viruses, and protozoan parasites. The structure of this domain is an eight-stranded alpha/beta barrel with a pronounced active-site cleft at the C-terminal end of the beta-barrel.
Probab=38.96 E-value=80 Score=24.61 Aligned_cols=76 Identities=18% Similarity=0.246 Sum_probs=45.2
Q ss_pred cHHHHHHhCCchhhchhhcCCCcEEeeCC----CHHHHHHHHHHHH--hhhhCce-----EecCCCC----cCCCCcchH
Q psy5105 18 DFKQLAIEYPEFRKHLHQELSGKLKFNFQ----NPDALRIFTTTVL--RKDFGLN-----VEIPPMR----LVPTLPLRL 82 (100)
Q Consensus 18 DF~~La~~~p~l~~~v~~~~~G~~~iDF~----~~~Av~~LtkaLL--~~dfgL~-----~~lP~~~----LcP~iP~R~ 82 (100)
.|.+|.+++|.++-.+.....+. +-.|+ |++..+.+.++++ .+.||++ |+-|... -+=..-.|.
T Consensus 74 ~~~~lk~~~p~lkvl~siGG~~~-s~~f~~~~~~~~~r~~Fi~siv~~l~~~~fDGidiDwE~p~~~~~~~~~~~~~d~~ 152 (322)
T cd06548 74 QLRKLKQKNPHLKILLSIGGWTW-SGGFSDAAATEASRAKFADSAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKE 152 (322)
T ss_pred HHHHHHHhCCCCEEEEEEeCCCC-CCCchhHhCCHHHHHHHHHHHHHHHHhcCCCeEEECCcCCCCCCCCCCCCChhHHH
Confidence 36788889999988776543222 22453 5666666666664 3345443 5556532 111124689
Q ss_pred HHHHHHHHHHhh
Q psy5105 83 NYILWIEDLLEA 94 (100)
Q Consensus 83 nYI~wi~DLL~~ 94 (100)
+|+.-|++|=+.
T Consensus 153 ~~~~ll~~Lr~~ 164 (322)
T cd06548 153 NFTLLLKELREA 164 (322)
T ss_pred HHHHHHHHHHHH
Confidence 999988888543
No 20
>PF07624 PSD2: Protein of unknown function (DUF1585); InterPro: IPR011478 This entry represents a conserved region at the C terminus of a family of cytochrome-like proteins found in bacteria such as Rhodopirellula baltica and Solibacter usitatus. These proteins also contain IPR013036 from INTERPRO, IPR013039 from INTERPRO, IPR013042 from INTERPRO and IPR013043 from INTERPRO.
Probab=38.86 E-value=17 Score=23.07 Aligned_cols=54 Identities=13% Similarity=0.298 Sum_probs=39.7
Q ss_pred CCCCCccHHHHHHhC-Cchhhchhhc----CCCcEEeeCCCHHHHHHHHHHHHhhhhCce
Q psy5105 12 KYRVPPDFKQLAIEY-PEFRKHLHQE----LSGKLKFNFQNPDALRIFTTTVLRKDFGLN 66 (100)
Q Consensus 12 ~y~~~~DF~~La~~~-p~l~~~v~~~----~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~ 66 (100)
.+.+..+|+++-.+. ++|..-+... ..|+ .+.|.|..+|.++..++-..+|++.
T Consensus 5 ~f~~~~eLk~~L~~~~~~~~~~~~~kl~~YAlGR-~~~~~D~~~i~~i~~~~~~~~y~~~ 63 (76)
T PF07624_consen 5 SFEGAAELKQYLAERKDQFARCFAEKLLTYALGR-PLEFSDRCEIDRIVEAFKANGYRLR 63 (76)
T ss_pred ccCCHHHHHHHHHHCHHHHHHHHHHHHHHHHcCC-CCCcchHHHHHHHHHHHHHcCCCHH
Confidence 355677887665555 6676655432 3476 5789999999999999999998874
No 21
>PF13708 Methyltransf_27: Methyltransferase domain
Probab=37.61 E-value=44 Score=24.78 Aligned_cols=26 Identities=15% Similarity=0.356 Sum_probs=22.2
Q ss_pred CCCcEEeeCCCHHHHHHHHHHHHhhh
Q psy5105 37 LSGKLKFNFQNPDALRIFTTTVLRKD 62 (100)
Q Consensus 37 ~~G~~~iDF~~~~Av~~LtkaLL~~d 62 (100)
++|...|-|++++-+..||..|=+++
T Consensus 165 kkGt~Hi~fk~~dl~~klN~ila~~y 190 (194)
T PF13708_consen 165 KKGTAHITFKRPDLVDKLNRILAKHY 190 (194)
T ss_pred cCCcEEEEECCHHHHHHHHHHHHHHc
Confidence 45899999999999999999876553
No 22
>PF08984 DUF1858: Domain of unknown function (DUF1858); InterPro: IPR015077 This protein has no known function. It is found in various hypothetical bacterial proteins. ; PDB: 2K53_A 2K5E_A 2FI0_A.
Probab=34.96 E-value=18 Score=21.79 Aligned_cols=28 Identities=29% Similarity=0.355 Sum_probs=18.8
Q ss_pred HHHHHHhCCchhhchhhcCCCcEEeeCCCHHH
Q psy5105 19 FKQLAIEYPEFRKHLHQELSGKLKFNFQNPDA 50 (100)
Q Consensus 19 F~~La~~~p~l~~~v~~~~~G~~~iDF~~~~A 50 (100)
-.+|.++||++.+++... |= =.+++|..
T Consensus 7 I~el~~~yP~~~~il~~~--gf--~~l~~p~~ 34 (59)
T PF08984_consen 7 IYELLEQYPELIEILVSY--GF--HCLGNPVM 34 (59)
T ss_dssp HHHHHHH-GGGHHHHHHT--TG--GGGGSCCH
T ss_pred HHHHHHHCHHHHHHHHHc--CC--cccCCcHH
Confidence 468999999999988764 41 13556643
No 23
>COG4536 CorB Putative Mg2+ and Co2+ transporter CorB [Inorganic ion transport and metabolism]
Probab=34.31 E-value=25 Score=29.84 Aligned_cols=39 Identities=15% Similarity=0.283 Sum_probs=29.2
Q ss_pred hCCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHHhhhhCceEecCCC
Q psy5105 25 EYPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVLRKDFGLNVEIPPM 72 (100)
Q Consensus 25 ~~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL~~dfgL~~~lP~~ 72 (100)
..|...+=+.+..+|...|| -...+|.|||+ ++|.+|++
T Consensus 330 e~d~~~~ev~~q~dgs~iid--Gs~~iRdlNr~-------l~W~Lp~e 368 (423)
T COG4536 330 EHDTLAKEVIPQSDGSFIID--GSANVRDLNRA-------LDWNLPDE 368 (423)
T ss_pred cCcccchhhcccCCCcEEEe--CCCcHHHHHHh-------cCCCCCCC
Confidence 44555555777788988887 44579999999 68999865
No 24
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=33.77 E-value=21 Score=26.65 Aligned_cols=65 Identities=17% Similarity=0.272 Sum_probs=32.6
Q ss_pred ccHHHHHHhCCchhhchhhcCCCcEEeeCCCHHHHHHHHHHHH------------hhhhCceE----ecCCCCcCCCCcc
Q psy5105 17 PDFKQLAIEYPEFRKHLHQELSGKLKFNFQNPDALRIFTTTVL------------RKDFGLNV----EIPPMRLVPTLPL 80 (100)
Q Consensus 17 ~DF~~La~~~p~l~~~v~~~~~G~~~iDF~~~~Av~~LtkaLL------------~~dfgL~~----~lP~~~LcP~iP~ 80 (100)
-.|.+|..++...-.|+...+.+|+.+.|.+.++.. ..|.+| +.+||-.. +....+|-||-+.
T Consensus 10 ~~l~~l~~~~~~~~~~~~L~sFrRi~v~f~~~~~A~-~~r~~l~~~~~~~~g~~l~~yf~~~~~~~~~~~~~~L~~P~~~ 88 (184)
T PF04847_consen 10 AELEELFSTYDPPVQFSPLKSFRRIRVVFESPESAQ-RARQLLHWDGTSFNGKRLRVYFGQPTPIEDDSEKQYLEPPKPE 88 (184)
T ss_dssp HHHHHHHHTT-SS-EEEEETTTTEEEEE-SSTTHHH-HHHHTST--TSEETTEE-EEE----SS------S--S------
T ss_pred HHHHHHHHhcCCceEEEEcCCCCEEEEEeCCHHHHH-HHHHHhcccccccCCCceEEEEcccccccccccccccCCCCcc
Confidence 357888899999999998889999999999876543 333333 34555333 3366788888777
Q ss_pred hH
Q psy5105 81 RL 82 (100)
Q Consensus 81 R~ 82 (100)
|.
T Consensus 89 K~ 90 (184)
T PF04847_consen 89 KQ 90 (184)
T ss_dssp --
T ss_pred ce
Confidence 64
No 25
>KOG0942|consensus
Probab=33.06 E-value=14 Score=34.36 Aligned_cols=68 Identities=25% Similarity=0.281 Sum_probs=50.1
Q ss_pred ccHHHHHHhCCchhhchhhcC--CCcEEeeCCCHHHHHHHHHHHHhhhhCce--Eec-CCCCcCC-CCcchHHHHHHHHH
Q psy5105 17 PDFKQLAIEYPEFRKHLHQEL--SGKLKFNFQNPDALRIFTTTVLRKDFGLN--VEI-PPMRLVP-TLPLRLNYILWIED 90 (100)
Q Consensus 17 ~DF~~La~~~p~l~~~v~~~~--~G~~~iDF~~~~Av~~LtkaLL~~dfgL~--~~l-P~~~LcP-~iP~R~nYI~wi~D 90 (100)
+||..|+.-.|+|-++++.=+ +| |...-.+|+=++...++|.. +|+ |-|.=+| |.-||..|||-+.|
T Consensus 771 vd~~dL~SlDPeLY~nLifLk~y~g-------ddi~eL~L~FtVv~~e~G~~~vVeLkPnGs~i~VTneNvi~YihLVsn 843 (1001)
T KOG0942|consen 771 VDLHDLASLDPELYKNLIFLKNYNG-------DDISELQLDFTVVNSELGERQVVELKPNGSKIRVTNENVIEYIHLVSN 843 (1001)
T ss_pred CChhhhcccCHHHHHHHHHHHhcCC-------CchhhccceEEEeccccccceeEEeccCCccceeechhhhhhhHHhhh
Confidence 899999999999998886422 33 12333455566778899985 776 6666665 67789999998876
Q ss_pred H
Q psy5105 91 L 91 (100)
Q Consensus 91 L 91 (100)
-
T Consensus 844 Y 844 (1001)
T KOG0942|consen 844 Y 844 (1001)
T ss_pred h
Confidence 3
No 26
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=32.72 E-value=80 Score=17.79 Aligned_cols=38 Identities=13% Similarity=0.098 Sum_probs=25.2
Q ss_pred HHHHhCCchhhchhhcCC-CcEEeeCCCHHHHHHHHHHH
Q psy5105 21 QLAIEYPEFRKHLHQELS-GKLKFNFQNPDALRIFTTTV 58 (100)
Q Consensus 21 ~La~~~p~l~~~v~~~~~-G~~~iDF~~~~Av~~LtkaL 58 (100)
++-++|-+........+. |..-|.|.+.++.+..=+.|
T Consensus 3 ~~f~~fG~V~~i~~~~~~~~~a~V~f~~~~~A~~a~~~l 41 (56)
T PF13893_consen 3 KLFSKFGEVKKIKIFKKKRGFAFVEFASVEDAQKAIEQL 41 (56)
T ss_dssp HHHTTTS-EEEEEEETTSTTEEEEEESSHHHHHHHHHHH
T ss_pred HHhCCcccEEEEEEEeCCCCEEEEEECCHHHHHHHHHHh
Confidence 445566666666655555 88899999988776555444
No 27
>TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence.
Probab=31.54 E-value=45 Score=20.94 Aligned_cols=17 Identities=35% Similarity=0.567 Sum_probs=14.6
Q ss_pred EeeCCCHHHHHHHHHHH
Q psy5105 42 KFNFQNPDALRIFTTTV 58 (100)
Q Consensus 42 ~iDF~~~~Av~~LtkaL 58 (100)
.+||.+++|++.+.+.|
T Consensus 24 ~~~~~~~e~~~~f~~~L 40 (69)
T TIGR03595 24 KTPFNNDENLRRFARKL 40 (69)
T ss_pred HcCCCCHHHHHHHHHHH
Confidence 47999999999887765
No 28
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=30.06 E-value=71 Score=24.94 Aligned_cols=26 Identities=27% Similarity=0.733 Sum_probs=20.0
Q ss_pred EEeeCCCHHHHHHHHHHH--HhhhhCce
Q psy5105 41 LKFNFQNPDALRIFTTTV--LRKDFGLN 66 (100)
Q Consensus 41 ~~iDF~~~~Av~~LtkaL--L~~dfgL~ 66 (100)
..+||+||+|..-..+.+ +..+.|++
T Consensus 130 ~~~Dftnp~a~~w~~~~~~~~~~~~Gvd 157 (303)
T cd06592 130 AVLDFTNPEAVDWFLSRLKSLQEKYGID 157 (303)
T ss_pred ceEeCCCHHHHHHHHHHHHHHHHHhCCc
Confidence 469999999998887655 44467887
No 29
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=29.36 E-value=80 Score=25.19 Aligned_cols=28 Identities=18% Similarity=0.347 Sum_probs=18.2
Q ss_pred EeeCCCHHHHHHHHHHH--HhhhhCce-Eec
Q psy5105 42 KFNFQNPDALRIFTTTV--LRKDFGLN-VEI 69 (100)
Q Consensus 42 ~iDF~~~~Av~~LtkaL--L~~dfgL~-~~l 69 (100)
-+||+||+|+.-....+ +..+.|++ |.+
T Consensus 131 ~~Dftnp~a~~ww~~~~~~~~~~~Gvdg~w~ 161 (339)
T cd06602 131 FPDFLNPNTQEWWTDEIKDFHDQVPFDGLWI 161 (339)
T ss_pred CcCCCCHHHHHHHHHHHHHHHhcCCCcEEEe
Confidence 48999999987554333 12347887 533
No 30
>cd02879 GH18_plant_chitinase_class_V The class V plant chitinases have a glycosyl hydrolase family 18 (GH18) domain, but lack the chitin-binding domain present in other GH18 enzymes. The GH18 domain of the class V chitinases has endochitinase activity in some cases and no catalytic activity in others. Included in this family is a lectin found in black locust (Robinia pseudoacacia) bark, which binds chitin but lacks chitinase activity. Also included is a chitinase-related receptor-like kinase (CHRK1) from tobacco (Nicotiana tabacum), with an N-terminal GH18 domain and a C-terminal kinase domain, which is thought to be part of a plant signaling pathway. The GH18 domain of CHRK1 is expressed extracellularly where it binds chitin but lacks chitinase activity.
Probab=28.89 E-value=1.4e+02 Score=23.19 Aligned_cols=72 Identities=19% Similarity=0.329 Sum_probs=42.6
Q ss_pred cHH-HHHHhCCchhhchhhcCCCcEEeeC----CCHHHHHHHHHHHHh--hhhCc-----eEecCCCCcCCCCcchHHHH
Q psy5105 18 DFK-QLAIEYPEFRKHLHQELSGKLKFNF----QNPDALRIFTTTVLR--KDFGL-----NVEIPPMRLVPTLPLRLNYI 85 (100)
Q Consensus 18 DF~-~La~~~p~l~~~v~~~~~G~~~iDF----~~~~Av~~LtkaLL~--~dfgL-----~~~lP~~~LcP~iP~R~nYI 85 (100)
+|. .|.+++|.++-.+.....+..+-.| +|+++.+.+.+.++. +.||+ +||-|.. -..|.||+
T Consensus 55 ~~~~~~k~~~~~lkvlisiGG~~~~s~~fs~~~~~~~~R~~fi~siv~~l~~~~fDGidiDWE~P~~-----~~d~~n~~ 129 (299)
T cd02879 55 TFTETVKRKNPSVKTLLSIGGGGSDSSAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFPSS-----QVEMENFG 129 (299)
T ss_pred HHHHHHHHhCCCCeEEEEEeCCCCCCchhhHHhCCHHHHHHHHHHHHHHHHHhCCCceeecccCCCC-----hhHHHHHH
Confidence 454 5677899999877653322111124 466666666655432 23433 3666631 24699999
Q ss_pred HHHHHHHhh
Q psy5105 86 LWIEDLLEA 94 (100)
Q Consensus 86 ~wi~DLL~~ 94 (100)
..|++|-..
T Consensus 130 ~ll~elr~~ 138 (299)
T cd02879 130 KLLEEWRAA 138 (299)
T ss_pred HHHHHHHHH
Confidence 999998654
No 31
>PRK11104 hemG protoporphyrinogen oxidase; Provisional
Probab=27.47 E-value=59 Score=23.44 Aligned_cols=18 Identities=6% Similarity=0.222 Sum_probs=16.5
Q ss_pred eeCCCHHHHHHHHHHHHh
Q psy5105 43 FNFQNPDALRIFTTTVLR 60 (100)
Q Consensus 43 iDF~~~~Av~~LtkaLL~ 60 (100)
+||+|+++|++.++.+..
T Consensus 155 ~~~~dw~~v~~fa~~~~~ 172 (177)
T PRK11104 155 VEYTDWEQVANFAREFAQ 172 (177)
T ss_pred eeeCCHHHHHHHHHHHHh
Confidence 899999999999998765
No 32
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=27.34 E-value=78 Score=21.53 Aligned_cols=34 Identities=18% Similarity=0.380 Sum_probs=29.0
Q ss_pred CcEEeeCCCHHHHHHHHHHHHhhhhCce-EecCCC
Q psy5105 39 GKLKFNFQNPDALRIFTTTVLRKDFGLN-VEIPPM 72 (100)
Q Consensus 39 G~~~iDF~~~~Av~~LtkaLL~~dfgL~-~~lP~~ 72 (100)
|.++|-|-|.+.+++||+.-+.++.--+ +++|-+
T Consensus 1 ~~vsi~~v~~~~i~~lN~~yr~~d~~TDVLSF~~~ 35 (110)
T TIGR00043 1 GELSLLFVDDEEIQELNREYRDKDYPTDVLSFPYE 35 (110)
T ss_pred CEEEEEEECHHHHHHHHHHHhCCCCCCceEeeccc
Confidence 3568889999999999999999998888 588754
No 33
>TIGR02130 dapB_plant dihydrodipicolinate reductase. This narrow family includes genes from Arabidopsis and Fibrobacter succinogenes (which probably recieved the gene from a plant via lateral gene transfer). The sequences are distantly related to the dihydrodipicolinate reductases from archaea. In Fibrobacter this gene is the only candidate DHPR in the genome.
Probab=26.86 E-value=72 Score=25.43 Aligned_cols=21 Identities=10% Similarity=0.192 Sum_probs=17.5
Q ss_pred EEeeCCCHHHHHHHHHHHHhh
Q psy5105 41 LKFNFQNPDALRIFTTTVLRK 61 (100)
Q Consensus 41 ~~iDF~~~~Av~~LtkaLL~~ 61 (100)
+-|||+.|+++.+..+..++.
T Consensus 72 VvIDFT~P~~~~~n~~~~~~~ 92 (275)
T TIGR02130 72 ICIDYTHPSAVNDNAAFYGKH 92 (275)
T ss_pred EEEECCChHHHHHHHHHHHHC
Confidence 679999999999987776654
No 34
>PF02244 Propep_M14: Carboxypeptidase activation peptide; InterPro: IPR003146 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The peptidases are synthesised as inactive molecules, zymogens, with propeptides that must be removed by proteolytic cleavage to activate the enzyme. Structural studies of carboxypeptidases A and B reveal the propeptide to exist as a globular domain, followed by an extended alpha-helix; this shields the catalytic site, without specifically binding to it, while the substrate-binding site is blocked by making specific contacts [, ]. Members of this propeptide family are found in the metallocarboxypeptidases: A1, A2 [], A3, A4, A5, A6, U, insect gut carboxypeptidase and B [], and and are associated with peptidases belonging to MEROPS peptidase family M14A. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase.; GO: 0004180 carboxypeptidase activity, 0006508 proteolysis; PDB: 3D68_C 3D67_B 3D66_A 1PBA_A 3GLJ_A 1NSA_A 1KWM_A 2BOA_B 1PYT_A 1JQG_A ....
Probab=26.39 E-value=1.3e+02 Score=18.16 Aligned_cols=49 Identities=18% Similarity=0.268 Sum_probs=32.8
Q ss_pred eeCCCHHHHHHHHHHHHhhhhCce-EecCC------CCcCCCCcchHHHHHHHHHHHhhcCCC
Q psy5105 43 FNFQNPDALRIFTTTVLRKDFGLN-VEIPP------MRLVPTLPLRLNYILWIEDLLEANQIS 98 (100)
Q Consensus 43 iDF~~~~Av~~LtkaLL~~dfgL~-~~lP~------~~LcP~iP~R~nYI~wi~DLL~~~~~~ 98 (100)
|.-++.+-+..|.+-- ...+++ |.-|. +-++|| .++..+.++|...+|+
T Consensus 3 v~p~t~~q~~~L~~L~--~~~~~dfW~~~~~~~~~~dv~V~p-----~~~~~f~~~L~~~~i~ 58 (74)
T PF02244_consen 3 VTPKTEEQLELLQELE--QSNELDFWKEPSSVGRPVDVMVPP-----EKLEEFEELLKEHGIE 58 (74)
T ss_dssp EEESSHHHHHHHHHHH--HHSTEEEEESSSSTTSEEEEEEEG-----GGHHHHHHHHHHTT-E
T ss_pred EEeCCHHHHHHHHHHh--cccceeeecCCCCCCCeEEEEECH-----HHHHHHHHHHHHCCCc
Confidence 4456777777776655 888888 77766 224444 5777778888777763
No 35
>PF13979 SopA_C: SopA-like catalytic domain; PDB: 3NB2_B 3NAW_B 3SQV_B 2QZA_B 3SY2_B 2QYU_A.
Probab=26.24 E-value=1e+02 Score=23.21 Aligned_cols=44 Identities=25% Similarity=0.523 Sum_probs=32.8
Q ss_pred eCCCHHHHHHHHHHHHhhhhCceEecCCCCcCCCCcchHHHHHHHHHHHhhcC
Q psy5105 44 NFQNPDALRIFTTTVLRKDFGLNVEIPPMRLVPTLPLRLNYILWIEDLLEANQ 96 (100)
Q Consensus 44 DF~~~~Av~~LtkaLL~~dfgL~~~lP~~~LcP~iP~R~nYI~wi~DLL~~~~ 96 (100)
...+|+++|.++.|||..-+.| +|.|=..-.+=.|...|+...+
T Consensus 96 E~dSP~~LR~YA~ALl~kA~eL---------dP~i~s~~~~~~w~~rLlg~~~ 139 (172)
T PF13979_consen 96 EYDSPKPLRRYANALLEKAYEL---------DPSIFSSQTFDDWKNRLLGLNN 139 (172)
T ss_dssp SSC--HHHHHHHHHHHHHHHHH----------GGGCSHHHHHHHHHHHCTTTC
T ss_pred CCCChHHHHHHHHHHHHHHHHh---------CcchhhHHHHHHHHHHHcCCCC
Confidence 5789999999999999998875 4555556788888888886543
No 36
>COG2170 Uncharacterized conserved protein [Function unknown]
Probab=26.12 E-value=86 Score=26.33 Aligned_cols=32 Identities=25% Similarity=0.424 Sum_probs=29.9
Q ss_pred ecCCCCcCCCCcchHHHHHHHHHHHhhcCCCC
Q psy5105 68 EIPPMRLVPTLPLRLNYILWIEDLLEANQISS 99 (100)
Q Consensus 68 ~lP~~~LcP~iP~R~nYI~wi~DLL~~~~~~~ 99 (100)
.+|...|.|-+++=..|+.|.++++....|++
T Consensus 179 ~~P~~g~pp~f~sw~~f~~~~~~~~~tG~I~~ 210 (369)
T COG2170 179 QLPTNGLPPAFQSWAAFEAFFRDQLETGTIDS 210 (369)
T ss_pred hCCcCCCCccccCHHHHHHHHHHHHHhccccc
Confidence 67999999999999999999999999999874
No 37
>COG0289 DapB Dihydrodipicolinate reductase [Amino acid transport and metabolism]
Probab=26.11 E-value=66 Score=25.75 Aligned_cols=26 Identities=19% Similarity=0.194 Sum_probs=21.4
Q ss_pred cEEeeCCCHHHHHHHHHHHHhhhhCc
Q psy5105 40 KLKFNFQNPDALRIFTTTVLRKDFGL 65 (100)
Q Consensus 40 ~~~iDF~~~~Av~~LtkaLL~~dfgL 65 (100)
.+-|||+.|+++.++.+..++.-..+
T Consensus 71 DV~IDFT~P~~~~~~l~~~~~~~~~l 96 (266)
T COG0289 71 DVLIDFTTPEATLENLEFALEHGKPL 96 (266)
T ss_pred CEEEECCCchhhHHHHHHHHHcCCCe
Confidence 46899999999999999888765443
No 38
>PF12563 Hemolysin_N: Hemolytic toxin N terminal; InterPro: IPR022220 This domain family is found in bacteria, and is approximately 190 amino acids in length. The family is found in association with PF07968 from PFAM, PF00652 from PFAM. This family is a bacterial virulence factor - hemolysin - which forms pores in erythrocytes and causes them to lyse. ; PDB: 1XEZ_A 3O44_I.
Probab=24.37 E-value=1.2e+02 Score=23.16 Aligned_cols=40 Identities=15% Similarity=0.351 Sum_probs=26.0
Q ss_pred CchhhchhhcCCCcEEeeCCCH--HHHHHHHHHHHhhhhCceE
Q psy5105 27 PEFRKHLHQELSGKLKFNFQNP--DALRIFTTTVLRKDFGLNV 67 (100)
Q Consensus 27 p~l~~~v~~~~~G~~~iDF~~~--~Av~~LtkaLL~~dfgL~~ 67 (100)
++++.-|. +.+.+.-|||+.- +.-++-.|+.++.-+|+.+
T Consensus 66 ~~ird~Vl-nq~krylvDFS~iede~~k~~aq~~~r~~~G~sF 107 (187)
T PF12563_consen 66 AQIRDDVL-NQNKRYLVDFSQIEDEEEKAQAQAKFRKQYGLSF 107 (187)
T ss_dssp HHHHHHHT-TS--EEEEE-TT--SHHHHHHHHHHHHHHHS-B-
T ss_pred HHHHHHHH-hcCCeEEEEccccCChHHHHHHHHHHHHHhCcCc
Confidence 34555554 4567789999985 4578889999999999964
No 39
>PF12018 DUF3508: Domain of unknown function (DUF3508); InterPro: IPR021897 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 280 amino acids in length. This domain has two conserved sequence motifs: GFC and GLL. This family is also known as UPF0704.
Probab=24.01 E-value=1.3e+02 Score=23.58 Aligned_cols=42 Identities=24% Similarity=0.411 Sum_probs=30.0
Q ss_pred CccHHHHHHhCCchhhchhhcCCC---------------cEEeeCCCHHHHHHHHHH
Q psy5105 16 PPDFKQLAIEYPEFRKHLHQELSG---------------KLKFNFQNPDALRIFTTT 57 (100)
Q Consensus 16 ~~DF~~La~~~p~l~~~v~~~~~G---------------~~~iDF~~~~Av~~Ltka 57 (100)
..||.+|--+|--|=++-....+| ...+-|++++|++...+.
T Consensus 194 ~~~~~~l~l~~~GFC~~tl~~~~GLLlPG~p~~Gv~~~~~k~y~F~s~~aa~~F~~~ 250 (281)
T PF12018_consen 194 TANFDQLPLQYNGFCPWTLAERDGLLLPGNPSIGVLKYKDKYYAFSSREAAYRFAED 250 (281)
T ss_pred cccHHhccHHHcCcCeEEEeccCceeecCCCccceeEEcCEEEEeCCHHHHHHHHHC
Confidence 788988766666655554444444 335789999999988877
No 40
>smart00362 RRM_2 RNA recognition motif.
Probab=22.84 E-value=1.4e+02 Score=16.06 Aligned_cols=37 Identities=14% Similarity=0.167 Sum_probs=20.8
Q ss_pred cHHHHHHhCCchhhchhh-cC---CCcEEeeCCCHHHHHHH
Q psy5105 18 DFKQLAIEYPEFRKHLHQ-EL---SGKLKFNFQNPDALRIF 54 (100)
Q Consensus 18 DF~~La~~~p~l~~~v~~-~~---~G~~~iDF~~~~Av~~L 54 (100)
|..++-+++-+....... +. .|...+.|+++++....
T Consensus 15 ~l~~~~~~~g~v~~~~~~~~~~~~~~~~~v~f~~~~~a~~a 55 (72)
T smart00362 15 DLKELFSKFGPIESVKIPKDTGKSKGFAFVEFESEEDAEKA 55 (72)
T ss_pred HHHHHHHhcCCEEEEEEecCCCCCCceEEEEeCCHHHHHHH
Confidence 444555555544332222 22 36778999998776554
No 41
>PRK14434 acylphosphatase; Provisional
Probab=22.71 E-value=1.2e+02 Score=19.87 Aligned_cols=42 Identities=17% Similarity=0.214 Sum_probs=34.4
Q ss_pred HHHHHhCCchhhchhhcCCCcEEeeCCCH--HHHHHHHHHHHhh
Q psy5105 20 KQLAIEYPEFRKHLHQELSGKLKFNFQNP--DALRIFTTTVLRK 61 (100)
Q Consensus 20 ~~La~~~p~l~~~v~~~~~G~~~iDF~~~--~Av~~LtkaLL~~ 61 (100)
..+|.++-+|.-+|....+|++.|...-+ +++.++-+.|.+.
T Consensus 22 ~~~A~~lg~l~G~V~N~~dGsVei~~qG~~~~~l~~f~~~l~~g 65 (92)
T PRK14434 22 YSLALEIGDIYGRVWNNDDGTVEILAQSDDSAKLAKFIQEIRKG 65 (92)
T ss_pred HHHHHHcCCcEEEEEECCCCCEEEEEEcCCHHHHHHHHHHHhcC
Confidence 58899999899999988899888886653 5898888887664
No 42
>PLN02775 Probable dihydrodipicolinate reductase
Probab=22.71 E-value=96 Score=24.90 Aligned_cols=21 Identities=14% Similarity=0.248 Sum_probs=18.0
Q ss_pred EEeeCCCHHHHHHHHHHHHhh
Q psy5105 41 LKFNFQNPDALRIFTTTVLRK 61 (100)
Q Consensus 41 ~~iDF~~~~Av~~LtkaLL~~ 61 (100)
+-|||+.|+++.+..+..++.
T Consensus 83 VvIDFT~P~a~~~~~~~~~~~ 103 (286)
T PLN02775 83 IVVDYTLPDAVNDNAELYCKN 103 (286)
T ss_pred EEEECCChHHHHHHHHHHHHC
Confidence 579999999999988877664
No 43
>PF14330 DUF4387: Domain of unknown function (DUF4387)
Probab=22.38 E-value=1e+02 Score=21.15 Aligned_cols=38 Identities=21% Similarity=0.607 Sum_probs=26.2
Q ss_pred hhhchhhcCCC--cEEee--CCCHHHHH------HHHHHHHhhhhCce
Q psy5105 29 FRKHLHQELSG--KLKFN--FQNPDALR------IFTTTVLRKDFGLN 66 (100)
Q Consensus 29 l~~~v~~~~~G--~~~iD--F~~~~Av~------~LtkaLL~~dfgL~ 66 (100)
+.+.+..+..| ..|+| |+++|+-. .||+.+..+-||++
T Consensus 4 la~vIRSKNAGPf~lT~DI~F~~~e~Y~~vk~s~~l~~~~ia~Ly~i~ 51 (99)
T PF14330_consen 4 LAKVIRSKNAGPFELTFDIIFKDKEAYERVKASGVLTPELIARLYGID 51 (99)
T ss_pred HHHHHhccCCCCcEEEEEEEECCHHHHHHHHHcCccCHHHHHHHHCCC
Confidence 44445544446 46777 88877652 48999999999974
No 44
>cd06545 GH18_3CO4_chitinase The Bacteroides thetaiotaomicron protein represented by pdb structure 3CO4 is an uncharacterized bacterial member of the family 18 glycosyl hydrolases with homologs found in Flavobacterium, Stigmatella, and Pseudomonas.
Probab=22.34 E-value=2e+02 Score=21.48 Aligned_cols=42 Identities=19% Similarity=0.450 Sum_probs=24.4
Q ss_pred CCHHHHHHHHHHHH--hhhhCce-----EecCCCCcCCCCcchHHHHHHHHHHHhh
Q psy5105 46 QNPDALRIFTTTVL--RKDFGLN-----VEIPPMRLVPTLPLRLNYILWIEDLLEA 94 (100)
Q Consensus 46 ~~~~Av~~LtkaLL--~~dfgL~-----~~lP~~~LcP~iP~R~nYI~wi~DLL~~ 94 (100)
+|+++...+.+.+. .+.||++ ||-|.. . |.+|+..+++|-+.
T Consensus 79 ~~~~~r~~fi~~lv~~~~~~~~DGIdiDwE~~~~------~-~~~~~~fv~~Lr~~ 127 (253)
T cd06545 79 NDPAKRKALVDKIINYVVSYNLDGIDVDLEGPDV------T-FGDYLVFIRALYAA 127 (253)
T ss_pred cCHHHHHHHHHHHHHHHHHhCCCceeEEeeccCc------c-HhHHHHHHHHHHHH
Confidence 45665555555543 2345443 444432 1 88999998888654
No 45
>COG1713 Predicted HD superfamily hydrolase involved in NAD metabolism [Coenzyme metabolism]
Probab=22.10 E-value=1.5e+02 Score=22.46 Aligned_cols=73 Identities=25% Similarity=0.345 Sum_probs=42.4
Q ss_pred HHHHHhCCc--hhhchhhcCCCc-EEeeCCCHHHHHH-HHHHHHhhhhCce-Eec-----CCCCcCCCCcchHHHHHHHH
Q psy5105 20 KQLAIEYPE--FRKHLHQELSGK-LKFNFQNPDALRI-FTTTVLRKDFGLN-VEI-----PPMRLVPTLPLRLNYILWIE 89 (100)
Q Consensus 20 ~~La~~~p~--l~~~v~~~~~G~-~~iDF~~~~Av~~-LtkaLL~~dfgL~-~~l-----P~~~LcP~iP~R~nYI~wi~ 89 (100)
..+++.+|+ +.+.+.. .|. ..+++.++.-... ....+++++||++ =++ =.+..+ .--+.++=|.+|+
T Consensus 50 HD~aK~~p~~~~~~~~~~--~~~~~~~~~~~~~llH~~vgay~~~~~fGi~De~VL~AI~~HTtg~-~~mt~ldkIiyiA 126 (187)
T COG1713 50 HDIAKELPEQKLLKIAKK--YGLELDLERESPLLLHGKVGAYLLKEEFGIKDEEVLSAIEYHTTGR-KQMTLLDKILYVA 126 (187)
T ss_pred HHHHhhCCHHHHHHHHHH--hCCCchhhccChHHHHHHHHHHHHHHHhCCCcHHHHHHHHHhccCC-Cccchhhheeeee
Confidence 456777775 5555543 332 2256888888865 5677889999995 111 112222 2234566666666
Q ss_pred HHHhhc
Q psy5105 90 DLLEAN 95 (100)
Q Consensus 90 DLL~~~ 95 (100)
|-++..
T Consensus 127 D~iEp~ 132 (187)
T COG1713 127 DKIEPG 132 (187)
T ss_pred ccccCC
Confidence 665543
No 46
>KOG0799|consensus
Probab=21.51 E-value=1.6e+02 Score=24.76 Aligned_cols=16 Identities=44% Similarity=1.115 Sum_probs=11.3
Q ss_pred CCCCCCCC------CCCccHHH
Q psy5105 6 FMHPRNKY------RVPPDFKQ 21 (100)
Q Consensus 6 ~mHprN~y------~~~~DF~~ 21 (100)
.-||+|.| +.++.|+.
T Consensus 125 iYhPqN~ycihvD~~s~~~fk~ 146 (439)
T KOG0799|consen 125 IYHPQNVYCIHVDAKSPPEFRV 146 (439)
T ss_pred HhCCcCcceEEECCCCCHHHHH
Confidence 35899998 23788873
No 47
>COG3228 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.02 E-value=63 Score=25.90 Aligned_cols=22 Identities=41% Similarity=0.558 Sum_probs=18.9
Q ss_pred CCcCCCCcchHHHHHHHHHHHh
Q psy5105 72 MRLVPTLPLRLNYILWIEDLLE 93 (100)
Q Consensus 72 ~~LcP~iP~R~nYI~wi~DLL~ 93 (100)
-+=+|++|+|.+|--|-.|+-+
T Consensus 166 anGvP~l~~r~d~~~W~~~l~a 187 (266)
T COG3228 166 ANGVPFLPLRSDVAGWEHDLHA 187 (266)
T ss_pred cCCCCcCcccchHHHHHHHHHH
Confidence 5668999999999999888754
No 48
>TIGR02691 arsC_pI258_fam arsenate reductase (thioredoxin). This family describes the well-studied thioredoxin-dependent arsenate reductase of Staphylococcus aureaus plasmid pI258 and other mechanistically similar arsenate reductases. The mechanism involves an intramolecular disulfide bond cascade, and aligned members of this family have four absolutely conserved Cys residues. This group of arsenate reductases belongs to the low-molecular weight protein-tyrosine phosphatase family (pfam01451), as does a group of glutathione/glutaredoxin type arsenate reductases (TIGR02689). At least two other, non-homologous groups of arsenate reductases involved in arsenical resistance are also known. This enzyme reduces arsenate to arsenite, which may be more toxic but which is more easily exported.
Probab=20.77 E-value=20 Score=24.74 Aligned_cols=35 Identities=17% Similarity=0.312 Sum_probs=22.7
Q ss_pred HHHHHHHHhhhhCceEecC--CCCcCCCCcchHHHHHH
Q psy5105 52 RIFTTTVLRKDFGLNVEIP--PMRLVPTLPLRLNYILW 87 (100)
Q Consensus 52 ~~LtkaLL~~dfgL~~~lP--~~~LcP~iP~R~nYI~w 87 (100)
++|+...+ ..|++-+.+- ...+||.+|+-..+++|
T Consensus 60 ~~l~~~~~-~~~D~vitm~~~~~~~~~~~p~~~~~~~w 96 (129)
T TIGR02691 60 DLIDLDIL-NKADLVVTLCGDARDKCPATPPHVKREHW 96 (129)
T ss_pred ccCChhhc-ccCCEEEEeCchhccCCCccCCCCeEEEC
Confidence 34555444 3567666553 34589999986667788
No 49
>PRK10658 putative alpha-glucosidase; Provisional
Probab=20.48 E-value=2.7e+02 Score=24.74 Aligned_cols=29 Identities=24% Similarity=0.492 Sum_probs=18.1
Q ss_pred EEeeCCCHHHHHHHHHHHH-hhhhCce-Eec
Q psy5105 41 LKFNFQNPDALRIFTTTVL-RKDFGLN-VEI 69 (100)
Q Consensus 41 ~~iDF~~~~Av~~LtkaLL-~~dfgL~-~~l 69 (100)
.-+||+||+|..-....+= ..+.|++ |..
T Consensus 384 ~~~Dftnp~ar~W~~~~~~~l~d~Gvdgfw~ 414 (665)
T PRK10658 384 AIVDFTNPDACKWYADKLKGLLDMGVDCFKT 414 (665)
T ss_pred eeecCCCHHHHHHHHHHHHHHHhcCCcEEEe
Confidence 4699999999875543220 1256887 433
No 50
>PRK14432 acylphosphatase; Provisional
Probab=20.25 E-value=1.5e+02 Score=19.50 Aligned_cols=39 Identities=10% Similarity=0.237 Sum_probs=29.8
Q ss_pred HHHHhCCchhhchhhcCCCcEEeeC--CCHHHHHHHHHHHHh
Q psy5105 21 QLAIEYPEFRKHLHQELSGKLKFNF--QNPDALRIFTTTVLR 60 (100)
Q Consensus 21 ~La~~~p~l~~~v~~~~~G~~~iDF--~~~~Av~~LtkaLL~ 60 (100)
.+|.+.- |.-+|....+|.+.|-. -+++++.++-+.|-.
T Consensus 23 ~~A~~lg-l~G~V~N~~dG~Vei~~~~G~~~~v~~f~~~l~~ 63 (93)
T PRK14432 23 QIANNMK-LKGFVKNLNDGRVEIVAFFNTKEQMKKFEKLLKN 63 (93)
T ss_pred HHHHHhC-CEEEEEECCCCCEEEEEEECCHHHHHHHHHHHHh
Confidence 4555554 78888888899888874 699999999886654
Done!