Query psy16754
Match_columns 127
No_of_seqs 67 out of 69
Neff 4.5
Searched_HMMs 46136
Date Fri Aug 16 21:09:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16754.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16754hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4352|consensus 100.0 1.9E-61 4.1E-66 372.7 13.0 123 1-125 45-170 (187)
2 PF06905 FAIM1: Fas apoptotic 100.0 1.9E-59 4.1E-64 365.2 15.2 125 1-125 46-173 (177)
3 PF06905 FAIM1: Fas apoptotic 97.7 0.00029 6.3E-09 55.5 9.1 73 50-126 6-86 (177)
4 KOG4352|consensus 97.2 0.00045 9.8E-09 54.3 4.5 75 49-126 4-83 (187)
5 COG3443 Predicted periplasmic 87.2 2.5 5.4E-05 33.9 6.4 64 37-122 63-128 (193)
6 PF13464 DUF4115: Domain of un 78.6 14 0.0003 24.4 6.5 59 36-97 16-76 (77)
7 PF08244 Glyco_hydro_32C: Glyc 77.7 2.6 5.6E-05 27.9 2.7 28 50-77 56-85 (86)
8 PF13385 Laminin_G_3: Concanav 74.9 17 0.00038 24.5 6.4 41 61-101 89-131 (157)
9 PRK14081 triple tyrosine motif 67.6 1.1E+02 0.0025 28.9 12.0 104 16-125 306-427 (667)
10 TIGR01322 scrB_fam sucrose-6-p 65.6 8.4 0.00018 33.3 3.8 28 60-87 416-444 (445)
11 PF06439 DUF1080: Domain of Un 63.3 11 0.00024 27.4 3.6 20 61-80 131-150 (185)
12 PF02872 5_nucleotid_C: 5'-nuc 60.2 6.1 0.00013 28.9 1.7 60 27-86 65-136 (156)
13 TIGR01997 sufA_proteo FeS asse 60.1 34 0.00073 24.0 5.5 67 18-99 26-92 (107)
14 PRK09420 cpdB bifunctional 2', 50.9 13 0.00028 34.3 2.6 58 29-86 444-539 (649)
15 PF03633 Glyco_hydro_65C: Glyc 50.0 58 0.0013 19.9 5.6 27 55-81 23-51 (54)
16 COG0316 sufA Fe-S cluster asse 49.8 78 0.0017 23.1 6.1 65 16-99 27-95 (110)
17 TIGR01390 CycNucDiestase 2',3' 49.4 13 0.00029 34.0 2.4 59 28-86 420-516 (626)
18 PLN03082 Iron-sulfur cluster a 46.7 40 0.00086 26.1 4.4 64 18-97 80-144 (163)
19 PRK09418 bifunctional 2',3'-cy 46.1 21 0.00045 34.0 3.2 58 29-86 459-553 (780)
20 PF08531 Bac_rhamnosid_N: Alph 44.8 26 0.00055 26.6 3.0 37 16-56 4-49 (172)
21 PRK14081 triple tyrosine motif 43.9 2.9E+02 0.0063 26.2 12.3 104 7-123 491-614 (667)
22 PF14099 Polysacc_lyase: Polys 43.0 95 0.0021 23.7 6.0 42 49-90 141-191 (224)
23 PF14344 DUF4397: Domain of un 39.2 75 0.0016 22.0 4.5 20 68-87 13-32 (122)
24 PRK13623 iron-sulfur cluster i 38.3 83 0.0018 22.2 4.7 67 18-99 34-100 (115)
25 PRK09419 bifunctional 2',3'-cy 37.9 29 0.00063 33.9 2.9 58 29-86 453-547 (1163)
26 PF13364 BetaGal_dom4_5: Beta- 35.5 62 0.0014 22.9 3.7 40 18-57 50-91 (111)
27 KOG4767|consensus 35.5 24 0.00052 29.1 1.7 52 18-70 96-151 (231)
28 PF02837 Glyco_hydro_2_N: Glyc 35.1 44 0.00096 24.1 2.9 22 61-82 84-109 (167)
29 PF04313 HSDR_N: Type I restri 34.3 44 0.00095 25.3 2.9 37 2-42 100-136 (194)
30 PRK09455 rseB anti-sigma E fac 33.5 50 0.0011 28.0 3.3 33 2-38 129-163 (319)
31 smart00640 Glyco_32 Glycosyl h 33.4 38 0.00082 29.3 2.6 18 60-77 419-436 (437)
32 smart00776 NPCBM This novel pu 33.1 2.1E+02 0.0045 21.4 6.7 42 70-111 86-130 (145)
33 TIGR03000 plancto_dom_1 Planct 32.5 43 0.00094 23.2 2.3 16 26-41 38-57 (75)
34 PRK05641 putative acetyl-CoA c 32.3 76 0.0016 24.2 3.9 33 9-45 5-38 (153)
35 PF14453 ThiS-like: ThiS-like 31.8 33 0.00073 22.5 1.6 13 71-83 1-13 (57)
36 PF00054 Laminin_G_1: Laminin 31.6 82 0.0018 22.3 3.8 91 35-125 26-128 (131)
37 PF07908 D-aminoacyl_C: D-amin 31.3 34 0.00074 21.1 1.5 15 73-87 23-37 (48)
38 PF07691 PA14: PA14 domain; I 31.0 1.3E+02 0.0029 20.8 4.7 36 32-67 72-117 (145)
39 PF14958 DUF4506: Domain of un 30.7 2.4E+02 0.0051 21.3 7.8 64 63-126 32-108 (138)
40 PRK15367 type III secretion sy 30.6 59 0.0013 28.9 3.4 47 61-107 51-101 (395)
41 COG3026 RseB Negative regulato 30.5 54 0.0012 28.4 3.0 30 16-45 139-184 (320)
42 KOG1119|consensus 29.7 98 0.0021 25.1 4.2 60 17-87 114-173 (199)
43 PTZ00334 trans-sialidase; Prov 29.6 76 0.0016 30.5 4.1 47 51-101 638-691 (780)
44 smart00560 LamGL LamG-like jel 29.2 85 0.0018 22.2 3.5 22 62-83 66-89 (133)
45 PF04780 DUF629: Protein of un 27.7 26 0.00057 31.6 0.7 46 61-106 251-297 (466)
46 PRK10348 ribosome-associated h 27.6 1.6E+02 0.0034 22.3 4.8 40 73-113 36-75 (133)
47 smart00210 TSPN Thrombospondin 27.5 2.2E+02 0.0047 21.5 5.7 64 57-122 115-180 (184)
48 PRK11907 bifunctional 2',3'-cy 26.8 55 0.0012 31.4 2.7 58 29-86 532-626 (814)
49 PRK06944 sulfur carrier protei 26.8 46 0.001 20.8 1.6 12 71-82 1-12 (65)
50 cd07999 GH7_CBH_EG Glycosyl hy 26.8 28 0.00061 30.8 0.7 21 1-21 112-134 (386)
51 PRK10856 cytoskeletal protein 26.6 4E+02 0.0087 22.6 8.0 74 19-98 253-329 (331)
52 COG4766 EutQ Ethanolamine util 25.9 1.3E+02 0.0028 23.9 4.2 81 16-100 72-161 (176)
53 cd02859 AMPKbeta_GBD_like AMP- 25.7 1.2E+02 0.0025 20.1 3.5 20 19-39 38-57 (79)
54 PRK05659 sulfur carrier protei 25.6 49 0.0011 20.8 1.6 12 71-82 1-12 (66)
55 PF14280 DUF4365: Domain of un 25.6 71 0.0015 22.5 2.6 58 37-95 39-98 (138)
56 PF08141 SspH: Small acid-solu 25.2 2E+02 0.0044 18.8 4.8 30 68-99 13-44 (58)
57 PRK06488 sulfur carrier protei 24.6 53 0.0011 20.8 1.6 12 71-82 1-12 (65)
58 cd02858 Esterase_N_term Estera 24.5 1.1E+02 0.0023 20.4 3.2 22 19-40 43-64 (85)
59 cd01179 IPT_plexin_repeat2 Sec 24.4 1.7E+02 0.0037 19.3 4.1 31 77-107 16-46 (85)
60 PRK05863 sulfur carrier protei 24.3 54 0.0012 21.0 1.6 13 71-83 1-13 (65)
61 KOG0275|consensus 24.2 1.3E+02 0.0027 27.1 4.2 86 30-116 326-439 (508)
62 PRK09504 sufA iron-sulfur clus 24.0 2.8E+02 0.0061 20.0 5.5 68 17-99 40-107 (122)
63 PF03099 BPL_LplA_LipB: Biotin 23.3 1.4E+02 0.003 20.3 3.6 76 19-97 33-120 (125)
64 cd00110 LamG Laminin G domain; 23.1 2E+02 0.0044 19.6 4.5 66 58-123 79-151 (151)
65 PF06832 BiPBP_C: Penicillin-B 23.0 2.2E+02 0.0047 18.8 4.4 50 17-68 31-80 (89)
66 KOG2372|consensus 22.9 68 0.0015 26.8 2.2 65 2-75 174-239 (241)
67 PF00380 Ribosomal_S9: Ribosom 22.3 1.3E+02 0.0027 22.3 3.4 25 16-45 6-30 (121)
68 PRK04313 30S ribosomal protein 22.2 1.2E+02 0.0025 25.2 3.5 37 33-69 65-102 (237)
69 PF14438 SM-ATX: Ataxin 2 SM d 22.1 89 0.0019 20.5 2.3 24 99-123 12-35 (77)
70 PF13954 PapC_N: PapC N-termin 22.1 92 0.002 22.8 2.6 23 70-92 33-56 (146)
71 COG0071 IbpA Molecular chapero 21.9 3.1E+02 0.0068 19.9 5.5 29 58-86 48-81 (146)
72 cd03829 Sina Seven in absentia 21.7 60 0.0013 24.7 1.6 21 28-56 65-85 (127)
73 PF02210 Laminin_G_2: Laminin 21.6 1.4E+02 0.0031 19.6 3.3 68 58-125 53-128 (128)
74 PRK07696 sulfur carrier protei 21.4 66 0.0014 20.9 1.6 11 71-81 1-11 (67)
75 PF05508 Ran-binding: RanGTP-b 21.4 1.2E+02 0.0027 26.0 3.6 22 16-37 223-246 (302)
76 cd03775 MATH_Ubp21p Ubiquitin- 21.2 1.4E+02 0.003 21.2 3.4 34 32-69 3-36 (134)
77 PF14275 DUF4362: Domain of un 21.2 1.5E+02 0.0032 21.3 3.5 28 99-126 28-55 (98)
78 CHL00079 rps9 ribosomal protei 21.1 1.1E+02 0.0024 23.0 2.9 25 16-45 14-38 (130)
79 cd02861 E_set_proteins_like E 20.9 1.7E+02 0.0037 19.1 3.6 10 30-39 49-58 (82)
80 PF09865 DUF2092: Predicted pe 20.8 2.8E+02 0.0061 22.2 5.3 43 73-126 30-72 (214)
81 PF14651 Lipocalin_7: Lipocali 20.7 3.6E+02 0.0078 20.0 7.3 83 16-103 35-121 (128)
82 TIGR02011 IscA iron-sulfur clu 20.6 3E+02 0.0065 19.0 5.5 67 18-99 24-90 (105)
83 smart00363 S4 S4 RNA-binding d 20.6 1.4E+02 0.0029 16.8 2.7 22 72-93 27-48 (60)
84 PF10262 Rdx: Rdx family; Int 20.5 58 0.0012 21.4 1.1 15 66-80 40-54 (76)
85 TIGR00731 ctc_TL5 ribosomal pr 20.5 4E+02 0.0088 20.5 8.1 83 32-118 35-118 (176)
No 1
>KOG4352|consensus
Probab=100.00 E-value=1.9e-61 Score=372.68 Aligned_cols=123 Identities=40% Similarity=0.826 Sum_probs=121.9
Q ss_pred CccccceeeEEe-CCcceEEEEEEccCCceEEEEEEECCeehHhhh--eeeeeeeeEEeeCCeeEEEEEEccceeEEEcC
Q psy16754 1 MFRLVGEESFQL-DSKVPCLIKIEPSGPFGYAYSITVDGKTLEKFS--QSKANCTWIVTVEDETYRVVLERGTLDIWVNG 77 (127)
Q Consensus 1 mFkLVGke~F~i-~~~~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~--~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG 77 (127)
||||||||+|.| + +||+|+|||+|||+|||||+|||||++||+ ++|++++|+.+.||++|||||+|++|+|||||
T Consensus 45 MFKLVGketF~vg~--tkciInvda~~gFaYEYsL~inGKs~~ky~Ed~~k~~~~W~~t~dg~~~RivL~kdtm~~w~NG 122 (187)
T KOG4352|consen 45 MFKLVGKETFHVGQ--TKCIINVDAAPGFAYEYSLYINGKSHDKYTEDMTKQYRLWLYTDDGQEYRIVLKKDTMSLWVNG 122 (187)
T ss_pred HHHhhchheEeecc--eEEEEEeccCCCceEEEEEEEcCcchHHHHHHhhhheeEEEEecCCceEEEEEeccceeeEEcC
Confidence 899999999999 7 899999999999999999999999999999 99999999999999999999999999999999
Q ss_pred EEeceeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEecCc
Q psy16754 78 DRVDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLISKLK 125 (127)
Q Consensus 78 ~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~vn~~ 125 (127)
.+++|+|||||+||+|||+||+++|+|+|+|||+|++||+|+|+|||+
T Consensus 123 ~~l~TageFVd~GT~ThF~lg~~~c~i~A~SSGkkk~GivHTL~vng~ 170 (187)
T KOG4352|consen 123 DELRTAGEFVDGGTDTHFLLGDTECVIQARSSGKKKDGIVHTLLVNGV 170 (187)
T ss_pred ccccccceeecCCeeEEEEecCCcEEEEEeccCCCcCcEEEEEEECCE
Confidence 999999999999999999999999999999999999999999999996
No 2
>PF06905 FAIM1: Fas apoptotic inhibitory molecule (FAIM1); InterPro: IPR010695 This family consists of several fas apoptotic inhibitory molecule (FAIM) proteins. FAIM expression is upregulated in B cells by anti-Ig treatment that induces Fas-resistance, and overexpression of FAIM diminishes sensitivity to Fas-mediated apoptosis of B and non-B cell lines. FAIM is highly evolutionarily conserved and is widely expressed in murine tissues, suggesting that FAIM plays an important role in cellular physiology [].; GO: 0043066 negative regulation of apoptosis; PDB: 3MX7_A 2KW1_A 2KD2_A.
Probab=100.00 E-value=1.9e-59 Score=365.22 Aligned_cols=125 Identities=47% Similarity=0.926 Sum_probs=115.4
Q ss_pred CccccceeeEEe-CCcceEEEEEEccCCceEEEEEEECCeehHhhh--eeeeeeeeEEeeCCeeEEEEEEccceeEEEcC
Q psy16754 1 MFRLVGEESFQL-DSKVPCLIKIEPSGPFGYAYSITVDGKTLEKFS--QSKANCTWIVTVEDETYRVVLERGTLDIWVNG 77 (127)
Q Consensus 1 mFkLVGke~F~i-~~~~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~--~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG 77 (127)
||||||+|+|+| +.+.+|.|+|+|++||+|+|||+||||||++|+ |+|++++|+++++|+++||||||+||+|||||
T Consensus 46 mfklvg~e~F~ig~~~~k~~I~I~~~~g~~YeYsL~VdGksl~ky~e~~~k~~~tW~~~i~G~~~RIvLdk~t~~vwvnG 125 (177)
T PF06905_consen 46 MFKLVGKETFTIGGKNTKCEINIEAVSGFAYEYSLEVDGKSLKKYKEEQSKKFNTWELNIDGQEYRIVLDKDTMDVWVNG 125 (177)
T ss_dssp -S---EEEEEEETTTTEEEEEEEEEETTTEEEEEEEETTEEEEE--SSTTTTEEEEEEEETTEEEEEEEETTTTEEEETT
T ss_pred eeeeCcccEEEECCCceEEEEEEEecCCceEEEEEEECCEEHHHHHHHHhhhheeEEEecCCCEEEEEEEcceEEEEECC
Confidence 899999999999 777899999999999999999999999999999 99999999999999999999999999999999
Q ss_pred EEeceeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEecCc
Q psy16754 78 DRVDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLISKLK 125 (127)
Q Consensus 78 ~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~vn~~ 125 (127)
++||++|||+|+||+|||+++++||+|+|+|||+|++||+|+|||||+
T Consensus 126 ~~iet~~eFvd~Gtet~F~l~~~~c~I~~~ssG~k~~gi~h~L~v~~~ 173 (177)
T PF06905_consen 126 EKIETEGEFVDDGTETHFELGGHPCYIKAVSSGKKREGIIHTLYVDGE 173 (177)
T ss_dssp CEE--EEEEETTCEEEEEEETTEEEEEEEEEESSSSSEEEEEEEETTE
T ss_pred EEccccceecCCCcEEEEEECCEeEEEEEEecCCcCcCeEEEEEECCE
Confidence 999999999999999999999999999999999999999999999996
No 3
>PF06905 FAIM1: Fas apoptotic inhibitory molecule (FAIM1); InterPro: IPR010695 This family consists of several fas apoptotic inhibitory molecule (FAIM) proteins. FAIM expression is upregulated in B cells by anti-Ig treatment that induces Fas-resistance, and overexpression of FAIM diminishes sensitivity to Fas-mediated apoptosis of B and non-B cell lines. FAIM is highly evolutionarily conserved and is widely expressed in murine tissues, suggesting that FAIM plays an important role in cellular physiology [].; GO: 0043066 negative regulation of apoptosis; PDB: 3MX7_A 2KW1_A 2KD2_A.
Probab=97.73 E-value=0.00029 Score=55.50 Aligned_cols=73 Identities=33% Similarity=0.384 Sum_probs=54.3
Q ss_pred eeeeEEeeCCeeEEEEEEccc----eeEEEcCEEeceee-eeeeCCeeEEEEeC--CccEEEEEEe-ecCceeceEEEEE
Q psy16754 50 NCTWIVTVEDETYRVVLERGT----LDIWVNGDRVDVTG-EFVENGTETHFTIG--DTPAYIKAVS-SCNKKEGLLYSLI 121 (127)
Q Consensus 50 ~~tW~~~i~g~~~RIvLdK~t----mdvw~nG~~iet~g-eFvd~Gtet~F~lg--~~~c~I~a~s-sG~k~~Gi~H~L~ 121 (127)
--+|.+.+++.-++|-|+=.| +=|+|||+.+--.. -|---|.++ |.+| +..|.|.... +| ..-.|+|+
T Consensus 6 va~W~Vpl~d~~h~IefeHgtttGkrvI~VDGkei~r~~wmfklvg~e~-F~ig~~~~k~~I~I~~~~g---~~YeYsL~ 81 (177)
T PF06905_consen 6 VARWNVPLSDGVHKIEFEHGTTTGKRVIKVDGKEIVRRDWMFKLVGKET-FTIGGKNTKCEINIEAVSG---FAYEYSLE 81 (177)
T ss_dssp EEEEEEEETTEEEEEEEEE-TTT--EEEEETTEEEEEE---S---EEEE-EEETTTTEEEEEEEEEETT---TEEEEEEE
T ss_pred EEEEEEecCCCEEEEEEEeCCccCeEEEEECCcEEEEecceeeeCcccE-EEECCCceEEEEEEEecCC---ceEEEEEE
Confidence 357999999999999999664 55999999774444 366666666 9999 7799998664 44 36899999
Q ss_pred ecCcC
Q psy16754 122 SKLKT 126 (127)
Q Consensus 122 vn~~~ 126 (127)
|||++
T Consensus 82 VdGks 86 (177)
T PF06905_consen 82 VDGKS 86 (177)
T ss_dssp ETTEE
T ss_pred ECCEE
Confidence 99975
No 4
>KOG4352|consensus
Probab=97.25 E-value=0.00045 Score=54.29 Aligned_cols=75 Identities=36% Similarity=0.441 Sum_probs=58.1
Q ss_pred eeeeeEEeeCCeeEEEEEEcccee----EEEcCEEe-ceeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEec
Q psy16754 49 ANCTWIVTVEDETYRVVLERGTLD----IWVNGDRV-DVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLISK 123 (127)
Q Consensus 49 ~~~tW~~~i~g~~~RIvLdK~tmd----vw~nG~~i-et~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~vn 123 (127)
-.-+|.....++-|||-||-.|-. |||||+.+ .-.=-|-==|.|| |.+|++.|-|..--.+. -.--++|++|
T Consensus 4 ~~~~~d~~~~d~~~rie~ehgttsgrr~i~v~gre~lrr~wMFKLVGket-F~vg~tkciInvda~~g--FaYEYsL~in 80 (187)
T KOG4352|consen 4 PTMTQDHVPEDQMYRIELEHGTTSGRRMIWVNGREVLRRDWMFKLVGKET-FHVGQTKCIINVDAAPG--FAYEYSLYIN 80 (187)
T ss_pred ceeeeecCcccceEEEEEecCcccccEEEEEcCHHHHHHHHHHHhhchhe-EeecceEEEEEeccCCC--ceEEEEEEEc
Confidence 456899999999999999988765 89999754 3333366678887 99999999998743332 2467899999
Q ss_pred CcC
Q psy16754 124 LKT 126 (127)
Q Consensus 124 ~~~ 126 (127)
|++
T Consensus 81 GKs 83 (187)
T KOG4352|consen 81 GKS 83 (187)
T ss_pred Ccc
Confidence 986
No 5
>COG3443 Predicted periplasmic or secreted protein [General function prediction only]
Probab=87.19 E-value=2.5 Score=33.90 Aligned_cols=64 Identities=22% Similarity=0.207 Sum_probs=46.7
Q ss_pred CCeehHhhh--eeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEEeCCccEEEEEEeecCcee
Q psy16754 37 DGKTLEKFS--QSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKE 114 (127)
Q Consensus 37 ~GKsl~kf~--~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~ 114 (127)
..|+.++|+ ..|-+.|= --||+++.+|| +|+.||..+.+...-.-|+|.+..+|||
T Consensus 63 ~~ktaaeyk~YY~kGY~Td-------v~~I~I~~~tv-------------tF~k~g~~~s~~Y~y~G~kILtY~kGnR-- 120 (193)
T COG3443 63 GKKTAAEYKAYYRKGYKTD-------VERIGIDGDTV-------------TFFKNGKPTSGKYAYDGYKILTYKKGNR-- 120 (193)
T ss_pred ccccHHHHHHHHhcccccc-------eeEEEEcCCEE-------------EEEECCceeeEEEccCCEEEEeecCCCC--
Confidence 567888888 44545441 24677777765 5777788888888888899999999997
Q ss_pred ceEEEEEe
Q psy16754 115 GLLYSLIS 122 (127)
Q Consensus 115 Gi~H~L~v 122 (127)
|+-+.+-.
T Consensus 121 GVRylFe~ 128 (193)
T COG3443 121 GVRYLFEC 128 (193)
T ss_pred ceEEEEEe
Confidence 77766543
No 6
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=78.65 E-value=14 Score=24.38 Aligned_cols=59 Identities=27% Similarity=0.399 Sum_probs=37.6
Q ss_pred ECCeehHhhh-eeeeeeeeEEeeCCeeEEEEEE-ccceeEEEcCEEeceeeeeeeCCeeEEEEe
Q psy16754 36 VDGKTLEKFS-QSKANCTWIVTVEDETYRVVLE-RGTLDIWVNGDRVDVTGEFVENGTETHFTI 97 (127)
Q Consensus 36 V~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvLd-K~tmdvw~nG~~iet~geFvd~Gtet~F~l 97 (127)
.||+.+-... +.-...+| -.+..++|.+- -...+|.+||+.++..+....-.....|.+
T Consensus 16 ~dG~~~~~~~l~~G~~~~~---~~~~~~~i~iGna~~v~v~~nG~~~~~~~~~~~v~~~~~~~~ 76 (77)
T PF13464_consen 16 ADGKVLFSGTLKAGETKTF---EGKEPFRIRIGNAGAVEVTVNGKPVDLLGPPGQVVKVARFTL 76 (77)
T ss_pred CCCcEeeeeeeCCCcEEEE---eCCCCEEEEEeCCCcEEEEECCEECCCCCCCCccceEEEEcC
Confidence 3454444444 33334444 45678888886 678999999999998776555344444543
No 7
>PF08244 Glyco_hydro_32C: Glycosyl hydrolases family 32 C terminal; InterPro: IPR013189 This domain corresponds to the C-terminal domain of glycosyl hydrolase family 32. It forms a beta sandwich module [].; PDB: 2AEZ_A 2ADE_A 2ADD_A 1ST8_A 2AEY_A 1W2T_C 1UYP_A 3UGG_A 3UGH_B 3UGF_B ....
Probab=77.73 E-value=2.6 Score=27.94 Aligned_cols=28 Identities=18% Similarity=0.400 Sum_probs=21.2
Q ss_pred eeeeEEee-CCe-eEEEEEEccceeEEEcC
Q psy16754 50 NCTWIVTV-EDE-TYRVVLERGTLDIWVNG 77 (127)
Q Consensus 50 ~~tW~~~i-~g~-~~RIvLdK~tmdvw~nG 77 (127)
.+.+.+.. ++. ..||.+|...+||++|+
T Consensus 56 ~~~~~~~~~~~~l~L~i~vD~SsvEiFvNd 85 (86)
T PF08244_consen 56 VRSAPLDLGDKILKLRIFVDRSSVEIFVND 85 (86)
T ss_dssp EEEEETTTTESEEEEEEEEETTEEEEEETT
T ss_pred eEEeeccCCCCcEEEEEEEeCCEEEEEECC
Confidence 34444444 344 89999999999999997
No 8
>PF13385 Laminin_G_3: Concanavalin A-like lectin/glucanases superfamily; PDB: 4DQA_A 1N1Y_A 1MZ6_A 1MZ5_A 1N1S_A 2A75_A 1WCS_A 1N1T_A 1N1V_A 2FHR_A ....
Probab=74.87 E-value=17 Score=24.49 Aligned_cols=41 Identities=17% Similarity=0.263 Sum_probs=28.5
Q ss_pred eEEEEEEccceeEEEcCEEeceeeeee--eCCeeEEEEeCCcc
Q psy16754 61 TYRVVLERGTLDIWVNGDRVDVTGEFV--ENGTETHFTIGDTP 101 (127)
Q Consensus 61 ~~RIvLdK~tmdvw~nG~~iet~geFv--d~Gtet~F~lg~~~ 101 (127)
...++.+..++.+|+||+.+.+...-. .......|.+|...
T Consensus 89 ~l~~~~~~~~~~lyvnG~~~~~~~~~~~~~~~~~~~~~iG~~~ 131 (157)
T PF13385_consen 89 HLALTYDGSTVTLYVNGELVGSSTIPSNISLNSNGPLFIGGSG 131 (157)
T ss_dssp EEEEEEETTEEEEEETTEEETTCTEESSSSTTSCCEEEESS-S
T ss_pred EEEEEEECCeEEEEECCEEEEeEeccCCcCCCCcceEEEeecC
Confidence 556777888999999999985554422 23455678888655
No 9
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=67.57 E-value=1.1e+02 Score=28.88 Aligned_cols=104 Identities=17% Similarity=0.303 Sum_probs=63.3
Q ss_pred ceEEEEEEccCCceEEEEEEECCeehHhhh-eeeeeeeeEEeeCCeeEEEEE-Eccc-----------eeEEEcCE---E
Q psy16754 16 VPCLIKIEPSGPFGYAYSITVDGKTLEKFS-QSKANCTWIVTVEDETYRVVL-ERGT-----------LDIWVNGD---R 79 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvL-dK~t-----------mdvw~nG~---~ 79 (127)
....|+..|.||..+.|.-.|+|...+.=- -.-..-+|.|+.+|. |+|-| =||. ++.-+.-. +
T Consensus 306 ~~I~ika~a~GG~~llYrf~I~G~~~e~~~Y~~~n~~~w~P~~~G~-Y~I~v~VKDk~S~~eyD~~~~v~f~i~~~~~~p 384 (667)
T PRK14081 306 TDIELKAVAEGGKELLYRFIIKGKESEDSGYIRNNIYTWKPKIAGK-YSITLWVKDISSKGEYEDKSSIDYTIEEKSKEP 384 (667)
T ss_pred ceEEEEEEecCCCceEEEEEECCcEEeeccccccceEEEeeCCCce-EEEEEEEEcCcCcccccceEEEEEEEcccCCCC
Confidence 567888999999999999999998876544 445566899998885 44433 2332 22222221 1
Q ss_pred eceeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEe--cCc
Q psy16754 80 VDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLIS--KLK 125 (127)
Q Consensus 80 iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~v--n~~ 125 (127)
+.-.--.+|.+. ..+-|.+-.|.+..+|.. ++.++..+ ||.
T Consensus 385 ~~I~~vl~d~~~---~~lvG~~i~i~v~a~gg~--~~lY~f~ik~ng~ 427 (667)
T PRK14081 385 IKIEDVILDKGK---HILKGEEIKIRVIAEGGT--NLRYSFIIKKDGK 427 (667)
T ss_pred eEEEEEEECCCC---ceEeCCeEEEEEEecCCC--eEEEEEEEEECCE
Confidence 111111233221 233366777777777764 67777766 553
No 10
>TIGR01322 scrB_fam sucrose-6-phosphate hydrolase.
Probab=65.58 E-value=8.4 Score=33.30 Aligned_cols=28 Identities=14% Similarity=0.466 Sum_probs=22.4
Q ss_pred eeEEEEEEccceeEEEcC-EEeceeeeee
Q psy16754 60 ETYRVVLERGTLDIWVNG-DRVDVTGEFV 87 (127)
Q Consensus 60 ~~~RIvLdK~tmdvw~nG-~~iet~geFv 87 (127)
...||.+|...+||+||+ +..=|.+-|.
T Consensus 416 ~~l~i~vD~s~vEvFvn~G~~~~t~riyp 444 (445)
T TIGR01322 416 VSLHIFIDKSSVEIFINDGEEVMTSRIFP 444 (445)
T ss_pred EEEEEEEECCEEEEEECCCEEEEEEeccC
Confidence 379999999999999997 6666666553
No 11
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=63.32 E-value=11 Score=27.45 Aligned_cols=20 Identities=40% Similarity=0.861 Sum_probs=18.7
Q ss_pred eEEEEEEccceeEEEcCEEe
Q psy16754 61 TYRVVLERGTLDIWVNGDRV 80 (127)
Q Consensus 61 ~~RIvLdK~tmdvw~nG~~i 80 (127)
.++|+..-+++.+|+||+++
T Consensus 131 ~~~I~~~g~~i~v~vnG~~v 150 (185)
T PF06439_consen 131 TVRIVVKGNRITVWVNGKPV 150 (185)
T ss_dssp EEEEEEETTEEEEEETTEEE
T ss_pred EEEEEEECCEEEEEECCEEE
Confidence 78999999999999999987
No 12
>PF02872 5_nucleotid_C: 5'-nucleotidase, C-terminal domain; InterPro: IPR008334 5'-nucleotidases 3.1.3.5 from EC [] are enzymes that catalyze the hydrolysis of phosphate esterified at carbon 5' of the ribose and deoxyribose portions of nucleotide molecules. 5'-nucleotidase is a ubiquitous enzyme found in a wide variety of species and which occurs in different cellular locations. The extracellular 5'-nucleotidase from mammals and Discopyge ommata (Electric ray) isozyme is a homodimeric disulphide-bonded glycoprotein attached to the membrane by a GPI-anchor, and requires zinc for its activity. Vibrio parahaemolyticus 5'-nucleotidase (gene nutA) is bound to the membrane by a lipid chain, and requires chloride and magnesium ions for its activity. It is involved in degrading extracellular 5'-nucleotides for nutritional needs. Periplasmic bacterial 5'-nucleotidase (gene ushA), also known as UDP-sugar hydrolase 3.6.1.45 from EC, can degrade UDP-glucose and other nucleotide diphosphate sugars. It produces sugar-1-phosphate which can then be used by the cell. UshA seems to require cobalt for its activity. 5'-Nucleotidases are evolutionary related to the periplasmic bacterial 2',3'-cyclic-nucleotide 2'-phosphodiesterase 3.1.4.16 from EC (gene cpdB), which catalyzes two consecutive reactions: it first converts 2',3'-cyclic-nucleotide to 3'-nucleotide and then acts as a 3'-nucleotidase; and mosquito apyrase 3.6.1.5 from EC (ATP-diphosphohydrolase) [], which catalyzes the hydrolysis of ATP into AMP and facilitates hematophagy by preventing ADP-dependent platelet aggregation in the host. CD73 (also called ecto-5'-nucleotidase) possesses the enzymatic activity of a 5'-nucleotidase and catalyses the dephosphorylation of purine and pyrimidine ribo- and deoxyribonucleoside monophosphates to their corresponding nucleosides. Triggering of lymphocyte CD73 with mAb causes phosphorylation and dephosphorylation of certain, yet unknown protein substrates []. A possible function for CD73 is to regulate the availability of adenosine for interaction with cell surface adenosine receptor by converting AMP to adenosine. In common with other GPI anchored surface proteins CD73 can mediate costimulatory signals in T cell activation []. This entry is the C-terminal domain of 5'-nucleotidases. ; GO: 0016787 hydrolase activity, 0009166 nucleotide catabolic process; PDB: 3ZTV_A 3ZU0_B 2WDD_A 2WDF_A 2WDE_A 2WDC_A 3QFK_A 1OID_B 1OI8_B 1USH_A ....
Probab=60.24 E-value=6.1 Score=28.95 Aligned_cols=60 Identities=15% Similarity=0.163 Sum_probs=34.5
Q ss_pred CceEEEEEEECCeehHhhh-e-----eeeeeeeEEeeCCeeEEEEEEccc----eeE--EEcCEEeceeeee
Q psy16754 27 PFGYAYSITVDGKTLEKFS-Q-----SKANCTWIVTVEDETYRVVLERGT----LDI--WVNGDRVDVTGEF 86 (127)
Q Consensus 27 gf~YeYsL~V~GKsl~kf~-~-----~k~~~tW~~~i~g~~~RIvLdK~t----mdv--w~nG~~iet~geF 86 (127)
...--+++++.|+.|+++. + ......+.+.+.|-.+.+..++.. -++ -+||++|+-...|
T Consensus 65 f~~~~~~~~itG~~Lk~~le~~~~~~~~~~~~~~~~~sGi~y~~d~~~~~~~ri~~l~~~~~G~~i~~~~~Y 136 (156)
T PF02872_consen 65 FPNKLAVVEITGAQLKELLEYDSQGDYPGQDGGFLQVSGIRYTYDPDRPPGQRITDLTYTINGKPIDDDKTY 136 (156)
T ss_dssp SS-EEEEEEEEHHHHHHHHHHCHHTTCCSCSCCCEEEESEEEEEETTS-TTSCEEEEEEEETTEE--TTSEE
T ss_pred cCCEEEEEEEEHHHHHHHHHhhccccccccccccccccceeEEEeecCCCCCcEEEEEEccCCeEcCCCCEE
Confidence 3345789999999999998 4 122223333555777777655321 334 5688777555443
No 13
>TIGR01997 sufA_proteo FeS assembly scaffold SufA. This model represents the SufA protein of the SUF system of iron-sulfur cluster biosynthesis. This system performs FeS biosynthesis even during oxidative stress and tends to be absent in obligate anaerobic and microaerophilic bacteria.
Probab=60.13 E-value=34 Score=23.98 Aligned_cols=67 Identities=15% Similarity=0.311 Sum_probs=39.9
Q ss_pred EEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEEe
Q psy16754 18 CLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFTI 97 (127)
Q Consensus 18 ~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~l 97 (127)
..|.+++.|-..|.|.|....++=+. -..+..+ ..+|++|+..++ +.+|..|| |++++.-..|.+
T Consensus 26 lRi~v~~~GC~G~~y~~~l~~~~~~~--------D~v~~~~--g~~v~id~~s~~-~l~g~~ID----y~~~~~~~~F~~ 90 (107)
T TIGR01997 26 IRLGVKKTGCAGMEYVLDLVSEPKKD--------DDLIEHD--GAKVFVAPEAVL-FILGTQVD----FVRTTLRQGFKF 90 (107)
T ss_pred EEEEEECCCCCCcEEEeeecCCCCCC--------CEEEecC--CEEEEEcHHHHh-hhCCCEEE----EEEcCCcceEEE
Confidence 55666665546677777753332111 1122233 478888988755 66776664 677777777888
Q ss_pred CC
Q psy16754 98 GD 99 (127)
Q Consensus 98 g~ 99 (127)
.+
T Consensus 91 ~N 92 (107)
T TIGR01997 91 NN 92 (107)
T ss_pred EC
Confidence 53
No 14
>PRK09420 cpdB bifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase periplasmic precursor protein; Reviewed
Probab=50.86 E-value=13 Score=34.27 Aligned_cols=58 Identities=14% Similarity=0.292 Sum_probs=40.2
Q ss_pred eEEEEEEECCeehHhhh-ee-eeeee-------------eE-------EeeCCeeEEEEEEccc----------------
Q psy16754 29 GYAYSITVDGKTLEKFS-QS-KANCT-------------WI-------VTVEDETYRVVLERGT---------------- 70 (127)
Q Consensus 29 ~YeYsL~V~GKsl~kf~-~~-k~~~t-------------W~-------~~i~g~~~RIvLdK~t---------------- 70 (127)
---+.++|.|+.|++.. ++ ..++. |. ..++|..|.|.+.+..
T Consensus 444 Ntl~~v~vtG~~Lk~~LE~sa~~f~qi~~~~~~~~~l~~~~~f~~~nfd~isGl~y~iD~s~P~~~~~~~~~~~~~g~RI 523 (649)
T PRK09420 444 NTLVVVKATGAEVKEWLECSAGQFNQIDPNSTKPQSLINWDGFRTYNFDVIDGVNYQIDVTQPARYDGECKLINPNANRI 523 (649)
T ss_pred CEEEEEEccHHHHHHHHHHHHHhhcccCCCCCCchhccCccccCccCcceECCEEEEEECCCCccccccccccCCCCCce
Confidence 34567899999999988 32 11211 21 1378999999887665
Q ss_pred eeEEEcCEEeceeeee
Q psy16754 71 LDIWVNGDRVDVTGEF 86 (127)
Q Consensus 71 mdvw~nG~~iet~geF 86 (127)
.++.+||+.|+-..+|
T Consensus 524 ~~l~~~G~pid~~~~Y 539 (649)
T PRK09420 524 KNLTFNGKPIDPKATF 539 (649)
T ss_pred EEEEECCEECCCCCEE
Confidence 5788899998766554
No 15
>PF03633 Glyco_hydro_65C: Glycosyl hydrolase family 65, C-terminal domain ; InterPro: IPR005194 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This family of glycosyl hydrolases (GH65 from CAZY) contains this domain and includes vacuolar acid trehalase and maltose phosphorylases. Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phosphate into beta-D-glucose-1-phosphate and glucose. The C-terminal domain forms a two layered jelly roll motif. This domain is situated at the base of the catalytic domain, however its function remains unknown [].; PDB: 1H54_A.
Probab=49.97 E-value=58 Score=19.92 Aligned_cols=27 Identities=37% Similarity=0.492 Sum_probs=14.6
Q ss_pred EeeCCeeEEEEEEcc--ceeEEEcCEEec
Q psy16754 55 VTVEDETYRVVLERG--TLDIWVNGDRVD 81 (127)
Q Consensus 55 ~~i~g~~~RIvLdK~--tmdvw~nG~~ie 81 (127)
+.++.....|.+... .++|+++|+.+.
T Consensus 23 v~i~~~~v~v~~~~g~~~l~i~v~g~~~~ 51 (54)
T PF03633_consen 23 VEITHEKVTVTLLSGDAPLTIKVYGEEVT 51 (54)
T ss_dssp EEEETTEEEEEEEESS--EEEEETT----
T ss_pred EEEECCEEEEEEccCCccEEEEECCCccc
Confidence 445666666666644 577888887654
No 16
>COG0316 sufA Fe-S cluster assembly scaffold protein [Posttranslational modification, protein turnover, chaperones]
Probab=49.75 E-value=78 Score=23.08 Aligned_cols=65 Identities=17% Similarity=0.386 Sum_probs=45.2
Q ss_pred ceEEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEe----ceeeeeeeCCe
Q psy16754 16 VPCLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRV----DVTGEFVENGT 91 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~i----et~geFvd~Gt 91 (127)
....|-|+..|--.|+|.++.+- .++..+ .+++.+...|||+-..+ .++-.|+++..
T Consensus 27 ~~lRv~V~~gGCsG~~Y~~~~~~-----------------~~~~~D--~v~e~~g~~v~vD~~S~~~L~G~~IDyv~~~~ 87 (110)
T COG0316 27 LGLRVGVKGGGCSGFQYGLEFDD-----------------EINEDD--TVFEQDGVKVVVDPKSLPYLEGTEIDYVEDLL 87 (110)
T ss_pred ceEEEEEeCCCCCCcEeEEEEcC-----------------CCCCCC--EEEEeCCEEEEEChhhhhhhcCCEEEEEEcCc
Confidence 45778888877778888887762 222222 26677777777776543 66777999998
Q ss_pred eEEEEeCC
Q psy16754 92 ETHFTIGD 99 (127)
Q Consensus 92 et~F~lg~ 99 (127)
-.-|.+.+
T Consensus 88 g~~F~~~N 95 (110)
T COG0316 88 GSGFTFKN 95 (110)
T ss_pred CCceEEEC
Confidence 88888864
No 17
>TIGR01390 CycNucDiestase 2',3'-cyclic-nucleotide 2'-phosphodiesterase. 2',3'-cyclic-nucleotide 2'-phosphodiesterase is a bifunctional enzyme localized to the periplasm of Gram-negative bacteria. 2',3'-cyclic-nucleotide 2'-phosphodiesters are intermediates formed during the hydrolysis of RNA by the ribonuclease I, which is also found to the periplasm, and other enzymes of the RNAse T2 family. Bacteria are unable to transport 2',3'-cyclic-nucleotides into the cytoplasm. 2',3'-cyclic-nucleotide 2'-phosphodiesterase contains 2 active sites which catalyze the reactions that convert the 2',3'-cyclic-nucleotide into a 3'-nucleotide, which is then converted into nucleic acid and phosphate. Both final products can be transported into the cytoplasm. Thus, it has been suggested that 2',3'-cyclic-nucleotide 2'-phosphodiesterase has a 'scavenging' function. Experimental evidence indicates that 2',3'-cyclic-nucleotide 2'-phosphodiesterase enables Yersinia enterocolitica O:8 to grow on 2'3'-cAMP as a
Probab=49.40 E-value=13 Score=33.99 Aligned_cols=59 Identities=12% Similarity=0.199 Sum_probs=41.3
Q ss_pred ceEEEEEEECCeehHhhh-ee-eeeeee--------------------EEeeCCeeEEEEEEccc---------------
Q psy16754 28 FGYAYSITVDGKTLEKFS-QS-KANCTW--------------------IVTVEDETYRVVLERGT--------------- 70 (127)
Q Consensus 28 f~YeYsL~V~GKsl~kf~-~~-k~~~tW--------------------~~~i~g~~~RIvLdK~t--------------- 70 (127)
--.-+.++|.|+.|+++. ++ ..++.| -..++|..|.|-+.+..
T Consensus 420 ~Ntl~~v~vtGa~Lk~~LE~sa~~f~~~~~~~~~~~~l~~~~~~~~~~fd~i~Gl~y~iD~t~P~~~~~~~~~~~~~g~R 499 (626)
T TIGR01390 420 PNTLVVVKVTGAQVKEWLECSAGQFKQIDPTSTKPQSLIDWDGFRTYNFDVIDGVNYEIDVTQPARYDGDCKLINPNAHR 499 (626)
T ss_pred CCEEEEEECcHHHHHHHHHHHHHhhcccCCCccChhhhccCCCCCccccceECCeEEEEeCCCCccccccccccCCCCCc
Confidence 345677899999999998 32 122222 12367889999887665
Q ss_pred -eeEEEcCEEeceeeee
Q psy16754 71 -LDIWVNGDRVDVTGEF 86 (127)
Q Consensus 71 -mdvw~nG~~iet~geF 86 (127)
.++.+||+.|+-..+|
T Consensus 500 I~~l~~~G~pid~~~~Y 516 (626)
T TIGR01390 500 IKNLTYQGKPIDPAAQF 516 (626)
T ss_pred eEEEEECCEECCCCCEE
Confidence 6788899998766665
No 18
>PLN03082 Iron-sulfur cluster assembly; Provisional
Probab=46.72 E-value=40 Score=26.11 Aligned_cols=64 Identities=25% Similarity=0.461 Sum_probs=39.5
Q ss_pred EEEEEEccCCceEEEEEEECCeehH-hhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEE
Q psy16754 18 CLIKIEPSGPFGYAYSITVDGKTLE-KFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFT 96 (127)
Q Consensus 18 ~~I~I~a~~gf~YeYsL~V~GKsl~-kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~ 96 (127)
..|.|++-|=..|.|.|..+.++-+ +. .++ . ...+|++|+..++ +++|-.|| |+|++.-..|.
T Consensus 80 LRl~V~~gGCSG~~Y~~~ld~~~~~~D~-------v~e--~--~Gv~vvVD~~s~~-~L~Gs~ID----Yve~l~~~gF~ 143 (163)
T PLN03082 80 LRLSVETGGCSGFQYVFELDDKTNSDDR-------VFE--K--DGVKLVVDNISYD-FVKGATVD----YVEELIRSAFV 143 (163)
T ss_pred EEEEEecCCCCCceeeeEEccCCCCCCE-------EEe--c--CCeEEEECHHHHH-HhCCCEEE----eecCCCCCeeE
Confidence 5666666554455666665543211 11 122 2 2678999998766 66776665 78888888898
Q ss_pred e
Q psy16754 97 I 97 (127)
Q Consensus 97 l 97 (127)
+
T Consensus 144 f 144 (163)
T PLN03082 144 V 144 (163)
T ss_pred E
Confidence 8
No 19
>PRK09418 bifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase precursor protein; Reviewed
Probab=46.06 E-value=21 Score=33.95 Aligned_cols=58 Identities=16% Similarity=0.295 Sum_probs=40.4
Q ss_pred eEEEEEEECCeehHhhh-eee-eeeee-------------------EEeeCCeeEEEEEEccc----------------e
Q psy16754 29 GYAYSITVDGKTLEKFS-QSK-ANCTW-------------------IVTVEDETYRVVLERGT----------------L 71 (127)
Q Consensus 29 ~YeYsL~V~GKsl~kf~-~~k-~~~tW-------------------~~~i~g~~~RIvLdK~t----------------m 71 (127)
-.-+.++|.|+-|++.. ++- .++.+ -..++|..|.|.+.+.. .
T Consensus 459 Ntl~vv~vTGaqLke~LE~sa~~f~qi~~~~~~~q~l~~~~f~~ynFd~isGl~Y~iD~sqP~~~~~~~~~~~~~g~RI~ 538 (780)
T PRK09418 459 NTLYAVKVNGAQVKEWLEMSAGQFNQIDPKKTEEQPLVNIGYPTYNFDILDGLKYEIDVTQPAKYDKDGKVVNANTNRII 538 (780)
T ss_pred CEEEEEEEcHHHHHHHHHHHHhhhcccCcccccccccccCCCCccccceEccEEEEEeCCCCccccccccccccCcceEE
Confidence 34567899999999998 322 23222 23568889998887654 6
Q ss_pred eEEEcCEEeceeeee
Q psy16754 72 DIWVNGDRVDVTGEF 86 (127)
Q Consensus 72 dvw~nG~~iet~geF 86 (127)
++.+||+.|+-..+|
T Consensus 539 ~L~~nGkPVd~~q~Y 553 (780)
T PRK09418 539 NMTYEGKPVADNQEF 553 (780)
T ss_pred EEEECCEECCCCCEE
Confidence 678899988766554
No 20
>PF08531 Bac_rhamnosid_N: Alpha-L-rhamnosidase N-terminal domain; InterPro: IPR013737 This domain is found in bacterial rhamnosidase A and B enzymes and is probably involved in substrate recognition. ; PDB: 2OKX_B.
Probab=44.81 E-value=26 Score=26.56 Aligned_cols=37 Identities=22% Similarity=0.337 Sum_probs=24.0
Q ss_pred ceEEEEEEccCCceEEEEEEECCeehHh---------hheeeeeeeeEEe
Q psy16754 16 VPCLIKIEPSGPFGYAYSITVDGKTLEK---------FSQSKANCTWIVT 56 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL~V~GKsl~k---------f~~~k~~~tW~~~ 56 (127)
.+|.|.|-|.| .|.|+|||+..-. |.+...++|..++
T Consensus 4 ~~A~l~isa~g----~Y~l~vNG~~V~~~~l~P~~t~y~~~~~Y~tyDVt 49 (172)
T PF08531_consen 4 RSARLYISALG----RYELYVNGERVGDGPLAPGWTDYDKRVYYQTYDVT 49 (172)
T ss_dssp ---EEEEEEES----EEEEEETTEEEEEE--------BTTEEEEEEEE-T
T ss_pred eEEEEEEEeCe----eEEEEECCEEeeCCccccccccCCCceEEEEEeCh
Confidence 46899999987 7999999998642 2233455666665
No 21
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=43.94 E-value=2.9e+02 Score=26.25 Aligned_cols=104 Identities=13% Similarity=0.245 Sum_probs=62.3
Q ss_pred eeeEEeCCcceEEEEEEccCCceEEEEEEECCeeh--HhhheeeeeeeeEEeeCCeeEEEEEEcc-----------ceeE
Q psy16754 7 EESFQLDSKVPCLIKIEPSGPFGYAYSITVDGKTL--EKFSQSKANCTWIVTVEDETYRVVLERG-----------TLDI 73 (127)
Q Consensus 7 ke~F~i~~~~k~~I~I~a~~gf~YeYsL~V~GKsl--~kf~~~k~~~tW~~~i~g~~~RIvLdK~-----------tmdv 73 (127)
++.|.++.....+...+......|+|-|.+||... .+|..++ .-+|.|...|.=.--||-|+ ...+
T Consensus 491 ~~~~~vg~~i~~~~~~~~~k~v~y~y~~~~NG~~v~~t~Ys~~~-~ysf~P~~~GkY~I~V~aKn~~s~~~~D~~k~v~~ 569 (667)
T PRK14081 491 KEYYLVGDDIEIEVIIQNTKDVLIKYILKINGHKVEETDYIKNK-KYKFIPKCSGKYTIEVLAKNIKSTEEYDSKKEVKF 569 (667)
T ss_pred cccEEeCCEEEEEEEEeCCCeEEEEEEEEECCEEEEEeeccccc-eEEEeecCCceEEEEEEEcccccccccccceEEEE
Confidence 46666652344444555556778999999999887 5566433 34699999998666677544 4556
Q ss_pred EEcCE-Eec------eeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEec
Q psy16754 74 WVNGD-RVD------VTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLISK 123 (127)
Q Consensus 74 w~nG~-~ie------t~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~vn 123 (127)
.|+.. +|. ..++|-.| .+....|.|.|++ ++.+.++|=
T Consensus 570 ~V~e~~pi~nt~~~~~~~~~~~n----------~~~t~~~~~~gg~--~v~Yef~v~ 614 (667)
T PRK14081 570 YVREALPITNTKIKTSKKKFKCN----------EEVTFSVKSEGGK--DVCYEFYIM 614 (667)
T ss_pred EEcCCCCceeeEEEeecceEEcC----------CeEEEEEEccCCC--cEEEEEEEE
Confidence 67653 221 12233332 3333344455554 458887763
No 22
>PF14099 Polysacc_lyase: Polysaccharide lyase; PDB: 3ILR_A 3IKW_A 3INA_A 3IMN_A 3IN9_A 2ZZJ_A.
Probab=43.05 E-value=95 Score=23.66 Aligned_cols=42 Identities=31% Similarity=0.414 Sum_probs=24.7
Q ss_pred eeeeeEEeeC-CeeEEEEEE-------ccceeEEEcCEEe-ceeeeeeeCC
Q psy16754 49 ANCTWIVTVE-DETYRVVLE-------RGTLDIWVNGDRV-DVTGEFVENG 90 (127)
Q Consensus 49 ~~~tW~~~i~-g~~~RIvLd-------K~tmdvw~nG~~i-et~geFvd~G 90 (127)
....+...+. |+=++|+++ ...++||+||+++ +..|.-..++
T Consensus 141 ~~~~~~~~~~~G~W~~~~i~~~~s~~~~G~~~vw~nG~~v~~~~g~~~~~~ 191 (224)
T PF14099_consen 141 AYSVDLGPVERGKWHDFVIHVKWSPDSDGFLEVWLNGKLVVDYKGPTGYND 191 (224)
T ss_dssp EEEEECCCS-TTSEEEEEEEEEE-CCCTEEEEEEECCEECCEEEEEECECC
T ss_pred eEeecCCCcCCCcEEEEEEEEEECCCCCEEEEEEECCEEEEEEeCCceeCC
Confidence 3344444443 664555543 3579999999988 6666544444
No 23
>PF14344 DUF4397: Domain of unknown function (DUF4397)
Probab=39.24 E-value=75 Score=22.00 Aligned_cols=20 Identities=20% Similarity=0.400 Sum_probs=10.1
Q ss_pred ccceeEEEcCEEeceeeeee
Q psy16754 68 RGTLDIWVNGDRVDVTGEFV 87 (127)
Q Consensus 68 K~tmdvw~nG~~iet~geFv 87 (127)
-..+||++||+.+-..=.|.
T Consensus 13 ~~~vdv~~dg~~~~~~v~y~ 32 (122)
T PF14344_consen 13 APAVDVYVDGTKVFSNVAYG 32 (122)
T ss_pred CccEEEEECCEEEEccCCCC
Confidence 34566666666543333333
No 24
>PRK13623 iron-sulfur cluster insertion protein ErpA; Provisional
Probab=38.31 E-value=83 Score=22.22 Aligned_cols=67 Identities=21% Similarity=0.347 Sum_probs=40.5
Q ss_pred EEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEEe
Q psy16754 18 CLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFTI 97 (127)
Q Consensus 18 ~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~l 97 (127)
..|.|++.|-..|.|.|..+..+-+ .-..... ...+|++|+..+. +.+|..| .|+++.....|.+
T Consensus 34 LRi~v~~~GCsG~~y~l~l~~~~~~--------~D~v~e~--~gv~v~id~~s~~-~l~g~~I----Dy~~~~~~~~F~f 98 (115)
T PRK13623 34 LRVYITGGGCSGFQYGFTFDEQVNE--------DDTTIEK--QGVTLVVDPMSLQ-YLVGAEV----DYTEGLEGSRFVI 98 (115)
T ss_pred EEEEEeCCCCCCcEEEEEECCCCCC--------CCEEEEc--CCEEEEEcHHHHH-HhCCCEE----EeecCCCcceEEE
Confidence 5666676665667777766632211 1122222 3577888888666 6777555 4677777777888
Q ss_pred CC
Q psy16754 98 GD 99 (127)
Q Consensus 98 g~ 99 (127)
.+
T Consensus 99 ~N 100 (115)
T PRK13623 99 KN 100 (115)
T ss_pred EC
Confidence 53
No 25
>PRK09419 bifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase precursor protein; Reviewed
Probab=37.90 E-value=29 Score=33.93 Aligned_cols=58 Identities=16% Similarity=0.320 Sum_probs=39.5
Q ss_pred eEEEEEEECCeehHhhh-ee-eeeeeeE-------------------EeeCCeeEEEEEEccc----------------e
Q psy16754 29 GYAYSITVDGKTLEKFS-QS-KANCTWI-------------------VTVEDETYRVVLERGT----------------L 71 (127)
Q Consensus 29 ~YeYsL~V~GKsl~kf~-~~-k~~~tW~-------------------~~i~g~~~RIvLdK~t----------------m 71 (127)
-.-+.++|.|+.|+++. ++ ..++.+. ..++|-.|.|.+.+.. .
T Consensus 453 N~l~~v~vtG~~L~~~LE~sa~~~~~~~~~~~~~~~~~~~~~~~~~fd~~~Gl~y~~D~s~p~~~~~~g~~~~~~g~RI~ 532 (1163)
T PRK09419 453 NTLYIVKLNGSQVKDWMEMSAGQFNQIKPNDGDLQALLNENFRSYNFDVIDGVTYQIDVTKPAKYNENGNVINADGSRIV 532 (1163)
T ss_pred CEEEEEEecHHHHHHHHHHhHHHhCccCCCCCCcccccCCCCCccccceeCCeEEEEeCCCCCcccccccccCCCCCcee
Confidence 34677899999999999 32 2233222 1368889999887665 5
Q ss_pred eEEEcCEEeceeeee
Q psy16754 72 DIWVNGDRVDVTGEF 86 (127)
Q Consensus 72 dvw~nG~~iet~geF 86 (127)
++.+||+.|+-..+|
T Consensus 533 ~l~~~G~pi~~~~~y 547 (1163)
T PRK09419 533 NLKYDGKPVEDSQEF 547 (1163)
T ss_pred eeeECCEECCCCCEE
Confidence 677788887655544
No 26
>PF13364 BetaGal_dom4_5: Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=35.51 E-value=62 Score=22.86 Aligned_cols=40 Identities=18% Similarity=0.204 Sum_probs=26.1
Q ss_pred EEEE-EEccCCceEEEEEEECCeehHhhh-eeeeeeeeEEee
Q psy16754 18 CLIK-IEPSGPFGYAYSITVDGKTLEKFS-QSKANCTWIVTV 57 (127)
Q Consensus 18 ~~I~-I~a~~gf~YeYsL~V~GKsl~kf~-~~k~~~tW~~~i 57 (127)
..|. +...++.+|.+++.|||+-+-+|. ....-.+..+..
T Consensus 50 ~~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~g~q~tf~~p~ 91 (111)
T PF13364_consen 50 TSLTPLNIQGGNAFRASVWVNGWFLGSYWPGIGPQTTFSVPA 91 (111)
T ss_dssp EEEE-EEECSSTTEEEEEEETTEEEEEEETTTECCEEEEE-B
T ss_pred eeEEEEeccCCCceEEEEEECCEEeeeecCCCCccEEEEeCc
Confidence 4444 555668889999999998888877 333335555553
No 27
>KOG4767|consensus
Probab=35.49 E-value=24 Score=29.08 Aligned_cols=52 Identities=21% Similarity=0.252 Sum_probs=36.3
Q ss_pred EEEEEEccCCc---eEEEEEEECCeehHhhh-eeeeeeeeEEeeCCeeEEEEEEccc
Q psy16754 18 CLIKIEPSGPF---GYAYSITVDGKTLEKFS-QSKANCTWIVTVEDETYRVVLERGT 70 (127)
Q Consensus 18 ~~I~I~a~~gf---~YeYsL~V~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvLdK~t 70 (127)
-.++|+|+|.. .|||| +.+|-.|+-|+ --..+..=.+.+-+.+-|+||--+|
T Consensus 96 ps~TiKaiGhQWywtYEys-D~~~l~FdSYMip~e~Le~g~~RlLevDnr~VlP~dt 151 (231)
T KOG4767|consen 96 PSLTIKAIGHQWYWTYEYS-DFNGLEFDSYMIPEEDLELGQLRLLEVDNRVVLPIDT 151 (231)
T ss_pred CceEeeeccceEEEEEEec-cCCceeeeeccCCHHHcCCCCceEEeecceEEEecCC
Confidence 35788898764 67887 56679999999 4344444445555668899987775
No 28
>PF02837 Glyco_hydro_2_N: Glycosyl hydrolases family 2, sugar binding domain; InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=35.06 E-value=44 Score=24.10 Aligned_cols=22 Identities=32% Similarity=0.445 Sum_probs=15.4
Q ss_pred eEEEEEEcc----ceeEEEcCEEece
Q psy16754 61 TYRVVLERG----TLDIWVNGDRVDV 82 (127)
Q Consensus 61 ~~RIvLdK~----tmdvw~nG~~iet 82 (127)
.-|++|.-+ ..+||+||+.+-.
T Consensus 84 ~~~~~L~f~gv~~~a~v~vNG~~vg~ 109 (167)
T PF02837_consen 84 GKRVFLRFEGVDYAAEVYVNGKLVGS 109 (167)
T ss_dssp TSEEEEEESEEESEEEEEETTEEEEE
T ss_pred CceEEEEeccceEeeEEEeCCeEEee
Confidence 456666655 4569999998844
No 29
>PF04313 HSDR_N: Type I restriction enzyme R protein N terminus (HSDR_N); InterPro: IPR007409 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type I restriction endonucleases are components of prokaryotic DNA restriction-modification mechanisms that protects the organism against invading foreign DNA. Type I enzymes have three different subunits subunits - M (modification), S (specificity) and R (restriction) - that form multifunctional enzymes with restriction (3.1.21.3 from EC), methylase (2.1.1.72 from EC) and ATPase activities [, ]. The S subunit is required for both restriction and modification and is responsible for recognition of the DNA sequence specific for the system. The M subunit is necessary for modification, and the R subunit is required for restriction. These enzymes use S-Adenosyl-L-methionine (AdoMet) as the methyl group donor in the methylation reaction, and have a requirement for ATP. They recognise asymmetric DNA sequences split into two domains of specific sequence, one 3-4 bp long and another 4-5 bp long, separated by a nonspecific spacer 6-8 bp in length. Cleavage occurs a considerable distance from the recognition sites, rarely less than 400 bp away and up to 7000 bp away. Adenosyl residues are methylated, one on each strand of the recognition sequence. These enzymes are widespread in eubacteria and archaea. In enteric bacteria they have been subdivide into four families: types IA, IB, IC and ID. Type III restriction endonucleases (3.1.21.5 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. Type III enzymes are hetero-oligomeric, multifunctional proteins composed of two subunits, Res and Mod. The Mod subunit recognises the DNA sequence specific for the system and is a modification methyltransferase; as such it is functionally equivalent to the M and S subunits of type I restriction endonuclease. Res is required for restriction, although it has no enzymatic activity on its own. Type III enzymes recognise short 5-6 bp long asymmetric DNA sequences and cleave 25-27 bp downstream to leave short, single-stranded 5' protrusions. They require the presence of two inversely oriented unmethylated recognition sites for restriction to occur. These enzymes methylate only one strand of the DNA, at the N-6 position of adenosyl residues, so newly replicated DNA will have only one strand methylated, which is sufficient to protect against restriction. Type III enzymes belong to the beta-subfamily of N6 adenine methyltransferases, containing the nine motifs that characterise this family, including motif I, the AdoMet binding pocket (FXGXG), and motif IV, the catalytic region (S/D/N (PP) Y/F) [, ]. This entry represents the N-terminal domain found in both the R subunit (HsdR) of type I enzymes and the Res subunit of type III enzymes. The type I enzyme represented is EcoRI, which recognises the DNA sequence 5'-GAATTC; the R protein (HsdR) is required for both nuclease and ATPase activity [, , ]. This domain is often found adjacent to a methylase domain (IPR002052 from INTERPRO) in restriction endonucleases or methylases. In one of the proteins, Q97RD0 from SWISSPROT, it is adjacent to a helicase domain (IPR011545 from INTERPRO) in a putative restriction endonuclease.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0006304 DNA modification; PDB: 2Y3T_B 2W74_B 2W00_B.
Probab=34.34 E-value=44 Score=25.31 Aligned_cols=37 Identities=14% Similarity=0.146 Sum_probs=18.2
Q ss_pred ccccceeeEEeCCcceEEEEEEccCCceEEEEEEECCeehH
Q psy16754 2 FRLVGEESFQLDSKVPCLIKIEPSGPFGYAYSITVDGKTLE 42 (127)
Q Consensus 2 FkLVGke~F~i~~~~k~~I~I~a~~gf~YeYsL~V~GKsl~ 42 (127)
|+.++...|.|. ..+.+.+......-|.|.|||.||-
T Consensus 100 ~~~~~~N~f~v~----~q~~~~~~~~~r~D~vLfvNGlPl~ 136 (194)
T PF04313_consen 100 FKNPENNIFSVV----RQVPVSGGDKRRPDIVLFVNGLPLA 136 (194)
T ss_dssp SSSGGGS-EEEE----E-----------EEEEEEETTEEEE
T ss_pred cCccccCcEEEE----EEEEEeeccCCcceEEEEECCeEEE
Confidence 455666777772 1222226677888999999999864
No 30
>PRK09455 rseB anti-sigma E factor; Provisional
Probab=33.51 E-value=50 Score=27.96 Aligned_cols=33 Identities=33% Similarity=0.631 Sum_probs=24.3
Q ss_pred ccccceeeEEe-CCcceE-EEEEEccCCceEEEEEEECC
Q psy16754 2 FRLVGEESFQL-DSKVPC-LIKIEPSGPFGYAYSITVDG 38 (127)
Q Consensus 2 FkLVGke~F~i-~~~~k~-~I~I~a~~gf~YeYsL~V~G 38 (127)
|++.|+|. | + ..| .|.+.|-....|.|.|+||-
T Consensus 129 ~~~~g~~r--VaG--r~~~vi~~~PkD~~rY~~~lwiD~ 163 (319)
T PRK09455 129 FISVGRTR--IAD--RLCQVIRIVPKDGTRYSYIVWIDE 163 (319)
T ss_pred EEEccccE--ECC--eeEEEEEEEECCCCCcceEEEEEc
Confidence 45666653 5 5 445 67888888899999999973
No 31
>smart00640 Glyco_32 Glycosyl hydrolases family 32.
Probab=33.40 E-value=38 Score=29.27 Aligned_cols=18 Identities=22% Similarity=0.702 Sum_probs=16.3
Q ss_pred eeEEEEEEccceeEEEcC
Q psy16754 60 ETYRVVLERGTLDIWVNG 77 (127)
Q Consensus 60 ~~~RIvLdK~tmdvw~nG 77 (127)
.+.||++|...+||++|+
T Consensus 419 ~~lri~vD~ssvEvf~nd 436 (437)
T smart00640 419 LSLRILVDHSSVEIFANG 436 (437)
T ss_pred EEEEEEEeceEEEEEeCC
Confidence 369999999999999997
No 32
>smart00776 NPCBM This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
Probab=33.07 E-value=2.1e+02 Score=21.39 Aligned_cols=42 Identities=7% Similarity=0.271 Sum_probs=28.5
Q ss_pred ceeEEEcCEEeceeeeeeeCCeeEEEEe---CCccEEEEEEeecC
Q psy16754 70 TLDIWVNGDRVDVTGEFVENGTETHFTI---GDTPAYIKAVSSCN 111 (127)
Q Consensus 70 tmdvw~nG~~iet~geFvd~Gtet~F~l---g~~~c~I~a~ssG~ 111 (127)
.+.|++||+.+=..+....+.....+.+ |-....|++...|+
T Consensus 86 ~F~V~~Dg~~l~~s~~~~~~~~~~~~~vdv~G~~~L~L~v~~~g~ 130 (145)
T smart00776 86 VFEVYADGTKLYNSGVLRGADPAKAVDVDVSGAKELRLVVTDAGD 130 (145)
T ss_pred EEEEEeCCEeEEEcccccCCCCCeEEEEEcCCCeEEEEEEEeCCC
Confidence 5679999998866655555555667776 35666666665555
No 33
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=32.47 E-value=43 Score=23.20 Aligned_cols=16 Identities=38% Similarity=0.729 Sum_probs=12.5
Q ss_pred CCceEEEEEEE----CCeeh
Q psy16754 26 GPFGYAYSITV----DGKTL 41 (127)
Q Consensus 26 ~gf~YeYsL~V----~GKsl 41 (127)
+|..|+|++.+ ||+.+
T Consensus 38 ~G~~y~Y~v~a~~~~dG~~~ 57 (75)
T TIGR03000 38 AGKEYEYTVTAEYDRDGRIL 57 (75)
T ss_pred CCCEEEEEEEEEEecCCcEE
Confidence 57899999988 77654
No 34
>PRK05641 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated
Probab=32.35 E-value=76 Score=24.24 Aligned_cols=33 Identities=21% Similarity=0.330 Sum_probs=24.5
Q ss_pred eEEe-CCcceEEEEEEccCCceEEEEEEECCeehHhhh
Q psy16754 9 SFQL-DSKVPCLIKIEPSGPFGYAYSITVDGKTLEKFS 45 (127)
Q Consensus 9 ~F~i-~~~~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~ 45 (127)
.+.| + ..+++.|+.++ .=.|.+.||||+++-..
T Consensus 5 ~~~~~g--~~~~v~v~~~~--~~~~~itvnG~~y~V~v 38 (153)
T PRK05641 5 KVIVDG--VEYEVEVEELG--PGKFRVSFEGKTYEVEA 38 (153)
T ss_pred EEEECC--EEEEEEEEeec--CccEEEEECCEEEEEEE
Confidence 4556 6 77899998886 33578899999886655
No 35
>PF14453 ThiS-like: ThiS-like ubiquitin
Probab=31.76 E-value=33 Score=22.46 Aligned_cols=13 Identities=23% Similarity=0.432 Sum_probs=11.0
Q ss_pred eeEEEcCEEecee
Q psy16754 71 LDIWVNGDRVDVT 83 (127)
Q Consensus 71 mdvw~nG~~iet~ 83 (127)
|.|++||+.+++.
T Consensus 1 M~I~vN~k~~~~~ 13 (57)
T PF14453_consen 1 MKIKVNEKEIETE 13 (57)
T ss_pred CEEEECCEEEEcC
Confidence 7899999998764
No 36
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=31.60 E-value=82 Score=22.25 Aligned_cols=91 Identities=13% Similarity=0.104 Sum_probs=53.6
Q ss_pred EECCeehHhhh---eeeeeeeeEEeeCCeeEEEEEEc--cceeEEEcCEEe-ceeeeeeeCC---eeEEEEeCCcc-EEE
Q psy16754 35 TVDGKTLEKFS---QSKANCTWIVTVEDETYRVVLER--GTLDIWVNGDRV-DVTGEFVENG---TETHFTIGDTP-AYI 104 (127)
Q Consensus 35 ~V~GKsl~kf~---~~k~~~tW~~~i~g~~~RIvLdK--~tmdvw~nG~~i-et~geFvd~G---tet~F~lg~~~-c~I 104 (127)
-+||+--=.|. .......+...-||+-++|.+.+ ....+.|||+.. ..+..-.... +....-+|+.| -..
T Consensus 26 L~~G~l~~~~~~G~~~~~~~~~~~i~dg~wh~v~~~r~~~~~~L~Vd~~~~~~~~s~~~~~~~l~~~~~lyvGG~p~~~~ 105 (131)
T PF00054_consen 26 LRDGRLEFRYNLGSGPASLRSPQKINDGKWHTVSVSRNGRNGSLSVDGEEVVTGESPSGATQSLDVDGPLYVGGLPSSSS 105 (131)
T ss_dssp EETTEEEEEEESSSEEEEEEESSETTSSSEEEEEEEEETTEEEEEETTSEEEEEEECSSSSSSCEECSEEEESSSSTTTG
T ss_pred EECCEEEEEEeCCCccceecCCCccCCCcceEEEEEEcCcEEEEEECCccceeeecCCccccccccccCEEEccCCchhh
Confidence 45666655555 44455556665588888888865 477889999876 2222212222 33457788888 000
Q ss_pred E--EEeecCceeceEEEEEecCc
Q psy16754 105 K--AVSSCNKKEGLLYSLISKLK 125 (127)
Q Consensus 105 ~--a~ssG~k~~Gi~H~L~vn~~ 125 (127)
. .......=.|-|+.|.+|++
T Consensus 106 ~~~~~~~~~~f~GCi~~~~in~~ 128 (131)
T PF00054_consen 106 RPRPLPISPGFKGCIRNLSINGK 128 (131)
T ss_dssp CGSSCSCCSB-EEEEEEEEETTE
T ss_pred cccccccCCCeeEEEEEeEECCE
Confidence 0 01122234588999999986
No 37
>PF07908 D-aminoacyl_C: D-aminoacylase, C-terminal region; InterPro: IPR012855 D-aminoacylase (Q9AGH8 from SWISSPROT, 3.5.1.81 from EC) hydrolyses a wide variety of N-acyl derivatives of neutral D-amino acids, in a zinc-dependent manner. The enzyme is composed of a small beta-barrel domain and a larger catalytic alpha/beta-barrel that contains a short alpha/beta insert. The overall structure shares significant similarity to the alpha/beta-barrel amidohydrolase superfamily, in which the beta-strands in both barrels superimpose well []. The C-terminal region featured in this entry forms part of the beta-barrel domain, together with a short N-terminal segment. This domain does not seem to contribute to the substrate-binding site or to be involved in the catalytic process.; GO: 0008270 zinc ion binding, 0016811 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides; PDB: 3GIQ_B 3GIP_B 1V4Y_A 1M7J_A 1RK5_A 1RJP_A 1RJR_A 1RJQ_A 1RK6_A 1V51_A.
Probab=31.33 E-value=34 Score=21.12 Aligned_cols=15 Identities=40% Similarity=0.620 Sum_probs=10.7
Q ss_pred EEEcCEEeceeeeee
Q psy16754 73 IWVNGDRVDVTGEFV 87 (127)
Q Consensus 73 vw~nG~~iet~geFv 87 (127)
|||||+.+=..|+++
T Consensus 23 V~VNG~~vv~~g~~t 37 (48)
T PF07908_consen 23 VFVNGQIVVEDGEVT 37 (48)
T ss_dssp EEETTEEEECTTEES
T ss_pred EEECCEEEEECCeEC
Confidence 799998875555543
No 38
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=30.96 E-value=1.3e+02 Score=20.76 Aligned_cols=36 Identities=17% Similarity=0.191 Sum_probs=18.9
Q ss_pred EEEEECCeehHhhh--ee--------eeeeeeEEeeCCeeEEEEEE
Q psy16754 32 YSITVDGKTLEKFS--QS--------KANCTWIVTVEDETYRVVLE 67 (127)
Q Consensus 32 YsL~V~GKsl~kf~--~~--------k~~~tW~~~i~g~~~RIvLd 67 (127)
+.|.|||+.+-+.. +. ...........|..|.|.++
T Consensus 72 ~~l~idg~~vid~~~~~~~~~~~~~~~~~~~~v~l~~g~~y~i~i~ 117 (145)
T PF07691_consen 72 ARLWIDGKLVIDNWGNQGGGFFNSGPSSTSGTVTLEAGGKYPIRIE 117 (145)
T ss_dssp EEEEETTEEEEECSCTTTSTTTTTSBCCEEEEEEE-TT-EEEEEEE
T ss_pred EEEEECCEEEEcCCccccccccccccceEEEEEEeeCCeeEEEEEE
Confidence 56788888876665 21 23333333345666655554
No 39
>PF14958 DUF4506: Domain of unknown function (DUF4506)
Probab=30.73 E-value=2.4e+02 Score=21.30 Aligned_cols=64 Identities=17% Similarity=0.241 Sum_probs=44.7
Q ss_pred EEEEEccceeEEEc--CEEe-ceeeeeeeC------Cee--EEEEe--CCccEEEEEEeecCceeceEEEEEecCcC
Q psy16754 63 RVVLERGTLDIWVN--GDRV-DVTGEFVEN------GTE--THFTI--GDTPAYIKAVSSCNKKEGLLYSLISKLKT 126 (127)
Q Consensus 63 RIvLdK~tmdvw~n--G~~i-et~geFvd~------Gte--t~F~l--g~~~c~I~a~ssG~k~~Gi~H~L~vn~~~ 126 (127)
-|+=+--+||||+. ++-. -..||++++ ... ....+ ..+.|.||-.|.|+|.-=.++.++|.-++
T Consensus 32 ~I~SeAR~~EvY~g~~~EY~~T~rGe~v~~~~~~~~~~lY~~~l~le~~~~~~~iK~lSl~~k~~vy~~~i~v~~~~ 108 (138)
T PF14958_consen 32 GIVSEARNMEVYVGQSEEYCGTSRGELVDEDSEEENIILYKKDLKLESPTSECKIKFLSLGEKQCVYGSKIVVHLRP 108 (138)
T ss_pred EEEEccCEEEEEECCCCceeeEcCcEEecCCCccccceEEEEEEEcCCCccEEEEEEEecCCccEEEEEEEEEEecC
Confidence 34556667999987 6666 445778882 222 13333 57899999999999988888888876553
No 40
>PRK15367 type III secretion system protein SsaD; Provisional
Probab=30.58 E-value=59 Score=28.88 Aligned_cols=47 Identities=15% Similarity=0.284 Sum_probs=31.4
Q ss_pred eEEEEEEccceeEEEcCEEeceeeeeeeCCee----EEEEeCCccEEEEEE
Q psy16754 61 TYRVVLERGTLDIWVNGDRVDVTGEFVENGTE----THFTIGDTPAYIKAV 107 (127)
Q Consensus 61 ~~RIvLdK~tmdvw~nG~~iet~geFvd~Gte----t~F~lg~~~c~I~a~ 107 (127)
+.-|.|+...+.|||||..-+...--.=++.- -+|.+|...+.+...
T Consensus 51 e~gi~l~~~~~~vwVnG~~~~~~~~LPl~q~Ie~aG~~~vlG~~d~~L~~~ 101 (395)
T PRK15367 51 ADSLFVDAGKARVRVNGRRFNPNKPLPSSGVLQVAGVAIAFGKQDCELADY 101 (395)
T ss_pred CCcEEEecCCceEEECCEEcCCCCCCCCcchhhhcceEEEecCcccccccC
Confidence 66788899889999999855433222222211 389999888877543
No 41
>COG3026 RseB Negative regulator of sigma E activity [Signal transduction mechanisms]
Probab=30.51 E-value=54 Score=28.41 Aligned_cols=30 Identities=37% Similarity=0.805 Sum_probs=24.9
Q ss_pred ceE-EEEEEccCCceEEEEEEE---------------CCeehHhhh
Q psy16754 16 VPC-LIKIEPSGPFGYAYSITV---------------DGKTLEKFS 45 (127)
Q Consensus 16 ~k~-~I~I~a~~gf~YeYsL~V---------------~GKsl~kf~ 45 (127)
.-| +|+|.|-.+|.|.|-+++ ||+.|++|.
T Consensus 139 r~c~virvvpkD~~RY~Y~vw~D~~s~LplrsdlLdrdG~~LEQfr 184 (320)
T COG3026 139 RLCEVIRVVPKDGFRYSYIVWLDEESGLPLRSDLLDRDGELLEQFR 184 (320)
T ss_pred CceeEEEEccCCCceEEEEEEEeccCCCchhhhhhccccceeeeEE
Confidence 334 689999999999999887 488888887
No 42
>KOG1119|consensus
Probab=29.72 E-value=98 Score=25.15 Aligned_cols=60 Identities=25% Similarity=0.468 Sum_probs=40.6
Q ss_pred eEEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeee
Q psy16754 17 PCLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFV 87 (127)
Q Consensus 17 k~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFv 87 (127)
-..+.||..|=-.|+|.+..|-|.=.+ =..|+ .-.-|||.|-..|+ +++|..||-..|-.
T Consensus 114 ~LRl~VegGGCsGFQYkf~LD~~in~d------D~vf~----e~~arVVvD~~SL~-~~kGatvdy~~ELI 173 (199)
T KOG1119|consen 114 FLRLTVEGGGCSGFQYKFRLDNKINND------DRVFV----ENGARVVVDNVSLN-LLKGATVDYTNELI 173 (199)
T ss_pred eEEEEEecCCccceEEEEEecCCCCCc------ceEEe----eCCcEEEEeccchh-hccCceeehHHHHh
Confidence 467888887767788888887443221 22355 33789999999888 47787777665543
No 43
>PTZ00334 trans-sialidase; Provisional
Probab=29.65 E-value=76 Score=30.54 Aligned_cols=47 Identities=23% Similarity=0.343 Sum_probs=29.4
Q ss_pred eeeEEeeCCeeEEEEEEc---cceeEEEcCEEeceeeeeeeC---Cee-EEEEeCCcc
Q psy16754 51 CTWIVTVEDETYRVVLER---GTLDIWVNGDRVDVTGEFVEN---GTE-THFTIGDTP 101 (127)
Q Consensus 51 ~tW~~~i~g~~~RIvLdK---~tmdvw~nG~~iet~geFvd~---Gte-t~F~lg~~~ 101 (127)
.+|++. ++|+|+|-. ..-.|||||+.+.. .+..-. .-+ .||-+|+-.
T Consensus 638 stWe~~---k~yqVal~L~~G~~gsvYVDG~~vg~-~~~~l~~~~~~~IshFyiGgdg 691 (780)
T PTZ00334 638 SNWEPE---TTHQVAIVLRNGKQGSAYVDGQRVGD-ASCELKNTDSKGISHFYIGGDG 691 (780)
T ss_pred ccccCC---CeEEEEEEEeCCCeEEEEECCEEecC-cccccCCCCCcccceEEECCCc
Confidence 456654 578887654 24699999999933 222222 222 699998654
No 44
>smart00560 LamGL LamG-like jellyroll fold domain.
Probab=29.24 E-value=85 Score=22.23 Aligned_cols=22 Identities=32% Similarity=0.534 Sum_probs=16.4
Q ss_pred EEEEEEc--cceeEEEcCEEecee
Q psy16754 62 YRVVLER--GTLDIWVNGDRVDVT 83 (127)
Q Consensus 62 ~RIvLdK--~tmdvw~nG~~iet~ 83 (127)
.-+|.|. ..|.+|+||+++.+.
T Consensus 66 va~v~d~~~g~~~lYvnG~~~~~~ 89 (133)
T smart00560 66 LAGVYDGGAGKLSLYVNGVEVATS 89 (133)
T ss_pred EEEEEECCCCeEEEEECCEEcccc
Confidence 3456676 689999999988544
No 45
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=27.72 E-value=26 Score=31.64 Aligned_cols=46 Identities=13% Similarity=0.117 Sum_probs=39.2
Q ss_pred eEEEEEEccceeEEEcCEEe-ceeeeeeeCCeeEEEEeCCccEEEEE
Q psy16754 61 TYRVVLERGTLDIWVNGDRV-DVTGEFVENGTETHFTIGDTPAYIKA 106 (127)
Q Consensus 61 ~~RIvLdK~tmdvw~nG~~i-et~geFvd~Gtet~F~lg~~~c~I~a 106 (127)
.=||.||-++..+-.|++.+ .....|.|+||..+|...+|...+.+
T Consensus 251 kekI~ld~~~S~LLLDkrll~~~~~~~dd~gti~~~~~~~~~~~~~~ 297 (466)
T PF04780_consen 251 KEKIDLDSDFSCLLLDKRLLKGELDPFDDEGTINVFDPNVHYADVQP 297 (466)
T ss_pred ceeeecCCccceeeechhhhccCCCccCcCCceeccCCCcccccCCC
Confidence 66899999999999999888 55568999999999999888877655
No 46
>PRK10348 ribosome-associated heat shock protein Hsp15; Provisional
Probab=27.61 E-value=1.6e+02 Score=22.29 Aligned_cols=40 Identities=15% Similarity=0.260 Sum_probs=33.0
Q ss_pred EEEcCEEeceeeeeeeCCeeEEEEeCCccEEEEEEeecCce
Q psy16754 73 IWVNGDRVDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKK 113 (127)
Q Consensus 73 vw~nG~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~ 113 (127)
|++||+. .-.+.=|..|.+-.+..++....|+...-+.++
T Consensus 36 V~vnG~~-~Kps~~V~~gd~l~v~~~~~~~~v~Vl~l~~~R 75 (133)
T PRK10348 36 VHYNGQR-SKPSKIVELNATLTLRQGNDERTVIVKAITEQR 75 (133)
T ss_pred EEECCEE-CCCCCccCCCCEEEEEECCEEEEEEEeECcccc
Confidence 7899999 677777888999889999998888887666654
No 47
>smart00210 TSPN Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin
Probab=27.53 E-value=2.2e+02 Score=21.49 Aligned_cols=64 Identities=11% Similarity=0.136 Sum_probs=35.3
Q ss_pred eCCeeEEEEE--EccceeEEEcCEEeceeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEe
Q psy16754 57 VEDETYRVVL--ERGTLDIWVNGDRVDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLIS 122 (127)
Q Consensus 57 i~g~~~RIvL--dK~tmdvw~nG~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~v 122 (127)
.||+=+||.| +.+...+|+|++++++.. +-..+. .....++..-...-.+++.+=.|.+..|-+
T Consensus 115 ~dg~WH~lal~V~~~~v~LyvDC~~~~~~~-l~~~~~-~~~~~~g~~~~g~~~~~~~~f~G~lq~l~i 180 (184)
T smart00210 115 ADGQWHKLALSVSGSSATLYVDCNEIDSRP-LDRPGQ-PPIDTDGIEVRGAQAADRKPFQGDLQQLKI 180 (184)
T ss_pred ccCCceEEEEEEeCCEEEEEECCcccccee-cCCccc-ccccccceEEEeeccCCCCcceEEeEEEEE
Confidence 4787666665 567888999999987753 322221 111112221111112334466788888765
No 48
>PRK11907 bifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase precursor protein; Reviewed
Probab=26.85 E-value=55 Score=31.39 Aligned_cols=58 Identities=21% Similarity=0.308 Sum_probs=39.7
Q ss_pred eEEEEEEECCeehHhhh-eee-eeeee-------------------EEeeCCeeEEEEEEccc----------------e
Q psy16754 29 GYAYSITVDGKTLEKFS-QSK-ANCTW-------------------IVTVEDETYRVVLERGT----------------L 71 (127)
Q Consensus 29 ~YeYsL~V~GKsl~kf~-~~k-~~~tW-------------------~~~i~g~~~RIvLdK~t----------------m 71 (127)
-.-+.++|.|+-|++.. ++- .++.+ -..++|..|.|-+.+.. .
T Consensus 532 NtL~vv~vTGaqLk~~LE~sa~~f~qi~~~~~~~q~l~~~~~~~ynFd~isGl~Y~iD~tqP~~~~~~g~~~~~~G~RI~ 611 (814)
T PRK11907 532 NVTAILKVTGAQLKEWLEMSAGQFNQIDPNSKEPQNLVNTDYRTYNFDVIDGVTYKFDITQPNKYDRDGKLVNPTASRVR 611 (814)
T ss_pred CeEEEEEeeHHHHHHHHHHHHHhhcccCCcccccccccccCcCcccCceecceEEEEeCCCCccccccccccCCCCCceE
Confidence 34577899999999999 422 22221 12368889998887652 3
Q ss_pred eEEEcCEEeceeeee
Q psy16754 72 DIWVNGDRVDVTGEF 86 (127)
Q Consensus 72 dvw~nG~~iet~geF 86 (127)
++.+||+.|+-..+|
T Consensus 612 ~L~~~G~PId~~~~Y 626 (814)
T PRK11907 612 NLQYNGQPVDANQEF 626 (814)
T ss_pred EEEECCEECCCCCEE
Confidence 678899988766554
No 49
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=26.80 E-value=46 Score=20.79 Aligned_cols=12 Identities=25% Similarity=0.506 Sum_probs=9.8
Q ss_pred eeEEEcCEEece
Q psy16754 71 LDIWVNGDRVDV 82 (127)
Q Consensus 71 mdvw~nG~~iet 82 (127)
|.|++||+.++.
T Consensus 1 m~i~vNg~~~~~ 12 (65)
T PRK06944 1 MDIQLNQQTLSL 12 (65)
T ss_pred CEEEECCEEEEC
Confidence 789999998754
No 50
>cd07999 GH7_CBH_EG Glycosyl hydrolase family 7. Glycosyl hydrolase family 7 contains eukaryotic endoglucanases (EGs) and cellobiohydrolases (CBHs) that hydrolyze glycosidic bonds using a double-displacement mechanism. This leads to a net retention of the conformation at the anomeric carbon. Both enzymes work synergistically in the degradation of cellulose,which is the main component of plant cell wall, and is composed of beta-1,4 linked glycosyl units. EG cleaves the beta-1,4 linkages of cellulose and CBH cleaves off cellobiose disaccharide units from the reducing end of the chain. In general, the O-glycosyl hydrolases are a widespread group of enzymes that hydrolyze the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A glycosyl hydrolase classification system based on sequence similarity has led to the definition of more than 95 different families inlcuding glycoside hydrolase family 7.
Probab=26.75 E-value=28 Score=30.83 Aligned_cols=21 Identities=43% Similarity=0.994 Sum_probs=15.3
Q ss_pred Ccccccee-eEEe-CCcceEEEE
Q psy16754 1 MFRLVGEE-SFQL-DSKVPCLIK 21 (127)
Q Consensus 1 mFkLVGke-~F~i-~~~~k~~I~ 21 (127)
||+|+|+| +|.| -+++.|=++
T Consensus 112 mf~Llg~EftFDVD~S~LpCGlN 134 (386)
T cd07999 112 MFKLLGQEFTFDVDMSNLPCGMN 134 (386)
T ss_pred EEEecCCeeEEEeecccCCcccc
Confidence 89999997 6767 556666443
No 51
>PRK10856 cytoskeletal protein RodZ; Provisional
Probab=26.64 E-value=4e+02 Score=22.65 Aligned_cols=74 Identities=14% Similarity=0.241 Sum_probs=51.1
Q ss_pred EEEEEccCCceEEEEEEECCeehHhhh-eeeeeeeeEEeeCCe-eEEEEEE-ccceeEEEcCEEeceeeeeeeCCeeEEE
Q psy16754 19 LIKIEPSGPFGYAYSITVDGKTLEKFS-QSKANCTWIVTVEDE-TYRVVLE-RGTLDIWVNGDRVDVTGEFVENGTETHF 95 (127)
Q Consensus 19 ~I~I~a~~gf~YeYsL~V~GKsl~kf~-~~k~~~tW~~~i~g~-~~RIvLd-K~tmdvw~nG~~iet~geFvd~Gtet~F 95 (127)
.+.|.. ++..|.---+.|||.|-.-. ..-. + +.+.|+ .++|.|- ....+|.+||+.++-.+ |.-.|....|
T Consensus 253 ~L~i~f-~~d~Wv~V~da~Gk~l~~g~~~~G~--~--~~~~g~~p~~v~iG~~~~v~i~~nG~~vdl~~-~~~~g~vARf 326 (331)
T PRK10856 253 ALVMNF-TADCWLEVTDATGKKLFSGMQRKGG--N--LNLTGQAPYKLKIGAPAAVQIQYQGKPVDLSR-FIRTNQVARL 326 (331)
T ss_pred eEEEEE-cCCeEEEEEeCCCcChhhhccCCCc--e--EEecCCceEEEEEcCCCceEEEECCEEccCCc-cCcCCceEEE
Confidence 344444 34566666677899988776 2222 2 344455 7999997 45789999999998765 4667888888
Q ss_pred EeC
Q psy16754 96 TIG 98 (127)
Q Consensus 96 ~lg 98 (127)
.|+
T Consensus 327 tLp 329 (331)
T PRK10856 327 TLN 329 (331)
T ss_pred ecC
Confidence 886
No 52
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=25.94 E-value=1.3e+02 Score=23.94 Aligned_cols=81 Identities=20% Similarity=0.449 Sum_probs=53.9
Q ss_pred ceEEEEEEccCCceEEEEE----EECCeehH-hhh-eeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEe-ceeee--e
Q psy16754 16 VPCLIKIEPSGPFGYAYSI----TVDGKTLE-KFS-QSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRV-DVTGE--F 86 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL----~V~GKsl~-kf~-~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~i-et~ge--F 86 (127)
.++.++-++. --+=.|+= +=||.++. -|+ ..-++-+|.++.|..++ |||-. |+|-.+|+.+ .-+|+ |
T Consensus 72 vk~~~r~d~g-qp~~V~~tdLvt~~~g~~l~aG~m~~~~~tf~wtl~yDe~d~--VlEGr-L~V~~~g~tv~a~aGDvif 147 (176)
T COG4766 72 VKFGLRFDTG-QPDCVYTTDLVTEQEGSRLGAGLMEMKNTTFPWTLNYDEIDY--VLEGR-LHVRIDGRTVIAGAGDVIF 147 (176)
T ss_pred ceeEeeecCC-CCCeEEeeceeecccCCccccceeeeccccCcceecccceeE--EEeee-EEEEEcCCeEecCCCcEEE
Confidence 4455555552 23444543 34666664 355 44478899999987665 56654 9999999988 55666 8
Q ss_pred eeCCeeEEEEeCCc
Q psy16754 87 VENGTETHFTIGDT 100 (127)
Q Consensus 87 vd~Gtet~F~lg~~ 100 (127)
.--|....|...+.
T Consensus 148 iPKgssIefst~ge 161 (176)
T COG4766 148 IPKGSSIEFSTTGE 161 (176)
T ss_pred ecCCCeEEEeccce
Confidence 88888887877543
No 53
>cd02859 AMPKbeta_GBD_like AMP-activated protein kinase (AMPK) beta subunit glycogen binding domain (GBD). AMPK is a metabolic stress sensing protein that senses AMP/ATP and has recently been found to act as a glycogen sensor as well. The protein functions as a alpha-beta-gamma heterotrimer. This domain is the glycogen binding domain of the beta subunit.
Probab=25.67 E-value=1.2e+02 Score=20.07 Aligned_cols=20 Identities=30% Similarity=0.494 Sum_probs=13.0
Q ss_pred EEEEEccCCceEEEEEEECCe
Q psy16754 19 LIKIEPSGPFGYAYSITVDGK 39 (127)
Q Consensus 19 ~I~I~a~~gf~YeYsL~V~GK 39 (127)
++.++- +...|+|..-|||.
T Consensus 38 ~~~~~L-~~g~y~YkF~Vdg~ 57 (79)
T cd02859 38 SATLRL-PPGKYQYKFIVDGE 57 (79)
T ss_pred EEEEEc-CCCCEEEEEEECCE
Confidence 344433 34589999989875
No 54
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=25.59 E-value=49 Score=20.77 Aligned_cols=12 Identities=25% Similarity=0.634 Sum_probs=9.6
Q ss_pred eeEEEcCEEece
Q psy16754 71 LDIWVNGDRVDV 82 (127)
Q Consensus 71 mdvw~nG~~iet 82 (127)
|.|++||+..+.
T Consensus 1 m~i~vNG~~~~~ 12 (66)
T PRK05659 1 MNIQLNGEPREL 12 (66)
T ss_pred CEEEECCeEEEc
Confidence 789999997744
No 55
>PF14280 DUF4365: Domain of unknown function (DUF4365)
Probab=25.57 E-value=71 Score=22.48 Aligned_cols=58 Identities=22% Similarity=0.430 Sum_probs=44.4
Q ss_pred CCeehHhhh--eeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEE
Q psy16754 37 DGKTLEKFS--QSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHF 95 (127)
Q Consensus 37 ~GKsl~kf~--~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F 95 (127)
+|++..... |-|.+..|...-++..+.+.++++.++.|.+ ..+..---.++.-.+.-|
T Consensus 39 ~g~~tg~~i~VQlK~t~~~~~~~~~~~~~~~~~~~~l~y~~~-~~~PvilV~~~~~~~~~y 98 (138)
T PF14280_consen 39 DGKPTGRRIAVQLKSTESYFYENDDGSFSFPLDKEHLNYWLN-HPVPVILVLVDPDSDCAY 98 (138)
T ss_pred CCcccCCEEEEEEcccCcccccCCCceEEEeeCHHHHHHHhc-CCCCEEEEEEeCCCCEEE
Confidence 788777666 9999999998777889999999999999998 566655555665555433
No 56
>PF08141 SspH: Small acid-soluble spore protein H family; InterPro: IPR012610 This family consists of the small acid-soluble spore proteins (SASP) of the H type (sspH). SspH are unique to spores of Bacillus subtilis and are expressed only in the forespore compartment during sporulation of this organism. The sspH genes are monocistronic and are recognised by the forespore-specific sigma factor for RNA polymerase - sigma-G. The specific role of this protein is unclear but is thought to play a role in sporulation under conditions different from that of the common laboratory tests of spore properties [].; GO: 0030436 asexual sporulation, 0042601 endospore-forming forespore
Probab=25.20 E-value=2e+02 Score=18.77 Aligned_cols=30 Identities=27% Similarity=0.406 Sum_probs=21.6
Q ss_pred ccceeEEEcCEEe--ceeeeeeeCCeeEEEEeCC
Q psy16754 68 RGTLDIWVNGDRV--DVTGEFVENGTETHFTIGD 99 (127)
Q Consensus 68 K~tmdvw~nG~~i--et~geFvd~Gtet~F~lg~ 99 (127)
++.-+|..||..| +..-| ++||.+-+.+++
T Consensus 13 ~~~i~V~y~G~pV~Ie~vde--~~~tA~V~~l~~ 44 (58)
T PF08141_consen 13 PDMIEVTYNGVPVWIEHVDE--ENGTARVHPLDN 44 (58)
T ss_pred CceEEEEECCEEEEEEEEcC--CCCeEEEEECCC
Confidence 4556888899877 55554 888888777743
No 57
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=24.62 E-value=53 Score=20.80 Aligned_cols=12 Identities=25% Similarity=0.797 Sum_probs=10.3
Q ss_pred eeEEEcCEEece
Q psy16754 71 LDIWVNGDRVDV 82 (127)
Q Consensus 71 mdvw~nG~~iet 82 (127)
|.|++||+.++.
T Consensus 1 m~i~~Ng~~~~~ 12 (65)
T PRK06488 1 MKLFVNGETLQT 12 (65)
T ss_pred CEEEECCeEEEc
Confidence 789999998876
No 58
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=24.50 E-value=1.1e+02 Score=20.44 Aligned_cols=22 Identities=36% Similarity=0.523 Sum_probs=14.5
Q ss_pred EEEEEccCCceEEEEEEECCee
Q psy16754 19 LIKIEPSGPFGYAYSITVDGKT 40 (127)
Q Consensus 19 ~I~I~a~~gf~YeYsL~V~GKs 40 (127)
.+.|.++.+..|.|...|||..
T Consensus 43 ~~~v~~l~~g~Y~Y~~~vdg~~ 64 (85)
T cd02858 43 SVTTGPLAPGIYTYSFLVDGVR 64 (85)
T ss_pred EEEECCCCCcEEEEEEEECCeE
Confidence 3455555666788888888743
No 59
>cd01179 IPT_plexin_repeat2 Second repeat of the IPT domain of Plexins and Cell Surface Receptors (PCSR) . Plexins are involved in the regulation of cell proliferation and of cellular adhesion and repulsion receptors. In general, there are three copies of the IPT domain present preceeded by SEMA (semaphorin) and PSI (plexin, semaphorin, integrin) domains.
Probab=24.35 E-value=1.7e+02 Score=19.26 Aligned_cols=31 Identities=32% Similarity=0.672 Sum_probs=19.0
Q ss_pred CEEeceeeeeeeCCeeEEEEeCCccEEEEEE
Q psy16754 77 GDRVDVTGEFVENGTETHFTIGDTPAYIKAV 107 (127)
Q Consensus 77 G~~iet~geFvd~Gtet~F~lg~~~c~I~a~ 107 (127)
|..|--.|.+-+.|....=.+|+.+|.+...
T Consensus 16 GT~vtI~G~~~~~~~~~~V~ig~~~C~~~~~ 46 (85)
T cd01179 16 GTRLTITGKHLNAGSSVRVTVGGQPCKILSV 46 (85)
T ss_pred CEEEEEEEECCCCCCeEEEEECCeEeeEEEe
Confidence 4455555665555555556677777777554
No 60
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=24.25 E-value=54 Score=21.01 Aligned_cols=13 Identities=31% Similarity=0.504 Sum_probs=10.2
Q ss_pred eeEEEcCEEecee
Q psy16754 71 LDIWVNGDRVDVT 83 (127)
Q Consensus 71 mdvw~nG~~iet~ 83 (127)
|.|++||+..+..
T Consensus 1 m~i~vNG~~~~~~ 13 (65)
T PRK05863 1 MIVVVNEEQVEVD 13 (65)
T ss_pred CEEEECCEEEEcC
Confidence 7899999987553
No 61
>KOG0275|consensus
Probab=24.19 E-value=1.3e+02 Score=27.08 Aligned_cols=86 Identities=21% Similarity=0.390 Sum_probs=47.3
Q ss_pred EEEEEEE----CCeehHhhh-eeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeE-----------
Q psy16754 30 YAYSITV----DGKTLEKFS-QSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTET----------- 93 (127)
Q Consensus 30 YeYsL~V----~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet----------- 93 (127)
|.|+.-| .||.||+|+ ++---+-=..+-||...--.---.|..||- |+--|-...|--.|++.
T Consensus 326 fD~tvRiHGlKSGK~LKEfrGHsSyvn~a~ft~dG~~iisaSsDgtvkvW~-~KtteC~~Tfk~~~~d~~vnsv~~~PKn 404 (508)
T KOG0275|consen 326 FDQTVRIHGLKSGKCLKEFRGHSSYVNEATFTDDGHHIISASSDGTVKVWH-GKTTECLSTFKPLGTDYPVNSVILLPKN 404 (508)
T ss_pred ccceEEEeccccchhHHHhcCccccccceEEcCCCCeEEEecCCccEEEec-CcchhhhhhccCCCCcccceeEEEcCCC
Confidence 4455554 579999999 655444444555555433333445666773 44444444555555431
Q ss_pred --EEEeCC---------ccE-EEEEEeecCceece
Q psy16754 94 --HFTIGD---------TPA-YIKAVSSCNKKEGL 116 (127)
Q Consensus 94 --~F~lg~---------~~c-~I~a~ssG~k~~Gi 116 (127)
||.+-+ .+. .+++.|||+|.-|=
T Consensus 405 peh~iVCNrsntv~imn~qGQvVrsfsSGkREgGd 439 (508)
T KOG0275|consen 405 PEHFIVCNRSNTVYIMNMQGQVVRSFSSGKREGGD 439 (508)
T ss_pred CceEEEEcCCCeEEEEeccceEEeeeccCCccCCc
Confidence 444422 222 24567899887663
No 62
>PRK09504 sufA iron-sulfur cluster assembly scaffold protein; Provisional
Probab=24.03 E-value=2.8e+02 Score=19.99 Aligned_cols=68 Identities=13% Similarity=0.354 Sum_probs=40.1
Q ss_pred eEEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEE
Q psy16754 17 PCLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFT 96 (127)
Q Consensus 17 k~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~ 96 (127)
...|.|++.|-..|.|.|.....+=+. -+.+..+ ..+|++|+..+ .|.+|-.|| |++++...-|.
T Consensus 40 ~LRi~v~~gGCsG~~Y~~~l~~e~~~~--------D~v~e~~--g~~v~Id~~s~-~~L~g~~ID----y~~~~~~~gF~ 104 (122)
T PRK09504 40 GVRLGVKQTGCAGFGYVLDSVSEPDKD--------DLVFEHD--GAKLFVPLQAM-PFIDGTEVD----YVREGLNQIFK 104 (122)
T ss_pred eEEEEEECCCCCceEEEeeecCCCCCC--------CEEEEeC--CEEEEEcHHHH-HhhCCcEEE----eecCCCcceEE
Confidence 366777766544566666644322111 1222233 46677888866 467775554 67777778898
Q ss_pred eCC
Q psy16754 97 IGD 99 (127)
Q Consensus 97 lg~ 99 (127)
+.+
T Consensus 105 f~N 107 (122)
T PRK09504 105 FHN 107 (122)
T ss_pred EEC
Confidence 854
No 63
>PF03099 BPL_LplA_LipB: Biotin/lipoate A/B protein ligase family This entry is just a subset of the Pfam family; InterPro: IPR004143 This domain is found in biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to form cellular metabolites. Biotin protein ligase (BPL) is the enzyme responsible for attaching biotin to a specific lysine at the active site of biotin enzymes. Each organism probably has only one BPL. Biotin attachment is a two step reaction that results in the formation of an amide linkage between the carboxyl group of biotin and the epsilon-amino group of the modified lysine []. Lipoate-protein ligase A (LPLA) (octanoyltransferase) catalyses the formation of an amide linkage between lipoic acid and a specific lysine residue in lipoate dependent enzymes [].; GO: 0003824 catalytic activity, 0006464 protein modification process; PDB: 2ARU_A 2C8M_B 2C7I_A 3R07_A 2ARS_A 2ART_A 3FJP_A 3EFR_B 2EAY_B 3EFS_B ....
Probab=23.30 E-value=1.4e+02 Score=20.26 Aligned_cols=76 Identities=16% Similarity=0.175 Sum_probs=41.6
Q ss_pred EEEEEccCCceEEEEEEECC-----eehHhhh---eeeeeeeeE---EeeCCeeEEEEEEccceeEEEcCEEe-ceeeee
Q psy16754 19 LIKIEPSGPFGYAYSITVDG-----KTLEKFS---QSKANCTWI---VTVEDETYRVVLERGTLDIWVNGDRV-DVTGEF 86 (127)
Q Consensus 19 ~I~I~a~~gf~YeYsL~V~G-----Ksl~kf~---~~k~~~tW~---~~i~g~~~RIvLdK~tmdvw~nG~~i-et~geF 86 (127)
.++.+|.+...|...+..+. ..+..|. ....+++.. ....|..+++ |-.-||+++|+|| -..-|-
T Consensus 33 ~~w~~p~g~l~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~---kw~nDi~~~~kKi~Gil~~~ 109 (125)
T PF03099_consen 33 RVWHSPPGNLYFSLILPPDDPNFPPSDIPSYILLAALAVLEALGEFGPGEPGIDCFI---KWPNDIYVNGKKIAGILQER 109 (125)
T ss_dssp EEEEBTTTEEEEEEEEETTTTTHHGGGHHHHHHHHHHHHHHHHHHTTHTTTTSSEEE---ETTTEEEETTEEEEEEEEEE
T ss_pred ceeeeCCcEEEEEEEEccccccccchhhhHHHHHHHHHHHHHhhhhccccCCCceEE---eCCCCccCCCcEEEEEeEee
Confidence 56667655455555555443 2455554 222222221 1223556664 6889999999999 445555
Q ss_pred eeCCeeEEEEe
Q psy16754 87 VENGTETHFTI 97 (127)
Q Consensus 87 vd~Gtet~F~l 97 (127)
..++...|..+
T Consensus 110 ~~~~~~~~~~i 120 (125)
T PF03099_consen 110 RRGGILHHGSI 120 (125)
T ss_dssp ETTEEEEEEEE
T ss_pred eCCcEEEEEEE
Confidence 55555455544
No 64
>cd00110 LamG Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Probab=23.13 E-value=2e+02 Score=19.64 Aligned_cols=66 Identities=17% Similarity=0.215 Sum_probs=33.6
Q ss_pred CCeeEEEEE--EccceeEEEcCEEeceeeeeee---CCeeEEEEeCCccEEEEE--EeecCceeceEEEEEec
Q psy16754 58 EDETYRVVL--ERGTLDIWVNGDRVDVTGEFVE---NGTETHFTIGDTPAYIKA--VSSCNKKEGLLYSLISK 123 (127)
Q Consensus 58 ~g~~~RIvL--dK~tmdvw~nG~~iet~geFvd---~Gtet~F~lg~~~c~I~a--~ssG~k~~Gi~H~L~vn 123 (127)
||+-++|.+ +...+.+++||+...+...-.. --....+-+|+.|-.... ..+...=.|-+..+.+|
T Consensus 79 dg~Wh~v~i~~~~~~~~l~VD~~~~~~~~~~~~~~~~~~~~~~~iGg~~~~~~~~~~~~~~~F~Gci~~v~in 151 (151)
T cd00110 79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN 151 (151)
T ss_pred CCCEEEEEEEECCCEEEEEECCccEEeeeCCCCceeecCCCCeEEcCCCCchhcccccccCCCceEeeEeEeC
Confidence 566555555 5667889999974322211111 112345667776643321 11223335666666654
No 65
>PF06832 BiPBP_C: Penicillin-Binding Protein C-terminus Family; InterPro: IPR009647 This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (IPR001460 from INTERPRO). It is predicted to be a beta fold.
Probab=22.96 E-value=2.2e+02 Score=18.83 Aligned_cols=50 Identities=18% Similarity=0.202 Sum_probs=31.0
Q ss_pred eEEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEc
Q psy16754 17 PCLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLER 68 (127)
Q Consensus 17 k~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK 68 (127)
.-.|.+++.++.. .+...|||+.+..=. ...--.|.+.-.|+..-.|+|.
T Consensus 31 ~~~l~l~a~~~~~-~~~W~vdg~~~g~~~-~~~~~~~~~~~~G~h~l~vvD~ 80 (89)
T PF06832_consen 31 RQPLVLKAAGGRG-PVYWFVDGEPLGTTQ-PGHQLFWQPDRPGEHTLTVVDA 80 (89)
T ss_pred cceEEEEEeCCCC-cEEEEECCEEcccCC-CCCeEEeCCCCCeeEEEEEEcC
Confidence 4667777777766 999999999993333 2334457774445443333443
No 66
>KOG2372|consensus
Probab=22.92 E-value=68 Score=26.81 Aligned_cols=65 Identities=20% Similarity=0.370 Sum_probs=47.0
Q ss_pred ccccceeeEEeCCcceEEEEEEccCCceEEEEEEECCeehHhhh-eeeeeeeeEEeeCCeeEEEEEEccceeEEE
Q psy16754 2 FRLVGEESFQLDSKVPCLIKIEPSGPFGYAYSITVDGKTLEKFS-QSKANCTWIVTVEDETYRVVLERGTLDIWV 75 (127)
Q Consensus 2 FkLVGke~F~i~~~~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~ 75 (127)
||.+|...|-| .|..---|.|+..=.|.|.+||--+--.. ...++.-=.+- ++++++|+ -+|+|.
T Consensus 174 y~~TG~n~f~i----~c~~dfLa~G~GgGRfgLwLD~dL~~G~S~~ceTFgN~~Ls-~~qdF~Iq----~lElW~ 239 (241)
T KOG2372|consen 174 YRWTGDNSFFI----YCDKDFLAFGGGGGRFGLWLDGDLLHGSSHPCETFGNEPLS-DKQDFIIQ----DLELWG 239 (241)
T ss_pred eeecCCcceEE----EechhHhhhcCCCCceEEEecccccccccCCCcccCCcccC-CcCceEEE----EEEEEe
Confidence 77888888885 36666677777777899999998877666 55555443333 88899984 378886
No 67
>PF00380 Ribosomal_S9: Ribosomal protein S9/S16; InterPro: IPR000754 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S9 is one of the proteins from the small ribosomal subunit. It belongs to the S9P family of ribosomal proteins which, on the basis of sequence similarities [, ], groups bacterial; algal chloroplast; cyanelle and archaeal S9 proteins; and mammalian; plant; and yeast mitochondrial ribosomal S9 proteins. These proteins adopt a beta-alpha-beta fold similar to that found in numerous RNA/DNA-binding proteins, as well as in kinases from the GHMP kinase family [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2V46_I 3T1H_I 3MR8_I 3F1G_I 3D5C_I 3D5A_I 2WDG_I 3MS0_I 2WDM_I 2J02_I ....
Probab=22.30 E-value=1.3e+02 Score=22.26 Aligned_cols=25 Identities=24% Similarity=0.486 Sum_probs=18.9
Q ss_pred ceEEEEEEccCCceEEEEEEECCeehHhhh
Q psy16754 16 VPCLIKIEPSGPFGYAYSITVDGKTLEKFS 45 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~ 45 (127)
..|.+.+.+-+| .+.|||+|+.+|-
T Consensus 6 a~A~v~l~~G~G-----~i~INg~~l~~yf 30 (121)
T PF00380_consen 6 AIARVWLKPGSG-----KIRINGKPLEEYF 30 (121)
T ss_dssp EEEEEEEEESSS-----EEEETTSEHHHHS
T ss_pred EEEEEEEEeCce-----EEEECCEEHHHhc
Confidence 457777777433 6799999998887
No 68
>PRK04313 30S ribosomal protein S4e; Validated
Probab=22.24 E-value=1.2e+02 Score=25.19 Aligned_cols=37 Identities=19% Similarity=0.289 Sum_probs=26.7
Q ss_pred EEEECCeehHhhh-eeeeeeeeEEeeCCeeEEEEEEcc
Q psy16754 33 SITVDGKTLEKFS-QSKANCTWIVTVEDETYRVVLERG 69 (127)
Q Consensus 33 sL~V~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvLdK~ 69 (127)
.+.||||.-++++ ----+..=.+.-.|+.|||++|..
T Consensus 65 ~V~VDGkvr~D~~~PvGlmDVIsI~~~~e~yRvl~d~k 102 (237)
T PRK04313 65 KVLVDGRVRKDYKFPVGLMDVISIPETGEYYRVLPDEK 102 (237)
T ss_pred cEEECCEEEcccccCcCceeEEEEccCCCeEEEEECCC
Confidence 3578888888888 445555555555788888888755
No 69
>PF14438 SM-ATX: Ataxin 2 SM domain; PDB: 1M5Q_1.
Probab=22.13 E-value=89 Score=20.45 Aligned_cols=24 Identities=13% Similarity=0.148 Sum_probs=17.8
Q ss_pred CccEEEEEEeecCceeceEEEEEec
Q psy16754 99 DTPAYIKAVSSCNKKEGLLYSLISK 123 (127)
Q Consensus 99 ~~~c~I~a~ssG~k~~Gi~H~L~vn 123 (127)
|++|.|. ..+|.+=+||.|+.-.+
T Consensus 12 G~~V~V~-~~~G~~yeGif~s~s~~ 35 (77)
T PF14438_consen 12 GQTVEVT-TKNGSVYEGIFHSASPE 35 (77)
T ss_dssp TSEEEEE-ETTS-EEEEEEEEE-T-
T ss_pred CCEEEEE-ECCCCEEEEEEEeCCCc
Confidence 6788885 58999999999997654
No 70
>PF13954 PapC_N: PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=22.09 E-value=92 Score=22.81 Aligned_cols=23 Identities=30% Similarity=0.603 Sum_probs=15.5
Q ss_pred ceeEEEcCEEec-eeeeeeeCCee
Q psy16754 70 TLDIWVNGDRVD-VTGEFVENGTE 92 (127)
Q Consensus 70 tmdvw~nG~~ie-t~geFvd~Gte 92 (127)
..+|++||+.+. ..=+|..++..
T Consensus 33 ~vdv~vN~~~~~~~~i~f~~~~~~ 56 (146)
T PF13954_consen 33 SVDVYVNGKFIGRYDIEFINNDDG 56 (146)
T ss_dssp EEEEEETTEEEEEEEEEEEESSST
T ss_pred EEEEEECCeeeeeEEEEEEeCCCc
Confidence 478899998884 33346666654
No 71
>COG0071 IbpA Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]
Probab=21.91 E-value=3.1e+02 Score=19.95 Aligned_cols=29 Identities=31% Similarity=0.664 Sum_probs=22.1
Q ss_pred CCeeEEEEEE-----ccceeEEEcCEEeceeeee
Q psy16754 58 EDETYRVVLE-----RGTLDIWVNGDRVDVTGEF 86 (127)
Q Consensus 58 ~g~~~RIvLd-----K~tmdvw~nG~~iet~geF 86 (127)
.+.++||++| |+.++|-+.+..+.-.|+=
T Consensus 48 ~~~~~~I~~elPG~~kedI~I~~~~~~l~I~g~~ 81 (146)
T COG0071 48 TDDEYRITAELPGVDKEDIEITVEGNTLTIRGER 81 (146)
T ss_pred cCCEEEEEEEcCCCChHHeEEEEECCEEEEEEEe
Confidence 5669999986 7889999988866555554
No 72
>cd03829 Sina Seven in absentia (Sina) protein family, C-terminal substrate binding domain; composed of the Drosophila Sina protein, the mammalian Sina homolog (Siah), the plant protein SINAT5, and similar proteins. Sina, Siah and SINAT5 are RING-containing proteins that function as E3 ubiquitin ligases, acting either as single proteins or as a part of multiprotein complexes. Sina is expressed in many cells in the developing eye but is essential specifically for R7 photoreceptor cell development. Sina cooperates with Phyllopod (Phyl), Ebi and the E2 ubiquitin-conjugating enzyme Ubcd1 to catalyze the ubiquitination and subsequent degradation of Tramtrack (Ttk88); Ttk88 is a transcriptional repressor that blocks photoreceptor differentiation. Similarly, the mammalian homologue Siah1 cooperates with SIP (Siah-interacting protein), Ebi and the adaptor protein Skp1, to target beta-catenin for ubiquitination and degradation via a p53-dependent mechanism. SINAT5 targets NAC1 for ubiquitin-medi
Probab=21.74 E-value=60 Score=24.68 Aligned_cols=21 Identities=24% Similarity=0.659 Sum_probs=16.3
Q ss_pred ceEEEEEEECCeehHhhheeeeeeeeEEe
Q psy16754 28 FGYAYSITVDGKTLEKFSQSKANCTWIVT 56 (127)
Q Consensus 28 f~YeYsL~V~GKsl~kf~~~k~~~tW~~~ 56 (127)
-.|.|+|+|+| +.+..+|+-+
T Consensus 65 ~nf~Y~Lel~~--------n~RkL~we~~ 85 (127)
T cd03829 65 ENFTYRLELNG--------NRRRLTWEAT 85 (127)
T ss_pred hcceEEEEEcC--------CCcEEEeecC
Confidence 46899999996 4677788754
No 73
>PF02210 Laminin_G_2: Laminin G domain; InterPro: IPR012680 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, including a large number of extracellular proteins. The C terminus of the laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions have been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each have five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012679 from INTERPRO).; PDB: 3POY_A 3QCW_B 3R05_B 3ASI_A 3MW4_B 3MW3_A 1QU0_D 1DYK_A 1OKQ_A 3SH4_A ....
Probab=21.64 E-value=1.4e+02 Score=19.60 Aligned_cols=68 Identities=15% Similarity=0.150 Sum_probs=41.1
Q ss_pred CCeeEEEEE--EccceeEEEcCEEeceeeeee----eCCeeEEEEeCCccEEEEEEee--cCceeceEEEEEecCc
Q psy16754 58 EDETYRVVL--ERGTLDIWVNGDRVDVTGEFV----ENGTETHFTIGDTPAYIKAVSS--CNKKEGLLYSLISKLK 125 (127)
Q Consensus 58 ~g~~~RIvL--dK~tmdvw~nG~~iet~geFv----d~Gtet~F~lg~~~c~I~a~ss--G~k~~Gi~H~L~vn~~ 125 (127)
||+-++|.+ +...+.+.||++......... .-.....+.+|+.|-....... ...=.|-+..|++||+
T Consensus 53 dg~wh~v~i~~~~~~~~l~Vd~~~~~~~~~~~~~~~~~~~~~~l~iGg~~~~~~~~~~~~~~~f~Gci~~l~vng~ 128 (128)
T PF02210_consen 53 DGQWHKVSISRDGNRVTLTVDGQSVSSESLPSSSSDSLDPDGSLYIGGLPESNQPSGSVDTPGFVGCIRDLRVNGQ 128 (128)
T ss_dssp SSSEEEEEEEEETTEEEEEETTSEEEEEESSSTTHHCBESEEEEEESSTTTTCTCTTSSTTSB-EEEEEEEEETTE
T ss_pred ccceeEEEEEEeeeeEEEEecCccceEEeccccceecccCCCCEEEecccCccccccccCCCCcEEEcCeEEECCC
Confidence 566555555 566788999998764444322 2223344788877765432111 4445688999999985
No 74
>PRK07696 sulfur carrier protein ThiS; Provisional
Probab=21.41 E-value=66 Score=20.86 Aligned_cols=11 Identities=18% Similarity=0.827 Sum_probs=9.0
Q ss_pred eeEEEcCEEec
Q psy16754 71 LDIWVNGDRVD 81 (127)
Q Consensus 71 mdvw~nG~~ie 81 (127)
|.|++||+..+
T Consensus 1 m~I~vNG~~~~ 11 (67)
T PRK07696 1 MNLKINGNQIE 11 (67)
T ss_pred CEEEECCEEEE
Confidence 78999998764
No 75
>PF05508 Ran-binding: RanGTP-binding protein; InterPro: IPR008812 The small Ras-like GTPase Ran plays an essential role in the transport of macromolecules in and out of the nucleus and has been implicated in spindle and nuclear envelope formation during mitosis in higher eukaryotes. The Saccharomyces cerevisiae ORF YGL164c encoding a novel RanGTP-binding protein, termed Yrb30p was identified. The protein competes with S. cerevisiae RanBP1 (Yrb1p) for binding to the GTP-bound form of S. cerevisiae Ran (Gsp1p) and is, like Yrb1p, able to form trimeric complexes with RanGTP and some of the karyopherins [].
Probab=21.39 E-value=1.2e+02 Score=26.00 Aligned_cols=22 Identities=23% Similarity=0.384 Sum_probs=17.8
Q ss_pred ceEEEEEE--ccCCceEEEEEEEC
Q psy16754 16 VPCLIKIE--PSGPFGYAYSITVD 37 (127)
Q Consensus 16 ~k~~I~I~--a~~gf~YeYsL~V~ 37 (127)
.||.++|- +-....|.|.|+|.
T Consensus 223 ~ka~v~l~R~~~~~~~y~Y~L~I~ 246 (302)
T PF05508_consen 223 HKARVRLCRVPSKSDEYAYQLEII 246 (302)
T ss_pred cceEEEEEEecCCCCceEEEEEEe
Confidence 67888885 35679999999997
No 76
>cd03775 MATH_Ubp21p Ubiquitin-specific protease 21 (Ubp21p) family, MATH domain; composed of fungal proteins with similarity to Ubp21p of fission yeast. Ubp21p is a deubiquitinating enzyme that may be involved in the regulation of the protein kinase Prp4p, which controls the formation of active spliceosomes. Members of this family are similar to human HAUSP (Herpesvirus-associated ubiquitin-specific protease) in that they contain an N-terminal MATH domain and a C-terminal catalytic protease (C19 family) domain. HAUSP is also an ubiquitin-specific protease that specifically catalyzes the deubiquitylation of p53 and MDM2. The MATH domain of HAUSP contains the binding site for p53 and MDM2. Similarly, the MATH domain of members in this family may be involved in substrate binding.
Probab=21.18 E-value=1.4e+02 Score=21.20 Aligned_cols=34 Identities=9% Similarity=0.037 Sum_probs=23.8
Q ss_pred EEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEcc
Q psy16754 32 YSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERG 69 (127)
Q Consensus 32 YsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~ 69 (127)
|+..|+|=|. ..+...+-...++|.+|||.+-..
T Consensus 3 f~w~I~~fS~----~~~~~~S~~F~vGG~~W~l~~yP~ 36 (134)
T cd03775 3 FTWRIKNWSE----LEKKVHSPKFKCGGFEWRILLFPQ 36 (134)
T ss_pred EEEEECCccc----CCcceeCCCEEECCeeEEEEEeCC
Confidence 5666665554 234566677788999999998754
No 77
>PF14275 DUF4362: Domain of unknown function (DUF4362)
Probab=21.15 E-value=1.5e+02 Score=21.29 Aligned_cols=28 Identities=14% Similarity=0.131 Sum_probs=21.3
Q ss_pred CccEEEEEEeecCceeceEEEEEecCcC
Q psy16754 99 DTPAYIKAVSSCNKKEGLLYSLISKLKT 126 (127)
Q Consensus 99 ~~~c~I~a~ssG~k~~Gi~H~L~vn~~~ 126 (127)
+.++.|+.+.-.....-|+|.|..||++
T Consensus 28 ~k~d~IrIv~yT~EGdPI~~~L~~~G~~ 55 (98)
T PF14275_consen 28 GKPDKIRIVQYTIEGDPIFQDLEYDGNQ 55 (98)
T ss_pred CCCCEEEEEEecCCCCCEEEEEEECCCE
Confidence 5667777777777777888888888864
No 78
>CHL00079 rps9 ribosomal protein S9
Probab=21.12 E-value=1.1e+02 Score=22.96 Aligned_cols=25 Identities=20% Similarity=0.482 Sum_probs=19.1
Q ss_pred ceEEEEEEccCCceEEEEEEECCeehHhhh
Q psy16754 16 VPCLIKIEPSGPFGYAYSITVDGKTLEKFS 45 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL~V~GKsl~kf~ 45 (127)
..|.+++.+.+| .+.|||+++++|-
T Consensus 14 a~Arv~l~~G~G-----~i~INg~~~~~yf 38 (130)
T CHL00079 14 AVAQVRLVPGSG-----EIIINGKPAEEYL 38 (130)
T ss_pred eEEEEEEEcCCc-----EEEECCCcHHHHC
Confidence 457777876433 4789999999996
No 79
>cd02861 E_set_proteins_like E or "early" set-like proteins. These alpha amylase-like sugar utilizing enzymes which may be related to the immunoglobulin and/or fibronectin type III superfamilies are associated with different types of catalytic domains at either the N-terminal or C-terminal end. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=20.93 E-value=1.7e+02 Score=19.08 Aligned_cols=10 Identities=50% Similarity=1.086 Sum_probs=7.0
Q ss_pred EEEEEEECCe
Q psy16754 30 YAYSITVDGK 39 (127)
Q Consensus 30 YeYsL~V~GK 39 (127)
|+|...|||.
T Consensus 49 y~Ykf~vdg~ 58 (82)
T cd02861 49 YEYKFVVDGE 58 (82)
T ss_pred EEEEEEECCE
Confidence 7777777764
No 80
>PF09865 DUF2092: Predicted periplasmic protein (DUF2092); InterPro: IPR019207 This entry represents various hypothetical prokaryotic proteins of unknown function.
Probab=20.75 E-value=2.8e+02 Score=22.24 Aligned_cols=43 Identities=12% Similarity=0.223 Sum_probs=26.3
Q ss_pred EEEcCEEeceeeeeeeCCeeEEEEeCCccEEEEEEeecCceeceEEEEEecCcC
Q psy16754 73 IWVNGDRVDVTGEFVENGTETHFTIGDTPAYIKAVSSCNKKEGLLYSLISKLKT 126 (127)
Q Consensus 73 vw~nG~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG~k~~Gi~H~L~vn~~~ 126 (127)
|.-+|+++.- ++..++.| ..|=++++..+|+.+.. .++.||+|
T Consensus 30 v~~~gqklq~-------~~~~~v~v-~RPdklr~~~~gd~~~~---~~~yDGkt 72 (214)
T PF09865_consen 30 VTPDGQKLQF-------SSSGTVTV-QRPDKLRIDRRGDGADR---EFYYDGKT 72 (214)
T ss_pred ecCCCceEEE-------EEEEEEEE-eCCCeEEEEEEcCCcce---EEEECCCE
Confidence 4455655543 33455666 56777777777775543 57888875
No 81
>PF14651 Lipocalin_7: Lipocalin / cytosolic fatty-acid binding protein family; PDB: 2F73_A 3B2L_A 3B2J_A 3STN_A 3STK_A 3STM_X 3VG6_A 3VG4_A 3VG5_A 3B2K_A ....
Probab=20.73 E-value=3.6e+02 Score=20.00 Aligned_cols=83 Identities=22% Similarity=0.283 Sum_probs=50.6
Q ss_pred ceEEEEEEccCCceEEEEEEE-CCeeh-HhhheeeeeeeeEEeeCCeeEEEE--EEccceeEEEcCEEeceeeeeeeCCe
Q psy16754 16 VPCLIKIEPSGPFGYAYSITV-DGKTL-EKFSQSKANCTWIVTVEDETYRVV--LERGTLDIWVNGDRVDVTGEFVENGT 91 (127)
Q Consensus 16 ~k~~I~I~a~~gf~YeYsL~V-~GKsl-~kf~~~k~~~tW~~~i~g~~~RIv--LdK~tmdvw~nG~~iet~geFvd~Gt 91 (127)
.+..+.|.. .|..|.++..- +||++ ..|.--+.+. +..++|.....+ ||=+.|- ++..+++..+|+++|=-
T Consensus 35 ~k~v~Ei~q-~Gd~ft~t~~~~ggk~~tNtFtiG~E~e--~~~~~G~k~K~tvt~EG~KLv--~~~~~~~~~~Ei~gd~l 109 (128)
T PF14651_consen 35 AKPVTEISQ-NGDDFTWTTTTPGGKTVTNTFTIGKECE--EETMGGKKFKATVTMEGGKLV--VTYPNYDSTREIVGDEL 109 (128)
T ss_dssp S-EEEEEEE-ETTEEEEEEEETTTEEEEEEEETTCEEE--EEETTSCEEEEEEEEETTEEE--EEETTEEEEEEEETTEE
T ss_pred CCceEEEEE-eCCeEEEEEEecCCCEEEEEEEcCcceE--EEcCCCcEEEEEEEEECCEEE--EEcCCccEEEEEECCEE
Confidence 566777776 57888887754 46775 5666222222 445678777666 4444443 34445677888888755
Q ss_pred eEEEEeCCccEE
Q psy16754 92 ETHFTIGDTPAY 103 (127)
Q Consensus 92 et~F~lg~~~c~ 103 (127)
...+++|+..|.
T Consensus 110 v~t~T~g~vt~k 121 (128)
T PF14651_consen 110 VETMTLGGVTFK 121 (128)
T ss_dssp EEEEEETTEEEE
T ss_pred EEEEEeCCEEEE
Confidence 556788766554
No 82
>TIGR02011 IscA iron-sulfur cluster assembly protein IscA. This clade is limited to the proteobacteria.
Probab=20.59 E-value=3e+02 Score=19.02 Aligned_cols=67 Identities=19% Similarity=0.431 Sum_probs=39.8
Q ss_pred EEEEEEccCCceEEEEEEECCeehHhhheeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEEe
Q psy16754 18 CLIKIEPSGPFGYAYSITVDGKTLEKFSQSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFTI 97 (127)
Q Consensus 18 ~~I~I~a~~gf~YeYsL~V~GKsl~kf~~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~l 97 (127)
..|.|++-|-..|.|.|.....+-+. -+.+..+ ..+|++|+..++ |.+|..|| |+++..-..|.+
T Consensus 24 lRi~v~~~GCsG~~y~l~l~~~~~~~--------D~v~~~~--g~~v~id~~s~~-~l~g~~ID----y~~~~~~~~F~~ 88 (105)
T TIGR02011 24 LRLGVKTSGCSGMAYVLEFVDEPTPD--------DIVFEDK--GVKIVIDGKSLQ-YLDGTQLD----FVKEGLNEGFKF 88 (105)
T ss_pred EEEEEeCCCCCCEEEEeeecCCCCCC--------CEEEEcC--CEEEEEcHHHhH-HhCCCEEE----EecCCCcceEEE
Confidence 56777776666788888754222111 1222333 477888887554 66676665 666666667877
Q ss_pred CC
Q psy16754 98 GD 99 (127)
Q Consensus 98 g~ 99 (127)
.+
T Consensus 89 ~n 90 (105)
T TIGR02011 89 TN 90 (105)
T ss_pred EC
Confidence 53
No 83
>smart00363 S4 S4 RNA-binding domain.
Probab=20.58 E-value=1.4e+02 Score=16.83 Aligned_cols=22 Identities=27% Similarity=0.374 Sum_probs=14.8
Q ss_pred eEEEcCEEeceeeeeeeCCeeE
Q psy16754 72 DIWVNGDRVDVTGEFVENGTET 93 (127)
Q Consensus 72 dvw~nG~~iet~geFvd~Gtet 93 (127)
.|++||+.+.....-+..|..-
T Consensus 27 ~i~vng~~~~~~~~~l~~gd~i 48 (60)
T smart00363 27 RVKVNGKKVTKPSYIVKPGDVI 48 (60)
T ss_pred CEEECCEEecCCCeEeCCCCEE
Confidence 5789998884455556666543
No 84
>PF10262 Rdx: Rdx family; InterPro: IPR011893 This entry represents the Rdx family of selenoproteins, which includes mammalian selenoproteins SelW, SelV, SelT and SelH, bacterial SelW-like proteins and cysteine-containing proteins of unknown function in all three domains of life. Mammalian Rdx12 and its fish selenoprotein orthologues are also members of this family []. These proteins possess a thioredoxin-like fold and a conserved CXXC or CxxU (U is selenocysteine) motif near the N terminus, suggesting a redox function. Rdx proteins can use catalytic cysteine (or selenocysteine) to form transient mixed disulphides with substrate proteins. Selenium (Se) plays an essential role in cell survival and most of the effects of Se are probably mediated by selenoproteins. Selenoprotein W (SelW) plays an important role in protection of neurons from oxidative stress during neuronal development [], []. Selenoprotein T (SelT) is conserved from plants to humans. SelT is localized to the endoplasmic reticulum through a hydrophobic domain. The protein binds to UDP-glucose:glycoprotein glucosyltransferase (UGTR), the endoplasmic reticulum (ER)-resident protein, which is known to be involved in the quality control of protein folding [, ]. The function of SelT is unknown, although it may have a role in PACAP signaling during PC12 cell differentiation [, ]. Selenoprotein H (SelH) protects neurons against UVB-induced damage by inhibiting apoptotic cell death pathways, by preventing mitochondrial depolarization, and by promoting cell survival pathways [].; GO: 0008430 selenium binding, 0045454 cell redox homeostasis; PDB: 2OJL_B 2FA8_A 2P0G_C 2NPB_A 3DEX_C 2OKA_A 2OBK_G.
Probab=20.48 E-value=58 Score=21.43 Aligned_cols=15 Identities=27% Similarity=0.545 Sum_probs=11.5
Q ss_pred EEccceeEEEcCEEe
Q psy16754 66 LERGTLDIWVNGDRV 80 (127)
Q Consensus 66 LdK~tmdvw~nG~~i 80 (127)
....+++||+||+.|
T Consensus 40 ~~~G~FEV~v~g~lI 54 (76)
T PF10262_consen 40 GSTGAFEVTVNGELI 54 (76)
T ss_dssp ESTT-EEEEETTEEE
T ss_pred ccCCEEEEEEccEEE
Confidence 346789999999977
No 85
>TIGR00731 ctc_TL5 ribosomal protein L25, Ctc-form. The C-terminal domain of sll1824, an apparent L25 of Synechocystis PCC6803, matches the N-terminal domain of this family. Examples of L25 and Ctc are not separated by a UPGMA tree built on the region of shared homology.
Probab=20.48 E-value=4e+02 Score=20.47 Aligned_cols=83 Identities=13% Similarity=0.229 Sum_probs=50.2
Q ss_pred EEEEECCeehHhhh-eeeeeeeeEEeeCCeeEEEEEEccceeEEEcCEEeceeeeeeeCCeeEEEEeCCccEEEEEEeec
Q psy16754 32 YSITVDGKTLEKFS-QSKANCTWIVTVEDETYRVVLERGTLDIWVNGDRVDVTGEFVENGTETHFTIGDTPAYIKAVSSC 110 (127)
Q Consensus 32 YsL~V~GKsl~kf~-~~k~~~tW~~~i~g~~~RIvLdK~tmdvw~nG~~iet~geFvd~Gtet~F~lg~~~c~I~a~ssG 110 (127)
-++.|+-+.|.+.. .........++++|...++++--=-++ .+.++.+-..=.-++.+.+.+..+ |..+.-.+.|
T Consensus 35 ~~i~v~~~el~k~l~~~~~~~~i~L~v~g~~~~vlikevQ~~-pv~~~i~HvDF~~v~~~~~v~v~V---Pv~~~G~~~g 110 (176)
T TIGR00731 35 VNLELKSKEFIKYLRKGATSTVLTLEIGGKEFKVLVKDYQYN-PVTNEVIHVDFLEVVEGVKLKVEV---PIKLIGTPIG 110 (176)
T ss_pred EEEEECHHHHHHHHhccCCcEEEEEEECCEEEEEEEehhhhc-cCCCCeEEEEeEEeCCCCEEEEEe---eEEEeccccc
Confidence 46789999998887 332455677888999887766433333 345555544444556777766555 4444444445
Q ss_pred CceeceEE
Q psy16754 111 NKKEGLLY 118 (127)
Q Consensus 111 ~k~~Gi~H 118 (127)
-++-|++.
T Consensus 111 vk~GG~l~ 118 (176)
T TIGR00731 111 VKNGGILT 118 (176)
T ss_pred ccCCcEEE
Confidence 55545443
Done!