Query psy10106
Match_columns 101
No_of_seqs 110 out of 550
Neff 6.7
Searched_HMMs 46136
Date Fri Aug 16 18:01:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10106.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10106hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00059 dynein light chain; P 100.0 8.1E-42 1.8E-46 220.4 12.4 89 12-101 2-90 (90)
2 KOG3430|consensus 100.0 6.9E-42 1.5E-46 218.4 11.6 89 12-101 1-90 (90)
3 PF01221 Dynein_light: Dynein 100.0 1.2E-40 2.6E-45 213.9 9.7 89 12-101 1-89 (89)
4 PLN03058 dynein light chain ty 100.0 3E-39 6.5E-44 219.8 12.4 86 16-101 34-122 (128)
5 PF04155 Ground-like: Ground-l 97.7 0.00064 1.4E-08 42.1 8.5 54 45-99 22-76 (76)
6 PF05075 DUF684: Protein of un 91.2 2 4.4E-05 33.5 8.4 54 47-100 188-251 (345)
7 PF15650 Tox-REase-9: Restrict 79.7 1.5 3.3E-05 28.2 1.9 17 54-70 70-86 (89)
8 PF10703 MoaF: Molybdenum cofa 78.9 3.8 8.3E-05 31.2 4.2 32 65-99 182-214 (265)
9 PF12006 DUF3500: Protein of u 75.2 22 0.00047 27.6 7.5 44 19-62 215-261 (313)
10 PF12652 CotJB: CotJB protein; 67.7 3.1 6.8E-05 26.0 1.2 16 50-65 41-56 (78)
11 COG1570 XseA Exonuclease VII, 63.8 15 0.00033 30.0 4.7 49 48-100 9-61 (440)
12 PF13742 tRNA_anti_2: OB-fold 62.0 28 0.00062 22.1 4.9 49 48-100 5-59 (99)
13 PRK00286 xseA exodeoxyribonucl 59.4 26 0.00056 27.9 5.3 47 48-100 9-61 (438)
14 PF11858 DUF3378: Domain of un 56.2 18 0.00039 22.6 3.2 20 82-101 29-48 (81)
15 KOG2451|consensus 53.0 29 0.00063 28.4 4.5 42 18-59 51-94 (503)
16 PF05184 SapB_1: Saposin-like 52.2 32 0.0007 17.6 3.9 29 29-57 6-34 (39)
17 PF06150 ChaB: ChaB; InterPro 51.7 47 0.001 19.4 5.1 44 23-67 7-54 (57)
18 PF08776 VASP_tetra: VASP tetr 47.5 31 0.00068 19.0 2.8 32 27-60 6-37 (40)
19 PF12362 DUF3646: DNA polymera 46.9 25 0.00054 23.5 2.9 29 47-75 48-76 (117)
20 PF00352 TBP: Transcription fa 46.7 44 0.00095 20.6 3.8 25 77-101 30-58 (86)
21 cd03472 Rieske_RO_Alpha_BPDO_l 46.1 41 0.00089 22.3 3.9 38 58-100 1-39 (128)
22 PF08958 DUF1871: Domain of un 45.1 23 0.00051 22.1 2.4 29 36-64 26-54 (79)
23 PRK13007 succinyl-diaminopimel 43.3 1.4E+02 0.003 22.4 6.8 37 25-61 1-40 (352)
24 TIGR00237 xseA exodeoxyribonuc 40.9 97 0.0021 24.9 5.9 47 48-100 3-55 (432)
25 PF14900 DUF4493: Domain of un 40.6 69 0.0015 23.2 4.6 37 56-92 119-160 (235)
26 TIGR02084 leud 3-isopropylmala 40.2 27 0.00059 24.4 2.3 34 46-79 25-62 (156)
27 PF06457 Ectatomin: Ectatomin; 36.9 16 0.00035 19.1 0.6 22 37-60 13-34 (34)
28 cd03538 Rieske_RO_Alpha_AntDO 36.8 91 0.002 21.1 4.5 44 52-100 9-53 (146)
29 PF12550 GCR1_C: Transcription 36.6 11 0.00024 23.2 -0.1 12 56-67 29-40 (81)
30 PF08594 UPF0300: Uncharacteri 35.6 1.2E+02 0.0026 22.6 5.1 58 25-82 37-112 (215)
31 PF10925 DUF2680: Protein of u 35.5 44 0.00095 19.6 2.4 16 47-62 34-49 (59)
32 KOG2130|consensus 34.9 33 0.00073 27.3 2.3 19 63-81 277-295 (407)
33 cd00454 Trunc_globin Truncated 34.5 1.2E+02 0.0026 19.0 4.9 37 23-59 71-107 (116)
34 PF08006 DUF1700: Protein of u 34.0 1.2E+02 0.0025 21.1 4.8 37 23-59 15-65 (181)
35 PRK14023 homoaconitate hydrata 33.8 59 0.0013 23.0 3.2 33 47-79 28-64 (166)
36 COG2101 SPT15 TATA-box binding 33.1 35 0.00077 24.7 2.0 16 86-101 45-62 (185)
37 PF08015 Pheromone: Fungal mat 32.9 25 0.00053 21.0 1.0 14 57-70 54-69 (69)
38 PF06840 DUF1241: Protein of u 32.4 29 0.00063 24.5 1.4 13 48-60 123-135 (154)
39 KOG3650|consensus 32.3 82 0.0018 20.9 3.5 39 2-43 30-68 (120)
40 PF07742 BTG: BTG family; Int 31.0 65 0.0014 21.5 2.9 19 49-67 29-47 (118)
41 PF09868 DUF2095: Uncharacteri 30.7 1E+02 0.0022 21.1 3.7 15 47-61 94-108 (128)
42 PRK10667 Hha toxicity attenuat 30.5 75 0.0016 21.5 3.1 20 43-62 65-84 (122)
43 PF06925 MGDG_synth: Monogalac 30.4 55 0.0012 22.3 2.6 27 48-74 2-28 (169)
44 cd05503 Bromo_BAZ2A_B_like Bro 29.8 1.1E+02 0.0024 19.1 3.8 36 27-62 59-97 (97)
45 cd00520 RRF Ribosome recycling 29.2 1.5E+02 0.0033 20.9 4.7 40 17-56 93-137 (179)
46 PRK10425 DNase TatD; Provision 29.2 1.5E+02 0.0033 22.0 5.0 46 27-72 103-157 (258)
47 cd05498 Bromo_Brdt_II_like Bro 28.8 1.2E+02 0.0025 19.0 3.8 36 27-62 64-102 (102)
48 PRK00083 frr ribosome recyclin 28.6 1.2E+02 0.0027 21.6 4.2 43 14-56 94-141 (185)
49 PF13798 PCYCGC: Protein of un 28.4 22 0.00047 25.3 0.3 33 30-63 114-146 (158)
50 cd03545 Rieske_RO_Alpha_OHBDO_ 28.1 1.8E+02 0.0039 19.8 4.8 44 52-100 11-56 (150)
51 PF02283 CobU: Cobinamide kina 27.8 68 0.0015 22.4 2.7 24 47-71 36-59 (167)
52 PF03701 UPF0181: Uncharacteri 27.6 1.3E+02 0.0028 17.4 4.7 33 25-57 8-44 (51)
53 TIGR03849 arch_ComA phosphosul 27.5 1.5E+02 0.0034 22.2 4.7 46 22-70 94-153 (237)
54 cd01579 AcnA_Bact_Swivel Bacte 27.2 42 0.00092 22.5 1.5 17 62-78 46-62 (121)
55 PF06884 DUF1264: Protein of u 26.9 45 0.00098 23.9 1.7 19 50-68 98-116 (171)
56 PRK00394 transcription factor; 26.9 90 0.0019 22.2 3.2 25 77-101 120-148 (179)
57 PF01743 PolyA_pol: Poly A pol 26.5 1.8E+02 0.0039 19.0 4.5 37 51-89 28-64 (126)
58 COG1352 CheR Methylase of chem 26.1 82 0.0018 23.9 3.0 49 17-68 129-178 (268)
59 KOG4194|consensus 25.7 39 0.00084 29.5 1.3 23 63-85 586-608 (873)
60 KOG1569|consensus 25.7 1.4E+02 0.0031 23.4 4.3 28 48-75 226-261 (323)
61 PF09570 RE_SinI: SinI restric 25.7 1.1E+02 0.0023 22.9 3.4 50 23-72 75-137 (221)
62 PF09477 Type_III_YscG: Bacter 25.7 1.4E+02 0.0031 20.0 3.8 32 25-56 1-34 (116)
63 PF10757 YbaJ: Biofilm formati 25.0 91 0.002 21.2 2.8 20 43-62 65-84 (122)
64 PF10440 WIYLD: Ubiquitin-bind 24.9 72 0.0016 19.3 2.1 40 34-74 11-51 (65)
65 CHL00123 rps6 ribosomal protei 24.7 1.7E+02 0.0038 18.5 4.0 35 11-45 4-38 (97)
66 cd05392 RasGAP_Neurofibromin_l 24.6 55 0.0012 25.0 1.9 31 41-71 143-174 (323)
67 cd00197 VHS_ENTH_ANTH VHS, ENT 24.3 42 0.00091 21.4 1.1 20 47-69 35-54 (115)
68 PLN00062 TATA-box-binding prot 24.0 1.1E+02 0.0024 21.8 3.2 24 78-101 29-56 (179)
69 cd01578 AcnA_Mitochon_Swivel M 23.9 46 0.00099 23.3 1.2 18 62-79 67-84 (149)
70 COG3140 Uncharacterized protei 23.8 1.7E+02 0.0037 17.3 4.3 34 24-57 7-44 (60)
71 cd04518 TBP_archaea archaeal T 23.7 84 0.0018 22.3 2.6 25 77-101 119-147 (174)
72 PF06869 DUF1258: Protein of u 23.7 1.9E+02 0.004 22.2 4.5 49 18-66 122-176 (258)
73 KOG3938|consensus 23.3 1.4E+02 0.0031 23.3 3.9 23 23-45 250-272 (334)
74 PRK13293 F420-0--gamma-glutamy 23.2 76 0.0017 24.0 2.4 29 49-77 127-155 (245)
75 cd00652 TBP_TLF TATA box bindi 23.1 1.2E+02 0.0027 21.3 3.4 26 76-101 119-148 (174)
76 PF10302 DUF2407: DUF2407 ubiq 23.1 40 0.00088 21.7 0.8 8 65-72 89-96 (97)
77 PF01726 LexA_DNA_bind: LexA D 23.0 1.6E+02 0.0036 17.3 3.4 28 27-54 5-35 (65)
78 PF01545 Cation_efflux: Cation 22.4 96 0.0021 22.6 2.8 22 49-70 255-276 (284)
79 COG1498 SIK1 Protein implicate 22.1 1.1E+02 0.0024 24.8 3.2 41 20-60 199-239 (395)
80 PF07062 Clc-like: Clc-like; 22.0 75 0.0016 23.4 2.1 19 62-80 23-41 (211)
81 PF04622 ERG2_Sigma1R: ERG2 an 21.8 1.6E+02 0.0034 21.8 3.8 34 31-64 34-68 (216)
82 PF02806 Alpha-amylase_C: Alph 21.8 1E+02 0.0022 18.7 2.4 38 63-100 46-93 (95)
83 PF08908 DUF1852: Domain of un 21.4 51 0.0011 25.6 1.1 26 54-79 94-120 (322)
84 PF14326 DUF4384: Domain of un 21.1 2E+02 0.0043 17.3 3.6 30 69-98 4-36 (83)
85 cd04516 TBP_eukaryotes eukaryo 21.1 1.4E+02 0.003 21.2 3.2 26 76-101 118-147 (174)
86 PRK13888 conjugal transfer pro 21.0 29 0.00064 25.6 -0.2 19 51-69 91-109 (206)
87 PF15571 Imm25: Immunity prote 20.7 60 0.0013 22.1 1.2 22 48-69 18-39 (124)
88 TIGR01478 STEVOR variant surfa 20.6 1.2E+02 0.0026 23.6 3.0 23 23-45 54-76 (295)
89 PF13565 HTH_32: Homeodomain-l 20.2 1.9E+02 0.0041 16.6 3.9 35 26-62 32-66 (77)
No 1
>PTZ00059 dynein light chain; Provisional
Probab=100.00 E-value=8.1e-42 Score=220.43 Aligned_cols=89 Identities=52% Similarity=0.961 Sum_probs=86.1
Q ss_pred CCCCCCceEEeCCCCHHHHHHHHHHHHHHHhhCCChHHHHHHHHHhccccCCCceEEEEeCCceeeEEecCCcEEEEEeC
Q psy10106 12 PKDDKAPLIKSTDMPNDMQQEVITVAKAAYERCNNFADLAAYIKKEFENKYGPAWHCVVGQGFGSFVTHDRSSFIYLYIG 91 (101)
Q Consensus 12 m~~~~~~~I~~sdM~~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~~yg~~WhcIVG~~Fgs~vth~~~~~i~F~~~ 91 (101)
|++++ ++|+.+|||++||++|+++|.+|++++++++|||++||++||++|||+||||||++|||++||++++||||+++
T Consensus 2 m~~~~-~~i~~~dM~~emq~~a~~~~~~Al~~~~~~kdiA~~IK~~fD~~yg~~WhciVG~~Fgs~vthe~~~~i~F~~~ 80 (90)
T PTZ00059 2 MSDRK-AVVKNADMSEDMQQDAIDCANQALEKFNIEKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYLG 80 (90)
T ss_pred CCCCc-cEEEECCCCHHHHHHHHHHHHHHHHHcCchHHHHHHHHHHHHhhcCCCCEEEEecCeeEEEEEeCCcEEEEEEC
Confidence 66655 89999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CEEEEEEecC
Q psy10106 92 MDGILLFRHS 101 (101)
Q Consensus 92 ~~~illfKt~ 101 (101)
+++|||||+|
T Consensus 81 ~~~vLlfK~~ 90 (90)
T PTZ00059 81 QVAILLFKSG 90 (90)
T ss_pred CEEEEEEecC
Confidence 9999999997
No 2
>KOG3430|consensus
Probab=100.00 E-value=6.9e-42 Score=218.45 Aligned_cols=89 Identities=53% Similarity=1.008 Sum_probs=85.3
Q ss_pred CCCCCCceEEeCCCCHHHHHHHHHHHHHHHhhCC-ChHHHHHHHHHhccccCCCceEEEEeCCceeeEEecCCcEEEEEe
Q psy10106 12 PKDDKAPLIKSTDMPNDMQQEVITVAKAAYERCN-NFADLAAYIKKEFENKYGPAWHCVVGQGFGSFVTHDRSSFIYLYI 90 (101)
Q Consensus 12 m~~~~~~~I~~sdM~~~mq~~~~~~a~~a~~~~~-~~kdiA~~IK~~lD~~yg~~WhcIVG~~Fgs~vth~~~~~i~F~~ 90 (101)
|.+.. ++|+.+|||++||++|+++|..|+++|+ ++++||..||++||++||++||||||++|||+|||++++||||++
T Consensus 1 ~~~~~-~~vk~tDM~~~mq~~a~~~a~~al~~f~~~~k~iA~~iKkefDkkyG~~WhcivG~~FGs~vThe~g~Fiyf~~ 79 (90)
T KOG3430|consen 1 MLERK-AVVKATDMPEEMQQEAIELARQALEKFNVIEKDIAAFIKKEFDKKYGPTWHCIVGRNFGSYVTHETGHFIYFYL 79 (90)
T ss_pred CCCcc-ceEecCCCChHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHhhhcCCccEEEEcCCcceEEEeecCcEEEEEe
Confidence 45555 9999999999999999999999999999 789999999999999999999999999999999999999999999
Q ss_pred CCEEEEEEecC
Q psy10106 91 GMDGILLFRHS 101 (101)
Q Consensus 91 ~~~~illfKt~ 101 (101)
|.++|||||++
T Consensus 80 g~l~illfK~~ 90 (90)
T KOG3430|consen 80 GVLAILLFKCA 90 (90)
T ss_pred ceEEEEEEecC
Confidence 99999999985
No 3
>PF01221 Dynein_light: Dynein light chain type 1 ; InterPro: IPR001372 Dynein is a multisubunit microtubule-dependent motor enzyme that acts as the force generating protein of eukaryotic cilia and flagella. The cytoplasmic isoform of dynein acts as a motor for the intracellular retrograde motility of vesicles and organelles along microtubules. Dynein is composed of a number of ATP-binding large subunits (see IPR004273 from INTERPRO), intermediate size subunits and small subunits. Among the small subunits, there is a family of highly conserved proteins which make up this family [, ]. Both type 1 (DLC1) and 2 (DLC2) dynein light chains have a similar two-layer alpha-beta core structure consisting of beta-alpha(2)-beta-X-beta(2) [, ].; GO: 0007017 microtubule-based process, 0005875 microtubule associated complex; PDB: 1F95_A 1F96_A 1F3C_A 3P8M_B 2XQQ_C 1RE6_A 1CMI_A 1PWK_A 1PWJ_A 4DS1_C ....
Probab=100.00 E-value=1.2e-40 Score=213.91 Aligned_cols=89 Identities=44% Similarity=0.884 Sum_probs=83.9
Q ss_pred CCCCCCceEEeCCCCHHHHHHHHHHHHHHHhhCCChHHHHHHHHHhccccCCCceEEEEeCCceeeEEecCCcEEEEEeC
Q psy10106 12 PKDDKAPLIKSTDMPNDMQQEVITVAKAAYERCNNFADLAAYIKKEFENKYGPAWHCVVGQGFGSFVTHDRSSFIYLYIG 91 (101)
Q Consensus 12 m~~~~~~~I~~sdM~~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~~yg~~WhcIVG~~Fgs~vth~~~~~i~F~~~ 91 (101)
|++.+ ++|+.+|||++||++|+++|.+|++++++++|+|++||++||++|||+||||||++|||++||+++++|||+++
T Consensus 1 m~~~~-~~i~~~dM~~~~~~~~~~~~~~a~~~~~~~~eiA~~iK~~lD~~yG~~Wh~IVG~~Fg~~~th~~~~~~~f~~~ 79 (89)
T PF01221_consen 1 MSENK-IVIKSSDMPEEMQEEAIELAKEALKKYQDEKEIAEFIKQELDKKYGPTWHCIVGKSFGSSVTHEPGTFLYFKIG 79 (89)
T ss_dssp SGSCS-EEEEEEES-HHHHHHHHHHHHHHHHHCSSHHHHHHHHHHHHHHHHSS-EEEEEESEEEEEEEEETTEEEEEEET
T ss_pred CCCCc-cEEEECCCCHHHHHHHHHHHHHHHHHCCcHHHHHHHHHHHHhcccCCceEEEECCcEEEEEEEcCCcEEEEEEC
Confidence 66666 99999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CEEEEEEecC
Q psy10106 92 MDGILLFRHS 101 (101)
Q Consensus 92 ~~~illfKt~ 101 (101)
+++||||||+
T Consensus 80 ~~~~li~kt~ 89 (89)
T PF01221_consen 80 NIAFLIFKTQ 89 (89)
T ss_dssp TEEEEEEEE-
T ss_pred CEEEEEEecC
Confidence 9999999996
No 4
>PLN03058 dynein light chain type 1 family protein; Provisional
Probab=100.00 E-value=3e-39 Score=219.81 Aligned_cols=86 Identities=38% Similarity=0.738 Sum_probs=82.7
Q ss_pred CCceEEeCCCCHHHHHHHHHHHHHHHhhCC---ChHHHHHHHHHhccccCCCceEEEEeCCceeeEEecCCcEEEEEeCC
Q psy10106 16 KAPLIKSTDMPNDMQQEVITVAKAAYERCN---NFADLAAYIKKEFENKYGPAWHCVVGQGFGSFVTHDRSSFIYLYIGM 92 (101)
Q Consensus 16 ~~~~I~~sdM~~~mq~~~~~~a~~a~~~~~---~~kdiA~~IK~~lD~~yg~~WhcIVG~~Fgs~vth~~~~~i~F~~~~ 92 (101)
.+++|+.+|||++||++|+++|.+|+++++ +++|||.+||++||++|||+||||||++|||+|||++++||||++|+
T Consensus 34 ~dv~Ik~sDM~~emQ~~ave~a~~Al~k~~~~~~ekdIA~~IKk~fDkkYG~tWHCIVGk~FGs~VTHe~~~fIyF~ig~ 113 (128)
T PLN03058 34 LNVRVRASDMPLVLQNRAFSCARDILDAMPGKLDSKRLALALKKEFDSAYGPAWHCIVGTSFGSYVTHSTGGFLYFSIDK 113 (128)
T ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHHccccCCHHHHHHHHHHHHhhhhCCceEEEECCcEEEEEEEcCCcEEEEEECC
Confidence 459999999999999999999999999974 67999999999999999999999999999999999999999999999
Q ss_pred EEEEEEecC
Q psy10106 93 DGILLFRHS 101 (101)
Q Consensus 93 ~~illfKt~ 101 (101)
++|||||||
T Consensus 114 ~aiLLfKt~ 122 (128)
T PLN03058 114 VYILLFKTA 122 (128)
T ss_pred EEEEEEecc
Confidence 999999996
No 5
>PF04155 Ground-like: Ground-like domain; InterPro: IPR007284 This group of proteins contain one or more copies of the ground-like domain, which are specific to Caenorhabditis elegans and Caenorhabditis briggsae. It has been proposed that the ground-like domain containing proteins may bind and modulate the activity of Patched-like membrane molecules, reminiscent of the modulating activities of neuropeptides [].
Probab=97.69 E-value=0.00064 Score=42.08 Aligned_cols=54 Identities=17% Similarity=0.218 Sum_probs=46.6
Q ss_pred CChHHHHHHHHHhccccCCCceEEEEeC-CceeeEEecCCcEEEEEeCCEEEEEEe
Q psy10106 45 NNFADLAAYIKKEFENKYGPAWHCVVGQ-GFGSFVTHDRSSFIYLYIGMDGILLFR 99 (101)
Q Consensus 45 ~~~kdiA~~IK~~lD~~yg~~WhcIVG~-~Fgs~vth~~~~~i~F~~~~~~illfK 99 (101)
.+..++++.|.+.++++||+.+-||+++ +|....... ..|--...++...++|+
T Consensus 22 ~~~~~s~~~Iq~~~e~~f~~~f~vIcs~~~Fsy~~~~~-~~~C~~~~~g~~c~af~ 76 (76)
T PF04155_consen 22 CNLSISKRAIQKAAEKRFGGSFEVICSEGDFSYSTHTD-DLYCKVEKNGVTCLAFA 76 (76)
T ss_pred CCHHHHHHHHHHHHHHHhCCCEEEEEeCCCceeEEecc-cceeeeeeCCEEEEEEC
Confidence 5567899999999999999999999999 676665555 78888899999999996
No 6
>PF05075 DUF684: Protein of unknown function (DUF684); InterPro: IPR007767 This family contains uncharacterised proteins from Caenorhabditis elegans.
Probab=91.17 E-value=2 Score=33.46 Aligned_cols=54 Identities=19% Similarity=0.273 Sum_probs=38.4
Q ss_pred hHHHHHHHHHhccccCC-CceEEEEeC--------CceeeEEecCCcEEE-EEeCCEEEEEEec
Q psy10106 47 FADLAAYIKKEFENKYG-PAWHCVVGQ--------GFGSFVTHDRSSFIY-LYIGMDGILLFRH 100 (101)
Q Consensus 47 ~kdiA~~IK~~lD~~yg-~~WhcIVG~--------~Fgs~vth~~~~~i~-F~~~~~~illfKt 100 (101)
..+.|..||+.||+-.. -.+-+||-. .|.....+..+++|. |.-|+-.++||||
T Consensus 188 n~eKAd~Ik~~Le~ilTnDsFYIiVfd~~~~~~~~~~y~~~~~~~dq~I~s~~rGgcNv~VYRS 251 (345)
T PF05075_consen 188 NEEKADEIKKKLEKILTNDSFYIIVFDDCSGYDNHYYYGFYDNNEDQYIESFNRGGCNVFVYRS 251 (345)
T ss_pred hHHHHHHHHHHHHhhCCCCcEEEEEecccccCCccceeeeccCcccCEEEEEeCCCeEEEEEee
Confidence 46899999999999653 368888822 222222456667766 5678999999998
No 7
>PF15650 Tox-REase-9: Restriction endonuclease fold toxin 9
Probab=79.65 E-value=1.5 Score=28.17 Aligned_cols=17 Identities=35% Similarity=0.770 Sum_probs=14.8
Q ss_pred HHHhccccCCCceEEEE
Q psy10106 54 IKKEFENKYGPAWHCVV 70 (101)
Q Consensus 54 IK~~lD~~yg~~WhcIV 70 (101)
-|++|+..||.+|.|||
T Consensus 70 Y~~el~~~~G~~W~~~l 86 (89)
T PF15650_consen 70 YKQELEKIYGGGWKTRL 86 (89)
T ss_pred HHHHhcCccCCCeeEEe
Confidence 35789999999999997
No 8
>PF10703 MoaF: Molybdenum cofactor biosynthesis protein F; InterPro: IPR024724 Molybdenum cofactor biosynthesis protein F (MoaF) is essential for the production of the monoamine-inducible 30kDa protein in Klebsiella []. It is necessary for reconstituting organoautotrophic growth in Ralstonia eutropha []. MoaF is conserved in proteobacteria and some lower eukaryotes. The operon regulating the Moa genes is responsible for molybdenum cofactor biosynthesis.
Probab=78.87 E-value=3.8 Score=31.25 Aligned_cols=32 Identities=28% Similarity=0.489 Sum_probs=25.2
Q ss_pred ceEEEEeCCceeeEEecCCcEEEEEeC-CEEEEEEe
Q psy10106 65 AWHCVVGQGFGSFVTHDRSSFIYLYIG-MDGILLFR 99 (101)
Q Consensus 65 ~WhcIVG~~Fgs~vth~~~~~i~F~~~-~~~illfK 99 (101)
+|||+.|-+=|-. +.+.+-++++. ++.++.||
T Consensus 182 ~W~CL~G~e~Gla---D~D~c~~~Ki~d~lYlf~Wr 214 (265)
T PF10703_consen 182 AWQCLSGVEKGLA---DTDRCHYYKIADNLYLFTWR 214 (265)
T ss_pred EEEEeeccccCCC---CccceEEEEecCCEEEEEEE
Confidence 6999999977754 45778888885 78888886
No 9
>PF12006 DUF3500: Protein of unknown function (DUF3500); InterPro: IPR021889 This family of proteins is functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 335 to 438 amino acids in length. This protein has a conserved GHH sequence motif. This protein has two completely conserved G residues that may be functionally important.
Probab=75.16 E-value=22 Score=27.57 Aligned_cols=44 Identities=18% Similarity=0.261 Sum_probs=33.5
Q ss_pred eEEeCCCCHHHHHHHHHHHHHHHhhCCCh--HHHHHHH-HHhccccC
Q psy10106 19 LIKSTDMPNDMQQEVITVAKAAYERCNNF--ADLAAYI-KKEFENKY 62 (101)
Q Consensus 19 ~I~~sdM~~~mq~~~~~~a~~a~~~~~~~--kdiA~~I-K~~lD~~y 62 (101)
=|..+||+.+.|+.+..++..-+..++.+ ++.-+.| +..||+.|
T Consensus 215 Gl~~s~Lt~~Qq~ll~~li~~y~~~~~~~~a~~~~~~i~~~~ld~t~ 261 (313)
T PF12006_consen 215 GLAVSELTADQQELLLALIKEYLGRLPEEDAAERMAEIEEAGLDETY 261 (313)
T ss_pred CcChhhCCHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhccccceE
Confidence 37889999999999999999998888864 2223344 67777765
No 10
>PF12652 CotJB: CotJB protein; InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=67.71 E-value=3.1 Score=26.01 Aligned_cols=16 Identities=44% Similarity=0.864 Sum_probs=12.9
Q ss_pred HHHHHHHhccccCCCc
Q psy10106 50 LAAYIKKEFENKYGPA 65 (101)
Q Consensus 50 iA~~IK~~lD~~yg~~ 65 (101)
.-+.+++.+.++|||-
T Consensus 41 ~~~~l~~~Ye~~yGPL 56 (78)
T PF12652_consen 41 QRKQLKKEYEKRYGPL 56 (78)
T ss_pred HHHHHHHHHHHHhCCC
Confidence 4466889999999984
No 11
>COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair]
Probab=63.76 E-value=15 Score=29.96 Aligned_cols=49 Identities=29% Similarity=0.520 Sum_probs=36.0
Q ss_pred HHHHHHHHHhccccCCCceEEEEeCCceeeEEecCCcEEEEEeC----CEEEEEEec
Q psy10106 48 ADLAAYIKKEFENKYGPAWHCVVGQGFGSFVTHDRSSFIYLYIG----MDGILLFRH 100 (101)
Q Consensus 48 kdiA~~IK~~lD~~yg~~WhcIVG~~Fgs~vth~~~~~i~F~~~----~~~illfKt 100 (101)
.++..+||..||..+|..| |-|+ =|.+|+-.....||.+. .+...+|+.
T Consensus 9 Seln~~ik~llE~~~~~V~--v~GE--ISn~t~~~sgH~YFtLKD~~A~i~c~mf~~ 61 (440)
T COG1570 9 SELNDYIKRLLERDLGQVW--VRGE--ISNFTRPASGHLYFTLKDERAQIRCVMFKG 61 (440)
T ss_pred HHHHHHHHHHHHhcCCeEE--EEEE--ecCCccCCCccEEEEEccCCceEEEEEEcC
Confidence 5799999999999999888 5566 13345334448999985 566777764
No 12
>PF13742 tRNA_anti_2: OB-fold nucleic acid binding domain
Probab=62.05 E-value=28 Score=22.09 Aligned_cols=49 Identities=24% Similarity=0.471 Sum_probs=33.9
Q ss_pred HHHHHHHHHhcccc--CCCceEEEEeCCceeeEEecCCcEEEEEeC----CEEEEEEec
Q psy10106 48 ADLAAYIKKEFENK--YGPAWHCVVGQGFGSFVTHDRSSFIYLYIG----MDGILLFRH 100 (101)
Q Consensus 48 kdiA~~IK~~lD~~--yg~~WhcIVG~~Fgs~vth~~~~~i~F~~~----~~~illfKt 100 (101)
.++.++||..++.. ++..| |.|+=-+ ++.-.+.++||.+. .+...+|++
T Consensus 5 s~l~~~ik~~le~~~~~~~vw--V~GEIs~--~~~~~~gh~YftLkD~~a~i~~~~~~~ 59 (99)
T PF13742_consen 5 SELNNYIKDLLERDPPLPNVW--VEGEISN--LKRHSSGHVYFTLKDEEASISCVIFRS 59 (99)
T ss_pred HHHHHHHHHHHhcCCCcCCEE--EEEEEee--cEECCCceEEEEEEcCCcEEEEEEEHH
Confidence 47899999999998 56777 5666211 22226777899883 566777764
No 13
>PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed
Probab=59.35 E-value=26 Score=27.87 Aligned_cols=47 Identities=26% Similarity=0.552 Sum_probs=36.0
Q ss_pred HHHHHHHHHhccccCCCceEEEEeC--CceeeEEecCCcEEEEEeC----CEEEEEEec
Q psy10106 48 ADLAAYIKKEFENKYGPAWHCVVGQ--GFGSFVTHDRSSFIYLYIG----MDGILLFRH 100 (101)
Q Consensus 48 kdiA~~IK~~lD~~yg~~WhcIVG~--~Fgs~vth~~~~~i~F~~~----~~~illfKt 100 (101)
.++..+||..||..++..| |.|+ +|-. -..-++||.+. .+...+|++
T Consensus 9 sel~~~ik~~le~~~~~v~--v~gEis~~~~----~~sGH~Yf~Lkd~~a~i~~~~~~~ 61 (438)
T PRK00286 9 SELNRYVKSLLERDLGQVW--VRGEISNFTR----HSSGHWYFTLKDEIAQIRCVMFKG 61 (438)
T ss_pred HHHHHHHHHHHHhhCCcEE--EEEEeCCCee----CCCCeEEEEEEcCCcEEEEEEEcC
Confidence 5899999999999988887 6666 5543 24557999984 578888885
No 14
>PF11858 DUF3378: Domain of unknown function (DUF3378); InterPro: IPR024568 Ribonuclease HIII 3.1.26 from EC cleaves RNA from DNA-RNA hybrids. It catalyses endonucleolytic cleavage to 5'-phospho-monoesters. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other []. The only RNase H homologue in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This entry represents the uncharacterised N-terminal domain of bacterial RNase HIII.; PDB: 3ASM_A 2D0C_A 2D0B_A 2D0A_A.
Probab=56.17 E-value=18 Score=22.58 Aligned_cols=20 Identities=5% Similarity=-0.062 Sum_probs=16.3
Q ss_pred CCcEEEEEeCCEEEEEEecC
Q psy10106 82 RSSFIYLYIGMDGILLFRHS 101 (101)
Q Consensus 82 ~~~~i~F~~~~~~illfKt~ 101 (101)
++..+.++.++..|.+|+||
T Consensus 29 p~~~f~aK~~~~tIt~Y~SG 48 (81)
T PF11858_consen 29 PYAVFQAKYNGVTITAYKSG 48 (81)
T ss_dssp TTEEEEEEETTEEEEEETTS
T ss_pred CCEEEEEeCCCeEEEEEeCC
Confidence 45556678899999999997
No 15
>KOG2451|consensus
Probab=53.04 E-value=29 Score=28.43 Aligned_cols=42 Identities=24% Similarity=0.317 Sum_probs=33.8
Q ss_pred ceEEeCCCCHHHHHHHHHHHHHHHhhCCC--hHHHHHHHHHhcc
Q psy10106 18 PLIKSTDMPNDMQQEVITVAKAAYERCNN--FADLAAYIKKEFE 59 (101)
Q Consensus 18 ~~I~~sdM~~~mq~~~~~~a~~a~~~~~~--~kdiA~~IK~~lD 59 (101)
+.=...||+-+.-+++++.|.+|++.+.+ -+|-++.|++|.|
T Consensus 51 ii~~V~~~~V~e~~kAI~aA~EaF~s~~~~takeRs~lLrkwy~ 94 (503)
T KOG2451|consen 51 IIGKVADMTVEEAEKAIDAAYEAFKSYRNLTAKERSALLRKWYE 94 (503)
T ss_pred hhhcccCCcHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHH
Confidence 55578999999999999999999999875 3566666666654
No 16
>PF05184 SapB_1: Saposin-like type B, region 1; InterPro: IPR007856 Synonym(s):cerebroside sulphate activator, CSAct Saposin B is a small non-enzymatic glycoprotein required for the breakdown of cerebroside sulphates (sulphatides) in lysosomes. Saposin B contains three intramolecular disulphide bridges, exists as a dimer and is remarkably heat, protease, and pH stable. The crystal structure of human saposin B reveals an unusual shell-like dimer consisting of a monolayer of alpha-helices enclosing a large hydrophobic cavity. Although the secondary structure of saposin B is similar to that of the known monomeric members of the saposin-like superfamily, the helices are repacked into a different tertiary arrangement to form the homodimer. A comparison of the two forms of the saposin B dimer suggests that extraction of target lipids from membranes involves a conformational change that facilitates access to the inner cavity [].; GO: 0006629 lipid metabolic process; PDB: 1N69_C 1QDM_C 4DDJ_A 2DOB_A 1OF9_A 2Z9A_A 1M12_A 2GTG_A 1SN6_A 2QYP_B ....
Probab=52.16 E-value=32 Score=17.58 Aligned_cols=29 Identities=7% Similarity=0.094 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHHhhCCChHHHHHHHHHh
Q psy10106 29 MQQEVITVAKAAYERCNNFADLAAYIKKE 57 (101)
Q Consensus 29 mq~~~~~~a~~a~~~~~~~kdiA~~IK~~ 57 (101)
.-+.+++.+...++...++.+|.+.|.+.
T Consensus 6 ~C~~~v~~i~~~l~~~~t~~~I~~~l~~~ 34 (39)
T PF05184_consen 6 ICKFVVKEIEKLLKNNKTEEEIKKALEKA 34 (39)
T ss_dssp HHHHHHHHHHHHHHSTCHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCccHHHHHHHHHHH
Confidence 45678888888888777788888888764
No 17
>PF06150 ChaB: ChaB; InterPro: IPR009317 This family of proteins contain a conserved 60 residue region. This protein is known as ChaB in Escherichia coli and is found next to ChaA, which is a cation transporter protein. ChaB may be regulate ChaA function in some way.; PDB: 1SG7_A.
Probab=51.70 E-value=47 Score=19.35 Aligned_cols=44 Identities=20% Similarity=0.396 Sum_probs=28.9
Q ss_pred CCCCHHHHHHHHHHHHHHHhhCCChH---HHH-HHHHHhccccCCCceE
Q psy10106 23 TDMPNDMQQEVITVAKAAYERCNNFA---DLA-AYIKKEFENKYGPAWH 67 (101)
Q Consensus 23 sdM~~~mq~~~~~~a~~a~~~~~~~k---diA-~~IK~~lD~~yg~~Wh 67 (101)
..||..-|+--++....|++.++++. .+| ..+|+.+.+ -++.|.
T Consensus 7 ~~LP~~Aq~if~~afn~a~~~~~de~~A~~vAw~AVk~~Y~k-~~g~W~ 54 (57)
T PF06150_consen 7 EHLPEHAQRIFRKAFNSAWEEYGDEERAHRVAWAAVKRKYEK-VNGRWV 54 (57)
T ss_dssp TT--SHHHHHHHHHHHHHHHH--SHHHHHHHHHHHHHHHEEE-SSS-EE
T ss_pred hHCCHHHHHHHHHHHHHHHHhcCCHhHHHHHHHHHHHHHhee-cCCEee
Confidence 46777777777777888889997752 133 678999999 677785
No 18
>PF08776 VASP_tetra: VASP tetramerisation domain; InterPro: IPR014885 Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure []. ; PDB: 1USE_A 1USD_A.
Probab=47.51 E-value=31 Score=18.97 Aligned_cols=32 Identities=13% Similarity=0.413 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHhhCCChHHHHHHHHHhccc
Q psy10106 27 NDMQQEVITVAKAAYERCNNFADLAAYIKKEFEN 60 (101)
Q Consensus 27 ~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~ 60 (101)
+.+.++|++.++.-+++.+ .+|-..|+++|.+
T Consensus 6 e~~KqEIL~EvrkEl~K~K--~EIIeA~~~eL~r 37 (40)
T PF08776_consen 6 ERLKQEILEEVRKELQKVK--EEIIEAIRQELSR 37 (40)
T ss_dssp HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhc
Confidence 3578888888888888776 4577778888765
No 19
>PF12362 DUF3646: DNA polymerase III gamma and tau subunits C terminal; InterPro: IPR022107 This domain family is found in bacteria, and is approximately 120 amino acids in length. The family is found in association with PF00004 from PFAM. The proteins in this family are frequently annotated as the gamma and tau subunits of DNA polymerase III, however there is little accompanying literature to back this up.
Probab=46.89 E-value=25 Score=23.49 Aligned_cols=29 Identities=21% Similarity=0.376 Sum_probs=25.7
Q ss_pred hHHHHHHHHHhccccCCCceEEEEeCCce
Q psy10106 47 FADLAAYIKKEFENKYGPAWHCVVGQGFG 75 (101)
Q Consensus 47 ~kdiA~~IK~~lD~~yg~~WhcIVG~~Fg 75 (101)
.+++|..|.+.|..-.|..|.|.+.+.=|
T Consensus 48 p~dl~~~L~~~L~~wTG~rW~V~~s~~~g 76 (117)
T PF12362_consen 48 PKDLAQRLSRKLQEWTGQRWIVSLSNEPG 76 (117)
T ss_pred CHHHHHHHHHHHHHHhCCCeEEEEeCCCC
Confidence 37899999999999999999999998654
No 20
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=46.73 E-value=44 Score=20.57 Aligned_cols=25 Identities=16% Similarity=0.308 Sum_probs=17.8
Q ss_pred eEEecCCcE--EEEEeC--CEEEEEEecC
Q psy10106 77 FVTHDRSSF--IYLYIG--MDGILLFRHS 101 (101)
Q Consensus 77 ~vth~~~~~--i~F~~~--~~~illfKt~ 101 (101)
.+.|+++.| +.+++. ...++||+||
T Consensus 30 ~~~YePe~fpgl~~r~~~p~~t~~IF~sG 58 (86)
T PF00352_consen 30 NVEYEPERFPGLIYRLRNPKATVLIFSSG 58 (86)
T ss_dssp TEEEETTTESSEEEEETTTTEEEEEETTS
T ss_pred CcEEeeccCCeEEEeecCCcEEEEEEcCC
Confidence 346777754 556663 8899999887
No 21
>cd03472 Rieske_RO_Alpha_BPDO_like Rieske non-heme iron oxygenase (RO) family, Biphenyl dioxygenase (BPDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of BPDO and similar proteins including cumene dioxygenase (CumDO), nitrobenzene dioxygenase (NBDO), alkylbenzene dioxygenase (AkbDO) and dibenzofuran 4,4a-dioxygenase (DFDO). ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. BPDO degrades biphenyls and polychlorinated biphenyls (PCB's) while CumDO degrades cumene (isopropylbenzene), an aromatic hydrocarbon that is i
Probab=46.07 E-value=41 Score=22.34 Aligned_cols=38 Identities=16% Similarity=0.482 Sum_probs=26.0
Q ss_pred ccccCCCceEEEEeC-CceeeEEecCCcEEEEEeCCEEEEEEec
Q psy10106 58 FENKYGPAWHCVVGQ-GFGSFVTHDRSSFIYLYIGMDGILLFRH 100 (101)
Q Consensus 58 lD~~yg~~WhcIVG~-~Fgs~vth~~~~~i~F~~~~~~illfKt 100 (101)
|++-+...|+.|-=. .+. +.+.+..+.+++..|+|+|.
T Consensus 1 ~~~i~~~~W~~v~~~~el~-----~~g~~~~~~~~~~~i~l~r~ 39 (128)
T cd03472 1 LERVFARSWLLLGHETHIP-----KAGDYLTTYMGEDPVIVVRQ 39 (128)
T ss_pred CcchhhCCCeEeEEHHHCC-----CCCCEEEEEECCceEEEEEC
Confidence 456678889986544 331 34666677788888888874
No 22
>PF08958 DUF1871: Domain of unknown function (DUF1871); InterPro: IPR015053 This set of hypothetical proteins is produced by prokaryotes pertaining to the Bacillus genus. ; PDB: 1U84_A.
Probab=45.10 E-value=23 Score=22.09 Aligned_cols=29 Identities=24% Similarity=0.273 Sum_probs=21.3
Q ss_pred HHHHHHhhCCChHHHHHHHHHhccccCCC
Q psy10106 36 VAKAAYERCNNFADLAAYIKKEFENKYGP 64 (101)
Q Consensus 36 ~a~~a~~~~~~~kdiA~~IK~~lD~~yg~ 64 (101)
.+..++....+..++|+.|+.-|...||.
T Consensus 26 ~Iv~~v~~~~~~~~LA~~Iq~If~~SF~e 54 (79)
T PF08958_consen 26 DIVQAVHENDDPEELAKKIQSIFEFSFGE 54 (79)
T ss_dssp HHHHHHTT-S-HHHHHHHHHHHHHHHHSS
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHHcc
Confidence 34556677777789999999999888874
No 23
>PRK13007 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=43.25 E-value=1.4e+02 Score=22.39 Aligned_cols=37 Identities=14% Similarity=0.161 Sum_probs=28.6
Q ss_pred CCHHHHHHHHHHHHHHHhhCC---ChHHHHHHHHHhcccc
Q psy10106 25 MPNDMQQEVITVAKAAYERCN---NFADLAAYIKKEFENK 61 (101)
Q Consensus 25 M~~~mq~~~~~~a~~a~~~~~---~~kdiA~~IK~~lD~~ 61 (101)
|+.+...++++...+.++-.. ++.++|++|++.|++.
T Consensus 1 ~~~~~~~~~~~~l~~li~ips~s~~e~~~~~~l~~~l~~~ 40 (352)
T PRK13007 1 MTLDLAADLAELTAALVDIPSVSGDEKALADAVEAALRAL 40 (352)
T ss_pred CccchHHHHHHHHHHHhcCCCCCchHHHHHHHHHHHHHhC
Confidence 445566788888888876543 4678999999999985
No 24
>TIGR00237 xseA exodeoxyribonuclease VII, large subunit. This family consist of exodeoxyribonuclease VII, large subunit XseA which catalyses exonucleolytic cleavage in either the 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. Exonuclease VII consists of one large subunit and four small subunits.
Probab=40.95 E-value=97 Score=24.90 Aligned_cols=47 Identities=26% Similarity=0.461 Sum_probs=36.0
Q ss_pred HHHHHHHHHhccccCCCceEEEEeC--CceeeEEecCCcEEEEEeC----CEEEEEEec
Q psy10106 48 ADLAAYIKKEFENKYGPAWHCVVGQ--GFGSFVTHDRSSFIYLYIG----MDGILLFRH 100 (101)
Q Consensus 48 kdiA~~IK~~lD~~yg~~WhcIVG~--~Fgs~vth~~~~~i~F~~~----~~~illfKt 100 (101)
.++..+||..||..++..| |.|+ ||-. | ..-++||.+. .+...+|++
T Consensus 3 sel~~~ik~~le~~~~~v~--V~GEisn~~~---~-~sGH~YFtLkD~~a~i~~vmf~~ 55 (432)
T TIGR00237 3 SELNAQIKALLEATFLQVW--IQGEISNFTQ---P-VSGHWYFTLKDENAQVRCVMFRG 55 (432)
T ss_pred HHHHHHHHHHHHhhCCcEE--EEEEecCCee---C-CCceEEEEEEcCCcEEEEEEEcC
Confidence 4788999999999899888 5676 6652 3 3446899984 678888886
No 25
>PF14900 DUF4493: Domain of unknown function (DUF4493)
Probab=40.61 E-value=69 Score=23.19 Aligned_cols=37 Identities=30% Similarity=0.533 Sum_probs=28.6
Q ss_pred HhccccCCCceEEEEeCCceeeEEe--cCC---cEEEEEeCC
Q psy10106 56 KEFENKYGPAWHCVVGQGFGSFVTH--DRS---SFIYLYIGM 92 (101)
Q Consensus 56 ~~lD~~yg~~WhcIVG~~Fgs~vth--~~~---~~i~F~~~~ 92 (101)
..|.+.|+..|++-|...-+..+++ ... .-.||..+.
T Consensus 119 ~~f~~~f~~~y~vtV~~~~~~~~~~~~~~~~~~~~~Yf~~~~ 160 (235)
T PF14900_consen 119 DEFKKYFGSDYSVTVSTGAGGSVTFNKDETTSDRSAYFKAGE 160 (235)
T ss_pred HHHHhhhccceEEEEEccCCccEEEeeccCCCCcceEEECCC
Confidence 4466667777999999865777777 555 889999987
No 26
>TIGR02084 leud 3-isopropylmalate dehydratase, small subunit. Several pairs of archaeal proteins resemble the leuC and leuD pair in length and sequence but even more closely resemble the respective domains of homoaconitase, and their identity is uncertain. The members of the seed for this model are those sequences which are gene clustered with other genes involved in leucine biosynthesis and include some archaea.
Probab=40.20 E-value=27 Score=24.45 Aligned_cols=34 Identities=21% Similarity=0.248 Sum_probs=22.7
Q ss_pred ChHHHHHHHHHhccc----cCCCceEEEEeCCceeeEE
Q psy10106 46 NFADLAAYIKKEFEN----KYGPAWHCVVGQGFGSFVT 79 (101)
Q Consensus 46 ~~kdiA~~IK~~lD~----~yg~~WhcIVG~~Fgs~vt 79 (101)
+..++|+++-+.+|. ++.+..-+|.|+|||+=-|
T Consensus 25 ~~~~l~~~~f~~~~p~f~~~~~~g~iiVaG~NFG~GSS 62 (156)
T TIGR02084 25 DPKELAKHCMEDLDKDFVKKVKEGDIIVAGENFGCGSS 62 (156)
T ss_pred CHHHHHhhhhccCChhHHhhcCCCCEEEccCcccCCCc
Confidence 345677777666663 4556777888889997433
No 27
>PF06457 Ectatomin: Ectatomin; InterPro: IPR009458 Ectatomin is a toxin from the venom of the ant Ectatomma tuberculatum. Ectatomin can efficiently insert into the plasma membrane, where it can form channels. Ectatomin was shown to inhibit L-type calcium currents in isolated rat cardiac myocytes []. In these cells, ectatomin induces a gradual, irreversible increase in ion leakage across the membrane, which can lead to cell death. Ectatomin is comprised of two subunits, A and B, which are homologous. The structure of ectatomin reveals that each subunit consists of two alpha helices with a connecting hinge region, which form a hairpin structure that is stabilised by disulphide bridges. A disulphide bridge between the hinge regions of the two subunits links the heterodimer together, forming a closed bundle of four helices with a left-handed twist [].; GO: 0005216 ion channel activity, 0009405 pathogenesis, 0005576 extracellular region; PDB: 1ECI_A.
Probab=36.91 E-value=16 Score=19.12 Aligned_cols=22 Identities=23% Similarity=0.523 Sum_probs=13.9
Q ss_pred HHHHHhhCCChHHHHHHHHHhccc
Q psy10106 37 AKAAYERCNNFADLAAYIKKEFEN 60 (101)
Q Consensus 37 a~~a~~~~~~~kdiA~~IK~~lD~ 60 (101)
...-.+++. -+||.+||+..|+
T Consensus 13 ~~~~akkc~--g~iat~ik~~c~k 34 (34)
T PF06457_consen 13 VKPWAKKCS--GSIATMIKRKCDK 34 (34)
T ss_dssp HHHHHCTBS--CCHHHHHHHHCH-
T ss_pred cHHHHHHcc--ccHHHHHHHHhCC
Confidence 333334444 4699999998764
No 28
>cd03538 Rieske_RO_Alpha_AntDO Rieske non-heme iron oxygenase (RO) family, Anthranilate 1,2-dioxygenase (AntDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. AntDO converts anthranilate to catechol, a naturally occurring compound formed through tryptophan degradation and an important intermediate in the metabolism of many N-heterocyclic compounds such as indole, o-nitrobenzoate, carbazole, and quinaldine.
Probab=36.75 E-value=91 Score=21.13 Aligned_cols=44 Identities=25% Similarity=0.453 Sum_probs=30.7
Q ss_pred HHHHHhccccCCCceEEEEeC-CceeeEEecCCcEEEEEeCCEEEEEEec
Q psy10106 52 AYIKKEFENKYGPAWHCVVGQ-GFGSFVTHDRSSFIYLYIGMDGILLFRH 100 (101)
Q Consensus 52 ~~IK~~lD~~yg~~WhcIVG~-~Fgs~vth~~~~~i~F~~~~~~illfKt 100 (101)
+....+++.-+...|+.|.-. .. -+++.++-+.+++..|+|+|.
T Consensus 9 ~~~~~e~~~i~~~~W~~v~~~~el-----p~~G~~~~~~i~g~~i~v~r~ 53 (146)
T cd03538 9 EIFALEMERLFGNAWIYVGHESQV-----PNPGDYITTRIGDQPVVMVRH 53 (146)
T ss_pred HHHHHHHHHHhhcCCEEEEEHHHC-----CCCCCEEEEEECCeeEEEEEC
Confidence 344566677788899986532 22 135778888889899998884
No 29
>PF12550 GCR1_C: Transcriptional activator of glycolytic enzymes; InterPro: IPR022210 This domain family is found in eukaryotes, and is approximately 80 amino acids in length. This family is activates the transcription of glycolytic enzymes.
Probab=36.59 E-value=11 Score=23.21 Aligned_cols=12 Identities=42% Similarity=1.157 Sum_probs=9.7
Q ss_pred HhccccCCCceE
Q psy10106 56 KEFENKYGPAWH 67 (101)
Q Consensus 56 ~~lD~~yg~~Wh 67 (101)
+.|+++||..|-
T Consensus 29 ~~le~~yG~~WR 40 (81)
T PF12550_consen 29 RSLEKKYGSKWR 40 (81)
T ss_pred HHHHHHhChhhc
Confidence 357889999996
No 30
>PF08594 UPF0300: Uncharacterised protein family (UPF0300); InterPro: IPR013903 This entry of proteins appear to be specific to Schizosaccharomyces pombe (Fission yeast).
Probab=35.60 E-value=1.2e+02 Score=22.59 Aligned_cols=58 Identities=12% Similarity=0.247 Sum_probs=34.3
Q ss_pred CCHHHHHHHHHHHHHHHhhCCChHHHHHHHHHhccccCC------------------CceEEEEeCCceeeEEecC
Q psy10106 25 MPNDMQQEVITVAKAAYERCNNFADLAAYIKKEFENKYG------------------PAWHCVVGQGFGSFVTHDR 82 (101)
Q Consensus 25 M~~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~~yg------------------~~WhcIVG~~Fgs~vth~~ 82 (101)
-|+.+|+.+=..+..-+++-.....-.+.-+..-.+++| -.|-||+-++|-+++-=+.
T Consensus 37 Yp~~lq~aLW~AV~~yv~~~v~~~~yt~lh~~Aa~kriG~IRmyLV~P~Diy~v~~~~~W~~I~~k~F~c~I~l~~ 112 (215)
T PF08594_consen 37 YPEFLQEALWKAVEHYVNNNVSSGGYTKLHKRAAQKRIGHIRMYLVDPHDIYSVDHSNSWIAICSKNFMCNIHLDQ 112 (215)
T ss_pred CcHHHHHHHHHHHHHHHHhhcccccchHHHHHHHHHhcCCeEEEEecccceEEecCCccEEEEecCcceEEEEecC
Confidence 455666555444444445433333334444444455555 3799999999999986443
No 31
>PF10925 DUF2680: Protein of unknown function (DUF2680); InterPro: IPR024485 Members in this family of proteins are annotated as YckD however currently no function is known.
Probab=35.53 E-value=44 Score=19.65 Aligned_cols=16 Identities=25% Similarity=0.310 Sum_probs=13.5
Q ss_pred hHHHHHHHHHhccccC
Q psy10106 47 FADLAAYIKKEFENKY 62 (101)
Q Consensus 47 ~kdiA~~IK~~lD~~y 62 (101)
-+|-|+.||+++|+++
T Consensus 34 TqeqAd~ik~~id~~~ 49 (59)
T PF10925_consen 34 TQEQADAIKKHIDQRQ 49 (59)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 3688999999999875
No 32
>KOG2130|consensus
Probab=34.89 E-value=33 Score=27.34 Aligned_cols=19 Identities=21% Similarity=0.328 Sum_probs=14.9
Q ss_pred CCceEEEEeCCceeeEEec
Q psy10106 63 GPAWHCVVGQGFGSFVTHD 81 (101)
Q Consensus 63 g~~WhcIVG~~Fgs~vth~ 81 (101)
++.||||+--.....||+.
T Consensus 277 ~GWWHvVlNle~TIAiTqN 295 (407)
T KOG2130|consen 277 SGWWHVVLNLEPTIAITQN 295 (407)
T ss_pred CCeEEEEeccCceeeeeec
Confidence 5689999988777777753
No 33
>cd00454 Trunc_globin Truncated hemoglobins (trHbs) are a family of oxygen-binding heme proteins found in cyanobacteria, eubacteria, unicellular eukaryotes, and plants. The truncated hemoglobins have a characteristic two-over-two alpha helical folding pattern that is distinct from the three-over-three pattern found in other globins. A subset of these have been demonstrated to form homodimers.
Probab=34.47 E-value=1.2e+02 Score=19.04 Aligned_cols=37 Identities=14% Similarity=0.111 Sum_probs=29.7
Q ss_pred CCCCHHHHHHHHHHHHHHHhhCCChHHHHHHHHHhcc
Q psy10106 23 TDMPNDMQQEVITVAKAAYERCNNFADLAAYIKKEFE 59 (101)
Q Consensus 23 sdM~~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD 59 (101)
-.++++.-...++...+|++++....++++.+...+.
T Consensus 71 ~~I~~~~f~~~l~~l~~al~~~~~~~~~~~~~~~~~~ 107 (116)
T cd00454 71 FPITEEEFDAWLELLRDALDELGVPAELADALLARAE 107 (116)
T ss_pred CCCCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHH
Confidence 3788999999999999999999887666666665554
No 34
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=34.04 E-value=1.2e+02 Score=21.05 Aligned_cols=37 Identities=14% Similarity=0.371 Sum_probs=26.7
Q ss_pred CCCCHHHHHHHHHHHHHHHhhC--------------CChHHHHHHHHHhcc
Q psy10106 23 TDMPNDMQQEVITVAKAAYERC--------------NNFADLAAYIKKEFE 59 (101)
Q Consensus 23 sdM~~~mq~~~~~~a~~a~~~~--------------~~~kdiA~~IK~~lD 59 (101)
.+||++.++++++...+-++.- ++-+++|+.|+....
T Consensus 15 ~~lp~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~~iA~~i~~~~~ 65 (181)
T PF08006_consen 15 KKLPEEEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPKEIAREILAEYS 65 (181)
T ss_pred HcCCHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHHHHHHHHHHhhh
Confidence 4699999999999887776542 233678888876543
No 35
>PRK14023 homoaconitate hydratase small subunit; Provisional
Probab=33.79 E-value=59 Score=22.97 Aligned_cols=33 Identities=18% Similarity=0.198 Sum_probs=20.9
Q ss_pred hHHHHHHHHHhcc----ccCCCceEEEEeCCceeeEE
Q psy10106 47 FADLAAYIKKEFE----NKYGPAWHCVVGQGFGSFVT 79 (101)
Q Consensus 47 ~kdiA~~IK~~lD----~~yg~~WhcIVG~~Fgs~vt 79 (101)
..++++++-..+| +++.+..-+|.|+|||+=-|
T Consensus 28 ~~~l~~~~f~~~~p~f~~~~~~g~IIVaG~NFG~GSS 64 (166)
T PRK14023 28 EDRFHNYAFAHLRPEFASTVRPGDILVAGRNFGLGSS 64 (166)
T ss_pred HHHHHhhhccCCChhhHhhcCCCCEEEccCcccCCcc
Confidence 3456666655555 44556677788889997433
No 36
>COG2101 SPT15 TATA-box binding protein (TBP), component of TFIID and TFIIIB [Transcription]
Probab=33.10 E-value=35 Score=24.73 Aligned_cols=16 Identities=19% Similarity=0.378 Sum_probs=12.3
Q ss_pred EEEEe--CCEEEEEEecC
Q psy10106 86 IYLYI--GMDGILLFRHS 101 (101)
Q Consensus 86 i~F~~--~~~~illfKt~ 101 (101)
+.+++ .+.++||||||
T Consensus 45 lv~Rl~ePk~a~LIF~SG 62 (185)
T COG2101 45 LVYRLEEPKTAALIFRSG 62 (185)
T ss_pred eEEEecCCcceEEEEecC
Confidence 55565 48899999987
No 37
>PF08015 Pheromone: Fungal mating-type pheromone; InterPro: IPR012597 This family corresponds to mating-type pheromone proteins. The homobasidiomycetes, or mushroom fungi, have arguably the most complex mating system of all known organisms. Many species possess a mating system known as bifactorial incompatibility, where two unlinked loci control the mating-type of an individual incompatibility loci (the A and B mating-type loci). Each A mating-type sublocus encodes a pair of divergently transcribed homeodomain transcription factors while the genes responsible for B mating-type activity encode lipopeptide pheromones and G-protein -coupled pheromone receptors [].; GO: 0000772 mating pheromone activity, 0016020 membrane
Probab=32.86 E-value=25 Score=20.97 Aligned_cols=14 Identities=36% Similarity=0.764 Sum_probs=8.7
Q ss_pred hccccCCC--ceEEEE
Q psy10106 57 EFENKYGP--AWHCVV 70 (101)
Q Consensus 57 ~lD~~yg~--~WhcIV 70 (101)
..|++.|+ +|-|||
T Consensus 54 d~Er~~~g~~~~fCVI 69 (69)
T PF08015_consen 54 DFERRGGGGAGAFCVI 69 (69)
T ss_pred CccccCCCCceEEEeC
Confidence 34555544 688886
No 38
>PF06840 DUF1241: Protein of unknown function (DUF1241); InterPro: IPR009652 This family consists of several programmed cell death 10 protein (PDCD10 or TFAR15) sequences. The function of this family is unknown.; PDB: 3L8I_A 3RQG_B 3RQE_B 3L8J_A 3RQF_B 3AJM_B.
Probab=32.41 E-value=29 Score=24.47 Aligned_cols=13 Identities=31% Similarity=0.473 Sum_probs=12.1
Q ss_pred HHHHHHHHHhccc
Q psy10106 48 ADLAAYIKKEFEN 60 (101)
Q Consensus 48 kdiA~~IK~~lD~ 60 (101)
||||..||+-||.
T Consensus 123 K~IAsaIK~lLdA 135 (154)
T PF06840_consen 123 KEIASAIKKLLDA 135 (154)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 7999999999995
No 39
>KOG3650|consensus
Probab=32.34 E-value=82 Score=20.90 Aligned_cols=39 Identities=15% Similarity=0.208 Sum_probs=23.3
Q ss_pred CCCCCCCCCCCCCCCCceEEeCCCCHHHHHHHHHHHHHHHhh
Q psy10106 2 GPKPKKGGAAPKDDKAPLIKSTDMPNDMQQEVITVAKAAYER 43 (101)
Q Consensus 2 ~~~~~~~~~~m~~~~~~~I~~sdM~~~mq~~~~~~a~~a~~~ 43 (101)
|..||+.-|+|... ..-..+|+.+.+++-.+++...++.
T Consensus 30 grtprs~~P~~~~~---~l~a~e~~~d~~EEKaRlItQVLEL 68 (120)
T KOG3650|consen 30 GRTPRSLLPKMMNA---DLDAVEAENDVEEEKARLITQVLEL 68 (120)
T ss_pred CCCccccCcccccc---cccccccccChHHHHHHHHHHHHHH
Confidence 45589998988863 3555666665554444444444443
No 40
>PF07742 BTG: BTG family; InterPro: IPR002087 Anti-proliferative proteins have been shown to include mammalian and avian protein BTG1 (which appears to be involved in negative regulation of cell proliferation) and rat/mouse NGF-inducible protein PC3/TIS21 (BTG2) [, , ]. These proteins have from 158 to 363 amino acid residues, that are highly similar and include 3 conserved cysteine residues. BTG2 seems to have a signal sequence; while the other proteins may lack such a domain. The sequence of the N-terminal half of these proteins is well conserved.; PDB: 3DJU_B 3E9V_A 2Z15_D 2D5R_B 3DJN_B.
Probab=31.01 E-value=65 Score=21.48 Aligned_cols=19 Identities=16% Similarity=0.392 Sum_probs=14.6
Q ss_pred HHHHHHHHhccccCCCceE
Q psy10106 49 DLAAYIKKEFENKYGPAWH 67 (101)
Q Consensus 49 diA~~IK~~lD~~yg~~Wh 67 (101)
..|+.|.+.|-++|-+.|+
T Consensus 29 ~F~~~L~~~L~~ry~~HW~ 47 (118)
T PF07742_consen 29 RFAEELENLLCERYKGHWY 47 (118)
T ss_dssp HHHHHHHHHHHHHHTTS--
T ss_pred HHHHHHHHHHHHHHhCCCC
Confidence 4688899999999988887
No 41
>PF09868 DUF2095: Uncharacterized protein conserved in archaea (DUF2095); InterPro: IPR018662 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=30.69 E-value=1e+02 Score=21.06 Aligned_cols=15 Identities=13% Similarity=0.098 Sum_probs=13.4
Q ss_pred hHHHHHHHHHhcccc
Q psy10106 47 FADLAAYIKKEFENK 61 (101)
Q Consensus 47 ~kdiA~~IK~~lD~~ 61 (101)
..+.|+.|+..|.++
T Consensus 94 t~e~A~eLr~~L~~k 108 (128)
T PF09868_consen 94 TPEEAKELRSILVKK 108 (128)
T ss_pred CHHHHHHHHHHHHHh
Confidence 468999999999987
No 42
>PRK10667 Hha toxicity attenuator; Provisional
Probab=30.55 E-value=75 Score=21.54 Aligned_cols=20 Identities=15% Similarity=0.311 Sum_probs=18.3
Q ss_pred hCCChHHHHHHHHHhccccC
Q psy10106 43 RCNNFADLAAYIKKEFENKY 62 (101)
Q Consensus 43 ~~~~~kdiA~~IK~~lD~~y 62 (101)
||+++.+++..|-+.||++|
T Consensus 65 KYp~~~~l~~~ideYLDeTy 84 (122)
T PRK10667 65 KYPEDSKLIEQIDEYLDDTY 84 (122)
T ss_pred cCCcHhhHHHHHHHHHHHHH
Confidence 68888999999999999987
No 43
>PF06925 MGDG_synth: Monogalactosyldiacylglycerol (MGDG) synthase; InterPro: IPR009695 This entry represents a conserved region of approximately 180 residues found towirds the N terminus of a number of plant and bacterial diacylglycerol glucosyltransferases, such as monogalactosyldiacylglycerol synthase [].; GO: 0016758 transferase activity, transferring hexosyl groups, 0009247 glycolipid biosynthetic process
Probab=30.42 E-value=55 Score=22.25 Aligned_cols=27 Identities=19% Similarity=0.194 Sum_probs=22.8
Q ss_pred HHHHHHHHHhccccCCCceEEEEeCCc
Q psy10106 48 ADLAAYIKKEFENKYGPAWHCVVGQGF 74 (101)
Q Consensus 48 kdiA~~IK~~lD~~yg~~WhcIVG~~F 74 (101)
...|+.|++.|+..||+...|.|..-|
T Consensus 2 ~~aA~Al~eal~~~~~~~~~v~v~D~~ 28 (169)
T PF06925_consen 2 NSAARALAEALERRRGPDAEVEVVDFL 28 (169)
T ss_pred HHHHHHHHHHHHhhcCCCCEEEEEehH
Confidence 357899999999999999999988744
No 44
>cd05503 Bromo_BAZ2A_B_like Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=29.83 E-value=1.1e+02 Score=19.14 Aligned_cols=36 Identities=14% Similarity=0.203 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHHHHhhCCCh---HHHHHHHHHhccccC
Q psy10106 27 NDMQQEVITVAKAAYERCNNF---ADLAAYIKKEFENKY 62 (101)
Q Consensus 27 ~~mq~~~~~~a~~a~~~~~~~---kdiA~~IK~~lD~~y 62 (101)
++...++-.+...|..-++.. -..|..|++.|++++
T Consensus 59 ~ef~~D~~li~~Na~~yN~~~s~i~~~a~~l~~~f~~~~ 97 (97)
T cd05503 59 EEFAEDVRLVFDNCETFNEDDSEVGRAGHNMRKFFEKRW 97 (97)
T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 455666666666665544443 245888999988764
No 45
>cd00520 RRF Ribosome recycling factor (RRF). Ribosome recycling factor dissociates the posttermination complex, composed of the ribosome, deacylated tRNA, and mRNA, after termination of translation. Thus ribosomes are "recycled" and ready for another round of protein synthesis. RRF is believed to bind the ribosome at the A-site in a manner that mimics tRNA, but the specific mechanisms remain unclear. RRF is essential for bacterial growth. It is not necessary for cell growth in archaea or eukaryotes, but is found in mitochondria or chloroplasts of some eukaryotic species.
Probab=29.25 E-value=1.5e+02 Score=20.94 Aligned_cols=40 Identities=20% Similarity=0.199 Sum_probs=29.9
Q ss_pred CceEEeCCCCHHHHHHHHHHHHHHHhhCCCh-----HHHHHHHHH
Q psy10106 17 APLIKSTDMPNDMQQEVITVAKAAYERCNNF-----ADLAAYIKK 56 (101)
Q Consensus 17 ~~~I~~sdM~~~mq~~~~~~a~~a~~~~~~~-----kdiA~~IK~ 56 (101)
.+.|.---|++|.+++.++.|....++.... ++.-+.||.
T Consensus 93 ~iri~iP~lT~E~R~~lvK~~k~~~E~~Kv~iRniR~~~~~~lKk 137 (179)
T cd00520 93 VIRVNLPPLTEERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKK 137 (179)
T ss_pred EEEecCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3677778999999999999999998887642 344455554
No 46
>PRK10425 DNase TatD; Provisional
Probab=29.25 E-value=1.5e+02 Score=21.99 Aligned_cols=46 Identities=17% Similarity=0.267 Sum_probs=33.5
Q ss_pred HHHHHHHHHHHHHHHhhCCC---------hHHHHHHHHHhccccCCCceEEEEeC
Q psy10106 27 NDMQQEVITVAKAAYERCNN---------FADLAAYIKKEFENKYGPAWHCVVGQ 72 (101)
Q Consensus 27 ~~mq~~~~~~a~~a~~~~~~---------~kdiA~~IK~~lD~~yg~~WhcIVG~ 72 (101)
.+.|+++.+.-.+...+++. .+++.+.||+...+.-++.+||.-|.
T Consensus 103 ~~~Q~~vF~~ql~lA~~~~~Pv~iH~r~a~~~~l~iL~~~~~~~~~~i~H~fsG~ 157 (258)
T PRK10425 103 PEEQERAFVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLPGAVLHCFTGT 157 (258)
T ss_pred HHHHHHHHHHHHHHHHHhCCCeEEEEeCchHHHHHHHHHhccCCCCeEEEecCCC
Confidence 56788888888887777653 46788888875433336789998887
No 47
>cd05498 Bromo_Brdt_II_like Bromodomain, Brdt_like subfamily, repeat II. Human Brdt is a testis-specific member of the BET subfamily of bromodomain proteins; the first bromodomain in Brdt has been shown to be essential for male germ cell differentiation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=28.83 E-value=1.2e+02 Score=19.05 Aligned_cols=36 Identities=14% Similarity=0.276 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHhhCCCh---HHHHHHHHHhccccC
Q psy10106 27 NDMQQEVITVAKAAYERCNNF---ADLAAYIKKEFENKY 62 (101)
Q Consensus 27 ~~mq~~~~~~a~~a~~~~~~~---kdiA~~IK~~lD~~y 62 (101)
.+...++--+...|..-++.. -..|..|++.|++++
T Consensus 64 ~ef~~D~~li~~Na~~yn~~~s~i~~~a~~l~~~fe~~~ 102 (102)
T cd05498 64 QEFAADVRLMFSNCYKYNPPDHPVHAMARKLQDVFEDRW 102 (102)
T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 456666666666665555443 357999999998864
No 48
>PRK00083 frr ribosome recycling factor; Reviewed
Probab=28.63 E-value=1.2e+02 Score=21.59 Aligned_cols=43 Identities=16% Similarity=0.202 Sum_probs=31.7
Q ss_pred CCCCceEEeCCCCHHHHHHHHHHHHHHHhhCCCh-----HHHHHHHHH
Q psy10106 14 DDKAPLIKSTDMPNDMQQEVITVAKAAYERCNNF-----ADLAAYIKK 56 (101)
Q Consensus 14 ~~~~~~I~~sdM~~~mq~~~~~~a~~a~~~~~~~-----kdiA~~IK~ 56 (101)
++..+.|..-.|++|.+++.++.|.+..++.... ++.-+.||+
T Consensus 94 dg~~Iri~iP~lT~E~R~elvK~~k~~~E~aKv~iRniRr~~~~~iKk 141 (185)
T PRK00083 94 DGTVIRLPIPPLTEERRKELVKQVKKEAEEAKVAIRNIRRDANDKLKK 141 (185)
T ss_pred CCCEEEecCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444778888999999999999999988877642 344455554
No 49
>PF13798 PCYCGC: Protein of unknown function with PCYCGC motif
Probab=28.42 E-value=22 Score=25.26 Aligned_cols=33 Identities=15% Similarity=0.160 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHhhCCChHHHHHHHHHhccccCC
Q psy10106 30 QQEVITVAKAAYERCNNFADLAAYIKKEFENKYG 63 (101)
Q Consensus 30 q~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~~yg 63 (101)
...-+++|..|+..+...+. ...|++..|++|.
T Consensus 114 C~vCl~ia~~a~~~~~~Gks-~~eIR~~ID~kYk 146 (158)
T PF13798_consen 114 CGVCLDIAVQAVQMYQEGKS-PKEIRQYIDEKYK 146 (158)
T ss_pred cHHHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHH
Confidence 34456777888777766553 5678888888875
No 50
>cd03545 Rieske_RO_Alpha_OHBDO_like Rieske non-heme iron oxygenase (RO) family, Ortho-halobenzoate-1,2-dioxygenase (OHBDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of OHBDO, salicylate 5-hydroxylase (S5H), terephthalate 1,2-dioxygenase system (TERDOS) and similar proteins. ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OHBDO converts 2-chlorobenzoate (2-CBA) to catechol as well as 2,4-dCBA and 2,5-dCBA to 4-chlorocatechol, as part of the chlorobenzoate degradation pathway. Although ortho-substituted chlorobe
Probab=28.06 E-value=1.8e+02 Score=19.76 Aligned_cols=44 Identities=20% Similarity=0.411 Sum_probs=31.3
Q ss_pred HHHHHhccccCC-CceEEEEeC-CceeeEEecCCcEEEEEeCCEEEEEEec
Q psy10106 52 AYIKKEFENKYG-PAWHCVVGQ-GFGSFVTHDRSSFIYLYIGMDGILLFRH 100 (101)
Q Consensus 52 ~~IK~~lD~~yg-~~WhcIVG~-~Fgs~vth~~~~~i~F~~~~~~illfKt 100 (101)
+....++++-|. ..|+.|.-. .. -+++.++-+.+++..++|+|.
T Consensus 11 ~~~~~E~~~if~~~~W~~v~~~~el-----~~~g~~~~~~i~g~~iiv~r~ 56 (150)
T cd03545 11 AYFDREQERIFRGKTWSYVGLEAEI-----PNAGDFKSTFVGDTPVVVTRA 56 (150)
T ss_pred HHHHHHHHhhhCCCceEEEEEHHHC-----CCCCCEEEEEECCceEEEEEC
Confidence 456677778885 899998755 22 134677778888888888873
No 51
>PF02283 CobU: Cobinamide kinase / cobinamide phosphate guanyltransferase; InterPro: IPR003203 This family is composed of a group of bifunctional cobalbumin biosynthesis enzymes which display cobinamide kinase and cobinamide phosphate guanyltransferase activity. The crystal structure of the enzyme reveals the molecule to be a trimer with a propeller-like shape [].; GO: 0000166 nucleotide binding, 0043752 adenosylcobinamide kinase activity, 0051188 cofactor biosynthetic process; PDB: 1CBU_C 1C9K_B.
Probab=27.84 E-value=68 Score=22.38 Aligned_cols=24 Identities=8% Similarity=0.238 Sum_probs=20.7
Q ss_pred hHHHHHHHHHhccccCCCceEEEEe
Q psy10106 47 FADLAAYIKKEFENKYGPAWHCVVG 71 (101)
Q Consensus 47 ~kdiA~~IK~~lD~~yg~~WhcIVG 71 (101)
+.|+++.|+++-..+ +..|+.|=-
T Consensus 36 D~em~~RI~~H~~~R-~~~w~tiE~ 59 (167)
T PF02283_consen 36 DEEMRERIARHRQRR-PKGWITIEE 59 (167)
T ss_dssp HHHHHHHHHHHHHHS-STCEEEEE-
T ss_pred CHHHHHHHHHHHHhC-CCCcEEEec
Confidence 468999999999999 999999865
No 52
>PF03701 UPF0181: Uncharacterised protein family (UPF0181); InterPro: IPR005371 This family contains small proteins of about 50 amino acids of unknown function. The family includes YoaH P76260 from SWISSPROT.
Probab=27.65 E-value=1.3e+02 Score=17.36 Aligned_cols=33 Identities=18% Similarity=0.195 Sum_probs=25.6
Q ss_pred CCHHHHHHHHHHHHHHHhhCCCh----HHHHHHHHHh
Q psy10106 25 MPNDMQQEVITVAKAAYERCNNF----ADLAAYIKKE 57 (101)
Q Consensus 25 M~~~mq~~~~~~a~~a~~~~~~~----kdiA~~IK~~ 57 (101)
++-+.|+.|++.+.+.+..=-+. +-||+.|++.
T Consensus 8 LtHeeQQ~AvE~Iq~LMaqGmSsgEAI~~VA~~iRe~ 44 (51)
T PF03701_consen 8 LTHEEQQQAVERIQELMAQGMSSGEAIAIVAQEIREE 44 (51)
T ss_pred CCHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHH
Confidence 45688999999999999876554 4578888764
No 53
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=27.51 E-value=1.5e+02 Score=22.19 Aligned_cols=46 Identities=24% Similarity=0.294 Sum_probs=31.6
Q ss_pred eCCCCHHHHHHHHHHHHHH-H-----------hhC--CChHHHHHHHHHhccccCCCceEEEE
Q psy10106 22 STDMPNDMQQEVITVAKAA-Y-----------ERC--NNFADLAAYIKKEFENKYGPAWHCVV 70 (101)
Q Consensus 22 ~sdM~~~mq~~~~~~a~~a-~-----------~~~--~~~kdiA~~IK~~lD~~yg~~WhcIV 70 (101)
..+||++.+..+++.+.+. + ... ..-.+..+.++..||. | .|.+||
T Consensus 94 ~~~i~~~~~~rlI~~~~~~g~~v~~EvG~K~~~~~~~~~~~~~i~~~~~~LeA--G-A~~Vii 153 (237)
T TIGR03849 94 SMEISLEERCNLIERAKDNGFMVLSEVGKKSPEKDSELTPDDRIKLINKDLEA--G-ADYVII 153 (237)
T ss_pred ccCCCHHHHHHHHHHHHhCCCeEeccccccCCcccccCCHHHHHHHHHHHHHC--C-CcEEEE
Confidence 4689999999999999854 1 111 1124677888888986 4 666665
No 54
>cd01579 AcnA_Bact_Swivel Bacterial Aconitase-like swivel domain. Aconitase (aconitate hydratase or citrate hydrolyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. Cis-aconitate is formed as an intermediate product during the course of the reaction. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism. This distinct subfamily is found only in bacteria and archea. Its exact characteristics are not known.
Probab=27.16 E-value=42 Score=22.48 Aligned_cols=17 Identities=18% Similarity=0.126 Sum_probs=12.1
Q ss_pred CCCceEEEEeCCceeeE
Q psy10106 62 YGPAWHCVVGQGFGSFV 78 (101)
Q Consensus 62 yg~~WhcIVG~~Fgs~v 78 (101)
+|..+-+|.|+|||+=-
T Consensus 46 ~~~~~iiVaG~nFG~GS 62 (121)
T cd01579 46 AAGPGFIVGGENYGQGS 62 (121)
T ss_pred cCCCeEEEcCCcCCCCc
Confidence 45567778888999743
No 55
>PF06884 DUF1264: Protein of unknown function (DUF1264); InterPro: IPR010686 This family contains a number of bacterial and eukaryotic proteins of unknown function that are approximately 200 residues long. Some family members are annotated as putative lipoproteins.
Probab=26.93 E-value=45 Score=23.93 Aligned_cols=19 Identities=21% Similarity=0.520 Sum_probs=13.6
Q ss_pred HHHHHHHhccccCCCceEE
Q psy10106 50 LAAYIKKEFENKYGPAWHC 68 (101)
Q Consensus 50 iA~~IK~~lD~~yg~~Whc 68 (101)
+.+.+-+.+-+.||.+||.
T Consensus 98 ae~~~m~~l~~tYGKt~Ht 116 (171)
T PF06884_consen 98 AEKAEMEKLVKTYGKTWHT 116 (171)
T ss_pred HHHHHHHHHHhhhCCeEEe
Confidence 3455566677889999884
No 56
>PRK00394 transcription factor; Reviewed
Probab=26.86 E-value=90 Score=22.20 Aligned_cols=25 Identities=16% Similarity=0.260 Sum_probs=19.8
Q ss_pred eEEecCCcE--EEEEeC--CEEEEEEecC
Q psy10106 77 FVTHDRSSF--IYLYIG--MDGILLFRHS 101 (101)
Q Consensus 77 ~vth~~~~~--i~F~~~--~~~illfKt~ 101 (101)
...|+|..| +-|++. ...+|||.||
T Consensus 120 ~~~YePe~fPglvyR~~~pk~~~lIF~SG 148 (179)
T PRK00394 120 NIEYEPEQFPGLVYRLDDPKVVVLLFGSG 148 (179)
T ss_pred CcEECcccCceEEEEecCCcEEEEEEcCC
Confidence 478888875 667774 7999999997
No 57
>PF01743 PolyA_pol: Poly A polymerase head domain; InterPro: IPR002646 This group includes nucleic acid independent RNA polymerases, such as polynucleotide adenylyltransferase (2.7.7.19 from EC), which adds the poly (A) tail to mRNA. This group also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA 2.7.7.25 from EC.; GO: 0003723 RNA binding, 0016779 nucleotidyltransferase activity, 0006396 RNA processing; PDB: 1VFG_A 3H38_A 3H3A_B 3H39_B 3H37_A 1MIY_A 1MIV_B 1MIW_B 1OU5_B 3AQN_A ....
Probab=26.55 E-value=1.8e+02 Score=18.99 Aligned_cols=37 Identities=19% Similarity=0.217 Sum_probs=27.5
Q ss_pred HHHHHHhccccCCCceEEEEeCCceeeEEecCCcEEEEE
Q psy10106 51 AAYIKKEFENKYGPAWHCVVGQGFGSFVTHDRSSFIYLY 89 (101)
Q Consensus 51 A~~IK~~lD~~yg~~WhcIVG~~Fgs~vth~~~~~i~F~ 89 (101)
+..+.+.|.+.++..|++ |++|+..--+..+..+.+.
T Consensus 28 ~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~di~ 64 (126)
T PF01743_consen 28 PEEFAKLLAKKLGGVFVV--GKRFGTVRVVFGGGSIDIA 64 (126)
T ss_dssp HHHHHHHHCTTCCEEEEE--ETTTTEEEEEETTCEEEEE
T ss_pred HHHHHHHHHhhccccccc--ccccceeeecCCCcccccc
Confidence 456777788888887777 9999987777666666554
No 58
>COG1352 CheR Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]
Probab=26.05 E-value=82 Score=23.92 Aligned_cols=49 Identities=14% Similarity=0.154 Sum_probs=31.7
Q ss_pred CceEEeCCCCHHHHHHHHHHHHHHHhhCCChHHH-HHHHHHhccccCCCceEE
Q psy10106 17 APLIKSTDMPNDMQQEVITVAKAAYERCNNFADL-AAYIKKEFENKYGPAWHC 68 (101)
Q Consensus 17 ~~~I~~sdM~~~mq~~~~~~a~~a~~~~~~~kdi-A~~IK~~lD~~yg~~Whc 68 (101)
++.|..+|.+....+.|..-.-.+... .+.+ .+.+++.|++.-++.|.+
T Consensus 129 ~~~I~AtDId~~~L~~A~~G~Y~~~~~---~~~~~~~~~~ryF~~~~~~~y~v 178 (268)
T COG1352 129 RVKILATDIDLSVLEKARAGIYPSREL---LRGLPPELLRRYFERGGDGSYRV 178 (268)
T ss_pred ceEEEEEECCHHHHHHHhcCCCChhHh---hccCCHHHHhhhEeecCCCcEEE
Confidence 489999999998777654433221111 1222 567789998887776765
No 59
>KOG4194|consensus
Probab=25.75 E-value=39 Score=29.46 Aligned_cols=23 Identities=26% Similarity=0.431 Sum_probs=18.8
Q ss_pred CCceEEEEeCCceeeEEecCCcE
Q psy10106 63 GPAWHCVVGQGFGSFVTHDRSSF 85 (101)
Q Consensus 63 g~~WhcIVG~~Fgs~vth~~~~~ 85 (101)
.+-.||||...|||.+++-.+--
T Consensus 586 ~grYQCVvtN~FGStysqk~Klt 608 (873)
T KOG4194|consen 586 EGRYQCVVTNHFGSTYSQKAKLT 608 (873)
T ss_pred CceEEEEEecccCcchhheeEEE
Confidence 57899999999999988765433
No 60
>KOG1569|consensus
Probab=25.73 E-value=1.4e+02 Score=23.45 Aligned_cols=28 Identities=11% Similarity=0.126 Sum_probs=19.0
Q ss_pred HHHHHHHHHhccc------cCCCce-EEEEeC-Cce
Q psy10106 48 ADLAAYIKKEFEN------KYGPAW-HCVVGQ-GFG 75 (101)
Q Consensus 48 kdiA~~IK~~lD~------~yg~~W-hcIVG~-~Fg 75 (101)
.+|++.|-+...- .++..| ||.||+ +|.
T Consensus 226 ~nipemieeFk~G~~i~~d~~~~~~~~~~vGkl~mt 261 (323)
T KOG1569|consen 226 RNIPEMIEEFKNGHEIKFDEERENILQIKVGKLDMT 261 (323)
T ss_pred cchHHHHHHhhCCcccccccccCceeeeeeeeecCC
Confidence 4677766554432 246788 999999 775
No 61
>PF09570 RE_SinI: SinI restriction endonuclease; InterPro: IPR019070 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes the restriction endonuclease SinI, which recognises and cleaves the double-stranded sequence G^GWCC. ; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=25.71 E-value=1.1e+02 Score=22.86 Aligned_cols=50 Identities=16% Similarity=0.307 Sum_probs=28.4
Q ss_pred CCCCHHHHHHHHHHHHHHH----hhCC--------ChHHHHHHHHHhccccCCC-ceEEEEeC
Q psy10106 23 TDMPNDMQQEVITVAKAAY----ERCN--------NFADLAAYIKKEFENKYGP-AWHCVVGQ 72 (101)
Q Consensus 23 sdM~~~mq~~~~~~a~~a~----~~~~--------~~kdiA~~IK~~lD~~yg~-~WhcIVG~ 72 (101)
+..|+||...+++.+..+- +... .|+-+-..|-++||.+-.| +|++.-|.
T Consensus 75 ~TvpDE~vs~i~~~~~~~s~e~l~~i~~~HrlsM~aENivG~LLErYL~~~Le~~GW~wC~Gs 137 (221)
T PF09570_consen 75 TTVPDEMVSVIMNASFGYSQEDLELIKEGHRLSMAAENIVGALLERYLAEVLEPYGWIWCCGS 137 (221)
T ss_pred CCCChHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCceeeccc
Confidence 5567777777766433221 1111 1233445577777776544 69998887
No 62
>PF09477 Type_III_YscG: Bacterial type II secretion system chaperone protein (type_III_yscG); InterPro: IPR013348 YscG is a molecular chaperone for YscE, where both are part of the type III secretion system that in Yersinia is designated Ysc (Yersinia secretion). The secretion system delivers effector proteins, designated Yops (Yersinia outer proteins), in Yersinia. This entry consists of YscG from Yersinia, and functionally equivalent type III secretion proteins in other species: e.g. AscG in Aeromonas and LscG in Photorhabdus luminescens.; GO: 0009405 pathogenesis; PDB: 3PH0_D 2UWJ_G 2P58_C.
Probab=25.67 E-value=1.4e+02 Score=20.05 Aligned_cols=32 Identities=13% Similarity=0.228 Sum_probs=22.3
Q ss_pred CCHHHHHHHHHHHHHHHhhCCCh--HHHHHHHHH
Q psy10106 25 MPNDMQQEVITVAKAAYERCNNF--ADLAAYIKK 56 (101)
Q Consensus 25 M~~~mq~~~~~~a~~a~~~~~~~--kdiA~~IK~ 56 (101)
|+.+++....++|.-+...+--+ ..||..|+.
T Consensus 1 M~~~l~~lLAElAL~atG~HcH~EA~tIa~wL~~ 34 (116)
T PF09477_consen 1 MNRELRRLLAELALMATGHHCHQEANTIADWLEQ 34 (116)
T ss_dssp --HHHHHHHHHHHHHHHTTT-HHHHHHHHHHHHH
T ss_pred CchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHh
Confidence 78899999999999998877532 456666654
No 63
>PF10757 YbaJ: Biofilm formation regulator YbaJ; InterPro: IPR019693 YbaJ regulates biofilm formation. It also has an important role in the regulation of motility in the biofilm. YbaJ functions in increasing conjugation, aggregation and decreasing the motility, resulting in an increase of biofilm [].
Probab=25.01 E-value=91 Score=21.15 Aligned_cols=20 Identities=20% Similarity=0.419 Sum_probs=18.2
Q ss_pred hCCChHHHHHHHHHhccccC
Q psy10106 43 RCNNFADLAAYIKKEFENKY 62 (101)
Q Consensus 43 ~~~~~kdiA~~IK~~lD~~y 62 (101)
||+++.++...|-+.||++|
T Consensus 65 KYp~~~~l~~~ideYLDeTy 84 (122)
T PF10757_consen 65 KYPDESDLIELIDEYLDETY 84 (122)
T ss_pred ccCcHhhHHHHHHHHHHHHH
Confidence 68888999999999999987
No 64
>PF10440 WIYLD: Ubiquitin-binding WIYLD domain; InterPro: IPR018848 This entry represents a presumed domain which has been predicted to contain three alpha helices. It was named the WIYLD domain based on the pattern of the ost conserved residues []. This domain appears to be specific to plant SET-domain proteins. ; GO: 0018024 histone-lysine N-methyltransferase activity
Probab=24.91 E-value=72 Score=19.29 Aligned_cols=40 Identities=15% Similarity=0.271 Sum_probs=26.4
Q ss_pred HHHHHHHHhhCCChHH-HHHHHHHhccccCCCceEEEEeCCc
Q psy10106 34 ITVAKAAYERCNNFAD-LAAYIKKEFENKYGPAWHCVVGQGF 74 (101)
Q Consensus 34 ~~~a~~a~~~~~~~kd-iA~~IK~~lD~~yg~~WhcIVG~~F 74 (101)
++.|.+|+...+-.++ |-.-||+-| +-||+.|--|=-.||
T Consensus 11 ~daA~dam~~lG~~~~~v~~vl~~LL-~lY~~nW~lIEed~Y 51 (65)
T PF10440_consen 11 IDAALDAMRQLGFSKKQVRPVLKNLL-KLYDGNWELIEEDNY 51 (65)
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHH-HHHcCCchhhhcccH
Confidence 5568888888886544 445555544 468888977654444
No 65
>CHL00123 rps6 ribosomal protein S6; Validated
Probab=24.68 E-value=1.7e+02 Score=18.46 Aligned_cols=35 Identities=9% Similarity=0.022 Sum_probs=26.8
Q ss_pred CCCCCCCceEEeCCCCHHHHHHHHHHHHHHHhhCC
Q psy10106 11 APKDDKAPLIKSTDMPNDMQQEVITVAKAAYERCN 45 (101)
Q Consensus 11 ~m~~~~~~~I~~sdM~~~mq~~~~~~a~~a~~~~~ 45 (101)
+|-...-+.|+..|++++..+.+++...+.+.+.+
T Consensus 4 ~mr~YE~~~Il~p~l~e~~~~~~~~~~~~~i~~~g 38 (97)
T CHL00123 4 KLNKYETMYLLKPDLNEEELLKWIENYKKLLRKRG 38 (97)
T ss_pred cccceeEEEEECCCCCHHHHHHHHHHHHHHHHHCC
Confidence 34333337789999999999999888888887765
No 66
>cd05392 RasGAP_Neurofibromin_like Neurofibromin-like proteins include the Saccharomyces cerevisiae RasGAP proteins Ira1 and Ira2, the closest homolog of neurofibromin, which is responsible for the human autosomal dominant disease neurofibromatosis type I (NF1). The RasGAP Ira1/2 proteins are negative regulators of the Ras-cAMP signaling pathway and conserved from yeast to human. In yeast Ras proteins are activated by GEFs, and inhibited by two GAPs, Ira1 and Ira2. Ras proteins activate the cAMP/protein kinase A (PKA) pathway, which controls metabolism, stress resistance, growth, and meiosis. Recent studies showed that the kelch proteins Gpb1 and Gpb2 inhibit Ras activity via association with Ira1 and Ira2. Gpb1/2 bind to a conserved C-terminal domain of Ira1/2, and loss of Gpb1/2 results in a destabilization of Ira1 and Ira2, leading to elevated levels of Ras2-GTP and uninhibited cAMP-PKA signaling. Since the Gpb1/2 binding domain on Ira1/2 is conserved in the human neurofibromin prote
Probab=24.58 E-value=55 Score=25.02 Aligned_cols=31 Identities=16% Similarity=0.190 Sum_probs=22.0
Q ss_pred HhhCCCh-HHHHHHHHHhccccCCCceEEEEe
Q psy10106 41 YERCNNF-ADLAAYIKKEFENKYGPAWHCVVG 71 (101)
Q Consensus 41 ~~~~~~~-kdiA~~IK~~lD~~yg~~WhcIVG 71 (101)
++.++.+ +.||+.|++.+.++|...+.-+||
T Consensus 143 ~~~~P~~lr~i~~~l~~~v~~kfp~~~~~~Vg 174 (323)
T cd05392 143 LDRFPPELREICHHIYEVVSEKFPDSALSAVG 174 (323)
T ss_pred HHhCCHHHHHHHHHHHHHHHHHCCCchHHHHH
Confidence 3444443 789999999999999875544444
No 67
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=24.34 E-value=42 Score=21.41 Aligned_cols=20 Identities=35% Similarity=0.313 Sum_probs=15.6
Q ss_pred hHHHHHHHHHhccccCCCceEEE
Q psy10106 47 FADLAAYIKKEFENKYGPAWHCV 69 (101)
Q Consensus 47 ~kdiA~~IK~~lD~~yg~~WhcI 69 (101)
-+++++.|++.|+.+ .|+++
T Consensus 35 ~~~~~~~l~kRl~~~---~~~~~ 54 (115)
T cd00197 35 PKEAVDAIKKRINNK---NPHVV 54 (115)
T ss_pred HHHHHHHHHHHhcCC---cHHHH
Confidence 478999999999875 67654
No 68
>PLN00062 TATA-box-binding protein; Provisional
Probab=23.95 E-value=1.1e+02 Score=21.81 Aligned_cols=24 Identities=17% Similarity=0.306 Sum_probs=15.8
Q ss_pred EEecCCcE--EEEEe--CCEEEEEEecC
Q psy10106 78 VTHDRSSF--IYLYI--GMDGILLFRHS 101 (101)
Q Consensus 78 vth~~~~~--i~F~~--~~~~illfKt~ 101 (101)
..|+++.| +.+++ ....+|||+||
T Consensus 29 ~eYePe~fpgli~Rl~~Pk~t~lIF~SG 56 (179)
T PLN00062 29 AEYNPKRFAAVIMRIREPKTTALIFASG 56 (179)
T ss_pred CEECCccCcEEEEEeCCCcEEEEEECCC
Confidence 45666553 55555 37888999887
No 69
>cd01578 AcnA_Mitochon_Swivel Mitochondrial aconitase A swivel domain. Aconitase (also known as aconitate hydratase and citrate hydro-lyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. This is the aconitase swivel domain, which undergoes swivelling conformational change in the enzyme mechanism. In eukaryotes two isozymes of aconitase are known to exist: one found in the mitochondrial matrix and the other found in the cytoplasm. This is the mitochondrial form. The mitochondrial product is coded by a nuclear gene. Most members of this subfamily are mitochondrial but there are some bacterial members.
Probab=23.95 E-value=46 Score=23.35 Aligned_cols=18 Identities=17% Similarity=0.423 Sum_probs=13.5
Q ss_pred CCCceEEEEeCCceeeEE
Q psy10106 62 YGPAWHCVVGQGFGSFVT 79 (101)
Q Consensus 62 yg~~WhcIVG~~Fgs~vt 79 (101)
-|-.|-+|.|++||+=-|
T Consensus 67 ~g~~~iIVaG~nyG~GSS 84 (149)
T cd01578 67 HGIKWVVIGDENYGEGSS 84 (149)
T ss_pred cCCCeEEEccCccCCCCc
Confidence 355799999999996433
No 70
>COG3140 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.78 E-value=1.7e+02 Score=17.31 Aligned_cols=34 Identities=12% Similarity=0.148 Sum_probs=24.5
Q ss_pred CCCHHHHHHHHHHHHHHHhhCCCh----HHHHHHHHHh
Q psy10106 24 DMPNDMQQEVITVAKAAYERCNNF----ADLAAYIKKE 57 (101)
Q Consensus 24 dM~~~mq~~~~~~a~~a~~~~~~~----kdiA~~IK~~ 57 (101)
.++-+.|+.+++.+.+.+..=-+. .-+|+.|++.
T Consensus 7 ~LtHeqQQ~AVE~Iq~lMaeGmSsGEAIa~VA~elRe~ 44 (60)
T COG3140 7 SLTHEQQQKAVERIQELMAEGMSSGEAIALVAQELREN 44 (60)
T ss_pred cccHHHHHHHHHHHHHHHHccccchhHHHHHHHHHHHH
Confidence 467789999999999988765443 3357777653
No 71
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=23.72 E-value=84 Score=22.27 Aligned_cols=25 Identities=16% Similarity=0.227 Sum_probs=19.6
Q ss_pred eEEecCCcE--EEEEeC--CEEEEEEecC
Q psy10106 77 FVTHDRSSF--IYLYIG--MDGILLFRHS 101 (101)
Q Consensus 77 ~vth~~~~~--i~F~~~--~~~illfKt~ 101 (101)
...|+|..| +-|++. ...+|||.||
T Consensus 119 ~~~YePe~fpglvyR~~~pk~~~lIF~SG 147 (174)
T cd04518 119 NAEYEPEQFPGLVYRLDEPKVVLLLFSSG 147 (174)
T ss_pred CCccCcccCceEEEEecCCcEEEEEeCCC
Confidence 567888775 667774 8999999987
No 72
>PF06869 DUF1258: Protein of unknown function (DUF1258); InterPro: IPR009667 This family represents a conserved region approximately 260 residues long within a number of hypothetical proteins of unknown function that seem to be specific to Caenorhabditis elegans. Note that this family contains a number of conserved cysteine and histidine residues.
Probab=23.72 E-value=1.9e+02 Score=22.18 Aligned_cols=49 Identities=16% Similarity=0.281 Sum_probs=41.3
Q ss_pred ceEEeCCCCHHHHHHHHHHHHHHHhhCCC------hHHHHHHHHHhccccCCCce
Q psy10106 18 PLIKSTDMPNDMQQEVITVAKAAYERCNN------FADLAAYIKKEFENKYGPAW 66 (101)
Q Consensus 18 ~~I~~sdM~~~mq~~~~~~a~~a~~~~~~------~kdiA~~IK~~lD~~yg~~W 66 (101)
+.++-.|||+...+...++..+++-.... .+.|...|+..++...|..|
T Consensus 122 vsl~l~DLp~~~mqks~nvil~gi~e~s~nPSt~~wn~i~p~i~sd~e~~~g~v~ 176 (258)
T PF06869_consen 122 VSLMLVDLPTSEMQKSTNVILEGIVECSENPSTKVWNRIIPMIFSDMESRTGEVG 176 (258)
T ss_pred EEEEeecCChhHhhhhHHHhhhheeeccCCCchHHHHHHHHHHHHHhhccccEEc
Confidence 56788999999999999999999977654 36799999999998887643
No 73
>KOG3938|consensus
Probab=23.28 E-value=1.4e+02 Score=23.32 Aligned_cols=23 Identities=17% Similarity=0.329 Sum_probs=20.8
Q ss_pred CCCCHHHHHHHHHHHHHHHhhCC
Q psy10106 23 TDMPNDMQQEVITVAKAAYERCN 45 (101)
Q Consensus 23 sdM~~~mq~~~~~~a~~a~~~~~ 45 (101)
-|+++++.+.+++.+.+.++.|-
T Consensus 250 Ed~~~e~~~~~i~kindllesym 272 (334)
T KOG3938|consen 250 EDQPDEFEEAAIEKINDLLESYM 272 (334)
T ss_pred ccchhHHHHHHHHHHHHHHHHhc
Confidence 58999999999999999999884
No 74
>PRK13293 F420-0--gamma-glutamyl ligase; Reviewed
Probab=23.17 E-value=76 Score=23.98 Aligned_cols=29 Identities=17% Similarity=0.195 Sum_probs=26.3
Q ss_pred HHHHHHHHhccccCCCceEEEEeCCceee
Q psy10106 49 DLAAYIKKEFENKYGPAWHCVVGQGFGSF 77 (101)
Q Consensus 49 diA~~IK~~lD~~yg~~WhcIVG~~Fgs~ 77 (101)
.-|+.|++.|.+++|..=-|||..+||--
T Consensus 127 ~SA~~ir~~l~~~~g~~v~VIItDt~gr~ 155 (245)
T PRK13293 127 ESAERIREGLEELTGKKVGVIITDTNGRP 155 (245)
T ss_pred HHHHHHHHHHHHHHCCCEEEEEEcCCCcc
Confidence 47999999999999999999999999854
No 75
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=23.10 E-value=1.2e+02 Score=21.27 Aligned_cols=26 Identities=12% Similarity=0.175 Sum_probs=20.3
Q ss_pred eeEEecCCcE--EEEEeC--CEEEEEEecC
Q psy10106 76 SFVTHDRSSF--IYLYIG--MDGILLFRHS 101 (101)
Q Consensus 76 s~vth~~~~~--i~F~~~--~~~illfKt~ 101 (101)
..+.|+|..| +-|++. ...+|||.||
T Consensus 119 ~~~~YePe~fpgli~r~~~pk~t~lIF~sG 148 (174)
T cd00652 119 ENASYEPELFPGLIYRMDEPKVVLLIFVSG 148 (174)
T ss_pred cccEECCccCceEEEEecCCcEEEEEEcCC
Confidence 4788999865 556664 8899999987
No 76
>PF10302 DUF2407: DUF2407 ubiquitin-like domain; InterPro: IPR019413 This entry represents a family of proteins of unknown function found in fungi. They contain a characteristic GFDRL sequence motif.
Probab=23.07 E-value=40 Score=21.67 Aligned_cols=8 Identities=38% Similarity=0.962 Sum_probs=6.3
Q ss_pred ceEEEEeC
Q psy10106 65 AWHCVVGQ 72 (101)
Q Consensus 65 ~WhcIVG~ 72 (101)
.=||+||.
T Consensus 89 yIhCsIGd 96 (97)
T PF10302_consen 89 YIHCSIGD 96 (97)
T ss_pred EEEEeccC
Confidence 35999996
No 77
>PF01726 LexA_DNA_bind: LexA DNA binding domain; InterPro: IPR006199 This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins in bacteria. The aligned region contains a variant form of the helix-turn-helix DNA binding motif []. This domain usually at the N terminus is found associated with IPR006198 from INTERPRO the auto-proteolytic domain of LexA 3.4.21.88 from EC.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1LEA_A 1JHH_A 3JSP_A 1JHF_A 3JSO_B 1LEB_A 3K2Z_A.
Probab=23.03 E-value=1.6e+02 Score=17.25 Aligned_cols=28 Identities=18% Similarity=0.306 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHHHhhCC---ChHHHHHHH
Q psy10106 27 NDMQQEVITVAKAAYERCN---NFADLAAYI 54 (101)
Q Consensus 27 ~~mq~~~~~~a~~a~~~~~---~~kdiA~~I 54 (101)
.+.|.+|++...+-+++++ +.+|||+.+
T Consensus 5 T~rQ~~vL~~I~~~~~~~G~~Pt~rEIa~~~ 35 (65)
T PF01726_consen 5 TERQKEVLEFIREYIEENGYPPTVREIAEAL 35 (65)
T ss_dssp -HHHHHHHHHHHHHHHHHSS---HHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHh
Confidence 4678889998888887754 457777654
No 78
>PF01545 Cation_efflux: Cation efflux family; InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=22.43 E-value=96 Score=22.56 Aligned_cols=22 Identities=14% Similarity=0.412 Sum_probs=19.9
Q ss_pred HHHHHHHHhccccCCCceEEEE
Q psy10106 49 DLAAYIKKEFENKYGPAWHCVV 70 (101)
Q Consensus 49 diA~~IK~~lD~~yg~~WhcIV 70 (101)
++++.|++.+-++|++.++|.|
T Consensus 255 ~i~~~i~~~l~~~~~~i~~v~I 276 (284)
T PF01545_consen 255 EIRERIEKRLREKFPGIYDVTI 276 (284)
T ss_dssp HHHHHHHHHHHHHSTTCEEEEE
T ss_pred HHHHHHHHHHHHhCCCcEEEEE
Confidence 5899999999999999999776
No 79
>COG1498 SIK1 Protein implicated in ribosomal biogenesis, Nop56p homolog [Translation, ribosomal structure and biogenesis]
Probab=22.10 E-value=1.1e+02 Score=24.76 Aligned_cols=41 Identities=15% Similarity=0.230 Sum_probs=34.6
Q ss_pred EEeCCCCHHHHHHHHHHHHHHHhhCCChHHHHHHHHHhccc
Q psy10106 20 IKSTDMPNDMQQEVITVAKAAYERCNNFADLAAYIKKEFEN 60 (101)
Q Consensus 20 I~~sdM~~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~ 60 (101)
-...+|++.+...|...|....+.+..-+++..||...++.
T Consensus 199 SmG~~~~~~Di~~i~~~ae~i~~L~~~R~~l~~Yi~~~M~~ 239 (395)
T COG1498 199 SMGADLSEEDIDNIRELAEIILELYELREQLEEYIESKMSE 239 (395)
T ss_pred ccccCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35678999999999999999988888777889999888765
No 80
>PF07062 Clc-like: Clc-like; InterPro: IPR010761 Clc proteins are a nine-member gene family of chloride channels that have diverse roles in the plasma membrane and in intracellular organelles, especially membrane excitability and the maintenance of osmotic balance [, ]. This family contains a number of Clc-like proteins that are approximately 250 residues long and their homologues. ; GO: 0016021 integral to membrane
Probab=22.01 E-value=75 Score=23.40 Aligned_cols=19 Identities=26% Similarity=0.758 Sum_probs=15.7
Q ss_pred CCCceEEEEeCCceeeEEe
Q psy10106 62 YGPAWHCVVGQGFGSFVTH 80 (101)
Q Consensus 62 yg~~WhcIVG~~Fgs~vth 80 (101)
+-|.||++--++++.+..|
T Consensus 23 ~SPsWQvv~~~e~~~~h~h 41 (211)
T PF07062_consen 23 FSPSWQVVNIREFRAYHQH 41 (211)
T ss_pred cCCceEEEEecccCcEEec
Confidence 5699999999999988443
No 81
>PF04622 ERG2_Sigma1R: ERG2 and Sigma1 receptor like protein; InterPro: IPR006716 This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8--delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols []. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum [], which interacts with endogenous steroid hormones, such as progesterone and testosterone []. It also binds the sigma ligands, which are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine []. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.; GO: 0000247 C-8 sterol isomerase activity, 0006696 ergosterol biosynthetic process, 0005783 endoplasmic reticulum
Probab=21.83 E-value=1.6e+02 Score=21.82 Aligned_cols=34 Identities=24% Similarity=0.302 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHhhCCCh-HHHHHHHHHhccccCCC
Q psy10106 31 QEVITVAKAAYERCNNF-ADLAAYIKKEFENKYGP 64 (101)
Q Consensus 31 ~~~~~~a~~a~~~~~~~-kdiA~~IK~~lD~~yg~ 64 (101)
+.+.++|++++.+++++ .++-+.|-+.|-+.|++
T Consensus 34 ~~l~~ia~~~ia~~~~~~~~~~~~l~~~L~~~y~~ 68 (216)
T PF04622_consen 34 KVLHEIAKKAIARHPNDTEEILSDLVDELRKKYPD 68 (216)
T ss_pred HHHHHHHHHHHhhcCCCHHHHHHHHHHHHHhHCCC
Confidence 34556677777777553 66778888888888886
No 82
>PF02806 Alpha-amylase_C: Alpha amylase, C-terminal all-beta domain; InterPro: IPR006048 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Alpha-amylase is classified as family 13 of the glycosyl hydrolases and is present in archaea, bacteria, plants and animals. Alpha-amylase is an essential enzyme in alpha-glucan metabolism, acting to catalyse the hydrolysis of alpha-1,4-glucosidic bonds of glycogen, starch and related polysaccharides. Although all alpha-amylases possess the same catalytic function, they can vary with respect to sequence. In general, they are composed of three domains: a TIM barrel containing the active site residues and chloride ion-binding site (domain A), a long loop region inserted between the third beta strand and the alpha-helix of domain A that contains calcium-binding site(s) (domain B), and a C-terminal beta-sheet domain that appears to show some variability in sequence and length between amylases (domain C) []. Amylases have at least one conserved calcium-binding site, as calcium is essential for the stability of the enzyme. The chloride-binding functions to activate the enzyme, which acts by a two-step mechanism involving a catalytic nucleophile base (usually an Asp) and a catalytic proton donor (usually a Glu) that are responsible for the formation of the beta-linked glycosyl-enzyme intermediate. This entry represents the all-beta domain that is found in several alpha-amylases, usually at the C terminus, and which forms a Greek key beta-barrel fold in these enzymes []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 1TCM_A 1CXL_A 1PJ9_A 1OT2_A 2DIJ_A 1CGV_A 1CXK_A 1PEZ_A 1CGX_A 2CXG_A ....
Probab=21.79 E-value=1e+02 Score=18.71 Aligned_cols=38 Identities=18% Similarity=0.229 Sum_probs=23.6
Q ss_pred CCceEEEEeC---Ccee-------eEEecCCcEEEEEeCCEEEEEEec
Q psy10106 63 GPAWHCVVGQ---GFGS-------FVTHDRSSFIYLYIGMDGILLFRH 100 (101)
Q Consensus 63 g~~WhcIVG~---~Fgs-------~vth~~~~~i~F~~~~~~illfKt 100 (101)
++.|++++.. .||. .++...+.-+.+.+..++.+|||-
T Consensus 46 ~g~y~~vlnsd~~~~~g~~~~~~~~v~~~~~g~~~~~lp~~s~~vl~~ 93 (95)
T PF02806_consen 46 AGRYKEVLNSDDEEYGGSGKGNSGEVTVDSNGRITVTLPPYSALVLKL 93 (95)
T ss_dssp SEEEEETTTTTCEEEEESSCSETSEEEEETTSEEEEEESTTEEEEEEE
T ss_pred cceeeEEeCCCccEECCcccccCceEEEeeCCEEEEEECCCEEEEEEE
Confidence 4566666555 3443 233323334889999999999873
No 83
>PF08908 DUF1852: Domain of unknown function (DUF1852); InterPro: IPR015004 This group of proteins are functionally uncharacterised.
Probab=21.39 E-value=51 Score=25.58 Aligned_cols=26 Identities=27% Similarity=0.480 Sum_probs=21.0
Q ss_pred HH-HhccccCCCceEEEEeCCceeeEE
Q psy10106 54 IK-KEFENKYGPAWHCVVGQGFGSFVT 79 (101)
Q Consensus 54 IK-~~lD~~yg~~WhcIVG~~Fgs~vt 79 (101)
|| .-+|++-|..-.=|||.+|.|+|.
T Consensus 94 Lkt~IvD~kt~~rieGivGNnFSSYVR 120 (322)
T PF08908_consen 94 LKTNIVDHKTNERIEGIVGNNFSSYVR 120 (322)
T ss_pred eeccEEecCCCceecceeccccccccc
Confidence 44 346888888888899999999985
No 84
>PF14326 DUF4384: Domain of unknown function (DUF4384)
Probab=21.15 E-value=2e+02 Score=17.34 Aligned_cols=30 Identities=37% Similarity=0.589 Sum_probs=22.8
Q ss_pred EEeCCceeeEEecCCcEEE-EEeC--CEEEEEE
Q psy10106 69 VVGQGFGSFVTHDRSSFIY-LYIG--MDGILLF 98 (101)
Q Consensus 69 IVG~~Fgs~vth~~~~~i~-F~~~--~~~illf 98 (101)
-+|+.+-..++.....|++ |.++ +...+||
T Consensus 4 ~~Ge~v~~~~~~~~~~Yl~l~~~~~~G~v~~L~ 36 (83)
T PF14326_consen 4 RVGERVRFRVTSNRDGYLYLFYIDADGKVTLLF 36 (83)
T ss_pred cCCCEEEEEEEeCCCeEEEEEEECCCCCEEEEe
Confidence 4678888888888999988 6664 6666666
No 85
>cd04516 TBP_eukaryotes eukaryotic TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=21.14 E-value=1.4e+02 Score=21.19 Aligned_cols=26 Identities=15% Similarity=0.235 Sum_probs=19.7
Q ss_pred eeEEecCCcE--EEEEeC--CEEEEEEecC
Q psy10106 76 SFVTHDRSSF--IYLYIG--MDGILLFRHS 101 (101)
Q Consensus 76 s~vth~~~~~--i~F~~~--~~~illfKt~ 101 (101)
-.++||++.| +-|++. ...+|||.||
T Consensus 118 ~~~~YePE~fPgliyr~~~pk~~~liF~sG 147 (174)
T cd04516 118 QFSSYEPELFPGLIYRMVKPKIVLLIFVSG 147 (174)
T ss_pred hccEeCCccCceEEEEecCCcEEEEEeCCC
Confidence 3578999876 556664 7789999987
No 86
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=20.96 E-value=29 Score=25.59 Aligned_cols=19 Identities=21% Similarity=0.370 Sum_probs=16.5
Q ss_pred HHHHHHhccccCCCceEEE
Q psy10106 51 AAYIKKEFENKYGPAWHCV 69 (101)
Q Consensus 51 A~~IK~~lD~~yg~~WhcI 69 (101)
|-.|+..|+++||-+|.+|
T Consensus 91 AWILr~~i~~~~G~~W~AV 109 (206)
T PRK13888 91 AWRLRMHIRNDKGDLWTKA 109 (206)
T ss_pred HHHHHHHHHHhhCchHHHH
Confidence 5678899998899999986
No 87
>PF15571 Imm25: Immunity protein 25
Probab=20.67 E-value=60 Score=22.15 Aligned_cols=22 Identities=23% Similarity=0.332 Sum_probs=18.2
Q ss_pred HHHHHHHHHhccccCCCceEEE
Q psy10106 48 ADLAAYIKKEFENKYGPAWHCV 69 (101)
Q Consensus 48 kdiA~~IK~~lD~~yg~~WhcI 69 (101)
.++...||+.+|..||..|+-|
T Consensus 18 r~~r~~Ik~~~~~~~g~~~~~I 39 (124)
T PF15571_consen 18 REIRNEIKELNDNLYGIEIESI 39 (124)
T ss_pred HHHHHHHHHHHccccccchhhh
Confidence 4678888999999999888654
No 88
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=20.58 E-value=1.2e+02 Score=23.65 Aligned_cols=23 Identities=4% Similarity=0.127 Sum_probs=17.7
Q ss_pred CCCCHHHHHHHHHHHHHHHhhCC
Q psy10106 23 TDMPNDMQQEVITVAKAAYERCN 45 (101)
Q Consensus 23 sdM~~~mq~~~~~~a~~a~~~~~ 45 (101)
-|=++||++-+-+.=.||+++|.
T Consensus 54 Y~nDpEmK~iid~~n~eaikkyq 76 (295)
T TIGR01478 54 YHNDPELKEIIDKLNEEAIKKYQ 76 (295)
T ss_pred CCCcHHHHHHHHHHhHHHhhhhh
Confidence 34478888888787888998885
No 89
>PF13565 HTH_32: Homeodomain-like domain
Probab=20.24 E-value=1.9e+02 Score=16.62 Aligned_cols=35 Identities=14% Similarity=0.258 Sum_probs=22.7
Q ss_pred CHHHHHHHHHHHHHHHhhCCChHHHHHHHHHhccccC
Q psy10106 26 PNDMQQEVITVAKAAYERCNNFADLAAYIKKEFENKY 62 (101)
Q Consensus 26 ~~~mq~~~~~~a~~a~~~~~~~kdiA~~IK~~lD~~y 62 (101)
++++++.|++++.+-- .-+-.+||..|.+.+...+
T Consensus 32 ~~e~~~~i~~~~~~~p--~wt~~~i~~~L~~~~g~~~ 66 (77)
T PF13565_consen 32 DPEQRERIIALIEEHP--RWTPREIAEYLEEEFGISV 66 (77)
T ss_pred cHHHHHHHHHHHHhCC--CCCHHHHHHHHHHHhCCCC
Confidence 6667666666554321 2245789999999887654
Done!