Query psy4115
Match_columns 149
No_of_seqs 107 out of 164
Neff 5.6
Searched_HMMs 46136
Date Fri Aug 16 17:22:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4115.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4115hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF10256 Erf4: Golgin subfamil 100.0 8.1E-43 1.7E-47 256.8 11.9 115 24-138 1-118 (118)
2 KOG4069|consensus 100.0 4.7E-42 1E-46 258.5 13.1 125 22-146 26-150 (154)
3 KOG4101|consensus 99.9 2.5E-22 5.4E-27 154.1 4.8 114 34-147 52-172 (175)
4 PF15496 DUF4646: Domain of un 97.5 0.00018 3.8E-09 53.9 4.3 91 32-122 26-123 (123)
5 KOG1792|consensus 66.4 34 0.00074 28.3 7.4 68 49-116 115-206 (230)
6 PF00036 EF-hand_1: EF hand; 51.3 7.8 0.00017 21.6 0.8 16 48-63 14-29 (29)
7 PF06305 DUF1049: Protein of u 50.7 56 0.0012 20.9 5.0 40 77-116 24-63 (68)
8 PF10607 CLTH: CTLH/CRA C-term 45.6 46 0.00099 24.2 4.4 45 44-88 95-139 (145)
9 smart00555 GIT Helical motif i 45.0 33 0.00071 19.7 2.8 22 50-71 7-28 (31)
10 PF06122 TraH: Conjugative rel 41.1 66 0.0014 27.9 5.4 51 50-117 67-118 (361)
11 KOG3066|consensus 39.5 34 0.00074 28.7 3.2 87 28-115 106-208 (271)
12 COG0166 Pgi Glucose-6-phosphat 39.1 1.2E+02 0.0025 27.5 6.8 102 26-130 192-299 (446)
13 PF10166 DUF2368: Uncharacteri 37.7 69 0.0015 24.3 4.4 38 87-124 87-124 (131)
14 PF09373 PMBR: Pseudomurein-bi 37.2 50 0.0011 18.9 2.8 25 48-72 2-26 (33)
15 cd00299 GST_C_family Glutathio 36.3 1.2E+02 0.0025 19.4 4.9 59 53-111 38-98 (100)
16 PF13202 EF-hand_5: EF hand; P 35.1 20 0.00043 19.1 0.8 13 48-60 13-25 (25)
17 COG4956 Integral membrane prot 33.3 43 0.00094 29.4 2.9 47 77-123 40-88 (356)
18 PF00674 DUP: DUP family; Int 32.7 62 0.0013 23.1 3.3 35 48-82 54-89 (108)
19 PF13132 DUF3950: Domain of un 32.6 65 0.0014 18.6 2.6 23 60-82 2-25 (30)
20 cd03205 GST_C_6 GST_C family, 32.1 1.6E+02 0.0034 19.7 6.0 59 52-111 38-96 (98)
21 PF10015 DUF2258: Uncharacteri 32.0 70 0.0015 22.2 3.3 26 46-71 23-48 (75)
22 PF06889 DUF1266: Protein of u 31.8 1.6E+02 0.0035 21.7 5.6 35 47-82 97-131 (172)
23 TIGR01639 P_fal_TIGR01639 Plas 31.6 51 0.0011 21.5 2.5 26 46-71 5-30 (61)
24 PF08060 NOSIC: NOSIC (NUC001) 31.3 33 0.00072 21.7 1.5 24 39-63 27-51 (53)
25 smart00757 CRA CT11-RanBPM. pr 31.1 1.6E+02 0.0035 19.7 5.1 41 44-84 43-84 (99)
26 PF14425 Imm3: Immunity protei 30.7 52 0.0011 24.6 2.7 29 44-72 80-108 (117)
27 cd03203 GST_C_Lambda GST_C fam 30.6 1.8E+02 0.0039 20.4 5.5 73 42-115 22-101 (120)
28 PF05903 Peptidase_C97: PPPDE 29.6 56 0.0012 24.8 2.8 40 50-89 81-124 (151)
29 COG1422 Predicted membrane pro 29.3 2.8E+02 0.0062 22.7 6.9 68 48-115 18-86 (201)
30 cd03187 GST_C_Phi GST_C family 29.2 1.8E+02 0.0039 19.5 5.2 62 53-115 47-110 (118)
31 COG2306 Predicted RNA-binding 28.8 1.4E+02 0.0031 24.0 5.0 42 48-89 134-177 (183)
32 cd03179 GST_C_1 GST_C family, 28.7 1.2E+02 0.0025 20.1 4.0 59 52-113 44-104 (105)
33 PF13405 EF-hand_6: EF-hand do 28.3 36 0.00079 18.5 1.2 15 48-62 14-28 (31)
34 COG5415 Predicted integral mem 27.9 1E+02 0.0023 25.7 4.2 34 82-115 77-110 (251)
35 PF05164 ZapA: Cell division p 27.8 1.9E+02 0.004 19.2 8.2 68 49-116 16-87 (89)
36 PF06831 H2TH: Formamidopyrimi 27.5 60 0.0013 22.7 2.4 34 42-75 53-86 (92)
37 PF13410 GST_C_2: Glutathione 27.1 1.6E+02 0.0034 18.2 5.1 59 53-111 8-68 (69)
38 PF09851 SHOCT: Short C-termin 26.9 69 0.0015 18.0 2.2 16 48-63 14-29 (31)
39 PF07108 PipA: PipA protein; 26.7 1.4E+02 0.003 24.0 4.5 48 50-111 53-101 (200)
40 COG3462 Predicted membrane pro 26.7 54 0.0012 24.5 2.1 16 48-63 100-115 (117)
41 PRK15022 ferritin-like protein 26.6 1.8E+02 0.0038 22.8 5.2 58 55-116 6-64 (167)
42 KOG3431|consensus 26.4 58 0.0013 24.8 2.3 22 46-67 84-105 (129)
43 COG2143 Thioredoxin-related pr 26.2 76 0.0016 25.4 3.0 27 46-72 131-157 (182)
44 PF06957 COPI_C: Coatomer (COP 26.2 4.1E+02 0.0089 24.0 8.0 76 23-111 177-253 (422)
45 PRK06474 hypothetical protein; 26.0 70 0.0015 24.9 2.8 23 50-72 135-157 (178)
46 PF09693 Phage_XkdX: Phage unc 25.9 39 0.00085 20.3 1.1 14 48-61 24-37 (40)
47 PF06304 DUF1048: Protein of u 25.5 2.4E+02 0.0052 20.4 5.4 37 53-89 21-57 (103)
48 PF14752 RBP_receptor: Retinol 25.3 2E+02 0.0044 27.2 6.1 49 56-107 304-352 (617)
49 TIGR03042 PS_II_psbQ_bact phot 25.1 1.5E+02 0.0033 22.8 4.4 38 76-113 2-45 (142)
50 smart00054 EFh EF-hand, calciu 24.9 47 0.001 15.8 1.1 15 48-62 14-28 (29)
51 PHA02650 hypothetical protein; 23.8 1.9E+02 0.0041 20.4 4.3 19 52-70 16-34 (81)
52 PF11922 DUF3440: Domain of un 23.7 1.7E+02 0.0037 23.5 4.6 77 50-135 26-128 (181)
53 PF06720 Phi-29_GP16_7: Bacter 23.7 26 0.00057 26.4 0.0 16 102-117 28-43 (130)
54 PF12409 P5-ATPase: P5-type AT 23.6 87 0.0019 22.7 2.8 23 74-96 14-36 (119)
55 PF08388 GIIM: Group II intron 23.1 2.2E+02 0.0048 18.4 6.2 37 74-111 20-56 (80)
56 KOG0164|consensus 22.3 14 0.00031 35.9 -2.0 75 51-127 346-430 (1001)
57 TIGR00691 spoT_relA (p)ppGpp s 22.0 3.7E+02 0.0081 25.6 7.2 23 50-72 164-186 (683)
58 KOG3096|consensus 21.9 3.8E+02 0.0082 22.2 6.3 47 59-115 112-159 (225)
59 PF09457 RBD-FIP: FIP domain ; 21.9 95 0.0021 19.6 2.3 15 101-115 21-35 (48)
60 TIGR02603 CxxCH_TIGR02603 puta 21.4 91 0.002 22.9 2.5 23 41-63 108-130 (133)
61 PF12193 Sulf_coat_C: Sulfolob 21.4 2.3E+02 0.005 18.9 4.1 35 53-88 35-69 (69)
62 KOG3014|consensus 21.3 1.4E+02 0.003 25.3 3.8 38 101-138 101-138 (257)
63 COG4064 MtrG Tetrahydromethano 21.0 34 0.00074 23.6 0.1 21 100-120 21-41 (75)
64 TIGR01669 phage_XkdX phage unc 20.5 70 0.0015 19.8 1.5 13 49-61 30-42 (45)
65 PF03304 Mlp: Mlp lipoprotein 20.3 1.4E+02 0.0031 23.3 3.4 60 55-118 78-140 (152)
66 PRK01026 tetrahydromethanopter 20.2 1.1E+02 0.0023 21.4 2.5 24 50-73 11-34 (77)
No 1
>PF10256 Erf4: Golgin subfamily A member 7/ERF4 family; InterPro: IPR019383 Proteins in this entry include Golgin subfamily A member 7 and the Ras modification protein ERF4.
Probab=100.00 E-value=8.1e-43 Score=256.84 Aligned_cols=115 Identities=32% Similarity=0.620 Sum_probs=112.5
Q ss_pred EEEEccCC--CCCcccccccCCchhhcCCCchHHHHHHHHHHHHHHHHhhcc-chhhHHHHHHHHHHHHHHHhhhcchhh
Q psy4115 24 KIFIQRDY--TDGTMVRFQTHFPQELEGKIERQVYEYTINQMNVYFQEAESV-SCKSYCEGCMACLTAYFVYLCAETHYE 100 (149)
Q Consensus 24 ~i~I~RDy--s~g~~~qF~t~fP~eL~g~Is~eeF~~iI~~IN~~L~~A~~~-s~~n~~e~~L~clT~~l~~~~~~t~~k 100 (149)
+||||||| ++|+.|+|++.||.+|+|+|+++||+++|++||.+|++|+.+ +++++++++++|||+|+|.+++.+||+
T Consensus 1 ivri~Rdy~~~~~~~~~Fs~~~P~~L~~~is~~ef~~iI~~IN~~l~~a~~~~~~~~~~~~~l~~lt~~l~~~~~~~~~~ 80 (118)
T PF10256_consen 1 IVRIPRDYDTSSGIFPQFSTEYPGELSGYISPEEFEEIINTINQILKEAFEPISWRNIIENILGCLTLGLSSLCFKTHYK 80 (118)
T ss_pred CEEEeeecCCCCCcCCccCccCCHhhcCCCCHHHHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 69999999 789999999999999999999999999999999999999875 999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHhhcCcCCCceeeCccccceEEEEE
Q psy4115 101 KCLKKVAKFIAEQNENVFLPRALMLVNPAERGLRVIEI 138 (149)
Q Consensus 101 r~l~~le~yIe~~N~~~~~~~Gl~ii~Pr~~G~lslei 138 (149)
|.+++||+||+++|+++|+++||+|++||++|||+|||
T Consensus 81 ~~~~~le~~l~~~N~~~~~~~gi~ii~pr~~g~lsv~~ 118 (118)
T PF10256_consen 81 RKLRELEKYLEQLNEELFKPRGIKIISPRRSGYLSVDF 118 (118)
T ss_pred HHHHHHHHHHHHHHHHhcccCCcEEEchhHceEEEEeC
Confidence 99999999999999999999999999999999999996
No 2
>KOG4069|consensus
Probab=100.00 E-value=4.7e-42 Score=258.54 Aligned_cols=125 Identities=48% Similarity=0.838 Sum_probs=122.4
Q ss_pred eeEEEEccCCCCCcccccccCCchhhcCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchhhH
Q psy4115 22 CAKIFIQRDYTDGTMVRFQTHFPQELEGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHYEK 101 (149)
Q Consensus 22 ~~~i~I~RDys~g~~~qF~t~fP~eL~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~kr 101 (149)
..+||||||||.|+.+||.+.||.+|+++|+++.|+++|.+||.++++|+..+.++++|+||||+|+|+++.|.+|||+|
T Consensus 26 ~~qifiqrdys~g~~vqf~~~~pa~le~~i~R~vfE~Ti~rlN~~yAeAE~~~~qty~EgclgC~TaY~iy~ctethYek 105 (154)
T KOG4069|consen 26 NGQIFIQRDYSKGLDVQFEAEYPARLEEKIPRDVFENTIVRLNRIYAEAEAITPQTYFEGCLGCFTAYAIYACTETHYEK 105 (154)
T ss_pred cceEEEEecccCCcceEEeecCcHHHhccCcHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 47899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhhcCcCCCceeeCccccceEEEEEEEecCCCC
Q psy4115 102 CLKKVAKFIAEQNENVFLPRALMLVNPAERGLRVIEIAILDQPIV 146 (149)
Q Consensus 102 ~l~~le~yIe~~N~~~~~~~Gl~ii~Pr~~G~lslei~i~~~~~~ 146 (149)
+|++|.+||..+|+++|++.|+++++|+++|+++|+|.|+++++.
T Consensus 106 ~L~klskfl~~qNe~IY~~~Gl~l~dP~eRGLRVieiti~~~~g~ 150 (154)
T KOG4069|consen 106 KLDKLSKFLNRQNEEIYHHVGLHLRDPMERGLRVIEITIIVSFGS 150 (154)
T ss_pred HHHHHHHHHHhhhhhhccccceeecCchhhceEEEEEEEEecccc
Confidence 999999999999999999999999999999999999999998864
No 3
>KOG4101|consensus
Probab=99.86 E-value=2.5e-22 Score=154.09 Aligned_cols=114 Identities=21% Similarity=0.324 Sum_probs=102.2
Q ss_pred CcccccccCCchhhcCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchh-hHHHHHHHHHHHH
Q psy4115 34 GTMVRFQTHFPQELEGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHY-EKCLKKVAKFIAE 112 (149)
Q Consensus 34 g~~~qF~t~fP~eL~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~-kr~l~~le~yIe~ 112 (149)
|+..+|.++||.+|.|+|+||||..+|.++|..|+++..++.+|+++||++|+|...+++|..-+. ||+...|||.+++
T Consensus 52 GlsnrFntEfP~~Ltg~vAPEEfkati~RvNsvLKk~lpvnvrWLlCGc~cCCCtlGCSmWPViCLnkRT~lsleKllew 131 (175)
T KOG4101|consen 52 GLSNRFNTEFPSELTGKVAPEEFKATIGRVNSVLKKHLPVNVRWLLCGCLCCCCTLGCSMWPVICLNKRTRLSLEKLLEW 131 (175)
T ss_pred eccccccccCChhhccccCHHHHHHHHHHHHHHHHhcCCchhHHHHhhHHHHhhccCccccceeeecHHHHHHHHHHHhh
Confidence 899999999999999999999999999999999999999999999999999999999999999988 8899999999999
Q ss_pred HHhhcCcCCCceeeCccc---c--ce-EEEEEEEecCCCCC
Q psy4115 113 QNENVFLPRALMLVNPAE---R--GL-RVIEIAILDQPIVP 147 (149)
Q Consensus 113 ~N~~~~~~~Gl~ii~Pr~---~--G~-lslei~i~~~~~~~ 147 (149)
+|.++||+.||||.--++ + -. .+|.|+.+.+.+.-
T Consensus 132 ENn~LYHKLgLHW~L~kr~ces~~mmEYVilieflPK~pi~ 172 (175)
T KOG4101|consen 132 ENNRLYHKLGLHWRLAKRKCESNNMMEYVILIEFLPKTPIF 172 (175)
T ss_pred hhhhHHHHHhhhhhhhhccccccchHhHhhheeecCCCcCC
Confidence 999999999999974332 1 11 45778877776543
No 4
>PF15496 DUF4646: Domain of unknown function (DUF4646)
Probab=97.45 E-value=0.00018 Score=53.94 Aligned_cols=91 Identities=20% Similarity=0.271 Sum_probs=57.8
Q ss_pred CCCcccccccCC-chhhcC-CCchHHHHHHHHHHHHH--HHHhhcc-chhhHHHHHHHH-HHHHHHHhhhcchh-hHHHH
Q psy4115 32 TDGTMVRFQTHF-PQELEG-KIERQVYEYTINQMNVY--FQEAESV-SCKSYCEGCMAC-LTAYFVYLCAETHY-EKCLK 104 (149)
Q Consensus 32 s~g~~~qF~t~f-P~eL~g-~Is~eeF~~iI~~IN~~--L~~A~~~-s~~n~~e~~L~c-lT~~l~~~~~~t~~-kr~l~ 104 (149)
++.+.-.|...| |..|.. -|++++|...++.|+.. |...... .+.--+...+++ +.+++...-....- ++.-.
T Consensus 26 s~~l~~gFp~~~~P~~l~~~DVs~eDW~~F~~dl~~aa~ls~~~~~~~~~~~~~~~v~~Gi~~~~v~~~~~~~~~~~k~~ 105 (123)
T PF15496_consen 26 SDSLSSGFPYLYPPPPLASHDVSEEDWTRFLNDLSEAASLSPSQSIVAGVGPIVMGVGFGIPAYLVAKAIRKAMKEKKRG 105 (123)
T ss_pred CCccccCCCCcCCCchhhhcCCCHHHHHHHHHHHHHHHhcCcccceeeeeccccccccccchhhhhhHhhhhcccccchH
Confidence 444555676655 778877 59999999999999877 3333321 111111111111 44555544444443 35566
Q ss_pred HHHHHHHHHHhhcCcCCC
Q psy4115 105 KVAKFIAEQNENVFLPRA 122 (149)
Q Consensus 105 ~le~yIe~~N~~~~~~~G 122 (149)
+|+..|+.||+.+|+++|
T Consensus 106 ~v~~~i~~WN~~FF~PRg 123 (123)
T PF15496_consen 106 EVESTIDQWNEGFFRPRG 123 (123)
T ss_pred HHHHHHHHHHHHhccCCC
Confidence 899999999999999987
No 5
>KOG1792|consensus
Probab=66.39 E-value=34 Score=28.28 Aligned_cols=68 Identities=10% Similarity=0.134 Sum_probs=49.5
Q ss_pred CCCchHHHHHHHH----HHHHHHHHhhcc-chhhHHHHHHHHHHHHHHHhhh-------------------cchhhHHHH
Q psy4115 49 GKIERQVYEYTIN----QMNVYFQEAESV-SCKSYCEGCMACLTAYFVYLCA-------------------ETHYEKCLK 104 (149)
Q Consensus 49 g~Is~eeF~~iI~----~IN~~L~~A~~~-s~~n~~e~~L~clT~~l~~~~~-------------------~t~~kr~l~ 104 (149)
=.|+++++.+..+ .+|..+.+..++ ..+++.+.....+-.|+.+.++ -..|+|--.
T Consensus 115 i~ipee~~~~~a~~~~~~in~~l~~l~~ia~~~d~~~~lk~~v~lw~lS~vGs~fn~lTll~ig~v~~~TvP~~YEky~d 194 (230)
T KOG1792|consen 115 ITIPEEFVLALASSLRVEINQALSELRDIALGRDLKDFLKVAVGLWILSYVGSLFNFLTLLYIGLVLLFTVPVLYEKYED 194 (230)
T ss_pred eecCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccchhHHhHH
Confidence 3688999988888 999999999885 6667777666666555554222 236788888
Q ss_pred HHHHHHHHHHhh
Q psy4115 105 KVAKFIAEQNEN 116 (149)
Q Consensus 105 ~le~yIe~~N~~ 116 (149)
+++.|++...++
T Consensus 195 ~ID~~~~~~~~~ 206 (230)
T KOG1792|consen 195 QIDPYLGKVMEE 206 (230)
T ss_pred HHhHHHHHHHHH
Confidence 888888876653
No 6
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=51.27 E-value=7.8 Score=21.60 Aligned_cols=16 Identities=19% Similarity=0.551 Sum_probs=13.4
Q ss_pred cCCCchHHHHHHHHHH
Q psy4115 48 EGKIERQVYEYTINQM 63 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~I 63 (149)
+|+|+.+||..++.++
T Consensus 14 dG~I~~~Ef~~~~~~L 29 (29)
T PF00036_consen 14 DGKIDFEEFKEMMKKL 29 (29)
T ss_dssp SSEEEHHHHHHHHHHT
T ss_pred CCcCCHHHHHHHHHhC
Confidence 5889999999988763
No 7
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=50.72 E-value=56 Score=20.88 Aligned_cols=40 Identities=8% Similarity=0.166 Sum_probs=29.3
Q ss_pred hHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHhh
Q psy4115 77 SYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNEN 116 (149)
Q Consensus 77 n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~~ 116 (149)
=++.-.+|++.+++..+...-..++..+++++=+++.-++
T Consensus 24 il~~f~~G~llg~l~~~~~~~~~r~~~~~~~k~l~~le~e 63 (68)
T PF06305_consen 24 ILIAFLLGALLGWLLSLPSRLRLRRRIRRLRKELKKLEKE 63 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455667888888888887777788888887777665544
No 8
>PF10607 CLTH: CTLH/CRA C-terminal to LisH motif domain; InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined [].
Probab=45.61 E-value=46 Score=24.22 Aligned_cols=45 Identities=11% Similarity=0.034 Sum_probs=36.0
Q ss_pred chhhcCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHH
Q psy4115 44 PQELEGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTA 88 (149)
Q Consensus 44 P~eL~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~ 88 (149)
++.+...++++.++++.+.+|+.+-+.+..+..+-++.++.-.+.
T Consensus 95 ~s~~~~l~~~~~~~~la~~~~~~~l~~~~~~~~s~L~~~~~~g~~ 139 (145)
T PF10607_consen 95 PSPYKELLSPERREELAEEFNSAILKSYGLPKESPLEVILKAGLS 139 (145)
T ss_pred chHHHHHhChHHHHHHHHHHHHHHHHHhCcCCCCHHHHHHHHHHH
Confidence 345667789999999999999999999998777777777655443
No 9
>smart00555 GIT Helical motif in the GIT family of ADP-ribosylation factor GTPase-activating proteins. Helical motif in the GIT family of ADP-ribosylation factor GTPase-activating proteins, and in yeast Spa2p and Sph1p (CPP; unpublished results). In p95-APP1 the N-terminal GIT motif might be involved in binding PIX.
Probab=44.97 E-value=33 Score=19.72 Aligned_cols=22 Identities=9% Similarity=0.321 Sum_probs=18.7
Q ss_pred CCchHHHHHHHHHHHHHHHHhh
Q psy4115 50 KIERQVYEYTINQMNVYFQEAE 71 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L~~A~ 71 (149)
++++++|.+.+..||..+.+-+
T Consensus 7 ~L~~~~F~~L~~Dv~~El~RR~ 28 (31)
T smart00555 7 RLSDEQFQKLLTDLNDELKRRE 28 (31)
T ss_pred hcCHHHHHHHHHHHHHHHHHhh
Confidence 5889999999999999886543
No 10
>PF06122 TraH: Conjugative relaxosome accessory transposon protein; InterPro: IPR010927 Six Tra proteins encoded by the F plasmid and required by F(+) cells to elaborate F pili. The six proteins are TraH, TraF, TraW, TraU, TrbI, and TrbB. Except for TrbI, these proteins were all identified as hallmarks of F-like type IV secretion systems (TFSSs), with no homologues among TFSS genes of P-type or I-type systems. With the exception of TrbI, which is an inner membrane protein, the remaining proteins are or are predicted to be periplasmic. TrbI consists of one membrane-spanning segment near its N terminus and an 88-residue, hydrophilic domain that extends into the periplasm []. It has been proposed that the TraH interaction group is to control F-pilus extension and retraction during conjugation [, , ].
Probab=41.15 E-value=66 Score=27.87 Aligned_cols=51 Identities=14% Similarity=0.173 Sum_probs=41.3
Q ss_pred CCchHHHHHHHHHH-HHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHhhc
Q psy4115 50 KIERQVYEYTINQM-NVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNENV 117 (149)
Q Consensus 50 ~Is~eeF~~iI~~I-N~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~~~ 117 (149)
.|+.+||-+....| +....=||..-..+++..| +..+.+|++.++++|+-.
T Consensus 67 fIn~dqlVq~lr~Ia~nA~gyAF~LAL~t~~p~~-----------------~~~~~~lq~~~~~lN~~~ 118 (361)
T PF06122_consen 67 FINSDQLVQMLRNIASNAPGYAFQLALQTLCPQC-----------------GNIMDKLQKIAQALNQMN 118 (361)
T ss_pred cCCHHHHHHHHHHHHHhhHHHHHHHHHHHhCHHH-----------------HHHHHHHHHHHHHHHhhh
Confidence 68999999999888 6666777776666766666 889999999999999843
No 11
>KOG3066|consensus
Probab=39.54 E-value=34 Score=28.73 Aligned_cols=87 Identities=11% Similarity=0.253 Sum_probs=62.4
Q ss_pred ccCCCCCcccccccCCchhhcCCCchHHHHHHH--------HHHHHHHHH---hhccchhhHHHHHHHH--HHHHHHHhh
Q psy4115 28 QRDYTDGTMVRFQTHFPQELEGKIERQVYEYTI--------NQMNVYFQE---AESVSCKSYCEGCMAC--LTAYFVYLC 94 (149)
Q Consensus 28 ~RDys~g~~~qF~t~fP~eL~g~Is~eeF~~iI--------~~IN~~L~~---A~~~s~~n~~e~~L~c--lT~~l~~~~ 94 (149)
-|.-+|-.+-||+.-+-..|-.|+..-.|...+ +.||.-|.- .+..+ -|++|-+||+ |||=+..+|
T Consensus 106 ~~EL~G~d~~kf~rA~t~GlQEYVEAvtF~~f~lsgtLc~~dein~~lvpl~~~~rl~-in~iDYvLGvaDlTGElMRm~ 184 (271)
T KOG3066|consen 106 KRELAGLDADKFSRACTHGLQEYVEAVTFKFFLLSGTLCQTDEINSCLVPLDSSFRLS-INFIDYVLGVADLTGELMRML 184 (271)
T ss_pred HHHhcCCcHHHHHHhhcccHHHHHHHHHHHHHHHhccccchhhhhheecccCCcccee-eeHHHHHHHHhhhHHHHHHHH
Confidence 344455567788887877888889888898877 889998843 33322 2889988887 688899999
Q ss_pred hcchhh---HHHHHHHHHHHHHHh
Q psy4115 95 AETHYE---KCLKKVAKFIAEQNE 115 (149)
Q Consensus 95 ~~t~~k---r~l~~le~yIe~~N~ 115 (149)
....-+ ..+.++-+|+.++-+
T Consensus 185 I~~~s~g~I~~~~~~~qFlRq~h~ 208 (271)
T KOG3066|consen 185 ITNGSKGSIQQLTQQVQFLRQLHK 208 (271)
T ss_pred HhcCcCcchhhHHHHHHHHHHHHh
Confidence 877654 355566677766554
No 12
>COG0166 Pgi Glucose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=39.07 E-value=1.2e+02 Score=27.54 Aligned_cols=102 Identities=9% Similarity=0.052 Sum_probs=63.9
Q ss_pred EEccCCCCCcccccccCCchh-hcCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhh-----hcchh
Q psy4115 26 FIQRDYTDGTMVRFQTHFPQE-LEGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLC-----AETHY 99 (149)
Q Consensus 26 ~I~RDys~g~~~qF~t~fP~e-L~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~-----~~t~~ 99 (149)
|.==||=+| ||+...+-. |.-.+....|.+.+.--...-+.......+.=..-.++++.-|..... ....|
T Consensus 192 f~ipd~VGG---RfS~~SaVG~l~~a~~~~~~~~lL~Ga~~~d~~~~~~~l~~N~~~l~ali~~~~~~~~G~~~~~i~~Y 268 (446)
T COG0166 192 FEIPDWVGG---RYSVLSAVGLLPLALGGIDFKELLEGAAAADEHFRTTPLEENLAVLYALIGIWYYNFKGAEIEVILPY 268 (446)
T ss_pred EECCCCCCC---ccchhHHHHHHHHHHhcccHHHHHHHHHHHHHHhccCChhhcHHHHHHHHHHHHhcccCCCceEEEec
Confidence 333355555 998777766 555788889998888877665555554444444444555544444322 22467
Q ss_pred hHHHHHHHHHHHHHHhhcCcCCCceeeCccc
Q psy4115 100 EKCLKKVAKFIAEQNENVFLPRALMLVNPAE 130 (149)
Q Consensus 100 kr~l~~le~yIe~~N~~~~~~~Gl~ii~Pr~ 130 (149)
...|+.+.+|.+|.|-+--.|..-.+-+|..
T Consensus 269 ~~~l~~f~~~~qQL~~ES~GK~~~~~~~~~~ 299 (446)
T COG0166 269 DPYLEYFPAWLQQLNMESNGKSVKGIGPEVN 299 (446)
T ss_pred cHHHHHHHHHHHHHHHhccCCCccCcCCccc
Confidence 8899999999999997543444334444433
No 13
>PF10166 DUF2368: Uncharacterised conserved protein (DUF2368); InterPro: IPR019319 This family is conserved from nematodes to humans. The function is not known.
Probab=37.67 E-value=69 Score=24.30 Aligned_cols=38 Identities=16% Similarity=0.253 Sum_probs=26.5
Q ss_pred HHHHHHhhhcchhhHHHHHHHHHHHHHHhhcCcCCCce
Q psy4115 87 TAYFVYLCAETHYEKCLKKVAKFIAEQNENVFLPRALM 124 (149)
Q Consensus 87 T~~l~~~~~~t~~kr~l~~le~yIe~~N~~~~~~~Gl~ 124 (149)
++|=++++.++..+|-..+.|+.+++...-+.-|.|+-
T Consensus 87 ~~Yq~D~ayG~kl~RIr~eAE~Il~~e~~ll~lP~G~~ 124 (131)
T PF10166_consen 87 LGYQYDMAYGTKLQRIREEAERILEEEPDLLELPGGLP 124 (131)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHhCHHHhcCCCCCC
Confidence 33444566667777778888998888777666677763
No 14
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=37.16 E-value=50 Score=18.86 Aligned_cols=25 Identities=16% Similarity=0.412 Sum_probs=20.8
Q ss_pred cCCCchHHHHHHHHHHHHHHHHhhc
Q psy4115 48 EGKIERQVYEYTINQMNVYFQEAES 72 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~IN~~L~~A~~ 72 (149)
.+.|+.+|+.+...+++.+..+--.
T Consensus 2 ~~~i~~~~~~d~a~rv~~f~~~ngR 26 (33)
T PF09373_consen 2 SGTISKEEYLDMASRVNNFYESNGR 26 (33)
T ss_pred CceecHHHHHHHHHHHHHHHHHcCC
Confidence 3679999999999999998766544
No 15
>cd00299 GST_C_family Glutathione S-transferase (GST) family, C-terminal alpha helical domain; a large, diverse group of cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. In addition, GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. This family, also referred to as soluble GSTs, is the largest family of GSH transferases and is only distantly related to the mitochondrial GSTs (GSTK). Soluble GSTs bear no structural similarity to microsomal GSTs (MAPEG family) and display additional activities unique to their group, such as catalyzing thiolysis, reduction and isomerization of certain compounds. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an activ
Probab=36.33 E-value=1.2e+02 Score=19.43 Aligned_cols=59 Identities=10% Similarity=0.035 Sum_probs=35.4
Q ss_pred hHHHHHHHHHHHHHHHHhhcc--chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHH
Q psy4115 53 RQVYEYTINQMNVYFQEAESV--SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIA 111 (149)
Q Consensus 53 ~eeF~~iI~~IN~~L~~A~~~--s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe 111 (149)
.+++++.+..||..|.+.... .--++.|-+++++..++.............-.+.+|.+
T Consensus 38 ~~~~~~~~~~l~~~L~~~~~~~g~~~t~aDi~~~~~l~~~~~~~~~~~~~~~~p~l~~~~~ 98 (100)
T cd00299 38 REELAAALAALEKLLAGRPYLAGDRFSLADIALAPVLARLDLLGPLLGLLDEYPRLAAWYD 98 (100)
T ss_pred HHHHHHHHHHHHHHHccCCCCCCCCcCHHHHHHHHHHHHHHHhhhhhhhhccCccHHHHHH
Confidence 457888999999999765432 22478888888777666543222111233444555554
No 16
>PF13202 EF-hand_5: EF hand; PDB: 3DD4_A 2Q4U_A 2BE4_A 1UHJ_B 1UHI_A 1UHH_B 1EJ3_B 1UHK_A 2ZFD_A 1UHN_A ....
Probab=35.07 E-value=20 Score=19.13 Aligned_cols=13 Identities=23% Similarity=0.516 Sum_probs=10.4
Q ss_pred cCCCchHHHHHHH
Q psy4115 48 EGKIERQVYEYTI 60 (149)
Q Consensus 48 ~g~Is~eeF~~iI 60 (149)
+|.|+.+||...+
T Consensus 13 DG~is~~E~~~~~ 25 (25)
T PF13202_consen 13 DGKISFEEFQRLV 25 (25)
T ss_dssp SSEEEHHHHHHHH
T ss_pred CCcCCHHHHHHHC
Confidence 5789999998753
No 17
>COG4956 Integral membrane protein (PIN domain superfamily) [General function prediction only]
Probab=33.26 E-value=43 Score=29.43 Aligned_cols=47 Identities=13% Similarity=0.219 Sum_probs=39.7
Q ss_pred hHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHhh--cCcCCCc
Q psy4115 77 SYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNEN--VFLPRAL 123 (149)
Q Consensus 77 n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~~--~~~~~Gl 123 (149)
.+++++++.+-+++.+.|+.....+.++++|.+|.+.+-+ +|...|+
T Consensus 40 ~~v~~ligai~~~li~~~~~~~~~~~~~~le~~i~k~~~~~ilf~tiGL 88 (356)
T COG4956 40 EYVDALIGAIIFFLISFWFGKYVLNWLKRLEEQIRKLPVTTILFGTIGL 88 (356)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHH
Confidence 5778899999999999999988899999999999998853 6665555
No 18
>PF00674 DUP: DUP family; InterPro: IPR001142 A number of uncharacterised integral membrane proteins from yeast contain an internal duplication due to duplicated genes. Duplicated copies of genes may be classified in two types of cluster organisation. The first type includes genes sharing a significant level of identity in the amino acid sequences of their predicted protein product. They are recovered on two different chromosomes, transcribed in the same orientation and the distance between them is conserved. The second type of cluster is based on one gene unit tandemly repeated. This duplication is itself repeated elsewhere in the genome. The basic gene unit is recovered many times in the genome and is a component of a multigene family of unknown function. These organisations in clusters of genes suggest a 'Lego organisation' of the yeast chromosomes []. The proteins belonging to this family are of unknown function.
Probab=32.67 E-value=62 Score=23.11 Aligned_cols=35 Identities=14% Similarity=0.258 Sum_probs=26.8
Q ss_pred cCCCchHHHHHHHHHHHHHHHHhhcc-chhhHHHHH
Q psy4115 48 EGKIERQVYEYTINQMNVYFQEAESV-SCKSYCEGC 82 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~IN~~L~~A~~~-s~~n~~e~~ 82 (149)
.+.++.++|+.|-.++|++|.+.-.. +-.-+.||-
T Consensus 54 ~P~~d~~~Wd~IA~~mN~yly~~~~w~t~~~FyDG~ 89 (108)
T PF00674_consen 54 KPGVDMKKWDIIASRMNQYLYEEGLWNTPYFFYDGK 89 (108)
T ss_pred CCCCCcccHHHHHHHHHHHHHhCCCCCCcCcccCcH
Confidence 44678899999999999999988764 444555553
No 19
>PF13132 DUF3950: Domain of unknown function (DUF3950)
Probab=32.58 E-value=65 Score=18.62 Aligned_cols=23 Identities=13% Similarity=0.319 Sum_probs=16.5
Q ss_pred HHHHHHHHHHhhccchh-hHHHHH
Q psy4115 60 INQMNVYFQEAESVSCK-SYCEGC 82 (149)
Q Consensus 60 I~~IN~~L~~A~~~s~~-n~~e~~ 82 (149)
|++||..+..+-+-++. |++|.|
T Consensus 2 leQI~~a~~~~~~~NFSaWV~dAC 25 (30)
T PF13132_consen 2 LEQIEIALEQEGSGNFSAWVKDAC 25 (30)
T ss_pred hHHHHHHHHhccCcChHHHHHHHH
Confidence 68899999888776555 455555
No 20
>cd03205 GST_C_6 GST_C family, unknown subfamily 6; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain.
Probab=32.11 E-value=1.6e+02 Score=19.69 Aligned_cols=59 Identities=12% Similarity=0.252 Sum_probs=35.5
Q ss_pred chHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHH
Q psy4115 52 ERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIA 111 (149)
Q Consensus 52 s~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe 111 (149)
..+++++....+++.|.+.- ..--++-|-+++|+-.|.-............-.|.+|.+
T Consensus 38 ~~~~~~~~l~~le~~L~~~~-~d~~TlADi~l~~~l~~~~~~~~~~~~~~~~p~l~~w~~ 96 (98)
T cd03205 38 QRGKIERALDALEAELAKLP-LDPLDLADIAVACALGYLDFRHPDLDWRAAHPALAAWYA 96 (98)
T ss_pred HHHHHHHHHHHHHHhhhhCC-CCCCCHHHHHHHHHHHHHHhHccCcchhhhChHHHHHHH
Confidence 35778888888888886531 244477888888877776432222111233455666654
No 21
>PF10015 DUF2258: Uncharacterized protein conserved in archaea (DUF2258); InterPro: IPR017140 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=32.00 E-value=70 Score=22.24 Aligned_cols=26 Identities=19% Similarity=0.207 Sum_probs=22.6
Q ss_pred hhcCCCchHHHHHHHHHHHHHHHHhh
Q psy4115 46 ELEGKIERQVYEYTINQMNVYFQEAE 71 (149)
Q Consensus 46 eL~g~Is~eeF~~iI~~IN~~L~~A~ 71 (149)
.|.+++.+++-...+..+|..|.+-+
T Consensus 23 ~l~~~v~~~ei~ra~aeLNk~ly~~l 48 (75)
T PF10015_consen 23 ALRGKVPPEEIVRAAAELNKKLYEKL 48 (75)
T ss_pred HHcCCCCHHHHHHHHHHHHHHHHHHH
Confidence 36899999999999999999987644
No 22
>PF06889 DUF1266: Protein of unknown function (DUF1266); InterPro: IPR009677 This family consists of several hypothetical bacterial proteins of around 235 residues in length. Members of this family seem to be found exclusively in the Enterobacteria Salmonella typhimurium and Escherichia coli. The function of this family is unknown.
Probab=31.78 E-value=1.6e+02 Score=21.66 Aligned_cols=35 Identities=17% Similarity=0.045 Sum_probs=27.4
Q ss_pred hcCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHH
Q psy4115 47 LEGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGC 82 (149)
Q Consensus 47 L~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~ 82 (149)
.-|+|+++|+...+.++=....+.|+ ||.-|..++
T Consensus 97 ~~Gyi~eeea~~~~~~~~~~a~~~y~-sWee~~~sy 131 (172)
T PF06889_consen 97 QAGYITEEEAWEYLLKAAQRAQQHYS-SWEEYAASY 131 (172)
T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHCC-CHHHHHHHH
Confidence 56899999999999999888877774 366555554
No 23
>TIGR01639 P_fal_TIGR01639 Plasmodium falciparum uncharacterized domain TIGR01639. This model represents a conserved sequence region of about 60 amino acids found in over 40 predicted proteins of Plasmodium falciparum. It is not found elsewhere, including closely related species such as Plasmodium yoelii. No member of this family is characterized.
Probab=31.62 E-value=51 Score=21.45 Aligned_cols=26 Identities=15% Similarity=0.277 Sum_probs=20.4
Q ss_pred hhcCCCchHHHHHHHHHHHHHHHHhh
Q psy4115 46 ELEGKIERQVYEYTINQMNVYFQEAE 71 (149)
Q Consensus 46 eL~g~Is~eeF~~iI~~IN~~L~~A~ 71 (149)
.|+-.++++|+.+.|+.+++..-...
T Consensus 5 Dls~~lTeEEl~~~i~~L~~~~~~~d 30 (61)
T TIGR01639 5 DLSKKLSKEELNELINSLDEIPNRND 30 (61)
T ss_pred HHhHHccHHHHHHHHHhhcCCCCHHH
Confidence 46677999999999999987664443
No 24
>PF08060 NOSIC: NOSIC (NUC001) domain; InterPro: IPR012976 This is the central domain in Nop56/SIK1-like proteins [].; PDB: 3PLA_K 3ICX_B 3ID6_A 3ID5_E 3NVM_A 3NMU_B 2NNW_C 3NVI_A 3NVK_A 2OZB_E ....
Probab=31.35 E-value=33 Score=21.67 Aligned_cols=24 Identities=29% Similarity=0.488 Sum_probs=17.3
Q ss_pred cccCCchhhcC-CCchHHHHHHHHHH
Q psy4115 39 FQTHFPQELEG-KIERQVYEYTINQM 63 (149)
Q Consensus 39 F~t~fP~eL~g-~Is~eeF~~iI~~I 63 (149)
++..|| ||+. -.++.+|..+|..|
T Consensus 27 Y~~~FP-EL~~lv~~~~~Y~~vV~~i 51 (53)
T PF08060_consen 27 YSWHFP-ELESLVPNPIDYAKVVKII 51 (53)
T ss_dssp HTTTST-THHHHS-SHHHHHHHHHHT
T ss_pred HHccch-hHHHHcCCHHHHHHHHHHh
Confidence 456799 6887 56788888888764
No 25
>smart00757 CRA CT11-RanBPM. protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi)
Probab=31.11 E-value=1.6e+02 Score=19.68 Aligned_cols=41 Identities=17% Similarity=0.086 Sum_probs=32.1
Q ss_pred chhhcCCCchHHHHHHHHHHHHHHHHhh-ccchhhHHHHHHH
Q psy4115 44 PQELEGKIERQVYEYTINQMNVYFQEAE-SVSCKSYCEGCMA 84 (149)
Q Consensus 44 P~eL~g~Is~eeF~~iI~~IN~~L~~A~-~~s~~n~~e~~L~ 84 (149)
++.+...++++.++.+.+.+|..+-+.+ ..+..+-++.++.
T Consensus 43 ~sp~~~ll~~~~~~~la~~~n~~~l~~~~~~~~~s~L~~~~~ 84 (99)
T smart00757 43 PSPYKELLSPSQREKLAEELNSAILELLHGKSSESPLEILLS 84 (99)
T ss_pred CccHHHHCCHHHHHHHHHHHHHHHHHHccCCCCCChHHHHHH
Confidence 4456677999999999999999998887 6555666666655
No 26
>PF14425 Imm3: Immunity protein Imm3
Probab=30.71 E-value=52 Score=24.63 Aligned_cols=29 Identities=17% Similarity=0.257 Sum_probs=25.0
Q ss_pred chhhcCCCchHHHHHHHHHHHHHHHHhhc
Q psy4115 44 PQELEGKIERQVYEYTINQMNVYFQEAES 72 (149)
Q Consensus 44 P~eL~g~Is~eeF~~iI~~IN~~L~~A~~ 72 (149)
|..+.+.++.+|+++++.++|..|..-..
T Consensus 80 ~~~~~~eLt~eE~~dL~~R~nkVL~~l~~ 108 (117)
T PF14425_consen 80 FEEVKGELTQEEKEDLSQRINKVLDGLEK 108 (117)
T ss_pred hHHHHhHhhHHHHHHHHHHHHHHHHHHhc
Confidence 55567889999999999999999987665
No 27
>cd03203 GST_C_Lambda GST_C family, Class Lambda subfamily; composed of plant-specific class Lambda GSTs. GSTs are cytosolic, usually dimeric, proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Lambda subfamily was recently discovered, together with dehydroascorbate reductases (DHARs), as two outlying groups of the GST superfamily in Arabidopsis thaliana, which contain conserved active site cysteines. Characterization of recombinant A. thaliana proteins show that Lambda class GSTs are monomeric, similar
Probab=30.63 E-value=1.8e+02 Score=20.44 Aligned_cols=73 Identities=10% Similarity=0.068 Sum_probs=43.9
Q ss_pred CCchhhcCCCchHHHHHHHHHHHHHHHH----hhccchhhHHHHHHHHHHHHHHHhh---hcchhhHHHHHHHHHHHHHH
Q psy4115 42 HFPQELEGKIERQVYEYTINQMNVYFQE----AESVSCKSYCEGCMACLTAYFVYLC---AETHYEKCLKKVAKFIAEQN 114 (149)
Q Consensus 42 ~fP~eL~g~Is~eeF~~iI~~IN~~L~~----A~~~s~~n~~e~~L~clT~~l~~~~---~~t~~kr~l~~le~yIe~~N 114 (149)
-+|..+.++- .++.++..+.+++.|.+ .|-.+--++.|-++++...++.... .........-+|.+|.+...
T Consensus 22 ~~~~~~~~~~-~~~~~~~l~~Le~~L~~~~~~~fl~G~~tlADi~l~~~~~~~~~~~~~~~~~~~~~~~P~l~~W~~~~~ 100 (120)
T cd03203 22 LYSSLIKGDP-SAEAAAALDYIENALSKFDDGPFFLGQFSLVDIAYVPFIERFQIFLSELFNYDITEGRPNLAAWIEEMN 100 (120)
T ss_pred HHHHHhcCCc-hHHHHHHHHHHHHHHHhcCCCCCcCCCccHHHHHHHHHHHHHHHHHHHhcCccccccCcHHHHHHHHHh
Confidence 4577776765 55889999999999974 2222423777888777665443211 11111123356888888775
Q ss_pred h
Q psy4115 115 E 115 (149)
Q Consensus 115 ~ 115 (149)
+
T Consensus 101 ~ 101 (120)
T cd03203 101 K 101 (120)
T ss_pred c
Confidence 4
No 28
>PF05903 Peptidase_C97: PPPDE putative peptidase domain; InterPro: IPR008580 This domain consists of the N-terminal portion of several eukaryotic sequences. The function of this domain is unknown.; PDB: 2WP7_A 3EBQ_A.
Probab=29.61 E-value=56 Score=24.81 Aligned_cols=40 Identities=13% Similarity=0.356 Sum_probs=28.4
Q ss_pred CCchHHHHHHHHHHHHHH-HHhhcc---chhhHHHHHHHHHHHH
Q psy4115 50 KIERQVYEYTINQMNVYF-QEAESV---SCKSYCEGCMACLTAY 89 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L-~~A~~~---s~~n~~e~~L~clT~~ 89 (149)
.+++++|++++..|-..+ ..+|++ +|.+|.+.+...|++-
T Consensus 81 ~~~~~~~~~~l~~l~~~~~~~~Y~Ll~~NCNhFs~~l~~~L~g~ 124 (151)
T PF05903_consen 81 TLSEEEFEEILRSLSREFTGDSYHLLNRNCNHFSDALCQFLTGK 124 (151)
T ss_dssp ---HHHHHHHHHHHHTT-SGGG-BTTTBSHHHHHHHHHHHHHS-
T ss_pred cCCHHHHHHHHHHHHhhccCCcchhhhhhhhHHHHHHHHHhCCC
Confidence 589999999999888754 677774 8888888888888764
No 29
>COG1422 Predicted membrane protein [Function unknown]
Probab=29.27 E-value=2.8e+02 Score=22.65 Aligned_cols=68 Identities=15% Similarity=0.114 Sum_probs=44.3
Q ss_pred cCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchh-hHHHHHHHHHHHHHHh
Q psy4115 48 EGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHY-EKCLKKVAKFIAEQNE 115 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~-kr~l~~le~yIe~~N~ 115 (149)
-|++.+..++.+=..+|..|.---++-.--+.=-++|++||....+.-.-.. ..+++++.+-.++.|+
T Consensus 18 ~g~~~~~~~~~i~~~ln~~f~P~i~~~~p~lvilV~avi~gl~~~i~~~~liD~ekm~~~qk~m~efq~ 86 (201)
T COG1422 18 FGIMFSSIRDGIGGALNVVFGPLLSPLPPHLVILVAAVITGLYITILQKLLIDQEKMKELQKMMKEFQK 86 (201)
T ss_pred HHHHhHHHHHHHHHHHHHHHhhhccccccHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHH
Confidence 3556667888888899999988777633445666778888877765554333 2455555555555444
No 30
>cd03187 GST_C_Phi GST_C family, Class Phi subfamily; composed of plant-specific class Phi GSTs and related fungal and bacterial proteins. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Phi GST subfamily has experience extensive gene duplication. The Arabidopsis and Oryza genomes contain 13 and 16 Tau GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Tau GSTs, showing class specificity in substrate preference. Phi enzymes a
Probab=29.25 E-value=1.8e+02 Score=19.55 Aligned_cols=62 Identities=3% Similarity=-0.001 Sum_probs=36.1
Q ss_pred hHHHHHHHHHHHHHHHHh-hcc-chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHh
Q psy4115 53 RQVYEYTINQMNVYFQEA-ESV-SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNE 115 (149)
Q Consensus 53 ~eeF~~iI~~IN~~L~~A-~~~-s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~ 115 (149)
.+++.+.++.+++.|... |-. +--++.|-++.++..++.. ............|.+|++....
T Consensus 47 ~~~~~~~l~~le~~L~~~~~l~G~~~t~aDi~l~~~~~~~~~-~~~~~~~~~~p~l~~~~~~~~~ 110 (118)
T cd03187 47 EEKLKKVLDVYEARLSKSKYLAGDSFTLADLSHLPYLQYLMA-TPFAKLFDSRPHVKAWWEDISA 110 (118)
T ss_pred HHHHHHHHHHHHHHcccCcccCCCCccHHHHHHHHHHHHHHH-ccchhhhhcCchHHHHHHHHHh
Confidence 456777777788888653 222 3357888888777666543 1111111234567788887665
No 31
>COG2306 Predicted RNA-binding protein, associated with RNAses E/G family [General function prediction only]
Probab=28.84 E-value=1.4e+02 Score=23.99 Aligned_cols=42 Identities=19% Similarity=0.143 Sum_probs=32.0
Q ss_pred cCCCchHHHHHHHHHHHHHHHHhhc--cchhhHHHHHHHHHHHH
Q psy4115 48 EGKIERQVYEYTINQMNVYFQEAES--VSCKSYCEGCMACLTAY 89 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~IN~~L~~A~~--~s~~n~~e~~L~clT~~ 89 (149)
.|.|++++|++.+.+.|.+.+++.+ ...++..--|+...=+|
T Consensus 134 ~GlIT~~qf~~Ay~~a~aVe~elr~g~~~~q~~~~~~~~~vw~~ 177 (183)
T COG2306 134 YGLITPEQFEKAYRTANAVEAELRNGKFPPQWLATKGMELVWAY 177 (183)
T ss_pred cCCCCHHHHHHHHHHHHHHHHHHhcCCcchhHHhhhhHHHhhhh
Confidence 5789999999999999999998887 25666665555554333
No 32
>cd03179 GST_C_1 GST_C family, unknown subfamily 1; composed of uncharacterized bacterial proteins, with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain.
Probab=28.69 E-value=1.2e+02 Score=20.06 Aligned_cols=59 Identities=8% Similarity=-0.053 Sum_probs=33.6
Q ss_pred chHHHHHHHHHHHHHHHHh-hcc-chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHH
Q psy4115 52 ERQVYEYTINQMNVYFQEA-ESV-SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQ 113 (149)
Q Consensus 52 s~eeF~~iI~~IN~~L~~A-~~~-s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~ 113 (149)
..+++++....++..|... |=. .--++.|-+++++..|+-..... ....-+|.+|++..
T Consensus 44 ~~~~~~~~l~~le~~L~~~~~l~g~~~slaDi~~~~~~~~~~~~~~~---~~~~p~l~~~~~~~ 104 (105)
T cd03179 44 LRERGHAALAVLEAHLAGRDFLVGDALTIADIALAAYTHVADEGGFD---LADYPAIRAWLARI 104 (105)
T ss_pred HHHHHHHHHHHHHHHHccCccccCCCCCHHHHHHHHHHHhccccCCC---hHhCccHHHHHHhh
Confidence 4567777888888888432 111 23378888888877775321111 12233466676654
No 33
>PF13405 EF-hand_6: EF-hand domain; PDB: 2AMI_A 3QRX_A 1W7J_B 1OE9_B 1W7I_B 1KFU_S 1KFX_S 2BL0_B 1Y1X_B 3MSE_B ....
Probab=28.25 E-value=36 Score=18.49 Aligned_cols=15 Identities=13% Similarity=0.257 Sum_probs=12.9
Q ss_pred cCCCchHHHHHHHHH
Q psy4115 48 EGKIERQVYEYTINQ 62 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~ 62 (149)
+|+|+.+||..++++
T Consensus 14 dG~I~~~el~~~l~~ 28 (31)
T PF13405_consen 14 DGFIDFEELRAILRK 28 (31)
T ss_dssp SSEEEHHHHHHHHHH
T ss_pred CCcCcHHHHHHHHHH
Confidence 578999999999875
No 34
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=27.93 E-value=1e+02 Score=25.69 Aligned_cols=34 Identities=12% Similarity=0.099 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHh
Q psy4115 82 CMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNE 115 (149)
Q Consensus 82 ~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~ 115 (149)
+||.+++|++.|.....|+.++.+.+.-++.+|.
T Consensus 77 llgs~slymfrwal~~lye~r~~r~~~~L~kLra 110 (251)
T COG5415 77 LLGSGSLYMFRWALTKLYEFRNNRRLRKLAKLRA 110 (251)
T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHH
Confidence 4556777888888888887777666666666554
No 35
>PF05164 ZapA: Cell division protein ZapA; InterPro: IPR007838 This entry a structural domain found in the cell division protein ZapA, as well as in related proteins. This domain has a core structure consisting of two layers alpha/beta, and has a long C-terminal helix that forms dimeric parallel and tetrameric antiparallel coiled coils []. ZapA interacts with FtsZ, where FtsZ is part of a mid-cell cytokinetic structure termed the Z-ring that recruits a hierarchy of fission related proteins early in the bacterial cell cycle. ZapA drives the polymerisation and filament bundling of FtsZ, thereby contributing to the spatio-temporal tuning of the Z-ring.; PDB: 1T3U_B 1W2E_B 3HNW_A.
Probab=27.78 E-value=1.9e+02 Score=19.15 Aligned_cols=68 Identities=12% Similarity=0.016 Sum_probs=38.9
Q ss_pred CCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhh--cchhhHHHH--HHHHHHHHHHhh
Q psy4115 49 GKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCA--ETHYEKCLK--KVAKFIAEQNEN 116 (149)
Q Consensus 49 g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~--~t~~kr~l~--~le~yIe~~N~~ 116 (149)
+.=+++..+++...||+.+.+.-.-+..-=.+.++..++..+..-+. ....+.... ++++-|++.+++
T Consensus 16 ~~~~ee~l~~~a~~i~~~i~~~~~~~~~~~~~~~~vlaaLnla~e~~~~~~~~~~~~~~~~l~~~i~~L~~~ 87 (89)
T PF05164_consen 16 PDEDEEYLRKAAELINEKINEIKKKYPKLSPERLAVLAALNLADELLKLKRELDELEELERLEERIEELNER 87 (89)
T ss_dssp TGCGHHHHHHHHHHHHHHHHHHCTTCCTSSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHh
Confidence 33478899999999999998876632111123334444444444333 223333333 677777776654
No 36
>PF06831 H2TH: Formamidopyrimidine-DNA glycosylase H2TH domain; InterPro: IPR015886 This entry represents a helix-2turn-helix DNA-binding domain found in DNA glycosylase/AP lyase enzymes, which are involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. Most damage to bases in DNA is repaired by the base excision repair pathway []. These enzymes are primarily from bacteria, and have both DNA glycosylase activity (3.2.2 from EC) and AP lyase activity (4.2.99.18 from EC). Examples include formamidopyrimidine-DNA glycosylases (Fpg; MutM) and endonuclease VIII (Nei). Formamidopyrimidine-DNA glycosylases (Fpg, MutM) is a trifunctional DNA base excision repair enzyme that removes a wide range of oxidation-damaged bases (N-glycosylase activity; 3.2.2.23 from EC) and cleaves both the 3'- and 5'-phosphodiester bonds of the resulting apurinic/apyrimidinic site (AP lyase activity; 4.2.99.18 from EC). Fpg has a preference for oxidised purines, excising oxidized purine bases such as 7,8-dihydro-8-oxoguanine (8-oxoG). ITs AP (apurinic/apyrimidinic) lyase activity introduces nicks in the DNA strand, cleaving the DNA backbone by beta-delta elimination to generate a single-strand break at the site of the removed base with both 3'- and 5'-phosphates. Fpg is a monomer composed of 2 domains connected by a flexible hinge []. The two DNA-binding motifs (a zinc finger and the helix-two-turns-helix motifs) suggest that the oxidized base is flipped out from double-stranded DNA in the binding mode and excised by a catalytic mechanism similar to that of bifunctional base excision repair enzymes []. Fpg binds one ion of zinc at the C terminus, which contains four conserved and essential cysteines [, ]. Endonuclease VIII (Nei) has the same enzyme activities as Fpg above (3.2.2 from EC, 4.2.99.18 from EC), but with a preference for oxidized pyrimidines, such as thymine glycol, 5,6-dihydrouracil and 5,6-dihydrothymine []. These protein contains three structural domains: an N-terminal catalytic core domain, a central helix-two turn-helix (H2TH) module and a C-terminal zinc finger []. The N-terminal catalytic domain and the C-terminal zinc finger straddle the DNA with the long axis of the protein oriented roughly orthogonal to the helical axis of the DNA. Residues that contact DNA are located in the catalytic domain and in a beta-hairpin loop formed by the zinc finger []. This entry represents the central domain containing the DNA-binding helix-two turn-helix domain [].; GO: 0003684 damaged DNA binding, 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0008270 zinc ion binding, 0016799 hydrolase activity, hydrolyzing N-glycosyl compounds, 0006289 nucleotide-excision repair; PDB: 3GQ3_A 3JR5_A 3SAT_A 3GPX_A 2F5Q_A 3SBJ_A 3U6S_A 3SAU_A 3SAR_A 2F5P_A ....
Probab=27.50 E-value=60 Score=22.68 Aligned_cols=34 Identities=12% Similarity=0.135 Sum_probs=28.4
Q ss_pred CCchhhcCCCchHHHHHHHHHHHHHHHHhhccch
Q psy4115 42 HFPQELEGKIERQVYEYTINQMNVYFQEAESVSC 75 (149)
Q Consensus 42 ~fP~eL~g~Is~eeF~~iI~~IN~~L~~A~~~s~ 75 (149)
.-|......|++++++.+...+-..|.+|.....
T Consensus 53 i~P~~~~~~L~~~~~~~l~~~~~~vl~~ai~~gg 86 (92)
T PF06831_consen 53 IHPERPASSLSEEELRRLHEAIKRVLREAIEVGG 86 (92)
T ss_dssp B-TTSBGGGSHHHHHHHHHHHHHHHHHHHHHTT-
T ss_pred CCccCccccCCHHHHHHHHHHHHHHHHHHHHcCC
Confidence 4577777889999999999999999999987643
No 37
>PF13410 GST_C_2: Glutathione S-transferase, C-terminal domain; PDB: 4DEJ_H 3IC8_A 2JL4_A 2V6K_B 3CBU_B 1JLW_B 3F6D_B 3G7I_A 3F63_A 3G7J_B ....
Probab=27.14 E-value=1.6e+02 Score=18.15 Aligned_cols=59 Identities=7% Similarity=0.090 Sum_probs=35.3
Q ss_pred hHHHHHHHHHHHHHHHHhhc-c-chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHH
Q psy4115 53 RQVYEYTINQMNVYFQEAES-V-SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIA 111 (149)
Q Consensus 53 ~eeF~~iI~~IN~~L~~A~~-~-s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe 111 (149)
.+++++.+..++..|...-- . +--++.|-++...-.++-..............|.+|.+
T Consensus 8 ~~~~~~~l~~le~~L~~~~fl~G~~~s~aD~~l~~~l~~~~~~~~~~~~~~~~p~l~~w~~ 68 (69)
T PF13410_consen 8 RAQLEAALDALEDHLADGPFLFGDRPSLADIALAPFLWRLRFVGPDFDLLEAYPNLRAWYE 68 (69)
T ss_dssp HHHHHHHHHHHHHHHTTSSBTTBSS--HHHHHHHHHHHHHHHCTHTCCHHTTSHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhhCCCCCCCCCCHHHHHHHHHHHHHHHhCcCcCccccCHHHHHHHh
Confidence 47788889999998876542 2 44478887766554444443332233455566777765
No 38
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=26.91 E-value=69 Score=18.02 Aligned_cols=16 Identities=25% Similarity=0.524 Sum_probs=12.8
Q ss_pred cCCCchHHHHHHHHHH
Q psy4115 48 EGKIERQVYEYTINQM 63 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~I 63 (149)
.|-||++||+..-.+|
T Consensus 14 ~G~IseeEy~~~k~~l 29 (31)
T PF09851_consen 14 KGEISEEEYEQKKARL 29 (31)
T ss_pred cCCCCHHHHHHHHHHH
Confidence 4789999999876654
No 39
>PF07108 PipA: PipA protein; InterPro: IPR010777 This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis [].
Probab=26.71 E-value=1.4e+02 Score=23.99 Aligned_cols=48 Identities=19% Similarity=0.320 Sum_probs=38.1
Q ss_pred CCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHH-HHHH
Q psy4115 50 KIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVA-KFIA 111 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le-~yIe 111 (149)
-|+...+.+--..+-++|..||.-| ..+.-++.+.|++.|..+| +|+-
T Consensus 53 EiDdd~Y~eC~~eLS~il~~AYtqs--------------~TFRRLmNyAydkeL~DveqrWll 101 (200)
T PF07108_consen 53 EIDDDIYDECYDELSSILQNAYTQS--------------ETFRRLMNYAYDKELHDVEQRWLL 101 (200)
T ss_pred cccchHHHHHHHHHHHHHHHHHhhh--------------HHHHHHHHHHhhhhhhhhHHHhhh
Confidence 3567788888899999999999743 3445667888999999999 7884
No 40
>COG3462 Predicted membrane protein [Function unknown]
Probab=26.67 E-value=54 Score=24.54 Aligned_cols=16 Identities=25% Similarity=0.536 Sum_probs=14.2
Q ss_pred cCCCchHHHHHHHHHH
Q psy4115 48 EGKIERQVYEYTINQM 63 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~I 63 (149)
.|-||+|||+.++++|
T Consensus 100 kGEItEEEY~r~~~~i 115 (117)
T COG3462 100 KGEITEEEYRRIIRTI 115 (117)
T ss_pred cCCCCHHHHHHHHHHh
Confidence 4679999999999987
No 41
>PRK15022 ferritin-like protein; Provisional
Probab=26.64 E-value=1.8e+02 Score=22.79 Aligned_cols=58 Identities=9% Similarity=0.071 Sum_probs=42.7
Q ss_pred HHHHHHHHHHHHHHHhhcc-chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHhh
Q psy4115 55 VYEYTINQMNVYFQEAESV-SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNEN 116 (149)
Q Consensus 55 eF~~iI~~IN~~L~~A~~~-s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~~ 116 (149)
--..+=.+||..|..|+-+ +...||+.. .+-.+.-|+..+....+...+++++.+|.+
T Consensus 6 ~~~~LN~QIn~E~~aSy~YLsMa~~~~~~----~L~GfA~ff~~qa~EEreHA~k~~~yl~~r 64 (167)
T PRK15022 6 MLLKLNSQMNLEFYASNLYLHLSEWCSEQ----SLNGTATFLRAQAQSNVTQMMRMFNFMKSA 64 (167)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CChhHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 3345557899999999986 777887633 344555666777778888889999999874
No 42
>KOG3431|consensus
Probab=26.43 E-value=58 Score=24.79 Aligned_cols=22 Identities=9% Similarity=0.117 Sum_probs=19.9
Q ss_pred hhcCCCchHHHHHHHHHHHHHH
Q psy4115 46 ELEGKIERQVYEYTINQMNVYF 67 (149)
Q Consensus 46 eL~g~Is~eeF~~iI~~IN~~L 67 (149)
.++++|++++|-.+.++||.--
T Consensus 84 Qvs~Kise~~lisiLe~is~Qt 105 (129)
T KOG3431|consen 84 QVSHKISEAELISILEKISAQT 105 (129)
T ss_pred CccccccHHHHHHHHHHHHHhh
Confidence 4889999999999999999866
No 43
>COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
Probab=26.24 E-value=76 Score=25.43 Aligned_cols=27 Identities=11% Similarity=0.066 Sum_probs=22.4
Q ss_pred hhcCCCchHHHHHHHHHHHHHHHHhhc
Q psy4115 46 ELEGKIERQVYEYTINQMNVYFQEAES 72 (149)
Q Consensus 46 eL~g~Is~eeF~~iI~~IN~~L~~A~~ 72 (149)
+|.||++|++|..+++-|-+.+.+-+.
T Consensus 131 ~lPGY~ppe~Fl~vlkYVa~g~ykd~~ 157 (182)
T COG2143 131 ELPGYMPPEQFLAVLKYVADGKYKDTK 157 (182)
T ss_pred ecCCCCCHHHHHHHHHHHHHHHHhhhc
Confidence 488999999999999988777766554
No 44
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=26.19 E-value=4.1e+02 Score=24.01 Aligned_cols=76 Identities=8% Similarity=0.139 Sum_probs=37.9
Q ss_pred eEEEEccCCCCCcccccccCCchhhcCCCchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhh-cchhhH
Q psy4115 23 AKIFIQRDYTDGTMVRFQTHFPQELEGKIERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCA-ETHYEK 101 (149)
Q Consensus 23 ~~i~I~RDys~g~~~qF~t~fP~eL~g~Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~-~t~~kr 101 (149)
-.+.|+|+|.+...+. ..|-.. ..++.|...|+++|.....+=|..++.++-.-|..+.+ ....++
T Consensus 177 l~~~~~r~~~~~~~~~---~lP~i~----------~~l~~L~~~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~ 243 (422)
T PF06957_consen 177 LPSYIRRNWDESNPKN---GLPAIP----------LSLSSLEERLKEGYKLFTAGKFEEAIEIFRSILHSIPLLVVESRE 243 (422)
T ss_dssp EEEEEBCTTTTSSSCC---G-BB--------------HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHC--BSSCH
T ss_pred ccccccCCcccccccc---CCCcCc----------CCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhheeeecCHH
Confidence 4578999999753333 233211 12344555677777754444455555555555554333 233344
Q ss_pred HHHHHHHHHH
Q psy4115 102 CLKKVAKFIA 111 (149)
Q Consensus 102 ~l~~le~yIe 111 (149)
...++.++|+
T Consensus 244 E~~e~~eli~ 253 (422)
T PF06957_consen 244 EEDEAKELIE 253 (422)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHHH
Confidence 4555544443
No 45
>PRK06474 hypothetical protein; Provisional
Probab=26.05 E-value=70 Score=24.94 Aligned_cols=23 Identities=13% Similarity=0.219 Sum_probs=20.4
Q ss_pred CCchHHHHHHHHHHHHHHHHhhc
Q psy4115 50 KIERQVYEYTINQMNVYFQEAES 72 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L~~A~~ 72 (149)
++|++||++....|+..+.+-..
T Consensus 135 ~Lt~ee~~el~~el~~ll~~y~~ 157 (178)
T PRK06474 135 KLDEEEFEEFQSELNELMIKYYN 157 (178)
T ss_pred ecCHHHHHHHHHHHHHHHHHHHh
Confidence 69999999999999999987654
No 46
>PF09693 Phage_XkdX: Phage uncharacterised protein (Phage_XkdX); InterPro: IPR010022 This entry is represented by Bacteriophage 69, Orf86. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry identifies a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=25.88 E-value=39 Score=20.25 Aligned_cols=14 Identities=21% Similarity=0.359 Sum_probs=11.8
Q ss_pred cCCCchHHHHHHHH
Q psy4115 48 EGKIERQVYEYTIN 61 (149)
Q Consensus 48 ~g~Is~eeF~~iI~ 61 (149)
-|.|+++||++|+.
T Consensus 24 ~g~IT~eey~eITG 37 (40)
T PF09693_consen 24 AGWITKEEYKEITG 37 (40)
T ss_pred cCeECHHHHHHhhC
Confidence 37899999999874
No 47
>PF06304 DUF1048: Protein of unknown function (DUF1048); InterPro: IPR008316 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 2O3L_B 2HH6_A 2O4T_A.
Probab=25.51 E-value=2.4e+02 Score=20.44 Aligned_cols=37 Identities=11% Similarity=0.114 Sum_probs=25.6
Q ss_pred hHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHH
Q psy4115 53 RQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAY 89 (149)
Q Consensus 53 ~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~ 89 (149)
|++++.+...|+.+|-..-..++++.++.+-++|-++
T Consensus 21 P~dY~~ay~~i~~Yl~~~~~~~g~~~~~il~dildlf 57 (103)
T PF06304_consen 21 PEDYRIAYKAIQKYLWYFGPTDGRDMMEILSDILDLF 57 (103)
T ss_dssp -HHHHHHHHHHHHHHHHHTBSSHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHH
Confidence 6889999999999998766557776555554444433
No 48
>PF14752 RBP_receptor: Retinol binding protein receptor
Probab=25.29 E-value=2e+02 Score=27.20 Aligned_cols=49 Identities=22% Similarity=0.230 Sum_probs=36.9
Q ss_pred HHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHH
Q psy4115 56 YEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVA 107 (149)
Q Consensus 56 F~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le 107 (149)
-.++|..++.++.-++ .-++...+++|++.....+-+-.+|+|.|++|-
T Consensus 304 ~~~~i~~v~~~~~~~~---v~~~~a~~la~l~~~~~l~~~lv~yR~~l~~L~ 352 (617)
T PF14752_consen 304 RQEVIELVKEYIWVLE---VCWVVATVLACLTTVSYLFHILVCYRKHLKRLY 352 (617)
T ss_pred hhHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3466777776666544 346788899999999888888888899888764
No 49
>TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology.
Probab=25.05 E-value=1.5e+02 Score=22.84 Aligned_cols=38 Identities=13% Similarity=0.101 Sum_probs=26.3
Q ss_pred hhHHHHHHHHHHHHHHHhhh------cchhhHHHHHHHHHHHHH
Q psy4115 76 KSYCEGCMACLTAYFVYLCA------ETHYEKCLKKVAKFIAEQ 113 (149)
Q Consensus 76 ~n~~e~~L~clT~~l~~~~~------~t~~kr~l~~le~yIe~~ 113 (149)
|.++-.||.++++++++.-. .|.....|.+|++|++..
T Consensus 2 r~~~s~~Lv~~~~~Lvsc~~p~~~~p~tysp~~l~~i~~~~~~i 45 (142)
T TIGR03042 2 RSLASLLLVLLLTFLVSCSGPAAAVPPTYSPAQLAQIQRQAEGI 45 (142)
T ss_pred hhHHHHHHHHHHHHHHHcCCCcccCCCCCCHHHHHHHHHHHHHH
Confidence 45677778877777776332 355578888888888763
No 50
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=24.86 E-value=47 Score=15.79 Aligned_cols=15 Identities=20% Similarity=0.477 Sum_probs=12.2
Q ss_pred cCCCchHHHHHHHHH
Q psy4115 48 EGKIERQVYEYTINQ 62 (149)
Q Consensus 48 ~g~Is~eeF~~iI~~ 62 (149)
+|+|+.++|..++..
T Consensus 14 ~g~i~~~e~~~~~~~ 28 (29)
T smart00054 14 DGKIDFEEFKDLLKA 28 (29)
T ss_pred CCcEeHHHHHHHHHh
Confidence 578999999888765
No 51
>PHA02650 hypothetical protein; Provisional
Probab=23.76 E-value=1.9e+02 Score=20.38 Aligned_cols=19 Identities=5% Similarity=0.176 Sum_probs=16.4
Q ss_pred chHHHHHHHHHHHHHHHHh
Q psy4115 52 ERQVYEYTINQMNVYFQEA 70 (149)
Q Consensus 52 s~eeF~~iI~~IN~~L~~A 70 (149)
+.+.|++.|+-+=..|.+.
T Consensus 16 ~DdDFnnFI~VVkSVLtD~ 34 (81)
T PHA02650 16 TDDDFNNFIDVVKSVLSDE 34 (81)
T ss_pred cHHHHHHHHHHHHHHHcCC
Confidence 5689999999999999775
No 52
>PF11922 DUF3440: Domain of unknown function (DUF3440); InterPro: IPR021845 This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is typically between 53 to 190 amino acids in length. This domain is found associated with PF01507 from PFAM. This domain has a conserved KND sequence motif.
Probab=23.70 E-value=1.7e+02 Score=23.54 Aligned_cols=77 Identities=17% Similarity=0.160 Sum_probs=45.2
Q ss_pred CCchHHHHHHHHHHHHHH-------------HHhhc---cchhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHH
Q psy4115 50 KIERQVYEYTINQMNVYF-------------QEAES---VSCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQ 113 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L-------------~~A~~---~s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~ 113 (149)
.|+|+-|..+|.++|..= ...-. .||+.|.+-+|+-+ +...-+.-+++++.||+.|
T Consensus 26 vlEPetW~k~v~RV~Ganfg~iY~~t~~~G~~~i~lP~~~TWk~Y~~FLL~Tl--------P~~tr~~Y~~Ki~~~i~~w 97 (181)
T PF11922_consen 26 VLEPETWGKMVGRVNGANFGAIYGGTSAMGYRKIKLPEGHTWKSYAEFLLDTL--------PEKTREHYRNKIAVSIRWW 97 (181)
T ss_pred hcCHHHHHHHHHHHccccccccccCcchhhhhccCCCCCCCHHHHHHHHHHcC--------CHHHHHHHHHHHHHHHHHH
Confidence 489999999999999752 11111 26666665554433 2222245667788888866
Q ss_pred Hhh---c-------CcCCCceeeCccccceEE
Q psy4115 114 NEN---V-------FLPRALMLVNPAERGLRV 135 (149)
Q Consensus 114 N~~---~-------~~~~Gl~ii~Pr~~G~ls 135 (149)
=++ + ...+|++|...-. +|+.
T Consensus 98 ~~~g~~lpde~i~~L~~~~~~~~~~~~-~~~t 128 (181)
T PF11922_consen 98 RKKGGCLPDETIQELEEAGIPIEVNGK-NYRT 128 (181)
T ss_pred HHcCCCCCHHHHHHHHhcCCceeeccc-cccc
Confidence 542 1 1224666665544 5543
No 53
>PF06720 Phi-29_GP16_7: Bacteriophage phi-29 early protein GP16.7; InterPro: IPR009595 The early-expressed gene 16.7 is conserved in bacteriophage phi-29 and related phages. It encodes a membrane protein, GP16.7, consisting of an N-terminal transmembrane domain and a C-terminal DNA-binding and dimerisation domain. GP16.7 plays an important role in organising membrane-associated bacteriophage DNA replication [, ]. The C-terminal domain has a similar secondary structure similar to homeodomains, but forms a fundamentally different tertiary structure consisting of a six-helical dimeric fold []. Multimerisation of this dimer leads to efficient DNA binding.; PDB: 2C5R_B 2BNK_A 1ZAE_B.
Probab=23.69 E-value=26 Score=26.36 Aligned_cols=16 Identities=13% Similarity=0.430 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHhhc
Q psy4115 102 CLKKVAKFIAEQNENV 117 (149)
Q Consensus 102 ~l~~le~yIe~~N~~~ 117 (149)
....+|+||+++|+.+
T Consensus 28 E~~e~edyiEdL~k~i 43 (130)
T PF06720_consen 28 EARELEDYIEDLNKRI 43 (130)
T ss_dssp ----------------
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4566899999999865
No 54
>PF12409 P5-ATPase: P5-type ATPase cation transporter
Probab=23.56 E-value=87 Score=22.66 Aligned_cols=23 Identities=17% Similarity=0.158 Sum_probs=19.4
Q ss_pred chhhHHHHHHHHHHHHHHHhhhc
Q psy4115 74 SCKSYCEGCMACLTAYFVYLCAE 96 (149)
Q Consensus 74 s~~n~~e~~L~clT~~l~~~~~~ 96 (149)
.++.++--++++||+++..+++.
T Consensus 14 ~~r~~l~~~l~ilT~Gll~L~~~ 36 (119)
T PF12409_consen 14 KWRTILYYFLCILTLGLLYLVFR 36 (119)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHh
Confidence 57899999999999999876553
No 55
>PF08388 GIIM: Group II intron, maturase-specific domain; InterPro: IPR013597 This region is found mainly in various bacterial and archaeal species, but a few members of this family are expressed by fungal and chlamydomonal species. It has been implicated in the binding of intron RNA during reverse transcription and splicing [].
Probab=23.15 E-value=2.2e+02 Score=18.42 Aligned_cols=37 Identities=14% Similarity=0.170 Sum_probs=18.3
Q ss_pred chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHH
Q psy4115 74 SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIA 111 (149)
Q Consensus 74 s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe 111 (149)
+....++.+=..+-||.=++-.. +..+.+.+|+.||.
T Consensus 20 ~~~~~i~~LN~~lrGW~nYy~~~-~~~~~f~~ld~~v~ 56 (80)
T PF08388_consen 20 SLEELIKKLNPILRGWANYYRIG-NSSKTFSKLDHYVW 56 (80)
T ss_pred CHHHHHHHHHHHHHHHHHhhcch-hHHHHHHHHHHHHH
Confidence 33444444444444444444433 33566666666664
No 56
>KOG0164|consensus
Probab=22.34 E-value=14 Score=35.85 Aligned_cols=75 Identities=12% Similarity=0.247 Sum_probs=46.1
Q ss_pred CchHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHHHHHH---------hhhcchhhHHHHH-HHHHHHHHHhhcCcC
Q psy4115 51 IERQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTAYFVY---------LCAETHYEKCLKK-VAKFIAEQNENVFLP 120 (149)
Q Consensus 51 Is~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~~l~~---------~~~~t~~kr~l~~-le~yIe~~N~~~~~~ 120 (149)
|=+.-|.=+|++||+.+.---..+.+.. +.++|+|-.|.+. +|..+|-+|-.+- +|--+ .+-++-|..
T Consensus 346 iY~RlF~Wiv~rIn~~i~~~~~~~~~~~-~~Vigvldiygfeif~~NSFEQfcINYCNEKLQQlFIel~L-KqEQEEY~r 423 (1001)
T KOG0164|consen 346 IYSRLFTWIVNRINRSIEVKGVITLRKG-NTVIGVLDIYGFEIFQDNSFEQFCINYCNEKLQQLFIELVL-KQEQEEYER 423 (1001)
T ss_pred HHHHHHHHHHHhhhhheecccccccccC-ceEEEEEEeeeEEeecCCcHHHHHHHHHHHHHHHHHHHHHH-HhhHHHHHh
Confidence 4456688899999998865433322222 5666666667663 7777776653332 23333 334556788
Q ss_pred CCceeeC
Q psy4115 121 RALMLVN 127 (149)
Q Consensus 121 ~Gl~ii~ 127 (149)
.||.|.+
T Consensus 424 EgI~W~~ 430 (1001)
T KOG0164|consen 424 EGIEWTH 430 (1001)
T ss_pred cCCCcee
Confidence 8997764
No 57
>TIGR00691 spoT_relA (p)ppGpp synthetase, RelA/SpoT family. (p)ppGpp is a regulatory metabolite of the stringent response, but appears also to be involved in antibiotic biosynthesis in some species.
Probab=22.04 E-value=3.7e+02 Score=25.55 Aligned_cols=23 Identities=17% Similarity=0.076 Sum_probs=18.0
Q ss_pred CCchHHHHHHHHHHHHHHHHhhc
Q psy4115 50 KIERQVYEYTINQMNVYFQEAES 72 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L~~A~~ 72 (149)
++.|++|..+.+.|+..-.+-+.
T Consensus 164 ~l~p~~y~~i~~~l~~~~~~~~~ 186 (683)
T TIGR00691 164 YLYPKEYENIKSLVNEQKVNREN 186 (683)
T ss_pred hcCHHHHHHHHHHHHHHHHHHHH
Confidence 57889999999999887665444
No 58
>KOG3096|consensus
Probab=21.93 E-value=3.8e+02 Score=22.24 Aligned_cols=47 Identities=28% Similarity=0.312 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHhhcc-chhhHHHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHh
Q psy4115 59 TINQMNVYFQEAESV-SCKSYCEGCMACLTAYFVYLCAETHYEKCLKKVAKFIAEQNE 115 (149)
Q Consensus 59 iI~~IN~~L~~A~~~-s~~n~~e~~L~clT~~l~~~~~~t~~kr~l~~le~yIe~~N~ 115 (149)
.|..+|-+|-+-+.. .|+.+.+.+-..|| |-+|.|++.-+-|+++|.
T Consensus 112 ~~r~~NLeLM~k~g~eaw~~~ne~le~~l~----------~aqkelq~~kk~iq~vn~ 159 (225)
T KOG3096|consen 112 LVRNENLELMSKHGEEAWKQYNEVLEVMLT----------HAQKELQKTKKLIQDVNR 159 (225)
T ss_pred HHHHhhHHHHHHhhHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHhH
Confidence 456688888777776 77777777655554 559999999999999996
No 59
>PF09457 RBD-FIP: FIP domain ; InterPro: IPR019018 The Rab11 GTPase regulates recycling of internalized plasma membrane receptors and is essential for completion of cytokinesis. A family of Rab11 interacting proteins (FIPs) that conserve a C-terminal Rab-binding domain (RBD) selectively recognise the active form of Rab11. FIPs are diverse in sequence length and composition toward their N-termini, presumably a feature that underpins their specific roles in Rab11-mediated vesicle trafficking. They have been divided into three subfamilies (classe I, II, and III)on the basis of domain architecture. Class I FIPs comprises a subfamily of three proteins (Rip11/pp75/FIP5, Rab-coupling protein (RCP), and FIP2) that possess an N- terminal C2 domain, localize to recycling endosomes, and regulate plasma membrane recycling. The class II subfamily consists of two proteins (FIP3/eferin/arfophilin and FIP4) with tandem EF hands and a proline-rich region. Class II FIPs localize to recycling endosomes, the trans-Golgi network, and have been implicated in the regulation of membrane trafficking during cytokinesis. The class III subfamily consists of a single protein, FIP1, which does not contain obvious homology domains or motifs other than the FIP-RBD [, , , ]. The FIP-RBD domain is also found in Rab6-interacting protein Erc1/Elks. Erc1 is the regulatory subunit of the IKK complex and probably recruits IkappaBalpha/NFKBIA to the complex []. It may be involved in the organisation of the cytomatrix at the nerve terminals active zone (CAZ) which regulates neurotransmitter release. It may also be involved in vesicle trafficking at the CAZ, as well as in Rab-6 regulated endosomes to Golgi transport []. The FIB-RBD domain consists of an N-terminal long alpha-helix, followed by a 90 degrees bend at a conserved proline residue, a 3(10) helix and a C-terminal short beta-strand, adopting an "L" shape. The long alpha-helix forms a parallel coiled-coil homodimer that symmetrically interacts with two Rab11 molecules on both sides, forming a quaternary Rab11-(FIP)2-Rab11 complex. The Rab11-interacting region of FIP-RBD is confined to the C-terminal 24 amino acids, which cover the C-terminal half of the long alpha-helix and the short beta-strand [, , , ]. This entry represents the FIP-RBD domain.; PDB: 2HV8_E 2D7C_D 2K6S_B 2GZD_D 2GZH_B.
Probab=21.85 E-value=95 Score=19.61 Aligned_cols=15 Identities=7% Similarity=0.204 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHh
Q psy4115 101 KCLKKVAKFIAEQNE 115 (149)
Q Consensus 101 r~l~~le~yIe~~N~ 115 (149)
..+++|++||+..=.
T Consensus 21 ~~v~eLe~YiD~LL~ 35 (48)
T PF09457_consen 21 SRVRELEDYIDNLLV 35 (48)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 467779999997643
No 60
>TIGR02603 CxxCH_TIGR02603 putative heme-binding domain, Pirellula/Verrucomicrobium type. This model represents a domain limited to very few species but expanded into large paralogous families in some species that conain it. We find it in over 20 copies each in Pirellula sp. strain 1 (phylum Planctomycetes) and Verrucomicrobium spinosum DSM 4136 (phylum Verrucomicrobia), and no matches above trusted cutoff an any other species so far. This domain, about 140 amino acids long, contains an absolutely conserved motif CxxCH, the cytochrome c family heme-binding site signature (PS00190).
Probab=21.36 E-value=91 Score=22.92 Aligned_cols=23 Identities=17% Similarity=0.219 Sum_probs=20.1
Q ss_pred cCCchhhcCCCchHHHHHHHHHH
Q psy4115 41 THFPQELEGKIERQVYEYTINQM 63 (149)
Q Consensus 41 t~fP~eL~g~Is~eeF~~iI~~I 63 (149)
+.+|+.|...++++||.+.|.-|
T Consensus 108 S~MP~gl~~~Lt~~e~~dL~aYL 130 (133)
T TIGR02603 108 SLMPEGLEMGLSDQDLADLVAYL 130 (133)
T ss_pred CcCCchhhccCCHHHHHHHHHHH
Confidence 45899999999999999998755
No 61
>PF12193 Sulf_coat_C: Sulfolobus virus coat protein C terminal; InterPro: IPR022014 This domain family is found in viruses, and is approximately 70 amino acids in length. It is the C-terminal of a coat protein in sulfolobus viruses. ; PDB: 3F2E_A.
Probab=21.36 E-value=2.3e+02 Score=18.90 Aligned_cols=35 Identities=14% Similarity=0.203 Sum_probs=23.6
Q ss_pred hHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHH
Q psy4115 53 RQVYEYTINQMNVYFQEAESVSCKSYCEGCMACLTA 88 (149)
Q Consensus 53 ~eeF~~iI~~IN~~L~~A~~~s~~n~~e~~L~clT~ 88 (149)
-..+-..|+.+-.++..||+.+ ..++|.+..+.|+
T Consensus 35 gpalvnminglkayyisafnan-p~vldavtniitg 69 (69)
T PF12193_consen 35 GPALVNMINGLKAYYISAFNAN-PQVLDAVTNIITG 69 (69)
T ss_dssp SHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHS
T ss_pred chHHHHHHhhhHHHhhHhhcCC-cHHHHHHHHHhcC
Confidence 3556778888888999999875 4577777666653
No 62
>KOG3014|consensus
Probab=21.34 E-value=1.4e+02 Score=25.32 Aligned_cols=38 Identities=18% Similarity=0.174 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHhhcCcCCCceeeCccccceEEEEE
Q psy4115 101 KCLKKVAKFIAEQNENVFLPRALMLVNPAERGLRVIEI 138 (149)
Q Consensus 101 r~l~~le~yIe~~N~~~~~~~Gl~ii~Pr~~G~lslei 138 (149)
-.++++++-+.-.|.++....-.+...|...+|++|.+
T Consensus 101 ~~~kk~~Ev~~~VnnELg~~~~~~~~~~~~k~~lFIS~ 138 (257)
T KOG3014|consen 101 AALKKVEEVMKMVNNELGYQQIENQCWPKIKTFLFISV 138 (257)
T ss_pred HHHHHHHHHHHHHHhhcCCcccccccccceeEEEEEEe
Confidence 36788888899999988888777777788888888643
No 63
>COG4064 MtrG Tetrahydromethanopterin S-methyltransferase, subunit G [Coenzyme metabolism]
Probab=21.01 E-value=34 Score=23.62 Aligned_cols=21 Identities=19% Similarity=0.251 Sum_probs=11.6
Q ss_pred hHHHHHHHHHHHHHHhhcCcC
Q psy4115 100 EKCLKKVAKFIAEQNENVFLP 120 (149)
Q Consensus 100 kr~l~~le~yIe~~N~~~~~~ 120 (149)
.++|.++|+=+|-.|.++|..
T Consensus 21 ~kRLdeieekvef~~~Ev~Qr 41 (75)
T COG4064 21 HKRLDEIEEKVEFVNGEVYQR 41 (75)
T ss_pred HHHHHHHHHHHHhhHHHHHHH
Confidence 345555666666666555543
No 64
>TIGR01669 phage_XkdX phage uncharacterized protein, XkdX family. This model represents a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=20.52 E-value=70 Score=19.78 Aligned_cols=13 Identities=23% Similarity=0.327 Sum_probs=11.4
Q ss_pred CCCchHHHHHHHH
Q psy4115 49 GKIERQVYEYTIN 61 (149)
Q Consensus 49 g~Is~eeF~~iI~ 61 (149)
+.|+++|+++|+.
T Consensus 30 ~~IT~eey~eITG 42 (45)
T TIGR01669 30 KLITREQYKVITG 42 (45)
T ss_pred CccCHHHHHHHhC
Confidence 6899999999874
No 65
>PF03304 Mlp: Mlp lipoprotein family; InterPro: IPR004983 The Mlp (for Multicopy Lipoprotein) family of lipoproteins is found in Borrelia species []. This family were previously known as 2.9 lipoprotein genes []. These surface expressed genes may represent new candidate vaccinogens for Lyme disease []. Members of this family generally are downstream of four ORFs called A,B,C and D that are involved in hemolytic activity.
Probab=20.26 E-value=1.4e+02 Score=23.28 Aligned_cols=60 Identities=15% Similarity=0.382 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHHHhhc--cchhhHHHHHHHHHHHHHHHhhhcchhhHHH-HHHHHHHHHHHhhcC
Q psy4115 55 VYEYTINQMNVYFQEAES--VSCKSYCEGCMACLTAYFVYLCAETHYEKCL-KKVAKFIAEQNENVF 118 (149)
Q Consensus 55 eF~~iI~~IN~~L~~A~~--~s~~n~~e~~L~clT~~l~~~~~~t~~kr~l-~~le~yIe~~N~~~~ 118 (149)
|+..--++++.+|..+.+ .++-.++.+.++|+++- +-...|+... ..|+.|.......+|
T Consensus 78 El~~af~~vy~FLe~k~sk~k~~~~yIk~~id~~~~~----~~~~k~~~~~~n~I~~~F~g~~~d~f 140 (152)
T PF03304_consen 78 ELLNAFTEVYNFLEKKESKIKDFDEYIKGAIDCCTGN----DNANKYKNTFKNEIEQFFRGVAIDIF 140 (152)
T ss_pred HHHHhHHHHHHHHHHHHhhhhHHHHHHHHHHHhccCc----cchhhcccchHHHHHHHHhccchhhc
Confidence 555566777888877655 27778888888888876 3334555444 667777766654444
No 66
>PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional
Probab=20.15 E-value=1.1e+02 Score=21.41 Aligned_cols=24 Identities=4% Similarity=0.106 Sum_probs=18.8
Q ss_pred CCchHHHHHHHHHHHHHHHHhhcc
Q psy4115 50 KIERQVYEYTINQMNVYFQEAESV 73 (149)
Q Consensus 50 ~Is~eeF~~iI~~IN~~L~~A~~~ 73 (149)
.+++.+|+++.+++|++=.+-+..
T Consensus 11 iv~~~d~~~i~~rLD~iEeKVEft 34 (77)
T PRK01026 11 VVDPKDFKEIQKRLDEIEEKVEFT 34 (77)
T ss_pred ecCHHHHHHHHHHHHHHHHHHHHH
Confidence 478999999999999876655543
Done!