Query psy18141
Match_columns 113
No_of_seqs 118 out of 172
Neff 4.4
Searched_HMMs 46136
Date Fri Aug 16 23:40:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18141.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18141hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4100|consensus 100.0 1.1E-46 2.3E-51 273.2 9.8 99 1-100 17-115 (125)
2 PF13233 Complex1_LYR_2: Compl 99.3 4E-12 8.6E-17 87.6 5.3 85 1-89 6-103 (104)
3 PF05347 Complex1_LYR: Complex 98.4 3E-07 6.5E-12 57.1 3.4 50 1-50 8-59 (59)
4 PF13232 Complex1_LYR_1: Compl 98.2 3.3E-06 7.2E-11 53.4 4.5 51 1-51 8-60 (61)
5 KOG4620|consensus 78.9 1.9 4.2E-05 29.7 2.4 34 1-34 16-49 (80)
6 PF13220 DUF4028: Protein of u 73.9 2.1 4.6E-05 28.2 1.4 20 13-32 17-36 (65)
7 PF14304 CSTF_C: Transcription 70.1 5.6 0.00012 24.8 2.6 27 69-95 16-42 (46)
8 KOG3801|consensus 52.5 58 0.0012 23.2 5.5 57 1-57 14-72 (94)
9 PF07813 LTXXQ: LTXXQ motif fa 49.9 22 0.00048 22.9 3.0 24 77-100 14-37 (100)
10 PF11460 DUF3007: Protein of u 48.1 39 0.00084 24.4 4.2 37 39-87 63-99 (104)
11 PF06299 DUF1045: Protein of u 47.1 11 0.00024 28.8 1.3 43 9-51 86-136 (160)
12 PF04282 DUF438: Family of unk 47.0 26 0.00056 23.5 2.9 30 3-39 6-36 (71)
13 KOG3346|consensus 40.9 15 0.00033 28.6 1.3 37 19-55 145-182 (185)
14 PRK10363 cpxP periplasmic repr 40.0 25 0.00054 27.2 2.3 20 77-96 50-69 (166)
15 PF14659 Phage_int_SAM_3: Phag 39.2 54 0.0012 18.9 3.3 29 16-44 27-57 (58)
16 PLN03085 nucleobase:cation sym 35.9 72 0.0016 25.8 4.4 69 8-94 70-139 (221)
17 PRK10455 periplasmic protein; 35.5 31 0.00067 26.0 2.2 20 77-96 56-75 (161)
18 PF13801 Metal_resist: Heavy-m 35.1 38 0.00083 22.0 2.3 21 75-95 39-59 (125)
19 PF00715 IL2: Interleukin 2 Th 34.2 41 0.00088 25.6 2.6 20 33-52 125-144 (145)
20 PRK12750 cpxP periplasmic repr 33.6 49 0.0011 25.1 3.0 20 74-93 133-152 (170)
21 PF14794 DUF4479: Domain of un 33.1 40 0.00087 22.4 2.2 19 75-93 45-63 (73)
22 PRK12750 cpxP periplasmic repr 32.1 44 0.00095 25.4 2.5 18 77-94 52-69 (170)
23 TIGR00787 dctP tripartite ATP- 31.6 56 0.0012 25.1 3.1 32 68-99 209-240 (257)
24 PF11647 PMT_C: C-terminal reg 31.2 67 0.0014 21.1 3.0 16 79-94 37-52 (66)
25 smart00189 IL2 Interleukin-2 f 30.8 48 0.001 25.3 2.5 19 34-52 132-150 (154)
26 COG0334 GdhA Glutamate dehydro 28.9 1.3E+02 0.0027 26.5 5.0 26 26-52 69-94 (411)
27 PRK12751 cpxP periplasmic stre 28.0 68 0.0015 24.4 2.9 25 69-93 121-145 (162)
28 KOG2712|consensus 27.7 45 0.00098 24.3 1.8 19 77-95 82-100 (108)
29 PF08158 NUC130_3NT: NUC130/3N 27.6 35 0.00076 21.4 1.1 15 2-16 22-37 (52)
30 PRK07080 hypothetical protein; 27.4 65 0.0014 27.3 2.9 33 24-56 174-209 (317)
31 PF00046 Homeobox: Homeobox do 26.7 1.1E+02 0.0023 18.0 3.2 25 77-101 6-30 (57)
32 smart00496 IENR2 Intron-encode 26.7 35 0.00076 18.9 0.9 18 66-83 1-18 (26)
33 KOG4049|consensus 26.6 53 0.0011 26.9 2.2 33 17-52 172-204 (248)
34 PF11304 DUF3106: Protein of u 26.2 1.6E+02 0.0034 20.6 4.4 35 9-47 58-92 (107)
35 PHA02902 putative IMV membrane 25.1 58 0.0012 22.0 1.8 27 64-90 38-66 (70)
36 PF12616 DUF3775: Protein of u 24.6 74 0.0016 21.4 2.3 22 74-95 9-30 (75)
37 PF10655 DUF2482: Hypothetical 24.1 61 0.0013 23.2 1.9 29 68-96 6-34 (100)
38 KOG1070|consensus 23.8 97 0.0021 31.7 3.7 41 10-50 1641-1683(1710)
39 PF06937 EURL: EURL protein; 23.7 88 0.0019 26.3 3.0 41 68-108 209-250 (285)
40 cd08816 CARD_RIG-I_1 Caspase a 23.3 56 0.0012 23.0 1.6 15 31-45 18-32 (89)
41 PRK00306 50S ribosomal protein 22.6 1.4E+02 0.0031 19.0 3.3 19 73-91 4-23 (66)
42 PF14842 FliG_N: FliG N-termin 22.4 59 0.0013 22.5 1.6 40 65-104 17-56 (108)
43 PF08863 YolD: YolD-like prote 22.4 72 0.0016 20.6 1.9 20 80-99 2-21 (92)
44 PF08035 Op_neuropeptide: Opio 22.3 12 0.00027 21.6 -1.5 10 39-48 3-12 (31)
45 PF12651 RHH_3: Ribbon-helix-h 22.3 1.4E+02 0.003 17.7 3.0 29 77-105 8-36 (44)
46 PF13024 DUF3884: Protein of u 22.2 44 0.00096 22.7 0.9 11 12-22 22-32 (77)
47 PF06853 DUF1249: Protein of u 22.2 77 0.0017 22.8 2.2 15 35-49 98-112 (120)
48 smart00309 PAH Pancreatic horm 21.9 1.4E+02 0.0029 17.7 2.8 22 31-52 11-32 (36)
49 PF06207 DUF1002: Protein of u 21.7 1.1E+02 0.0023 24.5 3.1 23 78-100 192-214 (225)
50 PF12767 SAGA-Tad1: Transcript 21.7 64 0.0014 25.4 1.8 31 42-83 25-60 (252)
51 TIGR03759 conj_TIGR03759 integ 21.1 1.8E+02 0.004 23.2 4.2 43 41-91 16-61 (200)
No 1
>KOG4100|consensus
Probab=100.00 E-value=1.1e-46 Score=273.17 Aligned_cols=99 Identities=51% Similarity=0.798 Sum_probs=93.0
Q ss_pred ChhhcchhcCChHHHHhhhHHHHHHHhhhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCcccCCCCHHHHhhcCH
Q psy18141 1 MEMATLHRGLPPELQILGHQYVRDEFRRHKGCNPSEAQVFMNEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQALNDLRD 80 (113)
Q Consensus 1 RrILRlHR~LP~~mR~LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~le~lsd 80 (113)
|||||+||.||+++|+|||+||||||||||+|||.|+++||+||+.||.+|++|+++.|.. ++.+|.+||++.+++|||
T Consensus 17 kriLrlHr~lp~~~R~lGD~YVkdEFrrHk~vnp~~~~~FlteW~~Ya~~l~qql~~~g~~-K~~~g~~ld~d~le~l~d 95 (125)
T KOG4100|consen 17 KRILRLHRGLPAELRALGDQYVKDEFRRHKTVNPLEAQGFLTEWERYAVALSQQLSSAGKW-KGEIGSDLDSDKLEQLSD 95 (125)
T ss_pred HHHHHHHccCChHHHHHHHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHhhhcCcc-cccccccCCHHHHHHcCH
Confidence 6899999999999999999999999999999999999999999999999999999876654 579999999999999999
Q ss_pred HHHHHHHHHHHHhcCCCcch
Q psy18141 81 DQIAQLYELMLAATQPEVLE 100 (113)
Q Consensus 81 EQi~QLyEL~~ea~k~~~~~ 100 (113)
|||+||||||+||+++..++
T Consensus 96 eqi~QLyELm~ea~k~~~~~ 115 (125)
T KOG4100|consen 96 EQIGQLYELMKEAQKHMGDD 115 (125)
T ss_pred HHHHHHHHHHHHHHhhhhhh
Confidence 99999999999999975543
No 2
>PF13233 Complex1_LYR_2: Complex1_LYR-like
Probab=99.30 E-value=4e-12 Score=87.57 Aligned_cols=85 Identities=32% Similarity=0.433 Sum_probs=64.7
Q ss_pred ChhhcchhcCChHHH--------HhhhHHH----HHHHhhhcCCC-HHHHHHHHHHHHHHHHHHHHHhcccCCCCCCccc
Q psy18141 1 MEMATLHRGLPPELQ--------ILGHQYV----RDEFRRHKGCN-PSEAQVFMNEWAKYAVNLAEQLGIRGPKFAKPLG 67 (113)
Q Consensus 1 RrILRlHR~LP~~mR--------~LGD~YV----K~EFRrHk~~n-p~~~~~Fl~eW~~Y~~~L~~Q~~~~g~~~~~~~G 67 (113)
|+|||.|+..|.... +.||.|| ++||++|+.++ +..+..|+.+|+.|+..|.+|..-..--..-..|
T Consensus 6 R~lLRel~~~~~~~~~~~~~~~~~~~~~~v~~~~~~ef~~~~~~~~~~~~~~~~~~~~~y~~~L~~qr~y~eL~erYnpg 85 (104)
T PF13233_consen 6 RSLLRELRRYPRRSKIHQLKAPRSPGDQYVLEQARAEFRRHKSANDEEEAQEFLQEWENYATFLKNQREYKELLERYNPG 85 (104)
T ss_pred HHHHHHHHhhcchhhhhhhhchhHHHHHHHHHHHHHHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Confidence 678999987665542 8999999 99999999997 9999999999999999999887310000123678
Q ss_pred CCCCHHHHhhcCHHHHHHHHHH
Q psy18141 68 QSLDEQALNDLRDDQIAQLYEL 89 (113)
Q Consensus 68 ~~L~~~~le~lsdEQi~QLyEL 89 (113)
.+||+++. + ++..+++.|
T Consensus 86 ~~m~qee~--v--~~tA~~VGL 103 (104)
T PF13233_consen 86 LDMTQEER--V--RRTARRVGL 103 (104)
T ss_pred cCCCHHHH--H--HHHHHHhcC
Confidence 88998876 2 555555543
No 3
>PF05347 Complex1_LYR: Complex 1 protein (LYR family); InterPro: IPR008011 This family of short proteins includes proteins from the NADH-ubiquinone oxidoreductase complex I. The family includes the B14 subunit from bovine NADH-ubiquinone oxidoreductase B14 subunit Q02366 from SWISSPROT, and the B22 subunit from the human enzyme Q9Y6M9 from SWISSPROT. The family has been named LYR after a highly conserved tripeptide motif close to the N terminus of these proteins. Members of this family also found in yeast which do contain this complex. In these organisms they are believed to be be required for iron-sulphur custer biogenesis.
Probab=98.40 E-value=3e-07 Score=57.14 Aligned_cols=50 Identities=14% Similarity=0.265 Sum_probs=45.3
Q ss_pred ChhhcchhcCChH-HHHhhhHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHH
Q psy18141 1 MEMATLHRGLPPE-LQILGHQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVN 50 (113)
Q Consensus 1 RrILRlHR~LP~~-mR~LGD~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~ 50 (113)
|.+||+.+.+|.+ .|...-.+|++||+.|++. +|..|..+++.+..|+.+
T Consensus 8 R~lLR~~~~~~~~~~r~~~~~~iR~~Fr~n~~~~d~~~I~~~l~~g~~~l~~ 59 (59)
T PF05347_consen 8 RQLLRAARSFPDDSEREYIRAEIRQEFRKNRNETDPEKIEELLKKGEEELEM 59 (59)
T ss_pred HHHHHHHHHcCCcchHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHhcC
Confidence 6899999988865 4999999999999999998 999999999999999863
No 4
>PF13232 Complex1_LYR_1: Complex1_LYR-like
Probab=98.18 E-value=3.3e-06 Score=53.36 Aligned_cols=51 Identities=22% Similarity=0.304 Sum_probs=46.5
Q ss_pred ChhhcchhcCC-hHHHHhhhHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHHH
Q psy18141 1 MEMATLHRGLP-PELQILGHQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVNL 51 (113)
Q Consensus 1 RrILRlHR~LP-~~mR~LGD~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~L 51 (113)
|.+||..+.+| +..|.--..+|+++||+|+++ +|..|..++.+|+.-+..|
T Consensus 8 R~lLR~~~~~~~~~~r~~~~~~ir~~Fr~~~~~td~~~i~~~l~~~~~~L~~l 60 (61)
T PF13232_consen 8 RQLLREASKFPDYNFRSYFRRRIRDRFRRNKNVTDPEKIAKLLKEGRKELELL 60 (61)
T ss_pred HHHHHHhhhcCCcchHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHc
Confidence 67899999888 788988899999999999998 9999999999999988765
No 5
>KOG4620|consensus
Probab=78.93 E-value=1.9 Score=29.68 Aligned_cols=34 Identities=26% Similarity=0.376 Sum_probs=24.4
Q ss_pred ChhhcchhcCChHHHHhhhHHHHHHHhhhcCCCH
Q psy18141 1 MEMATLHRGLPPELQILGHQYVRDEFRRHKGCNP 34 (113)
Q Consensus 1 RrILRlHR~LP~~mR~LGD~YVK~EFRrHk~~np 34 (113)
|+.||+-+.=|.+-..==-.||..|||.|.+.+-
T Consensus 16 R~~lraa~~Kp~~~~~~~m~fvh~EFrk~~~lpr 49 (80)
T KOG4620|consen 16 RDLLRAARGKPGAEARRWMAFVHAEFRKHAGLPR 49 (80)
T ss_pred HHHHHHhcCCCchHHHHHHHHHHHHHHHhcCCcH
Confidence 4667777766654444445799999999999843
No 6
>PF13220 DUF4028: Protein of unknown function (DUF4028)
Probab=73.92 E-value=2.1 Score=28.16 Aligned_cols=20 Identities=30% Similarity=0.454 Sum_probs=16.6
Q ss_pred HHHHhhhHHHHHHHhhhcCC
Q psy18141 13 ELQILGHQYVRDEFRRHKGC 32 (113)
Q Consensus 13 ~mR~LGD~YVK~EFRrHk~~ 32 (113)
.+-.=||+|||.=|+.|.+-
T Consensus 17 vqnkng~~yvkkwf~khen~ 36 (65)
T PF13220_consen 17 VQNKNGDQYVKKWFSKHENN 36 (65)
T ss_pred EecccHhHHHHHHHHhccch
Confidence 34456999999999999975
No 7
>PF14304 CSTF_C: Transcription termination and cleavage factor C-terminal; PDB: 2J8P_A.
Probab=70.09 E-value=5.6 Score=24.83 Aligned_cols=27 Identities=26% Similarity=0.324 Sum_probs=22.2
Q ss_pred CCCHHHHhhcCHHHHHHHHHHHHHhcC
Q psy18141 69 SLDEQALNDLRDDQIAQLYELMLAATQ 95 (113)
Q Consensus 69 ~L~~~~le~lsdEQi~QLyEL~~ea~k 95 (113)
.|+++.++.|.++|..+.-.|++...+
T Consensus 16 ~Lt~eQI~~LPp~qR~~I~~Lr~ql~~ 42 (46)
T PF14304_consen 16 QLTPEQINALPPDQRQQILQLRQQLMR 42 (46)
T ss_dssp TS-HHHHHTS-HHHHTHHHHHHHHHH-
T ss_pred cCCHHHHHhCCHHHHHHHHHHHHHHHh
Confidence 799999999999999999999987664
No 8
>KOG3801|consensus
Probab=52.48 E-value=58 Score=23.19 Aligned_cols=57 Identities=21% Similarity=0.243 Sum_probs=47.9
Q ss_pred ChhhcchhcCC-hHHHHhhhHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHHHHHHhcc
Q psy18141 1 MEMATLHRGLP-PELQILGHQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVNLAEQLGI 57 (113)
Q Consensus 1 RrILRlHR~LP-~~mR~LGD~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~L~~Q~~~ 57 (113)
|.+||.-+.+| =..|+.--.=.+|=||..|++ +|.++..-+.|=+.=+..|..|...
T Consensus 14 r~~lr~s~qfp~YNyReY~~RrtRD~Fr~Nkn~~Dp~e~~~l~~eakk~LevikRQ~ii 72 (94)
T KOG3801|consen 14 RNLLRESKQFPQYNYREYFQRRTRDTFRANKNVCDPAEIKKLYKEAKKQLEVIKRQSII 72 (94)
T ss_pred HHHHHHHhhCCcccHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34666666677 677888888899999999998 9999999999999999999988753
No 9
>PF07813 LTXXQ: LTXXQ motif family protein; InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT). This is a periplasmic protein that aids in combating extracytoplasmic protein-mediated toxicity, and may also be involved in the response to alkaline pH []. Another member of this family, Spy (P77754 from SWISSPROT) is also a periplasmic protein that may be involved in the response to stress []. The homology between CpxP and Spy may indicate that these two proteins are functionally related []. The motif is found repeated twice in many members of this entry. ; GO: 0042597 periplasmic space; PDB: 3ITF_B 3QZC_B 3OEO_D 3O39_A.
Probab=49.89 E-value=22 Score=22.93 Aligned_cols=24 Identities=21% Similarity=0.101 Sum_probs=17.5
Q ss_pred hcCHHHHHHHHHHHHHhcCCCcch
Q psy18141 77 DLRDDQIAQLYELMLAATQPEVLE 100 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~k~~~~~ 100 (113)
+||++|..++..|+.+........
T Consensus 14 ~LT~eQ~~~~~~i~~~~~~~~~~~ 37 (100)
T PF07813_consen 14 NLTDEQKAKWRAIRQAMKAKMKPL 37 (100)
T ss_dssp --THHHHHHHHHHHHHHCTTS---
T ss_pred CCCHHHHHHHHHHHHHHHHHHHhh
Confidence 589999999999999988776655
No 10
>PF11460 DUF3007: Protein of unknown function (DUF3007); InterPro: IPR021562 This is a family of uncharacterised proteins found in bacteria and eukaryotes.
Probab=48.09 E-value=39 Score=24.38 Aligned_cols=37 Identities=16% Similarity=0.408 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHhcccCCCCCCcccCCCCHHHHhhcCHHHHHHHH
Q psy18141 39 VFMNEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQALNDLRDDQIAQLY 87 (113)
Q Consensus 39 ~Fl~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~le~lsdEQi~QLy 87 (113)
-|+.+|++|-+...+.. ..+| +..++.||||.+.+|.
T Consensus 63 Ty~~Q~k~Ye~a~~~~~-----------~~~l-qkRle~l~~eE~~~L~ 99 (104)
T PF11460_consen 63 TYMQQRKDYEEAVDQLT-----------NEEL-QKRLEELSPEELEALQ 99 (104)
T ss_pred cHHHHHHHHHHHHHHHh-----------HHHH-HHHHHhCCHHHHHHHH
Confidence 48899999988775321 1112 2237888888877664
No 11
>PF06299 DUF1045: Protein of unknown function (DUF1045); InterPro: IPR009389 This family consists of several hypothetical proteins from Agrobacterium, Rhizobium and Brucella species. The function of this family is unknown.
Probab=47.10 E-value=11 Score=28.80 Aligned_cols=43 Identities=33% Similarity=0.425 Sum_probs=33.8
Q ss_pred cCChHHHHh----hhHHHHHHHhhhcCC----CHHHHHHHHHHHHHHHHHH
Q psy18141 9 GLPPELQIL----GHQYVRDEFRRHKGC----NPSEAQVFMNEWAKYAVNL 51 (113)
Q Consensus 9 ~LP~~mR~L----GD~YVK~EFRrHk~~----np~~~~~Fl~eW~~Y~~~L 51 (113)
+|.+.++.+ |.=||-+|||=|++. ++........-|+.+..-+
T Consensus 86 ~Ls~~Q~~~L~rWGYPYV~deFRFHmTLTg~l~~~~~~~~~~~l~~~f~~~ 136 (160)
T PF06299_consen 86 GLSPRQRANLERWGYPYVMDEFRFHMTLTGRLDPAERARVEAALEAHFAPL 136 (160)
T ss_pred cCCHHHHHHHHHhCCCceeCcCEeeEEeCCCCCHHHHHHHHHHHHHHHHhh
Confidence 466666654 888999999999975 5668888888888877765
No 12
>PF04282 DUF438: Family of unknown function (DUF438); InterPro: IPR007380 This is a a group of uncharacterised proteins.
Probab=46.96 E-value=26 Score=23.47 Aligned_cols=30 Identities=33% Similarity=0.561 Sum_probs=23.0
Q ss_pred hhcchhcCChHHHHhhhHHHHHHHhhhcC-CCHHHHHH
Q psy18141 3 MATLHRGLPPELQILGHQYVRDEFRRHKG-CNPSEAQV 39 (113)
Q Consensus 3 ILRlHR~LP~~mR~LGD~YVK~EFRrHk~-~np~~~~~ 39 (113)
|+|+|.+=+++ =||.+|..+-. ++|.+|..
T Consensus 6 i~~Lh~G~~~e-------~vk~~F~~~~~~Vs~~EI~~ 36 (71)
T PF04282_consen 6 IKRLHEGEDPE-------EVKEEFKKLFSDVSASEISA 36 (71)
T ss_pred HHHHhCCCCHH-------HHHHHHHHHHCCCCHHHHHH
Confidence 67888875554 48999999985 59998853
No 13
>KOG3346|consensus
Probab=40.89 E-value=15 Score=28.63 Aligned_cols=37 Identities=22% Similarity=0.496 Sum_probs=31.4
Q ss_pred hHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHHHHHHh
Q psy18141 19 HQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVNLAEQL 55 (113)
Q Consensus 19 D~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~L~~Q~ 55 (113)
+.+=..||.-|..- +|.-+..|.++|..|+-.+.+++
T Consensus 145 ~~F~~~~F~~~~~lg~PvA~~~f~aq~d~~~~~~~~~~ 182 (185)
T KOG3346|consen 145 GNFNTRKFAKKYELGTPVAGNFFQAQWDDYVPKLYKQL 182 (185)
T ss_pred cceeHHHHHHHhccCCchhhheehhhcchhhHHHHHhh
Confidence 45677899966655 99999999999999999998765
No 14
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=39.98 E-value=25 Score=27.17 Aligned_cols=20 Identities=30% Similarity=0.390 Sum_probs=17.7
Q ss_pred hcCHHHHHHHHHHHHHhcCC
Q psy18141 77 DLRDDQIAQLYELMLAATQP 96 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~k~ 96 (113)
.|||+|..|+.+||++....
T Consensus 50 dLTdaQRqQmRdLm~~~r~~ 69 (166)
T PRK10363 50 SLTEHQRQQMRDLMQQARHE 69 (166)
T ss_pred CCCHHHHHHHHHHHHHHHhc
Confidence 68999999999999988754
No 15
>PF14659 Phage_int_SAM_3: Phage integrase, N-terminal SAM-like domain; PDB: 2KD1_A 2KOB_A 2KHQ_A 3LYS_E 2KIW_A 2KKP_A.
Probab=39.15 E-value=54 Score=18.94 Aligned_cols=29 Identities=24% Similarity=0.331 Sum_probs=13.7
Q ss_pred HhhhHHHHHHHhhhc--CCCHHHHHHHHHHH
Q psy18141 16 ILGHQYVRDEFRRHK--GCNPSEAQVFMNEW 44 (113)
Q Consensus 16 ~LGD~YVK~EFRrHk--~~np~~~~~Fl~eW 44 (113)
..-+.||...|-.-+ +++|.++..|+.++
T Consensus 27 ~~~~~~i~p~~g~~~i~~It~~~i~~~~~~l 57 (58)
T PF14659_consen 27 SIIKNHILPYFGNKKIKDITPRDIQNFINEL 57 (58)
T ss_dssp HHHHHHHHHHTTSSBGGG--HHHHHHHHHHH
T ss_pred HHHHHHHHHHHCcCcHHHCCHHHHHHHHHHc
Confidence 344455555553322 23666666666554
No 16
>PLN03085 nucleobase:cation symporter-1; Provisional
Probab=35.89 E-value=72 Score=25.84 Aligned_cols=69 Identities=23% Similarity=0.207 Sum_probs=34.4
Q ss_pred hcCChHHHHhhhHHHHHHHhhhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCC-CCCcccCCCCHHHHhhcCHHHHHHH
Q psy18141 8 RGLPPELQILGHQYVRDEFRRHKGCNPSEAQVFMNEWAKYAVNLAEQLGIRGPK-FAKPLGQSLDEQALNDLRDDQIAQL 86 (113)
Q Consensus 8 R~LP~~mR~LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y~~~L~~Q~~~~g~~-~~~~~G~~L~~~~le~lsdEQi~QL 86 (113)
|+|||++|--|+ .|++| +.|..-+..-....-.+|.= .-...|+.|+-++ +.+-|.-.-.+
T Consensus 70 ~~~~~~~~~~~~-~~~~~----------------~~i~~laE~rI~eAm~~GeFdNLpG~GKPL~ldd-~p~vp~e~r~~ 131 (221)
T PLN03085 70 RKLPPELRGRRN-SVRSE----------------TDIINVVEQRIWHSMEEGHFENLPGKGKPLNLST-NPHADPAEDTL 131 (221)
T ss_pred ccCChhhccccc-cccch----------------HhHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCcc-cCCCCHHHHHH
Confidence 689999987665 35554 33444444332222122211 1124677776544 33333334456
Q ss_pred HHHHHHhc
Q psy18141 87 YELMLAAT 94 (113)
Q Consensus 87 yEL~~ea~ 94 (113)
|.+|+.+-
T Consensus 132 nRILknaG 139 (221)
T PLN03085 132 YRILSKNG 139 (221)
T ss_pred HHHHHHCC
Confidence 66666554
No 17
>PRK10455 periplasmic protein; Reviewed
Probab=35.53 E-value=31 Score=26.03 Aligned_cols=20 Identities=30% Similarity=0.416 Sum_probs=17.4
Q ss_pred hcCHHHHHHHHHHHHHhcCC
Q psy18141 77 DLRDDQIAQLYELMLAATQP 96 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~k~ 96 (113)
+|||+|..|+..||++....
T Consensus 56 ~LT~~Qrqqir~im~~~r~~ 75 (161)
T PRK10455 56 NLTDAQKQQIRDIMKAQRDQ 75 (161)
T ss_pred CCCHHHHHHHHHHHHHHHHh
Confidence 89999999999999887543
No 18
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=35.07 E-value=38 Score=22.00 Aligned_cols=21 Identities=33% Similarity=0.306 Sum_probs=16.3
Q ss_pred HhhcCHHHHHHHHHHHHHhcC
Q psy18141 75 LNDLRDDQIAQLYELMLAATQ 95 (113)
Q Consensus 75 le~lsdEQi~QLyEL~~ea~k 95 (113)
.-.||+||..++.+++.+...
T Consensus 39 ~l~Lt~eQ~~~l~~~~~~~~~ 59 (125)
T PF13801_consen 39 MLNLTPEQQAKLRALMDEFRQ 59 (125)
T ss_dssp HS-TTHHHHHHHHHHHHHHHH
T ss_pred hcCCCHHHHHHHHHHHHHHHH
Confidence 457899999999998887664
No 19
>PF00715 IL2: Interleukin 2 This family is a subset of the SCOP family.; InterPro: IPR000779 T-Lymphocytes regulate the growth and differentiation of certain lymphopoietic and haemopoietic cells through the release of various secreted protein factors []. These factors, which include interleukin-2 (IL2), are secreted by lectin- or antigen-stimulated T-cells, and have various physiological effects. IL2 is a lymphokine that induces the proliferation of responsive T-cells. In addition, it acts on some B-cells, via receptor-specific binding [], as a growth factor and antibody production stimulant []. The protein is secreted as a single glycosylated polypeptide, and cleavage of a signal sequence is required for its activity []. Solution NMR suggests that the structure of IL2 comprises a bundle of 4 helices (termed A-D), flanked by 2 shorter helices and several poorly-defined loops. Residues in helix A, and in the loop region between helices A and B, are important for receptor binding. Secondary structure analysis has suggested similarity to IL4 and granulocyte-macrophage colony stimulating factor (GMCSF) [].; GO: 0005134 interleukin-2 receptor binding, 0008083 growth factor activity, 0006955 immune response, 0005576 extracellular region; PDB: 3QAZ_M 2B5I_A 1Z92_A 3QB1_F 2ERJ_H 1NBP_A 1PY2_C 1IRL_A 1QVN_A 3INK_C ....
Probab=34.20 E-value=41 Score=25.59 Aligned_cols=20 Identities=15% Similarity=0.517 Sum_probs=16.5
Q ss_pred CHHHHHHHHHHHHHHHHHHH
Q psy18141 33 NPSEAQVFMNEWAKYAVNLA 52 (113)
Q Consensus 33 np~~~~~Fl~eW~~Y~~~L~ 52 (113)
...-|++||..|=.+|..|-
T Consensus 125 et~tiveFln~WItFCQsii 144 (145)
T PF00715_consen 125 ETVTIVEFLNKWITFCQSII 144 (145)
T ss_dssp SEBEHHHHHHHHHHHHHHHH
T ss_pred ceehHHHHHHHHHHHHHHhh
Confidence 35568999999999998763
No 20
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=33.55 E-value=49 Score=25.12 Aligned_cols=20 Identities=30% Similarity=0.258 Sum_probs=16.7
Q ss_pred HHhhcCHHHHHHHHHHHHHh
Q psy18141 74 ALNDLRDDQIAQLYELMLAA 93 (113)
Q Consensus 74 ~le~lsdEQi~QLyEL~~ea 93 (113)
.+.-|+|||..+|.+++++=
T Consensus 133 ~~~vLTpEQRak~~e~~~~r 152 (170)
T PRK12750 133 MLSILTPEQKAKFQELQQER 152 (170)
T ss_pred HHHhCCHHHHHHHHHHHHHH
Confidence 37789999999999998764
No 21
>PF14794 DUF4479: Domain of unknown function (DUF4479); PDB: 3BU2_C.
Probab=33.15 E-value=40 Score=22.40 Aligned_cols=19 Identities=26% Similarity=0.298 Sum_probs=12.2
Q ss_pred HhhcCHHHHHHHHHHHHHh
Q psy18141 75 LNDLRDDQIAQLYELMLAA 93 (113)
Q Consensus 75 le~lsdEQi~QLyEL~~ea 93 (113)
.=.||+||+.+|.++-.++
T Consensus 45 ~V~Lt~eqv~~LN~~l~~~ 63 (73)
T PF14794_consen 45 QVFLTEEQVAKLNQALQKA 63 (73)
T ss_dssp -----HHHHHHHHHHHHHT
T ss_pred CEEcCHHHHHHHHHHHHHc
Confidence 4479999999999987765
No 22
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=32.07 E-value=44 Score=25.38 Aligned_cols=18 Identities=44% Similarity=0.407 Sum_probs=15.2
Q ss_pred hcCHHHHHHHHHHHHHhc
Q psy18141 77 DLRDDQIAQLYELMLAAT 94 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~ 94 (113)
.|||+|..||..++++..
T Consensus 52 ~LTdeQk~qik~i~~~~r 69 (170)
T PRK12750 52 DLTDAQKEQLKEMREANR 69 (170)
T ss_pred CCCHHHHHHHHHHHHHHH
Confidence 689999999999886654
No 23
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=31.58 E-value=56 Score=25.14 Aligned_cols=32 Identities=16% Similarity=0.024 Sum_probs=26.2
Q ss_pred CCCCHHHHhhcCHHHHHHHHHHHHHhcCCCcc
Q psy18141 68 QSLDEQALNDLRDDQIAQLYELMLAATQPEVL 99 (113)
Q Consensus 68 ~~L~~~~le~lsdEQi~QLyEL~~ea~k~~~~ 99 (113)
--++.+..+.||+||...|.+...++.....+
T Consensus 209 ~~~n~~~~~~L~~e~q~~i~~a~~~~~~~~~~ 240 (257)
T TIGR00787 209 VVVNKAFWKSLPPDLQAVVKEAAKEAGEYQRK 240 (257)
T ss_pred EEEeHHHHhcCCHHHHHHHHHHHHHHHHHHHH
Confidence 45678999999999999999998888654444
No 24
>PF11647 PMT_C: C-terminal region of Pasteurella multocida toxin residues 569-1285; InterPro: IPR020972 This entry represents the C-terminal domain of Pasteurella multocida toxin (PMT) which displays a Trojan horse-like shape with three domains, C1, C2 and C3. The C3 domain possesses the Cys-His-Asp catalytic triad. PMT is an enzyme toxin carrying the cysteine protease-like catalytic triad which functions on the cytoplasmic face of the plasma membrane of target cells []. This entry is also found in the Vibrio cholerae RTX toxin [], a bacterial toxin that self-process by a cysteine peptidase mechanism. These cysteine peptidases belong to MEROPS peptidase family C80 (RTX self-cleaving toxin, clan CD). ; PDB: 4ERR_A 2EC5_B 2EBH_X 2EBF_X.
Probab=31.15 E-value=67 Score=21.14 Aligned_cols=16 Identities=25% Similarity=0.376 Sum_probs=14.0
Q ss_pred CHHHHHHHHHHHHHhc
Q psy18141 79 RDDQIAQLYELMLAAT 94 (113)
Q Consensus 79 sdEQi~QLyEL~~ea~ 94 (113)
..+++..||+|++...
T Consensus 37 ~~~~V~~l~~L~~~ie 52 (66)
T PF11647_consen 37 GREGVETLYELRKQIE 52 (66)
T ss_dssp CHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5999999999998764
No 25
>smart00189 IL2 Interleukin-2 family. Interleukin-2 is a cytokine produced by T-helper cells in response to antigenic or mitogenic stimulation. This protein is required for T-cell proliferation and other activities crucial to the regulation of the immune response.
Probab=30.84 E-value=48 Score=25.31 Aligned_cols=19 Identities=16% Similarity=0.511 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy18141 34 PSEAQVFMNEWAKYAVNLA 52 (113)
Q Consensus 34 p~~~~~Fl~eW~~Y~~~L~ 52 (113)
..-|+.||+.|-.+|..|-
T Consensus 132 t~tivEFLn~WItfCQsi~ 150 (154)
T smart00189 132 SVTIVEFLNRWIAFCQSII 150 (154)
T ss_pred eehHHHHHHHHHHHHHHHH
Confidence 3468999999999999875
No 26
>COG0334 GdhA Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]
Probab=28.94 E-value=1.3e+02 Score=26.46 Aligned_cols=26 Identities=23% Similarity=0.273 Sum_probs=21.8
Q ss_pred HhhhcCCCHHHHHHHHHHHHHHHHHHH
Q psy18141 26 FRRHKGCNPSEAQVFMNEWAKYAVNLA 52 (113)
Q Consensus 26 FRrHk~~np~~~~~Fl~eW~~Y~~~L~ 52 (113)
.|-|-.+|.+++. ||+-|..|-..|.
T Consensus 69 iRfhP~v~~~ev~-~Ls~~MT~Knal~ 94 (411)
T COG0334 69 VRFHPYVTLEEVK-ALSFWMTLKNALA 94 (411)
T ss_pred eecCCCCCHHHHH-HHHHHHHHHHHHh
Confidence 3668888888777 9999999999886
No 27
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=27.97 E-value=68 Score=24.36 Aligned_cols=25 Identities=20% Similarity=0.118 Sum_probs=19.2
Q ss_pred CCCHHHHhhcCHHHHHHHHHHHHHh
Q psy18141 69 SLDEQALNDLRDDQIAQLYELMLAA 93 (113)
Q Consensus 69 ~L~~~~le~lsdEQi~QLyEL~~ea 93 (113)
+.-.+.++-|+|||..+|.+.+++-
T Consensus 121 ~~~~qmy~lLTPEQra~l~~~~e~r 145 (162)
T PRK12751 121 KVRNQMYNLLTPEQKEALNKKHQER 145 (162)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 3345568889999999999888764
No 28
>KOG2712|consensus
Probab=27.75 E-value=45 Score=24.26 Aligned_cols=19 Identities=21% Similarity=0.186 Sum_probs=16.3
Q ss_pred hcCHHHHHHHHHHHHHhcC
Q psy18141 77 DLRDDQIAQLYELMLAATQ 95 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~k 95 (113)
.||.+|..+|.+++.++.+
T Consensus 82 SLs~~qW~~Lk~~~~eId~ 100 (108)
T KOG2712|consen 82 SLSLEQWSKLKEHIEEIDK 100 (108)
T ss_pred ccCHHHHHHHHHHHHHHHH
Confidence 5788999999999998764
No 29
>PF08158 NUC130_3NT: NUC130/3NT domain; InterPro: IPR012977 This N-terminal domain is found in a novel nucleolar protein family defined by NUC130/133 [].
Probab=27.62 E-value=35 Score=21.40 Aligned_cols=15 Identities=27% Similarity=0.578 Sum_probs=11.7
Q ss_pred hhhcchh-cCChHHHH
Q psy18141 2 EMATLHR-GLPPELQI 16 (113)
Q Consensus 2 rILRlHR-~LP~~mR~ 16 (113)
.||+-|- .|+|+||.
T Consensus 22 ~lL~~~~~~L~p~lR~ 37 (52)
T PF08158_consen 22 DLLRNHHTVLDPDLRM 37 (52)
T ss_pred HHHHhccccCCHHHHH
Confidence 4677776 69999985
No 30
>PRK07080 hypothetical protein; Validated
Probab=27.40 E-value=65 Score=27.30 Aligned_cols=33 Identities=24% Similarity=0.518 Sum_probs=29.4
Q ss_pred HHHhhhcCC---CHHHHHHHHHHHHHHHHHHHHHhc
Q psy18141 24 DEFRRHKGC---NPSEAQVFMNEWAKYAVNLAEQLG 56 (113)
Q Consensus 24 ~EFRrHk~~---np~~~~~Fl~eW~~Y~~~L~~Q~~ 56 (113)
.|||++-=+ ||+++..|...|-.|...|..-++
T Consensus 174 ~~F~mrE~V~iGt~e~v~~~r~~w~e~~~~l~~~Lg 209 (317)
T PRK07080 174 QLFRMREYVRIGTPEQIVAFRQSWIERGTAMADALG 209 (317)
T ss_pred hheeeeEEEEecCHHHHHHHHHHHHHHHHHHHHHhC
Confidence 589998865 999999999999999999987775
No 31
>PF00046 Homeobox: Homeobox domain not present here.; InterPro: IPR001356 The homeobox domain was first identified in a number of drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [, , ]. Hox genes encode homeodomain-containing transcriptional regulators that operate differential genetic programs along the anterior-posterior axis of animal bodies []. The domain binds DNA through a helix-turn-helix (HTH) structure. The HTH motif is characterised by two alpha-helices, which make intimate contacts with the DNA and are joined by a short turn. The second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA []. The first helix helps to stabilise the structure. The motif is very similar in sequence and structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeotic proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2DA3_A 1LFB_A 2LFB_A 2ECB_A 2DA5_A 3D1N_O 3A03_A 2XSD_C 3CMY_A 1AHD_P ....
Probab=26.73 E-value=1.1e+02 Score=17.98 Aligned_cols=25 Identities=20% Similarity=0.236 Sum_probs=18.8
Q ss_pred hcCHHHHHHHHHHHHHhcCCCcchH
Q psy18141 77 DLRDDQIAQLYELMLAATQPEVLEA 101 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~k~~~~~~ 101 (113)
.+|++|+..|.+.......|.....
T Consensus 6 ~~t~~q~~~L~~~f~~~~~p~~~~~ 30 (57)
T PF00046_consen 6 RFTKEQLKVLEEYFQENPYPSKEER 30 (57)
T ss_dssp SSSHHHHHHHHHHHHHSSSCHHHHH
T ss_pred CCCHHHHHHHHHHHHHhcccccccc
Confidence 5899999999999998555544433
No 32
>smart00496 IENR2 Intron-encoded nuclease repeat 2. Short helical motif of unknown function (unpublished results).
Probab=26.68 E-value=35 Score=18.87 Aligned_cols=18 Identities=11% Similarity=0.250 Sum_probs=14.3
Q ss_pred ccCCCCHHHHhhcCHHHH
Q psy18141 66 LGQSLDEQALNDLRDDQI 83 (113)
Q Consensus 66 ~G~~L~~~~le~lsdEQi 83 (113)
+|+..|++...+||+-..
T Consensus 1 ~GkkhSEETK~KMSea~~ 18 (26)
T smart00496 1 KGKKHSEETKKKMSEAAX 18 (26)
T ss_pred CCCCCCHHHHHHHHHhhc
Confidence 478899999999888643
No 33
>KOG4049|consensus
Probab=26.63 E-value=53 Score=26.92 Aligned_cols=33 Identities=36% Similarity=0.467 Sum_probs=27.7
Q ss_pred hhhHHHHHHHhhhcCCCHHHHHHHHHHHHHHHHHHH
Q psy18141 17 LGHQYVRDEFRRHKGCNPSEAQVFMNEWAKYAVNLA 52 (113)
Q Consensus 17 LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y~~~L~ 52 (113)
-||.=++.|| .|.++.++..|..||+.=+.-+.
T Consensus 172 tGd~eerQef---iNLdEseA~qfd~ew~se~s~~~ 204 (248)
T KOG4049|consen 172 TGDVEERQEF---INLDESEAPQFDTEWASEASAIM 204 (248)
T ss_pred cccHHHHHHh---hcccccchhhHhHHHHHHHHhcc
Confidence 4899999998 47788899999999998777654
No 34
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=26.24 E-value=1.6e+02 Score=20.63 Aligned_cols=35 Identities=29% Similarity=0.624 Sum_probs=21.9
Q ss_pred cCChHHHHhhhHHHHHHHhhhcCCCHHHHHHHHHHHHHH
Q psy18141 9 GLPPELQILGHQYVRDEFRRHKGCNPSEAQVFMNEWAKY 47 (113)
Q Consensus 9 ~LP~~mR~LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y 47 (113)
.|||+.|. -|+..|+.=+..+|++=......|..|
T Consensus 58 ~LspeqR~----~~R~~~~~~~~Lpp~qR~~lr~~w~~y 92 (107)
T PF11304_consen 58 ALSPEQRQ----QARENYQRFKQLPPEQRQALRARWEAY 92 (107)
T ss_pred hCCHHHHH----HHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 36666654 356666666666666666666666666
No 35
>PHA02902 putative IMV membrane protein; Provisional
Probab=25.10 E-value=58 Score=21.97 Aligned_cols=27 Identities=30% Similarity=0.546 Sum_probs=19.6
Q ss_pred CcccCCCCHHH--HhhcCHHHHHHHHHHH
Q psy18141 64 KPLGQSLDEQA--LNDLRDDQIAQLYELM 90 (113)
Q Consensus 64 ~~~G~~L~~~~--le~lsdEQi~QLyEL~ 90 (113)
..+|.-|..+- -++|+|+|+.-|..|-
T Consensus 38 ~~~~~~l~~d~~F~D~lTpDQirAlHrlv 66 (70)
T PHA02902 38 ERFGDTLEDDPLFKDSLTPDQIKALHRLV 66 (70)
T ss_pred ccccccCCCCchhhccCCHHHHHHHHHHH
Confidence 45666664333 5799999999998874
No 36
>PF12616 DUF3775: Protein of unknown function (DUF3775); InterPro: IPR022254 This domain family is found in bacteria, and is approximately 80 amino acids in length. There is a single completely conserved residue G that may be functionally important.
Probab=24.57 E-value=74 Score=21.36 Aligned_cols=22 Identities=32% Similarity=0.466 Sum_probs=18.2
Q ss_pred HHhhcCHHHHHHHHHHHHHhcC
Q psy18141 74 ALNDLRDDQIAQLYELMLAATQ 95 (113)
Q Consensus 74 ~le~lsdEQi~QLyEL~~ea~k 95 (113)
-++.||++|...|.-||==-..
T Consensus 9 ~I~~l~~deqaeLvALmwiGRG 30 (75)
T PF12616_consen 9 FIEDLNEDEQAELVALMWIGRG 30 (75)
T ss_pred HHHhCCHHHHHHHHHHHHhcCC
Confidence 4889999999999999965443
No 37
>PF10655 DUF2482: Hypothetical protein of unknown function (DUF2482); InterPro: IPR018917 This entry is represented by Bacteriophage 80, Orf10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. All the members of this very small, very short family are derived from bacteriophages, of the SA bacteriophages 11, Mu50B, system, and from the Staphylococcal_phi-Mu50B-like_prophages subsystem. All members are hypothetical proteins.
Probab=24.13 E-value=61 Score=23.25 Aligned_cols=29 Identities=21% Similarity=0.439 Sum_probs=24.6
Q ss_pred CCCCHHHHhhcCHHHHHHHHHHHHHhcCC
Q psy18141 68 QSLDEQALNDLRDDQIAQLYELMLAATQP 96 (113)
Q Consensus 68 ~~L~~~~le~lsdEQi~QLyEL~~ea~k~ 96 (113)
++|+.+++..|=.|--+-||+|-.|+.+.
T Consensus 6 KdMTqeelr~llseK~~ELydL~~eI~kE 34 (100)
T PF10655_consen 6 KDMTQEELRDLLSEKNGELYDLANEIDKE 34 (100)
T ss_pred hhhhHHHHHHHHHHhhHHHHHHHHHhccc
Confidence 57888888888788889999999999875
No 38
>KOG1070|consensus
Probab=23.77 E-value=97 Score=31.67 Aligned_cols=41 Identities=24% Similarity=0.475 Sum_probs=35.3
Q ss_pred CChHHHHhhhHHHHHHHhh--hcCCCHHHHHHHHHHHHHHHHH
Q psy18141 10 LPPELQILGHQYVRDEFRR--HKGCNPSEAQVFMNEWAKYAVN 50 (113)
Q Consensus 10 LP~~mR~LGD~YVK~EFRr--Hk~~np~~~~~Fl~eW~~Y~~~ 50 (113)
+.-+++..++.||++=|+| |++..|..+.-|+.-|=.|-..
T Consensus 1641 id~eik~~~~~~vR~lfeRvi~l~l~~kkmKfffKkwLeyEk~ 1683 (1710)
T KOG1070|consen 1641 IDMEIKHGDIKYVRDLFERVIELKLSIKKMKFFFKKWLEYEKS 1683 (1710)
T ss_pred HHHHHccCCHHHHHHHHHHHHhcCCChhHhHHHHHHHHHHHHh
Confidence 3457888999999999999 5566999999999999999764
No 39
>PF06937 EURL: EURL protein; InterPro: IPR009704 This family consists of several animal EURL proteins. EURL is preferentially expressed in chick retinal precursor cells as well as in the anterior epithelial cells of the lens at early stages of development. EURL transcripts are found primarily in the peripheral dorsal retina, i.e., the most undifferentiated part of the dorsal retina. EURL transcripts are also detected in the lens at stage 18 and remain abundant in the proliferating epithelial cells of the lens until at least day 11. The distribution pattern of EURL in the developing retina and lens suggest a role before the events leading to cell determination and differentiation [].
Probab=23.69 E-value=88 Score=26.26 Aligned_cols=41 Identities=22% Similarity=0.233 Sum_probs=33.0
Q ss_pred CCCCHHHHhhcCHHHHHHHHH-HHHHhcCCCcchHHHHHHHH
Q psy18141 68 QSLDEQALNDLRDDQIAQLYE-LMLAATQPEVLEAEEAQAKA 108 (113)
Q Consensus 68 ~~L~~~~le~lsdEQi~QLyE-L~~ea~k~~~~~~~~~~~~~ 108 (113)
...|.++|+.|+-+++.||.+ |.+.++..+.+..-..|.|-
T Consensus 209 ~~~SrEeL~~Mt~~EL~qL~~~L~~qIq~vfeeLt~~vQEKD 250 (285)
T PF06937_consen 209 PHYSREELNSMTLDELKQLNEKLLQQIQDVFEELTQQVQEKD 250 (285)
T ss_pred cccCHHHhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467889999999999999975 78888888777776666553
No 40
>cd08816 CARD_RIG-I_1 Caspase activation and recruitment domain found in RIG-I, first repeat. Caspase activation and recruitment domain (CARD) found in RIG-I (Retinoic acid Inducible Gene I, also known as Ddx58), first repeat. RIG-I is a cytoplasmic RNA helicase that plays an important role in host antiviral response by sensing incoming viral RNA. RIG-I contains two N-terminal CARD domains and a C-terminal RNA helicase. Upon activation, the signal is transferred to downstream pathways via the adaptor molecule IPS-1 (MAVS, VISA, CARDIF), leading to the induction of type I interferons. Although very similar in sequence, RIG-I recognizes different sets of viruses compared to MDA5, a related RNA helicase. RIG-I associates with IPS-1 through a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction doma
Probab=23.27 E-value=56 Score=23.03 Aligned_cols=15 Identities=33% Similarity=0.454 Sum_probs=13.2
Q ss_pred CCCHHHHHHHHHHHH
Q psy18141 31 GCNPSEAQVFMNEWA 45 (113)
Q Consensus 31 ~~np~~~~~Fl~eW~ 45 (113)
..+|..|.+||+.|=
T Consensus 18 tl~P~yIl~~m~~~~ 32 (89)
T cd08816 18 ILRPSYILGFMTTWL 32 (89)
T ss_pred hhchHHHHHHHHHhc
Confidence 459999999999994
No 41
>PRK00306 50S ribosomal protein L29; Reviewed
Probab=22.61 E-value=1.4e+02 Score=19.00 Aligned_cols=19 Identities=26% Similarity=0.461 Sum_probs=8.6
Q ss_pred HHHhhcCHHHH-HHHHHHHH
Q psy18141 73 QALNDLRDDQI-AQLYELML 91 (113)
Q Consensus 73 ~~le~lsdEQi-~QLyEL~~ 91 (113)
.++..||++++ ..|.+|.+
T Consensus 4 ~elr~ls~~eL~~~l~~lkk 23 (66)
T PRK00306 4 KELRELSVEELNEKLLELKK 23 (66)
T ss_pred HHHhhCCHHHHHHHHHHHHH
Confidence 34555555554 33344433
No 42
>PF14842 FliG_N: FliG N-terminal domain; PDB: 3HJL_A 3AJC_A 3USY_B.
Probab=22.44 E-value=59 Score=22.46 Aligned_cols=40 Identities=23% Similarity=0.322 Sum_probs=26.3
Q ss_pred cccCCCCHHHHhhcCHHHHHHHHHHHHHhcCCCcchHHHH
Q psy18141 65 PLGQSLDEQALNDLRDDQIAQLYELMLAATQPEVLEAEEA 104 (113)
Q Consensus 65 ~~G~~L~~~~le~lsdEQi~QLyEL~~ea~k~~~~~~~~~ 104 (113)
.+|.+-..+.+..|+++++.+|-.-|-....+..+..+..
T Consensus 17 ~Lgee~Aa~vlk~l~~~ei~~i~~~ma~l~~v~~~~~~~V 56 (108)
T PF14842_consen 17 ALGEEAAAEVLKHLDEEEIERISREMAKLGSVSPEEVEEV 56 (108)
T ss_dssp HS-HHHHHHHHHHS-HHHHHHHHHHHHT-----HHHHHHH
T ss_pred HHCHHHHHHHHccCCHHHHHHHHHHHHccCCCCHHHHHHH
Confidence 3677778888999999999999888888777776665554
No 43
>PF08863 YolD: YolD-like protein; InterPro: IPR014962 These proteins are functionally uncharacterised. However it has been predicted that these proteins are functionally equivalent to the UmuD subunit of polymerase V from Gram-negative bacteria [].
Probab=22.40 E-value=72 Score=20.57 Aligned_cols=20 Identities=20% Similarity=0.180 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHhcCCCcc
Q psy18141 80 DDQIAQLYELMLAATQPEVL 99 (113)
Q Consensus 80 dEQi~QLyEL~~ea~k~~~~ 99 (113)
|||...|.+++.+..+....
T Consensus 2 PEH~e~L~~~~~e~~k~~kp 21 (92)
T PF08863_consen 2 PEHKEALRELIKEQNKVEKP 21 (92)
T ss_pred ChHHHHHHHHHHHhcccCCC
Confidence 34555555555555544433
No 44
>PF08035 Op_neuropeptide: Opioids neuropeptide; InterPro: IPR013532 Pro-opiomelanocortin is present in high levels in the pituitary and is processed into 3 major peptide families: adrenocorticotrophin (ACTH); alpha-, beta- and gamma-melanocyte- stimulating hormones (MSH); and beta-endorphin []. ACTH regulates the synthesis and release of glucocorticoids and, to some extent, aldosterone in the adrenal cortex. It is synthesised and released in response to corticotrophin-releasing factor at times of stress (i.e. heat, cold, infection, etc.), its release leading to increased metabolism. The action of MSH in man is poorly understood, but it may be involved in temperature regulation []. Full activity of ACTH resides in the first 20 N-terminal amino acids, the first 13 of which are identical to alpha-MSH [, ]. This region corresponds to the conserved YGG motif that is found in a wide variety of opioid neuropeptides such as enkephalin
Probab=22.35 E-value=12 Score=21.61 Aligned_cols=10 Identities=30% Similarity=0.780 Sum_probs=8.0
Q ss_pred HHHHHHHHHH
Q psy18141 39 VFMNEWAKYA 48 (113)
Q Consensus 39 ~Fl~eW~~Y~ 48 (113)
+||++|+.+.
T Consensus 3 GFM~s~~e~s 12 (31)
T PF08035_consen 3 GFMKSWDERS 12 (31)
T ss_pred cccccccccc
Confidence 7999998764
No 45
>PF12651 RHH_3: Ribbon-helix-helix domain
Probab=22.28 E-value=1.4e+02 Score=17.72 Aligned_cols=29 Identities=24% Similarity=0.186 Sum_probs=22.9
Q ss_pred hcCHHHHHHHHHHHHHhcCCCcchHHHHH
Q psy18141 77 DLRDDQIAQLYELMLAATQPEVLEAEEAQ 105 (113)
Q Consensus 77 ~lsdEQi~QLyEL~~ea~k~~~~~~~~~~ 105 (113)
.+++|...+|.+|-.++.-+....-++|-
T Consensus 8 ~l~~el~~~L~~ls~~t~i~~S~Ll~eAl 36 (44)
T PF12651_consen 8 SLDKELYEKLKELSEETGIPKSKLLREAL 36 (44)
T ss_pred ecCHHHHHHHHHHHHHHCCCHHHHHHHHH
Confidence 46778889999999999887777766664
No 46
>PF13024 DUF3884: Protein of unknown function (DUF3884)
Probab=22.19 E-value=44 Score=22.74 Aligned_cols=11 Identities=64% Similarity=1.047 Sum_probs=10.0
Q ss_pred hHHHHhhhHHH
Q psy18141 12 PELQILGHQYV 22 (113)
Q Consensus 12 ~~mR~LGD~YV 22 (113)
|+|.+||+=||
T Consensus 22 ~~lk~LG~W~~ 32 (77)
T PF13024_consen 22 PELKSLGKWYV 32 (77)
T ss_pred HHHHHhhceee
Confidence 69999999888
No 47
>PF06853 DUF1249: Protein of unknown function (DUF1249); InterPro: IPR009659 This family consists of several hypothetical bacterial proteins of around 150 residues in length. The function of this family is unknown.
Probab=22.15 E-value=77 Score=22.78 Aligned_cols=15 Identities=27% Similarity=0.773 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHH
Q psy18141 35 SEAQVFMNEWAKYAV 49 (113)
Q Consensus 35 ~~~~~Fl~eW~~Y~~ 49 (113)
-++-.||.+|=.||-
T Consensus 98 ~q~N~FL~eWL~~CL 112 (120)
T PF06853_consen 98 WQLNRFLAEWLRYCL 112 (120)
T ss_pred HHHHHHHHHHHHHHH
Confidence 367899999999996
No 48
>smart00309 PAH Pancreatic hormones / neuropeptide F / peptide YY family. Pancreatic hormone is a regulator of pancreatic and gastrointestinal functions.
Probab=21.91 E-value=1.4e+02 Score=17.70 Aligned_cols=22 Identities=9% Similarity=0.357 Sum_probs=18.3
Q ss_pred CCCHHHHHHHHHHHHHHHHHHH
Q psy18141 31 GCNPSEAQVFMNEWAKYAVNLA 52 (113)
Q Consensus 31 ~~np~~~~~Fl~eW~~Y~~~L~ 52 (113)
+++|+++..++++-+.|...+.
T Consensus 11 ~a~~e~l~~Y~~~L~~Yinlit 32 (36)
T smart00309 11 DASPEDLRQYLAALREYINLIT 32 (36)
T ss_pred CCCHHHHHHHHHHHHHHHHHhc
Confidence 3588999999999999988774
No 49
>PF06207 DUF1002: Protein of unknown function (DUF1002); InterPro: IPR009343 This protein family has no known function. Its members are about 300 amino acids in length. It has so far been detected in Firmicute bacteria and some archaebacteria.
Probab=21.74 E-value=1.1e+02 Score=24.51 Aligned_cols=23 Identities=30% Similarity=0.322 Sum_probs=20.5
Q ss_pred cCHHHHHHHHHHHHHhcCCCcch
Q psy18141 78 LRDDQIAQLYELMLAATQPEVLE 100 (113)
Q Consensus 78 lsdEQi~QLyEL~~ea~k~~~~~ 100 (113)
||++|+.+|..||....+..-+.
T Consensus 192 ls~~q~~~i~~l~~~~~~~~~~~ 214 (225)
T PF06207_consen 192 LSDEQIQQIVNLMKKIQNLNIDW 214 (225)
T ss_pred CCHHHHHHHHHHHHHHHcCCCCH
Confidence 89999999999999988876665
No 50
>PF12767 SAGA-Tad1: Transcriptional regulator of RNA polII, SAGA, subunit; InterPro: IPR024738 The yeast Spt-Ada-Gcn5-Acetyl (SAGA) transferase complex is a multifunctional coactivator involved in multiple cellular processes [], including regulation of transcription by RNA polymerase II [, ]. It is formed of five major modular subunits and shows a high degree of structural conservation to human TFTC and STAGA []. This entry represents Ada1 (known as Tada1 in higher eukaryotes), one of the subunits that constitute the SAGA core. It also functions as a component of the SALSA and SLIK complexes. ; GO: 0070461 SAGA-type complex
Probab=21.73 E-value=64 Score=25.39 Aligned_cols=31 Identities=23% Similarity=0.533 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHhcccCCCCCCcccCCCCHHHHhh-----cCHHHH
Q psy18141 42 NEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQALND-----LRDDQI 83 (113)
Q Consensus 42 ~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~le~-----lsdEQi 83 (113)
..|..|...|.. |+..+||.+++++ |..|++
T Consensus 25 ~~~~~Y~~~l~~-----------fl~~klsk~Efd~~~~~~L~~~~~ 60 (252)
T PF12767_consen 25 DRWKKYFQSLKR-----------FLSGKLSKEEFDKECRRILGRENV 60 (252)
T ss_pred HHHHHHHHHHHH-----------HHHhccCHHHHHHHHHHHhChhHH
Confidence 679999999974 6777888887664 556664
No 51
>TIGR03759 conj_TIGR03759 integrating conjugative element protein, PFL_4693 family. Members of this protein family, such as model protein PFL_4693 from Pseudomonas fluorescens Pf-5, belong to extended genomic regions that appear to be spread by conjugative transfer. Most members have a predicted N-terminal signal sequence. The function is unknown.
Probab=21.07 E-value=1.8e+02 Score=23.15 Aligned_cols=43 Identities=23% Similarity=0.462 Sum_probs=32.4
Q ss_pred HHHHHHHHHHHHHHhcccCCCCCCcccCCCCHHH---HhhcCHHHHHHHHHHHH
Q psy18141 41 MNEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQA---LNDLRDDQIAQLYELML 91 (113)
Q Consensus 41 l~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~---le~lsdEQi~QLyEL~~ 91 (113)
-.||+.|...+. |+ .+.....|||=. ++.=||+......||.-
T Consensus 16 ~~Ew~RY~~Lmq------G~--rG~~SPgLdPLtaLGIeArsd~ERrryAEl~v 61 (200)
T TIGR03759 16 EDEWQRYQQLMQ------GP--RGVYSPGLDPLTALGIEARSDEERRRYAELWV 61 (200)
T ss_pred HHHHHHHHHHhc------CC--ccCcCCCCChhhhhccccCCHHHHHHHHHHHH
Confidence 369999999875 32 245556687766 78889999999999864
Done!