Query 035921
Match_columns 305
No_of_seqs 41 out of 43
Neff 3.8
Searched_HMMs 46136
Date Fri Mar 29 07:39:56 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/035921.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/035921hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF11938 DUF3456: TLR4 regulat 98.9 2.9E-09 6.3E-14 91.7 6.8 123 43-166 2-151 (151)
2 KOG4052 Uncharacterized conser 95.6 0.015 3.3E-07 52.7 4.3 121 39-160 29-185 (190)
3 PF05184 SapB_1: Saposin-like 93.9 0.05 1.1E-06 36.3 2.4 36 43-86 2-37 (39)
4 KOG3782 Predicted membrane pro 92.9 0.45 9.7E-06 43.4 7.4 88 43-137 25-145 (189)
5 KOG4260 Uncharacterized conser 86.2 3.5 7.7E-05 40.6 8.1 94 42-167 27-133 (350)
6 PRK10144 formate-dependent nit 61.0 6.5 0.00014 34.1 2.3 36 43-82 41-85 (126)
7 smart00741 SapB Saposin (B) Do 57.5 15 0.00033 26.1 3.4 38 43-88 1-38 (76)
8 PF07172 GRP: Glycine rich pro 56.4 9 0.00019 31.5 2.3 25 1-26 1-25 (95)
9 PF15176 LRR19-TM: Leucine-ric 46.6 13 0.00029 31.4 1.8 24 11-51 23-46 (102)
10 TIGR03147 cyt_nit_nrfF cytochr 45.6 16 0.00035 31.7 2.3 36 43-82 41-85 (126)
11 KOG3130 Uncharacterized conser 37.8 23 0.00049 36.7 2.2 24 39-62 73-102 (514)
12 COG3088 CcmH Uncharacterized p 35.5 20 0.00043 32.3 1.3 35 43-81 45-88 (153)
13 PF04220 YihI: Der GTPase acti 32.1 25 0.00055 32.0 1.4 39 192-236 110-159 (169)
14 PF01194 RNA_pol_N: RNA polyme 31.4 15 0.00032 28.4 -0.2 36 40-79 2-37 (60)
15 PF11446 DUF2897: Protein of u 31.0 27 0.00058 26.3 1.1 23 7-29 6-28 (55)
16 PF10690 Myticin-prepro: Mytic 30.6 17 0.00036 30.5 0.0 29 181-211 70-98 (98)
17 PRK06287 cobalt transport prot 30.1 40 0.00086 28.2 2.1 27 1-27 1-27 (107)
18 PRK04016 DNA-directed RNA poly 29.5 29 0.00062 27.0 1.1 41 40-84 2-44 (62)
19 PF08372 PRT_C: Plant phosphor 29.4 1.1E+02 0.0024 27.4 5.0 46 232-281 39-85 (156)
20 PF08285 DPM3: Dolichol-phosph 28.4 36 0.00078 27.9 1.6 68 4-71 1-89 (91)
21 PRK14864 putative biofilm stre 27.3 73 0.0016 26.8 3.3 22 3-26 4-25 (104)
22 PLN00032 DNA-directed RNA poly 24.9 55 0.0012 26.1 2.0 36 40-79 2-37 (71)
23 PF00612 IQ: IQ calmodulin-bin 24.5 53 0.0011 19.4 1.4 13 280-292 2-14 (21)
24 smart00015 IQ Short calmodulin 22.5 64 0.0014 19.9 1.6 15 279-293 3-17 (26)
25 PF07771 TSGP1: Tick salivary 22.2 33 0.00071 29.4 0.3 25 4-28 1-25 (120)
26 PRK09810 entericidin A; Provis 21.6 91 0.002 22.4 2.4 19 4-22 2-20 (41)
27 PF01595 DUF21: Domain of unkn 20.9 54 0.0012 27.7 1.4 26 207-232 157-182 (183)
28 PF03918 CcmH: Cytochrome C bi 20.2 25 0.00053 31.0 -0.9 36 43-82 41-85 (148)
29 COG3417 FlgN Collagen-binding 20.1 1.4E+02 0.003 28.2 3.8 43 1-44 1-50 (200)
30 PF15017 AF1Q: Drug resistance 20.0 61 0.0013 26.6 1.4 18 232-249 66-85 (87)
No 1
>PF11938 DUF3456: TLR4 regulator and MIR-interacting MSAP; InterPro: IPR021852 This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 182 to 356 amino acids in length.
Probab=98.90 E-value=2.9e-09 Score=91.68 Aligned_cols=123 Identities=23% Similarity=0.474 Sum_probs=87.9
Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhC---------------CC----CCchHHHHHHHHhhcCCccc-----ccchheeee
Q 035921 43 ITCQVCEKLASQLYHQVQKKQAQIS---------------PK----KISEFEIIGIAENVCNLKKE-----EADWILKID 98 (305)
Q Consensus 43 IrC~VCe~iak~ly~~v~~k~~~~~---------------pk----Kv~E~dVie~iE~vCnp~k~-----eG~WI~~~D 98 (305)
-+|.||..+|.++...+..-..... .+ +.||.-++|++|+||+...+ ...+-.++-
T Consensus 2 ~~C~~C~~l~~el~~~l~~~~~~~~v~~~~~r~d~~g~~~~~~~~y~~SE~~l~EvlE~vC~~~~~Y~~~~~~~~~~~~~ 81 (151)
T PF11938_consen 2 LKCSACRALVEELEHRLAKTDPKKFVDVGGFRLDPDGERKKKKIPYARSELRLVEVLENVCEKMKDYVLHKDKSTGKRVY 81 (151)
T ss_pred CcchHHHHHHHHHHHHHHHhCCcCceeccccccccccccccccCccccchhHHHHHHHHHccCcccceeeecCCCCceEE
Confidence 3799999999999887654322211 12 35999999999999997652 111222222
Q ss_pred eeec---CCeeEEEeccccccccccchHHHHHHHHhhCCCCcchhhhhhccCCChHHHHHhhchhhhcccc
Q 035921 99 IVEQ---GDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACS 166 (305)
Q Consensus 99 IvEk---G~~L~lve~~~~gkC~~ECkTI~rACe~vmd~~dtDvae~L~~~~~~~~al~~~lC~d~S~aC~ 166 (305)
+... |....++.++....+..+.+++..+|+.|++.++.++.+++++.... ..|.+|||.+.++.|.
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~C~~ilEe~Ed~i~~~~~~~~~~-~~l~~~LC~~~~~~C~ 151 (151)
T PF11938_consen 82 KRMSTTGGTNPELSEKDVKVDPSDENKRLKFYCERILEEYEDEIEEWFFSEQDE-DDLEEWLCSETSKVCC 151 (151)
T ss_pred EecCCCcCcccccccccceeecHHHHhHHHHHHHHHHHHhHHHHHHHHHhcCCc-cCHHHHhhhcccccCC
Confidence 2211 22222333444445899999999999999999999999999988766 7888999999999994
No 2
>KOG4052 consensus Uncharacterized conserved protein [Function unknown]
Probab=95.61 E-value=0.015 Score=52.66 Aligned_cols=121 Identities=26% Similarity=0.398 Sum_probs=72.7
Q ss_pred CCCc-ccchhHHHHHHHHHHHHHHHH---hhh---------CCCC-----CchHHHHHHHHhhcCCccc-----------
Q 035921 39 DVPY-ITCQVCEKLASQLYHQVQKKQ---AQI---------SPKK-----ISEFEIIGIAENVCNLKKE----------- 89 (305)
Q Consensus 39 Dvpy-IrC~VCe~iak~ly~~v~~k~---~~~---------~pkK-----v~E~dVie~iE~vCnp~k~----------- 89 (305)
+|-| -+|.||-.+|-++-....+-- +-+ .+++ .||...+|+.||+|+.+-+
T Consensus 29 ~vrlpsKCEvCkllatEleA~l~eTGks~eVi~~gy~~ld~k~k~~v~Y~rselrl~E~tEniCer~ley~~hker~gs~ 108 (190)
T KOG4052|consen 29 TVRLPSKCEVCKLLATELEAKLEETGKSKEVIEHGYTRLDFKFKWFVLYQRSELRLAEITENICERFLEYKIHKERTGSE 108 (190)
T ss_pred cccccchhHHHHHHHHHHHHHHhhcCCcceeeeeceeeecceeeeeeeeehhHhHHHHHHHHHHHHHHHHhhhccCcchh
Confidence 3444 799999999988764432110 000 2333 2899999999999997531
Q ss_pred -----ccchheee-eeeecCCeeEEEe-ccccccccccchHHHHHHHHhhCCCCcchhhhhhccCCChHHHHHhhchh
Q 035921 90 -----EADWILKI-DIVEQGDKLVLVE-QEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKD 160 (305)
Q Consensus 90 -----eG~WI~~~-DIvEkG~~L~lve-~~~~gkC~~ECkTI~rACe~vmd~~dtDvae~L~~~~~~~~al~~~lC~d 160 (305)
..+=|.++ -+|+||.+..|.- -+---+--.|---....||-.+. ++.|+.+--|-+..+...|+|+||.+
T Consensus 109 rfaraqsqt~~tlk~l~~KGvKvdlgip~elwd~psvevt~lk~qce~lle-eyed~i~ewyf~hq~e~~l~k~lce~ 185 (190)
T KOG4052|consen 109 RFARAQSQTIETLKGLVEKGVKVDLGIPFELWDQPSVEVTALKQQCESLLE-EYEDLIEEWYFNHQSEEDLFKQLCER 185 (190)
T ss_pred HHHHHhhhHHHHHHHHHHcCceeeeCCchHhhcCCcHHHHHHHHHHHHHHH-HHHHHHHHHHhccccHHHHHHHHHHh
Confidence 12223222 3466777655532 11112333444555667887665 56777755455556789999999953
No 3
>PF05184 SapB_1: Saposin-like type B, region 1; InterPro: IPR007856 Synonym(s):cerebroside sulphate activator, CSAct Saposin B is a small non-enzymatic glycoprotein required for the breakdown of cerebroside sulphates (sulphatides) in lysosomes. Saposin B contains three intramolecular disulphide bridges, exists as a dimer and is remarkably heat, protease, and pH stable. The crystal structure of human saposin B reveals an unusual shell-like dimer consisting of a monolayer of alpha-helices enclosing a large hydrophobic cavity. Although the secondary structure of saposin B is similar to that of the known monomeric members of the saposin-like superfamily, the helices are repacked into a different tertiary arrangement to form the homodimer. A comparison of the two forms of the saposin B dimer suggests that extraction of target lipids from membranes involves a conformational change that facilitates access to the inner cavity [].; GO: 0006629 lipid metabolic process; PDB: 1N69_C 1QDM_C 4DDJ_A 2DOB_A 1OF9_A 2Z9A_A 1M12_A 2GTG_A 1SN6_A 2QYP_B ....
Probab=93.93 E-value=0.05 Score=36.27 Aligned_cols=36 Identities=22% Similarity=0.445 Sum_probs=29.8
Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhCCCCCchHHHHHHHHhhcCC
Q 035921 43 ITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNL 86 (305)
Q Consensus 43 IrC~VCe~iak~ly~~v~~k~~~~~pkKv~E~dVie~iE~vCnp 86 (305)
..|++|+.+++.+...+.. .-++.+|+..++++|+.
T Consensus 2 ~~C~~C~~~v~~i~~~l~~--------~~t~~~I~~~l~~~C~~ 37 (39)
T PF05184_consen 2 DECDICKFVVKEIEKLLKN--------NKTEEEIKKALEKACNK 37 (39)
T ss_dssp HHHHHHHHHHHHHHHHHHS--------TCHHHHHHHHHHHHHTT
T ss_pred CcchHHHHHHHHHHHHHHc--------CccHHHHHHHHHHHHhh
Confidence 3699999999999877751 12689999999999985
No 4
>KOG3782 consensus Predicted membrane protein, contains type II SA sequence [General function prediction only]
Probab=92.87 E-value=0.45 Score=43.44 Aligned_cols=88 Identities=25% Similarity=0.516 Sum_probs=58.3
Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhCCC-------------------CC----chHHHHHHHHhhcCCcccccchheee--
Q 035921 43 ITCQVCEKLASQLYHQVQKKQAQISPK-------------------KI----SEFEIIGIAENVCNLKKEEADWILKI-- 97 (305)
Q Consensus 43 IrC~VCe~iak~ly~~v~~k~~~~~pk-------------------Kv----~E~dVie~iE~vCnp~k~eG~WI~~~-- 97 (305)
+.|.+|..++.++=-..+ +..|| || ||.-.-|++|.+|+.-++=+.=+-+.
T Consensus 25 ~~CgaC~alVtelE~~IA----~vDPkK~i~vgsFR~~p~G~q~~~kV~yarSE~hLTEl~E~iCd~mk~Y~~~~~~stg 100 (189)
T KOG3782|consen 25 VKCGACKALVTELEEAIA----KVDPKKMIDVGSFRLDPQGNQISKKVRYARSEMHLTELMEKICDKMKDYLKATYKSTG 100 (189)
T ss_pred cccchHHHHHHHHHHHHH----hcCchheeeecceEECCCCCeeeeeeccchhHHHHHHHHHHHHHHHHHHHHHhccCCC
Confidence 589999999988754433 22333 33 89999999999999887655433221
Q ss_pred --eeee------cCCeeEEEeccccccccccchHHHHHHHHhhCCCCc
Q 035921 98 --DIVE------QGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDT 137 (305)
Q Consensus 98 --DIvE------kG~~L~lve~~~~gkC~~ECkTI~rACe~vmd~~dt 137 (305)
-+|. +|+-+.|+.-..-|-|| +|+--+|++|+++-++
T Consensus 101 k~~yv~~~~~~~~~~es~l~~~~~d~d~n---~~l~~~c~evle~~~~ 145 (189)
T KOG3782|consen 101 KFTYVKMIINGQMNPESSLVDFVQDGDLN---KSLGHFCNEVLEDNDE 145 (189)
T ss_pred ceEEEEEeccCCCCCccccccccccchhh---HHHHHHHHHHHHHHHH
Confidence 1111 24555555544556665 7899999999985443
No 5
>KOG4260 consensus Uncharacterized conserved protein [Function unknown]
Probab=86.24 E-value=3.5 Score=40.58 Aligned_cols=94 Identities=27% Similarity=0.556 Sum_probs=57.7
Q ss_pred cccchhHHHHHHHHHHHH-HHHHhh------------hCCCCCchHHHHHHHHhhcCCcccccchheeeeeeecCCeeEE
Q 035921 42 YITCQVCEKLASQLYHQV-QKKQAQ------------ISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVL 108 (305)
Q Consensus 42 yIrC~VCe~iak~ly~~v-~~k~~~------------~~pkKv~E~dVie~iE~vCnp~k~eG~WI~~~DIvEkG~~L~l 108 (305)
.-.|-.|..++...-+-+ ...|.. ++.-|-||.-.+|+.|.+|--.. +...|.
T Consensus 27 p~pCrtC~~LVssFn~GlerT~r~hfaGGdTAWEEknL~kYk~SE~RLvEilEglCsks~-----~~n~Df--------- 92 (350)
T KOG4260|consen 27 PEPCRTCRGLVSSFNEGLERTARHHFAGGDTAWEEKNLSKYKTSETRLVEILEGLCSKSS-----LPNMDF--------- 92 (350)
T ss_pred CCCchHHHHHHHHHHHHHHHHhhhccCCCchhhhhhhhhhccccchhHHHHHHHhhhccC-----CCCCCh---------
Confidence 346999999987654322 222222 11223489999999999997554 333343
Q ss_pred EeccccccccccchHHHHHHHHhhCCCCcchhhhhhccCCChHHHHHhhchhhhccccC
Q 035921 109 VEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACST 167 (305)
Q Consensus 109 ve~~~~gkC~~ECkTI~rACe~vmd~~dtDvae~L~~~~~~~~al~~~lC~d~S~aC~~ 167 (305)
+|-|....=|+.+. ++-|+..-..--|++|||-|--+-|--
T Consensus 93 -----------eCh~lle~hEellE-------~w~~hkq~e~Pdl~~WlCvdqLkvCCp 133 (350)
T KOG4260|consen 93 -----------ECHTLLEKHEELLE-------EWWYHKQHESPDLFNWLCVDQLKVCCP 133 (350)
T ss_pred -----------HHHHHHHHHHHHHH-------HHHHHhhcCCchHHhHhhhhhheeccC
Confidence 58888888776653 444432222224679999877666653
No 6
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=61.01 E-value=6.5 Score=34.14 Aligned_cols=36 Identities=36% Similarity=0.548 Sum_probs=26.5
Q ss_pred ccchhHH---------HHHHHHHHHHHHHHhhhCCCCCchHHHHHHHHh
Q 035921 43 ITCQVCE---------KLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82 (305)
Q Consensus 43 IrC~VCe---------~iak~ly~~v~~k~~~~~pkKv~E~dVie~iE~ 82 (305)
+||-||+ -+|+.+=.+|.++..+-. |+.+|++-+.+
T Consensus 41 LRC~vCqnqsiadSna~iA~dmR~~Vr~~i~~G~----sd~eI~~~~v~ 85 (126)
T PRK10144 41 LRCPQCQNQNLLESNAPVAVSMRHQVYSMVAEGK----SEVEIIGWMTE 85 (126)
T ss_pred CCCCCCCCCChhhcCCHHHHHHHHHHHHHHHcCC----CHHHHHHHHHH
Confidence 9999997 477777777777766332 68899887654
No 7
>smart00741 SapB Saposin (B) Domains. Present in multiple copies in prosaposin and in pulmonary surfactant-associated protein B. In plant aspartic proteinases, a saposin domain is circularly permuted. This causes the prediction algorithm to predict two such domains, where only one is truly present.
Probab=57.55 E-value=15 Score=26.09 Aligned_cols=38 Identities=29% Similarity=0.551 Sum_probs=30.0
Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhCCCCCchHHHHHHHHhhcCCcc
Q 035921 43 ITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKK 88 (305)
Q Consensus 43 IrC~VCe~iak~ly~~v~~k~~~~~pkKv~E~dVie~iE~vCnp~k 88 (305)
+.|+.|..++..+-..+.... ++..|.+.++++|+...
T Consensus 1 ~~C~~C~~~v~~~~~~~~~~~--------~~~~i~~~~~~~C~~~~ 38 (76)
T smart00741 1 LLCELCEDVVKQLENLLKDNK--------TEEEIKKALEKVCKKLP 38 (76)
T ss_pred CcChHHHHHHHHHHHHHHhch--------HHHHHHHHHHHHHHHcC
Confidence 369999999998887776321 26789999999999754
No 8
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=56.40 E-value=9 Score=31.48 Aligned_cols=25 Identities=28% Similarity=0.271 Sum_probs=13.0
Q ss_pred ChhHHHHHHHHHHHHHHHhhcccccc
Q 035921 1 MARIKAAVLVYVAVILLLSTCAPISY 26 (305)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 26 (305)
|| =|+.++|.|++++++.....+++
T Consensus 1 Ma-SK~~llL~l~LA~lLlisSevaa 25 (95)
T PF07172_consen 1 MA-SKAFLLLGLLLAALLLISSEVAA 25 (95)
T ss_pred Cc-hhHHHHHHHHHHHHHHHHhhhhh
Confidence 55 46666666655444443334443
No 9
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=46.63 E-value=13 Score=31.36 Aligned_cols=24 Identities=25% Similarity=0.312 Sum_probs=17.9
Q ss_pred HHHHHHHHhhcccccccCCCCCCCcCCCCCCcccchhHHHH
Q 035921 11 YVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKL 51 (305)
Q Consensus 11 ~~~~~~~~~~~~~~~~~~kkp~~~arKeDvpyIrC~VCe~i 51 (305)
+|+.++++|.+|.|+ +||.+|-..
T Consensus 23 Vv~~al~~SlLIala-----------------aKC~~~~k~ 46 (102)
T PF15176_consen 23 VVVTALVTSLLIALA-----------------AKCPVWYKY 46 (102)
T ss_pred HHHHHHHHHHHHHHH-----------------HHhHHHHHH
Confidence 455667777888887 799999653
No 10
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=45.59 E-value=16 Score=31.69 Aligned_cols=36 Identities=25% Similarity=0.420 Sum_probs=26.5
Q ss_pred ccchhHH---------HHHHHHHHHHHHHHhhhCCCCCchHHHHHHHHh
Q 035921 43 ITCQVCE---------KLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82 (305)
Q Consensus 43 IrC~VCe---------~iak~ly~~v~~k~~~~~pkKv~E~dVie~iE~ 82 (305)
+||-||+ -+|+.+=++|.++..+-. |+.+|++-+.+
T Consensus 41 LRC~vCqnqsiadS~a~iA~dmR~~Vr~~i~~G~----Sd~eI~~~~v~ 85 (126)
T TIGR03147 41 LRCPQCQNQNLVESNSPIAYDLRHEVYSMVNEGK----SNQQIIDFMTA 85 (126)
T ss_pred CCCCCCCCCChhhcCCHHHHHHHHHHHHHHHcCC----CHHHHHHHHHH
Confidence 9999997 477777777777665322 68899987754
No 11
>KOG3130 consensus Uncharacterized conserved protein [Function unknown]
Probab=37.82 E-value=23 Score=36.72 Aligned_cols=24 Identities=17% Similarity=0.380 Sum_probs=12.8
Q ss_pred CCCcccch------hHHHHHHHHHHHHHHH
Q 035921 39 DVPYITCQ------VCEKLASQLYHQVQKK 62 (305)
Q Consensus 39 DvpyIrC~------VCe~iak~ly~~v~~k 62 (305)
+=.|-.|. .|+.-.+++-+|..++
T Consensus 73 ~nyf~~CS~h~A~~I~~~R~~~~r~q~~~l 102 (514)
T KOG3130|consen 73 DNYFAKCSAHQAVGIVEHRKEHVRKQIDDL 102 (514)
T ss_pred cchHhhhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 55667775 4554444444444444
No 12
>COG3088 CcmH Uncharacterized protein involved in biosynthesis of c-type cytochromes [Posttranslational modification, protein turnover, chaperones]
Probab=35.45 E-value=20 Score=32.26 Aligned_cols=35 Identities=29% Similarity=0.448 Sum_probs=27.4
Q ss_pred ccchhHH---------HHHHHHHHHHHHHHhhhCCCCCchHHHHHHHH
Q 035921 43 ITCQVCE---------KLASQLYHQVQKKQAQISPKKISEFEIIGIAE 81 (305)
Q Consensus 43 IrC~VCe---------~iak~ly~~v~~k~~~~~pkKv~E~dVie~iE 81 (305)
.||-||+ -||..+-.+|.++..+-+ |+.+||.-+-
T Consensus 45 LRCp~CQNqsIadSnA~IA~DlR~~V~e~l~eGk----S~~qIid~mV 88 (153)
T COG3088 45 LRCPQCQNQSIADSNAPIARDLRHQVYELLQEGK----SDQQIIDYMV 88 (153)
T ss_pred cCCCcCCCCChhhhccHHHHHHHHHHHHHHHcCC----cHHHHHHHHH
Confidence 8999997 588888888888777443 5888888764
No 13
>PF04220 YihI: Der GTPase activator (YihI); InterPro: IPR007336 This entry contains Escherichia coli (strain K12) YihI. YihI activates the GTPase activity of Der, a 50S ribosomal subunit stability factor and can therefore be considered a GAP (GTPase activating)-like protein. The stimulation is specific to Der as YihI does not stimulate the GTPase activity of Era or ObgE. The interaction of YihI with Der requires only the C-terminal 78 amino acids of YihI []. A yihI deletion mutant is viable and shows a shorter lag period, but the same post-lag growth rate as a wild-type strain. yihI is expressed during the lag period. Overexpression of yihI inhibits cell growth and biogenesis of the 50S ribosomal subunit []. YihI is an unusual, highly hydrophilic protein with an uneven distribution of charged residues, resulting in an N-terminal region with high pI and a C-terminal region with low pI [].
Probab=32.08 E-value=25 Score=31.98 Aligned_cols=39 Identities=31% Similarity=0.644 Sum_probs=24.2
Q ss_pred HHHHHHHccCCCCCCCCCcCCHHHHH-----------HhhhcCCCCCCCCCCCCCC
Q 035921 192 MERIMRSMEGMPGAPGMQMYSKEDLM-----------NMKNFGGEDGDDNDDEDDD 236 (305)
Q Consensus 192 mErmM~~Me~m~g~~gm~mySRddl~-----------~M~~~G~~~~ddddddd~d 236 (305)
...++..++. | ...|.+|-. =|...|+.++||+++++++
T Consensus 110 L~~LLdrle~--G----e~Ls~~dQ~yvD~~LdRi~~Lm~~LGi~~ddd~e~~~~~ 159 (169)
T PF04220_consen 110 LNQLLDRLEE--G----ETLSAEDQKYVDEKLDRIEELMEELGIEDDDDDEEEEEE 159 (169)
T ss_pred HHHHHHHHHC--C----CcCCHHHHHHHHHHHHHHHHHHHHhCCCccccccccccc
Confidence 4456666663 3 666666541 3559999988886554333
No 14
>PF01194 RNA_pol_N: RNA polymerases N / 8 kDa subunit; InterPro: IPR000268 In eukaryotes, there are three different forms of DNA-dependent RNA polymerases (2.7.7.6 from EC) transcribing different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. In archaebacteria, there is generally a single form of RNA polymerase which also consists of an oligomeric assemblage of 10 to 13 polypeptides. Archaebacterial subunit N (gene rpoN) [] is a small protein of about 8 kDa, it is evolutionary related [] to a 8.3 kDa component shared by all three forms of eukaryotic RNA polymerases (gene RPB10 in yeast and POLR2J in mammals) as well as to African swine fever virus (ASFV) protein CP80R []. There is a conserved region which is located at the N-terminal extremity of these polymerase subunits; this region contains two cysteines that binds a zinc ion [].; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_N 3HKZ_N 1EF4_A 3H0G_V 2Y0S_N 2R92_J 3M4O_J 3S2D_J 1R9S_J 1Y1W_J ....
Probab=31.42 E-value=15 Score=28.36 Aligned_cols=36 Identities=8% Similarity=0.202 Sum_probs=25.4
Q ss_pred CCcccchhHHHHHHHHHHHHHHHHhhhCCCCCchHHHHHH
Q 035921 40 VPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGI 79 (305)
Q Consensus 40 vpyIrC~VCe~iak~ly~~v~~k~~~~~pkKv~E~dVie~ 79 (305)
|.+|||=.|.++..+.|....+..++ ..++.++|..
T Consensus 2 iiPVRCFTCGkvi~~~~e~y~~~~~~----~~~~~~~Ld~ 37 (60)
T PF01194_consen 2 IIPVRCFTCGKVIGNKWEEYLERLEN----GEDPGDALDD 37 (60)
T ss_dssp --SSS-STTTSBTCGHHHHHHHHHHT----TS-HHHHHHH
T ss_pred CCceecCCCCCChhHhHHHHHHHHHc----CCCHHHHHHH
Confidence 46799999999999999998887765 2245666654
No 15
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=31.01 E-value=27 Score=26.33 Aligned_cols=23 Identities=17% Similarity=0.433 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHhhcccccccCC
Q 035921 7 AVLVYVAVILLLSTCAPISYCAK 29 (305)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~k 29 (305)
.++++||++|+++.++.|-+.+|
T Consensus 6 wlIIviVlgvIigNia~LK~sAk 28 (55)
T PF11446_consen 6 WLIIVIVLGVIIGNIAALKYSAK 28 (55)
T ss_pred hHHHHHHHHHHHhHHHHHHHhcc
Confidence 46778999999999999998886
No 16
>PF10690 Myticin-prepro: Myticin pre-proprotein from the mussel; InterPro: IPR019631 Myticin is a cysteine-rich peptide produced in three isoforms, A, B and C, by Mytilus galloprovincialis (Mediterranean mussel). Isoforms A and B show antibacterial activity against Gram-positive bacteria, while isoform B is additionally active against the fungus Fusarium oxysporum and a Gram-negative bacterium, Escherichia coli (streptomycin resistant strain D31) []. Myticin-prepro is the precursor peptide. The mature molecule, named myticin, consists of 40 residues, with four intramolecular disulphide bridges and a cysteine array in the primary structure different from that of previously characterised cysteine-rich antimicrobial peptides. The first 20 amino acids are a putative signal peptide, and the antimicrobial peptide sequence is a 36-residue C-terminal extension. Such a structure suggests that myticins are synthesised as prepro-proteins that are then processed by various proteolytic events before storage in the haemocytes as the active peptide. Myticin precursors are expressed mainly in the haemocytes. ; PDB: 2EEM_A.
Probab=30.64 E-value=17 Score=30.51 Aligned_cols=29 Identities=31% Similarity=0.460 Sum_probs=0.0
Q ss_pred CCCcCChhHHHHHHHHHHccCCCCCCCCCcC
Q 035921 181 PFVPKPAKEAEMERIMRSMEGMPGAPGMQMY 211 (305)
Q Consensus 181 ~f~~~~~ke~emErmM~~Me~m~g~~gm~my 211 (305)
.|.+++++++||--+|.+||... ++|+||
T Consensus 70 ~a~~~NE~~~e~~p~Mne~Enld--~~mdM~ 98 (98)
T PF10690_consen 70 SARPENEGEMEMSPQMNEMENLD--QEMDMF 98 (98)
T ss_dssp -------------------------------
T ss_pred ccccccccccccccccccccccc--ccccCC
Confidence 47788888999999999999875 478886
No 17
>PRK06287 cobalt transport protein CbiN; Validated
Probab=30.07 E-value=40 Score=28.21 Aligned_cols=27 Identities=26% Similarity=0.272 Sum_probs=20.7
Q ss_pred ChhHHHHHHHHHHHHHHHhhccccccc
Q 035921 1 MARIKAAVLVYVAVILLLSTCAPISYC 27 (305)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 27 (305)
|.++|+.++..++++++++.++++-++
T Consensus 1 ~~~~~~~~~~~~~~all~a~~~s~~AS 27 (107)
T PRK06287 1 MMDNKKFLIAGLIVALLIAILAPFLAS 27 (107)
T ss_pred CCcchhhHHHHHHHHHHHHHHHHhhhc
Confidence 347788888888888898888876543
No 18
>PRK04016 DNA-directed RNA polymerase subunit N; Provisional
Probab=29.51 E-value=29 Score=26.98 Aligned_cols=41 Identities=10% Similarity=0.291 Sum_probs=29.9
Q ss_pred CCcccchhHHHHHHHHHHHHHHHHhhhCCCCCchHHHHH--HHHhhc
Q 035921 40 VPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIG--IAENVC 84 (305)
Q Consensus 40 vpyIrC~VCe~iak~ly~~v~~k~~~~~pkKv~E~dVie--~iE~vC 84 (305)
|..|||-.|.++..+.|.....+.++- .++.+||. -++.-|
T Consensus 2 iiPvRCFTCGkvi~~~we~y~~~~~~g----~~~~~vLd~Lg~~RyC 44 (62)
T PRK04016 2 MIPVRCFTCGKVIAEKWEEFKERVEAG----EDPGKVLDDLGVKRYC 44 (62)
T ss_pred CCCeEecCCCCChHHHHHHHHHHHHcC----CCHHHHHHHcCCcchh
Confidence 456999999999999999988877641 24566776 344444
No 19
>PF08372 PRT_C: Plant phosphoribosyltransferase C-terminal; InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO).
Probab=29.41 E-value=1.1e+02 Score=27.40 Aligned_cols=46 Identities=20% Similarity=0.258 Sum_probs=21.4
Q ss_pred CCCCCCCCCCCc-cccccccccCCCCcchHHHHHhhhhhhhHHHHHHHHHH
Q 035921 232 DEDDDDESDFPK-NLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKV 281 (305)
Q Consensus 232 ddd~d~d~~~p~-~~g~~~~~~~~~~~~~~~~v~~~~~~t~~~~k~~~~~~ 281 (305)
||.|+|.+.||+ +-+++++. .=|--+.|...+|....-+-....++
T Consensus 39 deldEEfD~~ps~~~~~~lr~----Rydrlr~va~rvQ~vlgd~At~gERl 85 (156)
T PF08372_consen 39 DELDEEFDTFPSSRPPDSLRM----RYDRLRSVAGRVQNVLGDVATQGERL 85 (156)
T ss_pred chhhhhhcccccccccHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444556678885 22333332 12233455555555544444444444
No 20
>PF08285 DPM3: Dolichol-phosphate mannosyltransferase subunit 3 (DPM3); InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=28.45 E-value=36 Score=27.86 Aligned_cols=68 Identities=18% Similarity=0.259 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHHHHhhcccccccCCCCCCCcCCCCCCc---------------------ccchhHHHHHHHHHHHHHHH
Q 035921 4 IKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPY---------------------ITCQVCEKLASQLYHQVQKK 62 (305)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~kkp~~~arKeDvpy---------------------IrC~VCe~iak~ly~~v~~k 62 (305)
|.++.=|+++++++.+.|+.+.--.-+.......+-||| ..=.-|+..+++|-+++++-
T Consensus 1 Mtr~~~~l~~~~~l~~~w~~l~~~~~~~~~~~~~~ii~~lP~~~Lv~fG~Ysl~~lgy~v~tFnDcpeA~~eL~~eI~eA 80 (91)
T PF08285_consen 1 MTRLQQWLSALLLLSALWLALLLGLLPLPPEPQQEIIPYLPFYALVSFGCYSLFTLGYGVATFNDCPEAAKELQKEIKEA 80 (91)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHccCCCCchhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHHHHHH
Confidence 444555555556666666555433222222334455555 23346999999999999988
Q ss_pred HhhhCCCCC
Q 035921 63 QAQISPKKI 71 (305)
Q Consensus 63 ~~~~~pkKv 71 (305)
++....|.+
T Consensus 81 K~dLr~kGv 89 (91)
T PF08285_consen 81 KADLRKKGV 89 (91)
T ss_pred HHHHHHcCC
Confidence 887755443
No 21
>PRK14864 putative biofilm stress and motility protein A; Provisional
Probab=27.35 E-value=73 Score=26.81 Aligned_cols=22 Identities=23% Similarity=0.410 Sum_probs=14.0
Q ss_pred hHHHHHHHHHHHHHHHhhcccccc
Q 035921 3 RIKAAVLVYVAVILLLSTCAPISY 26 (305)
Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~ 26 (305)
-||+.. ++++++++|++-.++.
T Consensus 4 ~mk~~~--~l~~~l~LS~~s~~~~ 25 (104)
T PRK14864 4 VMRRFA--SLLLTLLLSACSALQG 25 (104)
T ss_pred HHHHHH--HHHHHHHHhhhhhccc
Confidence 356633 5577778888766664
No 22
>PLN00032 DNA-directed RNA polymerase; Provisional
Probab=24.93 E-value=55 Score=26.13 Aligned_cols=36 Identities=14% Similarity=0.200 Sum_probs=26.8
Q ss_pred CCcccchhHHHHHHHHHHHHHHHHhhhCCCCCchHHHHHH
Q 035921 40 VPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGI 79 (305)
Q Consensus 40 vpyIrC~VCe~iak~ly~~v~~k~~~~~pkKv~E~dVie~ 79 (305)
|..|||=.|.++..+.|....++.++- .++.++|..
T Consensus 2 iiPVRCFTCGkvig~~we~y~~~~~~g----~~~~~~LD~ 37 (71)
T PLN00032 2 IIPVRCFTCGKVIGNKWDTYLDLLQAD----YSEGDALDA 37 (71)
T ss_pred CCceeecCCCCCcHHHHHHHHHHHhcC----CCHHHHHHH
Confidence 456999999999999999988777542 244555544
No 23
>PF00612 IQ: IQ calmodulin-binding motif; InterPro: IPR000048 The IQ motif is an extremely basic unit of about 23 amino acids, whose conserved core usually fits the consensus A-x(3)-I-Q-x(2)-F-R-x(4)-K-K. The IQ motif, which can be present in one or more copies, serves as a binding site for different EF-hand proteins including the essential and regulatory myosin light chains, calmodulin (CaM), and CaM-like proteins [, ].Many IQ motifs are protein kinase C (PKC) phosphorylation sites [, ]. Resolution of the 3D structure of scallop myosin has shown that the IQ motif forms a basic amphipathic helix []. Some proteins known to contain an IQ motif are listed below: A number of conventional and unconventional myosins. Neuromodulin (GAP-43). This protein is associated with nerve growth. It is a major component of the motile "growth cones" that form the tips of elongating axons. Neurogranin (NG/p17). Acts as a "third messenger" substrate of protein kinase C-mediated molecular cascades during synaptic development and remodeling. Sperm surface protein Sp17. Ras GTPase-activating-like protein IQGAP1. IQGAP1 contains 4 IQ motifs. This entry covers the entire IQ motif.; GO: 0005515 protein binding; PDB: 2DFS_A 2IX7_C 1OE9_A 1W7J_A 1W7I_A 1KQM_A 1KK7_A 1WDC_A 1DFL_A 1B7T_A ....
Probab=24.50 E-value=53 Score=19.43 Aligned_cols=13 Identities=31% Similarity=0.744 Sum_probs=9.9
Q ss_pred HHHHHHHHhhccc
Q 035921 280 KVSNHVQRWWSGI 292 (305)
Q Consensus 280 ~~s~~v~~wwkg~ 292 (305)
+++-.|++||+|-
T Consensus 2 ~aai~iQ~~~R~~ 14 (21)
T PF00612_consen 2 KAAIIIQSYWRGY 14 (21)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 4556789999983
No 24
>smart00015 IQ Short calmodulin-binding motif containing conserved Ile and Gln residues. Calmodulin-binding motif.
Probab=22.48 E-value=64 Score=19.91 Aligned_cols=15 Identities=20% Similarity=0.419 Sum_probs=11.2
Q ss_pred HHHHHHHHHhhcccc
Q 035921 279 NKVSNHVQRWWSGIK 293 (305)
Q Consensus 279 ~~~s~~v~~wwkg~k 293 (305)
++++-.|++||||-.
T Consensus 3 ~~aa~~IQa~~Rg~~ 17 (26)
T smart00015 3 TRAAIIIQAAWRGYL 17 (26)
T ss_pred HHHHHHHHHHHHHHH
Confidence 455667899999944
No 25
>PF07771 TSGP1: Tick salivary peptide group 1; InterPro: IPR011694 This entry contains a group of peptides derived from a salivary gland cDNA library of the tick Ixodes scapularis (Black-legged tick) []. Also present are peptides from a related tick species, Ixodes ricinus (Sheep tick). They are characterised by a putative signal peptide, indicative of secretion, and conserved cysteine residues.
Probab=22.24 E-value=33 Score=29.44 Aligned_cols=25 Identities=24% Similarity=0.272 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHhhcccccccC
Q 035921 4 IKAAVLVYVAVILLLSTCAPISYCA 28 (305)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~ 28 (305)
|+.+++++++++|++..-.....|+
T Consensus 1 m~~~~~~~vlv~la~~~~~~~~~Cp 25 (120)
T PF07771_consen 1 MGLTVVTLVLVSLAFFGSAAAHGCP 25 (120)
T ss_pred CcceeehhhHHhHHhhhhhhhhcCC
Confidence 5556665666666555444444444
No 26
>PRK09810 entericidin A; Provisional
Probab=21.60 E-value=91 Score=22.42 Aligned_cols=19 Identities=21% Similarity=0.469 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHhhcc
Q 035921 4 IKAAVLVYVAVILLLSTCA 22 (305)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~ 22 (305)
||+.++++++++++++++-
T Consensus 2 Mkk~~~l~~~~~~~L~aCN 20 (41)
T PRK09810 2 MKRLIVLVLLASTLLTGCN 20 (41)
T ss_pred hHHHHHHHHHHHHHHhhhh
Confidence 6777766665566555543
No 27
>PF01595 DUF21: Domain of unknown function DUF21; InterPro: IPR002550 This transmembrane region has no known function. Many of the sequences in this family are annotated as hemolysins, however this is due to a similarity to Q54318 from SWISSPROT that does not contain this domain. This domain is found in the N terminus of the proteins adjacent to two intracellular CBS domains (IPR000644 from INTERPRO).
Probab=20.91 E-value=54 Score=27.75 Aligned_cols=26 Identities=23% Similarity=0.303 Sum_probs=21.1
Q ss_pred CCCcCCHHHHHHhhhcCCCCCCCCCC
Q 035921 207 GMQMYSKEDLMNMKNFGGEDGDDNDD 232 (305)
Q Consensus 207 gm~mySRddl~~M~~~G~~~~ddddd 232 (305)
...-+|+|||..|.+.|.+++.-+++
T Consensus 157 ~~~~~s~eel~~lv~~~~e~G~i~~~ 182 (183)
T PF01595_consen 157 EDPAVSEEELRSLVEEGEEEGVIEEE 182 (183)
T ss_pred ccCCCCHHHHHHHHHhHHHCCCCCCC
Confidence 34789999999999998888877444
No 28
>PF03918 CcmH: Cytochrome C biogenesis protein; InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=20.22 E-value=25 Score=30.95 Aligned_cols=36 Identities=25% Similarity=0.446 Sum_probs=23.9
Q ss_pred ccchhHH---------HHHHHHHHHHHHHHhhhCCCCCchHHHHHHHHh
Q 035921 43 ITCQVCE---------KLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82 (305)
Q Consensus 43 IrC~VCe---------~iak~ly~~v~~k~~~~~pkKv~E~dVie~iE~ 82 (305)
+||.||+ -+|..+=++|.++..+- -|+++|++.+-.
T Consensus 41 LrCp~Cq~qsi~~s~a~~A~dmR~~I~~~l~~G----~s~~eI~~~~v~ 85 (148)
T PF03918_consen 41 LRCPVCQNQSIADSNAPIARDMRREIREMLAEG----KSDEEIIDYFVE 85 (148)
T ss_dssp CE-TTTTS-CTTT--SHHHHHHHHHHHHHHHHT------HHHHHHHHHH
T ss_pred ccCCCCCCCchhhcCcHHHHHHHHHHHHHHHcC----CCHHHHHHHHHH
Confidence 8999996 46777766666666532 268999988764
No 29
>COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only]
Probab=20.15 E-value=1.4e+02 Score=28.16 Aligned_cols=43 Identities=19% Similarity=0.348 Sum_probs=26.4
Q ss_pred ChhHHHHHHHHHHHHHHHhhccc-------ccccCCCCCCCcCCCCCCccc
Q 035921 1 MARIKAAVLVYVAVILLLSTCAP-------ISYCAKKPVAGARKEDVPYIT 44 (305)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~kkp~~~arKeDvpyIr 44 (305)
|-.||.-.. ++++++++++|.. +..+++-+....-+.|++-|.
T Consensus 1 mtk~k~~~~-il~~al~l~GCs~~~a~~~sv~~a~~~tt~~v~~td~~~~a 50 (200)
T COG3417 1 MTKMKIYAS-ILLLALFLSGCSSEPAPKESVKDAPENTTEPVDPTDLELIA 50 (200)
T ss_pred CchHHHHHH-HHHHHHHHhhcccCCCcchhcCcCcccccCCCCccchhhcc
Confidence 344555554 7888899999985 444444455555556666543
No 30
>PF15017 AF1Q: Drug resistance and apoptosis regulator
Probab=20.03 E-value=61 Score=26.65 Aligned_cols=18 Identities=33% Similarity=0.770 Sum_probs=11.4
Q ss_pred CCCCCCCCCC--Cccccccc
Q 035921 232 DEDDDDESDF--PKNLGKVF 249 (305)
Q Consensus 232 ddd~d~d~~~--p~~~g~~~ 249 (305)
++++|++... |||.+++-
T Consensus 66 e~~ddD~gGWITPsNIkqiq 85 (87)
T PF15017_consen 66 EEEDDDGGGWITPSNIKQIQ 85 (87)
T ss_pred ccccCCCCccccchhhhhhc
Confidence 4444445555 99988764
Done!