Query psy14055
Match_columns 493
No_of_seqs 340 out of 1667
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 18:25:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14055.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14055hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3913|consensus 100.0 1E-79 2.2E-84 624.5 20.6 236 181-477 47-356 (356)
2 smart00097 WNT1 found in Wnt-1 100.0 2.9E-72 6.3E-77 568.8 20.8 232 185-477 2-305 (305)
3 PF00110 wnt: wnt family; Int 100.0 2.1E-72 4.6E-77 572.3 5.5 236 181-477 1-310 (310)
4 COG3590 PepO Predicted metallo 99.0 2.7E-10 5.9E-15 122.3 6.1 83 21-112 88-171 (654)
5 PF05649 Peptidase_M13_N: Pept 99.0 1E-10 2.2E-15 120.1 1.4 88 22-110 71-159 (390)
6 KOG3913|consensus 98.7 2.8E-09 6.2E-14 110.0 1.2 56 388-443 163-229 (356)
7 smart00097 WNT1 found in Wnt-1 98.5 3.2E-08 6.9E-13 101.5 1.0 56 388-443 114-180 (305)
8 KOG3624|consensus 98.3 9.4E-07 2E-11 99.7 6.7 84 22-112 127-212 (687)
9 PF00110 wnt: wnt family; Int 98.1 3.4E-07 7.3E-12 94.3 -1.8 57 387-443 117-184 (310)
10 PHA00626 hypothetical protein 27.9 22 0.00048 28.1 0.3 28 462-490 12-39 (59)
11 PF07459 CTX_RstB: CTX phage R 23.7 91 0.002 28.1 3.3 31 62-93 70-100 (117)
12 smart00412 Cu_FIST Copper-Fist 22.1 22 0.00047 26.0 -0.7 30 463-492 9-38 (39)
No 1
>KOG3913|consensus
Probab=100.00 E-value=1e-79 Score=624.53 Aligned_cols=236 Identities=50% Similarity=0.958 Sum_probs=214.9
Q ss_pred ccchhhccCHhHHHHhhcchhhhhHHHhhHhhhhhhhhhhhhhcccccccHHHHHHHHHHHHHHHHHhcccCcccCCCCC
Q psy14055 181 ICTRFDFLGHRQQYLCTLNENILNVYLIVCVLIKYQQVDLIRSTRQVYVYLTVVAAGARMGIEECQNQFKMSRWNCTTFG 260 (493)
Q Consensus 181 ~~~~~~~l~~~Q~~lC~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sv~~Ga~~ai~ECq~QFr~~RWNCS~~~ 260 (493)
+|..++||+++|+++|++||++| ++|++|+++||+|||+|||++|||||+.+
T Consensus 47 ~C~~l~gL~~~Q~~~Cr~~p~~~----------------------------~sv~~G~~~~i~ECq~QFr~~RWNCs~~~ 98 (356)
T KOG3913|consen 47 LCDNLPGLSPRQRRLCRRNPDVM----------------------------PSVAEGAREGIQECQFQFRFRRWNCSTLD 98 (356)
T ss_pred chhhccccCHHHHHHHHhCcchH----------------------------HHHHHHHHHHHHHHHHHHHhhccCCCCCc
Confidence 89999999999999999999999 99999999999999999999999999987
Q ss_pred CCCcccccccccCCcchhhHhHHHhHHHHHHHHhhccCCCCCcccCCCccCCCCCCCCccccCCCCCcCCC---------
Q psy14055 261 NTSQVFGSVLTFKSRETAFVYAISSAGVAYAVTRACSRGELNECSCDNRVRLKKPRTSWQWGGCSERFDRG--------- 331 (493)
Q Consensus 261 ~~~~~f~~~l~~gtREtAFv~AIsSAgv~~~ItraCs~G~l~~C~C~~~~~~~~~~~~w~WgGCsdn~~~g--------- 331 (493)
..++|+++|++|+||+||||||+||||+|+||||||+|.|+.||||...++.+.+++|+||||||||+||
T Consensus 99 -~~~~~g~~l~~g~REsAFv~AIssAgV~havtraCs~G~l~~CgCd~~~~~~~~~~~w~WGGCsDnv~fG~~fsr~FlD 177 (356)
T KOG3913|consen 99 -QLPVFGPLLSRGTRETAFVYAISSAGVAHAVTRACSQGNLESCGCDPSPNGKSGPEGWEWGGCSDNVDFGIRFSRKFLD 177 (356)
T ss_pred -cccccchhhcccchHHHHHHHHHHhHHHHHHHHHhcCCCCCCcCCCCCCCCCCCCCCccccCCCCchHHHHHHHHHhcc
Confidence 6789999999999999999999999999999999999999999999887766555669999999999999
Q ss_pred --------------ccchhHHHHhhhhccchhh-----------------------------hcccccc-----------
Q psy14055 332 --------------NCNRYGLIVVNNQRKRNVK-----------------------------RLRSAVR----------- 357 (493)
Q Consensus 332 --------------~~~~~g~~~v~~~~~~~~k-----------------------------~~~~~~~----------- 357 (493)
|||++||++|.+.+++.+| ||..|.+
T Consensus 178 ~re~~~d~r~lmnlHNNeaGR~av~~~m~~~CKCHGvSGSC~~KTCW~~lp~Fr~vG~~Lk~KYd~A~~V~~~~~~~~~~ 257 (356)
T KOG3913|consen 178 AREKRKDARALMNLHNNEAGRKAVKKNMRRECKCHGVSGSCTVKTCWKQLPDFREVGDYLKEKYDGAIKVTVNNRGRRSA 257 (356)
T ss_pred ccccccCHHHHHHHhhhHHHHHHHHHhhhhcccccCccccchhhhHHhhCccHHHHHHHHHHHhhhheEEeeccCCcccc
Confidence 3899999999877765431 3444431
Q ss_pred --------cCCCCCCcceEEecCCccccccccccccccccchhhhhhhhhhcCCccccccHHHHHhhhccccccC---CC
Q psy14055 358 --------DAKQPNRTELVYMEESPDYCQRNETRVRLWRDIHFGEKFSRDFVDSKEDEDSEEALMNLHNNEAGRR---RS 426 (493)
Q Consensus 358 --------~~~~~~~~~Lvy~~~SpdyC~~~~~~~~~~dnI~fG~~fs~~FlD~~e~~~~~~~lmnlHNn~aGRk---~s 426 (493)
..++|+++||||+|+|||||++|+.+|++ .++||. +|
T Consensus 258 ~~~~~~~~~~~~~~~~dLVYle~SPdfC~~~~~~Gs~--------------------------------GT~GR~Cn~ts 305 (356)
T KOG3913|consen 258 PALRPEKPRFKPPTETDLVYLEDSPDYCERNKKTGSL--------------------------------GTQGRECNKTS 305 (356)
T ss_pred ccccccccccCCCCCCceEEecCCChhhccCccCCCC--------------------------------CCCCcccCCCC
Confidence 23567889999999999999999988774 489999 58
Q ss_pred CCCCCccccccCCCceeEEEEEEEeeccEEEeeeEEeCcccceEEEEEecC
Q psy14055 427 LGLDGCKLLCCGRGYMTRIREVEEKCNCKFVWCCNVKCEICRYKREEYLNP 477 (493)
Q Consensus 427 ~g~~sC~~LCCGRGy~t~~~~~~e~CnCkF~WCC~V~C~~C~~~~~~~~C~ 477 (493)
.++|||++|||||||+|++++++|+|+|||||||+|+|++|++++++||||
T Consensus 306 ~g~dgC~~LCCGRGynt~~~~~~e~C~CkFhWCC~V~C~~C~~~~~v~tCk 356 (356)
T KOG3913|consen 306 RGSDGCDLLCCGRGYNTRRVEVVERCHCKFHWCCYVKCKECRERVEVYTCK 356 (356)
T ss_pred CCCCCCccccCCCCCceeEEEEEEecCCEEEEeeEEECcccccEEEeeecC
Confidence 999999999999999999999999999999999999999999999999997
No 2
>smart00097 WNT1 found in Wnt-1.
Probab=100.00 E-value=2.9e-72 Score=568.83 Aligned_cols=232 Identities=52% Similarity=0.964 Sum_probs=203.7
Q ss_pred hhccCHhHHHHhhcchhhhhHHHhhHhhhhhhhhhhhhhcccccccHHHHHHHHHHHHHHHHHhcccCcccCCCCCCCCc
Q psy14055 185 FDFLGHRQQYLCTLNENILNVYLIVCVLIKYQQVDLIRSTRQVYVYLTVVAAGARMGIEECQNQFKMSRWNCTTFGNTSQ 264 (493)
Q Consensus 185 ~~~l~~~Q~~lC~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sv~~Ga~~ai~ECq~QFr~~RWNCS~~~~~~~ 264 (493)
+++|+++|+++|+++|++| ++|++|+++|++|||+||+++|||||+.. ..+
T Consensus 2 ~~~l~~~Q~~~C~~~~~~~----------------------------~~v~~g~~~ai~ECq~QF~~~rWNCs~~~-~~~ 52 (305)
T smart00097 2 LPGLSRRQRRLCRANPDVM----------------------------ISVAEGAQEGIEECQHQFRFRRWNCSTLD-NAS 52 (305)
T ss_pred CcccCHHHHHHHHhCHHHH----------------------------HHHHHHHHHHHHHHHHHhCCCCCCCCCCc-CCc
Confidence 5689999999999999999 99999999999999999999999999875 467
Q ss_pred ccccccccCCcchhhHhHHHhHHHHHHHHhhccCCCCCcccCCCccCCCCCCCCccccCCCCCcCCCc------------
Q psy14055 265 VFGSVLTFKSRETAFVYAISSAGVAYAVTRACSRGELNECSCDNRVRLKKPRTSWQWGGCSERFDRGN------------ 332 (493)
Q Consensus 265 ~f~~~l~~gtREtAFv~AIsSAgv~~~ItraCs~G~l~~C~C~~~~~~~~~~~~w~WgGCsdn~~~g~------------ 332 (493)
.|+++|.+||||+||||||+||||+|+|||||++|.|..|+||...++.++..+|+||||+|||+||+
T Consensus 53 ~~~~~l~~~trEtAfv~Ai~sAgv~~~itraCs~G~l~~C~Cd~~~~~~~~~~~w~WgGCsdnv~~G~~~s~~FlD~~~~ 132 (305)
T smart00097 53 VFGKVLRQGTRETAFVYAISSAGVAHAVTRACSEGELDSCGCDYRKRGRAGGRGWKWGGCSDNIDFGIRFSREFVDARER 132 (305)
T ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCCCCCCCCCcccCCCCccHHHHHHHHHHHHhcccc
Confidence 89999999999999999999999999999999999999999998766544434699999999999993
Q ss_pred -----------cchhHHHHhhhhccchh-----------------------------hhcccccc---------------
Q psy14055 333 -----------CNRYGLIVVNNQRKRNV-----------------------------KRLRSAVR--------------- 357 (493)
Q Consensus 333 -----------~~~~g~~~v~~~~~~~~-----------------------------k~~~~~~~--------------- 357 (493)
||++||.+|.+.+.+.+ ++|..|.+
T Consensus 133 ~~d~r~lmnlHNn~aGR~~v~~~~~~~CkCHGvSGSC~~kTCw~~l~~Fr~Ig~~Lk~kY~~A~~V~~~~~~~~~~l~~~ 212 (305)
T smart00097 133 GKDARALMNLHNNEAGRLAVKKTMKRECKCHGVSGSCTVKTCWLQLPDFRKVGDYLKEKYDGASEVEVDKRGTNKAPVPK 212 (305)
T ss_pred cccHHHHHHHhhhHHHHHHHHHHHHHhccccCccCCccccchHhhCCCHHHHHHHHHHHhccceEeeecccccccccccC
Confidence 78999998866554422 23444422
Q ss_pred --cCCCCCCcceEEecCCccccccccccccccccchhhhhhhhhhcCCccccccHHHHHhhhccccccC---CCCCCCCc
Q psy14055 358 --DAKQPNRTELVYMEESPDYCQRNETRVRLWRDIHFGEKFSRDFVDSKEDEDSEEALMNLHNNEAGRR---RSLGLDGC 432 (493)
Q Consensus 358 --~~~~~~~~~Lvy~~~SpdyC~~~~~~~~~~dnI~fG~~fs~~FlD~~e~~~~~~~lmnlHNn~aGRk---~s~g~~sC 432 (493)
..++|.++||||+|+|||||++|...|++ .+.||. ++.+++||
T Consensus 213 ~~~~~~~~~~dLVYle~SPdyC~~n~~~G~~--------------------------------GT~GR~Cn~~s~~~~~C 260 (305)
T smart00097 213 NSTFKPPTETDLVYLESSPDFCEKNPKTGSL--------------------------------GTQGRQCNKTSKGLDGC 260 (305)
T ss_pred CccCCCCCCCCeEEeCCCCcccccCCCcCCC--------------------------------CCCCCccCCCCCCCCCh
Confidence 12357889999999999999999887653 488999 56778999
Q ss_pred cccccCCCceeEEEEEEEeeccEEEeeeEEeCcccceEEEEEecC
Q psy14055 433 KLLCCGRGYMTRIREVEEKCNCKFVWCCNVKCEICRYKREEYLNP 477 (493)
Q Consensus 433 ~~LCCGRGy~t~~~~~~e~CnCkF~WCC~V~C~~C~~~~~~~~C~ 477 (493)
++|||||||+|++++++++|||||||||+|+|++|.+++++|+|+
T Consensus 261 ~~LCCgRGy~t~~~~~~~~C~CkF~WCC~V~C~~C~~~~~~~~C~ 305 (305)
T smart00097 261 DLLCCGRGYNTRTVEVVERCNCKFHWCCYVKCKQCRETVEKHTCK 305 (305)
T ss_pred hhcCCCCCceeEEEEEEEecCCEEEEeeEEECccCCcEEEEEEeC
Confidence 999999999999999999999999999999999999999999996
No 3
>PF00110 wnt: wnt family; InterPro: IPR005817 Wnt proteins constitute a large family of secreted molecules that are involved in intercellular signalling during development. The name derives from the first 2 members of the family to be discovered: int-1 (mouse) and wingless (Drosophila) []. It is now recognised that Wnt signalling controls many cell fate decisions in a variety of different organisms, including mammals []. Wnt signalling has been implicated in tumourigenesis, early mesodermal patterning of the embryo, morphogenesis of the brain and kidneys, regulation of mammary gland proliferation and Alzheimer's disease [, ]. Wnt-mediated signalling is believed to proceed initially through binding to cell surface receptors of the frizzled family; the signal is subsequently transduced through several cytoplasmic components to B-catenin, which enters the nucleus and activates the transcription of several genes important in development []. Several non-canonical Wnt signalling pathways have also been elucidated that act independently of B-catenin. Canonical and noncanonical Wnt signaling branches are highly interconnected, and cross-regulate each other []. Members of the Wnt gene family are defined by their sequence similarity to mouse Wnt-1 and Wingless in Drosophila. They encode proteins of ~350-400 residues in length, with orthologues identified in several, mostly vertebrate, species. Very little is known about the structure of Wnts as they are notoriously insoluble, but they share the following features characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites and 22 conserved cysteines [] that are probably involved in disulphide bonds. The Wnt proteins seem to adhere to the plasma membrane of the secreting cells and are therefore likely to signal over only few cell diameters. Fifteen major Wnt gene families have been identified in vertebrates, with multiple subtypes within some classes. In humans, 19 Wnt proteins have been identified that share 27% to 83% amino-acid sequence identity and a conserved pattern of 23 or 24 cysteine residues []. Wnt genes are highly conserved between vertebrate species sharing overall sequence identity and gene structure, and are slightly less conserved between vertebrates and invertebrates.; GO: 0005102 receptor binding, 0007275 multicellular organismal development, 0016055 Wnt receptor signaling pathway, 0005576 extracellular region; PDB: 4F0A_B.
Probab=100.00 E-value=2.1e-72 Score=572.28 Aligned_cols=236 Identities=49% Similarity=0.948 Sum_probs=150.8
Q ss_pred ccchhhccCHhHHHHhhcchhhhhHHHhhHhhhhhhhhhhhhhcccccccHHHHHHHHHHHHHHHHHhcccCcccCCCCC
Q psy14055 181 ICTRFDFLGHRQQYLCTLNENILNVYLIVCVLIKYQQVDLIRSTRQVYVYLTVVAAGARMGIEECQNQFKMSRWNCTTFG 260 (493)
Q Consensus 181 ~~~~~~~l~~~Q~~lC~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sv~~Ga~~ai~ECq~QFr~~RWNCS~~~ 260 (493)
||.+++||+++|+++|+++|++| ++|++|+++|++|||+||+++|||||+..
T Consensus 1 iC~~~~gL~~~Q~~~C~~~p~~m----------------------------~~i~~G~~~ai~ECq~QF~~~RWNCs~~~ 52 (310)
T PF00110_consen 1 ICDKIPGLTKRQRRLCRRNPDLM----------------------------PSIAEGAKMAIEECQHQFRNRRWNCSTVD 52 (310)
T ss_dssp ------------------HHHHH----------------------------HHHHHHHHHHHHHHHHHTTTSSS-----T
T ss_pred CCcccccccccccccccCCHHHH----------------------------HHHHHHHHHHHHHHHHHHhccCCCCCCCc
Confidence 79999999999999999999999 99999999999999999999999999987
Q ss_pred CCCccccc-ccccCCcchhhHhHHHhHHHHHHHHhhccCCCCCcccCCCccCCCCCCCCccccCCCCCcCCCc-------
Q psy14055 261 NTSQVFGS-VLTFKSRETAFVYAISSAGVAYAVTRACSRGELNECSCDNRVRLKKPRTSWQWGGCSERFDRGN------- 332 (493)
Q Consensus 261 ~~~~~f~~-~l~~gtREtAFv~AIsSAgv~~~ItraCs~G~l~~C~C~~~~~~~~~~~~w~WgGCsdn~~~g~------- 332 (493)
..+.|++ ++.+||||+||||||+||||+|+|||||++|.|..|+|+......+.+..|+||||+|||+||.
T Consensus 53 -~~~~f~~~~~~~gtrE~Afv~Ai~sAgv~~~itraCs~G~l~~C~C~~~~~~~~~~~~~~wggCsdni~~G~~~sr~Fl 131 (310)
T PF00110_consen 53 -NNPVFGPPILKKGTRETAFVYAISSAGVAHSITRACSRGKLKSCGCDRNPRGSSSQNTWQWGGCSDNIKFGIKFSRRFL 131 (310)
T ss_dssp --THHHH-TT--S--HHHHHHHHHHHHHHHHHHHHHHHTTT-SS-----TTTTSEEETTEEE-S----HHHHHHHHHHHH
T ss_pred -ccccccccccccCcccceeeEhhhcCchHHHHHHHhhcccCCcCCCcccccccccccccccCCcccccccchHHHHHHH
Confidence 4577888 9999999999999999999999999999999999999998887655444599999999999993
Q ss_pred ----------------cchhHHHHhhhhccchh-----------------------------hhccccccc---------
Q psy14055 333 ----------------CNRYGLIVVNNQRKRNV-----------------------------KRLRSAVRD--------- 358 (493)
Q Consensus 333 ----------------~~~~g~~~v~~~~~~~~-----------------------------k~~~~~~~~--------- 358 (493)
||++||.+|.+.++..+ ++|..|.+.
T Consensus 132 d~~~~~~~~~~~mn~HNn~aGR~~v~~~~~~~CkCHGvSGSC~~ktCw~~l~~f~~Ig~~Lk~kY~~A~~v~~~~~~~~~ 211 (310)
T PF00110_consen 132 DAREKGSDARSLMNLHNNEAGRKAVKKNMKKKCKCHGVSGSCTVKTCWRSLPPFREIGDYLKEKYDSAVKVKPNNNGNRS 211 (310)
T ss_dssp HHH--SSSHHHHHHHHHHHHHHHHHHHT-EEEEEE-SGGG-TTSEEEEEE---HHHHHHHHHHHHHT-EE----------
T ss_pred HhhhhhhhhhhhhhhhhhhhhhhhhcccccceEccccccCCCcceEEEEcCCcHHHHHHHHHHHhhcccccccccccccc
Confidence 78999999877655432 234444321
Q ss_pred ---------CCCCCCcceEEecCCccccccccccccccccchhhhhhhhhhcCCccccccHHHHHhhhccccccC---CC
Q psy14055 359 ---------AKQPNRTELVYMEESPDYCQRNETRVRLWRDIHFGEKFSRDFVDSKEDEDSEEALMNLHNNEAGRR---RS 426 (493)
Q Consensus 359 ---------~~~~~~~~Lvy~~~SpdyC~~~~~~~~~~dnI~fG~~fs~~FlD~~e~~~~~~~lmnlHNn~aGRk---~s 426 (493)
.++|..+||||+++|||||++|+..|++ .+.||. ++
T Consensus 212 ~~~~~~~~~~~~~~~~~LvY~~~SPdyC~~d~~~G~~--------------------------------GT~GR~C~~~~ 259 (310)
T PF00110_consen 212 KPSPRKNKRFKPPTSTDLVYLEKSPDYCEPDPSTGSL--------------------------------GTRGRECNKTS 259 (310)
T ss_dssp -------HHHHTS-SSS-EE-S----TTSEETTTTEE---------------------------------STT-EE----
T ss_pred ccccccccccCCCCCceEEEECCCCcceeeccccccC--------------------------------cccccccCCCC
Confidence 1357899999999999999999877653 488999 45
Q ss_pred CCCCCccccccCCCceeEEEEEEEeeccEEEeeeEEeCcccceEEEEEecC
Q psy14055 427 LGLDGCKLLCCGRGYMTRIREVEEKCNCKFVWCCNVKCEICRYKREEYLNP 477 (493)
Q Consensus 427 ~g~~sC~~LCCGRGy~t~~~~~~e~CnCkF~WCC~V~C~~C~~~~~~~~C~ 477 (493)
.+++||+.|||||||+|++++++|+|||||+|||+|+|++|++++++|+||
T Consensus 260 ~~~~~C~~LCCGrGy~t~~~~~~~~CnCkF~WCC~V~C~~C~~~~~~~~Ck 310 (310)
T PF00110_consen 260 SGPDSCDNLCCGRGYRTRTEEVEEKCNCKFHWCCEVKCDTCKKTVTVYTCK 310 (310)
T ss_dssp THHTHHHHHTGT-EEEEEEEEEEEE-S-B--SSS--B--EEEEEEEEEEE-
T ss_pred CCCCCCceeEcCCccceEEEEEEeeECCEEEeeeEEECCcCcEEEEEEEeC
Confidence 678999999999999999999999999999999999999999999999996
No 4
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=99.02 E-value=2.7e-10 Score=122.30 Aligned_cols=83 Identities=20% Similarity=0.200 Sum_probs=75.3
Q ss_pred cCCHHHHhhcCcchhHHHHhhhCCCCCCCCCCCCChHHHHHHHHHHHhhccCcceEEEEEecCCCCCcceeEEEecCCCC
Q psy14055 21 NPSQETIESQGVEPLTSILDSLGGWPLISPAWTPAQFDMNRLFAQSIRRWSVHHLFSVFVNVDRSDSSQKNVHFGSGATS 100 (493)
Q Consensus 21 ~mD~~~IEk~G~~PL~~~L~~iGGWPvl~s~W~e~~~dl~~~La~l~~~yG~~~Lf~~~V~~D~kNSS~niIyidQ~gLg 100 (493)
-||++.||+.|.+||++.|.+|. .+.+ .-++...|+++.. +|+..+|+|+|++|.||+++|++|+.||||+
T Consensus 88 ~mD~~~~E~~g~~Pl~~~La~i~---~~~s-----~sdf~a~l~~~~~-~g~~~~f~~~vspD~kds~~~v~~~sq~Glg 158 (654)
T COG3590 88 FMDEAKREKAGVDPLKPELAEID---SLAS-----FSDFAAALGQLER-AGQGNPFGFSVSPDFKDSTRYVLYFSQSGLG 158 (654)
T ss_pred hccHHHHHhcCCCchhHHHHHHH---hhcc-----HHHHHHHHHHHHh-ccCCCCceeeeccccccchhheeeeccCCCC
Confidence 49999999999999999999997 4554 5688999999985 8999999999999999999999999999999
Q ss_pred c-chhhccCcccc
Q psy14055 101 V-AEQLGLQESSA 112 (493)
Q Consensus 101 L-~RdYYl~~~~~ 112 (493)
| +++||.++..+
T Consensus 159 LPD~~YY~de~~~ 171 (654)
T COG3590 159 LPDTTYYRDEQHA 171 (654)
T ss_pred CCchhhhhhhhHH
Confidence 9 99999877544
No 5
>PF05649 Peptidase_M13_N: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR008753 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0008237 metallopeptidase activity, 0006508 proteolysis; PDB: 3DWB_A 3ZUK_A 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A.
Probab=98.99 E-value=1e-10 Score=120.10 Aligned_cols=88 Identities=27% Similarity=0.575 Sum_probs=74.8
Q ss_pred CCHHHHhhcCcchhHHHHhhhCCCCCCCCCCCCChHHHHHHHHHHHhhccCcceEEEEEecCCCCCcceeEEEecCCCCc
Q psy14055 22 PSQETIESQGVEPLTSILDSLGGWPLISPAWTPAQFDMNRLFAQSIRRWSVHHLFSVFVNVDRSDSSQKNVHFGSGATSV 101 (493)
Q Consensus 22 mD~~~IEk~G~~PL~~~L~~iGGWPvl~s~W~e~~~dl~~~La~l~~~yG~~~Lf~~~V~~D~kNSS~niIyidQ~gLgL 101 (493)
||++++++.|.+||.++|+++|+||.++ +|+++.++|.++++.+...+|.++||++.|.+|..|++.++|+|++|.++|
T Consensus 71 ~~~~~~~~~~~~~l~~~l~~~~~~p~~~-~~~~~~~~~~~~l~~l~~~~~~~~l~~~~v~~d~~~~~~~~l~i~~~~~~l 149 (390)
T PF05649_consen 71 MDTDAREKDGLEPLKEFLRSIGGWPFLS-DWNESKFDLLDTLARLSRRYGIDPLFSLYVDPDPQNPSKYILYIDPPELGL 149 (390)
T ss_dssp H-HHHHHHHTTHHHHHHHHHCTCBCCCS-SHHTTCCHHHHHHHHHHHTC---SSSEEEEEEETTEEEEEEEEEEE---SS
T ss_pred HHhhhcchhhhhhHHHHHHHhhhcccCC-cccCCHhHHHHHHHHHHhhccccceeeeEeeccccchheeEeecccCCCCC
Confidence 7889999999999999999999999884 577788999999999998889999999999999999999999999999999
Q ss_pred -chhhccCcc
Q psy14055 102 -AEQLGLQES 110 (493)
Q Consensus 102 -~RdYYl~~~ 110 (493)
.++||.++.
T Consensus 150 ~~~~~~~~~~ 159 (390)
T PF05649_consen 150 PSKEYYRDPH 159 (390)
T ss_dssp SSGGGGCTCG
T ss_pred cchHHhhcch
Confidence 888887654
No 6
>KOG3913|consensus
Probab=98.73 E-value=2.8e-09 Score=110.01 Aligned_cols=56 Identities=41% Similarity=0.787 Sum_probs=49.8
Q ss_pred ccchhhhhhhhhhcCCccccccHHHHHhhhccccccC----------CCCC-CCCccccccCCCcee
Q psy14055 388 RDIHFGEKFSRDFVDSKEDEDSEEALMNLHNNEAGRR----------RSLG-LDGCKLLCCGRGYMT 443 (493)
Q Consensus 388 dnI~fG~~fs~~FlD~~e~~~~~~~lmnlHNn~aGRk----------~s~g-~~sC~~LCCGRGy~t 443 (493)
|||.||..||++|+|++|+++|++++||||||+|||+ --+| ++||..-.|......
T Consensus 163 Dnv~fG~~fsr~FlD~re~~~d~r~lmnlHNNeaGR~av~~~m~~~CKCHGvSGSC~~KTCW~~lp~ 229 (356)
T KOG3913|consen 163 DNVDFGIRFSRKFLDAREKRKDARALMNLHNNEAGRKAVKKNMRRECKCHGVSGSCTVKTCWKQLPD 229 (356)
T ss_pred CchHHHHHHHHHhccccccccCHHHHHHHhhhHHHHHHHHHhhhhcccccCccccchhhhHHhhCcc
Confidence 8999999999999999999999999999999999999 1356 689999999876443
No 7
>smart00097 WNT1 found in Wnt-1.
Probab=98.49 E-value=3.2e-08 Score=101.53 Aligned_cols=56 Identities=45% Similarity=0.745 Sum_probs=49.8
Q ss_pred ccchhhhhhhhhhcCCccccccHHHHHhhhccccccC----------CCCC-CCCccccccCCCcee
Q psy14055 388 RDIHFGEKFSRDFVDSKEDEDSEEALMNLHNNEAGRR----------RSLG-LDGCKLLCCGRGYMT 443 (493)
Q Consensus 388 dnI~fG~~fs~~FlD~~e~~~~~~~lmnlHNn~aGRk----------~s~g-~~sC~~LCCGRGy~t 443 (493)
|||.||..||++|+|+++..++++++||||||+|||+ --+| ++||..-.|.+....
T Consensus 114 dnv~~G~~~s~~FlD~~~~~~d~r~lmnlHNn~aGR~~v~~~~~~~CkCHGvSGSC~~kTCw~~l~~ 180 (305)
T smart00097 114 DNIDFGIRFSREFVDARERGKDARALMNLHNNEAGRLAVKKTMKRECKCHGVSGSCTVKTCWLQLPD 180 (305)
T ss_pred ccHHHHHHHHHHHHhcccccccHHHHHHHhhhHHHHHHHHHHHHHhccccCccCCccccchHhhCCC
Confidence 7999999999999999988889999999999999999 2356 689999999986543
No 8
>KOG3624|consensus
Probab=98.29 E-value=9.4e-07 Score=99.70 Aligned_cols=84 Identities=29% Similarity=0.563 Sum_probs=73.8
Q ss_pred CCHHHHhhc-CcchhHHHHhhhCCCCCCCCCCCCChHHHHHHHHHHHhhccCcceEEEEEecCCCCCcceeEEEecCCCC
Q psy14055 22 PSQETIESQ-GVEPLTSILDSLGGWPLISPAWTPAQFDMNRLFAQSIRRWSVHHLFSVFVNVDRSDSSQKNVHFGSGATS 100 (493)
Q Consensus 22 mD~~~IEk~-G~~PL~~~L~~iGGWPvl~s~W~e~~~dl~~~La~l~~~yG~~~Lf~~~V~~D~kNSS~niIyidQ~gLg 100 (493)
|+....+.. +..||.++|+.+||||++.++|++.+|+|.++++.+..+||..+||.+.|..|..|++ |+.++
T Consensus 127 ~~~~~~~~~~~~~~l~~~i~~~G~wP~l~~~w~~~~f~~~~~l~~~~~~yg~~~l~~~~v~~~~~~~~-------~~~~~ 199 (687)
T KOG3624|consen 127 LDAKALESSGALQLLFRIIQSIGGWPLLEGNWDESKFNLNEMLANLLRRYGLTTLFLLEVALDYKNSS-------QPGLI 199 (687)
T ss_pred hchhhhhhhcchHHHHHHHHHhCCCcCCCCCCCcccCCHHHHHHHHHHHcCccceeEEEEecccccCc-------ccccC
Confidence 555555555 5889999999999999999889999999999999977779999999999999999998 99999
Q ss_pred c-chhhccCcccc
Q psy14055 101 V-AEQLGLQESSA 112 (493)
Q Consensus 101 L-~RdYYl~~~~~ 112 (493)
+ .+++|......
T Consensus 200 l~~~~~~~~~~~~ 212 (687)
T KOG3624|consen 200 LPSRSKYLATDSD 212 (687)
T ss_pred cchHhhhhccccH
Confidence 9 78999866554
No 9
>PF00110 wnt: wnt family; InterPro: IPR005817 Wnt proteins constitute a large family of secreted molecules that are involved in intercellular signalling during development. The name derives from the first 2 members of the family to be discovered: int-1 (mouse) and wingless (Drosophila) []. It is now recognised that Wnt signalling controls many cell fate decisions in a variety of different organisms, including mammals []. Wnt signalling has been implicated in tumourigenesis, early mesodermal patterning of the embryo, morphogenesis of the brain and kidneys, regulation of mammary gland proliferation and Alzheimer's disease [, ]. Wnt-mediated signalling is believed to proceed initially through binding to cell surface receptors of the frizzled family; the signal is subsequently transduced through several cytoplasmic components to B-catenin, which enters the nucleus and activates the transcription of several genes important in development []. Several non-canonical Wnt signalling pathways have also been elucidated that act independently of B-catenin. Canonical and noncanonical Wnt signaling branches are highly interconnected, and cross-regulate each other []. Members of the Wnt gene family are defined by their sequence similarity to mouse Wnt-1 and Wingless in Drosophila. They encode proteins of ~350-400 residues in length, with orthologues identified in several, mostly vertebrate, species. Very little is known about the structure of Wnts as they are notoriously insoluble, but they share the following features characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites and 22 conserved cysteines [] that are probably involved in disulphide bonds. The Wnt proteins seem to adhere to the plasma membrane of the secreting cells and are therefore likely to signal over only few cell diameters. Fifteen major Wnt gene families have been identified in vertebrates, with multiple subtypes within some classes. In humans, 19 Wnt proteins have been identified that share 27% to 83% amino-acid sequence identity and a conserved pattern of 23 or 24 cysteine residues []. Wnt genes are highly conserved between vertebrate species sharing overall sequence identity and gene structure, and are slightly less conserved between vertebrates and invertebrates.; GO: 0005102 receptor binding, 0007275 multicellular organismal development, 0016055 Wnt receptor signaling pathway, 0005576 extracellular region; PDB: 4F0A_B.
Probab=98.10 E-value=3.4e-07 Score=94.29 Aligned_cols=57 Identities=44% Similarity=0.764 Sum_probs=42.9
Q ss_pred cccchhhhhhhhhhcCCccccccHHHHHhhhccccccC----------CCCC-CCCccccccCCCcee
Q psy14055 387 WRDIHFGEKFSRDFVDSKEDEDSEEALMNLHNNEAGRR----------RSLG-LDGCKLLCCGRGYMT 443 (493)
Q Consensus 387 ~dnI~fG~~fs~~FlD~~e~~~~~~~lmnlHNn~aGRk----------~s~g-~~sC~~LCCGRGy~t 443 (493)
.|||.||..||++|+|+++..++.+++||+|||+|||+ --+| ++||..-.|.|....
T Consensus 117 sdni~~G~~~sr~Fld~~~~~~~~~~~mn~HNn~aGR~~v~~~~~~~CkCHGvSGSC~~ktCw~~l~~ 184 (310)
T PF00110_consen 117 SDNIKFGIKFSRRFLDAREKGSDARSLMNLHNNEAGRKAVKKNMKKKCKCHGVSGSCTVKTCWRSLPP 184 (310)
T ss_dssp ---HHHHHHHHHHHHHHH--SSSHHHHHHHHHHHHHHHHHHHT-EEEEEE-SGGG-TTSEEEEEE---
T ss_pred ccccccchHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhcccccceEccccccCCCcceEEEEcCCc
Confidence 37999999999999999998899999999999999999 1255 689999999876543
No 10
>PHA00626 hypothetical protein
Probab=27.90 E-value=22 Score=28.05 Aligned_cols=28 Identities=29% Similarity=0.265 Sum_probs=23.1
Q ss_pred EeCcccceEEEEEecCCchhhhhhhcccc
Q psy14055 462 VKCEICRYKREEYLNPRNCEKLFSKRAKG 490 (493)
Q Consensus 462 V~C~~C~~~~~~~~C~~~~~~~~~~~~~~ 490 (493)
|+|.+|+.....|.| +.|.--|+|.|-|
T Consensus 12 vrcg~cr~~snrYkC-kdCGY~ft~~~~~ 39 (59)
T PHA00626 12 AKEKTMRGWSDDYVC-CDCGYNDSKDAFG 39 (59)
T ss_pred eeeceecccCcceEc-CCCCCeechhhhh
Confidence 689999999999999 5677788887755
No 11
>PF07459 CTX_RstB: CTX phage RstB protein; InterPro: IPR010008 This family contains a number of RstB proteins approximately 120 residues long, including RstB1 and RstB2, from the Vibrio cholerae phage CTX. Functional analyses indicate that rstB2 is required for integration of the CTXphi phage into the V. cholerae chromosome [].
Probab=23.69 E-value=91 Score=28.10 Aligned_cols=31 Identities=23% Similarity=0.187 Sum_probs=24.1
Q ss_pred HHHHHHhhccCcceEEEEEecCCCCCcceeEE
Q psy14055 62 LFAQSIRRWSVHHLFSVFVNVDRSDSSQKNVH 93 (493)
Q Consensus 62 ~La~l~~~yG~~~Lf~~~V~~D~kNSS~niIy 93 (493)
+++++. ..-...+..+..++|+.|+++|++-
T Consensus 70 ll~~~~-~i~fP~~veL~lepdP~dpsrNiVv 100 (117)
T PF07459_consen 70 LLAKFK-QIQFPVLVELELEPDPEDPSRNIVV 100 (117)
T ss_pred HHHHHh-cCcCceEEEEEccCCCCCCcccEEE
Confidence 444444 3556788999999999999999874
No 12
>smart00412 Cu_FIST Copper-Fist. binds DNA only in present of copper or silver
Probab=22.12 E-value=22 Score=26.05 Aligned_cols=30 Identities=23% Similarity=0.363 Sum_probs=27.1
Q ss_pred eCcccceEEEEEecCCchhhhhhhccccCC
Q psy14055 463 KCEICRYKREEYLNPRNCEKLFSKRAKGQL 492 (493)
Q Consensus 463 ~C~~C~~~~~~~~C~~~~~~~~~~~~~~~~ 492 (493)
-|+.|.+--...+|+++-..||.-+.||.-
T Consensus 9 aC~~CirGHR~s~C~H~dRpL~~i~kkGRP 38 (39)
T smart00412 9 ACESCIRGHRSSTCNHNDRPLIPVRPRGRP 38 (39)
T ss_pred cCHHHHCcCccCCcccCCccceeecCCCCC
Confidence 488999999999999999999999999863
Done!