Query psy10730
Match_columns 150
No_of_seqs 33 out of 35
Neff 2.8
Searched_HMMs 46136
Date Fri Aug 16 17:06:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10730.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10730hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00117 LU Ly-6 antigen / uPA 99.2 6.9E-12 1.5E-16 84.8 3.2 75 26-118 1-77 (79)
2 PF00021 UPAR_LY6: u-PAR/Ly-6 99.0 1.5E-10 3.2E-15 73.1 2.5 77 28-118 1-77 (77)
3 smart00134 LU Ly-6 antigen / u 98.9 1E-09 2.2E-14 74.6 4.5 78 26-118 1-79 (79)
4 cd00206 snake_toxin Snake toxi 98.5 1.2E-07 2.6E-12 64.2 3.1 64 26-118 1-64 (64)
5 PF00087 Toxin_1: Snake toxin; 98.2 1.2E-06 2.6E-11 59.3 3.5 63 26-118 1-63 (63)
6 PF02988 PLA2_inh: Phospholipa 97.9 1.1E-05 2.3E-10 58.9 3.5 78 27-118 2-82 (83)
7 PF01064 Activin_recp: Activin 96.2 0.0032 6.8E-08 42.6 2.1 74 26-118 1-76 (83)
8 KOG2052|consensus 89.0 1.1 2.4E-05 42.2 6.2 78 25-122 34-112 (513)
9 PF06211 BAMBI: BMP and activi 89.0 0.29 6.3E-06 37.8 2.1 75 25-120 24-103 (107)
10 PF05444 DUF753: Protein of un 83.3 1.5 3.3E-05 33.5 3.4 55 45-119 21-75 (152)
11 PF05444 DUF753: Protein of un 61.8 11 0.00024 28.7 3.5 45 24-84 81-131 (152)
12 PF02178 AT_hook: AT hook moti 51.7 6.9 0.00015 20.1 0.6 10 4-13 1-10 (13)
13 KOG3653|consensus 37.3 40 0.00087 32.3 3.6 74 27-119 44-127 (534)
14 PF02428 Prot_inhib_II: Potato 31.7 25 0.00055 24.1 1.0 17 108-124 29-46 (52)
15 smart00384 AT_hook DNA binding 27.5 36 0.00079 20.6 1.1 11 4-14 1-11 (26)
16 PF01684 ET: ET module; Inter 22.5 45 0.00098 24.2 1.0 43 74-118 35-78 (82)
No 1
>cd00117 LU Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
Probab=99.22 E-value=6.9e-12 Score=84.79 Aligned_cols=75 Identities=32% Similarity=0.619 Sum_probs=56.8
Q ss_pred eEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccC--CceeeEEecccCCchhchHhhhccCCCcceeec
Q psy10730 26 LECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQG--ATKQYYVSKSCASKKFCEEFKKKNMPSCTYIWY 103 (150)
Q Consensus 26 LeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~G--a~kQ~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy 103 (150)
|+||+|.++ |.+.|++ .++|++++|+|++. ++... ..+...+.|+|+. .|+......++.-.++
T Consensus 1 L~C~~C~~~--~~~~C~~-~~~C~~~~~~C~~~-------~~~~~~~~~~~~~~~rgC~~--~C~~~~~~~~~~~~~~-- 66 (79)
T cd00117 1 LECYSCTGV--STSSCST-ETNCPSPDDQCLTA-------VATVIEESVRLSLVVRGCAS--DCPFTNVFGQLSITFL-- 66 (79)
T ss_pred CccCcCCCC--CCCCCCC-CCccCCCCCEeeEE-------EEEEEeeccccceEECcccC--CCCCCCccCccccceE--
Confidence 789999998 3468998 66999999999986 33322 2245579999999 5998877655322232
Q ss_pred ccccccCCCCCCCCC
Q psy10730 104 QDWKCSECCSGDRCN 118 (150)
Q Consensus 104 ~dw~c~eCC~gD~CN 118 (150)
.+.||++|+||
T Consensus 67 ----~~~CC~tD~CN 77 (79)
T cd00117 67 ----KVSCCQEDLCN 77 (79)
T ss_pred ----eeeeCCCCccC
Confidence 58999999999
No 2
>PF00021 UPAR_LY6: u-PAR/Ly-6 domain omitted due to poor similarity.; InterPro: IPR001526 CD59 (also called 1F-5Ag, H19, HRF20, MACIF, MIRL, P-18 or protectin) inhibits formation of membrane attack complex (MAC), thus protecting cells from complement mediated lysis. It has a signalling role, as a GPI-anchored molecule, in T cell activation and appears to have some role in cell adhesion through CD2 (controversial). CD59 associates with C9, inhibiting incorporation into C5b-8 preventing terminal steps in polymerisation of the (MAC) in plasma membranes. Genetic defects in GPI-anchor attachment that cause a reduction or loss of both CD59 and CD55 on erythrocytes produce the symptoms of the disease paroxysmal nocturnal haemoglobinuria (PNH). A variety of GPI-linked cell-surface glycoproteins are composed of one or more copies of a conserved domain of about 100 amino-acid residues [], []. Among these proteins, U-PAR contains three tandem copies of the domain, while all the others are made up of a single domain. As shown in the following schematic, this conserved domain contains 10 cysteine residues involved in five disulphide bonds - in U-PAR, the first copy of the domain lacks the fourth disulphide bond. +------+ +------------------------+ +---+ | | | | | | xCxxCxxxxxxCxxxxxCxxxxxCxxxxxxxxxxxxxxxxxxCxxxxCxxxxxxxxxxxxxxCCxxxCxxxxxxxx | | | | +---------------------+ +--------------+ 'C': conserved cysteine involved in a disulphide bond. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; PDB: 2I9B_H 1YWH_E 3BT2_U 2FD6_U 3U74_U 3BT1_U 3U73_U 2L03_A 3LAQ_V 2J8B_A ....
Probab=99.02 E-value=1.5e-10 Score=73.11 Aligned_cols=77 Identities=27% Similarity=0.560 Sum_probs=55.3
Q ss_pred EeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCCchhchHhhhccCCCcceeeccccc
Q psy10730 28 CYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCASKKFCEEFKKKNMPSCTYIWYQDWK 107 (150)
Q Consensus 28 CYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy~dw~ 107 (150)
||+|..+ ++ ..|... ++|+..+++|++..- ....|..+...+.|+|+..+.|.......... . .+.
T Consensus 1 C~~C~~~-~~-~~C~~~-~~C~~~~~~C~~~~~-----~~~~g~~~~~~~~kgC~~~~~C~~~~~~~~~~--~----~~~ 66 (77)
T PF00021_consen 1 CYSCYSP-SS-SSCSGE-VTCPGSEDVCYTATV-----TVTSGGNSVTLVSKGCATQSSCRFLSDFSNGG--I----IKV 66 (77)
T ss_dssp EEEEECS-TT-TTSCTE-EEECTTTSEEEEEEE-----EEETTTCCEEEEEEEEEETTCCSEEEEEESSS--E----ECE
T ss_pred CCcccCC-CC-CCCCCC-CCcCCCCCeeEEEEE-----EeeecccCceEEEeeccccCCccccccccccc--e----eEE
Confidence 8999998 43 699985 999999999999531 44555567789999999888893222221111 1 122
Q ss_pred ccCCCCCCCCC
Q psy10730 108 CSECCSGDRCN 118 (150)
Q Consensus 108 c~eCC~gD~CN 118 (150)
-++||++||||
T Consensus 67 ~~~CC~~dlCN 77 (77)
T PF00021_consen 67 SVSCCNTDLCN 77 (77)
T ss_dssp EEEEESSTTTT
T ss_pred EEEeCCCCCCC
Confidence 56999999999
No 3
>smart00134 LU Ly-6 antigen / uPA receptor -like domain. Three-fold repeated domain in urokinase-type plasminogen activator receptor; occurs singly in other GPI-linked cell-surface glycoproteins (Ly-6 family, CD59, thymocyte B cell antigen, Sgp-2). Topology of these domains is similar to that of snake venom neurotoxins.
Probab=98.95 E-value=1e-09 Score=74.58 Aligned_cols=78 Identities=27% Similarity=0.617 Sum_probs=54.2
Q ss_pred eEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCCchhchHhhhcc-CCCcceeecc
Q psy10730 26 LECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCASKKFCEEFKKKN-MPSCTYIWYQ 104 (150)
Q Consensus 26 LeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k~~C~~~~~~~-m~~ct~iwy~ 104 (150)
|+||+|.++. + ..|.. .++|++++|+|++.+. -.|..+. +..|+|+|+....|+...... .+.-..+
T Consensus 1 L~C~~C~~~~-~-~~C~~-~~~C~~~~~~C~~~~~----~~~~~~~--~~~~~rgC~~~~~Cp~~~~~~~~~~~~~~--- 68 (79)
T smart00134 1 LQCYSCTGNP-D-SSCSS-EEECRSPDDVCLTVVA----EVISGSV--GSVVYKGCATSPICPGSHSYEIHLTIANL--- 68 (79)
T ss_pred CcceecCCcC-C-CCCCC-cceeCCCCCEeEEEEE----EEEecCC--CceEEcCccCCCCCCCCccccccccEEEE---
Confidence 7899998865 3 47977 7789999999998631 1244443 457999999655799776411 1111122
Q ss_pred cccccCCCCCCCCC
Q psy10730 105 DWKCSECCSGDRCN 118 (150)
Q Consensus 105 dw~c~eCC~gD~CN 118 (150)
-+.||++|+||
T Consensus 69 ---~~~CC~tDlCN 79 (79)
T smart00134 69 ---KVSCCQTDLCN 79 (79)
T ss_pred ---eeEecCCCCCC
Confidence 47899999999
No 4
>cd00206 snake_toxin Snake toxin domain, present in short and long neurotoxins, cytotoxins and short toxins, and in other miscellaneous venom peptides. The toxin acts by binding to the nicotinic acetylcholine receptors in the postsynaptic membrane of skeletal muscles and preventing the binding of acetylcholine, thereby blocking the excitation of muscles. This domain contains 60-75 amino acids that are fixed by 4-5 disulfide bridges and is nearly all beta sheet; it exists as either monomers or dimers.
Probab=98.47 E-value=1.2e-07 Score=64.18 Aligned_cols=64 Identities=28% Similarity=0.787 Sum_probs=48.5
Q ss_pred eEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCCchhchHhhhccCCCcceeeccc
Q psy10730 26 LECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCASKKFCEEFKKKNMPSCTYIWYQD 105 (150)
Q Consensus 26 LeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy~d 105 (150)
|.||.|..+. ++. .++|+++|+.|.+. +|... ...+.|.++|+.. |++.+.. .
T Consensus 1 ~~C~~~~~~~-----~~~-~~tC~~ge~~Cyk~-------~~~~~-~~g~~i~rGCa~t--CP~~~~~---~-------- 53 (64)
T cd00206 1 LTCYNCPSPF-----PFT-TETCPDGENICYKK-------SWKDT-PRGVRIERGCAAT--CPKVKPG---E-------- 53 (64)
T ss_pred CCccCCCCCC-----CCc-ceeCCcccCcccee-------EEEcC-CCCCEEEccccCc--CcCCCCC---c--------
Confidence 5789887753 333 89999999999987 56541 2234699999995 9977543 2
Q ss_pred ccccCCCCCCCCC
Q psy10730 106 WKCSECCSGDRCN 118 (150)
Q Consensus 106 w~c~eCC~gD~CN 118 (150)
-+.||.+|+||
T Consensus 54 --~v~CC~TD~CN 64 (64)
T cd00206 54 --YVTCCTTDKCN 64 (64)
T ss_pred --ceEecCCCCCC
Confidence 38999999999
No 5
>PF00087 Toxin_1: Snake toxin; InterPro: IPR003571 Snake toxins belong to a family of proteins [] which groups short and long neurotoxins, cytotoxins and short toxins, as well as a other miscellaneous venom peptides. Most of these toxins act by binding to the nicotinic acetylcholine receptors in the postsynaptic membrane of skeletal muscles and prevent the binding of acetylcholine, thereby blocking the excitation of muscles. Snake toxins are proteins that consist of sixty to seventy five amino acids. Among the invariant residues are eight cysteines all involved in disulphide bonds. The structure is small, disulphide-rich, nearly all beta sheet.; GO: 0005576 extracellular region; PDB: 3NEQ_B 2H5F_A 3PLC_A 1NOR_A 1UG4_A 1FFJ_A 1CCQ_A 1CB9_A 1VB0_A 1ONJ_A ....
Probab=98.23 E-value=1.2e-06 Score=59.28 Aligned_cols=63 Identities=32% Similarity=0.788 Sum_probs=46.6
Q ss_pred eEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCCchhchHhhhccCCCcceeeccc
Q psy10730 26 LECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCASKKFCEEFKKKNMPSCTYIWYQD 105 (150)
Q Consensus 26 LeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy~d 105 (150)
|.||.+.. .. .. .++|+++|+.|.+. +|..+ .....|.++|+.. |+..+. +.
T Consensus 1 ~~C~~~~~-~~-----~~-~~tC~~ge~~Cy~k-------~~~~~-~~~~~~~rGCa~t--CP~~~~---~~-------- 52 (63)
T PF00087_consen 1 RTCYNCPS-SC-----FT-TKTCPEGENICYKK-------TWMDH-PRGMRVERGCAAT--CPTPKP---GE-------- 52 (63)
T ss_dssp -EEEESST-SS-----TE-EEEESTTSSEEEEE-------EEEET-STEEEEEEEEESS--SSSSBS---SE--------
T ss_pred CCccCCCC-CC-----Cc-ceECCCccCeeEEE-------EEecC-CCCceEecchhcc--CcCCCC---CC--------
Confidence 56887776 22 23 89999999999986 66663 3344899999984 987655 22
Q ss_pred ccccCCCCCCCCC
Q psy10730 106 WKCSECCSGDRCN 118 (150)
Q Consensus 106 w~c~eCC~gD~CN 118 (150)
-+.||.+|+||
T Consensus 53 --~v~CC~TD~CN 63 (63)
T PF00087_consen 53 --KVKCCSTDKCN 63 (63)
T ss_dssp --EEEEESSTTTT
T ss_pred --eEEEcCCCCCC
Confidence 37899999999
No 6
>PF02988 PLA2_inh: Phospholipase A2 inhibitor; InterPro: IPR004126 Proteins in this entry inhibit basic phospholipase A2 isozymes in snake's venom [, ].; GO: 0004859 phospholipase inhibitor activity, 0005576 extracellular region
Probab=97.91 E-value=1.1e-05 Score=58.85 Aligned_cols=78 Identities=29% Similarity=0.616 Sum_probs=55.8
Q ss_pred EEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeecc--CCceeeEEecccCCchhchHhhhc-cCCCcceeec
Q psy10730 27 ECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQ--GATKQYYVSKSCASKKFCEEFKKK-NMPSCTYIWY 103 (150)
Q Consensus 27 eCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~--Ga~kQ~yVSK~Ca~k~~C~~~~~~-~m~~ct~iwy 103 (150)
+|.+|.+. + .+|....++|+++||.|.+.+ +-. +.....++.|.|.+...|...... .||.=.++
T Consensus 2 ~CEvC~~~-G--~~C~G~~~tC~~~eDtC~~~~-------~E~~~~~~s~~~~~K~C~~S~~C~~~~~~~n~g~~~y~-- 69 (83)
T PF02988_consen 2 SCEVCHGS-G--KDCSGKMKTCEDGEDTCVTVV-------TEVSSAGVSFRTTHKGCFSSSDCHLGYVSTNMGHGDYM-- 69 (83)
T ss_pred CcceecCc-C--CCCCCCeeEcCCCCCEEEEEE-------EEeccCCeEEEEEEecccCccccCCCCEEEecCCCCEE--
Confidence 69999993 3 379988999999999999952 222 223455889999999999876665 34420000
Q ss_pred ccccccCCCCCCCCC
Q psy10730 104 QDWKCSECCSGDRCN 118 (150)
Q Consensus 104 ~dw~c~eCC~gD~CN 118 (150)
=.-..||++|.||
T Consensus 70 --rs~~~CC~gd~C~ 82 (83)
T PF02988_consen 70 --RSRIHCCQGDGCN 82 (83)
T ss_pred --EeeeeeECCCccC
Confidence 0147899999999
No 7
>PF01064 Activin_recp: Activin types I and II receptor domain; InterPro: IPR000472 Transforming growth factor-beta (TGF-beta) forms a family with other growth factors described in PDOC00223 from PROSITEDOC. The receptors for most of the members of this growth factor family are related. These proteins are receptor-type kinases of Ser/Thr type PDOC00100 from PROSITEDOC), which have a single transmembrane domain and a specific hydrophilic Cys-rich ligand-binding domain [, , ]. The C-terminal part of the extracellular domain is conserved. Some of the receptors of this family contain subclass-specific N-terminal extensions of this homology domain. The type I receptors also possess 7 extracellular residues preceding the cysteine box.; GO: 0004675 transmembrane receptor protein serine/threonine kinase activity, 0005024 transforming growth factor beta-activated receptor activity, 0016020 membrane; PDB: 3EVS_C 1BTE_A 1LX5_B 2GOO_C 2HLR_A 4FAO_W 2H64_C 2H62_D 1S4Y_C 1NYU_C ....
Probab=96.23 E-value=0.0032 Score=42.57 Aligned_cols=74 Identities=28% Similarity=0.574 Sum_probs=47.3
Q ss_pred eEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCce-eeEEecccCCchhchHhhhccCCCcceeecc
Q psy10730 26 LECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATK-QYYVSKSCASKKFCEEFKKKNMPSCTYIWYQ 104 (150)
Q Consensus 26 LeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~k-Q~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy~ 104 (150)
|+|| |.... |..+-.+|+. ..+|.+.+ .....+ ...|.|+|...+.-..... -.|.. ...
T Consensus 1 l~C~-C~~~~-----C~~~n~~C~t-~~~Cf~~~--------~~~~~~~~~~~~~GC~~~~~~~~~~~---~~C~~-~~~ 61 (83)
T PF01064_consen 1 LKCY-CDSDD-----CNETNQTCET-GGFCFTSW--------EQDNGGGVEIVKKGCWSNEEDCPLCR---NQCRS-SSS 61 (83)
T ss_dssp EEEE-EETTC-----HCTTTEEEEE-SCEEEEEE--------EEETTESEEEEEEEEEESTGGGTTTS---SSEEE-CSS
T ss_pred CEeE-eCccc-----CCCCCCEeCC-CCEEEEEE--------EEeeccceeEEEcCccCCccCcceec---ceeec-cCC
Confidence 7899 88733 7665588887 88999974 333332 5588899998754333211 12322 112
Q ss_pred cccccCCCC-CCCCC
Q psy10730 105 DWKCSECCS-GDRCN 118 (150)
Q Consensus 105 dw~c~eCC~-gD~CN 118 (150)
.+.-+.||. +|+||
T Consensus 62 ~~~~~~CC~~~D~CN 76 (83)
T PF01064_consen 62 PSSVVFCCCSGDFCN 76 (83)
T ss_dssp STCEEEEEESSTTGG
T ss_pred CCCeEEECCCCCccC
Confidence 245678888 99999
No 8
>KOG2052|consensus
Probab=89.03 E-value=1.1 Score=42.17 Aligned_cols=78 Identities=24% Similarity=0.549 Sum_probs=46.7
Q ss_pred eeEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCCchhchHhhhccCCC-cceeec
Q psy10730 25 GLECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCASKKFCEEFKKKNMPS-CTYIWY 103 (150)
Q Consensus 25 aLeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k~~C~~~~~~~m~~-ct~iwy 103 (150)
.++||-+..+..|- -||.- +-.|.+.+. .+...+--..++|++.+ +..-....-. |++ -
T Consensus 34 ~~~C~C~~~~~~n~-------~tC~t-~g~C~~s~~--------~~~~g~~~~~~gC~~~~--~~~~~~~~~~~~~~--s 93 (513)
T KOG2052|consen 34 LLKCYCSSCLCSNA-------NTCET-DGACFVSVE--------ENDDGKEQHHRGCMTLE--ESLPRCNPFKCAHS--S 93 (513)
T ss_pred eEEEEecCCCcCCC-------Ceeee-cCeEEEEEE--------ecCCCceEEEecccccc--cccccCCCccccCC--C
Confidence 68899887777772 46765 778988742 23333335678999974 1111111111 222 1
Q ss_pred ccccccCCCCCCCCCceee
Q psy10730 104 QDWKCSECCSGDRCNYFVT 122 (150)
Q Consensus 104 ~dw~c~eCC~gD~CNy~vi 122 (150)
.|-...+||.||.||...|
T Consensus 94 ~~~~~~~CC~~d~CN~~~i 112 (513)
T KOG2052|consen 94 PDFRNIECCYGDYCNNILI 112 (513)
T ss_pred CCceEEEecCccccccccC
Confidence 2455789999999996333
No 9
>PF06211 BAMBI: BMP and activin membrane-bound inhibitor (BAMBI) N-terminal domain; InterPro: IPR009345 This family consists of several eukaryotic BMP and activin membrane-bound inhibitor (BAMBI) proteins. Members of the transforming growth factor-beta (TGF-beta) superfamily, including TGF-beta, bone morphogenetic proteins (BMPs), activins and nodals, are vital for regulating growth and differentiation. BAMBI is related to TGF-beta-family type I receptors but lacks an intracellular kinase domain. BAMBI is co-expressed with the ventralising morphogen BMP4 during Xenopus embryogenesis and requires BMP signalling for its expression. The protein stably associates with TGF-beta-family receptors and inhibits BMP and activin as well as TGF-beta signalling [].
Probab=89.03 E-value=0.29 Score=37.79 Aligned_cols=75 Identities=29% Similarity=0.697 Sum_probs=46.7
Q ss_pred eeEEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEeccc-----CCchhchHhhhccCCCcc
Q psy10730 25 GLECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSC-----ASKKFCEEFKKKNMPSCT 99 (150)
Q Consensus 25 aLeCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~C-----a~k~~C~~~~~~~m~~ct 99 (150)
..+|| | |...|..|..+|....+.|.|++- .| .++.. -.+-+| .+.+.|...+...-..
T Consensus 24 EIRCY-C-----n~P~CV~TGYMCkSel~~CfSkll---dp---~n~ns--pl~HGCld~~~~~~~~C~~~~~~~~~~-- 87 (107)
T PF06211_consen 24 EIRCY-C-----NAPHCVATGYMCKSELNACFSKLL---DP---QNTNS--PLTHGCLDSLASTADICKSEQAQNHSG-- 87 (107)
T ss_pred ceEEe-c-----CCcceecccceeccccccceeecc---Cc---ccCCC--cccccccccccCCCCcccCccccccCC--
Confidence 47888 3 447899989999999999999861 12 22211 122233 3334455444332221
Q ss_pred eeecccccccCCCCCCCCCce
Q psy10730 100 YIWYQDWKCSECCSGDRCNYF 120 (150)
Q Consensus 100 ~iwy~dw~c~eCC~gD~CNy~ 120 (150)
.+...+||.-|.|||-
T Consensus 88 -----~~~~l~CCh~DMCNyr 103 (107)
T PF06211_consen 88 -----CPSPLECCHDDMCNYR 103 (107)
T ss_pred -----CCcccchhhhhhcccC
Confidence 3446899999999973
No 10
>PF05444 DUF753: Protein of unknown function (DUF753); InterPro: IPR008472 This entry contains sequences which are repeated in several uncharacterised proteins from Drosophila melanogaster.
Probab=83.34 E-value=1.5 Score=33.48 Aligned_cols=55 Identities=31% Similarity=0.637 Sum_probs=33.1
Q ss_pred cccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCCchhchHhhhccCCCcceeecccccccCCCCCCCCCc
Q psy10730 45 IKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCASKKFCEEFKKKNMPSCTYIWYQDWKCSECCSGDRCNY 119 (150)
Q Consensus 45 iktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy~dw~c~eCC~gD~CNy 119 (150)
.+.|++.++.|.|.+ ...| .+.++|.... +... ...|.. ..+=.|..| .++.||-
T Consensus 21 ~~~C~~~~~~C~t~~-------~~~~-----~~~RGC~~~~--~~~~---~~~C~~--~~~~~C~~C-~~~~CN~ 75 (152)
T PF05444_consen 21 TEECPQQDDQCYTRF-------DDDG-----TTIRGCLSDL--PAEE---QSSCSA--GNDANCVTC-SGDGCNN 75 (152)
T ss_pred cccCCCCCCCCEEEE-------eCCC-----ceEcCchhhc--chhh---cccccC--CCCCcccCc-CCCCCCC
Confidence 579999999999984 3333 3888888852 1000 111222 011137777 8999994
No 11
>PF05444 DUF753: Protein of unknown function (DUF753); InterPro: IPR008472 This entry contains sequences which are repeated in several uncharacterised proteins from Drosophila melanogaster.
Probab=61.78 E-value=11 Score=28.72 Aligned_cols=45 Identities=33% Similarity=0.756 Sum_probs=32.7
Q ss_pred ceeEEeecCCCCCCHHHH---h-cccccccCC--CCeeEeeecCCCCCeeccCCceeeEEecccCCc
Q psy10730 24 KGLECYVCTNQEANTEKC---L-KTIKTCDQD--EDACLSTINWGSPPYWSQGATKQYYVSKSCASK 84 (150)
Q Consensus 24 ~aLeCYvC~~q~~N~ekC---L-ntiktC~~~--ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~k 84 (150)
..|.||.|+ .+ +.| . .+.+.|+.. .|.|.|.+ ...+ .|.++|.+.
T Consensus 81 ~~~~C~qC~---~~-~~C~~~~~~~~~~C~~~~~~d~Cyt~~-------~~~~-----~~~RGC~s~ 131 (152)
T PF05444_consen 81 DRLSCYQCT---DD-SDCVSSDSSTAKPCPNYSEDDQCYTRV-------DDGG-----VVERGCLSD 131 (152)
T ss_pred cCCEeeeCC---CC-CCCCCCCCcCcCcCcCCCCCCcCEEEE-------eeCC-----EEEeCCCCc
Confidence 469999999 33 578 1 345889885 69999984 2233 788999986
No 12
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions. These motifs are found in a variety of proteins, including the high mobility group (HMG) proteins [], in DNA-binding proteins from plants [] and in hBRG1 protein, a central ATPase of the human switching/sucrose non-fermenting (SWI/SNF) remodeling complex []. High mobility group (HMG) proteins are a family of relatively low molecular weight non-histone components in chromatin []. HMG-I and HMG-Y (HMGA) are proteins of about 100 amino acid residues which are produced by the alternative splicing of a single gene. HMG-I/Y proteins bind preferentially to the minor groove of AT-rich regions in double-stranded DNA in a non-sequence specific manner [, ]. It is suggested that these proteins could function in nucleosome phasing and in the 3' end processing of mRNA transcripts. They are also involved in the transcription regulation of genes containing, or in close proximity to, AT-rich regions. ; GO: 0003677 DNA binding; PDB: 2EZE_A 2EZD_A 2EZF_A 2EZG_A.
Probab=51.67 E-value=6.9 Score=20.06 Aligned_cols=10 Identities=50% Similarity=1.066 Sum_probs=3.9
Q ss_pred CCCCCCCCCc
Q psy10730 4 TRPKGRPRNQ 13 (150)
Q Consensus 4 ~~~~~~~~~~ 13 (150)
+|+.|||+..
T Consensus 1 ~r~RGRP~k~ 10 (13)
T PF02178_consen 1 KRKRGRPRKN 10 (13)
T ss_dssp S--SS--TT-
T ss_pred CCcCCCCccc
Confidence 4788999864
No 13
>KOG3653|consensus
Probab=37.28 E-value=40 Score=32.31 Aligned_cols=74 Identities=27% Similarity=0.499 Sum_probs=42.8
Q ss_pred EEeecCCCCCCHHHHhcccccccCCCCeeEeeecCCCCCeeccCCceeeEEecccCC----------chhchHhhhccCC
Q psy10730 27 ECYVCTNQEANTEKCLKTIKTCDQDEDACLSTINWGSPPYWSQGATKQYYVSKSCAS----------KKFCEEFKKKNMP 96 (150)
Q Consensus 27 eCYvC~~q~~N~ekCLntiktC~~~ed~ClT~I~wgs~py~s~Ga~kQ~yVSK~Ca~----------k~~C~~~~~~~m~ 96 (150)
.|-.+..+++-.|-|-- ..|+... .|.+. |.+-..+--+|.++|.. ..+|..+.....+
T Consensus 44 ~~~~~~~~es~~e~c~~--~~c~k~~-~C~av--------W~~t~~~~~~v~qGC~~~~~D~~~c~~~~eCv~s~~~~~g 112 (534)
T KOG3653|consen 44 GCETGGLQESGIEHCYG--IECDKGR-HCFAV--------WNKTSGTIEVVKQGCWSHITDDINCEDSSECVVSAEPPPG 112 (534)
T ss_pred ccccccccccccccCcc--cccCCCC-ceEEE--------eeccCCceeEEeecCccccCCccccccccccccCCCCCCC
Confidence 33344444444444433 3677777 99994 55555544488999984 2334444433332
Q ss_pred CcceeecccccccCCCCCCCCCc
Q psy10730 97 SCTYIWYQDWKCSECCSGDRCNY 119 (150)
Q Consensus 97 ~ct~iwy~dw~c~eCC~gD~CNy 119 (150)
. |- =+=||.||+||-
T Consensus 113 ~-t~-------~~CcCs~~~CN~ 127 (534)
T KOG3653|consen 113 Q-TL-------YFCCCSTDFCNA 127 (534)
T ss_pred C-eE-------EEEecCCCcccC
Confidence 2 11 145899999994
No 14
>PF02428 Prot_inhib_II: Potato type II proteinase inhibitor family; InterPro: IPR003465 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Members of the Pin2 family are proteinase inhibitors that belong to MEROPS inhibitor family I20, clan IA and are restricted to plants. They inhibit serine peptidases belonging to MEROPS peptidase family S1 [] (IPR001254 from INTERPRO). They have a multidomain structure [], which permits circular permutation of the sequences. It was been shown that some naturally occurring Pin2 proteins, have an `ancestral' circularly permuted structure []. Circular permutation/ rearrangements of sequences has also been observed between species, such as favin from Vicia faba and the lectin concanavalin A from Canavalia ensiformis [] or amongst members of the plant aspartyl proteinases and human lung surfactant proteins []. The Pin2 family of proteinase inhibitors are present in seeds, leaves and other organs. Perhaps the best known representatives are the wound-induced proteinase inhibitors [, ], which contain up to eight sequence-repeats (the `IP repeats'). The sequence of the IP repeats is quite variable, only the cysteines constituting the four disulphide bridges and a single proline residue are conserved throughout all the known repeat sequences. The structure of the proteinase inhibitor complex is known [].; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1PJU_C 1OYV_I 1QH2_B 1FYB_A 1CE3_A 2JZM_A 1YTP_A 2JYY_A 1TIH_A 4SGB_I ....
Probab=31.67 E-value=25 Score=24.08 Aligned_cols=17 Identities=47% Similarity=1.458 Sum_probs=14.3
Q ss_pred ccCCCCC-CCCCceeecc
Q psy10730 108 CSECCSG-DRCNYFVTLG 124 (150)
Q Consensus 108 c~eCC~g-D~CNy~visG 124 (150)
|..||.+ ..||||--.|
T Consensus 29 CtnCC~g~KGC~yy~~dG 46 (52)
T PF02428_consen 29 CTNCCAGYKGCNYYSADG 46 (52)
T ss_dssp SSSTTTSBTTSEEEETTT
T ss_pred CccccccccccEEECCCC
Confidence 7779999 8899997665
No 15
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions. Small DNA-binding motif first described in the high mobility group non-histone chromosomal protein HMG-I(Y).
Probab=27.47 E-value=36 Score=20.56 Aligned_cols=11 Identities=45% Similarity=0.848 Sum_probs=8.2
Q ss_pred CCCCCCCCCcE
Q psy10730 4 TRPKGRPRNQY 14 (150)
Q Consensus 4 ~~~~~~~~~~~ 14 (150)
.|+.||||.+-
T Consensus 1 kRkRGRPrK~~ 11 (26)
T smart00384 1 KRKRGRPRKAP 11 (26)
T ss_pred CCCCCCCCCCC
Confidence 37889998753
No 16
>PF01684 ET: ET module; InterPro: IPR002603 The proteins in this entry have no known function, and are found in Caenorhabditis elegans and in Caenorhabditis briggsae. Each repeat contains 8-10 conserved cysteines that probably form 4-5 disulphide bridges. By inspection of the conservation of cysteines it looks like cysteines 1, 2, 3, 4, 9 and 10 are always present and that sometimes the pair 5 and 8 or the pair 6 and 7 are missing. This suggests that cysteines 5/8 and 6/7 make disulphide bridges.
Probab=22.48 E-value=45 Score=24.18 Aligned_cols=43 Identities=26% Similarity=0.524 Sum_probs=31.8
Q ss_pred eeEEecccCCchhchHhhhccCCCcceeecccccccCCCC-CCCCC
Q psy10730 74 QYYVSKSCASKKFCEEFKKKNMPSCTYIWYQDWKCSECCS-GDRCN 118 (150)
Q Consensus 74 Q~yVSK~Ca~k~~C~~~~~~~m~~ct~iwy~dw~c~eCC~-gD~CN 118 (150)
+....=.|.+...|.....+. .|..||...-.-.-||. +|.||
T Consensus 35 ~~~t~y~C~P~~vC~~L~l~n--~C~~i~~~~~vtgCCC~~~dnC~ 78 (82)
T PF01684_consen 35 HNVTLYTCDPTSVCRSLNLNN--SCNTIEGGREVTGCCCNNSDNCN 78 (82)
T ss_pred ceEEEEEechHHHHhhhcccC--cccccCCCCcEEEEECCCCcccc
Confidence 335566788888887766553 58888776555677999 89999
Done!