Query psy10936
Match_columns 410
No_of_seqs 86 out of 88
Neff 3.0
Searched_HMMs 46136
Date Fri Aug 16 22:17:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10936.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10936hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05428 CRF-BP: Corticotropin 100.0 9E-111 2E-115 809.5 26.3 255 49-370 48-310 (311)
2 PF05428 CRF-BP: Corticotropin 100.0 9.8E-31 2.1E-35 254.1 11.0 102 2-107 56-184 (311)
3 cd00041 CUB CUB domain; extrac 97.7 0.00039 8.4E-09 54.4 9.3 83 79-177 24-112 (113)
4 PF00431 CUB: CUB domain CUB d 97.1 0.0041 8.8E-08 48.6 8.4 82 78-175 22-109 (110)
5 smart00042 CUB Domain first fo 96.8 0.012 2.7E-07 45.9 9.3 84 78-175 13-101 (102)
6 cd00041 CUB CUB domain; extrac 81.0 23 0.00051 27.5 9.4 31 24-54 24-58 (113)
7 PRK05463 hypothetical protein; 75.4 1.2 2.6E-05 44.4 0.8 37 341-377 146-182 (262)
8 cd02847 Chitobiase_C_term Chit 73.7 2.6 5.5E-05 34.8 2.2 63 159-237 6-68 (78)
9 PF00431 CUB: CUB domain CUB d 68.9 52 0.0011 25.5 9.3 34 23-56 22-59 (110)
10 PF02770 Acyl-CoA_dh_M: Acyl-C 56.2 4.2 9E-05 30.1 0.3 14 114-127 23-36 (52)
11 smart00042 CUB Domain first fo 38.7 92 0.002 24.1 5.4 31 23-53 13-43 (102)
12 PF14289 DUF4369: Domain of un 36.9 83 0.0018 24.9 4.9 11 12-22 46-56 (106)
13 PF07286 DUF1445: Protein of u 33.9 35 0.00075 31.7 2.6 37 340-376 35-71 (143)
14 KOG4586|consensus 29.3 2.3E+02 0.005 26.7 7.0 80 79-174 63-149 (156)
15 COG2921 Uncharacterized conser 27.1 45 0.00098 28.9 2.0 29 343-372 50-78 (90)
16 PF11202 PRTase_1: Phosphoribo 22.8 37 0.00081 34.1 0.8 13 111-123 146-158 (257)
17 PF01190 Pollen_Ole_e_I: Polle 22.0 2E+02 0.0044 23.5 4.9 47 47-93 26-80 (97)
18 PF07177 Neuralized: Neuralize 21.7 86 0.0019 25.1 2.6 38 331-370 10-47 (69)
19 PF06581 p31comet: Mad1 and Cd 21.2 37 0.00079 34.3 0.4 16 162-177 221-236 (264)
No 1
>PF05428 CRF-BP: Corticotropin-releasing factor binding protein (CRF-BP); InterPro: IPR008435 This family consists of several eukaryotic corticotropin-releasing factor binding proteins (CRF-BP or CRH-BP). Corticotropin-releasing hormone (CRH) plays multiple roles in vertebrate species. In mammals, it is the major hypothalamic releasing factor for pituitary adrenocorticotropin secretion, and is a neurotransmitter or neuromodulator at other sites in the central nervous system. In non-mammalian vertebrates, CRH not only acts as a neurotransmitter and hypophysiotropin, it also acts as a potent thyrotropin-releasing factor, allowing CRH to regulate both the adrenal and thyroid axes, especially in development. CRH-BP is thought to play an inhibitory role in which it binds CRH and other CRH-like ligands and prevents the activation of CRH receptors. There is however evidence that CRH-BP may also exhibit diverse extra and intracellular roles in a cell specific fashion and at specific times in development [].
Probab=100.00 E-value=9e-111 Score=809.47 Aligned_cols=255 Identities=38% Similarity=0.706 Sum_probs=241.3
Q ss_pred CCCCCCCeeeeeeeeeeecCcceEeeeCCCCCceEEEEEecCCCCeEEEEEEeecCCCCCCCeEEEEeeeeEccccCCCC
Q psy10936 49 VPCETGGLVSVNCMLVTSKEGHFFYKSKSVEPVACGVFLLTDPDKLVEVYFDYLDVPCETGGLVSFVDGWELNGQFFPSL 128 (410)
Q Consensus 49 IdCE~G~rralrCmD~~SeeGef~F~ssrs~Q~vCgiFFI~EPdq~IeItv~~VdiPC~~g~ll~~~DGWeLnGe~FPs~ 128 (410)
++.|..|+|+++||||+|++|+|+|+|++ ||+|||+|||+|||++|||+|+++||||++|||||||||||||||||||+
T Consensus 48 ~~~e~~yrr~lrCldm~s~~G~f~f~a~~-pq~~Ca~y~iaePd~~IeI~~~~vdidC~~G~ll~v~DGW~LnGe~FPs~ 126 (311)
T PF05428_consen 48 LSEEFIYRRPLRCLDMLSEEGQFTFTASR-PQLVCAAYFIAEPDELIEIEFDHVDIDCEGGDLLKVFDGWELNGEKFPSS 126 (311)
T ss_pred cCccccccchhhheeeeccCceEEEecCC-CCceeEEEEEeCCCeEEEEEEEEeeccCCCCCEEEEEeceEECccCCcCc
Confidence 68899999999999999999999999997 89999999999999999999999999999999999999999999999999
Q ss_pred CCCCCcchhhhhhhcCCCccceeeeeccceEEEEEeecCCCCceEEEEeecCCCCCcceecccccceeEEeecCccccce
Q psy10936 129 EDHEKPLHERFTEFCGTKRIKTVFRSSQNAALIQYRVPTRGKGFSFYVRYHKNPTPCNILLEGLSDVYTLRNYGKSVNCS 208 (410)
Q Consensus 129 ~DH~lpl~eR~~efC~~~~~~~~~rSSQN~AliqyRvp~~G~gF~~~vr~~~Np~PCNil~~~~~~~yTl~n~g~~rNCS 208 (410)
+||||||+|||+|||+++..||+|||||||||||||||.|||||+++|||++||+|||||+|+|+|+|||||||||||||
T Consensus 127 ~DHplpl~eR~~efC~~~~~~~~~rSSQNvAmi~fRip~~GsgFt~~vR~~~Np~PCNVisq~~~g~fTl~n~gq~rNCS 206 (311)
T PF05428_consen 127 QDHPLPLEERYTEFCDSGPNRRSFRSSQNVAMIQFRIPSPGSGFTLTVRFIKNPFPCNVISQSPEGRFTLRNPGQRRNCS 206 (311)
T ss_pred ccCCCchHHHHHHhccCCCCcceeEeccceEEEEEEecCCCCceEEEEEeCCCCCCceEeccCCCcceEEECCCcccCcE
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eeeeeeeEEEEecCCCCCCcceeecCCCCCCCccchhccccccceeecccCccccceeeeeecceeeEEEeecccccccc
Q psy10936 209 LTTLYPAQVKVLSGKGESQPFRERTSSPQDNIPCNILLEGLSDVYTLRNYGKSVNCSLTTLYPAQVKVLSLGVGLSNSKI 288 (410)
Q Consensus 209 fsiiyPv~i~i~~~~~~~~~~r~r~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~nCsl~~l~pa~v~~l~Lg~g~~~~~~ 288 (410)
|||||||+|+|++ ++||+.+.. ++
T Consensus 207 fs~iyP~~i~i~~-------------------------------------------------------L~lG~~~~l-~~ 230 (311)
T PF05428_consen 207 FSIIYPVVISISD-------------------------------------------------------LSLGHTNGL-EL 230 (311)
T ss_pred EEEEeeeEEEEEE-------------------------------------------------------EecccccCc-cc
Confidence 9999999999999 888887554 32
Q ss_pred cccccceeecceeeecccccccceeeecCCCCCCceeee--------cCCCceeEEeeCceEEEEeeccceeeeEEEEEe
Q psy10936 289 KRNQDVEVETGTLHKADKRKFQYRIGLGRGKGRNRYVLL--------DTNTTTYSILCGVTTVRLVSSGEFDNAVTVALT 360 (410)
Q Consensus 289 k~~~~~~~~tgt~~~~d~~~~~D~velgG~~Gld~~~m~--------d~~~~~~~i~C~~t~VRlVSSG~~~N~vt~~~~ 360 (410)
| +|+.++|+ +.+||||||||+|||++||+ ++++++|+|+||+|||||||||+|+|+|||+||
T Consensus 231 ~--------~~~~~~C~--~~~D~Vei~Gg~GlD~skM~~~~~lCg~~~~p~~~~I~Cd~TvVRLVSSG~~~N~vtf~yr 300 (311)
T PF05428_consen 231 Q--------RGVIHGCQ--GSGDFVEIGGGNGLDTSKMFPVADLCGSFSGPAQMKIGCDNTVVRLVSSGRYINRVTFSYR 300 (311)
T ss_pred c--------cccccCCC--CcCCeEEECCcCCcCcccCEEhhHhhccCCCCceEEEecCCcEEEEEecCcceeEEEEEEe
Confidence 2 34456665 66899999999999999995 447999999999999999999999999999999
Q ss_pred cCCcccchhh
Q psy10936 361 PAQVEDLDAA 370 (410)
Q Consensus 361 ~~~~~~l~~a 370 (410)
+|+++||+.|
T Consensus 301 ~l~~~dl~~~ 310 (311)
T PF05428_consen 301 PLDEQDLELA 310 (311)
T ss_pred eCChhhhhcc
Confidence 9999999976
No 2
>PF05428 CRF-BP: Corticotropin-releasing factor binding protein (CRF-BP); InterPro: IPR008435 This family consists of several eukaryotic corticotropin-releasing factor binding proteins (CRF-BP or CRH-BP). Corticotropin-releasing hormone (CRH) plays multiple roles in vertebrate species. In mammals, it is the major hypothalamic releasing factor for pituitary adrenocorticotropin secretion, and is a neurotransmitter or neuromodulator at other sites in the central nervous system. In non-mammalian vertebrates, CRH not only acts as a neurotransmitter and hypophysiotropin, it also acts as a potent thyrotropin-releasing factor, allowing CRH to regulate both the adrenal and thyroid axes, especially in development. CRH-BP is thought to play an inhibitory role in which it binds CRH and other CRH-like ligands and prevents the activation of CRH receptors. There is however evidence that CRH-BP may also exhibit diverse extra and intracellular roles in a cell specific fashion and at specific times in development [].
Probab=99.97 E-value=9.8e-31 Score=254.07 Aligned_cols=102 Identities=23% Similarity=0.521 Sum_probs=96.8
Q ss_pred CcccccceeecceeeEEeeCCCCCceeEEEEecCCCceEEEEeeeecCCCCCCCeeeeeeeeeeecCcceEeeeC-----
Q psy10936 2 ASEDCMLVTSKEGHFFYKSKSIEPVACGVFLLTDPDKLVEVYFDYLDVPCETGGLVSVNCMLVTSKEGHFFYKSK----- 76 (410)
Q Consensus 2 ~~~rCm~m~S~eG~FtF~~a~~pql~Ca~ffIaePd~~I~I~~d~vdIdCE~G~rralrCmD~~SeeGef~F~ss----- 76 (410)
++.|||||+|++|+|+|+ |++||+|||+||||||||+|||+|++|||||++|++++ .+|+|.++|| +||++
T Consensus 56 r~lrCldm~s~~G~f~f~-a~~pq~~Ca~y~iaePd~~IeI~~~~vdidC~~G~ll~--v~DGW~LnGe-~FPs~~DHpl 131 (311)
T PF05428_consen 56 RPLRCLDMLSEEGQFTFT-ASRPQLVCAAYFIAEPDELIEIEFDHVDIDCEGGDLLK--VFDGWELNGE-KFPSSQDHPL 131 (311)
T ss_pred chhhheeeeccCceEEEe-cCCCCceeEEEEEeCCCeEEEEEEEEeeccCCCCCEEE--EEeceEECcc-CCcCcccCCC
Confidence 478999999999999999 79999999999999999999999999999999999977 6999999999 99998
Q ss_pred -------------------CCCCceEEEEE-ecCCCCeEEEEEEeec--CCCC
Q psy10936 77 -------------------SVEPVACGVFL-LTDPDKLVEVYFDYLD--VPCE 107 (410)
Q Consensus 77 -------------------rs~Q~vCgiFF-I~EPdq~IeItv~~Vd--iPC~ 107 (410)
||+|||+++|| |+.||++|+|+++++. +|||
T Consensus 132 pl~eR~~efC~~~~~~~~~rSSQNvAmi~fRip~~GsgFt~~vR~~~Np~PCN 184 (311)
T PF05428_consen 132 PLEERYTEFCDSGPNRRSFRSSQNVAMIQFRIPSPGSGFTLTVRFIKNPFPCN 184 (311)
T ss_pred chHHHHHHhccCCCCcceeEeccceEEEEEEecCCCCceEEEEEeCCCCCCce
Confidence 47999999999 9999999999999999 7777
No 3
>cd00041 CUB CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
Probab=97.69 E-value=0.00039 Score=54.40 Aligned_cols=83 Identities=20% Similarity=0.386 Sum_probs=66.3
Q ss_pred CCceEEEEEecCCCCeEEEEEEeecCC----CCCCCeEEEEeeeeEccccCCCCCCCCCcchhhhhhhcCCCccceeeee
Q psy10936 79 EPVACGVFLLTDPDKLVEVYFDYLDVP----CETGGLVSFVDGWELNGQFFPSLEDHEKPLHERFTEFCGTKRIKTVFRS 154 (410)
Q Consensus 79 ~Q~vCgiFFI~EPdq~IeItv~~VdiP----C~~g~ll~~~DGWeLnGe~FPs~~DH~lpl~eR~~efC~~~~~~~~~rS 154 (410)
+...|.--+-+.+++.|.|+|..++++ |. .+.|.+.||... ...+...||+... ...+.|
T Consensus 24 ~~~~C~w~i~~~~g~~i~l~f~~~~l~~~~~C~-~d~l~i~~g~~~--------------~~~~~~~~Cg~~~-~~~~~s 87 (113)
T cd00041 24 NNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCS-YDYLEIYDGPST--------------SSPLLGRFCGSTL-PPPIIS 87 (113)
T ss_pred CCCcEEEEEEcCCCCEEEEEEeCcccccCCCCC-CcEEEEEcCCCC--------------ccccceeeECCCC-CCCEEe
Confidence 568897556788889999999999987 87 688999998643 1224566899987 577999
Q ss_pred ccceEEEEEeecC--CCCceEEEEe
Q psy10936 155 SQNAALIQYRVPT--RGKGFSFYVR 177 (410)
Q Consensus 155 SQN~AliqyRvp~--~G~gF~~~vr 177 (410)
+.|.+.|.|+-.. .+.||.++.+
T Consensus 88 ~~~~~~i~f~s~~~~~~~GF~~~y~ 112 (113)
T cd00041 88 SGNSLTVRFRSDSSVTGRGFKATYS 112 (113)
T ss_pred cCCEEEEEEEeCCCCCCCCEEEEEE
Confidence 9999999999876 4778888765
No 4
>PF00431 CUB: CUB domain CUB domain entry Spermadhesins family entry Link to schematic domain picture by Peer Bork. ; InterPro: IPR000859 The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated [, ]. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression [, ]. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Proteins containing a CUB domain include: Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system. Cricetidae sp. (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. Vertebrate enteropeptidase (3.4.21.9 from EC), a type II membrane protein of the intestinal brush border, which activates trypsinogen. Vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. Sea urchin blastula proteins BP10 and SpAN. Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5. Neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. Fibropellins I and III from Strongylocentrotus purpuratus (Purple sea urchin). Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. Mammalian spermadhesins. Xenopus laevis embryonic protein UVS.2, which is expressed during dorsoanterior development. Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a jelly-roll fold. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites [].; PDB: 1SFP_A 3KQ4_B 2WNO_A 2QQK_A 2QQL_A 2QQO_B 2QQM_A 3POJ_A 3POB_A 3POG_B ....
Probab=97.08 E-value=0.0041 Score=48.58 Aligned_cols=82 Identities=20% Similarity=0.383 Sum_probs=61.0
Q ss_pred CCCceEEEEEecCCCCeEEEEEEeecCC----CCCCCeEEEEeeeeEccccCCCCCCCCCcchhhhhhhcCCCccceeee
Q psy10936 78 VEPVACGVFLLTDPDKLVEVYFDYLDVP----CETGGLVSFVDGWELNGQFFPSLEDHEKPLHERFTEFCGTKRIKTVFR 153 (410)
Q Consensus 78 s~Q~vCgiFFI~EPdq~IeItv~~VdiP----C~~g~ll~~~DGWeLnGe~FPs~~DH~lpl~eR~~efC~~~~~~~~~r 153 (410)
.++..|--=+-+.|+..|.|+|..++++ |. .+.|+|.||.... ......||++.. ...+.
T Consensus 22 ~~~~~C~w~i~~~~~~~I~l~f~~~~~~~~~~c~-~d~l~v~~g~~~~--------------~~~~~~~cg~~~-~~~i~ 85 (110)
T PF00431_consen 22 PSNSDCTWTITAPPGHRIRLTFLSFDLESSDSCC-QDYLEVYDGNDES--------------SPLLGRFCGSSP-PPSII 85 (110)
T ss_dssp SSSEEEEEEEE-STTEEEEEEEEEEEB--TTTST-SSEEEEESSSSTT--------------SEEEEEESSSSC-CEEEE
T ss_pred CCCCcEeEEEEecccceeeeccccccceeeeeec-ccceeEEeecccc--------------ceeeeeccCCcC-CccEE
Confidence 4688999666888999999999999999 65 5788999886432 233457998544 46789
Q ss_pred eccceEEEEEeecCC--CCceEEE
Q psy10936 154 SSQNAALIQYRVPTR--GKGFSFY 175 (410)
Q Consensus 154 SSQN~AliqyRvp~~--G~gF~~~ 175 (410)
|+.|.+.|.|+.... ++||.++
T Consensus 86 s~~~~l~i~f~s~~~~~~~gF~~~ 109 (110)
T PF00431_consen 86 SSSNSLFIRFHSDSSNSSRGFKAT 109 (110)
T ss_dssp ESSSEEEEEEEESSSSTTSEEEEE
T ss_pred ECCCEEEEEEEECCCCCCccEEEE
Confidence 999999999999665 4555543
No 5
>smart00042 CUB Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
Probab=96.84 E-value=0.012 Score=45.87 Aligned_cols=84 Identities=21% Similarity=0.326 Sum_probs=59.9
Q ss_pred CCCceEEEEEecCCCCeEEEEEEeecCCCCC---CCeEEEEeeeeEccccCCCCCCCCCcchhhhhhhcCCCccceeeee
Q psy10936 78 VEPVACGVFLLTDPDKLVEVYFDYLDVPCET---GGLVSFVDGWELNGQFFPSLEDHEKPLHERFTEFCGTKRIKTVFRS 154 (410)
Q Consensus 78 s~Q~vCgiFFI~EPdq~IeItv~~VdiPC~~---g~ll~~~DGWeLnGe~FPs~~DH~lpl~eR~~efC~~~~~~~~~rS 154 (410)
++...|.--+.+.+++.|.|++..++++... .+.|.|.||=. +....+..||+.....+.+.|
T Consensus 13 ~~~~~C~w~i~~~~g~~i~l~f~~~~l~~~~~C~~d~l~i~~g~~--------------~~~~~~~~~Cg~~~~~~~~~s 78 (102)
T smart00042 13 PNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPS--------------ASSPLLGRFCGSELPPPVISS 78 (102)
T ss_pred CCCCcEEEEEECCCCeEEEEEEEEEeccCCCCeeEeEEEEEeCCC--------------CCCceeEEEecCcCCCCeEEc
Confidence 4678996555778888999999999976432 46777776522 112233489999876788888
Q ss_pred ccceEEEEEeecCC--CCceEEE
Q psy10936 155 SQNAALIQYRVPTR--GKGFSFY 175 (410)
Q Consensus 155 SQN~AliqyRvp~~--G~gF~~~ 175 (410)
+.|.+.|.|+-... +.||.++
T Consensus 79 ~~n~~~i~f~s~~~~~~~GF~~~ 101 (102)
T smart00042 79 SSNSLTVTFVSDSSVQKRGFSAR 101 (102)
T ss_pred CCCEEEEEEEeCCCCCCCCeEEE
Confidence 89999999986543 3578764
No 6
>cd00041 CUB CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
Probab=80.96 E-value=23 Score=27.49 Aligned_cols=31 Identities=16% Similarity=0.393 Sum_probs=26.6
Q ss_pred CCceeEEEEecCCCceEEEEeeeecCC----CCCC
Q psy10936 24 EPVACGVFLLTDPDKLVEVYFDYLDVP----CETG 54 (410)
Q Consensus 24 pql~Ca~ffIaePd~~I~I~~d~vdId----CE~G 54 (410)
+.+.|---+-+.+.+.|.++|+.+++. |..-
T Consensus 24 ~~~~C~w~i~~~~g~~i~l~f~~~~l~~~~~C~~d 58 (113)
T cd00041 24 NNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYD 58 (113)
T ss_pred CCCcEEEEEEcCCCCEEEEEEeCcccccCCCCCCc
Confidence 458998888889999999999999998 7643
No 7
>PRK05463 hypothetical protein; Provisional
Probab=75.36 E-value=1.2 Score=44.36 Aligned_cols=37 Identities=22% Similarity=0.283 Sum_probs=32.6
Q ss_pred eEEEEeeccceeeeEEEEEecCCcccchhhhhccccC
Q psy10936 341 TTVRLVSSGEFDNAVTVALTPAQVEDLDAASLCDKRG 377 (410)
Q Consensus 341 t~VRlVSSG~~~N~vt~~~~~~~~~~l~~atkc~krg 377 (410)
|.+.+.++|.|.-..-|+.||...++++.+...+.|=
T Consensus 146 Tni~c~paG~F~g~mVVSMRP~~~~~v~~a~~iT~~~ 182 (262)
T PRK05463 146 TNIPCRPAGRFSGPMVVSMRPIPAADAIRAVQITSRF 182 (262)
T ss_pred cCCccccCCCcCCCEEEEeccCCHHHHHHHHHHhccC
Confidence 6677789999999999999999999999888888773
No 8
>cd02847 Chitobiase_C_term Chitobiase C-terminus domain. Chitobiase (AKA N-acetylglucosaminidase) digests the beta, 1-4 glycosidic bonds of the N-acetylglucosamine (NAG) oligomers found in chitin, an important structural element of fungal cell wall and arthropod exoskeletons. It is thought to proceed through an acid-base reaction mechanism, in which one protein carboxylate acts as catalytic acid, while the nucleophile is the polar acetamido group of the sugar in a substrate-assisted reaction with retention of the anomeric configuration. The C-terminus of chitobiase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chit
Probab=73.74 E-value=2.6 Score=34.80 Aligned_cols=63 Identities=27% Similarity=0.289 Sum_probs=41.6
Q ss_pred EEEEEeecCCCCceEEEEeecCCCCCcceecccccceeEEeecCccccceeeeeeeeEEEEecCCCCCCcceeecCCCC
Q psy10936 159 ALIQYRVPTRGKGFSFYVRYHKNPTPCNILLEGLSDVYTLRNYGKSVNCSLTTLYPAQVKVLSGKGESQPFRERTSSPQ 237 (410)
Q Consensus 159 AliqyRvp~~G~gF~~~vr~~~Np~PCNil~~~~~~~yTl~n~g~~rNCSfsiiyPv~i~i~~~~~~~~~~r~r~~~~~ 237 (410)
|=|.||||.+| .+.-.-... =|+..++.+-.||+-+..+ +-.|.+.+++.. ....++|..+|.
T Consensus 6 aGi~YRvP~pg-a~i~~g~l~-----~n~~~pg~~i~Yt~dgg~~------w~~Y~~p~~~~v----~~~v~vr~~s~~ 68 (78)
T cd02847 6 AGIDYRLPVPG-AKVENGKLE-----MNVSLPGLTLQYSTDGGKN------WNIYDAAAKPVV----SGDVQIRSVSFD 68 (78)
T ss_pred cCCccccCCCC-eEEEcCEEE-----EeccCCCcEEEEEecCCcc------Ceeccccccccc----cccEEEEEEcCC
Confidence 44789999999 333333333 3556788899999997755 455776554432 456778877664
No 9
>PF00431 CUB: CUB domain CUB domain entry Spermadhesins family entry Link to schematic domain picture by Peer Bork. ; InterPro: IPR000859 The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated [, ]. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression [, ]. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Proteins containing a CUB domain include: Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system. Cricetidae sp. (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. Vertebrate enteropeptidase (3.4.21.9 from EC), a type II membrane protein of the intestinal brush border, which activates trypsinogen. Vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. Sea urchin blastula proteins BP10 and SpAN. Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5. Neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. Fibropellins I and III from Strongylocentrotus purpuratus (Purple sea urchin). Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. Mammalian spermadhesins. Xenopus laevis embryonic protein UVS.2, which is expressed during dorsoanterior development. Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a jelly-roll fold. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites [].; PDB: 1SFP_A 3KQ4_B 2WNO_A 2QQK_A 2QQL_A 2QQO_B 2QQM_A 3POJ_A 3POB_A 3POG_B ....
Probab=68.93 E-value=52 Score=25.49 Aligned_cols=34 Identities=18% Similarity=0.292 Sum_probs=27.1
Q ss_pred CCCceeEEEEecCCCceEEEEeeeecCC----CCCCCe
Q psy10936 23 IEPVACGVFLLTDPDKLVEVYFDYLDVP----CETGGL 56 (410)
Q Consensus 23 ~pql~Ca~ffIaePd~~I~I~~d~vdId----CE~G~r 56 (410)
+++..|---+-+.|++.|.|+|..++++ |...++
T Consensus 22 ~~~~~C~w~i~~~~~~~I~l~f~~~~~~~~~~c~~d~l 59 (110)
T PF00431_consen 22 PSNSDCTWTITAPPGHRIRLTFLSFDLESSDSCCQDYL 59 (110)
T ss_dssp SSSEEEEEEEE-STTEEEEEEEEEEEB--TTTSTSSEE
T ss_pred CCCCcEeEEEEecccceeeeccccccceeeeeecccce
Confidence 4579999999999999999999999999 665444
No 10
>PF02770 Acyl-CoA_dh_M: Acyl-CoA dehydrogenase, middle domain; InterPro: IPR006091 Acyl-CoA dehydrogenases (1.3.99.3 from EC) are a family of flavoproteins that catalyse the alpha,beta-dehydrogenation of acyl-CoA thioesters to the corresponding trans 2,3-enoyl CoA-products with the concomitant reduction of enzyme-bound FAD. Different family members share a high sequence identity, catalytic mechanisms, and structural properties, but differ in the position of their catalytic bases and in their substrate binding specificity. Butyryl-CoA dehydrogenase [] prefers short chain substrates, medium chain- and long-chain acyl-CoA dehydrogenases prefer medium and long chain substrates, respectively, and Isovaleryl-CoA dehydrogenase [] prefers branched-chain substrates. The monomeric enzyme is folded into three domains of approximately equal size, where the N-terminal domain is all-alpha, the middle domain is an open (5,8) barrel, and the C-terminal domain is a four-helical bundle. The constituent families differ in the numbers of C-terminal domains. This entry represents the middle beta-barrel domain found in medium chain acyl-CoA dehydrogenases, as well as in the related peroxisomal acyl-CoA oxidase-II enzymes. Acyl-CoA oxidase (ACO; 1.3.3.6 from EC) catalyzes the first and rate-determining step of the peroxisomal beta-oxidation of fatty acids [].; GO: 0003995 acyl-CoA dehydrogenase activity, 0055114 oxidation-reduction process; PDB: 3MDD_B 1UDY_C 3MDE_B 2UXW_A 3B96_A 1SIQ_A 1SIR_A 2R0N_A 2R0M_A 2D29_B ....
Probab=56.24 E-value=4.2 Score=30.11 Aligned_cols=14 Identities=36% Similarity=0.252 Sum_probs=11.0
Q ss_pred EEeeeeEccccCCC
Q psy10936 114 FVDGWELNGQFFPS 127 (410)
Q Consensus 114 ~~DGWeLnGe~FPs 127 (410)
.=|||+|||+|..-
T Consensus 23 ~~~~~~L~G~K~~v 36 (52)
T PF02770_consen 23 DGDGYVLNGEKRFV 36 (52)
T ss_dssp ETTEEEEEEEEEEE
T ss_pred ccceEEEeeEEEEE
Confidence 44789999998753
No 11
>smart00042 CUB Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
Probab=38.66 E-value=92 Score=24.10 Aligned_cols=31 Identities=16% Similarity=0.276 Sum_probs=26.3
Q ss_pred CCCceeEEEEecCCCceEEEEeeeecCCCCC
Q psy10936 23 IEPVACGVFLLTDPDKLVEVYFDYLDVPCET 53 (410)
Q Consensus 23 ~pql~Ca~ffIaePd~~I~I~~d~vdIdCE~ 53 (410)
++.+.|---+.+.+.+.|.|.|+.++++...
T Consensus 13 ~~~~~C~w~i~~~~g~~i~l~f~~~~l~~~~ 43 (102)
T smart00042 13 PNNLDCVWTIRAPPGYRIELQFTDFDLESSD 43 (102)
T ss_pred CCCCcEEEEEECCCCeEEEEEEEEEeccCCC
Confidence 4569998888899999999999999997543
No 12
>PF14289 DUF4369: Domain of unknown function (DUF4369)
Probab=36.94 E-value=83 Score=24.87 Aligned_cols=11 Identities=36% Similarity=0.652 Sum_probs=8.4
Q ss_pred cceeeEEeeCC
Q psy10936 12 KEGHFFYKSKS 22 (410)
Q Consensus 12 ~eG~FtF~~a~ 22 (410)
.+|.|+|+-.-
T Consensus 46 ~nG~F~f~~~~ 56 (106)
T PF14289_consen 46 KNGKFSFKGPL 56 (106)
T ss_pred eCCEEEEEEeC
Confidence 48999999543
No 13
>PF07286 DUF1445: Protein of unknown function (DUF1445); InterPro: IPR009906 This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.; PDB: 3DB9_A 2PIF_A.
Probab=33.93 E-value=35 Score=31.65 Aligned_cols=37 Identities=19% Similarity=0.376 Sum_probs=28.4
Q ss_pred ceEEEEeeccceeeeEEEEEecCCcccchhhhhcccc
Q psy10936 340 VTTVRLVSSGEFDNAVTVALTPAQVEDLDAASLCDKR 376 (410)
Q Consensus 340 ~t~VRlVSSG~~~N~vt~~~~~~~~~~l~~atkc~kr 376 (410)
.|.+++.++|.|....-|+.||...++++.+.+.+.|
T Consensus 35 ~T~i~~~~aG~F~g~~VVSMRp~~~~~~~~a~~iT~~ 71 (143)
T PF07286_consen 35 RTNIPCCPAGPFSGPMVVSMRPIPPDKVERAVQITSR 71 (143)
T ss_dssp EEEEE---BTTB--EEEEEEEEEECCCCHHHHHHCC-
T ss_pred ecccccccccccCCCeEEEEecCCHHHHHHHHHHHhh
Confidence 4888999999999999999999999999988888766
No 14
>KOG4586|consensus
Probab=29.27 E-value=2.3e+02 Score=26.66 Aligned_cols=80 Identities=25% Similarity=0.386 Sum_probs=57.1
Q ss_pred CCceEEEEEecCCCCeEEEEEEeec-----CCCCCCCeEEEEeeeeEccccCCCCCCCCCcchhhhhhhcCCCccceeee
Q psy10936 79 EPVACGVFLLTDPDKLVEVYFDYLD-----VPCETGGLVSFVDGWELNGQFFPSLEDHEKPLHERFTEFCGTKRIKTVFR 153 (410)
Q Consensus 79 ~Q~vCgiFFI~EPdq~IeItv~~Vd-----iPC~~g~ll~~~DGWeLnGe~FPs~~DH~lpl~eR~~efC~~~~~~~~~r 153 (410)
|.--|.-+.=..|.+-|++.|+.+= .-|. -|++.|=|| |.-..-=+--||+.+..+ .+|
T Consensus 63 p~r~cv~vi~~~p~~~ve~~Fde~y~IEps~EC~-fD~iEvrDG--------------pfGFSPlI~rfCG~~nPp-~Ir 126 (156)
T KOG4586|consen 63 PNRDCVRVIHSRPQHDVEVKFDEVYHIEPSYECP-FDFIEVRDG--------------PFGFSPLIARFCGDRNPP-EIR 126 (156)
T ss_pred CCcceEEeEecccccceEEeeeeeEEecccccCC-CCcccccCC--------------CcCccHHHHHHhccCCCh-hhe
Confidence 5566775557788998999998764 7788 677777766 333333345589999875 899
Q ss_pred eccceEEEEEeecCC--CCceEE
Q psy10936 154 SSQNAALIQYRVPTR--GKGFSF 174 (410)
Q Consensus 154 SSQN~AliqyRvp~~--G~gF~~ 174 (410)
|+-----|-||-..- -.||+.
T Consensus 127 s~grFlWIkF~sD~ele~~gfsa 149 (156)
T KOG4586|consen 127 SVGRFLWIKFRSDSELEYQGFSA 149 (156)
T ss_pred ecCcEEEEEEcccchhhhcccce
Confidence 999988899987633 235554
No 15
>COG2921 Uncharacterized conserved protein [Function unknown]
Probab=27.10 E-value=45 Score=28.92 Aligned_cols=29 Identities=17% Similarity=0.367 Sum_probs=24.2
Q ss_pred EEEeeccceeeeEEEEEecCCcccchhhhh
Q psy10936 343 VRLVSSGEFDNAVTVALTPAQVEDLDAASL 372 (410)
Q Consensus 343 VRlVSSG~~~N~vt~~~~~~~~~~l~~atk 372 (410)
+|-=|-|+| ++|++.++..+.|.+|++.+
T Consensus 50 ~k~SSkGnY-~svsI~i~A~~~EQ~e~ly~ 78 (90)
T COG2921 50 WKPSSKGNY-LSVSITIRATNIEQVEALYR 78 (90)
T ss_pred eccCCCCce-EEEEEEEEECCHHHHHHHHH
Confidence 466677887 78999999999999998765
No 16
>PF11202 PRTase_1: Phosphoribosyl transferase (PRTase); InterPro: IPR011215 This is a family of uncharacterised bacterial proteins.
Probab=22.76 E-value=37 Score=34.12 Aligned_cols=13 Identities=46% Similarity=0.989 Sum_probs=11.0
Q ss_pred eEEEEeeeeEccc
Q psy10936 111 LVSFVDGWELNGQ 123 (410)
Q Consensus 111 ll~~~DGWeLnGe 123 (410)
=+.|||||.=||-
T Consensus 146 ~ivFVDGWTGKGa 158 (257)
T PF11202_consen 146 SIVFVDGWTGKGA 158 (257)
T ss_pred ceEEEecCccccH
Confidence 3789999999884
No 17
>PF01190 Pollen_Ole_e_I: Pollen proteins Ole e I like; InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds.
Probab=22.01 E-value=2e+02 Score=23.46 Aligned_cols=47 Identities=28% Similarity=0.420 Sum_probs=34.4
Q ss_pred ecCCCCCCCe-eeeeeeeeeecCcceEeeeCC-------CCCceEEEEEecCCCC
Q psy10936 47 LDVPCETGGL-VSVNCMLVTSKEGHFFYKSKS-------VEPVACGVFLLTDPDK 93 (410)
Q Consensus 47 vdIdCE~G~r-ralrCmD~~SeeGef~F~ssr-------s~Q~vCgiFFI~EPdq 93 (410)
|.|.|..+.- ..+.-...++..|.|+..-.. .....|-+.++..|++
T Consensus 26 V~v~C~~~~~~~~~~~~~~Td~~G~F~i~l~~~~~~~~~~~~~~C~v~l~~sp~~ 80 (97)
T PF01190_consen 26 VSVECKDGNGGVVFSAEAKTDENGYFSIELPSDPGSSSPHLSSSCRVKLVSSPDP 80 (97)
T ss_pred EEEECCCCCCCcEEEEEEEeCCCCEEEEEecCccccccCCCCCCcEEEEeCCCcC
Confidence 4455777542 344557889999999988643 2568999999999985
No 18
>PF07177 Neuralized: Neuralized; InterPro: IPR006573 NEUZ is a domain of unknown function found in neuralized proteins, i.e. proteins involved in the specification of the neuroblast during cellular differentiation. ; PDB: 2YUE_A 2E63_A.
Probab=21.72 E-value=86 Score=25.13 Aligned_cols=38 Identities=13% Similarity=0.182 Sum_probs=22.8
Q ss_pred CceeEEeeCceEEEEeeccceeeeEEEEEecCCcccchhh
Q psy10936 331 TTTYSILCGVTTVRLVSSGEFDNAVTVALTPAQVEDLDAA 370 (410)
Q Consensus 331 ~~~~~i~C~~t~VRlVSSG~~~N~vt~~~~~~~~~~l~~a 370 (410)
+....+..+.+++|- ...|.|.+.|+-||+..+|.=.+
T Consensus 10 G~nV~L~~~~~~A~R--~~sf~~giVFS~rPl~~~E~~~v 47 (69)
T PF07177_consen 10 GKNVRLSNDGTVARR--VSSFNNGIVFSSRPLRIGEKFEV 47 (69)
T ss_dssp -TTEEE-SSS-EEEE--ST-SSS-EEEESS-B-TT-EEEE
T ss_pred CCCEEEcCCCeEEEe--cccCCceEEEecCCccCCCEEEE
Confidence 445567777787777 66799999999999987765433
No 19
>PF06581 p31comet: Mad1 and Cdc20-bound-Mad2 binding; InterPro: IPR009511 This entry represents Mad1 and Cdc20-bound-Mad2 binding proteins that are involved in the cell-cycle surveillance mechanism called the spindle checkpoint []. This mechanism monitors the proper bipolar attachment of sister chromatids to spindle microtubules and ensures the fidelity of chromosome segregation during mitosis. A key player in mitosis is Mad2, which exhibits an unusual two-state behaviour. A Mad1-Mad2 core complex recruits cytosolic Mad2 to kinetochores through Mad2 dimerisation and converts Mad2 to a conformer amenable to Cdc20 binding. p31comet inactivates the checkpoint by binding to Mad1- or Cdc20-bound Mad2 in such a way as to stop Mad2 activation and to promote the dissociation of the Mad2-Cdc20 complex [].; GO: 0007096 regulation of exit from mitosis, 0005634 nucleus; PDB: 2QYF_B.
Probab=21.23 E-value=37 Score=34.30 Aligned_cols=16 Identities=44% Similarity=0.881 Sum_probs=9.9
Q ss_pred EEeecCCCCceEEEEe
Q psy10936 162 QYRVPTRGKGFSFYVR 177 (410)
Q Consensus 162 qyRvp~~G~gF~~~vr 177 (410)
+||||+||+-|++++-
T Consensus 221 nykvPtrg~~~~I~ls 236 (264)
T PF06581_consen 221 NYKVPTRGKKLTITLS 236 (264)
T ss_dssp T----SS-EEEEEEEE
T ss_pred ccccCCCCceEEEEEe
Confidence 6999999999999985
Done!