Query psy4861
Match_columns 101
No_of_seqs 106 out of 771
Neff 7.1
Searched_HMMs 46136
Date Fri Aug 16 19:33:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4861.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4861hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1094|consensus 99.9 1.7E-23 3.7E-28 167.4 7.1 92 6-98 51-142 (807)
2 smart00231 FA58C Coagulation f 99.9 7.7E-22 1.7E-26 132.3 10.2 84 5-98 20-103 (139)
3 cd00057 FA58C Substituted upda 99.7 2.1E-15 4.6E-20 100.0 10.8 84 6-97 24-107 (143)
4 KOG3516|consensus 99.5 5.2E-15 1.1E-19 124.7 3.4 80 12-98 56-135 (1306)
5 PF00754 F5_F8_type_C: F5/8 ty 99.0 3.5E-09 7.5E-14 67.9 8.4 62 18-85 25-86 (129)
6 smart00607 FTP eel-Fucolectin 85.2 3.5 7.7E-05 28.2 5.6 43 30-74 48-90 (151)
7 KOG4276|consensus 84.7 2.7 5.8E-05 27.3 4.4 50 31-83 5-54 (113)
8 PF02012 BNR: BNR/Asp-box repe 59.9 11 0.00023 15.2 1.7 10 71-80 2-11 (12)
9 smart00136 LamNT Laminin N-ter 59.4 26 0.00056 25.7 4.9 57 20-83 74-130 (238)
10 PF07738 Sad1_UNC: Sad1 / UNC- 57.2 11 0.00023 24.7 2.4 53 18-75 21-74 (135)
11 PF13285 DUF4073: Domain of un 56.4 8.8 0.00019 26.5 1.9 26 68-94 108-133 (158)
12 KOG1143|consensus 53.4 20 0.00043 29.0 3.6 43 29-71 159-208 (591)
13 PF00055 Laminin_N: Laminin N- 52.0 47 0.001 24.3 5.2 44 33-83 85-128 (237)
14 cd02847 Chitobiase_C_term Chit 51.2 13 0.00029 22.7 1.9 19 67-85 33-51 (78)
15 PF03404 Mo-co_dimer: Mo-co ox 36.7 1.3E+02 0.0027 19.9 5.1 38 38-83 22-60 (131)
16 KOG2687|consensus 35.2 84 0.0018 25.2 4.5 44 5-53 279-323 (414)
17 PHA00026 cp coat protein 34.7 10 0.00022 24.8 -0.5 25 74-99 78-103 (129)
18 KOG0296|consensus 29.8 1.1E+02 0.0024 24.3 4.3 29 70-100 145-173 (399)
19 PF06588 Muskelin_N: Muskelin 28.3 94 0.002 22.5 3.5 55 20-80 34-88 (199)
20 PF07210 DUF1416: Protein of u 28.2 96 0.0021 19.4 3.1 14 38-51 2-15 (85)
21 PF07081 DUF1349: Protein of u 25.7 82 0.0018 21.9 2.8 52 32-84 75-143 (183)
22 KOG1120|consensus 23.6 1.6E+02 0.0034 20.0 3.7 38 45-91 53-90 (134)
23 KOG2055|consensus 22.5 64 0.0014 26.4 1.9 48 48-100 317-369 (514)
24 PF04648 MF_alpha: Yeast matin 21.2 49 0.0011 13.5 0.6 9 35-43 3-11 (13)
25 PF13464 DUF4115: Domain of un 20.9 1.2E+02 0.0026 17.7 2.5 17 77-93 8-24 (77)
No 1
>KOG1094|consensus
Probab=99.89 E-value=1.7e-23 Score=167.45 Aligned_cols=92 Identities=33% Similarity=0.676 Sum_probs=87.6
Q ss_pred CCcCcccccCCCCCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEEcC
Q psy4861 6 EEIYEDMVRTDYNGGAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYKDA 85 (101)
Q Consensus 6 ~~~~~aRL~~~~~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~~~ 85 (101)
..|++|||+.+.++|||||+.++.....+||||||...+.||.|.||||++.|++.||.+.|+|.|+++| ..|..|++.
T Consensus 51 t~~~~~rl~se~g~GAWCPk~~v~~~~~E~LQvdl~~~hlit~V~TQGR~~~G~G~Efa~~y~I~Y~Rpg-~~Wi~wk~r 129 (807)
T KOG1094|consen 51 TGPQHARLHSEDGSGAWCPKGQVNSKSKEYLQVDLQRLHLITSVETQGRHAGGLGKEFARAYKIRYSRPG-LRWISWKDR 129 (807)
T ss_pred cccccccccccCCCcccCCCcccCccchhheEEeeeceEEEEEeeeccccCCCccceehhheeeeeccCc-chheeeccc
Confidence 4689999999999999999998877889999999999999999999999999999999999999999999 899999999
Q ss_pred CCcEEEeCcCCCC
Q psy4861 86 KGNEMYSYVQTGT 98 (101)
Q Consensus 86 ~~~~vF~GN~d~~ 98 (101)
.|.++..||.|-+
T Consensus 130 ~g~evi~gN~dt~ 142 (807)
T KOG1094|consen 130 WGQEVIPGNEDTE 142 (807)
T ss_pred cCceecCCCCCcc
Confidence 9999999999854
No 2
>smart00231 FA58C Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Probab=99.87 E-value=7.7e-22 Score=132.30 Aligned_cols=84 Identities=23% Similarity=0.474 Sum_probs=71.1
Q ss_pred CCCcCcccccCCCCCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEEc
Q psy4861 5 YEEIYEDMVRTDYNGGAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYKD 84 (101)
Q Consensus 5 y~~~~~aRL~~~~~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~~ 84 (101)
++.|++|||+.. ..++|||+. ++..|||||||++++.|++|+|||+.+. .+||+.|.+ ||.|+ ..|..|.+
T Consensus 20 ~~~~~~~rL~~~-~~~~W~~~~---~~~~~wlqvDLg~~~~v~~i~~qg~~~~---~~~~~~~~~-~s~dg-~~W~~~~~ 90 (139)
T smart00231 20 YWAAKIARLNGG-SDGAWCPAK---NSLPPWIQVDLGRTRTVTGVITGGRHGD---GDGVTYKLL-YSDDG-NNWTTYKD 90 (139)
T ss_pred CCCCCccCcCcC-CCCcccCCC---CCCCceeEeeccCcEEEEEEEecccCCC---CcEEEEEEE-EEeCC-CCEeEEeC
Confidence 578999999964 467999985 3668999999999999999999997532 479999988 99999 89999987
Q ss_pred CCCcEEEeCcCCCC
Q psy4861 85 AKGNEMYSYVQTGT 98 (101)
Q Consensus 85 ~~~~~vF~GN~d~~ 98 (101)
...++|.||+|..
T Consensus 91 -~~~~~f~gn~d~~ 103 (139)
T smart00231 91 -GNSKVFPGNSDAG 103 (139)
T ss_pred -CCeEEEeCccCCC
Confidence 4457999997654
No 3
>cd00057 FA58C Substituted updates: Jan 31, 2002
Probab=99.65 E-value=2.1e-15 Score=100.02 Aligned_cols=84 Identities=24% Similarity=0.527 Sum_probs=70.0
Q ss_pred CCcCcccccCCCCCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEEcC
Q psy4861 6 EEIYEDMVRTDYNGGAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYKDA 85 (101)
Q Consensus 6 ~~~~~aRL~~~~~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~~~ 85 (101)
+.|.+++++. ...||+... +..|||||||++++.|++|.+|++.+. ...++++.|+|+||.|+ .+|..+.+.
T Consensus 24 ~~~~~~dg~~---~t~W~~~~~---~~~~wi~vDL~~~~~i~~v~i~~~~~~-~~~~~~~~~~i~~s~dg-~~W~~~~~~ 95 (143)
T cd00057 24 WEASRARLNS---DNAWTPAVN---DPPQWLQVDLGKTRRVTGIQTQGRKGG-GSSEWVTSYKVQYSLDG-ETWTTYKDK 95 (143)
T ss_pred CCcCeeecCC---CCcccCCCC---CCCceEEEECCCCEEEEEEEEccCCCC-CccCeeEEEEEEEECCC-CCEeEEEcC
Confidence 4577777764 578999852 578999999999999999999998652 23579999999999999 999999887
Q ss_pred CCcEEEeCcCCC
Q psy4861 86 KGNEMYSYVQTG 97 (101)
Q Consensus 86 ~~~~vF~GN~d~ 97 (101)
....+|.||.|+
T Consensus 96 ~~~~~~~~~~~~ 107 (143)
T cd00057 96 GEEKVFTGNSDG 107 (143)
T ss_pred CcEEEEeCCcCC
Confidence 666789999874
No 4
>KOG3516|consensus
Probab=99.52 E-value=5.2e-15 Score=124.73 Aligned_cols=80 Identities=20% Similarity=0.493 Sum_probs=71.3
Q ss_pred cccCCCCCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEEcCCCcEEE
Q psy4861 12 MVRTDYNGGAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYKDAKGNEMY 91 (101)
Q Consensus 12 RL~~~~~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~~~~~~~vF 91 (101)
||+...+.+||.|.. ++.+||||+||+.++.|++||||||+++ ..|||+|.+.||+.+ ..|..|...+....|
T Consensus 56 ~~~~~~G~~gwsp~~---~~~~~wlq~dLg~~~ev~aVatqg~~~S---ddwvt~y~l~~sD~g-~~w~~y~~~~~~wtf 128 (1306)
T KOG3516|consen 56 KLNRRVGISGWSPKI---SNYNQWLQLDLGKRMEVTAVATQGRYGS---DDWVTSYILQYSDPG-RNWKTYSQEGSSWTF 128 (1306)
T ss_pred hhhhhcCcccccccc---CcccceeeeccCCceeeEEEeeccccCc---chHHHHHHHhccCCc-cceEEEEeCCCcccc
Confidence 555556789999995 5889999999999999999999999876 579999999999999 999999998778899
Q ss_pred eCcCCCC
Q psy4861 92 SYVQTGT 98 (101)
Q Consensus 92 ~GN~d~~ 98 (101)
.||.+..
T Consensus 129 ~Gn~n~~ 135 (1306)
T KOG3516|consen 129 VGNVNAD 135 (1306)
T ss_pred ccccccc
Confidence 9998764
No 5
>PF00754 F5_F8_type_C: F5/8 type C domain; InterPro: IPR000421 Blood coagulation factors V and VIII contain a C-terminal, twice repeated, domain of about 150 amino acids, which is called F5/8 type C, FA58C, or C1/C2- like domain. In the Dictyostelium discoideum (Slime mold) cell adhesion protein discoidin, a related domain, named discoidin I-like domain, DLD, or DS, has been found which shares a common C-terminal region of about 110 amino acids with the FA58C domain, but whose N-terminal 40 amino acids are much less conserved. Similar domains have been detected in other extracellular and membrane proteins [, , ] In coagulation factors V and VIII the repeated domains compose part of a larger functional domain which promotes binding to anionic phospholipids on the surface of platelets and endothelial cells []. The C-terminal domain of the second FA58C repeat (C2) of coagulation factor VIII has been shown to be responsible for phosphatidylserine-binding and essential for activity [, ]. It forms an amphipathic alpha-helix, which binds to the membrane []. FA58C contains two conserved cysteines in most proteins, which link the extremities of the domain by a disulphide bond [, , ]. A further disulphide bond is located near the C-terminal of the second FA58C domain in MFGM Q08431 from SWISSPROT []. +------------------------------------------------------------------------+ | +-+ | | | | | CxPLGxxQITASxxxxxRLxxxWxxxxWxxxxxxQGxxxxxxxxxxxxGNxxxxxxxxxxRxPxcxcLRxExGC 'C': conserved cysteine involved in a disulphide bond. 'c': cysteine involved in a disulphide bond in MFGM Q08431 from SWISSPROT. 'x': any amino acid. upper case letters: conserved residues. ; GO: 0007155 cell adhesion; PDB: 2J22_A 3CQO_C 1SDD_B 2VCA_A 2VC9_A 2VCC_A 2VCB_A 2J1E_A 2J7M_A 2J1A_A ....
Probab=99.00 E-value=3.5e-09 Score=67.92 Aligned_cols=62 Identities=21% Similarity=0.448 Sum_probs=53.2
Q ss_pred CCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEEcC
Q psy4861 18 NGGAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYKDA 85 (101)
Q Consensus 18 ~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~~~ 85 (101)
...+||+.. .+..+||+|||++++.|.+|..+++.... ..++++|+|++|.|+ ..|..+.+.
T Consensus 25 ~~t~W~~~~---~~~~~~i~idl~~~~~i~~i~i~~~~~~~--~~~~~~~~i~~s~dg-~~w~~~~~~ 86 (129)
T PF00754_consen 25 PSTAWCSNW---DDSPQWIQIDLGKPYTISGISIQFRNDGN--NGRPKSFKIEYSNDG-SNWTTVASQ 86 (129)
T ss_dssp TTSSEEESS---SSSTEEEEEEEEEEEEEEEEEEEEESSTT--TEEEEEEEEEEESSS-SSEEEEEET
T ss_pred CCCEEECCC---CCCCceEEEEeeeeEecceeeeccccccc--ceeeeeeeeeeeccc-ccccccccc
Confidence 458899973 26789999999999999999999996543 249999999999999 999998763
No 6
>smart00607 FTP eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain.
Probab=85.18 E-value=3.5 Score=28.21 Aligned_cols=43 Identities=14% Similarity=0.089 Sum_probs=32.6
Q ss_pred CCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECC
Q psy4861 30 TEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRP 74 (101)
Q Consensus 30 ~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~d 74 (101)
.+.++|-+|||+..+.|..|..=-|.+-- .+-..-|.|.-+.+
T Consensus 48 ~e~~PWW~VDL~~~~~V~~V~I~NR~Dcc--~~rl~~~eI~IG~s 90 (151)
T smart00607 48 KRSNPWWRVDLLQYMTIHSVTITNRGDCC--GERITGARILIGNS 90 (151)
T ss_pred CCCCCeEEEeCCCeEEeeEEEEecCCCCC--CccccceEEEECCc
Confidence 35689999999999999999998875422 34566777776644
No 7
>KOG4276|consensus
Probab=84.65 E-value=2.7 Score=27.35 Aligned_cols=50 Identities=16% Similarity=0.308 Sum_probs=35.8
Q ss_pred CCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEE
Q psy4861 31 EPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYK 83 (101)
Q Consensus 31 d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~ 83 (101)
+.+-|.-||||..+..++----=+.+- +.......+++=|.|| .+|+...
T Consensus 5 ~k~awf~iDLG~~vip~~y~lrh~rgy--gRsalRnW~fQgS~Dg-ktWt~l~ 54 (113)
T KOG4276|consen 5 DKNAWFAIDLGLEVIPTAYTLRHARGY--GRSALRNWKFQGSKDG-KTWTDLR 54 (113)
T ss_pred CcceeEEEecCceEeeeeeeeeecccc--cHHHhhheeeeccccC-CcceeEE
Confidence 567899999999887766543222221 2457777889999999 9998653
No 8
>PF02012 BNR: BNR/Asp-box repeat; InterPro: IPR002860 Members of this entry contain multiple BNR (bacterial neuraminidase repeat) repeats or Asp-boxes. The repeats are short, however the repeats are never found closer than 40 residues together suggesting that the repeat is structurally longer. These repeats are found in a variety of non-homologous proteins, including bacterial ribonucleases, sulphite oxidases, reelin, netrins, sialidases, neuraminidases, some lipoprotein receptors, and a variety of glycosyl hydrolases [].; PDB: 2JKB_A 2VW0_A 2VW2_A 2VW1_A 2CN2_D 2CN3_B 2VK7_B 2VK5_A 2VK6_A 2BF6_A ....
Probab=59.95 E-value=11 Score=15.19 Aligned_cols=10 Identities=20% Similarity=0.321 Sum_probs=6.1
Q ss_pred EECCCCCCcE
Q psy4861 71 YWRPKLGKWV 80 (101)
Q Consensus 71 ys~dg~~~W~ 80 (101)
+|.|++.+|.
T Consensus 2 ~S~D~G~TW~ 11 (12)
T PF02012_consen 2 YSTDGGKTWK 11 (12)
T ss_dssp EESSTTSS-E
T ss_pred EeCCCcccCc
Confidence 5777657786
No 9
>smart00136 LamNT Laminin N-terminal domain (domain VI). N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.
Probab=59.43 E-value=26 Score=25.74 Aligned_cols=57 Identities=7% Similarity=0.019 Sum_probs=34.6
Q ss_pred ceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEE
Q psy4861 20 GAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYK 83 (101)
Q Consensus 20 gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~ 83 (101)
--|-+........+--|++||++..-|+-|..|=+.... ..+| ++=|+.| .+|..|+
T Consensus 74 TwWQS~~~~~~~~~VtitLdL~k~fevtyi~l~F~s~RP--a~~i----~erSd~G-~tW~p~q 130 (238)
T smart00136 74 TWWQSEPLSNGPQNVNLTLDLGKEFHVTYVILKFCSPRP--SLWI----LERSDFG-KTWQPWQ 130 (238)
T ss_pred ceecCCCcCCCCccEEEEEecCCEEEEEEEEEEecCCCC--ceEE----EeecCCC-CCCcEee
Confidence 446655321111235789999999999999988752211 2233 3345667 8888764
No 10
>PF07738 Sad1_UNC: Sad1 / UNC-like C-terminal ; InterPro: IPR012919 The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and Sad1 share a common C-terminal region that is often termed the SUN (Sad1 and UNC) domain [, ]. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2 []. The SUN domain of Sun2 has been demonstrated to be in the periplasm []. ; PDB: 4DXT_A 3UNP_A 4DXR_A 4DXS_A.
Probab=57.24 E-value=11 Score=24.68 Aligned_cols=53 Identities=11% Similarity=0.102 Sum_probs=34.3
Q ss_pred CCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCC-ccceeeEEEEEEECCC
Q psy4861 18 NGGAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQ-GAEFTEAYLIDYWRPK 75 (101)
Q Consensus 18 ~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~-~~~wVt~y~v~ys~dg 75 (101)
-+-+||.+ ....||-|.|.++..|++|..+=-..... ...=.+.|.|.-+.+.
T Consensus 21 ~g~Cw~~~-----g~~~~~~I~L~~~I~v~~v~i~h~~~~~~~~sSapk~f~V~g~~~~ 74 (135)
T PF07738_consen 21 PGPCWAFE-----GSKGWLTIELSEPIYVTSVTIEHVEKSIAPFSSAPKDFEVWGSVDY 74 (135)
T ss_dssp TT-SEEEE-----TT-EEEEEEEEEEEEEEEEEEE---GGGSG-TTSB-EEEEEEESSS
T ss_pred CCccCccC-----CCceEEEEEcCCcEEEEEEEEEEeccccCCcCCCCcEEEEEEEecc
Confidence 35679877 45689999999999999998876322100 1234678888877776
No 11
>PF13285 DUF4073: Domain of unknown function (DUF4073)
Probab=56.44 E-value=8.8 Score=26.53 Aligned_cols=26 Identities=15% Similarity=0.236 Sum_probs=17.3
Q ss_pred EEEEECCCCCCcEEEEcCCCcEEEeCc
Q psy4861 68 LIDYWRPKLGKWVRYKDAKGNEMYSYV 94 (101)
Q Consensus 68 ~v~ys~dg~~~W~~y~~~~~~~vF~GN 94 (101)
.++||.+|...|..|.. ...+-|.||
T Consensus 108 ~MEYsv~g~~~W~~y~p-~npP~~~G~ 133 (158)
T PF13285_consen 108 YMEYSVDGGNNWHTYDP-ANPPDFSGD 133 (158)
T ss_pred EEEEeecCccccEeCCc-CCCCCCCCc
Confidence 35899998448999954 333456555
No 12
>KOG1143|consensus
Probab=53.38 E-value=20 Score=29.02 Aligned_cols=43 Identities=16% Similarity=0.271 Sum_probs=31.3
Q ss_pred CCCCCCeEEEecC-------CcEEEEEEEecCCcCCCCccceeeEEEEEE
Q psy4861 29 TTEPREWIEINLH-------FVHVITATGVQGRFGNGQGAEFTEAYLIDY 71 (101)
Q Consensus 29 ~~d~~qwLQVDL~-------~~~~It~V~TQGr~~~~~~~~wVt~y~v~y 71 (101)
.+|.+|+|+|-+. .+..+.||.|||..+++++..-+..|+-.+
T Consensus 159 vPd~QqfievRvAVlGg~D~GKSTLlGVLTQgeLDnG~GrARln~FRh~H 208 (591)
T KOG1143|consen 159 VPDSQQFIEVRVAVLGGCDVGKSTLLGVLTQGELDNGNGRARLNIFRHPH 208 (591)
T ss_pred CCCcccceEEEEEEecCcccCcceeeeeeecccccCCCCeeeeehhcchh
Confidence 3688999997542 345678999999998888766666665443
No 13
>PF00055 Laminin_N: Laminin N-terminal (Domain VI); InterPro: IPR008211 Laminin is a large molecular weight glycoprotein present only in basement membranes in almost every animal tissue. Laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components []. Each laminin is a heterotrimer assembled from alpha, beta and gamma chain subunits, secreted and incorporated into cell-associated extracellular matrices []. Basement membrane assembly is a cooperative process in which laminins polymerise through their N-terminal domain (LN or domain VI) and anchor to the cell surface through their G domains. Netrins may also associate with this network through heterotypic LN domain interactions []. This leads to cell signalling through integrins and dystroglycan (and possibly other receptors) recruited to the adherent laminin. This LN domain dependent self-assembly is considered to be crucial for the integrity of basement membranes, as highlighted by genetic forms of muscular dystrophy containing the deletion of the LN module from the alpha 2 laminin chain []. The laminin N-terminal domain is found in all laminin and netrin subunits except laminin alpha 3A, alpha 4 and gamma 2. ; PDB: 3TBD_A 3ZYG_B 3ZYI_B 3ZYJ_D 2Y38_A.
Probab=51.95 E-value=47 Score=24.31 Aligned_cols=44 Identities=18% Similarity=0.137 Sum_probs=30.4
Q ss_pred CCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEE
Q psy4861 33 REWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYK 83 (101)
Q Consensus 33 ~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~ 83 (101)
+--|++||++..-|+-|..|=+.... ..+ .|+=|.|. .+|..|+
T Consensus 85 ~VtitLdL~~~f~v~~v~l~F~spRP--~~m----iierS~D~-gtW~p~q 128 (237)
T PF00055_consen 85 NVTITLDLGKEFEVTYVILQFCSPRP--AAM----IIERSSDF-GTWQPWQ 128 (237)
T ss_dssp -EEEEEEEEEEEEEEEEEEEESS-----SEE----EEEEESST-TSEEEEE
T ss_pred ceEEEEcccceEEEEEEEEEEcCCCC--CeE----EEEEccCC-CceeEeH
Confidence 45699999999999999988752221 223 34448888 5999774
No 14
>cd02847 Chitobiase_C_term Chitobiase C-terminus domain. Chitobiase (AKA N-acetylglucosaminidase) digests the beta, 1-4 glycosidic bonds of the N-acetylglucosamine (NAG) oligomers found in chitin, an important structural element of fungal cell wall and arthropod exoskeletons. It is thought to proceed through an acid-base reaction mechanism, in which one protein carboxylate acts as catalytic acid, while the nucleophile is the polar acetamido group of the sugar in a substrate-assisted reaction with retention of the anomeric configuration. The C-terminus of chitobiase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chit
Probab=51.24 E-value=13 Score=22.70 Aligned_cols=19 Identities=21% Similarity=0.356 Sum_probs=15.6
Q ss_pred EEEEEECCCCCCcEEEEcC
Q psy4861 67 YLIDYWRPKLGKWVRYKDA 85 (101)
Q Consensus 67 y~v~ys~dg~~~W~~y~~~ 85 (101)
..|+|+.|+...|..|...
T Consensus 33 ~~i~Yt~dgg~~w~~Y~~p 51 (78)
T cd02847 33 LTLQYSTDGGKNWNIYDAA 51 (78)
T ss_pred cEEEEEecCCccCeecccc
Confidence 4689999964899999874
No 15
>PF03404 Mo-co_dimer: Mo-co oxidoreductase dimerisation domain; InterPro: IPR005066 The majority of molybdenum-containing enzymes utilise a molybdenum cofactor (MoCF or Moco) consisting of a Mo atom coordinated via a cis-dithiolene moiety to molybdopterin (MPT). MoCF is ubiquitous in nature, and the pathway for MoCF biosynthesis is conserved in all three domains of life. MoCF-containing enzymes function as oxidoreductases in carbon, nitrogen, and sulphur metabolism [, ]. In Escherichia coli, biosynthesis of MoCF is a three stage process. It begins with the MoaA and MoaC conversion of GTP to the meta-stable pterin intermediate precursor Z. The second stage involves MPT synthase (MoaD and MoaE), which converts precursor Z to MPT; MoeB is involved in the recycling of MPT synthase. The final step in MoCF synthesis is the attachment of mononuclear Mo to MPT, a process that requires MoeA and which is enhanced by MogA in an Mg2 ATP-dependent manner []. MoCF is the active co-factor in eukaryotic and some prokaryotic molybdo-enzymes, but the majority of bacterial enzymes requiring MoCF, need a modification of MTP for it to be active; MobA is involved in the attachment of a nucleotide monophosphate to MPT resulting in the MGD co-factor, the active co-factor for most prokaryotic molybdo-enzymes. Bacterial two-hybrid studies have revealed the close interactions between MoeA, MogA, and MobA in the synthesis of MoCF []. Moreover the close functional association of MoeA and MogA in the synthesis of MoCF is supported by fact that the known eukaryotic homologues to MoeA and MogA exist as fusion proteins: CNX1 (Q39054 from SWISSPROT) of Arabidopsis thaliana (Mouse-ear cress), mammalian Gephryin (e.g. Q9NQX3 from SWISSPROT) and Drosophila melanogaster (Fruit fly) Cinnamon (P39205 from SWISSPROT) []. This domain is found in molybdopterin cofactor oxidoreductases, such as in the C-terminal of Mo-containing sulphite oxidase, which catalyses the conversion of sulphite to sulphate, the terminal step in the oxidative degradation of cysteine and methionine []. This domain is involved in dimer formation, and has an Ig-fold structure [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2C9X_A 2CA3_A 2BLF_A 2CA4_A 2BPB_A 2XTS_C 2BII_A 2BIH_A 1OGP_A 2A9A_B ....
Probab=36.72 E-value=1.3e+02 Score=19.94 Aligned_cols=38 Identities=8% Similarity=-0.085 Sum_probs=25.0
Q ss_pred EecC-CcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEE
Q psy4861 38 INLH-FVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYK 83 (101)
Q Consensus 38 VDL~-~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~ 83 (101)
|.++ ....|.|+|.=|-. .- -=+|+.|.|++.+|...+
T Consensus 22 v~~~~~~v~i~G~A~~g~g------~~--I~rVEVS~DgG~tW~~A~ 60 (131)
T PF03404_consen 22 VKAGDGTVTIRGYAWSGGG------RG--IARVEVSTDGGKTWQEAT 60 (131)
T ss_dssp EESESEEEEEEEEEE-STT----------EEEEEEESSTTSSEEE-E
T ss_pred EccCCcEEEEEEEEEeCCC------cc--eEEEEEEeCCCCCcEEeE
Confidence 3444 78899999998842 12 235789999867898764
No 16
>KOG2687|consensus
Probab=35.18 E-value=84 Score=25.23 Aligned_cols=44 Identities=14% Similarity=0.271 Sum_probs=32.8
Q ss_pred CCCcCcccccCCC-CCceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecC
Q psy4861 5 YEEIYEDMVRTDY-NGGAWCPKNQITTEPREWIEINLHFVHVITATGVQG 53 (101)
Q Consensus 5 y~~~~~aRL~~~~-~~gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQG 53 (101)
|+.|...-|.... -+.+||-+ ....++.|+|.+...+++|-.|=
T Consensus 279 ~~~~P~vil~~~v~PG~Cw~Fk-----Gsrg~v~V~La~~Iip~avTleH 323 (414)
T KOG2687|consen 279 HHQPPRVILQPNVSPGECWAFK-----GSRGYVTVRLARFIIPSAVTLEH 323 (414)
T ss_pred ccCCCceEecCCCCccceeeec-----CCceEEEEEecceEEeeeeeeec
Confidence 4455555554333 36899988 46789999999999999987764
No 17
>PHA00026 cp coat protein
Probab=34.66 E-value=10 Score=24.76 Aligned_cols=25 Identities=16% Similarity=0.408 Sum_probs=19.2
Q ss_pred CCCCCcEEEEcCCCc-EEEeCcCCCCc
Q psy4861 74 PKLGKWVRYKDAKGN-EMYSYVQTGTL 99 (101)
Q Consensus 74 dg~~~W~~y~~~~~~-~vF~GN~d~~~ 99 (101)
.+ ..|..|.+..-. +||..|+|-++
T Consensus 78 p~-aaw~~y~n~~ltipifa~nddc~l 103 (129)
T PHA00026 78 PG-AAWKAYANIDLTIPIFAANDDCEL 103 (129)
T ss_pred cH-HHHHhhccceeeeeEEecCCcHHH
Confidence 45 689999887754 69999998653
No 18
>KOG0296|consensus
Probab=29.80 E-value=1.1e+02 Score=24.33 Aligned_cols=29 Identities=14% Similarity=0.470 Sum_probs=21.1
Q ss_pred EEECCCCCCcEEEEcCCCcEEEeCcCCCCcc
Q psy4861 70 DYWRPKLGKWVRYKDAKGNEMYSYVQTGTLW 100 (101)
Q Consensus 70 ~ys~dg~~~W~~y~~~~~~~vF~GN~d~~~~ 100 (101)
.+..++ ..|..+.. .+.-+|.|-.||..|
T Consensus 145 ~~e~~d-ieWl~WHp-~a~illAG~~DGsvW 173 (399)
T KOG0296|consen 145 DQEVED-IEWLKWHP-RAHILLAGSTDGSVW 173 (399)
T ss_pred ecccCc-eEEEEecc-cccEEEeecCCCcEE
Confidence 366666 77877644 556688888888887
No 19
>PF06588 Muskelin_N: Muskelin N-terminus; InterPro: IPR010565 This entry represents the N-terminal region of muskelin and is found in conjunction with several IPR006652 from INTERPRO repeats. Muskelin is an intracellular, kelch repeat protein that is needed in cell-spreading responses to the matrix adhesion molecule, thrombospondin-1 [].
Probab=28.31 E-value=94 Score=22.48 Aligned_cols=55 Identities=9% Similarity=0.247 Sum_probs=40.9
Q ss_pred ceeeeCCCCCCCCCCeEEEecCCcEEEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcE
Q psy4861 20 GAWCPKNQITTEPREWIEINLHFVHVITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWV 80 (101)
Q Consensus 20 gAW~~~~~~~~d~~qwLQVDL~~~~~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~ 80 (101)
.-|.+.. +...|||-+.|.++.+|+.| |=|.+...+ ..=++.|+|-=+.+. ....
T Consensus 34 SRWss~~---~~~~QyiiLkL~~paiV~sI-tFGKy~K~H-vCNlK~fkv~gG~~~-~~m~ 88 (199)
T PF06588_consen 34 SRWSSSS---NSPPQYIILKLESPAIVKSI-TFGKYEKPH-VCNLKKFKVYGGMDE-ENMI 88 (199)
T ss_pred ccccCCC---CCCCcEEEEEcCCCeEEEEE-eccccccCc-cccceeeEEeccCCH-HHHH
Confidence 4488885 47899999999999999876 457654433 345899999888776 4443
No 20
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=28.20 E-value=96 Score=19.44 Aligned_cols=14 Identities=21% Similarity=0.214 Sum_probs=13.3
Q ss_pred EecCCcEEEEEEEe
Q psy4861 38 INLHFVHVITATGV 51 (101)
Q Consensus 38 VDL~~~~~It~V~T 51 (101)
||+.+...|+|..+
T Consensus 2 vd~~ke~VItG~V~ 15 (85)
T PF07210_consen 2 VDVEKETVITGRVT 15 (85)
T ss_pred cCccceEEEEEEEe
Confidence 79999999999999
No 21
>PF07081 DUF1349: Protein of unknown function (DUF1349); InterPro: IPR009784 This family consists of several hypothetical bacterial proteins but contains one sequence (P40893 from SWISSPROT) from Saccharomyces cerevisiae. Members of this family are typically around 200 residues in length. The function of this family is unknown.; PDB: 3MEP_B 3O12_A.
Probab=25.68 E-value=82 Score=21.86 Aligned_cols=52 Identities=12% Similarity=0.191 Sum_probs=29.7
Q ss_pred CCCeEEEec----CCcEEEEEEEecCCcCCCC-------cccee------eEEEEEEECCCCCCcEEEEc
Q psy4861 32 PREWIEINL----HFVHVITATGVQGRFGNGQ-------GAEFT------EAYLIDYWRPKLGKWVRYKD 84 (101)
Q Consensus 32 ~~qwLQVDL----~~~~~It~V~TQGr~~~~~-------~~~wV------t~y~v~ys~dg~~~W~~y~~ 84 (101)
...||...+ .....|..|+|-|..+-.. ..-|+ ..|.|.||.|| ..|...+.
T Consensus 75 ~~~WiK~giE~~~~g~~~l~sV~t~~~SDws~~~~~~~~~~~~lrv~R~g~~~~~~ys~DG-~~w~~~R~ 143 (183)
T PF07081_consen 75 EDNWIKAGIEYSNDGTPRLSSVVTNGYSDWSLSPLPSDGQSVWLRVERRGDDLWIYYSADG-KTWTLLRI 143 (183)
T ss_dssp TTEEEEEEEEE-ETTCEEEEEEEEESSEEEEEEE--SBTTSEEEEEEEETTEEEEEEESSS-S---EEEE
T ss_pred CcccEEEEEEEecCCCceEEEEeccCcccccccccCCCCCEEEEEEEEeCCEEEEEEEcCC-CEEEEEEE
Confidence 346887544 3556777888887532111 11132 35788999999 89987654
No 22
>KOG1120|consensus
Probab=23.55 E-value=1.6e+02 Score=19.96 Aligned_cols=38 Identities=21% Similarity=0.440 Sum_probs=25.4
Q ss_pred EEEEEEecCCcCCCCccceeeEEEEEEECCCCCCcEEEEcCCCcEEE
Q psy4861 45 VITATGVQGRFGNGQGAEFTEAYLIDYWRPKLGKWVRYKDAKGNEMY 91 (101)
Q Consensus 45 ~It~V~TQGr~~~~~~~~wVt~y~v~ys~dg~~~W~~y~~~~~~~vF 91 (101)
.=-||-+-|+.+. +|.++|-..- ..+-...+.+|.+||
T Consensus 53 lrigVk~rGCnGl--------sYtleY~~~k-gkfDE~VeqdGv~I~ 90 (134)
T KOG1120|consen 53 LRIGVKQRGCNGL--------SYTLEYTKTK-GKFDEVVEQDGVRIF 90 (134)
T ss_pred eEEEEecCCcCcc--------eeeeeeeccC-CCCcceeeecCcEEE
Confidence 3348888888543 7899998886 555554455565555
No 23
>KOG2055|consensus
Probab=22.50 E-value=64 Score=26.42 Aligned_cols=48 Identities=15% Similarity=0.225 Sum_probs=28.9
Q ss_pred EEEecCCcCCC-----CccceeeEEEEEEECCCCCCcEEEEcCCCcEEEeCcCCCCcc
Q psy4861 48 ATGVQGRFGNG-----QGAEFTEAYLIDYWRPKLGKWVRYKDAKGNEMYSYVQTGTLW 100 (101)
Q Consensus 48 ~V~TQGr~~~~-----~~~~wVt~y~v~ys~dg~~~W~~y~~~~~~~vF~GN~d~~~~ 100 (101)
-|+++|+.+.- ...+|+++|+|.=...+ -.|+ ..+.+|..-+.||+-|
T Consensus 317 fia~~G~~G~I~lLhakT~eli~s~KieG~v~~-~~fs----Sdsk~l~~~~~~GeV~ 369 (514)
T KOG2055|consen 317 FIAIAGNNGHIHLLHAKTKELITSFKIEGVVSD-FTFS----SDSKELLASGGTGEVY 369 (514)
T ss_pred eEEEcccCceEEeehhhhhhhhheeeeccEEee-EEEe----cCCcEEEEEcCCceEE
Confidence 46788864321 13589999998766665 5555 2345555555555544
No 24
>PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha].The hormone is excreted into the culture medium by haploid cells of the alpha mating type and acts on cells of the opposite mating type (type A) by binding to a cognate G-protein coupled receptor which is coupled to a downstream signal transduction pathway. It inhibits DNA synthesis in type A cells synchronising them with type alpha, and so mediates the conjugation process.; GO: 0000772 mating pheromone activity, 0019953 sexual reproduction, 0005576 extracellular region
Probab=21.18 E-value=49 Score=13.54 Aligned_cols=9 Identities=11% Similarity=0.482 Sum_probs=5.9
Q ss_pred eEEEecCCc
Q psy4861 35 WIEINLHFV 43 (101)
Q Consensus 35 wLQVDL~~~ 43 (101)
||+++-|++
T Consensus 3 WL~~~~GqP 11 (13)
T PF04648_consen 3 WLRLSPGQP 11 (13)
T ss_pred ceeccCCCc
Confidence 677766654
No 25
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=20.88 E-value=1.2e+02 Score=17.74 Aligned_cols=17 Identities=35% Similarity=0.714 Sum_probs=8.7
Q ss_pred CCcEEEEcCCCcEEEeC
Q psy4861 77 GKWVRYKDAKGNEMYSY 93 (101)
Q Consensus 77 ~~W~~y~~~~~~~vF~G 93 (101)
.+|...++.+|..+|.|
T Consensus 8 ~sWv~V~d~dG~~~~~~ 24 (77)
T PF13464_consen 8 DSWVEVTDADGKVLFSG 24 (77)
T ss_pred CeEEEEEeCCCcEeeee
Confidence 45555555455445544
Done!