Query psy6330
Match_columns 123
No_of_seqs 107 out of 1028
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 23:40:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6330.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6330hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00231 FA58C Coagulation f 99.9 1.5E-22 3.2E-27 142.7 12.5 103 5-118 21-132 (139)
2 KOG1094|consensus 99.8 1.1E-19 2.3E-24 153.4 7.6 111 6-118 52-172 (807)
3 cd00057 FA58C Substituted upda 99.8 2.5E-17 5.4E-22 114.9 13.2 104 5-118 24-137 (143)
4 KOG3516|consensus 99.7 1.5E-17 3.2E-22 147.2 5.2 101 11-118 56-166 (1306)
5 PF00754 F5_F8_type_C: F5/8 ty 99.5 1.3E-13 2.8E-18 93.3 8.9 96 17-118 25-125 (129)
6 PF06588 Muskelin_N: Muskelin 95.6 0.068 1.5E-06 40.4 7.1 91 17-113 32-134 (199)
7 smart00607 FTP eel-Fucolectin 94.7 0.28 6E-06 35.4 7.9 75 30-108 49-131 (151)
8 KOG4276|consensus 94.6 0.08 1.7E-06 36.0 4.5 50 30-82 5-54 (113)
9 PF07738 Sad1_UNC: Sad1 / UNC- 92.6 1 2.2E-05 31.2 7.4 87 18-109 22-116 (135)
10 KOG2437|consensus 68.8 21 0.00045 31.3 6.6 97 7-109 24-129 (723)
11 smart00136 LamNT Laminin N-ter 66.2 19 0.0004 27.9 5.4 57 19-82 74-130 (238)
12 PF13163 DUF3999: Protein of u 60.7 19 0.00042 30.0 5.0 68 37-108 118-195 (429)
13 PF02012 BNR: BNR/Asp-box repe 59.1 12 0.00025 15.8 1.8 11 70-80 2-12 (12)
14 cd08159 APC10-like APC10-like 56.6 67 0.0015 22.6 9.6 91 17-115 24-119 (129)
15 PF00055 Laminin_N: Laminin N- 53.4 32 0.00069 26.5 4.8 58 18-82 69-128 (237)
16 KOG4350|consensus 44.8 1.2E+02 0.0027 26.0 7.2 92 8-106 451-546 (620)
17 PF03404 Mo-co_dimer: Mo-co ox 43.4 67 0.0015 22.3 4.8 38 38-83 23-61 (131)
18 cd02847 Chitobiase_C_term Chit 40.1 24 0.00053 22.6 1.9 21 64-84 31-51 (78)
19 cd02110 SO_family_Moco_dimer S 30.7 2.8E+02 0.006 22.1 8.2 37 40-85 222-258 (317)
20 KOG4350|consensus 25.3 47 0.001 28.4 1.7 50 30-84 324-373 (620)
21 KOG4432|consensus 24.8 1.4E+02 0.003 24.5 4.2 56 60-118 204-271 (405)
22 PF04648 MF_alpha: Yeast matin 24.6 37 0.0008 14.6 0.5 11 33-43 2-12 (13)
23 cd02114 bact_SorA_Moco sulfite 23.2 1.9E+02 0.0042 23.7 4.9 36 39-83 273-308 (367)
24 PF07081 DUF1349: Protein of u 20.9 1.1E+02 0.0023 22.4 2.7 53 30-83 74-143 (183)
25 PF11743 DUF3301: Protein of u 20.4 2.5E+02 0.0053 18.5 4.2 15 60-74 60-74 (97)
26 KOG3512|consensus 20.3 3E+02 0.0065 24.0 5.5 64 13-82 104-169 (592)
No 1
>smart00231 FA58C Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Probab=99.89 E-value=1.5e-22 Score=142.74 Aligned_cols=103 Identities=27% Similarity=0.451 Sum_probs=85.6
Q ss_pred ccCCceeeccCCCCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEec-
Q psy6330 5 RVKAKRRLKVEQAGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYG- 83 (123)
Q Consensus 5 ~~~~~aRLn~~~~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~- 83 (123)
+.|++|||+.. ..++|||..+ +..|||||||++++.|++|++||+... ..|++.|.+ ||.|+ .+|..|.+
T Consensus 21 ~~~~~~rL~~~-~~~~W~~~~~---~~~~wlqvDLg~~~~v~~i~~qg~~~~---~~~~~~~~~-~s~dg-~~W~~~~~~ 91 (139)
T smart00231 21 WAAKIARLNGG-SDGAWCPAKN---SLPPWIQVDLGRTRTVTGVITGGRHGD---GDGVTYKLL-YSDDG-NNWTTYKDG 91 (139)
T ss_pred CCCCccCcCcC-CCCcccCCCC---CCCceeEeeccCcEEEEEEEecccCCC---CcEEEEEEE-EEeCC-CCEeEEeCC
Confidence 56899999963 3579999887 778999999999999999999996433 479999988 99999 89999864
Q ss_pred --------CCCcEEEEEcCCCceEeeeEEeeeeeeCCCCcccc
Q psy6330 84 --------GNGKAEYTEEYILEYWRPGLRDWKQYYGGNGKAVD 118 (123)
Q Consensus 84 --------~~~~~~~~~~l~~~~~~~~~R~~p~~~~~~~~~~~ 118 (123)
.+.+.+..+.|.+|+.||||||+|..| .+.+||
T Consensus 92 ~~~~f~gn~d~~~~~~~~~~~~v~ARyvRi~p~~~--~~~~~~ 132 (139)
T smart00231 92 NSKVFPGNSDAGTVVVNEFPPPIVARYIRILPTGW--NGNIIL 132 (139)
T ss_pred CeEEEeCccCCCceEEEecCCCEEEEEEEEEEeEc--CCCcEE
Confidence 124456678899999999999999975 455776
No 2
>KOG1094|consensus
Probab=99.80 E-value=1.1e-19 Score=153.38 Aligned_cols=111 Identities=29% Similarity=0.482 Sum_probs=97.8
Q ss_pred cCCceeeccCCCCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEecCC
Q psy6330 6 VKAKRRLKVEQAGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGGN 85 (123)
Q Consensus 6 ~~~~aRLn~~~~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~~ 85 (123)
-|++|||+++.+++||||+........+||||||.....||.|-||||...+.+.+|.+.|+|.|+.+| ..|-.|++.-
T Consensus 52 ~~~~~rl~se~g~GAWCPk~~v~~~~~E~LQvdl~~~hlit~V~TQGR~~~G~G~Efa~~y~I~Y~Rpg-~~Wi~wk~r~ 130 (807)
T KOG1094|consen 52 GPQHARLHSEDGSGAWCPKGQVNSKSKEYLQVDLQRLHLITSVETQGRHAGGLGKEFARAYKIRYSRPG-LRWISWKDRW 130 (807)
T ss_pred ccccccccccCCCcccCCCcccCccchhheEEeeeceEEEEEeeeccccCCCccceehhheeeeeccCc-chheeecccc
Confidence 468999999989999999998877889999999999999999999999988888999999999999999 8999999977
Q ss_pred CcEEE----------EEcCCCceEeeeEEeeeeeeCCCCcccc
Q psy6330 86 GKAEY----------TEEYILEYWRPGLRDWKQYYGGNGKAVD 118 (123)
Q Consensus 86 ~~~~~----------~~~l~~~~~~~~~R~~p~~~~~~~~~~~ 118 (123)
|..++ .+.|.||++||++||+|.+- ..=.+||
T Consensus 131 g~evi~gN~dt~~~v~r~L~ppivAr~vRf~P~s~-~~rtVCm 172 (807)
T KOG1094|consen 131 GQEVIPGNEDTEGVVLRDLDPPIVARRVRFVPVSD-RTRTVCM 172 (807)
T ss_pred CceecCCCCCcchhhhhhcCchhhheeEEEEeccC-CcceEEE
Confidence 76554 57799999999999999872 2334454
No 3
>cd00057 FA58C Substituted updates: Jan 31, 2002
Probab=99.76 E-value=2.5e-17 Score=114.89 Aligned_cols=104 Identities=28% Similarity=0.484 Sum_probs=85.2
Q ss_pred ccCCceeeccCCCCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEecC
Q psy6330 5 RVKAKRRLKVEQAGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGG 84 (123)
Q Consensus 5 ~~~~~aRLn~~~~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~ 84 (123)
..|.+|+++. ...||+... +..|||||||++++.|++|.+|++.+.. ...+++.|+|++|.|| .+|+.+.+.
T Consensus 24 ~~~~~~dg~~---~t~W~~~~~---~~~~wi~vDL~~~~~i~~v~i~~~~~~~-~~~~~~~~~i~~s~dg-~~W~~~~~~ 95 (143)
T cd00057 24 WEASRARLNS---DNAWTPAVN---DPPQWLQVDLGKTRRVTGIQTQGRKGGG-SSEWVTSYKVQYSLDG-ETWTTYKDK 95 (143)
T ss_pred CCcCeeecCC---CCcccCCCC---CCCceEEEECCCCEEEEEEEEccCCCCC-ccCeeEEEEEEEECCC-CCEeEEEcC
Confidence 4567777775 589999887 7789999999999999999999987521 2589999999999999 999998862
Q ss_pred C----------CcEEEEEcCCCceEeeeEEeeeeeeCCCCcccc
Q psy6330 85 N----------GKAEYTEEYILEYWRPGLRDWKQYYGGNGKAVD 118 (123)
Q Consensus 85 ~----------~~~~~~~~l~~~~~~~~~R~~p~~~~~~~~~~~ 118 (123)
. +..+....|.+|+.||||||.|..+.. .+|+
T Consensus 96 ~~~~~~~~~~~~~~~~~~~l~~pv~aRyvRl~~~~~~~--~~~l 137 (143)
T cd00057 96 GEEKVFTGNSDGSTPVTNDFPPPIVARYIRILPTTWNG--NISL 137 (143)
T ss_pred CcEEEEeCCcCCCeEEEEeCCCCEEEEEEEEEEeecCC--ccEE
Confidence 2 135677889999999999999998544 5554
No 4
>KOG3516|consensus
Probab=99.69 E-value=1.5e-17 Score=147.23 Aligned_cols=101 Identities=29% Similarity=0.476 Sum_probs=90.7
Q ss_pred eeccCCCCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEecCC-----
Q psy6330 11 RLKVEQAGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGGN----- 85 (123)
Q Consensus 11 RLn~~~~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~~----- 85 (123)
||+...+.++|.|..+ +..||||+||++++.|++|+|||+.++ +.||++|.++|++.| .+|++|...+
T Consensus 56 ~~~~~~G~~gwsp~~~---~~~~wlq~dLg~~~ev~aVatqg~~~S---ddwvt~y~l~~sD~g-~~w~~y~~~~~~wtf 128 (1306)
T KOG3516|consen 56 KLNRRVGISGWSPKIS---NYNQWLQLDLGKRMEVTAVATQGRYGS---DDWVTSYILQYSDPG-RNWKTYSQEGSSWTF 128 (1306)
T ss_pred hhhhhcCccccccccC---cccceeeeccCCceeeEEEeeccccCc---chHHHHHHHhccCCc-cceEEEEeCCCcccc
Confidence 4555457899999999 999999999999999999999999887 489999999999999 9999998753
Q ss_pred -----CcEEEEEcCCCceEeeeEEeeeeeeCCCCcccc
Q psy6330 86 -----GKAEYTEEYILEYWRPGLRDWKQYYGGNGKAVD 118 (123)
Q Consensus 86 -----~~~~~~~~l~~~~~~~~~R~~p~~~~~~~~~~~ 118 (123)
.+.++.++|.||+.||++||.|..|.+.|+|=|
T Consensus 129 ~Gn~n~~sVv~~~l~~~~~ar~vr~~pl~wnp~grig~ 166 (1306)
T KOG3516|consen 129 VGNVNADSVVYHELEPPIEARFVRILPLDWNPKGRIGM 166 (1306)
T ss_pred ccccccceEEeccccCcccceEEeeeeeeeCCCCccee
Confidence 346788999999999999999999999999865
No 5
>PF00754 F5_F8_type_C: F5/8 type C domain; InterPro: IPR000421 Blood coagulation factors V and VIII contain a C-terminal, twice repeated, domain of about 150 amino acids, which is called F5/8 type C, FA58C, or C1/C2- like domain. In the Dictyostelium discoideum (Slime mold) cell adhesion protein discoidin, a related domain, named discoidin I-like domain, DLD, or DS, has been found which shares a common C-terminal region of about 110 amino acids with the FA58C domain, but whose N-terminal 40 amino acids are much less conserved. Similar domains have been detected in other extracellular and membrane proteins [, , ] In coagulation factors V and VIII the repeated domains compose part of a larger functional domain which promotes binding to anionic phospholipids on the surface of platelets and endothelial cells []. The C-terminal domain of the second FA58C repeat (C2) of coagulation factor VIII has been shown to be responsible for phosphatidylserine-binding and essential for activity [, ]. It forms an amphipathic alpha-helix, which binds to the membrane []. FA58C contains two conserved cysteines in most proteins, which link the extremities of the domain by a disulphide bond [, , ]. A further disulphide bond is located near the C-terminal of the second FA58C domain in MFGM Q08431 from SWISSPROT []. +------------------------------------------------------------------------+ | +-+ | | | | | CxPLGxxQITASxxxxxRLxxxWxxxxWxxxxxxQGxxxxxxxxxxxxGNxxxxxxxxxxRxPxcxcLRxExGC 'C': conserved cysteine involved in a disulphide bond. 'c': cysteine involved in a disulphide bond in MFGM Q08431 from SWISSPROT. 'x': any amino acid. upper case letters: conserved residues. ; GO: 0007155 cell adhesion; PDB: 2J22_A 3CQO_C 1SDD_B 2VCA_A 2VC9_A 2VCC_A 2VCB_A 2J1E_A 2J7M_A 2J1A_A ....
Probab=99.51 E-value=1.3e-13 Score=93.27 Aligned_cols=96 Identities=26% Similarity=0.416 Sum_probs=79.7
Q ss_pred CCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEecC-----CCcEEEE
Q psy6330 17 AGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGG-----NGKAEYT 91 (123)
Q Consensus 17 ~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~-----~~~~~~~ 91 (123)
...+||+... ...+||+|||++++.|.+|.++++.+.. ..++++|+|++|.|| .+|+.+.+. .......
T Consensus 25 ~~t~W~~~~~---~~~~~i~idl~~~~~i~~i~i~~~~~~~--~~~~~~~~i~~s~dg-~~w~~~~~~~~~~~~~~~~~~ 98 (129)
T PF00754_consen 25 PSTAWCSNWD---DSPQWIQIDLGKPYTISGISIQFRNDGN--NGRPKSFKIEYSNDG-SNWTTVASQFYGNTNSGSVVT 98 (129)
T ss_dssp TTSSEEESSS---SSTEEEEEEEEEEEEEEEEEEEEESSTT--TEEEEEEEEEEESSS-SSEEEEEETEEEESSSSSSEE
T ss_pred CCCEEECCCC---CCCceEEEEeeeeEecceeeeccccccc--ceeeeeeeeeeeccc-ccccccccceeeccCCCceEE
Confidence 4689999865 7799999999999999999999987653 149999999999999 999999863 3445566
Q ss_pred EcCCCceEeeeEEeeeeeeCCCCcccc
Q psy6330 92 EEYILEYWRPGLRDWKQYYGGNGKAVD 118 (123)
Q Consensus 92 ~~l~~~~~~~~~R~~p~~~~~~~~~~~ 118 (123)
..|.+|+.+|||||.+..+...+..++
T Consensus 99 ~~~~~~~~~ryiri~~~~~~~~~~~~~ 125 (129)
T PF00754_consen 99 ISFFPPVKARYIRITVTSWNGNNGWVS 125 (129)
T ss_dssp EEEEEEEEEEEEEEEEEEEESCSSEEE
T ss_pred EEeCCCeEEEEEEEEEEEECCCCceEE
Confidence 688999999999999987656665543
No 6
>PF06588 Muskelin_N: Muskelin N-terminus; InterPro: IPR010565 This entry represents the N-terminal region of muskelin and is found in conjunction with several IPR006652 from INTERPRO repeats. Muskelin is an intracellular, kelch repeat protein that is needed in cell-spreading responses to the matrix adhesion molecule, thrombospondin-1 [].
Probab=95.61 E-value=0.068 Score=40.38 Aligned_cols=91 Identities=11% Similarity=0.176 Sum_probs=64.2
Q ss_pred CCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEec-----CCCcEEE-
Q psy6330 17 AGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYG-----GNGKAEY- 90 (123)
Q Consensus 17 ~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~-----~~~~~~~- 90 (123)
+.+-|.+..+ ...|||-+.|..+..|+.|.. |-... ++.-=++.|+|..+++. .+...+.. ....+.|
T Consensus 32 qsSRWss~~~---~~~QyiiLkL~~paiV~sItF-GKy~K-~HvCNlK~fkv~gG~~~-~~m~ell~~gLkND~~~Etf~ 105 (199)
T PF06588_consen 32 QSSRWSSSSN---SPPQYIILKLESPAIVKSITF-GKYEK-PHVCNLKKFKVYGGMDE-ENMIELLHGGLKNDSNPETFN 105 (199)
T ss_pred ccccccCCCC---CCCcEEEEEcCCCeEEEEEec-ccccc-CccccceeeEEeccCCH-HHHHHHHhhhccCCCCCceEE
Confidence 4677999998 899999999999999999876 32221 11234689999999998 66665532 2222333
Q ss_pred -EEcC----CCceEeeeEEeee-eeeCCC
Q psy6330 91 -TEEY----ILEYWRPGLRDWK-QYYGGN 113 (123)
Q Consensus 91 -~~~l----~~~~~~~~~R~~p-~~~~~~ 113 (123)
.+.. ..+|-.+||.|.| ++|+.+
T Consensus 106 l~~~~~~~~~~~fP~rYIKIvPL~swGps 134 (199)
T PF06588_consen 106 LKHKTNNGIENYFPCRYIKIVPLQSWGPS 134 (199)
T ss_pred eEEecCCcccceeeeeeeEEechhhcCCC
Confidence 2333 2488999999999 777763
No 7
>smart00607 FTP eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain.
Probab=94.70 E-value=0.28 Score=35.42 Aligned_cols=75 Identities=11% Similarity=-0.094 Sum_probs=48.3
Q ss_pred CcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCc---EE-----EecCCCcEEEEEcCCCceEee
Q psy6330 30 DIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDW---KQ-----YYGGNGKAEYTEEYILEYWRP 101 (123)
Q Consensus 30 ~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W---~~-----y~~~~~~~~~~~~l~~~~~~~ 101 (123)
+..+|.+|||+..+.|..|.+--|.+-- .+-...|.|--+++. ... .. ....++ ..+.-.-.+|+..|
T Consensus 49 e~~PWW~VDL~~~~~V~~V~I~NR~Dcc--~~rl~~~eI~IG~s~-~~~~~~n~~c~~~~~~~~~-~~~~~~c~~~~~GR 124 (151)
T smart00607 49 RSNPWWRVDLLQYMTIHSVTITNRGDCC--GERITGARILIGNSL-ENGGINNPNCSTGGLMAGG-ETKTFCCPPPMIGR 124 (151)
T ss_pred CCCCeEEEeCCCeEEeeEEEEecCCCCC--CccccceEEEECCcc-cccCCcCcccCCCceeCCC-ceEEEeCCCCCeeE
Confidence 5689999999999999999997765432 245667777776543 111 10 011112 23333445679999
Q ss_pred eEEeeee
Q psy6330 102 GLRDWKQ 108 (123)
Q Consensus 102 ~~R~~p~ 108 (123)
||+|.-.
T Consensus 125 YV~i~Lp 131 (151)
T smart00607 125 YVTVYLP 131 (151)
T ss_pred EEEEEec
Confidence 9999754
No 8
>KOG4276|consensus
Probab=94.63 E-value=0.08 Score=36.01 Aligned_cols=50 Identities=18% Similarity=0.326 Sum_probs=39.8
Q ss_pred CcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEe
Q psy6330 30 DIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYY 82 (123)
Q Consensus 30 ~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~ 82 (123)
+.+-|.-||||.....++-...-+.+.+ ....+..+++.|.|| ++|+.+.
T Consensus 5 ~k~awf~iDLG~~vip~~y~lrh~rgyg--RsalRnW~fQgS~Dg-ktWt~l~ 54 (113)
T KOG4276|consen 5 DKNAWFAIDLGLEVIPTAYTLRHARGYG--RSALRNWKFQGSKDG-KTWTDLR 54 (113)
T ss_pred CcceeEEEecCceEeeeeeeeeeccccc--HHHhhheeeeccccC-CcceeEE
Confidence 6789999999988877776665443332 567888899999999 9999985
No 9
>PF07738 Sad1_UNC: Sad1 / UNC-like C-terminal ; InterPro: IPR012919 The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and Sad1 share a common C-terminal region that is often termed the SUN (Sad1 and UNC) domain [, ]. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2 []. The SUN domain of Sun2 has been demonstrated to be in the periplasm []. ; PDB: 4DXT_A 3UNP_A 4DXR_A 4DXS_A.
Probab=92.56 E-value=1 Score=31.24 Aligned_cols=87 Identities=13% Similarity=0.045 Sum_probs=49.9
Q ss_pred CCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCC-CcceEeEEEEEEEcCCccC-cEEEe----cCCCcEEEE
Q psy6330 18 GGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGR-GQEYTEEYILEYWRPGLRD-WKQYY----GGNGKAEYT 91 (123)
Q Consensus 18 ~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~-~~~~V~sy~i~ys~dg~~~-W~~y~----~~~~~~~~~ 91 (123)
+.+|+.... ..||-|.|.++..|++|.++--....+ .+.=.+.|.|..+.+.... |..+. +.+++.+=+
T Consensus 22 g~Cw~~~g~-----~~~~~I~L~~~I~v~~v~i~h~~~~~~~~sSapk~f~V~g~~~~~~~~~~~LG~f~y~~~~~~~Qt 96 (135)
T PF07738_consen 22 GPCWAFEGS-----KGWLTIELSEPIYVTSVTIEHVEKSIAPFSSAPKDFEVWGSVDYPDEEWVLLGEFEYDSDGNSIQT 96 (135)
T ss_dssp T-SEEEETT------EEEEEEEEEEEEEEEEEEE---GGGSG-TTSB-EEEEEEESSSSTS--EEEEEEE--TTS-SEEE
T ss_pred CccCccCCC-----ceEEEEEcCCcEEEEEEEEEEeccccCCcCCCCcEEEEEEEecccccceeeeceEEEcCCcCccEe
Confidence 678977554 689999999999999999984332000 1345789999999998223 76553 344554434
Q ss_pred EcCCCce--EeeeEEeeeee
Q psy6330 92 EEYILEY--WRPGLRDWKQY 109 (123)
Q Consensus 92 ~~l~~~~--~~~~~R~~p~~ 109 (123)
..|..+. ..++|+|.-.+
T Consensus 97 F~l~~~~~~~~~~i~l~i~s 116 (135)
T PF07738_consen 97 FPLPRPPRQPVRYIKLRILS 116 (135)
T ss_dssp EE-SS--S--EEEEEEEE--
T ss_pred eeeccCchhcCcEEEEEEEc
Confidence 4555554 77888877665
No 10
>KOG2437|consensus
Probab=68.80 E-value=21 Score=31.27 Aligned_cols=97 Identities=9% Similarity=0.149 Sum_probs=62.9
Q ss_pred CCceeeccCC-CCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEe---
Q psy6330 7 KAKRRLKVEQ-AGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYY--- 82 (123)
Q Consensus 7 ~~~aRLn~~~-~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~--- 82 (123)
|.+-..+++- ..+-|+..++ +..||+-..|.++..|+-|.. |....- +--=++.|+|..+.++ .....+.
T Consensus 24 P~NIl~d~P~d~NSRWS~~~N---~~~QYiiLKl~~~aLV~~ITF-GKy~k~-HvcNlKkf~v~ggR~~-~~m~elL~~G 97 (723)
T KOG2437|consen 24 PENILVDKPNDQNSRWSSESN---YPPQYLILKLERPALVQNITF-GKYEKT-HVCNLKKFKVFGGRNE-ENMTELLSSG 97 (723)
T ss_pred ccceeeCCCCCcccccCCCCC---CCceeEEEEcCCceeEEEEec-cccccc-cccchhheEEecccch-hhhHHHHhhh
Confidence 3334444442 3567999888 889999999999999988765 433321 1124689999999998 5554442
Q ss_pred --cCCCcEEEE--EcC-CCceEeeeEEeeeee
Q psy6330 83 --GGNGKAEYT--EEY-ILEYWRPGLRDWKQY 109 (123)
Q Consensus 83 --~~~~~~~~~--~~l-~~~~~~~~~R~~p~~ 109 (123)
+....+.|. +.- .-.|.++|+.|.|..
T Consensus 98 LkNDSn~ETF~L~~K~~e~~f~c~YiKIvPL~ 129 (723)
T KOG2437|consen 98 LKNDSNKETFTLKHKIDEQMFPCRYIKIVPLL 129 (723)
T ss_pred cCCccccceeeeEEeeccceeeeeeEEEEEHh
Confidence 222223332 111 356789999999964
No 11
>smart00136 LamNT Laminin N-terminal domain (domain VI). N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.
Probab=66.15 E-value=19 Score=27.88 Aligned_cols=57 Identities=21% Similarity=0.230 Sum_probs=36.6
Q ss_pred CccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEe
Q psy6330 19 GAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYY 82 (123)
Q Consensus 19 ~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~ 82 (123)
.-|.......-...--|++||++...|+-|.++=....- ..+ .++=|++| .+|..|.
T Consensus 74 TwWQS~~~~~~~~~VtitLdL~k~fevtyi~l~F~s~RP--a~~----i~erSd~G-~tW~p~q 130 (238)
T smart00136 74 TWWQSEPLSNGPQNVNLTLDLGKEFHVTYVILKFCSPRP--SLW----ILERSDFG-KTWQPWQ 130 (238)
T ss_pred ceecCCCcCCCCccEEEEEecCCEEEEEEEEEEecCCCC--ceE----EEeecCCC-CCCcEee
Confidence 567666532111236788999999999999998543221 122 23446777 9999885
No 12
>PF13163 DUF3999: Protein of unknown function (DUF3999)
Probab=60.68 E-value=19 Score=30.00 Aligned_cols=68 Identities=12% Similarity=0.074 Sum_probs=44.8
Q ss_pred EecCCc-EEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEec--------CCCcEEEEEcCC-CceEeeeEEee
Q psy6330 37 IDLGSP-HVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYG--------GNGKAEYTEEYI-LEYWRPGLRDW 106 (123)
Q Consensus 37 VDLg~~-~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~--------~~~~~~~~~~l~-~~~~~~~~R~~ 106 (123)
+|+... -.++.+..+=... + ..|+..|.|+.|.|- .+|..+.. .+++....+++. +...+||||+.
T Consensus 118 ld~s~~~~~l~~L~L~w~~~-~--~~~~~~v~VeaSdDl-~~W~~l~~~~~l~~L~~~~~~l~~~~I~L~~~~~rYLRl~ 193 (429)
T PF13163_consen 118 LDLSALKEPLDALRLDWPQS-N--FNWQARVSVEASDDL-QHWRPLAGDAQLMDLSNGGQRLVQDRIELPGSNARYLRLT 193 (429)
T ss_pred EECcccccchhheEEEeecC-C--CCceEEEEEEEecCc-ccceEccCCceEEEeccCCcceeeeeEccCCCCCceEEEE
Confidence 566544 3555555543211 1 479999999999999 99999932 234445554543 33599999998
Q ss_pred ee
Q psy6330 107 KQ 108 (123)
Q Consensus 107 p~ 108 (123)
..
T Consensus 194 ~~ 195 (429)
T PF13163_consen 194 WN 195 (429)
T ss_pred eC
Confidence 73
No 13
>PF02012 BNR: BNR/Asp-box repeat; InterPro: IPR002860 Members of this entry contain multiple BNR (bacterial neuraminidase repeat) repeats or Asp-boxes. The repeats are short, however the repeats are never found closer than 40 residues together suggesting that the repeat is structurally longer. These repeats are found in a variety of non-homologous proteins, including bacterial ribonucleases, sulphite oxidases, reelin, netrins, sialidases, neuraminidases, some lipoprotein receptors, and a variety of glycosyl hydrolases [].; PDB: 2JKB_A 2VW0_A 2VW2_A 2VW1_A 2CN2_D 2CN3_B 2VK7_B 2VK5_A 2VK6_A 2BF6_A ....
Probab=59.10 E-value=12 Score=15.84 Aligned_cols=11 Identities=36% Similarity=0.830 Sum_probs=6.7
Q ss_pred EEcCCccCcEE
Q psy6330 70 YWRPGLRDWKQ 80 (123)
Q Consensus 70 ys~dg~~~W~~ 80 (123)
+|.|++.+|+.
T Consensus 2 ~S~D~G~TW~~ 12 (12)
T PF02012_consen 2 YSTDGGKTWKK 12 (12)
T ss_dssp EESSTTSS-EE
T ss_pred EeCCCcccCcC
Confidence 56776688863
No 14
>cd08159 APC10-like APC10-like DOC1 domains in E3 ubiquitin ligases that mediate substrate ubiquitination. This family contains the single domain protein, APC10, a subunit of the anaphase-promoting complex (APC), as well as the DOC1 domain of multi-domain proteins present in E3 ubiquitin ligases. E3 ubiquitin ligases mediate substrate ubiquitination (or ubiquitylation), a component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. The APC, a multi-protein complex (or cyclosome), is a cell cycle-regulated, E3 ubiquitin ligase that controls important transitions in mitosis and the G1 phase by ubiquitinating regulatory proteins, thereby targeting them for degradation. APC10-like DOC1 domains such as those present in HECT (Homologous to the E6-AP Carboxyl Terminus) and Cullin-RING (Really Interesting New Gene) E3 ubiquitin ligase proteins, HECTD3, and CUL7, respectively, are also included in this hierarchy. CUL7 is a member of the Cullin-RING ligase family and f
Probab=56.58 E-value=67 Score=22.57 Aligned_cols=91 Identities=8% Similarity=-0.060 Sum_probs=59.3
Q ss_pred CCCccccCCCCCCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEe-EEEEEEEcCC----ccCcEEEecCCCcEEEE
Q psy6330 17 AGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTE-EYILEYWRPG----LRDWKQYYGGNGKAEYT 91 (123)
Q Consensus 17 ~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~-sy~i~ys~dg----~~~W~~y~~~~~~~~~~ 91 (123)
....|..... ...+|+.+.|.+.+.|..|++.-.. .+ +.|.. .-.|.-+.+. +..+..+.+..|--.+.
T Consensus 24 ~~tyWQSdG~---qphh~i~l~f~k~v~i~~l~i~~~~-~D--eSYtP~~I~V~~G~~~~dL~e~~~v~l~~p~Gwv~i~ 97 (129)
T cd08159 24 YDTYWQSDGS---QGSHWIRLFMKKDVLIRVLAIFVDM-AD--SSYMPSLVVVYGGHSPSDLRELKDVNIRPSNGWVALL 97 (129)
T ss_pred CCccCCCCCC---CCCEEEEEEEcCCcEEEEEEEEecC-CC--CCcCCcEEEEEecCCHHHhheeEEEEecCCCceEEEE
Confidence 3578988876 6678999999999999999987665 43 35663 4444444443 11233333455554443
Q ss_pred EcCCCceEeeeEEeeeeeeCCCCc
Q psy6330 92 EEYILEYWRPGLRDWKQYYGGNGK 115 (123)
Q Consensus 92 ~~l~~~~~~~~~R~~p~~~~~~~~ 115 (123)
+..++.+.+|+|.-..=-.||+
T Consensus 98 --~~~~~~~~~iqI~I~~nhqnG~ 119 (129)
T cd08159 98 --EDDTLKCPYIEIRIKRCRSDGI 119 (129)
T ss_pred --cCCcEEEEEEEEEEHHhccCCc
Confidence 3457889999988776445665
No 15
>PF00055 Laminin_N: Laminin N-terminal (Domain VI); InterPro: IPR008211 Laminin is a large molecular weight glycoprotein present only in basement membranes in almost every animal tissue. Laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components []. Each laminin is a heterotrimer assembled from alpha, beta and gamma chain subunits, secreted and incorporated into cell-associated extracellular matrices []. Basement membrane assembly is a cooperative process in which laminins polymerise through their N-terminal domain (LN or domain VI) and anchor to the cell surface through their G domains. Netrins may also associate with this network through heterotypic LN domain interactions []. This leads to cell signalling through integrins and dystroglycan (and possibly other receptors) recruited to the adherent laminin. This LN domain dependent self-assembly is considered to be crucial for the integrity of basement membranes, as highlighted by genetic forms of muscular dystrophy containing the deletion of the LN module from the alpha 2 laminin chain []. The laminin N-terminal domain is found in all laminin and netrin subunits except laminin alpha 3A, alpha 4 and gamma 2. ; PDB: 3TBD_A 3ZYG_B 3ZYI_B 3ZYJ_D 2Y38_A.
Probab=53.43 E-value=32 Score=26.48 Aligned_cols=58 Identities=21% Similarity=0.149 Sum_probs=34.5
Q ss_pred CCccccCCCC--CCCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEe
Q psy6330 18 GGAWCPKNPV--EKDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYY 82 (123)
Q Consensus 18 ~~aW~p~~~~--~~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~ 82 (123)
..-|....-. .+...--|++||++...|+-|.++=....- .+..|+=|.|+ .+|..|.
T Consensus 69 ~TwWQS~~~~~g~~~~~VtitLdL~~~f~v~~v~l~F~spRP------~~miierS~D~-gtW~p~q 128 (237)
T PF00055_consen 69 STWWQSETLQNGVQYENVTITLDLGKEFEVTYVILQFCSPRP------AAMIIERSSDF-GTWQPWQ 128 (237)
T ss_dssp ---EEB--STTTTSTT-EEEEEEEEEEEEEEEEEEEESS---------SEEEEEEESST-TSEEEEE
T ss_pred CceecCCccCCCCcCcceEEEEcccceEEEEEEEEEEcCCCC------CeEEEEEccCC-CceeEeH
Confidence 3567666221 112356799999999999999998542221 23334459999 5999885
No 16
>KOG4350|consensus
Probab=44.78 E-value=1.2e+02 Score=25.95 Aligned_cols=92 Identities=11% Similarity=0.058 Sum_probs=58.9
Q ss_pred CceeeccCCCCCccccCCCCCCCcCceEEEecCCcEEEEEEEec--CcccCCCCcceEeEEEEEEEcCCccCcEEEecCC
Q psy6330 8 AKRRLKVEQAGGAWCPKNPVEKDIREFIQIDLGSPHVVTGVQTQ--GRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGGN 85 (123)
Q Consensus 8 ~~aRLn~~~~~~aW~p~~~~~~~~~qWLQVDLg~~~~ItgI~tQ--G~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~~ 85 (123)
.+|.||......-|-....-.+-...-+.|-|++|+.|.++-+- .+-+ -..+|.|+-|.+- ..|.++.|..
T Consensus 451 RNALiNGD~~~YDWDsGYTCHQlGSG~IvvqLaQPY~igSmRlLLWdCDD------R~YSyYvevStNq-~eW~mvvDRt 523 (620)
T KOG4350|consen 451 RNALINGDSSSYDWDSGYTCHQLGSGLIVVQLAQPYIIGSMRLLLWDCDD------RFYSYYVEVSTNQ-DEWVMVVDRT 523 (620)
T ss_pred hhhhccCCcccccCcCCcchhhcCCceEEEEecCceeeeeeeEEEEecCC------CceeEEEEEEeCc-hheEEeeecc
Confidence 45667654334556665544445567889999999999876654 4422 2348999999999 9999998876
Q ss_pred CcEEE--EEcCCCceEeeeEEee
Q psy6330 86 GKAEY--TEEYILEYWRPGLRDW 106 (123)
Q Consensus 86 ~~~~~--~~~l~~~~~~~~~R~~ 106 (123)
...-- ++-.-.|.-+-|+||.
T Consensus 524 n~~c~sWQ~~~F~p~PvvyIRiV 546 (620)
T KOG4350|consen 524 NEECHSWQELIFDPLPVVYIRIV 546 (620)
T ss_pred cccccchhheeecCCceEEEEEE
Confidence 54321 1111133445566665
No 17
>PF03404 Mo-co_dimer: Mo-co oxidoreductase dimerisation domain; InterPro: IPR005066 The majority of molybdenum-containing enzymes utilise a molybdenum cofactor (MoCF or Moco) consisting of a Mo atom coordinated via a cis-dithiolene moiety to molybdopterin (MPT). MoCF is ubiquitous in nature, and the pathway for MoCF biosynthesis is conserved in all three domains of life. MoCF-containing enzymes function as oxidoreductases in carbon, nitrogen, and sulphur metabolism [, ]. In Escherichia coli, biosynthesis of MoCF is a three stage process. It begins with the MoaA and MoaC conversion of GTP to the meta-stable pterin intermediate precursor Z. The second stage involves MPT synthase (MoaD and MoaE), which converts precursor Z to MPT; MoeB is involved in the recycling of MPT synthase. The final step in MoCF synthesis is the attachment of mononuclear Mo to MPT, a process that requires MoeA and which is enhanced by MogA in an Mg2 ATP-dependent manner []. MoCF is the active co-factor in eukaryotic and some prokaryotic molybdo-enzymes, but the majority of bacterial enzymes requiring MoCF, need a modification of MTP for it to be active; MobA is involved in the attachment of a nucleotide monophosphate to MPT resulting in the MGD co-factor, the active co-factor for most prokaryotic molybdo-enzymes. Bacterial two-hybrid studies have revealed the close interactions between MoeA, MogA, and MobA in the synthesis of MoCF []. Moreover the close functional association of MoeA and MogA in the synthesis of MoCF is supported by fact that the known eukaryotic homologues to MoeA and MogA exist as fusion proteins: CNX1 (Q39054 from SWISSPROT) of Arabidopsis thaliana (Mouse-ear cress), mammalian Gephryin (e.g. Q9NQX3 from SWISSPROT) and Drosophila melanogaster (Fruit fly) Cinnamon (P39205 from SWISSPROT) []. This domain is found in molybdopterin cofactor oxidoreductases, such as in the C-terminal of Mo-containing sulphite oxidase, which catalyses the conversion of sulphite to sulphate, the terminal step in the oxidative degradation of cysteine and methionine []. This domain is involved in dimer formation, and has an Ig-fold structure [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2C9X_A 2CA3_A 2BLF_A 2CA4_A 2BPB_A 2XTS_C 2BII_A 2BIH_A 1OGP_A 2A9A_B ....
Probab=43.44 E-value=67 Score=22.33 Aligned_cols=38 Identities=16% Similarity=0.156 Sum_probs=25.3
Q ss_pred ecC-CcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEec
Q psy6330 38 DLG-SPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYG 83 (123)
Q Consensus 38 DLg-~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~ 83 (123)
..+ ++..|.|+|.-|... -|. +|+.|.||+.+|....-
T Consensus 23 ~~~~~~v~i~G~A~~g~g~------~I~--rVEVS~DgG~tW~~A~l 61 (131)
T PF03404_consen 23 KAGDGTVTIRGYAWSGGGR------GIA--RVEVSTDGGKTWQEATL 61 (131)
T ss_dssp ESESEEEEEEEEEE-STT--------EE--EEEEESSTTSSEEE-EE
T ss_pred ccCCcEEEEEEEEEeCCCc------ceE--EEEEEeCCCCCcEEeEe
Confidence 444 688999999988631 232 46789998688998863
No 18
>cd02847 Chitobiase_C_term Chitobiase C-terminus domain. Chitobiase (AKA N-acetylglucosaminidase) digests the beta, 1-4 glycosidic bonds of the N-acetylglucosamine (NAG) oligomers found in chitin, an important structural element of fungal cell wall and arthropod exoskeletons. It is thought to proceed through an acid-base reaction mechanism, in which one protein carboxylate acts as catalytic acid, while the nucleophile is the polar acetamido group of the sugar in a substrate-assisted reaction with retention of the anomeric configuration. The C-terminus of chitobiase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chit
Probab=40.13 E-value=24 Score=22.59 Aligned_cols=21 Identities=24% Similarity=0.534 Sum_probs=16.8
Q ss_pred eEEEEEEEcCCccCcEEEecC
Q psy6330 64 EEYILEYWRPGLRDWKQYYGG 84 (123)
Q Consensus 64 ~sy~i~ys~dg~~~W~~y~~~ 84 (123)
....|.|+.||..+|..|...
T Consensus 31 pg~~i~Yt~dgg~~w~~Y~~p 51 (78)
T cd02847 31 PGLTLQYSTDGGKNWNIYDAA 51 (78)
T ss_pred CCcEEEEEecCCccCeecccc
Confidence 356789999965899999763
No 19
>cd02110 SO_family_Moco_dimer Subgroup of sulfite oxidase (SO) family molybdopterin binding domains that contains conserved dimerization domain. This molybdopterin cofactor (Moco) binding domain is found in a variety of oxidoreductases, main members of this family are nitrate reductase (NR) and sulfite oxidase (SO).
Probab=30.66 E-value=2.8e+02 Score=22.12 Aligned_cols=37 Identities=22% Similarity=0.176 Sum_probs=25.3
Q ss_pred CCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEecCC
Q psy6330 40 GSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGGN 85 (123)
Q Consensus 40 g~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~~ 85 (123)
++...|.|+|--|.. -| -+|.+|.||+.+|+.+.-..
T Consensus 222 ~~~~~i~G~A~~g~~-------~I--~rVEvS~DgG~tW~~A~l~~ 258 (317)
T cd02110 222 GGRVEIGGVAWSGGR-------GI--RRVEVSLDGGRTWQEARLEG 258 (317)
T ss_pred CCeEEEEEEEEcCCC-------CE--EEEEEEeCCCCcceEeEccC
Confidence 556777777775531 13 34899999967999987543
No 20
>KOG4350|consensus
Probab=25.30 E-value=47 Score=28.42 Aligned_cols=50 Identities=22% Similarity=0.291 Sum_probs=36.8
Q ss_pred CcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEecC
Q psy6330 30 DIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYGG 84 (123)
Q Consensus 30 ~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~~ 84 (123)
+...=+|||||++..|..|-+--= +.+ .-.-+|.|.-|.|. ..|..+.|-
T Consensus 324 ~~~sgi~i~LG~p~iINhIrmllW-drd---sraysY~veVSmD~-~hW~rV~Dy 373 (620)
T KOG4350|consen 324 ENISGIQIDLGKPFIINHIRMLLW-DRD---SRAYSYQVEVSMDD-AHWNRVADY 373 (620)
T ss_pred ccceeEEEecCCchhhhhhhhhhh-ccc---ccceEEEEEEecch-hhhHHhhhh
Confidence 445679999999999888776531 111 33458999999999 999887653
No 21
>KOG4432|consensus
Probab=24.76 E-value=1.4e+02 Score=24.50 Aligned_cols=56 Identities=21% Similarity=0.268 Sum_probs=41.6
Q ss_pred cceEeEEEEEEEcCC-ccCcEEEecCCCcEE-----------EEEcCCCceEeeeEEeeeeeeCCCCcccc
Q psy6330 60 QEYTEEYILEYWRPG-LRDWKQYYGGNGKAE-----------YTEEYILEYWRPGLRDWKQYYGGNGKAVD 118 (123)
Q Consensus 60 ~~~V~sy~i~ys~dg-~~~W~~y~~~~~~~~-----------~~~~l~~~~~~~~~R~~p~~~~~~~~~~~ 118 (123)
+-|++-+++-|..+| +++|...+-.+...+ +...|.||+.+-.+|+.. ++.|+.||
T Consensus 204 spy~~P~rl~y~qNGi~knWDl~k~hdSvt~iL~n~srk~LVlvqqfRpaVy~G~~~~~~---~g~~~~vD 271 (405)
T KOG4432|consen 204 SPYYNPKRLIYTQNGITKNWDLAKCHDSVTCILVNMSRKELVLVQQFRPAVYVGKNRFLK---EGIGKPVD 271 (405)
T ss_pred CCCcCceEEEEEecCcccccchhhCCCceEEEEEeccchheehhhhcCcceeecceeecc---cCCCCccc
Confidence 468899999999999 789998775555433 346789999888888842 24566666
No 22
>PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha].The hormone is excreted into the culture medium by haploid cells of the alpha mating type and acts on cells of the opposite mating type (type A) by binding to a cognate G-protein coupled receptor which is coupled to a downstream signal transduction pathway. It inhibits DNA synthesis in type A cells synchronising them with type alpha, and so mediates the conjugation process.; GO: 0000772 mating pheromone activity, 0019953 sexual reproduction, 0005576 extracellular region
Probab=24.57 E-value=37 Score=14.60 Aligned_cols=11 Identities=18% Similarity=0.679 Sum_probs=7.4
Q ss_pred ceEEEecCCcE
Q psy6330 33 EFIQIDLGSPH 43 (123)
Q Consensus 33 qWLQVDLg~~~ 43 (123)
.||+++-|+++
T Consensus 2 hWL~~~~GqP~ 12 (13)
T PF04648_consen 2 HWLRLSPGQPM 12 (13)
T ss_pred cceeccCCCcC
Confidence 47777777654
No 23
>cd02114 bact_SorA_Moco sulfite:cytochrome c oxidoreductase subunit A (SorA), molybdopterin binding domain. SorA is involved in oxidation of sulfur compounds during chemolithothrophic growth. Together with SorB, a small c-type heme containing subunit, it forms a hetrodimer. It is a member of the sulfite oxidase (SO) family of molybdopterin binding domains. This molybdopterin cofactor (Moco) binding domain is found in a variety of oxidoreductases, main members of this family are nitrate reductase (NR) and sulfite oxidase (SO). Common features of all known members of this family are that they contain one single pterin cofactor and part of the coordination of the metal (Mo) is a cysteine ligand of the protein and that they catalyze the transfer of an oxygen to or from a lone pair of electrons on the substrate.
Probab=23.20 E-value=1.9e+02 Score=23.69 Aligned_cols=36 Identities=14% Similarity=0.122 Sum_probs=25.8
Q ss_pred cCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEec
Q psy6330 39 LGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYYG 83 (123)
Q Consensus 39 Lg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~~ 83 (123)
.++++.|+|+|--|.. -|+ .|..|.||+.+|+.+.-
T Consensus 273 ~~~~~~i~G~A~~G~~-------~I~--rVEVS~DgG~tW~~A~l 308 (367)
T cd02114 273 PAGELALRGIAFDGGS-------GIR--RVDVSADGGDSWTQATL 308 (367)
T ss_pred CCCeEEEEEEEEcCCC-------CEE--EEEEEeCCCCcceEeEe
Confidence 4567888888876531 133 47889998689999864
No 24
>PF07081 DUF1349: Protein of unknown function (DUF1349); InterPro: IPR009784 This family consists of several hypothetical bacterial proteins but contains one sequence (P40893 from SWISSPROT) from Saccharomyces cerevisiae. Members of this family are typically around 200 residues in length. The function of this family is unknown.; PDB: 3MEP_B 3O12_A.
Probab=20.88 E-value=1.1e+02 Score=22.41 Aligned_cols=53 Identities=17% Similarity=0.295 Sum_probs=31.8
Q ss_pred CcCceEEEec----CCcEEEEEEEecCcccCC-------CCcceEe------EEEEEEEcCCccCcEEEec
Q psy6330 30 DIREFIQIDL----GSPHVVTGVQTQGRYDHG-------RGQEYTE------EYILEYWRPGLRDWKQYYG 83 (123)
Q Consensus 30 ~~~qWLQVDL----g~~~~ItgI~tQG~~~~~-------~~~~~V~------sy~i~ys~dg~~~W~~y~~ 83 (123)
+...|+...+ .....+..|.|.|..+-. ...-|++ .|.+.||.|| ..|..++.
T Consensus 74 ~~~~WiK~giE~~~~g~~~l~sV~t~~~SDws~~~~~~~~~~~~lrv~R~g~~~~~~ys~DG-~~w~~~R~ 143 (183)
T PF07081_consen 74 DEDNWIKAGIEYSNDGTPRLSSVVTNGYSDWSLSPLPSDGQSVWLRVERRGDDLWIYYSADG-KTWTLLRI 143 (183)
T ss_dssp ETTEEEEEEEEE-ETTCEEEEEEEEESSEEEEEEE--SBTTSEEEEEEEETTEEEEEEESSS-S---EEEE
T ss_pred CCcccEEEEEEEecCCCceEEEEeccCcccccccccCCCCCEEEEEEEEeCCEEEEEEEcCC-CEEEEEEE
Confidence 3457887644 345678888888654311 0122332 6889999999 99998864
No 25
>PF11743 DUF3301: Protein of unknown function (DUF3301); InterPro: IPR021732 This family is conserved in Proteobacteria, but the function is not known.
Probab=20.41 E-value=2.5e+02 Score=18.47 Aligned_cols=15 Identities=20% Similarity=0.450 Sum_probs=13.6
Q ss_pred cceEeEEEEEEEcCC
Q psy6330 60 QEYTEEYILEYWRPG 74 (123)
Q Consensus 60 ~~~V~sy~i~ys~dg 74 (123)
..|.+.|..+||.||
T Consensus 60 ~~~~r~y~FEFS~~G 74 (97)
T PF11743_consen 60 LRWRRVYQFEFSSDG 74 (97)
T ss_pred eEEEEEEEEEEeCCC
Confidence 468899999999999
No 26
>KOG3512|consensus
Probab=20.32 E-value=3e+02 Score=24.04 Aligned_cols=64 Identities=14% Similarity=0.134 Sum_probs=36.3
Q ss_pred ccCCCCCccccCCCCC--CCcCceEEEecCCcEEEEEEEecCcccCCCCcceEeEEEEEEEcCCccCcEEEe
Q psy6330 13 KVEQAGGAWCPKNPVE--KDIREFIQIDLGSPHVVTGVQTQGRYDHGRGQEYTEEYILEYWRPGLRDWKQYY 82 (123)
Q Consensus 13 n~~~~~~aW~p~~~~~--~~~~qWLQVDLg~~~~ItgI~tQG~~~~~~~~~~V~sy~i~ys~dg~~~W~~y~ 82 (123)
|..+....|...+.+. ...+-=|.+.|+++..+|-|.++-..... .+-.|.=|.|-+.+|++|.
T Consensus 104 n~~~n~TcWqS~tw~~~~~PlnVtlTLSlgKkfELT~Vsl~FcS~rP------dsmaL~KS~D~GrTWqPfQ 169 (592)
T KOG3512|consen 104 NGPGNATCWQSETWSRYPSPLNVTLTLSLGKKFELTYVSLTFCSGRP------DSMALEKSLDYGRTWQPFQ 169 (592)
T ss_pred CCCCCcceeeccccCCCCCCceEEEEEecCcEEEEEEEEEEEecCCC------ceeeeeeccccCCcccccc
Confidence 3333456677753211 12234466778999999998887443221 1223333455439999884
Done!