Query psy6778
Match_columns 314
No_of_seqs 275 out of 1307
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 18:09:58 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6778.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6778hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3512|consensus 100.0 7.7E-84 1.7E-88 623.0 20.7 275 23-306 32-389 (592)
2 KOG0994|consensus 100.0 9.2E-74 2E-78 590.5 18.3 269 26-313 29-404 (1758)
3 smart00136 LamNT Laminin N-ter 100.0 3.2E-51 7E-56 376.9 13.4 157 32-197 2-164 (238)
4 KOG1836|consensus 100.0 8E-49 1.7E-53 431.0 17.6 299 4-313 6-400 (1705)
5 PF00055 Laminin_N: Laminin N- 100.0 2.8E-48 6E-53 357.5 9.0 154 37-197 1-163 (237)
6 KOG0994|consensus 99.7 4.5E-18 9.7E-23 178.5 7.5 112 197-313 334-464 (1758)
7 KOG3512|consensus 99.7 7.7E-18 1.7E-22 164.7 6.4 98 197-301 329-438 (592)
8 KOG1836|consensus 99.4 3.5E-13 7.5E-18 150.6 8.0 107 197-313 726-837 (1705)
9 cd00055 EGF_Lam Laminin-type e 98.7 2.1E-08 4.6E-13 71.0 4.8 42 201-245 1-45 (50)
10 cd00055 EGF_Lam Laminin-type e 98.6 6.8E-08 1.5E-12 68.3 4.6 42 259-303 1-45 (50)
11 PF00053 Laminin_EGF: Laminin 98.5 4.4E-08 9.6E-13 68.8 1.7 41 202-245 1-44 (49)
12 KOG1388|consensus 98.4 1.1E-07 2.4E-12 85.9 2.9 89 198-303 48-140 (217)
13 PF00053 Laminin_EGF: Laminin 98.4 7.2E-08 1.6E-12 67.8 0.7 43 260-305 1-46 (49)
14 smart00180 EGF_Lam Laminin-typ 98.4 3.9E-07 8.5E-12 63.5 3.9 37 202-241 1-40 (46)
15 smart00180 EGF_Lam Laminin-typ 98.2 1.5E-06 3.2E-11 60.6 3.3 37 260-299 1-40 (46)
16 KOG3509|consensus 98.0 1.1E-05 2.3E-10 87.0 7.7 98 197-304 749-854 (964)
17 KOG3509|consensus 97.7 4.9E-05 1.1E-09 82.0 6.2 101 201-312 703-803 (964)
18 PF00754 F5_F8_type_C: F5/8 ty 97.3 0.00058 1.3E-08 54.8 5.7 68 84-164 16-85 (129)
19 KOG4289|consensus 96.9 0.0017 3.7E-08 71.9 6.4 71 219-295 1739-1841(2531)
20 cd00057 FA58C Substituted upda 96.8 0.0032 6.9E-08 52.3 6.4 68 82-163 23-93 (143)
21 KOG1388|consensus 96.6 0.0021 4.5E-08 58.5 3.8 96 202-313 3-99 (217)
22 KOG4260|consensus 95.8 0.0092 2E-07 56.3 3.8 63 222-302 131-193 (350)
23 KOG1219|consensus 94.1 0.052 1.1E-06 63.3 4.3 31 206-236 3873-3903(4289)
24 KOG4289|consensus 93.6 0.064 1.4E-06 60.2 3.9 71 209-297 1251-1322(2531)
25 PF02012 BNR: BNR/Asp-box repe 92.3 0.094 2E-06 26.6 1.3 12 150-161 1-12 (12)
26 KOG1225|consensus 91.9 0.3 6.4E-06 50.4 5.8 71 219-300 265-364 (525)
27 cd08159 APC10-like APC10-like 91.7 0.46 9.9E-06 40.3 5.8 56 84-151 15-72 (129)
28 cd08666 APC10-HECTD3 APC10-lik 91.3 0.47 1E-05 40.5 5.4 56 83-150 19-76 (134)
29 KOG1219|consensus 91.3 0.27 5.8E-06 57.9 4.9 71 209-299 3915-3985(4289)
30 cd08366 APC10 APC10 subunit of 91.1 0.54 1.2E-05 40.3 5.7 60 82-154 17-79 (139)
31 KOG1226|consensus 90.6 0.49 1.1E-05 50.4 5.9 20 190-209 515-534 (783)
32 cd08365 APC10-like1 APC10-like 90.2 0.66 1.4E-05 39.5 5.3 56 84-151 16-73 (131)
33 KOG3437|consensus 89.7 0.68 1.5E-05 41.2 5.2 70 82-166 36-106 (184)
34 cd08665 APC10-CUL7 APC10-like 88.7 1.2 2.7E-05 37.8 5.9 58 82-151 13-72 (131)
35 cd08667 APC10-ZZEF1 APC10/DOC1 88.5 1.1 2.3E-05 38.2 5.4 56 84-151 15-72 (131)
36 KOG0196|consensus 87.5 0.95 2.1E-05 48.9 5.4 64 218-306 258-325 (996)
37 COG5156 DOC1 Anaphase-promotin 85.3 0.79 1.7E-05 40.2 3.0 71 79-166 37-110 (189)
38 PF07974 EGF_2: EGF-like domai 84.1 0.94 2E-05 29.1 2.2 26 203-232 7-32 (32)
39 PF03256 APC10: Anaphase-promo 82.2 1.8 3.8E-05 39.2 4.0 60 82-154 44-107 (193)
40 KOG1226|consensus 79.2 8.2 0.00018 41.5 8.2 14 220-233 479-492 (783)
41 KOG4260|consensus 77.3 1.4 3E-05 42.0 1.7 38 204-242 152-191 (350)
42 KOG1225|consensus 75.7 6.1 0.00013 41.0 6.0 15 219-233 296-310 (525)
43 cd08664 APC10-HERC2 APC10-like 74.7 8.1 0.00017 33.7 5.7 57 82-151 37-95 (152)
44 PF12661 hEGF: Human growth fa 70.4 1.9 4.1E-05 22.2 0.5 13 220-232 1-13 (13)
45 cd00185 TNFR Tumor necrosis fa 66.3 25 0.00054 27.8 6.5 63 222-303 26-89 (98)
46 smart00607 FTP eel-Fucolectin 60.3 17 0.00038 31.4 4.9 20 119-138 53-72 (151)
47 PF04863 EGF_alliinase: Alliin 57.9 4.1 9E-05 29.5 0.5 31 203-234 18-51 (56)
48 KOG4276|consensus 51.6 21 0.00046 29.1 3.6 50 120-170 10-61 (113)
49 KOG1214|consensus 49.2 18 0.00039 39.7 3.6 54 219-278 797-850 (1289)
50 smart00181 EGF Epidermal growt 47.1 21 0.00045 22.1 2.4 22 210-232 12-34 (35)
51 PF07738 Sad1_UNC: Sad1 / UNC- 44.2 57 0.0012 27.0 5.4 63 93-166 18-86 (135)
52 smart00231 FA58C Coagulation f 43.2 37 0.0008 28.0 4.1 45 117-163 44-89 (139)
53 PTZ00382 Variant-specific surf 40.6 54 0.0012 26.2 4.5 47 230-301 4-50 (96)
54 PHA03099 epidermal growth fact 37.8 21 0.00045 30.4 1.7 26 268-294 57-84 (139)
55 smart00179 EGF_CA Calcium-bind 33.0 51 0.0011 20.4 2.7 23 210-233 16-39 (39)
56 PHA02668 GM-CSF/IL-2 inhibitio 32.9 76 0.0016 29.7 4.6 40 116-163 81-120 (265)
57 cd00054 EGF_CA Calcium-binding 31.8 59 0.0013 19.7 2.8 16 275-291 23-38 (38)
58 KOG4350|consensus 31.3 17 0.00036 36.8 0.2 54 117-171 326-380 (620)
59 cd00053 EGF Epidermal growth f 28.0 67 0.0015 19.0 2.5 21 209-230 12-32 (36)
60 KOG1214|consensus 27.3 1.5E+02 0.0033 32.9 6.3 74 226-300 843-933 (1289)
61 smart00051 DSL delta serrate l 27.0 65 0.0014 23.7 2.6 23 205-232 41-63 (63)
62 PF01834 XRCC1_N: XRCC1 N term 26.7 36 0.00078 29.8 1.4 47 81-138 17-63 (153)
63 PF00008 EGF: EGF-like domain 24.1 21 0.00045 22.5 -0.4 23 209-231 10-32 (32)
64 KOG4611|consensus 22.3 1.2E+02 0.0025 30.5 4.1 66 230-308 50-118 (747)
65 PF06219 DUF1005: Protein of u 22.0 55 0.0012 33.2 1.9 128 23-168 166-305 (460)
66 PF12955 DUF3844: Domain of un 21.8 56 0.0012 26.7 1.5 11 203-214 14-24 (103)
67 PTZ00214 high cysteine membran 21.1 3E+02 0.0065 30.3 7.3 17 287-303 558-574 (800)
68 cd02847 Chitobiase_C_term Chit 20.8 64 0.0014 24.9 1.6 23 141-164 28-50 (78)
69 PF13956 Ibs_toxin: Toxin Ibs, 20.1 62 0.0013 18.3 1.0 13 1-13 2-14 (19)
No 1
>KOG3512|consensus
Probab=100.00 E-value=7.7e-84 Score=623.02 Aligned_cols=275 Identities=55% Similarity=1.067 Sum_probs=260.6
Q ss_pred CCCCCCc-cCCCcccCCCcchhhccCCceeecCccCCCCCccccCCC--CCcc-cCCCCCCCCCCCCceecccCCCCCce
Q psy6778 23 LNSDPCY-ENGRPRRCIPDFVNAAFGQEVKVNSVCGLESPERYCDTS--GACH-VCDAGSPRGRFPAEYLTDLNNPSNVT 98 (314)
Q Consensus 23 ~~~~~c~-~~g~~~~C~P~~~nla~g~~v~assTCG~~~~e~yC~~~--~~C~-~Cd~~~p~~~hp~~~~~D~~~~~~~t 98 (314)
++.|||+ +.|.|++|+|.|+|+|+|++|.|++||| ++||+||.+. ..|. .|||++|+++||+++|+|.+++.+.|
T Consensus 32 s~~dpc~~e~g~p~~C~P~~vnaa~g~~V~as~TCG-drPe~~c~~~~~~~~~~eCdAs~p~~AHpPalltD~n~~~n~T 110 (592)
T KOG3512|consen 32 SQVDPCEDETGEPRACQPEFVNAAFGKKVPASSTCG-DRPETFCSVENPYLCSNECDASNPDLAHPPALLTDLNGPGNAT 110 (592)
T ss_pred CCCCCccCCCCCeeecChhhhhhhhCCccCCccccC-CCccceeeecCCCcccccccCCCccccCChHHhcCCCCCCCcc
Confidence 3579999 7899999999999999999999999999 6999999987 7785 99999999999999999999999999
Q ss_pred eeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeeccCCCCCCceEEEeecCCCCCcccceeehhhhhcccCC-CC
Q psy6778 99 CWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCPKSIKPDSLAIYKSQDFGKSWQPLQFYSSQCKKLYGR-TA 177 (314)
Q Consensus 99 wWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~~s~rP~~~~iekS~D~g~tw~p~~yya~~C~~~f~~-~~ 177 (314)
+|||++|. ....|.||||||+|+|.||||||+|+|+ |+||++|+||||+|||+||+|||||+++|.+.|+| +.
T Consensus 111 cWqS~tw~----~~~~PlnVtlTLSlgKkfELT~Vsl~Fc--S~rPdsmaL~KS~D~GrTWqPfQFYss~C~~~fgr~~~ 184 (592)
T KOG3512|consen 111 CWQSETWS----RYPSPLNVTLTLSLGKKFELTYVSLTFC--SGRPDSMALEKSLDYGRTWQPFQFYSSDCRKAFGRSPR 184 (592)
T ss_pred eeeccccC----CCCCCceEEEEEecCcEEEEEEEEEEEe--cCCCceeeeeeccccCCcccccchhHHHHHHHhCCCCc
Confidence 99999874 2457899999999999999999999999 99999999999999999999999999999999999 66
Q ss_pred cCccccCCCCCceecccCc-------------------------------------------------------------
Q psy6778 178 TGVISRGNEQEALCTDRHK------------------------------------------------------------- 196 (314)
Q Consensus 178 ~~~~~~~~~~~~~Ct~~ys------------------------------------------------------------- 196 (314)
...+++.+++|++||++|+
T Consensus 185 ~a~isk~n~~Ea~Ct~s~~~~~~~~~~~~af~~l~~Rps~~dldss~vLqDwvTaTDiRvvl~Rp~s~G~~~dde~~l~r 264 (592)
T KOG3512|consen 185 RADISKSNEQEALCTESYSDGAGSGGGKIAFFELEDRPSAFDLDSSPVLQDWVTATDIRVVLLRPASLGEEYDDEANLAR 264 (592)
T ss_pred ccccccCCccceeeccccccCCCCCCceEEeeeecCCccccccccchhhhhheeeeeeEEEEechhhcCCcchhhhhhhh
Confidence 7778889999999999886
Q ss_pred --------ccccCCCCCCCCCCCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCC
Q psy6778 197 --------KQGGRCKCNGHASRCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASR 268 (314)
Q Consensus 197 --------~v~g~C~CnGHa~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~ 268 (314)
.|+|||+|||||++|+.+.+++.+|+|+|||+|+.|++|||+|+++||++||...+++|. +|+||+|+.+
T Consensus 265 YfYAisdl~VgGRCKCNgHAs~Cv~d~~~~ltCdC~HNTaGPdCgrCKpfy~dRPW~raT~~~a~~c~--ac~Cn~harr 342 (592)
T KOG3512|consen 265 YFYAISDLAVGGRCKCNGHASRCVMDESSHLTCDCEHNTAGPDCGRCKPFYYDRPWGRATALPANECV--ACNCNGHARR 342 (592)
T ss_pred hheecccceeeeeeeecCccceeeeccCCceEEecccCCCCCCcccccccccCCCccccccCCCcccc--ccccchhhhh
Confidence 799999999999999998878899999999999999999999999999999999999999 9999999999
Q ss_pred cccC---------CCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 269 CIKS---------PQGALVCECRHNTAGKDCEKCKPFFSDRPWGRAT 306 (314)
Q Consensus 269 C~~~---------~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~~~ 306 (314)
|.|+ .++|+|++|+|||+|+||+.|++|||||+..+.+
T Consensus 343 crfn~Ely~lSgr~SggvClnCrHnTaGrhChyCreGyyRd~s~pl~ 389 (592)
T KOG3512|consen 343 CRFNMELYRLSGRRSGGVCLNCRHNTAGRHCHYCREGYYRDGSKPLT 389 (592)
T ss_pred cccchhhhcccCccccceEeecccCCCCcccccccCccccCCCCCCc
Confidence 9988 5689999999999999999999999999998876
No 2
>KOG0994|consensus
Probab=100.00 E-value=9.2e-74 Score=590.48 Aligned_cols=269 Identities=36% Similarity=0.801 Sum_probs=241.0
Q ss_pred CCCccCCCcccCCCcchhhccCC--ceeecCccCCCCCccccCCC-----CCcccCCCCCCCCCCC-Ccee-----cccC
Q psy6778 26 DPCYENGRPRRCIPDFVNAAFGQ--EVKVNSVCGLESPERYCDTS-----GACHVCDAGSPRGRFP-AEYL-----TDLN 92 (314)
Q Consensus 26 ~~c~~~g~~~~C~P~~~nla~g~--~v~assTCG~~~~e~yC~~~-----~~C~~Cd~~~p~~~hp-~~~~-----~D~~ 92 (314)
.+|.+ .+|||+++||++|| ++.|+||||+++||+||+++ ++|++||++.|.++|+ ++++ +.+-
T Consensus 29 ~~c~~----~sCyP~tGnLliGR~~~L~AsSTCGl~~pe~yCi~Shlqdq~KCf~Cdsr~p~~~~~~pshrienVvss~~ 104 (1758)
T KOG0994|consen 29 SGCSR----GSCYPATGNLLIGRARKLTASSTCGLNKPERYCIVSHLQDQKKCFKCDSRYPHKAHLNPSHRIENVVSSFA 104 (1758)
T ss_pred Ccccc----CCcCCCccceeecchhheeecccccCCCCcceEEeeccccccccccccCCCccccCCChhhhhhhhhhccC
Confidence 36775 48999999999999 79999999999999999998 8999999999988776 4444 3344
Q ss_pred CCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeeccCCCCCCceEEEeecCCCCCcccceeehhhhhcc
Q psy6778 93 NPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCPKSIKPDSLAIYKSQDFGKSWQPLQFYSSQCKKL 172 (314)
Q Consensus 93 ~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~~s~rP~~~~iekS~D~g~tw~p~~yya~~C~~~ 172 (314)
.+.+.|||||+++ .++|+|+|||+.+|+|||+||+|+ ++||++|+||||.|||+||.+|+|||+||..+
T Consensus 105 p~r~~~WWQSeng---------v~~vtIQLDLEAEFhFTHLImtFk--tfRPAAMliERS~DFGkTW~vYrYFAyDC~as 173 (1758)
T KOG0994|consen 105 PPRNITWWQSENG---------VENVTIQLDLEAEFHFTHLIMTFK--TFRPAAMLIERSSDFGKTWHVYRYFAYDCSAT 173 (1758)
T ss_pred CCcccchhhcccC---------CCcceEEeehhhheeeeeeeEeec--cCCcceeeeeecccccccceeeeeeecccccC
Confidence 5666799999998 589999999999999999999999 99999999999999999999999999999999
Q ss_pred cCCCCcCccccCCCCCceecccCc--------------------------------------------------------
Q psy6778 173 YGRTATGVISRGNEQEALCTDRHK-------------------------------------------------------- 196 (314)
Q Consensus 173 f~~~~~~~~~~~~~~~~~Ct~~ys-------------------------------------------------------- 196 (314)
|++.+.++..+ .++|||++.||
T Consensus 174 FPGv~~~~~kk--~~DviCtSrYS~~~PstgGEVifrvl~P~~~iedPYs~~IQ~~LKITNLRvn~tklhtlgdnllD~r 251 (1758)
T KOG0994|consen 174 FPGVPTGPPKK--WDDVICTSRYSDPEPSTGGEVIFRVLDPAIDIEDPYSAKIQELLKITNLRVNFTKLHTLGDNLLDSR 251 (1758)
T ss_pred CCCCCCCCccc--ccceeeecccCCCCCCCCCeEEEEecCCCCCCCCchhHHHHHHhhhhheeeeeEeeccccccccccc
Confidence 98877765443 57899999997
Q ss_pred --------------ccccCCCCCCCCCCCccCC------------CCceeeeCCCCCCCCCCccCCCCCCCCCCCCCccc
Q psy6778 197 --------------KQGGRCKCNGHASRCIKSP------------QGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVY 250 (314)
Q Consensus 197 --------------~v~g~C~CnGHa~~C~~~~------------~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~ 250 (314)
.|.|.|+|+|||++|.|.+ ..+++|.|+|||+|.+||+|+++|+++||+||...
T Consensus 252 ~E~~ekyyYAiy~~vVrGsCfCyGHAs~C~P~~g~~s~~~~~ta~mVHG~C~C~HNT~G~nCE~C~~fYnDlPWrpAeG~ 331 (1758)
T KOG0994|consen 252 EEIREKYYYAIYDLVVRGSCFCYGHASQCAPVDGARSAKAPGTAHMVHGRCMCKHNTAGLNCEHCAPFYNDLPWRPAEGK 331 (1758)
T ss_pred cccccchhheeeeeeeecceeecCchhhcccCCCCCcccCCCccceecceeEeccCCCCCChHHhhHhhcCCCCCccCCC
Confidence 7899999999999998853 12578999999999999999999999999999999
Q ss_pred CCCCccCCcccCCCCCCCcccC---------CCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCC
Q psy6778 251 DANECKGGRCKCNGHASRCIKS---------PQGALVCECRHNTAGKDCEKCKPFFSDRPWGRAT---VYDANEC 313 (314)
Q Consensus 251 ~~~~C~~~~C~Cn~h~~~C~~~---------~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~~~---~~~~~~C 313 (314)
++++|+ .|+||+|+.+|+|| .++|||.+|+|||+|++||+|+|+|||+|.++.+ .|.+++|
T Consensus 332 ~~neCr--kC~CNgHa~sCHFD~aV~~ASG~vSGGVCDdCqHNT~G~~CE~CkP~fYRdprr~i~~p~vC~pC~C 404 (1758)
T KOG0994|consen 332 TSNECR--KCECNGHADTCHFDMAVYEASGNVSGGVCDDCQHNTEGQNCERCKPFFYRDPRRDISDPDVCKPCEC 404 (1758)
T ss_pred Cccccc--ccCCCCCcccccccHHHHhhcCCcccccCccccccccccchhhcCcccccCCCCCCCCccccccccC
Confidence 999999 99999999999998 5789999999999999999999999999987665 4455554
No 3
>smart00136 LamNT Laminin N-terminal domain (domain VI). N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.
Probab=100.00 E-value=3.2e-51 Score=376.86 Aligned_cols=157 Identities=52% Similarity=1.000 Sum_probs=148.5
Q ss_pred CCcccCCCcchhhccCCceeecCccCCCCCccccCCC------CCcccCCCCCCCCCCCCceecccCCCCCceeeccCCc
Q psy6778 32 GRPRRCIPDFVNAAFGQEVKVNSVCGLESPERYCDTS------GACHVCDAGSPRGRFPAEYLTDLNNPSNVTCWRSEAQ 105 (314)
Q Consensus 32 g~~~~C~P~~~nla~g~~v~assTCG~~~~e~yC~~~------~~C~~Cd~~~p~~~hp~~~~~D~~~~~~~twWqS~~~ 105 (314)
+++++|+|+|+|||+||+|.|+||||+++||+||+++ ++|++||+++|.++||+++|+|.+++++.|||||+++
T Consensus 2 ~~~~~C~P~~~nla~g~~v~assTCG~~~pe~yC~~~~~~~~~~~C~~CDa~~p~~~Hp~~~l~D~~~~~~~TwWQS~~~ 81 (238)
T smart00136 2 GRPRSCYPPFVNLAFGREVTATSTCGEPGPERYCKLVGHTEQGKKCDYCDARNPRRSHPAENLTDGNNPNNPTWWQSEPL 81 (238)
T ss_pred CCCccccCchhhhhcCCeeeEecCCCCCCCcceeEeccccCcCCcCCCCCCCCccccCCHHHhhccCCCCCCceecCCCc
Confidence 6789999999999999999999999998999999982 8999999999999999999999999999999999998
Q ss_pred ccCCcCCCCCCceeeEeecCceeEEEEEEeeeccCCCCCCceEEEeecCCCCCcccceeehhhhhcccCCCCcCccccCC
Q psy6778 106 TSVNSLSASPDNVTLTLSLGKKFELTYISLSFCPKSIKPDSLAIYKSQDFGKSWQPLQFYSSQCKKLYGRTATGVISRGN 185 (314)
Q Consensus 106 ~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~~s~rP~~~~iekS~D~g~tw~p~~yya~~C~~~f~~~~~~~~~~~~ 185 (314)
. +.+++|||||||+|+|||+||+|+|+ |+||++|+|||| |+|++|+||||||.+|.+.|++++++.+.+.+
T Consensus 82 ~------~~~~~VtitLdL~k~fevtyi~l~F~--s~RPa~~i~erS-d~G~tW~p~qyfa~dC~~~f~~~~~~~~~~~n 152 (238)
T smart00136 82 S------NGPQNVNLTLDLGKEFHVTYVILKFC--SPRPSLWILERS-DFGKTWQPWQYFSSDCRRTFGRPPRGPITKGN 152 (238)
T ss_pred C------CCCccEEEEEecCCEEEEEEEEEEec--CCCCceEEEeec-CCCCCCcEeeeeecchhhhcCCCCCCCCCCCC
Confidence 3 33568999999999999999999999 999999999999 99999999999999999999999998888789
Q ss_pred CCCceecccCcc
Q psy6778 186 EQEALCTDRHKK 197 (314)
Q Consensus 186 ~~~~~Ct~~ys~ 197 (314)
+++++|++.|+.
T Consensus 153 ~~~v~Ct~~ys~ 164 (238)
T smart00136 153 EDEVICTSEYSD 164 (238)
T ss_pred CCccEECCCCCC
Confidence 999999999974
No 4
>KOG1836|consensus
Probab=100.00 E-value=8e-49 Score=431.03 Aligned_cols=299 Identities=37% Similarity=0.685 Sum_probs=252.0
Q ss_pred hhHHHHHHHHHHHhcccccCC--CCCCc-cCCCcccCCCcchhhccCCceeecCccCCCCCccccCCC-------CCccc
Q psy6778 4 LKLLCVPLLIGAVCGEKIDLN--SDPCY-ENGRPRRCIPDFVNAAFGQEVKVNSVCGLESPERYCDTS-------GACHV 73 (314)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~--~~~c~-~~g~~~~C~P~~~nla~g~~v~assTCG~~~~e~yC~~~-------~~C~~ 73 (314)
|.||.++++..+........| ..+|| ..+++++|+|+|.|+|+...++++.|||+.++ .||.+. ..|.+
T Consensus 6 l~ll~~~~a~~~~~~~~~~~p~~~~~~~~~~~r~q~~~pe~~~aa~~~~~~~~~~cg~~~~-~~c~~~~~~g~~~~~C~~ 84 (1705)
T KOG1836|consen 6 LGLLWALLASLAASGGQSLFPQDINECYDQAGRQQASCPEFANAAGNLLATETNTCGLKGG-GYCVPCNCNGHSNGYCVI 84 (1705)
T ss_pred hHHHHHHHhhhccccccccCccccccccccCCcccccccccccccCCcceeEeccccCCCC-CcCcccCCcCCCCCccee
Confidence 444444444444443333223 44799 57899999999999999999999999999666 999876 36999
Q ss_pred CCCCCCCCCCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEE-EEeeeccCCCCCCceEEEee
Q psy6778 74 CDAGSPRGRFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTY-ISLSFCPKSIKPDSLAIYKS 152 (314)
Q Consensus 74 Cd~~~p~~~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~-v~l~f~~~s~rP~~~~iekS 152 (314)
|+++++...||+++|+|++. ..|||||+++.+ ....+.|++++++++.|++.+ +++.|- ||||++|+||||
T Consensus 85 C~~ag~~~~h~~~~~id~~~--~~twwQS~~l~~----g~q~~~~~~~~~~~g~~~~i~~~~~~~~--sprp~s~~l~k~ 156 (1705)
T KOG1836|consen 85 CNAAGEHCEHPASNLIDGAV--GETWWQSPPLPE----GHQYNFVTNLTDLLGKFRIITVVRKKAN--SPRPESFGLYKS 156 (1705)
T ss_pred CCCCCcccccCHhhhccCCc--CCccccCCCCCc----ccccchhhhhhhcCCeEEEEEeeeeccC--CCCCccceeecc
Confidence 99999999999999999986 789999999851 123567999888888877665 555555 999999999999
Q ss_pred cCCCCCcccceeehhhhhcccCCCCcCccccC-CCCCceecccCc-----------------------------------
Q psy6778 153 QDFGKSWQPLQFYSSQCKKLYGRTATGVISRG-NEQEALCTDRHK----------------------------------- 196 (314)
Q Consensus 153 ~D~g~tw~p~~yya~~C~~~f~~~~~~~~~~~-~~~~~~Ct~~ys----------------------------------- 196 (314)
+|.+.+|.|||||+.+|...+..+....++.. +++.++||+.||
T Consensus 157 ~~~~g~~~p~q~~s~s~~~~~~~~~~~~l~~~~~d~~~~Ct~e~S~i~PL~ng~i~~s~l~grpsa~~f~~S~~L~ef~t 236 (1705)
T KOG1836|consen 157 TDEDGPWCPWQFCSGSCRDCLSRPDRQALRGGENDDAAICTTEFSDISPLENGEIAFSTLEGRPSAYNFETSPELQEFTT 236 (1705)
T ss_pred cccCCCcCCceeecccCccccccCCcccccCCCCCccceeccccccCcccccchhhhhhhcCCCcccccCCCHHHHhhhc
Confidence 99999999999999999999888877777664 789999999887
Q ss_pred ----------------------------------------ccccCCCCCCCCCCCccCCCCce-eeeCCCCCCCCCCccC
Q psy6778 197 ----------------------------------------KQGGRCKCNGHASRCIKSPQGEL-VCECRHNTAGKDCEKC 235 (314)
Q Consensus 197 ----------------------------------------~v~g~C~CnGHa~~C~~~~~g~~-~C~C~~nT~G~~Ce~C 235 (314)
.|++||.|||||+.|+.+..... +|+|+|||.|..||+|
T Consensus 237 at~ir~~l~r~~T~~~~l~~~~~~~dp~v~~~yyyaisdi~vggrc~cnGha~~c~~~~~~~~lvc~c~hNT~g~~ce~c 316 (1705)
T KOG1836|consen 237 ATNIRIRLQRLNTFGLHLMSDEIFKDPKVTRSYYYAISDISVGGRCKCNGHASECDESEPQNLLVCRCQHNTCGLDCERC 316 (1705)
T ss_pred cchhhhhhhhhcccchhhcchhhhcCCceeeeeeeeHhhccccceEEEccchhhcCcccCCccceeeccccccccccccc
Confidence 78999999999999997643344 9999999999999999
Q ss_pred CCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccC-------CCCccccCCCCCCCCCCCCCCCCCCCCCCCC-CCCC
Q psy6778 236 KPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKS-------PQGALVCECRHNTAGKDCEKCKPFFSDRPWG-RATV 307 (314)
Q Consensus 236 ~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~-------~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~-~~~~ 307 (314)
.|+|+++||+++|...+++|+ +|+|++++..|.|| ..+|+|.+|++||+|.|||+|..||||.+.. ...+
T Consensus 317 ~p~fn~rpw~~~ts~~~~Ec~--~cnc~g~S~ec~~d~~l~r~~~~gg~c~~c~entag~~CerC~~~f~R~~~~~~~~~ 394 (1705)
T KOG1836|consen 317 LPFFNDRPWARATSQTANECE--ACNCNGRSEECYFDRELDRRTGGGGHCLDCRENTAGVHCERCLLGFYRSRQVTEPNP 394 (1705)
T ss_pred ccchhcchhhhhhhcccCcee--eccCCCchHhhhhcHHHHHhhcCCccccCccccccCcchhhccccccccCCCCCCCc
Confidence 999999999999999999999 99999999999998 5689999999999999999999999997632 2345
Q ss_pred CCCCCC
Q psy6778 308 YDANEC 313 (314)
Q Consensus 308 ~~~~~C 313 (314)
++++.|
T Consensus 395 c~~C~c 400 (1705)
T KOG1836|consen 395 CRPCIC 400 (1705)
T ss_pred CcccCC
Confidence 555554
No 5
>PF00055 Laminin_N: Laminin N-terminal (Domain VI); InterPro: IPR008211 Laminin is a large molecular weight glycoprotein present only in basement membranes in almost every animal tissue. Laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components []. Each laminin is a heterotrimer assembled from alpha, beta and gamma chain subunits, secreted and incorporated into cell-associated extracellular matrices []. Basement membrane assembly is a cooperative process in which laminins polymerise through their N-terminal domain (LN or domain VI) and anchor to the cell surface through their G domains. Netrins may also associate with this network through heterotypic LN domain interactions []. This leads to cell signalling through integrins and dystroglycan (and possibly other receptors) recruited to the adherent laminin. This LN domain dependent self-assembly is considered to be crucial for the integrity of basement membranes, as highlighted by genetic forms of muscular dystrophy containing the deletion of the LN module from the alpha 2 laminin chain []. The laminin N-terminal domain is found in all laminin and netrin subunits except laminin alpha 3A, alpha 4 and gamma 2. ; PDB: 3TBD_A 3ZYG_B 3ZYI_B 3ZYJ_D 2Y38_A.
Probab=100.00 E-value=2.8e-48 Score=357.54 Aligned_cols=154 Identities=45% Similarity=0.913 Sum_probs=111.2
Q ss_pred CCCcchhhccCCceeecCccCCCCCccccCCC------CCcccCCCCCCC--CCCCCceecccCCCCCceeeccCCcccC
Q psy6778 37 CIPDFVNAAFGQEVKVNSVCGLESPERYCDTS------GACHVCDAGSPR--GRFPAEYLTDLNNPSNVTCWRSEAQTSV 108 (314)
Q Consensus 37 C~P~~~nla~g~~v~assTCG~~~~e~yC~~~------~~C~~Cd~~~p~--~~hp~~~~~D~~~~~~~twWqS~~~~~~ 108 (314)
|+|+|+|||+||+|+||||||+++||+||++. .+|++||+++|. ++||+++|+|++++.+.|||||+++. .
T Consensus 1 C~P~~~nla~gr~v~assTCG~~~pe~yC~~~~~~~~~~~C~~Cda~~~~~~~~Hp~~~~~D~~~~~~~TwWQS~~~~-~ 79 (237)
T PF00055_consen 1 CYPPFGNLAFGREVTASSTCGEPGPERYCRLSGSQGRQRQCFICDASDPDPRRSHPPSYLTDSHNPNNSTWWQSETLQ-N 79 (237)
T ss_dssp -----EETTTT-EEECCCBS-SS--EEE--SSSTTT---E-EEE-TTSTSTTT---GGGGCSSSCTTB---EEB--ST-T
T ss_pred CCCCcchhhcCCEeEEEcCCCCCCCccceeccCccccCcccccCCCcccccccccChhhcccccccccCceecCCccC-C
Confidence 89999999999999999999999999999986 469999999876 99999999999998889999999763 1
Q ss_pred CcCCCCCCceeeEeecCceeEEEEEEeeeccCCCCCCceEEEeecCCCCCcccceeehhhhhcccCCCCcCcccc-CCCC
Q psy6778 109 NSLSASPDNVTLTLSLGKKFELTYISLSFCPKSIKPDSLAIYKSQDFGKSWQPLQFYSSQCKKLYGRTATGVISR-GNEQ 187 (314)
Q Consensus 109 ~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~~s~rP~~~~iekS~D~g~tw~p~~yya~~C~~~f~~~~~~~~~~-~~~~ 187 (314)
| ...++|||||||+++|+|+||+|+|+ ||||++|+||||+|+| +|+||||||.+|.+.|++++.+.... .+++
T Consensus 80 g---~~~~~VtitLdL~~~f~v~~v~l~F~--spRP~~miierS~D~g-tW~p~qYfA~~C~~~f~~~~~~~~~~~~~~~ 153 (237)
T PF00055_consen 80 G---VQYENVTITLDLGKEFEVTYVILQFC--SPRPAAMIIERSSDFG-TWQPWQYFASDCSRTFGMPPNGPPPKFANPD 153 (237)
T ss_dssp T---TSTT-EEEEEEEEEEEEEEEEEEEES--S---SEEEEEEESSTT-SEEEEEEEESSHHHHCS--BTTGGG-STSTT
T ss_pred C---CcCcceEEEEcccceEEEEEEEEEEc--CCCCCeEEEEEccCCC-ceeEeHHhhcCChhhcCCcccccccccCCCC
Confidence 1 12469999999999999999999999 9999999999999999 99999999999999999998877664 6789
Q ss_pred CceecccCcc
Q psy6778 188 EALCTDRHKK 197 (314)
Q Consensus 188 ~~~Ct~~ys~ 197 (314)
+|+|++.|+.
T Consensus 154 ~v~Ct~~yS~ 163 (237)
T PF00055_consen 154 EVICTSRYSD 163 (237)
T ss_dssp ---EESTTTT
T ss_pred cCeecCcCCC
Confidence 9999999973
No 6
>KOG0994|consensus
Probab=99.73 E-value=4.5e-18 Score=178.54 Aligned_cols=112 Identities=33% Similarity=0.798 Sum_probs=91.4
Q ss_pred ccccCCCCCCCCCCCccCC-----C---Cceee-eCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCC
Q psy6778 197 KQGGRCKCNGHASRCIKSP-----Q---GELVC-ECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHAS 267 (314)
Q Consensus 197 ~v~g~C~CnGHa~~C~~~~-----~---g~~~C-~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~ 267 (314)
.-+.+|+|||||.+|.+|. + ..|+| +|+|||.|.+||+|+|+||++|-+..+ .+..|+ ||+|+..++
T Consensus 334 neCrkC~CNgHa~sCHFD~aV~~ASG~vSGGVCDdCqHNT~G~~CE~CkP~fYRdprr~i~--~p~vC~--pC~CdP~GS 409 (1758)
T KOG0994|consen 334 NECRKCECNGHADTCHFDMAVYEASGNVSGGVCDDCQHNTEGQNCERCKPFFYRDPRRDIS--DPDVCK--PCECDPAGS 409 (1758)
T ss_pred ccccccCCCCCcccccccHHHHhhcCCcccccCccccccccccchhhcCcccccCCCCCCC--Cccccc--cccCCCCcC
Confidence 5689999999999999873 2 24799 799999999999999999999976554 788999 999997665
Q ss_pred ----Ccc--cCCC----CccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 268 ----RCI--KSPQ----GALVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANEC 313 (314)
Q Consensus 268 ----~C~--~~~~----~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~~~~~~~~~C 313 (314)
.|. .|.+ .|+| .||.|++|++|++|++|||+.......-|.++.|
T Consensus 410 ~~~g~cds~~Dp~~GlvaGqC-~CK~~V~G~RCd~Ck~Gywgl~~~dp~GC~~C~C 464 (1758)
T KOG0994|consen 410 QDGGICDSFCDPSTGLVAGQC-RCKEHVAGRRCDRCKDGYWGLTSADPYGCRPCDC 464 (1758)
T ss_pred cCCCccccccCcccccccccc-ccccCcCccccchhccCcccCccCCCCCcccccc
Confidence 231 1233 4999 7999999999999999999988666555655555
No 7
>KOG3512|consensus
Probab=99.71 E-value=7.7e-18 Score=164.68 Aligned_cols=98 Identities=32% Similarity=0.724 Sum_probs=84.2
Q ss_pred ccccCCCCCCCCCCCccCC-----C---Cceee-eCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCC
Q psy6778 197 KQGGRCKCNGHASRCIKSP-----Q---GELVC-ECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHAS 267 (314)
Q Consensus 197 ~v~g~C~CnGHa~~C~~~~-----~---g~~~C-~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~ 267 (314)
.-+..|+||+||++|.++. + ..++| +|+|||+|+||+.|++|||+++.++.+ +.+.|+ .|+||..++
T Consensus 329 ~~c~ac~Cn~harrcrfn~Ely~lSgr~SggvClnCrHnTaGrhChyCreGyyRd~s~pl~--hrkaCk--~CdChpVGs 404 (592)
T KOG3512|consen 329 NECVACNCNGHARRCRFNMELYRLSGRRSGGVCLNCRHNTAGRHCHYCREGYYRDGSKPLT--HRKACK--ACDCHPVGS 404 (592)
T ss_pred ccccccccchhhhhcccchhhhcccCccccceEeecccCCCCcccccccCccccCCCCCCc--hhhhhh--hcCCccccc
Confidence 5688999999999998763 1 13689 899999999999999999999876655 678999 999998765
Q ss_pred ---CcccCCCCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 268 ---RCIKSPQGALVCECRHNTAGKDCEKCKPFFSDRP 301 (314)
Q Consensus 268 ---~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p 301 (314)
+|. +.+|+| .|+++++|..|++|++|||..-
T Consensus 405 ~gktCN--q~tGqC-pCkeGvtG~tCnrCa~gyqqsr 438 (592)
T KOG3512|consen 405 AGKTCN--QTTGQC-PCKEGVTGLTCNRCAPGYQQSR 438 (592)
T ss_pred cccccc--ccCCcc-cCCCCCcccccccccchhhccc
Confidence 564 679999 7999999999999999999743
No 8
>KOG1836|consensus
Probab=99.40 E-value=3.5e-13 Score=150.58 Aligned_cols=107 Identities=26% Similarity=0.583 Sum_probs=88.1
Q ss_pred ccccCCCCCCCCCCCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcc--cCCC
Q psy6778 197 KQGGRCKCNGHASRCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCI--KSPQ 274 (314)
Q Consensus 197 ~v~g~C~CnGHa~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~--~~~~ 274 (314)
..+..|.||||+..|++. +|+|.|+|||.|++|++|++|||+.|-. +..+.|+ +|.|+.- ..|. .+..
T Consensus 726 ~~c~~C~cngh~~~Cd~~---tG~C~C~~~t~G~~C~~C~~GfYg~~~~----~~~~dC~--~C~Cp~~-~~~~~~~~~~ 795 (1705)
T KOG1836|consen 726 CPCIPCDCNGHSNICDPR---TGQCKCKHNTFGGQCAQCVDGFYGLPDL----GTSGDCQ--PCPCPNG-GACGQTPEIL 795 (1705)
T ss_pred CcccccccCCccccccCC---CCceecccCCCCCchhhhcCCCCCcccc----CCCCCCc--cCCCCCC-hhhcCcCccc
Confidence 457789999999999964 4699999999999999999999999843 3445599 9999865 3331 1245
Q ss_pred CccccCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCC
Q psy6778 275 GALVCECRHNTAGKDCEKCKPFFSDRPWGRA---TVYDANEC 313 (314)
Q Consensus 275 ~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~~---~~~~~~~C 313 (314)
.++|-+|+.+++|.+|+.|..|||++|..+. ..++.+.|
T Consensus 796 ~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c~~c~c 837 (1705)
T KOG1836|consen 796 EVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPCQSCQC 837 (1705)
T ss_pred ceecCCCCCCCcccccccCCCccccCCCCCCCCcccCcccee
Confidence 8999889999999999999999999998887 56666555
No 9
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=98.71 E-value=2.1e-08 Score=70.96 Aligned_cols=42 Identities=40% Similarity=0.980 Sum_probs=37.5
Q ss_pred CCCCCCCCC---CCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCC
Q psy6778 201 RCKCNGHAS---RCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWG 245 (314)
Q Consensus 201 ~C~CnGHa~---~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~ 245 (314)
+|.|++|++ .|++. .++|.|+++++|++|++|+++||+.||.
T Consensus 1 ~C~C~~~g~~~~~C~~~---~G~C~C~~~~~G~~C~~C~~g~~~~~~~ 45 (50)
T cd00055 1 PCDCNGHGSLSGQCDPG---TGQCECKPNTTGRRCDRCAPGYYGLPSQ 45 (50)
T ss_pred CCcCcCCCCCCccccCC---CCEEeCCCcCCCCCCCCCCCCCccCCCC
Confidence 589999998 79863 4699999999999999999999999874
No 10
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=98.59 E-value=6.8e-08 Score=68.34 Aligned_cols=42 Identities=40% Similarity=0.973 Sum_probs=37.3
Q ss_pred cccCCCCCC---CcccCCCCccccCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 259 RCKCNGHAS---RCIKSPQGALVCECRHNTAGKDCEKCKPFFSDRPWG 303 (314)
Q Consensus 259 ~C~Cn~h~~---~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~ 303 (314)
+|+|+++++ .|. ..+|+| .|+++++|++|++|++|||+.|..
T Consensus 1 ~C~C~~~g~~~~~C~--~~~G~C-~C~~~~~G~~C~~C~~g~~~~~~~ 45 (50)
T cd00055 1 PCDCNGHGSLSGQCD--PGTGQC-ECKPNTTGRRCDRCAPGYYGLPSQ 45 (50)
T ss_pred CCcCcCCCCCCcccc--CCCCEE-eCCCcCCCCCCCCCCCCCccCCCC
Confidence 588999888 686 458999 599999999999999999999875
No 11
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=98.50 E-value=4.4e-08 Score=68.82 Aligned_cols=41 Identities=34% Similarity=0.837 Sum_probs=34.4
Q ss_pred CCCCCCC---CCCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCC
Q psy6778 202 CKCNGHA---SRCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWG 245 (314)
Q Consensus 202 C~CnGHa---~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~ 245 (314)
|.|++|+ ..|++ ..++|.|++|++|++|++|+++||+.|+.
T Consensus 1 C~C~~~~~~~~~C~~---~~G~C~C~~~~~G~~C~~C~~g~~~~~~~ 44 (49)
T PF00053_consen 1 CDCNPHGSSSQTCDP---STGQCVCKPGTTGPRCDQCKPGYFGLPSD 44 (49)
T ss_dssp ESSTTCCBCCSSEEE---TCEEESBSTTEESTTS-EE-TTEECSTTS
T ss_pred CcCcCCCCCCCcccC---CCCEEeccccccCCcCcCCCCccccccCC
Confidence 7899998 78986 35699999999999999999999999863
No 12
>KOG1388|consensus
Probab=98.43 E-value=1.1e-07 Score=85.87 Aligned_cols=89 Identities=35% Similarity=0.805 Sum_probs=70.5
Q ss_pred cccCCCCCCCCCCCccCCCCceee-eCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCc
Q psy6778 198 QGGRCKCNGHASRCIKSPQGELVC-ECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGA 276 (314)
Q Consensus 198 v~g~C~CnGHa~~C~~~~~g~~~C-~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G 276 (314)
-+..|.|||| +.|.. .-+| .|++.|.|.+|++|+.|||++ .+...|+ ||+|++++..|. ...+
T Consensus 48 ~cP~~~cNGh-~~c~t----~~v~~~~~N~~~g~~c~kc~~g~~Gd-------tN~g~c~--~~~~~g~~~~~~--~~~~ 111 (217)
T KOG1388|consen 48 FCPLCQCNGH-SDCNT----QHVCWRCENGTTGAHCEKCIVGFYGD-------TNGGKCQ--PCDCNGGASACV--TLTG 111 (217)
T ss_pred cChHHHhcCC-CCccc----ceeeeeccCccccccCCceEEEEEec-------CCCCccC--HhhhcCCeeeee--ccCC
Confidence 3458999999 55541 3478 799999999999999999985 1667999 999999999987 4589
Q ss_pred cccCCC-CCCCCCCCCCCCCC--CCCCCCC
Q psy6778 277 LVCECR-HNTAGKDCEKCKPF--FSDRPWG 303 (314)
Q Consensus 277 ~C~~C~-~nt~G~~Ce~C~~G--yy~~p~~ 303 (314)
+| .|+ ..+.|++|++|..- |-+.|.+
T Consensus 112 ~c-~c~~kgvvgd~c~~~e~~N~~r~~~~k 140 (217)
T KOG1388|consen 112 KC-FCTTKGVVGDLCPKCEVPNRYRGKPLK 140 (217)
T ss_pred cc-ccccceEecccCcccccccccccCccc
Confidence 99 685 78899999998653 4445443
No 13
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=98.40 E-value=7.2e-08 Score=67.76 Aligned_cols=43 Identities=30% Similarity=0.747 Sum_probs=34.6
Q ss_pred ccCCCCC---CCcccCCCCccccCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 260 CKCNGHA---SRCIKSPQGALVCECRHNTAGKDCEKCKPFFSDRPWGRA 305 (314)
Q Consensus 260 C~Cn~h~---~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~~ 305 (314)
|+|++++ ..|.. .+|+|. |++|++|++|++|++|||++|..+.
T Consensus 1 C~C~~~~~~~~~C~~--~~G~C~-C~~~~~G~~C~~C~~g~~~~~~~~~ 46 (49)
T PF00053_consen 1 CDCNPHGSSSQTCDP--STGQCV-CKPGTTGPRCDQCKPGYFGLPSDPP 46 (49)
T ss_dssp ESSTTCCBCCSSEEE--TCEEES-BSTTEESTTS-EE-TTEECSTTSGC
T ss_pred CcCcCCCCCCCcccC--CCCEEe-ccccccCCcCcCCCCccccccCCCC
Confidence 6788887 67764 689995 9999999999999999999987443
No 14
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=98.38 E-value=3.9e-07 Score=63.48 Aligned_cols=37 Identities=35% Similarity=1.004 Sum_probs=32.7
Q ss_pred CCCC--CCC-CCCccCCCCceeeeCCCCCCCCCCccCCCCCCC
Q psy6778 202 CKCN--GHA-SRCIKSPQGELVCECRHNTAGKDCEKCKPFFSD 241 (314)
Q Consensus 202 C~Cn--GHa-~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~ 241 (314)
|.|+ |++ ..|++. .++|.|+++++|++||+|++|||+
T Consensus 1 C~C~~~G~~~~~C~~~---~G~C~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 1 CDCDPGGSASGTCDPD---TGQCECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred CcCCCCCCCCCcccCC---CCEEECCCCCCCCCCCcCCCCcCC
Confidence 7888 887 689864 459999999999999999999998
No 15
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=98.18 E-value=1.5e-06 Score=60.58 Aligned_cols=37 Identities=35% Similarity=1.013 Sum_probs=30.9
Q ss_pred ccCC--CCC-CCcccCCCCccccCCCCCCCCCCCCCCCCCCCC
Q psy6778 260 CKCN--GHA-SRCIKSPQGALVCECRHNTAGKDCEKCKPFFSD 299 (314)
Q Consensus 260 C~Cn--~h~-~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~ 299 (314)
|+|+ ++. ..|. ..+|+| .|+++++|++|++|++|||+
T Consensus 1 C~C~~~G~~~~~C~--~~~G~C-~C~~~~~G~~C~~C~~g~~g 40 (46)
T smart00180 1 CDCDPGGSASGTCD--PDTGQC-ECKPNVTGRRCDRCAPGYYG 40 (46)
T ss_pred CcCCCCCCCCCccc--CCCCEE-ECCCCCCCCCCCcCCCCcCC
Confidence 6777 554 5675 348999 59999999999999999999
No 16
>KOG3509|consensus
Probab=98.03 E-value=1.1e-05 Score=87.00 Aligned_cols=98 Identities=28% Similarity=0.531 Sum_probs=74.0
Q ss_pred ccccCCCCCCCCCCCccCCCCceee-eCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCc-------ccCCCCCCC
Q psy6778 197 KQGGRCKCNGHASRCIKSPQGELVC-ECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGR-------CKCNGHASR 268 (314)
Q Consensus 197 ~v~g~C~CnGHa~~C~~~~~g~~~C-~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~-------C~Cn~h~~~ 268 (314)
...-.|.||+|++.|..+ .+.| +|+|||.|.+|+.|.+|||+++ +.+....|+ + +.++.+...
T Consensus 749 ~~~~~c~~~~h~~~c~~~---~~~nt~~q~~~~~~~~~~~~~g~~~da----~~g~~~D~~--p~~~l~~~~~~~~r~~l 819 (964)
T KOG3509|consen 749 GLCEDCECNSHISQCEDD---LGYNTDCQNNTEGDRCELCSPGTYGDA----RRGTPEDCR--PATALTIQCSCNNRSPL 819 (964)
T ss_pred ccCcccccCCCccccccc---ccccccccccCccceeeecCCCccccC----ccCCcccCC--ccchhhhhhhhcccCcc
Confidence 667899999999999864 3488 8999999999999999999986 233444555 4 333332221
Q ss_pred cccCCCCccccCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 269 CIKSPQGALVCECRHNTAGKDCEKCKPFFSDRPWGR 304 (314)
Q Consensus 269 C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~ 304 (314)
- .+...-.|..|.+|++|.+|++++.+|++++...
T Consensus 820 ~-~~~~~~~~~~~p~~~~g~~~~~~~~~~~~~atdg 854 (964)
T KOG3509|consen 820 S-CDGFGPGCLLCPHNTEGTTCERVKAGYYGFATDG 854 (964)
T ss_pred c-cccCCCCcccCCCCccccchhhhccccccccCcC
Confidence 1 1123457889999999999999999999988654
No 17
>KOG3509|consensus
Probab=97.71 E-value=4.9e-05 Score=81.98 Aligned_cols=101 Identities=25% Similarity=0.495 Sum_probs=77.8
Q ss_pred CCCCCCCCCCCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccC
Q psy6778 201 RCKCNGHASRCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCE 280 (314)
Q Consensus 201 ~C~CnGHa~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~ 280 (314)
-|..+||...++. -..+|.|+..+.|.+||.|.++|--. .. .......|. .|.|+.|+..|.. ..+.|..
T Consensus 703 ~~~~~g~~~~~~~---~~~~C~c~~g~~G~~ce~c~e~~~ls-~t--~~~~~~~~~--~c~~~~h~~~c~~--~~~~nt~ 772 (964)
T KOG3509|consen 703 LCYGGGKTDIIAA---EVEQCQCPKGLVGTSCEDCAEGYTLS-TT--GGLYPGLCE--DCECNSHISQCED--DLGYNTD 772 (964)
T ss_pred cccCCCCCchhhh---hccccccCccccCccccccccccccc-cc--CCcCcccCc--ccccCCCcccccc--ccccccc
Confidence 5666666544442 23489999999999999999999732 11 123556777 8999999999984 4789999
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 281 CRHNTAGKDCEKCKPFFSDRPWGRATVYDANE 312 (314)
Q Consensus 281 C~~nt~G~~Ce~C~~Gyy~~p~~~~~~~~~~~ 312 (314)
|++|++|.+|+.|.+|||+++.. ++..|++.
T Consensus 773 ~q~~~~~~~~~~~~~g~~~da~~-g~~~D~~p 803 (964)
T KOG3509|consen 773 CQNNTEGDRCELCSPGTYGDARR-GTPEDCRP 803 (964)
T ss_pred ccccCccceeeecCCCccccCcc-CCcccCCc
Confidence 99999999999999999999863 55555543
No 18
>PF00754 F5_F8_type_C: F5/8 type C domain; InterPro: IPR000421 Blood coagulation factors V and VIII contain a C-terminal, twice repeated, domain of about 150 amino acids, which is called F5/8 type C, FA58C, or C1/C2- like domain. In the Dictyostelium discoideum (Slime mold) cell adhesion protein discoidin, a related domain, named discoidin I-like domain, DLD, or DS, has been found which shares a common C-terminal region of about 110 amino acids with the FA58C domain, but whose N-terminal 40 amino acids are much less conserved. Similar domains have been detected in other extracellular and membrane proteins [, , ] In coagulation factors V and VIII the repeated domains compose part of a larger functional domain which promotes binding to anionic phospholipids on the surface of platelets and endothelial cells []. The C-terminal domain of the second FA58C repeat (C2) of coagulation factor VIII has been shown to be responsible for phosphatidylserine-binding and essential for activity [, ]. It forms an amphipathic alpha-helix, which binds to the membrane []. FA58C contains two conserved cysteines in most proteins, which link the extremities of the domain by a disulphide bond [, , ]. A further disulphide bond is located near the C-terminal of the second FA58C domain in MFGM Q08431 from SWISSPROT []. +------------------------------------------------------------------------+ | +-+ | | | | | CxPLGxxQITASxxxxxRLxxxWxxxxWxxxxxxQGxxxxxxxxxxxxGNxxxxxxxxxxRxPxcxcLRxExGC 'C': conserved cysteine involved in a disulphide bond. 'c': cysteine involved in a disulphide bond in MFGM Q08431 from SWISSPROT. 'x': any amino acid. upper case letters: conserved residues. ; GO: 0007155 cell adhesion; PDB: 2J22_A 3CQO_C 1SDD_B 2VCA_A 2VC9_A 2VCC_A 2VCB_A 2J1E_A 2J7M_A 2J1A_A ....
Probab=97.27 E-value=0.00058 Score=54.85 Aligned_cols=68 Identities=28% Similarity=0.458 Sum_probs=55.6
Q ss_pred CCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeeccCCC--CCCceEEEeecCCCCCccc
Q psy6778 84 PAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCPKSI--KPDSLAIYKSQDFGKSWQP 161 (314)
Q Consensus 84 p~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~~s~--rP~~~~iekS~D~g~tw~p 161 (314)
++++++|++ ..|+|.+... .....|+|||++.+.|..|.|.+..... +|..+.|+-|.| |..|++
T Consensus 16 ~~~~~~Dg~---~~t~W~~~~~---------~~~~~i~idl~~~~~i~~i~i~~~~~~~~~~~~~~~i~~s~d-g~~w~~ 82 (129)
T PF00754_consen 16 PASNAFDGD---PSTAWCSNWD---------DSPQWIQIDLGKPYTISGISIQFRNDGNNGRPKSFKIEYSND-GSNWTT 82 (129)
T ss_dssp GGGGGGSSS---TTSSEEESSS---------SSTEEEEEEEEEEEEEEEEEEEEESSTTTEEEEEEEEEEESS-SSSEEE
T ss_pred hHHheEeCC---CCCEEECCCC---------CCCceEEEEeeeeEecceeeecccccccceeeeeeeeeeecc-cccccc
Confidence 689999988 5789999642 3467899999999999999999993221 899999999988 689987
Q ss_pred cee
Q psy6778 162 LQF 164 (314)
Q Consensus 162 ~~y 164 (314)
+.-
T Consensus 83 ~~~ 85 (129)
T PF00754_consen 83 VAS 85 (129)
T ss_dssp EEE
T ss_pred ccc
Confidence 654
No 19
>KOG4289|consensus
Probab=96.89 E-value=0.0017 Score=71.94 Aligned_cols=71 Identities=23% Similarity=0.597 Sum_probs=51.6
Q ss_pred eeeeCCCCCCCCCCc-----cCCCCCCCCCCCCC-----c--------------------ccCCCCccCCcccCCC--CC
Q psy6778 219 LVCECRHNTAGKDCE-----KCKPFFSDRPWGRA-----T--------------------VYDANECKGGRCKCNG--HA 266 (314)
Q Consensus 219 ~~C~C~~nT~G~~Ce-----~C~pgy~~~p~~~~-----t--------------------~~~~~~C~~~~C~Cn~--h~ 266 (314)
..|+|..++.|++|| .|-.|+||.|-=.. + ....+.|. +|+|.- ++
T Consensus 1739 Y~C~C~~g~~G~~Ce~~~dq~CPrGWWG~P~CgpC~CavsKgfdp~CnKt~G~CqCKe~hy~~~~~Cl--~CdC~~Gs~S 1816 (2531)
T KOG4289|consen 1739 YTCECPPGYTGPYCELRADQPCPRGWWGFPTCGPCNCAVSKGFDPDCNKTNGQCQCKENHYRPIGSCL--PCDCYFGSDS 1816 (2531)
T ss_pred eeEECCCcccCcchhhhccCCCCCcccCCCCccCccccccCCCCCCccccCcceeeccccccCCCcce--eeccccCCCc
Confidence 579999999999997 59899999873110 0 01223455 777762 22
Q ss_pred CCcccCCCCccccCCCCCCCCCCCCCCCC
Q psy6778 267 SRCIKSPQGALVCECRHNTAGKDCEKCKP 295 (314)
Q Consensus 267 ~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~ 295 (314)
..|. ..|+| .|+.+..|++|++|..
T Consensus 1817 r~C~---adGqC-~C~pgaiGRqCdrCd~ 1841 (2531)
T KOG4289|consen 1817 RECD---ADGQC-PCKPGAIGRQCDRCDN 1841 (2531)
T ss_pred cccc---CCCcC-CCCCccccccccccCC
Confidence 3453 57999 8999999999999984
No 20
>cd00057 FA58C Substituted updates: Jan 31, 2002
Probab=96.84 E-value=0.0032 Score=52.25 Aligned_cols=68 Identities=24% Similarity=0.264 Sum_probs=53.7
Q ss_pred CCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeeccC---CCCCCceEEEeecCCCCC
Q psy6778 82 RFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCPK---SIKPDSLAIYKSQDFGKS 158 (314)
Q Consensus 82 ~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~~---s~rP~~~~iekS~D~g~t 158 (314)
.+.++.+ |++. .|.|.+... ...--|+|||++.+.|+.|.|+...+ ..++..+.|+-|.| |.+
T Consensus 23 ~~~~~~~-dg~~---~t~W~~~~~---------~~~~wi~vDL~~~~~i~~v~i~~~~~~~~~~~~~~~~i~~s~d-g~~ 88 (143)
T cd00057 23 GWEASRA-RLNS---DNAWTPAVN---------DPPQWLQVDLGKTRRVTGIQTQGRKGGGSSEWVTSYKVQYSLD-GET 88 (143)
T ss_pred CCCcCee-ecCC---CCcccCCCC---------CCCceEEEECCCCEEEEEEEEccCCCCCccCeeEEEEEEEECC-CCC
Confidence 3667788 8873 688988653 23457999999999999999987632 55788999999999 999
Q ss_pred cccce
Q psy6778 159 WQPLQ 163 (314)
Q Consensus 159 w~p~~ 163 (314)
|+++.
T Consensus 89 W~~~~ 93 (143)
T cd00057 89 WTTYK 93 (143)
T ss_pred EeEEE
Confidence 99754
No 21
>KOG1388|consensus
Probab=96.59 E-value=0.0021 Score=58.52 Aligned_cols=96 Identities=26% Similarity=0.498 Sum_probs=65.4
Q ss_pred CCCCCCCCCCc-cCCCCceeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccC
Q psy6778 202 CKCNGHASRCI-KSPQGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCE 280 (314)
Q Consensus 202 C~CnGHa~~C~-~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~ 280 (314)
|.=|.++.-|. ++.+|.++|...|++.=.+|-.+..-+. -|+ -..|- .|+||||+.-+. .-+|-.
T Consensus 3 c~~~P~cgw~a~~~~~G~grcm~g~~~~p~d~~~~~~~~~--~W~------fl~cP--~~~cNGh~~c~t----~~v~~~ 68 (217)
T KOG1388|consen 3 CLRNPACGWCADGSNTGDGRCMEGHYRGPLDCVQLEQPRE--IWR------FLFCP--LCQCNGHSDCNT----QHVCWR 68 (217)
T ss_pred cccCCCcccccCCCCCCCCeEEECccccccchhhccCccc--hhh------hhcCh--HHHhcCCCCccc----ceeeee
Confidence 33444555553 3346778898888887777744443332 232 23466 899999887643 457778
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 281 CRHNTAGKDCEKCKPFFSDRPWGRATVYDANEC 313 (314)
Q Consensus 281 C~~nt~G~~Ce~C~~Gyy~~p~~~~~~~~~~~C 313 (314)
|+.++.|.+|++|+.|||+| ..+..+++++|
T Consensus 69 ~~N~~~g~~c~kc~~g~~Gd--tN~g~c~~~~~ 99 (217)
T KOG1388|consen 69 CENGTTGAHCEKCIVGFYGD--TNGGKCQPCDC 99 (217)
T ss_pred ccCccccccCCceEEEEEec--CCCCccCHhhh
Confidence 99999999999999999998 44555555544
No 22
>KOG4260|consensus
Probab=95.79 E-value=0.0092 Score=56.30 Aligned_cols=63 Identities=27% Similarity=0.551 Sum_probs=44.4
Q ss_pred eCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 222 ECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCECRHNTAGKDCEKCKPFFSDRP 301 (314)
Q Consensus 222 ~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p 301 (314)
=|+.+|-|++|..|.-|-- + .|-+ -=.|+|-+++ .+.|.| .|..+++|+.|.+|.++||-..
T Consensus 131 CCp~gtyGpdCl~Cpggse-r-----------~C~G-nG~C~GdGsR----~GsGkC-kC~~GY~Gp~C~~Cg~eyfes~ 192 (350)
T KOG4260|consen 131 CCPDGTYGPDCLQCPGGSE-R-----------PCFG-NGSCHGDGSR----EGSGKC-KCETGYTGPLCRYCGIEYFESS 192 (350)
T ss_pred ccCCCCcCCccccCCCCCc-C-----------CcCC-CCcccCCCCC----CCCCcc-cccCCCCCccccccchHHHHhh
Confidence 4889999999999954432 1 1220 0123333333 567999 8999999999999999999754
Q ss_pred C
Q psy6778 302 W 302 (314)
Q Consensus 302 ~ 302 (314)
.
T Consensus 193 R 193 (350)
T KOG4260|consen 193 R 193 (350)
T ss_pred c
Confidence 3
No 23
>KOG1219|consensus
Probab=94.11 E-value=0.052 Score=63.31 Aligned_cols=31 Identities=32% Similarity=0.787 Sum_probs=25.7
Q ss_pred CCCCCCccCCCCceeeeCCCCCCCCCCccCC
Q psy6778 206 GHASRCIKSPQGELVCECRHNTAGKDCEKCK 236 (314)
Q Consensus 206 GHa~~C~~~~~g~~~C~C~~nT~G~~Ce~C~ 236 (314)
-|+..|.....|+.+|.|..-+.|.+||.=.
T Consensus 3873 qhgG~C~~~~~ggy~CkCpsqysG~~CEi~~ 3903 (4289)
T KOG1219|consen 3873 QHGGTCISQPKGGYKCKCPSQYSGNHCEIDL 3903 (4289)
T ss_pred cCCCEecCCCCCceEEeCcccccCccccccc
Confidence 5788898766677899999999999998643
No 24
>KOG4289|consensus
Probab=93.61 E-value=0.064 Score=60.19 Aligned_cols=71 Identities=28% Similarity=0.706 Sum_probs=45.5
Q ss_pred CCCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccCCC-CCCCC
Q psy6778 209 SRCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCECR-HNTAG 287 (314)
Q Consensus 209 ~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~C~-~nt~G 287 (314)
++|... +|...|+|+..++|.|||.=. .+..|. |=-|- .+..|.....+|.|..|+ +.+++
T Consensus 1251 g~C~sr-EggYtCeCrpg~tGehCEvs~--------------~agrCv--pGvC~-nggtC~~~~nggf~c~Cp~ge~e~ 1312 (2531)
T KOG4289|consen 1251 GRCRSR-EGGYTCECRPGFTGEHCEVSA--------------RAGRCV--PGVCK-NGGTCVNLLNGGFCCHCPYGEFED 1312 (2531)
T ss_pred CceEEe-cCceeEEecCCccccceeeec--------------ccCccc--cceec-CCCEEeecCCCceeccCCCcccCC
Confidence 445533 567899999999999998532 223344 22222 355676556688888888 55688
Q ss_pred CCCCCCCCCC
Q psy6778 288 KDCEKCKPFF 297 (314)
Q Consensus 288 ~~Ce~C~~Gy 297 (314)
++|+.=...|
T Consensus 1313 prC~v~trSF 1322 (2531)
T KOG4289|consen 1313 PRCEVTTRSF 1322 (2531)
T ss_pred CceEEEeecc
Confidence 8887644433
No 25
>PF02012 BNR: BNR/Asp-box repeat; InterPro: IPR002860 Members of this entry contain multiple BNR (bacterial neuraminidase repeat) repeats or Asp-boxes. The repeats are short, however the repeats are never found closer than 40 residues together suggesting that the repeat is structurally longer. These repeats are found in a variety of non-homologous proteins, including bacterial ribonucleases, sulphite oxidases, reelin, netrins, sialidases, neuraminidases, some lipoprotein receptors, and a variety of glycosyl hydrolases [].; PDB: 2JKB_A 2VW0_A 2VW2_A 2VW1_A 2CN2_D 2CN3_B 2VK7_B 2VK5_A 2VK6_A 2BF6_A ....
Probab=92.25 E-value=0.094 Score=26.63 Aligned_cols=12 Identities=50% Similarity=0.938 Sum_probs=9.3
Q ss_pred EeecCCCCCccc
Q psy6778 150 YKSQDFGKSWQP 161 (314)
Q Consensus 150 ekS~D~g~tw~p 161 (314)
++|.|.|+||++
T Consensus 1 ~~S~D~G~TW~~ 12 (12)
T PF02012_consen 1 YYSTDGGKTWKK 12 (12)
T ss_dssp EEESSTTSS-EE
T ss_pred CEeCCCcccCcC
Confidence 479999999974
No 26
>KOG1225|consensus
Probab=91.91 E-value=0.3 Score=50.42 Aligned_cols=71 Identities=30% Similarity=0.658 Sum_probs=42.0
Q ss_pred eeeeCCCCCCCCCCcc--CCCCCCCCCCCCCcccCCCCccC------Ccc-------cCCCCCCCcccCCCCccccCCCC
Q psy6778 219 LVCECRHNTAGKDCEK--CKPFFSDRPWGRATVYDANECKG------GRC-------KCNGHASRCIKSPQGALVCECRH 283 (314)
Q Consensus 219 ~~C~C~~nT~G~~Ce~--C~pgy~~~p~~~~t~~~~~~C~~------~~C-------~Cn~h~~~C~~~~~~G~C~~C~~ 283 (314)
++|.|.++++|.+|+. |--.+.+ +..-..+.|.. ..| +|++|+ .|. .|.| .|-.
T Consensus 265 G~CIC~~Gf~G~dC~e~~Cp~~cs~-----~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~G-~Ci----~G~C-~C~~ 333 (525)
T KOG1225|consen 265 GRCICPPGFTGDDCDELVCPVDCSG-----GGVCVDGECICNPGYSGKDCSIRRCPADCSGHG-KCI----DGEC-LCDE 333 (525)
T ss_pred CeEeCCCCCcCCCCCcccCCcccCC-----CceecCCEeecCCCccccccccccCCccCCCCC-ccc----CCce-EeCC
Confidence 4888999999999998 5443221 00001112210 011 244333 222 5778 5889
Q ss_pred CCCCCCCCC--------------CCCCCCCC
Q psy6778 284 NTAGKDCEK--------------CKPFFSDR 300 (314)
Q Consensus 284 nt~G~~Ce~--------------C~~Gyy~~ 300 (314)
+++|.-|++ |+.||-+.
T Consensus 334 Gy~G~~C~~~~C~~~g~cv~gC~C~~Gw~G~ 364 (525)
T KOG1225|consen 334 GYTGELCIQRACSGGGQCVNGCKCKKGWRGP 364 (525)
T ss_pred CCcCCcccccccCCCceeccCceeccCccCC
Confidence 999999988 88887763
No 27
>cd08159 APC10-like APC10-like DOC1 domains in E3 ubiquitin ligases that mediate substrate ubiquitination. This family contains the single domain protein, APC10, a subunit of the anaphase-promoting complex (APC), as well as the DOC1 domain of multi-domain proteins present in E3 ubiquitin ligases. E3 ubiquitin ligases mediate substrate ubiquitination (or ubiquitylation), a component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. The APC, a multi-protein complex (or cyclosome), is a cell cycle-regulated, E3 ubiquitin ligase that controls important transitions in mitosis and the G1 phase by ubiquitinating regulatory proteins, thereby targeting them for degradation. APC10-like DOC1 domains such as those present in HECT (Homologous to the E6-AP Carboxyl Terminus) and Cullin-RING (Really Interesting New Gene) E3 ubiquitin ligase proteins, HECTD3, and CUL7, respectively, are also included in this hierarchy. CUL7 is a member of the Cullin-RING ligase family and f
Probab=91.68 E-value=0.46 Score=40.31 Aligned_cols=56 Identities=23% Similarity=0.322 Sum_probs=39.4
Q ss_pred CCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc--CCCCCCceEEEe
Q psy6778 84 PAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP--KSIKPDSLAIYK 151 (314)
Q Consensus 84 p~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~--~s~rP~~~~iek 151 (314)
.+..|.|++ ..|+|||+... | .--|+|.+.+...|..|.|.+.+ +|-.|.-+.|.-
T Consensus 15 gv~~L~D~~---~~tyWQSdG~q--------p-hh~i~l~f~k~v~i~~l~i~~~~~DeSYtP~~I~V~~ 72 (129)
T cd08159 15 PVSRLTDGN---YDTYWQSDGSQ--------G-SHWIRLFMKKDVLIRVLAIFVDMADSSYMPSLVVVYG 72 (129)
T ss_pred cHHHhcCCC---CCccCCCCCCC--------C-CEEEEEEEcCCcEEEEEEEEecCCCCCcCCcEEEEEe
Confidence 678899976 57999998641 3 23456667777778777666653 577888887744
No 28
>cd08666 APC10-HECTD3 APC10-like DOC1 domain of HECTD3, a HECT E3 ubiquitin ligase protein that mediates substrate ubiquitination. This model represents the APC10/DOC1 domain present in HECTD3, a HECT (Homologous to the E6-AP Carboxyl Terminus) E3 ubiquitin ligase protein. HECT E3 ubiquitin ligases mediate substrate ubiquitination (or ubiquitylation), and are a component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. They also regulate the trafficking of many receptors, channels, transporters and viral proteins. HECTD3 (HECT domain-containing protein3) contains a C-terminal HECT domain with the active site for ubiquitin transfer onto substrates, and an N-terminal APC10/DOC1 domain, which is responsible for substrate recognition and binding. HECTD3 specifically recognizes the Trio-binding protein, Tara (Trio-associated repeat on actin), implicated in regulating actin cytoskeletal, cell motility and cell growth. Tara also binds to TRF1 and may participate i
Probab=91.25 E-value=0.47 Score=40.53 Aligned_cols=56 Identities=23% Similarity=0.367 Sum_probs=36.0
Q ss_pred CCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEee--eccCCCCCCceEEE
Q psy6778 83 FPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLS--FCPKSIKPDSLAIY 150 (314)
Q Consensus 83 hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~--f~~~s~rP~~~~ie 150 (314)
+.+..|.|++ ..|+|||+.. +.++.++|. +.+.-.|..|.|. +..+|..|.-+.|.
T Consensus 19 ~gv~~L~D~~---~~tyWQSdG~-------qgpH~I~l~--f~~~v~i~~l~i~v~~~DeSYtP~~I~V~ 76 (134)
T cd08666 19 FNVSCLTDGD---PDTYWESDGS-------QGQHWIRLH--MKKGTIIKKLLLTVDATDDNYMPKRVAVY 76 (134)
T ss_pred CCHHHhccCC---CCccEecCCC-------CCCeEEEEE--ECCCcEeeEEEEEecCCCCCcCCcEEEEE
Confidence 4677889977 5799999743 234555554 5555555554444 44458888888773
No 29
>KOG1219|consensus
Probab=91.25 E-value=0.27 Score=57.88 Aligned_cols=71 Identities=28% Similarity=0.638 Sum_probs=50.3
Q ss_pred CCCccCCCCceeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccCCCCCCCCC
Q psy6778 209 SRCIKSPQGELVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCECRHNTAGK 288 (314)
Q Consensus 209 ~~C~~~~~g~~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~C~~nt~G~ 288 (314)
.+|++.. +...|+|++.++|.+||.= | -++|. .=.| +++..|.--.+.-.| +|.+++.|+
T Consensus 3915 gtCip~~-n~f~CnC~~gyTG~~Ce~~--G-------------i~eCs--~n~C-~~gg~C~n~~gsf~C-ncT~g~~gr 3974 (4289)
T KOG1219|consen 3915 GTCIPFY-NGFLCNCPNGYTGKRCEAR--G-------------ISECS--KNVC-GTGGQCINIPGSFHC-NCTPGILGR 3974 (4289)
T ss_pred CEEEecC-CCeeEeCCCCccCceeecc--c-------------ccccc--cccc-cCCceeeccCCceEe-ccChhHhcc
Confidence 6788764 4578999999999999753 1 24566 4334 477788632334589 899999999
Q ss_pred CCCCCCCCCCC
Q psy6778 289 DCEKCKPFFSD 299 (314)
Q Consensus 289 ~Ce~C~~Gyy~ 299 (314)
+|..=+|.-|-
T Consensus 3975 ~c~~~~pni~~ 3985 (4289)
T KOG1219|consen 3975 TCCAEKPNILS 3985 (4289)
T ss_pred cCccccCcccc
Confidence 99666665443
No 30
>cd08366 APC10 APC10 subunit of the anaphase-promoting complex (APC) that mediates substrate ubiquitination. This model represents the single domain protein APC10, a subunit of the anaphase-promoting complex (APC), which is a multi-subunit E3 ubiquitin ligase. E3 ubiquitin ligases mediate substrate ubiquitination (or ubiquitylation), a vital component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. The APC (also known as the cyclosome), is a cell cycle-regulated E3 ubiquitin ligase that controls important transitions in mitosis and the G1 phase by ubiquitinating regulatory proteins, thereby targeting them for degradation. In mitosis, the APC initiates sister chromatid separation by ubiquitinating the anaphase inhibitor securin and triggers exit from mitosis by ubiquitinating cyclin B. The C-terminus of APC10 binds to CDC27/APC3, an APC subunit that contains multiple tetratrico peptide repeats. APC10 domains are homologous to the DOC1 domains present in the
Probab=91.09 E-value=0.54 Score=40.31 Aligned_cols=60 Identities=20% Similarity=0.338 Sum_probs=40.5
Q ss_pred CCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc---CCCCCCceEEEeecC
Q psy6778 82 RFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP---KSIKPDSLAIYKSQD 154 (314)
Q Consensus 82 ~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~---~s~rP~~~~iekS~D 154 (314)
-+.+..|+|.+ ..|+|||+.. .|+.++ |.+.+...+..|.|-+.. +|..|..+.|.--.+
T Consensus 17 g~gv~~L~D~~---~~TyWQSDg~--------qPH~I~--i~f~k~~~i~~l~i~vd~~~DeSYtP~~I~v~~G~~ 79 (139)
T cd08366 17 GNGVDQLRDDS---LDTYWQSDGP--------QPHLIN--IQFSKKTDISAVALYLDYKLDESYTPSKISIRAGTS 79 (139)
T ss_pred CCCHHHhcCCC---CCccCCCCCC--------CCEEEE--EEecCCcEEEEEEEEEccCCCCCCCCEEEEEEEECC
Confidence 46788899977 5799999743 255555 455677777766554442 588888888755333
No 31
>KOG1226|consensus
Probab=90.64 E-value=0.49 Score=50.42 Aligned_cols=20 Identities=25% Similarity=0.690 Sum_probs=13.3
Q ss_pred eecccCcccccCCCCCCCCC
Q psy6778 190 LCTDRHKKQGGRCKCNGHAS 209 (314)
Q Consensus 190 ~Ct~~ys~v~g~C~CnGHa~ 209 (314)
+|+..-.=++|+|.||+-..
T Consensus 515 vCSgrG~C~CGqC~C~~~~~ 534 (783)
T KOG1226|consen 515 VCSGRGDCVCGQCVCHKPDN 534 (783)
T ss_pred CcCCCCcEeCCceEecCCCC
Confidence 55555445788888887655
No 32
>cd08365 APC10-like1 APC10-like DOC1 domains of E3 ubiquitin ligases that mediate substrate ubiquitination. This model represens the APC10-like DOC1 domain of multi-domain proteins present in E3 ubiquitin ligases. E3 ubiquitin ligases mediate substrate ubiquitination (or ubiquitylation), a component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. APC10/DOC1 domains such as those present in HECT (Homologous to the E6-AP Carboxyl Terminus) and Cullin-RING (Really Interesting New Gene) E3 ubiquitin ligase proteins, HECTD3, and CUL7, respectively, are also included here. CUL7 is a member of the Cullin-RING ligase family and functions as a molecular scaffold assembling a SCF-ROC1-like E3 ubiquitin ligase complex consisting of Skp1, CUL7, Fbx29 F-box protein, and ROC1 (RING-box protein 1) and promotes ubiquitination. CUL7 is a multi-domain protein with a C-terminal cullin domain that binds ROC1 and a centrally positioned APC10/DOC1 domain. HECTD3 contains a C-te
Probab=90.16 E-value=0.66 Score=39.49 Aligned_cols=56 Identities=23% Similarity=0.314 Sum_probs=39.3
Q ss_pred CCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeec--cCCCCCCceEEEe
Q psy6778 84 PAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFC--PKSIKPDSLAIYK 151 (314)
Q Consensus 84 p~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~--~~s~rP~~~~iek 151 (314)
.+..|.|++ ..|+|||+.. +.++ .|+|.+.+...|..|.|.+. ++|..|.-+.|.-
T Consensus 16 gv~~L~D~~---~~tyWQSDG~-------q~pH--~I~i~f~k~v~i~~l~l~vd~~DeSYtP~~I~V~~ 73 (131)
T cd08365 16 DASRLTDGN---TSTYWQSDGS-------QGSH--WIRLKMKPDVLVRHLSLAVDATDSSYMPQRVVVAG 73 (131)
T ss_pred hHHHhhcCC---CCceEccCCC-------CCCE--EEEEEEcCCcEEEEEEEEecCCCCCCCCcEEEEEe
Confidence 567788976 5799999853 2234 66677777777777666653 3588888887744
No 33
>KOG3437|consensus
Probab=89.67 E-value=0.68 Score=41.23 Aligned_cols=70 Identities=24% Similarity=0.417 Sum_probs=46.1
Q ss_pred CCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc-CCCCCCceEEEeecCCCCCcc
Q psy6778 82 RFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP-KSIKPDSLAIYKSQDFGKSWQ 160 (314)
Q Consensus 82 ~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~-~s~rP~~~~iekS~D~g~tw~ 160 (314)
-||++.|.|.+ ..|+|||..+ .|+.++|+++=.-..+...|.+.|.. .|.-|.-+-| +.|.+..
T Consensus 36 g~pvd~l~Ddn---~etyWqSdG~--------~PH~i~I~F~K~~~I~~v~if~~f~~DeSYtPs~i~I----~~G~g~~ 100 (184)
T KOG3437|consen 36 GFPVDNLRDDN---PETYWQSDGS--------QPHLINIQFHKRVDIQYVVIFLDFKQDESYTPSKIKI----RAGNGFN 100 (184)
T ss_pred CCChHHhhcCC---hhHheecCCC--------CCeEEEEEEEeEEEEEEEEEEEEEecccccCceeEEE----EecCChh
Confidence 48999999977 6899999876 37788887654433444445555554 3666766654 3466666
Q ss_pred cceeeh
Q psy6778 161 PLQFYS 166 (314)
Q Consensus 161 p~~yya 166 (314)
-++++.
T Consensus 101 dl~~~~ 106 (184)
T KOG3437|consen 101 DLWEIQ 106 (184)
T ss_pred heeeee
Confidence 666654
No 34
>cd08665 APC10-CUL7 APC10-like DOC1 domain of CUL7, subunit of the SCF-ROC1-like E3 ubiquitin ligase complex that mediates substrate ubiquitination. This model represents the APC10/DOC1 domain present in CUL7, a subunit of the SCF-ROC1-like E3 Ubiquitin (Ub) ligase complex, which mediates substrate ubiquitination (or ubiquitylation), and is a component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. CUL7 is a member of the Cullin-RING ligase family and functions as a molecular scaffold assembling the SCF-ROC1-like E3 Ub ligase complex consisting of the adapter protein Skp1, CUL7, the WD40 repeat-containing F-box Fbw8 (also known as Fbx29), and ROC1 (RING-box protein 1). CUL7 is a large protein with a C-terminal cullin domain that binds ROC1 and additional domains, including an APC10/DOC1 domain. While the Fbw8 protein is responsible for substrate protein recognition, the ROC1 RING domain recruits an Ub-charged E2 Ub-conjugating enzyme for substrate ubiqui
Probab=88.71 E-value=1.2 Score=37.85 Aligned_cols=58 Identities=17% Similarity=0.290 Sum_probs=39.0
Q ss_pred CCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc--CCCCCCceEEEe
Q psy6778 82 RFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP--KSIKPDSLAIYK 151 (314)
Q Consensus 82 ~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~--~s~rP~~~~iek 151 (314)
.|-+..|.|++ ..|+|||... +.|+.+ +|.+.+.-.|..|.|...+ +|..|.-+.|.-
T Consensus 13 ~~~~~~L~D~~---~~tyWQSdG~-------q~pH~I--~i~f~k~v~I~~l~i~v~~~DeSYtPs~I~V~~ 72 (131)
T cd08665 13 PHRANKLTDGN---PKTYWESNGS-------TGSHYI--NIHMHRGVVIRQLYMLVASEDSSYMPARVVVLG 72 (131)
T ss_pred cccHHHhhcCC---CCceEccCCC-------CCCeEE--EEEECCCcEEEEEEEEecCCCCCcCCeeEEEEe
Confidence 35567889977 5799999864 124444 4557777777776666554 477888777743
No 35
>cd08667 APC10-ZZEF1 APC10/DOC1-like domain of uncharacterized Zinc finger ZZ-type and EF-hand domain-containing protein 1 (ZZEF1) and homologs. This model represents the APC10/DOC1-like domain present in the uncharacterized Zinc finger ZZ-type and EF-hand domain-containing protein 1 (ZZEF1) of Mus musculus. Members of this family contain EF-hand, APC10, CUB, and zinc finger ZZ-type domains. ZZEF1-like APC10 domains are homologous to the APC10 subunit/DOC1 domains present in E3 ubiquitin ligases, which mediate substrate ubiquitination (or ubiquitylation), and are components of the ubiquitin-26S proteasome pathway for selective proteolytic degradation.
Probab=88.48 E-value=1.1 Score=38.21 Aligned_cols=56 Identities=20% Similarity=0.348 Sum_probs=40.3
Q ss_pred CCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc--CCCCCCceEEEe
Q psy6778 84 PAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP--KSIKPDSLAIYK 151 (314)
Q Consensus 84 p~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~--~s~rP~~~~iek 151 (314)
.++.|.|++ ..|+|||+.. +.|+ .|+|.+.+...|..|.|...+ +|..|..+.|.-
T Consensus 15 gv~~L~D~~---~~TYWQSDG~-------q~pH--~I~i~f~~~v~i~~l~i~v~~~DeSYtPs~I~I~~ 72 (131)
T cd08667 15 DIDRMTDGE---TSTYWQSDGS-------ARSH--WIRLKMKPDVVLRHLSIAVAATDQSYMPQQVTVSV 72 (131)
T ss_pred hhHHhhcCC---CCccCccCCC-------CCCe--EEEEEecCCeEEEEEEEEEcCCCCCcCCcEEEEEe
Confidence 567889977 5799999853 2244 566778888888887766643 488888887744
No 36
>KOG0196|consensus
Probab=87.49 E-value=0.95 Score=48.85 Aligned_cols=64 Identities=20% Similarity=0.370 Sum_probs=45.3
Q ss_pred ceeeeCCCCC----CCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccCCCCCCCCCCCCCC
Q psy6778 218 ELVCECRHNT----AGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCECRHNTAGKDCEKC 293 (314)
Q Consensus 218 ~~~C~C~~nT----~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~C~~nt~G~~Ce~C 293 (314)
.|.|.|+.++ .|..|+-|.+|||...- ....|. +|.=|.|+.. ...-.| +|
T Consensus 258 iG~C~C~aGye~~~~~~~C~aCp~G~yK~~~------~~~~C~--~CP~~S~s~~----ega~~C-------------~C 312 (996)
T KOG0196|consen 258 IGGCVCKAGYEEAENGKACQACPPGTYKASQ------GDSLCL--PCPPNSHSSS----EGATSC-------------TC 312 (996)
T ss_pred cCceeecCCCCcccCCCcceeCCCCcccCCC------CCCCCC--CCCCCCCCCC----CCCCcc-------------cc
Confidence 3689998776 57899999999995431 345788 8877776644 223455 47
Q ss_pred CCCCCCCCCCCCC
Q psy6778 294 KPFFSDRPWGRAT 306 (314)
Q Consensus 294 ~~Gyy~~p~~~~~ 306 (314)
..||||.+..+..
T Consensus 313 ~~gyyRA~~Dp~~ 325 (996)
T KOG0196|consen 313 ENGYYRADSDPPS 325 (996)
T ss_pred cCCcccCCCCCCC
Confidence 8899997765543
No 37
>COG5156 DOC1 Anaphase-promoting complex (APC), subunit 10 [Cell division and chromosome partitioning / Posttranslational modification, protein turnover, chaperones]
Probab=85.33 E-value=0.79 Score=40.17 Aligned_cols=71 Identities=23% Similarity=0.340 Sum_probs=49.5
Q ss_pred CCCCCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEe--eecc-CCCCCCceEEEeecCC
Q psy6778 79 PRGRFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISL--SFCP-KSIKPDSLAIYKSQDF 155 (314)
Q Consensus 79 p~~~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l--~f~~-~s~rP~~~~iekS~D~ 155 (314)
..+.||...+-|.+ -.|+|||+.. ..-+|.|.+.|...|.+|.| .|-. +|.-|..+-+..
T Consensus 37 fK~g~p~r~~lddn---~dtyWqsDg~----------qPH~i~I~F~kr~~I~~v~lfls~t~DeSYtPs~i~v~a---- 99 (189)
T COG5156 37 FKRGHPLRELLDDN---MDTYWQSDGV----------QPHSIQISFDKRRYIQSVQLFLSFTQDESYTPSKIGVRA---- 99 (189)
T ss_pred cccCCcHHHHhhcc---hhhhhccCCC----------CCceEEEEEeEEEeeeeehhhhhhhcccccCcceeEEec----
Confidence 45789999999876 5799999875 34678888888888887654 3332 467777766643
Q ss_pred CCCcccceeeh
Q psy6778 156 GKSWQPLQFYS 166 (314)
Q Consensus 156 g~tw~p~~yya 166 (314)
|.+.+..++++
T Consensus 100 G~~~~D~r~~~ 110 (189)
T COG5156 100 GLTREDVREIS 110 (189)
T ss_pred cCChhhheeEE
Confidence 55555555554
No 38
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=84.08 E-value=0.94 Score=29.09 Aligned_cols=26 Identities=35% Similarity=0.906 Sum_probs=20.9
Q ss_pred CCCCCCCCCccCCCCceeeeCCCCCCCCCC
Q psy6778 203 KCNGHASRCIKSPQGELVCECRHNTAGKDC 232 (314)
Q Consensus 203 ~CnGHa~~C~~~~~g~~~C~C~~nT~G~~C 232 (314)
.|++| ..|+.. .++|.|..++.|++|
T Consensus 7 ~C~~~-G~C~~~---~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGH-GTCVSP---CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCC-CEEeCC---CCEEECCCCCcCCCC
Confidence 48888 788742 359999999999987
No 39
>PF03256 APC10: Anaphase-promoting complex, subunit 10 (APC10); InterPro: IPR004939 The anaphase-promoting complex (APC) or cyclosome is a multi-subunit E3 protein ubiquitin ligase that regulates important events in mitosis such as the initiation of anaphase and exit from telophase. The APC, in conjunction with other enzymes, assembles multi-ubiquitin chains on a variety of regulatory proteins, thereby targeting them for proteolysis by the 26S proteasome. Anaphase is initiated when the APC triggers the destruction of securin, thereby allowing the protease, separase, to disrupt sister-chromatid cohesion. Securin ubiquitination by the APC is inhibited by cyclin-dependent kinase 1 (Cdk1)-dependent phosphorylation []. Forkhead Box M1 (FoxM1), which is a transcription factor that is over-expressed in many cancers, is degraded in late mitosis and early G1 phase by the APC/cyclosome (APC/C) E3 ubiquitin ligase []. The APC/C targets mitotic cyclins for destruction in mitosis and G1 phase and is then inactivated at S phase. It thereby generates alternating states of high and low cyclin-Cdk activity, which is required for the alternation of mitosis and DNA replication []. The APC/C is composed of at least 13 subunits that stay tightly associated throughout the cell cycle: APC1, APC2, APC4, APC5, APC9, APC11, CDC16, CDC23, CDC26, CDC27, DOC1, MND2 and SWM1[], []. In fission yeast the 13 subunits are known as: Apc1, Apc2, Nuc2, Apc4, Apc5, Cut9, Apc8, Apc10, Apc11, Hcn1, Apc13, Apc14 and Apc15 []. One of the subunits of the APC that is required for ubiquitination activity is APC10, a one-domain protein homologous to a sequence element, termed the DOC domain, found in several hypothetical proteins that may also mediate ubiquitination reactions, because they contain combinations of either RING finger (see PDOC00449 from PROSITEDOC), cullin (see PDOC00967 from PROSITEDOC) or HECT (see PDOC50237 from PROSITEDOC) domains [, , ]. The DOC domain consists of a beta-sandwich, in which a five-stranded antiparallel beta-sheet is packed on top of a three stranded antiparallel beta-sheet, exhibiting a 'jellyroll' fold [, ]. Proteins known to contain a DOC domain include: Eucaryotic Doc1/Apc10. Mammalian protein associated with the transcription factor Myc (PAM). Mouse runty-jerky-sterile (RJS) protein. Human HERC2, the ortholog of RJS. ; PDB: 1GQP_B 1JHJ_A.
Probab=82.18 E-value=1.8 Score=39.18 Aligned_cols=60 Identities=20% Similarity=0.378 Sum_probs=36.3
Q ss_pred CCCCc-eecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeec---cCCCCCCceEEEeecC
Q psy6778 82 RFPAE-YLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFC---PKSIKPDSLAIYKSQD 154 (314)
Q Consensus 82 ~hp~~-~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~---~~s~rP~~~~iekS~D 154 (314)
.+.+. .|+|.+ ..|+|||+.. .| =.|+|.+.+...|..|.|-+. .+|..|..+.|.--.+
T Consensus 44 g~gv~~~LrD~~---~~TyWQSDG~--------qp--H~I~i~f~k~v~I~~l~iyvd~~~DeSYtPs~I~V~aG~~ 107 (193)
T PF03256_consen 44 GFGVAELLRDGS---TETYWQSDGS--------QP--HWITIQFHKRVIISSLSIYVDYKQDESYTPSKISVRAGNS 107 (193)
T ss_dssp TBSCHGHCHSS----TT--EE--SS--------SS--EEEEEEEEEEEEE-EEEEEEEHHHHGGGSEEEEEEEEESS
T ss_pred CCCchheeeCCC---hhHhhccCCC--------CC--EEEEEEEcCCcEEEEEEEEEccCcCCCCCceEEEEEEeCC
Confidence 35666 777876 6899999754 13 467777777778887766543 2588888888865433
No 40
>KOG1226|consensus
Probab=79.21 E-value=8.2 Score=41.52 Aligned_cols=14 Identities=43% Similarity=1.025 Sum_probs=7.4
Q ss_pred eeeCCCCCCCCCCc
Q psy6778 220 VCECRHNTAGKDCE 233 (314)
Q Consensus 220 ~C~C~~nT~G~~Ce 233 (314)
+|.|..++.|..||
T Consensus 479 ~C~C~~G~~G~~CE 492 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCE 492 (783)
T ss_pred ceecCCCCCCCccc
Confidence 44555555555554
No 41
>KOG4260|consensus
Probab=77.30 E-value=1.4 Score=42.01 Aligned_cols=38 Identities=24% Similarity=0.613 Sum_probs=29.5
Q ss_pred CCCCCCCCccCC--CCceeeeCCCCCCCCCCccCCCCCCCC
Q psy6778 204 CNGHASRCIKSP--QGELVCECRHNTAGKDCEKCKPFFSDR 242 (314)
Q Consensus 204 CnGHa~~C~~~~--~g~~~C~C~~nT~G~~Ce~C~pgy~~~ 242 (314)
|+|. ..|.-|. .|.|.|.|+.+++|+.|..|.++||-.
T Consensus 152 C~Gn-G~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes 191 (350)
T KOG4260|consen 152 CFGN-GSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFES 191 (350)
T ss_pred cCCC-CcccCCCCCCCCCcccccCCCCCccccccchHHHHh
Confidence 5555 4565432 467899999999999999999999953
No 42
>KOG1225|consensus
Probab=75.72 E-value=6.1 Score=40.98 Aligned_cols=15 Identities=40% Similarity=0.866 Sum_probs=12.2
Q ss_pred eeeeCCCCCCCCCCc
Q psy6778 219 LVCECRHNTAGKDCE 233 (314)
Q Consensus 219 ~~C~C~~nT~G~~Ce 233 (314)
++|.|...+.|.+|+
T Consensus 296 g~CiC~~g~~G~dCs 310 (525)
T KOG1225|consen 296 GECICNPGYSGKDCS 310 (525)
T ss_pred CEeecCCCccccccc
Confidence 388888888888884
No 43
>cd08664 APC10-HERC2 APC10-like DOC1 domain present in HERC2 (HECT domain and RLD2). This model represents the APC10/DOC1 domain present in HERC2 (HECT domain and RLD2), a large multi-domain protein with three RCC1-like domains (RLDs), additional internal domains including a zinc finger ZZ-type and Cyt-b5 (Cytochrome b5-like Heme/Steroid binding) domains, and a C-terminal HECT (Homologous to the E6-AP Carboxyl Terminus) domain. The APC10/DOC1 domain of HERC2 is a homolog of the APC10 subunit and the DOC1 domain present in E3 ubiquitin ligases which mediate substrate ubiquitination (or ubiquitylation), a component of the ubiquitin-26S proteasome pathway for selective proteolytic degradation. As suggested by structural relationships between HERC2 and other proteins such as HERC1, the proposed role for HERC2 in protein trafficking and degradation pathways is consistent with observations that mutations in HERC2 lead to neuromuscular secretory vesicle and sperm acrosome defects, other develo
Probab=74.73 E-value=8.1 Score=33.74 Aligned_cols=57 Identities=18% Similarity=0.239 Sum_probs=37.9
Q ss_pred CCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc--CCCCCCceEEEe
Q psy6778 82 RFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP--KSIKPDSLAIYK 151 (314)
Q Consensus 82 ~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~--~s~rP~~~~iek 151 (314)
.|-+..|+|+ ..|+|||... +.++.++ |.+.+...|..+.|...+ +|..|.-+.|.-
T Consensus 37 ~~~~~~L~D~----~~TYWQSdG~-------q~~HwI~--l~~~~~v~I~~L~i~vd~~DeSY~Ps~I~V~~ 95 (152)
T cd08664 37 ENQAKRLIDG----SGSYWQSSGS-------QGKHWIR--LELHPDVLIHSLKIIVDPADSSYMPSLVVVSG 95 (152)
T ss_pred cccHHHhcCC----CCCeeccCCC-------CCceEEE--EEECCCcEEEEEEEEecCCCCCcCCceEEEEe
Confidence 3567778886 3699999864 2234555 456666677777776664 477888777743
No 44
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=70.44 E-value=1.9 Score=22.25 Aligned_cols=13 Identities=31% Similarity=1.022 Sum_probs=9.6
Q ss_pred eeeCCCCCCCCCC
Q psy6778 220 VCECRHNTAGKDC 232 (314)
Q Consensus 220 ~C~C~~nT~G~~C 232 (314)
.|.|+.+++|++|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 4788888888877
No 45
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=66.26 E-value=25 Score=27.80 Aligned_cols=63 Identities=27% Similarity=0.606 Sum_probs=34.8
Q ss_pred eCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCccc-CCCCCCCcccCCCCccccCCCCCCCCCCCCCCCCCCCCC
Q psy6778 222 ECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCK-CNGHASRCIKSPQGALVCECRHNTAGKDCEKCKPFFSDR 300 (314)
Q Consensus 222 ~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~-Cn~h~~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~ 300 (314)
.|. .+....|+.|.+|+|...|. ....|+ +|. |. .+..- .. .| ..|.-..|+ |++|||-+
T Consensus 26 ~C~-~~~~t~C~~C~~g~ys~~~~-----~~~~C~--~c~~C~-~g~~~-----~~---~c-t~t~dt~C~-C~~G~y~~ 86 (98)
T cd00185 26 DCT-PGSDTVCEPCPPGTYTDSWN-----HLPKCL--SCRTCD-SGLVE-----KA---PC-TATRNTVCG-CKPGFYCL 86 (98)
T ss_pred cCC-CCCCCeecCCCCCCcccCCC-----CCCcCC--cCccCC-CCCEE-----Ec---cC-CCCCCCeEe-CCCCCEec
Confidence 344 33456788899988865442 123566 542 33 11110 11 23 245556676 89999987
Q ss_pred CCC
Q psy6778 301 PWG 303 (314)
Q Consensus 301 p~~ 303 (314)
...
T Consensus 87 ~~~ 89 (98)
T cd00185 87 TKA 89 (98)
T ss_pred CCC
Confidence 654
No 46
>smart00607 FTP eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain.
Probab=60.26 E-value=17 Score=31.42 Aligned_cols=20 Identities=5% Similarity=0.046 Sum_probs=16.7
Q ss_pred eeEeecCceeEEEEEEeeec
Q psy6778 119 TLTLSLGKKFELTYISLSFC 138 (314)
Q Consensus 119 titl~l~~~f~~~~v~l~f~ 138 (314)
--.+||++.|.|..|.|...
T Consensus 53 WW~VDL~~~~~V~~V~I~NR 72 (151)
T smart00607 53 WWRVDLLQYMTIHSVTITNR 72 (151)
T ss_pred eEEEeCCCeEEeeEEEEecC
Confidence 35789999999999988765
No 47
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=57.89 E-value=4.1 Score=29.53 Aligned_cols=31 Identities=35% Similarity=0.865 Sum_probs=17.0
Q ss_pred CCCCCCCCCccCC---CCceeeeCCCCCCCCCCcc
Q psy6778 203 KCNGHASRCIKSP---QGELVCECRHNTAGKDCEK 234 (314)
Q Consensus 203 ~CnGHa~~C~~~~---~g~~~C~C~~nT~G~~Ce~ 234 (314)
.|+||...-. |. .|...|+|..-++|++|+.
T Consensus 18 ~CSGHGr~fl-Dg~~~dG~p~CECn~Cy~GpdCS~ 51 (56)
T PF04863_consen 18 SCSGHGRAFL-DGLIADGSPVCECNSCYGGPDCST 51 (56)
T ss_dssp --TTSEE--T-TS-EETTEE--EE-TTEESTTS-E
T ss_pred CcCCCCeeee-ccccccCCccccccCCcCCCCccc
Confidence 5889955442 21 3567899999999999964
No 48
>KOG4276|consensus
Probab=51.62 E-value=21 Score=29.06 Aligned_cols=50 Identities=20% Similarity=0.286 Sum_probs=38.4
Q ss_pred eEeecCceeEEEEEEeeeccCCCCCC--ceEEEeecCCCCCcccceeehhhhh
Q psy6778 120 LTLSLGKKFELTYISLSFCPKSIKPD--SLAIYKSQDFGKSWQPLQFYSSQCK 170 (314)
Q Consensus 120 itl~l~~~f~~~~v~l~f~~~s~rP~--~~~iekS~D~g~tw~p~~yya~~C~ 170 (314)
..||||-....+--.|+|.....|.+ -|.+.-|+| |++|+...--..|+.
T Consensus 10 f~iDLG~~vip~~y~lrh~rgygRsalRnW~fQgS~D-gktWt~l~vH~DD~s 61 (113)
T KOG4276|consen 10 FAIDLGLEVIPTAYTLRHARGYGRSALRNWKFQGSKD-GKTWTDLRVHVDDKS 61 (113)
T ss_pred EEEecCceEeeeeeeeeecccccHHHhhheeeecccc-CCcceeEEEEecccc
Confidence 36788888888877788874444543 489999999 999999888877765
No 49
>KOG1214|consensus
Probab=49.19 E-value=18 Score=39.66 Aligned_cols=54 Identities=24% Similarity=0.467 Sum_probs=29.6
Q ss_pred eeeeCCCCCCCCCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccc
Q psy6778 219 LVCECRHNTAGKDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALV 278 (314)
Q Consensus 219 ~~C~C~~nT~G~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C 278 (314)
++|.|-|+=.|..==.|+|||-|+-...- ...+|. |-.|+. +..|+.+.+.-.|
T Consensus 797 g~a~c~~hGgs~y~C~CLPGfsGDG~~c~---dvDeC~--psrChp-~A~CyntpgsfsC 850 (1289)
T KOG1214|consen 797 GQARCVHHGGSTYSCACLPGFSGDGHQCT---DVDECS--PSRCHP-AATCYNTPGSFSC 850 (1289)
T ss_pred CceEEEecCCceEEEeecCCccCCccccc---cccccC--ccccCC-CceEecCCCccee
Confidence 46777777677654579999987632211 225666 555553 2244443333344
No 50
>smart00181 EGF Epidermal growth factor-like domain.
Probab=47.14 E-value=21 Score=22.08 Aligned_cols=22 Identities=36% Similarity=0.992 Sum_probs=14.6
Q ss_pred CCccCCCCceeeeCCCCCCC-CCC
Q psy6778 210 RCIKSPQGELVCECRHNTAG-KDC 232 (314)
Q Consensus 210 ~C~~~~~g~~~C~C~~nT~G-~~C 232 (314)
.|... .+...|.|+.++.| ..|
T Consensus 12 ~C~~~-~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 12 TCINT-PGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred EEECC-CCCeEeECCCCCccCCcc
Confidence 56643 34567888888887 555
No 51
>PF07738 Sad1_UNC: Sad1 / UNC-like C-terminal ; InterPro: IPR012919 The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and Sad1 share a common C-terminal region that is often termed the SUN (Sad1 and UNC) domain [, ]. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2 []. The SUN domain of Sun2 has been demonstrated to be in the periplasm []. ; PDB: 4DXT_A 3UNP_A 4DXR_A 4DXS_A.
Probab=44.20 E-value=57 Score=27.01 Aligned_cols=63 Identities=21% Similarity=0.389 Sum_probs=40.1
Q ss_pred CCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeecc-----CCCCCCceEEEeecCCCCC-cccceeeh
Q psy6778 93 NPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFCP-----KSIKPDSLAIYKSQDFGKS-WQPLQFYS 166 (314)
Q Consensus 93 ~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~~-----~s~rP~~~~iekS~D~g~t-w~p~~yya 166 (314)
......||-.+. ..-.|+|.|.+...++.|.|..-+ .+..|..+-|.-+.|.... |..+.-|.
T Consensus 18 ~~~~g~Cw~~~g-----------~~~~~~I~L~~~I~v~~v~i~h~~~~~~~~sSapk~f~V~g~~~~~~~~~~~LG~f~ 86 (135)
T PF07738_consen 18 SYMPGPCWAFEG-----------SKGWLTIELSEPIYVTSVTIEHVEKSIAPFSSAPKDFEVWGSVDYPDEEWVLLGEFE 86 (135)
T ss_dssp --STT-SEEEET-----------T-EEEEEEEEEEEEEEEEEEE---GGGSG-TTSB-EEEEEEESSSSTS--EEEEEEE
T ss_pred cccCCccCccCC-----------CceEEEEEcCCcEEEEEEEEEEeccccCCcCCCCcEEEEEEEecccccceeeeceEE
Confidence 344567995432 346788999999999998886542 1557888999999887766 88776554
No 52
>smart00231 FA58C Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Probab=43.22 E-value=37 Score=28.05 Aligned_cols=45 Identities=16% Similarity=0.166 Sum_probs=28.3
Q ss_pred ceeeEeecCceeEEEEEEeeeccCCCC-CCceEEEeecCCCCCcccce
Q psy6778 117 NVTLTLSLGKKFELTYISLSFCPKSIK-PDSLAIYKSQDFGKSWQPLQ 163 (314)
Q Consensus 117 ~vtitl~l~~~f~~~~v~l~f~~~s~r-P~~~~iekS~D~g~tw~p~~ 163 (314)
.--|+|||++.+.|+-|.++....... -....+ -|.| |.+|.++.
T Consensus 44 ~~wlqvDLg~~~~v~~i~~qg~~~~~~~~~~~~~-~s~d-g~~W~~~~ 89 (139)
T smart00231 44 PPWIQVDLGRTRTVTGVITGGRHGDGDGVTYKLL-YSDD-GNNWTTYK 89 (139)
T ss_pred CceeEeeccCcEEEEEEEecccCCCCcEEEEEEE-EEeC-CCCEeEEe
Confidence 347899999999999998875311111 112223 4555 77898664
No 53
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=40.60 E-value=54 Score=26.21 Aligned_cols=47 Identities=21% Similarity=0.414 Sum_probs=29.4
Q ss_pred CCCccCCCCCCCCCCCCCcccCCCCccCCcccCCCCCCCcccCCCCccccCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 230 KDCEKCKPFFSDRPWGRATVYDANECKGGRCKCNGHASRCIKSPQGALVCECRHNTAGKDCEKCKPFFSDRP 301 (314)
Q Consensus 230 ~~Ce~C~pgy~~~p~~~~t~~~~~~C~~~~C~Cn~h~~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p 301 (314)
..|..|.++|+-.. +...|. +|+= + -|..|.. ...|..|..||+-+.
T Consensus 4 ~~Ct~C~~g~~~~~-------~~~~C~--~C~~----~---------~C~~C~~---~~~C~~C~~GY~~~~ 50 (96)
T PTZ00382 4 AVCTSCDSDKKPNK-------DGSGCV--LCSV----G---------NCKSCVV---DGVCGECNSGFSLDN 50 (96)
T ss_pred cccCcCCCCCccCC-------CCCcCC--cCCC----C---------CCcCCCC---CCccccCcCCcccCC
Confidence 46888999997431 234688 6641 1 2445532 356888999998643
No 54
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=37.80 E-value=21 Score=30.42 Aligned_cols=26 Identities=27% Similarity=0.695 Sum_probs=20.4
Q ss_pred CcccC--CCCccccCCCCCCCCCCCCCCC
Q psy6778 268 RCIKS--PQGALVCECRHNTAGKDCEKCK 294 (314)
Q Consensus 268 ~C~~~--~~~G~C~~C~~nt~G~~Ce~C~ 294 (314)
.|.+- ...-.| +|..++.|.+||+=.
T Consensus 57 ~C~yI~dl~~~~C-rC~~GYtGeRCEh~d 84 (139)
T PHA03099 57 DCIHARDIDGMYC-RCSHGYTGIRCQHVV 84 (139)
T ss_pred EEEeeccCCCcee-ECCCCccccccccee
Confidence 67654 346788 899999999999865
No 55
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=33.02 E-value=51 Score=20.41 Aligned_cols=23 Identities=35% Similarity=1.018 Sum_probs=13.9
Q ss_pred CCccCCCCceeeeCCCCCC-CCCCc
Q psy6778 210 RCIKSPQGELVCECRHNTA-GKDCE 233 (314)
Q Consensus 210 ~C~~~~~g~~~C~C~~nT~-G~~Ce 233 (314)
.|... .+...|.|+.++. |.+|+
T Consensus 16 ~C~~~-~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 16 TCVNT-VGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred EeECC-CCCeEeECCCCCccCCcCC
Confidence 46532 3445677777776 77664
No 56
>PHA02668 GM-CSF/IL-2 inhibition factor; Provisional
Probab=32.86 E-value=76 Score=29.65 Aligned_cols=40 Identities=15% Similarity=0.351 Sum_probs=27.4
Q ss_pred CceeeEeecCceeEEEEEEeeeccCCCCCCceEEEeecCCCCCcccce
Q psy6778 116 DNVTLTLSLGKKFELTYISLSFCPKSIKPDSLAIYKSQDFGKSWQPLQ 163 (314)
Q Consensus 116 ~~vtitl~l~~~f~~~~v~l~f~~~s~rP~~~~iekS~D~g~tw~p~~ 163 (314)
-||+|.+--.+.|.+.||-+.|. . --..|-+ .+| +|.||-
T Consensus 81 INVSVali~~gvip~syigv~fn--p-~~~ymyV--nvs---Sw~PW~ 120 (265)
T PHA02668 81 INVSVSAINCGFFDMRQVEVTYD--T-ARRQMYV--YLD---TWDPWV 120 (265)
T ss_pred eEEEEEEeecceeeeeeEeeeec--C-CcceEEE--Eec---ccCccc
Confidence 36666665567799999999998 2 2233444 333 789987
No 57
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=31.79 E-value=59 Score=19.69 Aligned_cols=16 Identities=25% Similarity=0.655 Sum_probs=9.7
Q ss_pred CccccCCCCCCCCCCCC
Q psy6778 275 GALVCECRHNTAGKDCE 291 (314)
Q Consensus 275 ~G~C~~C~~nt~G~~Ce 291 (314)
+..| .|..++.|.+|+
T Consensus 23 ~~~C-~C~~g~~g~~C~ 38 (38)
T cd00054 23 SYRC-SCPPGYTGRNCE 38 (38)
T ss_pred CeEe-ECCCCCcCCcCC
Confidence 3456 577777776653
No 58
>KOG4350|consensus
Probab=31.31 E-value=17 Score=36.83 Aligned_cols=54 Identities=20% Similarity=0.400 Sum_probs=41.3
Q ss_pred ceeeEeecCceeEEEEEEeeeccCCCCCCceEEEeecCCCCCcccceee-hhhhhc
Q psy6778 117 NVTLTLSLGKKFELTYISLSFCPKSIKPDSLAIYKSQDFGKSWQPLQFY-SSQCKK 171 (314)
Q Consensus 117 ~vtitl~l~~~f~~~~v~l~f~~~s~rP~~~~iekS~D~g~tw~p~~yy-a~~C~~ 171 (314)
.--|.|+||+.|.|.+|+|.|=.+..|..+.-+|-|+| ..-|.-.-=| ..+|+.
T Consensus 326 ~sgi~i~LG~p~iINhIrmllWdrdsraysY~veVSmD-~~hW~rV~DyS~Y~CRs 380 (620)
T KOG4350|consen 326 ISGIQIDLGKPFIINHIRMLLWDRDSRAYSYQVEVSMD-DAHWNRVADYSEYDCRS 380 (620)
T ss_pred ceeEEEecCCchhhhhhhhhhhcccccceEEEEEEecc-hhhhHHhhhhhhhcccc
Confidence 35689999999999999998875677899999999999 5666543323 245553
No 59
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=27.97 E-value=67 Score=19.01 Aligned_cols=21 Identities=29% Similarity=0.869 Sum_probs=11.9
Q ss_pred CCCccCCCCceeeeCCCCCCCC
Q psy6778 209 SRCIKSPQGELVCECRHNTAGK 230 (314)
Q Consensus 209 ~~C~~~~~g~~~C~C~~nT~G~ 230 (314)
..|.... +...|.|+.++.|.
T Consensus 12 ~~C~~~~-~~~~C~C~~g~~g~ 32 (36)
T cd00053 12 GTCVNTP-GSYRCVCPPGYTGD 32 (36)
T ss_pred CEEecCC-CCeEeECCCCCccc
Confidence 3454332 34567777777666
No 60
>KOG1214|consensus
Probab=27.28 E-value=1.5e+02 Score=32.85 Aligned_cols=74 Identities=19% Similarity=0.350 Sum_probs=36.8
Q ss_pred CCCCCCCccCCCCCCCCCCCCCcc-cCCCCccCC---cccCCCCCCCccc-CCCCccccCCCCCCCC-----------CC
Q psy6778 226 NTAGKDCEKCKPFFSDRPWGRATV-YDANECKGG---RCKCNGHASRCIK-SPQGALVCECRHNTAG-----------KD 289 (314)
Q Consensus 226 nT~G~~Ce~C~pgy~~~p~~~~t~-~~~~~C~~~---~C~Cn~h~~~C~~-~~~~G~C~~C~~nt~G-----------~~ 289 (314)
||.|.+==+|+|||+|+-.+-.-. .....|..- |=.|++.+.-|.. |..+-+| .|+..-.| ..
T Consensus 843 ntpgsfsC~C~pGy~GDGf~CVP~~~~~T~C~~er~hpl~chg~t~~~~~~Dp~~~e~-p~~~~ppG~~~~~c~~~~~~~ 921 (1289)
T KOG1214|consen 843 NTPGSFSCRCQPGYYGDGFQCVPDTSSLTPCEQERFHPLQCHGSTGFCWCVDPDGHEV-PGTQTPPGSTPPHCGPSPEQY 921 (1289)
T ss_pred cCCCcceeecccCccCCCceecCCCccCCccccccccceeeccccceeEeeCCCcccC-CCCCCCCCCCCCCCCCccccc
Confidence 566776668888888753321110 011122200 1124443333321 2334566 57777777 66
Q ss_pred CCCCCC-CCCCC
Q psy6778 290 CEKCKP-FFSDR 300 (314)
Q Consensus 290 Ce~C~~-Gyy~~ 300 (314)
|.+|.. |||-.
T Consensus 922 vp~Cd~hgh~ap 933 (1289)
T KOG1214|consen 922 VPQCDDHGHFAP 933 (1289)
T ss_pred CCCccccccccc
Confidence 677755 66653
No 61
>smart00051 DSL delta serrate ligand.
Probab=27.05 E-value=65 Score=23.72 Aligned_cols=23 Identities=26% Similarity=0.401 Sum_probs=16.8
Q ss_pred CCCCCCCccCCCCceeeeCCCCCCCCCC
Q psy6778 205 NGHASRCIKSPQGELVCECRHNTAGKDC 232 (314)
Q Consensus 205 nGHa~~C~~~~~g~~~C~C~~nT~G~~C 232 (314)
.|| -.|++ .|.|.|..+..|++|
T Consensus 41 ~~~-~~Cd~----~G~~~C~~Gw~G~~C 63 (63)
T smart00051 41 FGH-YTCDE----NGNKGCLEGWMGPYC 63 (63)
T ss_pred cCC-ccCCc----CCCEecCCCCcCCCC
Confidence 566 66764 247888888888876
No 62
>PF01834 XRCC1_N: XRCC1 N terminal domain; InterPro: IPR002706 DNA-repair protein Xrcc1 functions in the repair of single-strand DNA breaks in mammalian cells and forms a repair complex with beta-Pol, ligase III and PARP []. The NMR solution structure of the Xrcc1 N-terminal domain (Xrcc1 NTD) shows that the structural core is a beta-sandwich with beta-strands connected by loops, three helices and two short two-stranded beta-sheets at each connection side. The Xrcc1 NTD specifically binds single-strand break DNA (gapped and nicked) and a gapped DNA-beta-Pol complex [].; GO: 0003684 damaged DNA binding, 0000012 single strand break repair, 0005634 nucleus; PDB: 3K77_E 3K75_C 3LQC_A 1XNA_A 1XNT_A.
Probab=26.69 E-value=36 Score=29.78 Aligned_cols=47 Identities=21% Similarity=0.173 Sum_probs=31.8
Q ss_pred CCCCCceecccCCCCCceeeccCCcccCCcCCCCCCceeeEeecCceeEEEEEEeeec
Q psy6778 81 GRFPAEYLTDLNNPSNVTCWRSEAQTSVNSLSASPDNVTLTLSLGKKFELTYISLSFC 138 (314)
Q Consensus 81 ~~hp~~~~~D~~~~~~~twWqS~~~~~~~~~~~~~~~vtitl~l~~~f~~~~v~l~f~ 138 (314)
..||++++.+.+. ...|+-+..+ ...+.|-|-|+++-+|++|-|.=+
T Consensus 17 ~~~~A~NLL~~d~--~r~W~~~~~g---------ek~~~V~lQl~~~~~I~~IDIGN~ 63 (153)
T PF01834_consen 17 PVHPAENLLKSDT--YRKWKCAKAG---------EKQASVELQLEKASQITSIDIGNE 63 (153)
T ss_dssp SSSHGGGGSCGGG--CHHEEHSSTT----------SEEEEEEEEEEEE--SEEEEEEE
T ss_pred CCCchhhccCccc--CCcccccCCC---------CceEEEEEEecCceEEEEEeccCC
Confidence 4799999988653 3567655443 235677888888889999888755
No 63
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=24.13 E-value=21 Score=22.51 Aligned_cols=23 Identities=39% Similarity=0.865 Sum_probs=16.3
Q ss_pred CCCccCCCCceeeeCCCCCCCCC
Q psy6778 209 SRCIKSPQGELVCECRHNTAGKD 231 (314)
Q Consensus 209 ~~C~~~~~g~~~C~C~~nT~G~~ 231 (314)
..|.....+...|.|+.+++|++
T Consensus 10 g~C~~~~~~~y~C~C~~G~~G~~ 32 (32)
T PF00008_consen 10 GTCIDLPGGGYTCECPPGYTGKR 32 (32)
T ss_dssp EEEEEESTSEEEEEEBTTEESTT
T ss_pred eEEEeCCCCCEEeECCCCCccCC
Confidence 56765433567899999998874
No 64
>KOG4611|consensus
Probab=22.29 E-value=1.2e+02 Score=30.50 Aligned_cols=66 Identities=23% Similarity=0.417 Sum_probs=36.0
Q ss_pred CCCccCCCCCCCCCCCCCcccCCCCcc---CCcccCCCCCCCcccCCCCccccCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy6778 230 KDCEKCKPFFSDRPWGRATVYDANECK---GGRCKCNGHASRCIKSPQGALVCECRHNTAGKDCEKCKPFFSDRPWGRAT 306 (314)
Q Consensus 230 ~~Ce~C~pgy~~~p~~~~t~~~~~~C~---~~~C~Cn~h~~~C~~~~~~G~C~~C~~nt~G~~Ce~C~~Gyy~~p~~~~~ 306 (314)
+.||.|..|-...| +...|. .|.|.|...+..- |....|.= -|...-|-.|+.|||++-.+-.+
T Consensus 50 ptceecpegtlssp-------dqtgclncnngtchcpsqstli-frdasgnl-----ltndafcgncasgfyrndngyct 116 (747)
T KOG4611|consen 50 PTCEECPEGTLSSP-------DQTGCLNCNNGTCHCPSQSTLI-FRDASGNL-----LTNDAFCGNCASGFYRNDNGYCT 116 (747)
T ss_pred CccccCCCcccCCC-------ccCCceecCCCccCCCCcceEE-EEcCCCCe-----eccccccccccccceECCCcccc
Confidence 56888888776655 333455 2234444333221 11222322 24456788899999998765544
Q ss_pred CC
Q psy6778 307 VY 308 (314)
Q Consensus 307 ~~ 308 (314)
.|
T Consensus 117 kc 118 (747)
T KOG4611|consen 117 KC 118 (747)
T ss_pred cc
Confidence 33
No 65
>PF06219 DUF1005: Protein of unknown function (DUF1005); InterPro: IPR010410 This is a family of plant proteins with undetermined function.
Probab=22.03 E-value=55 Score=33.18 Aligned_cols=128 Identities=16% Similarity=0.186 Sum_probs=63.3
Q ss_pred CCCCCCc--cCCCcccCCCcchhhccCCceeecCccCCCCCccccCCCCCcccCCCCCCCC-CCCCceecccCCCCCcee
Q psy6778 23 LNSDPCY--ENGRPRRCIPDFVNAAFGQEVKVNSVCGLESPERYCDTSGACHVCDAGSPRG-RFPAEYLTDLNNPSNVTC 99 (314)
Q Consensus 23 ~~~~~c~--~~g~~~~C~P~~~nla~g~~v~assTCG~~~~e~yC~~~~~C~~Cd~~~p~~-~hp~~~~~D~~~~~~~tw 99 (314)
+-+||=| +.+..-.|.|.++.+--..+ +|---|+ +.+|.....+ ........|.... ..|
T Consensus 166 ~EpDPRfVFQFdgepecSPQV~QiqG~~r----------QPvFsCK-----Fs~~~~~~~~~~~sr~~~~~~~~s--r~W 228 (460)
T PF06219_consen 166 AEPDPRFVFQFDGEPECSPQVFQIQGNIR----------QPVFSCK-----FSRDRNSQRDPLRSRSLSSDPSSS--RSW 228 (460)
T ss_pred ccCCCeeEEEcCCccccCcEEEEecCccc----------cceeEEE-----EEecccCCcccccccccccccccc--ccc
Confidence 3478888 55666789999888752211 1222222 4455433111 1222344454322 345
Q ss_pred eccCCcccCCcCCCCCCceeeEe-ecCce-----eEEEEEEeeecc---CCCCCCceEEEeecCCCCCcccceeehhh
Q psy6778 100 WRSEAQTSVNSLSASPDNVTLTL-SLGKK-----FELTYISLSFCP---KSIKPDSLAIYKSQDFGKSWQPLQFYSSQ 168 (314)
Q Consensus 100 WqS~~~~~~~~~~~~~~~vtitl-~l~~~-----f~~~~v~l~f~~---~s~rP~~~~iekS~D~g~tw~p~~yya~~ 168 (314)
+.|....+.-...+.-..-.||| ||.+. +-+|-..---.- .-.-|.+|+|-|..-.| +|+||.-...+
T Consensus 229 ~~s~~~se~e~~~rERKGW~VtIHDLSGSPVAaASMvTPFVPSpGSDrVSRSNPGAWLILRP~~~g-sWkPWGRLEAW 305 (460)
T PF06219_consen 229 SSSSFGSERERPRRERKGWSVTIHDLSGSPVAAASMVTPFVPSPGSDRVSRSNPGAWLILRPDGDG-SWKPWGRLEAW 305 (460)
T ss_pred cccccccccccccccccCcEEEEEcCCCCHHHHhhhcCCccCCCCCCceeccCCCeEEEEecCCCC-Ccccchhhhhh
Confidence 56554421111111223366777 67642 333321100000 02369999999974445 99999988653
No 66
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=21.78 E-value=56 Score=26.71 Aligned_cols=11 Identities=45% Similarity=1.328 Sum_probs=8.1
Q ss_pred CCCCCCCCCccC
Q psy6778 203 KCNGHASRCIKS 214 (314)
Q Consensus 203 ~CnGHa~~C~~~ 214 (314)
+|+|| ..|...
T Consensus 14 ~CsgH-G~C~~~ 24 (103)
T PF12955_consen 14 NCSGH-GSCVKK 24 (103)
T ss_pred CCCCC-ceEeec
Confidence 58899 778743
No 67
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=21.06 E-value=3e+02 Score=30.32 Aligned_cols=17 Identities=24% Similarity=0.368 Sum_probs=11.9
Q ss_pred CCCCCCCCCCCCCCCCC
Q psy6778 287 GKDCEKCKPFFSDRPWG 303 (314)
Q Consensus 287 G~~Ce~C~~Gyy~~p~~ 303 (314)
...|..|.+|||-.+..
T Consensus 558 ~g~Ct~C~~GYyl~~~~ 574 (800)
T PTZ00214 558 NGKCTTTKKGYGISPDG 574 (800)
T ss_pred CCcccccccCcccCCCC
Confidence 35688888888876654
No 68
>cd02847 Chitobiase_C_term Chitobiase C-terminus domain. Chitobiase (AKA N-acetylglucosaminidase) digests the beta, 1-4 glycosidic bonds of the N-acetylglucosamine (NAG) oligomers found in chitin, an important structural element of fungal cell wall and arthropod exoskeletons. It is thought to proceed through an acid-base reaction mechanism, in which one protein carboxylate acts as catalytic acid, while the nucleophile is the polar acetamido group of the sugar in a substrate-assisted reaction with retention of the anomeric configuration. The C-terminus of chitobiase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chit
Probab=20.84 E-value=64 Score=24.90 Aligned_cols=23 Identities=30% Similarity=0.432 Sum_probs=17.2
Q ss_pred CCCCCceEEEeecCCCCCccccee
Q psy6778 141 SIKPDSLAIYKSQDFGKSWQPLQF 164 (314)
Q Consensus 141 s~rP~~~~iekS~D~g~tw~p~~y 164 (314)
...|. +.|+-++|.|.+|++|+-
T Consensus 28 ~~~pg-~~i~Yt~dgg~~w~~Y~~ 50 (78)
T cd02847 28 VSLPG-LTLQYSTDGGKNWNIYDA 50 (78)
T ss_pred ccCCC-cEEEEEecCCccCeeccc
Confidence 54554 567889999999998653
No 69
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=20.14 E-value=62 Score=18.33 Aligned_cols=13 Identities=46% Similarity=0.679 Sum_probs=8.6
Q ss_pred CchhhHHHHHHHH
Q psy6778 1 MHFLKLLCVPLLI 13 (314)
Q Consensus 1 ~~~~~~~~~~~~~ 13 (314)
|+++++|.+.+++
T Consensus 2 Mk~vIIlvvLLli 14 (19)
T PF13956_consen 2 MKLVIILVVLLLI 14 (19)
T ss_pred ceehHHHHHHHhc
Confidence 7777777665444
Done!