Query psy12980
Match_columns 609
No_of_seqs 208 out of 253
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 16:04:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12980.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12980hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05454 DAG1: Dystroglycan (D 100.0 4.4E-82 9.6E-87 649.3 0.1 272 322-609 1-290 (290)
2 KOG3781|consensus 100.0 2.2E-47 4.7E-52 408.1 17.6 327 214-594 227-558 (559)
3 smart00736 CADG Dystroglycan-t 99.8 4.5E-19 9.6E-24 155.9 12.6 94 218-316 2-95 (97)
4 PF05454 DAG1: Dystroglycan (D 99.7 1.4E-17 3E-22 172.6 0.0 95 2-100 18-112 (290)
5 smart00736 CADG Dystroglycan-t 99.0 7.2E-10 1.6E-14 97.5 6.8 90 118-212 2-96 (97)
6 PF05510 Sarcoglycan_2: Sarcog 98.8 3.1E-07 6.7E-12 99.1 17.8 259 222-491 22-310 (386)
7 KOG3781|consensus 98.8 1.3E-10 2.9E-15 126.4 -7.8 288 113-430 226-526 (559)
8 PF05345 He_PIG: Putative Ig d 98.3 1.4E-06 3.1E-11 68.2 6.1 47 253-301 2-49 (49)
9 KOG4482|consensus 97.9 8E-05 1.7E-09 79.5 11.3 205 222-430 28-248 (449)
10 cd00031 CA Cadherin repeat dom 96.9 0.018 4E-07 55.1 13.4 162 140-313 21-198 (199)
11 PF12877 DUF3827: Domain of un 95.8 0.064 1.4E-06 61.5 11.3 78 342-430 161-252 (684)
12 PF15102 TMEM154: TMEM154 prot 95.5 0.026 5.7E-07 53.9 5.7 37 463-499 54-90 (146)
13 PF10873 DUF2668: Protein of u 95.4 0.03 6.5E-07 53.2 5.8 22 474-495 68-89 (155)
14 PF13908 Shisa: Wnt and FGF in 95.3 0.069 1.5E-06 52.2 8.2 9 468-476 81-89 (179)
15 PF12273 RCR: Chitin synthesis 93.8 0.031 6.8E-07 52.0 1.7 27 474-500 6-32 (130)
16 smart00112 CA Cadherin repeats 93.0 0.91 2E-05 37.5 9.1 68 242-314 2-73 (79)
17 PF00028 Cadherin: Cadherin do 92.5 1.6 3.4E-05 37.2 10.3 68 240-312 20-90 (93)
18 PF02439 Adeno_E3_CR2: Adenovi 91.6 0.13 2.8E-06 38.5 2.1 11 490-500 28-38 (38)
19 PF04478 Mid2: Mid2 like cell 90.2 0.19 4.1E-06 48.4 2.4 31 467-497 50-80 (154)
20 cd00031 CA Cadherin repeat dom 89.9 3.2 7E-05 39.6 10.6 70 240-314 21-94 (199)
21 PF07495 Y_Y_Y: Y_Y_Y domain; 89.2 2.5 5.5E-05 33.8 7.9 44 267-312 20-65 (66)
22 cd00146 PKD polycystic kidney 88.6 0.96 2.1E-05 37.7 5.2 27 286-312 55-81 (81)
23 PF15050 SCIMP: SCIMP protein 88.5 0.46 1E-05 44.1 3.4 28 473-500 12-40 (133)
24 PF15418 DUF4625: Domain of un 87.2 4.3 9.2E-05 38.4 9.1 86 220-312 27-131 (132)
25 PF01102 Glycophorin_A: Glycop 86.4 0.38 8.3E-06 44.9 1.6 28 471-498 65-95 (122)
26 PF14991 MLANA: Protein melan- 84.2 0.32 6.9E-06 44.7 0.1 9 548-556 99-107 (118)
27 PF12245 Big_3_2: Bacterial Ig 81.9 3.5 7.5E-05 33.5 5.2 40 274-313 9-48 (60)
28 PF01299 Lamp: Lysosome-associ 77.8 2.2 4.7E-05 45.3 3.6 61 422-500 241-303 (306)
29 PF11669 WBP-1: WW domain-bind 75.3 2.6 5.5E-05 38.2 2.8 19 537-556 72-90 (102)
30 PF01835 A2M_N: MG2 domain; I 75.1 7.4 0.00016 33.8 5.7 59 250-310 37-99 (99)
31 PF13750 Big_3_3: Bacterial Ig 74.6 25 0.00055 34.0 9.7 34 282-315 117-150 (158)
32 PF02494 HYR: HYR domain; Int 73.5 4.9 0.00011 33.9 4.0 26 287-312 56-81 (81)
33 PF05478 Prominin: Prominin; 72.8 1.4 3.1E-05 52.8 0.8 33 470-502 96-129 (806)
34 PF06667 PspB: Phage shock pro 72.6 1.3 2.9E-05 38.0 0.3 31 472-502 6-36 (75)
35 KOG4289|consensus 72.1 1.4E+02 0.003 38.6 16.7 69 240-312 807-877 (2531)
36 PF10577 UPF0560: Uncharacteri 71.7 1.7 3.6E-05 51.6 0.9 30 466-495 268-299 (807)
37 TIGR01965 VCBS_repeat VCBS rep 69.4 17 0.00037 32.9 6.7 49 278-330 25-79 (99)
38 PHA03283 envelope glycoprotein 69.1 13 0.00028 42.5 7.1 25 480-504 411-437 (542)
39 PF02480 Herpes_gE: Alphaherpe 68.2 1.7 3.6E-05 48.8 0.0 49 225-275 143-194 (439)
40 PRK09458 pspB phage shock prot 66.8 2.6 5.7E-05 36.2 0.9 30 474-503 8-37 (75)
41 KOG1094|consensus 64.0 15 0.00033 42.8 6.5 30 471-500 392-422 (807)
42 smart00089 PKD Repeats in poly 63.3 12 0.00026 30.9 4.2 25 287-312 54-78 (79)
43 PF02439 Adeno_E3_CR2: Adenovi 63.2 4.3 9.3E-05 30.6 1.3 27 474-500 7-35 (38)
44 PF05345 He_PIG: Putative Ig d 62.5 9.6 0.00021 29.9 3.3 27 160-186 14-43 (49)
45 PF05568 ASFV_J13L: African sw 62.4 5.3 0.00011 38.5 2.1 55 482-544 43-97 (189)
46 PRK13211 N-acetylglucosamine-b 61.1 30 0.00065 39.5 8.1 64 250-314 342-406 (478)
47 PF14610 DUF4448: Protein of u 60.3 8.4 0.00018 38.0 3.3 29 466-496 158-186 (189)
48 PF00974 Rhabdo_glycop: Rhabdo 59.8 3 6.4E-05 47.6 0.0 17 230-246 146-162 (501)
49 PF13754 Big_3_4: Bacterial Ig 58.8 28 0.00061 27.6 5.4 27 286-312 22-49 (54)
50 TIGR02976 phageshock_pspB phag 58.5 3.5 7.6E-05 35.4 0.2 27 471-499 7-33 (75)
51 PF12273 RCR: Chitin synthesis 58.4 3.1 6.8E-05 38.7 -0.1 26 475-500 4-29 (130)
52 TIGR00864 PCC polycystin catio 57.0 1.7E+02 0.0037 40.1 14.7 61 245-311 1799-1864(2740)
53 KOG1219|consensus 56.0 1.2E+02 0.0026 41.2 12.4 186 112-311 949-1150(4289)
54 PF15069 FAM163: FAM163 family 55.6 9 0.00019 36.8 2.4 18 469-486 5-22 (143)
55 PF08693 SKG6: Transmembrane a 55.1 10 0.00022 29.0 2.1 23 470-495 17-39 (40)
56 PF14979 TMEM52: Transmembrane 54.7 6.1 0.00013 38.1 1.2 26 463-489 15-40 (154)
57 PF08374 Protocadherin: Protoc 54.7 11 0.00024 38.5 3.0 30 464-493 36-65 (221)
58 PF14575 EphA2_TM: Ephrin type 53.6 9.7 0.00021 32.6 2.1 13 488-500 20-32 (75)
59 KOG4331|consensus 52.4 3.9 8.5E-05 48.7 -0.6 37 463-500 103-141 (865)
60 PF01034 Syndecan: Syndecan do 51.9 5 0.00011 33.5 0.1 35 468-503 11-47 (64)
61 TIGR01965 VCBS_repeat VCBS rep 51.7 25 0.00054 31.8 4.5 17 202-218 72-88 (99)
62 PF06365 CD34_antigen: CD34/Po 50.5 13 0.00027 37.8 2.7 23 358-384 38-60 (202)
63 KOG4289|consensus 49.6 1.9E+02 0.0042 37.5 12.5 84 241-332 503-593 (2531)
64 PHA03286 envelope glycoprotein 49.0 7.1 0.00015 43.8 0.7 26 471-496 395-420 (492)
65 PHA03291 envelope glycoprotein 49.0 8.6 0.00019 41.9 1.3 29 282-313 136-164 (401)
66 PF00558 Vpu: Vpu protein; In 48.0 13 0.00028 32.5 2.1 30 470-499 8-37 (81)
67 PF05393 Hum_adeno_E3A: Human 44.7 13 0.00028 33.0 1.5 12 489-500 54-66 (94)
68 PHA03099 epidermal growth fact 44.3 17 0.00037 34.4 2.4 10 384-393 38-47 (139)
69 PF14914 LRRC37AB_C: LRRC37A/B 44.3 10 0.00022 36.7 0.9 21 475-495 129-149 (154)
70 PF13908 Shisa: Wnt and FGF in 43.1 15 0.00032 35.9 1.9 22 466-487 75-96 (179)
71 PF15065 NCU-G1: Lysosomal tra 42.2 7.2 0.00016 42.6 -0.5 29 276-304 117-145 (350)
72 PRK00269 zipA cell division pr 42.2 14 0.0003 39.4 1.6 27 475-501 9-36 (293)
73 PHA03099 epidermal growth fact 41.5 24 0.00051 33.5 2.8 16 487-502 118-133 (139)
74 PF02038 ATP1G1_PLM_MAT8: ATP1 41.3 11 0.00023 30.1 0.5 25 471-495 15-39 (50)
75 PF07204 Orthoreo_P10: Orthore 40.9 12 0.00026 33.6 0.8 7 432-438 6-12 (98)
76 PF09394 Inhibitor_I42: Chagas 40.5 59 0.0013 27.7 5.1 74 223-313 1-74 (92)
77 TIGR00864 PCC polycystin catio 39.9 2.7E+02 0.0059 38.4 12.6 61 239-314 1449-1509(2740)
78 PF07523 Big_3: Bacterial Ig-l 39.4 81 0.0018 25.7 5.5 50 250-312 18-67 (67)
79 PF08329 ChitinaseA_N: Chitina 37.3 55 0.0012 31.2 4.6 49 287-336 82-130 (133)
80 PF15471 TMEM171: Transmembran 36.0 1.2E+02 0.0026 32.3 7.2 27 463-490 156-182 (319)
81 PF15298 AJAP1_PANP_C: AJAP1/P 33.8 63 0.0014 32.8 4.6 54 465-518 98-159 (205)
82 PF05337 CSF-1: Macrophage col 33.7 14 0.0003 39.2 0.0 27 470-499 229-255 (285)
83 PF13753 SWM_repeat: Putative 33.6 5.9E+02 0.013 26.8 15.0 207 80-314 11-242 (317)
84 PF00801 PKD: PKD domain; Int 33.4 2.5E+02 0.0055 22.5 8.0 18 287-304 51-68 (69)
85 PF11770 GAPT: GRB2-binding ad 33.1 38 0.00083 32.9 2.8 17 480-496 20-36 (158)
86 PF02009 Rifin_STEVOR: Rifin/s 32.2 11 0.00025 40.3 -0.9 20 478-497 266-286 (299)
87 KOG1219|consensus 32.0 3.6E+02 0.0078 37.2 11.3 185 107-313 1048-1251(4289)
88 PF15347 PAG: Phosphoprotein a 31.9 70 0.0015 35.4 4.9 36 468-504 15-52 (428)
89 PF10164 DUF2367: Uncharacteri 31.6 33 0.00072 31.0 2.0 23 474-496 68-90 (98)
90 TIGR03660 T1SS_rpt_143 T1SS-14 31.1 3.4E+02 0.0074 25.9 8.9 31 286-316 83-113 (137)
91 PTZ00382 Variant-specific surf 30.4 40 0.00086 30.2 2.4 11 485-495 85-95 (96)
92 PF04234 CopC: CopC domain; I 30.0 3.2E+02 0.0069 23.9 8.0 62 251-313 35-97 (97)
93 cd07912 Tweety_N N-terminal do 29.6 41 0.00088 37.8 2.8 11 468-478 41-51 (418)
94 PF05915 DUF872: Eukaryotic pr 29.1 25 0.00054 32.6 0.9 25 472-496 77-105 (115)
95 PF11857 DUF3377: Domain of un 28.9 88 0.0019 27.0 4.0 31 467-499 31-61 (74)
96 PF05545 FixQ: Cbb3-type cytoc 28.7 36 0.00078 26.4 1.6 18 482-500 21-38 (49)
97 PF13860 FlgD_ig: FlgD Ig-like 28.4 1.2E+02 0.0027 25.6 5.0 12 288-299 69-80 (81)
98 PRK12633 flgD flagellar basal 28.2 2.1E+02 0.0045 29.6 7.4 25 279-305 166-190 (230)
99 PTZ00364 dipeptidyl-peptidase 27.9 48 0.001 38.5 3.1 38 463-500 468-514 (548)
100 COG4736 CcoQ Cbb3-type cytochr 27.6 30 0.00066 28.6 1.0 28 490-528 28-55 (60)
101 PF06679 DUF1180: Protein of u 26.6 47 0.001 32.7 2.3 16 498-513 127-142 (163)
102 PF15415 DUF4622: Protein of u 26.3 1.5E+02 0.0032 31.3 5.8 147 207-385 148-305 (310)
103 PF09680 Tiny_TM_bacill: Prote 25.3 66 0.0014 22.1 2.1 16 474-491 8-23 (24)
104 PF09315 DUF1973: Domain of un 25.1 4.7E+02 0.01 26.0 9.0 76 250-332 42-120 (179)
105 PF12768 Rax2: Cortical protei 24.9 88 0.0019 33.3 4.1 22 474-495 237-258 (281)
106 PRK06439 NADH dehydrogenase su 24.6 66 0.0014 27.7 2.5 29 467-495 43-71 (72)
107 TIGR03370 PEPCTERM_Roseo varia 24.6 33 0.00072 23.9 0.6 18 480-497 8-25 (26)
108 COG5513 Predicted secreted pro 24.5 5.9E+02 0.013 23.8 9.5 90 224-329 22-111 (113)
109 PLN03150 hypothetical protein; 24.5 29 0.00062 40.6 0.5 7 308-314 340-346 (623)
110 PF08496 Peptidase_S49_N: Pept 24.3 59 0.0013 31.6 2.5 32 472-503 14-45 (155)
111 TIGR00868 hCaCC calcium-activa 24.0 6.2E+02 0.013 31.4 11.3 102 223-332 495-604 (863)
112 PF06365 CD34_antigen: CD34/Po 23.9 73 0.0016 32.5 3.2 26 469-495 101-128 (202)
113 PRK12634 flgD flagellar basal 23.8 1.5E+02 0.0033 30.4 5.5 26 280-307 160-185 (221)
114 PF05140 ResB: ResB-like famil 23.7 42 0.0009 37.7 1.6 33 464-496 396-431 (464)
115 PF00974 Rhabdo_glycop: Rhabdo 23.7 26 0.00057 40.1 0.0 9 462-470 449-457 (501)
116 PF08372 PRT_C: Plant phosphor 23.4 51 0.0011 32.2 1.9 24 520-543 36-59 (156)
117 PHA03289 envelope glycoprotein 23.1 2.6E+02 0.0056 30.6 7.1 29 282-313 129-157 (352)
118 KOG3521|consensus 22.8 1.5E+02 0.0032 35.4 5.6 38 504-551 768-807 (846)
119 PF01601 Corona_S2: Coronaviru 22.3 29 0.00063 40.4 0.0 24 121-144 146-170 (610)
120 KOG4221|consensus 22.1 64 0.0014 40.7 2.7 34 460-494 1052-1085(1381)
121 PF05808 Podoplanin: Podoplani 22.0 30 0.00065 34.0 0.0 35 465-501 124-161 (162)
122 PF12669 P12: Virus attachment 21.7 64 0.0014 26.3 1.9 8 491-498 20-27 (58)
123 PF15102 TMEM154: TMEM154 prot 21.6 60 0.0013 31.4 1.9 40 461-500 49-88 (146)
124 PF14967 FAM70: FAM70 protein 21.5 2.2E+02 0.0048 30.8 6.2 41 449-489 175-216 (327)
125 PF10577 UPF0560: Uncharacteri 21.4 32 0.00068 41.4 0.0 30 460-489 267-297 (807)
126 PF13750 Big_3_3: Bacterial Ig 20.5 2.1E+02 0.0045 27.7 5.5 36 288-324 15-51 (158)
No 1
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=100.00 E-value=4.4e-82 Score=649.26 Aligned_cols=272 Identities=42% Similarity=0.692 Sum_probs=1.0
Q ss_pred eEEEEEecCCCCCcCchHHH-HHHHHHHHHHhCCCCCCcEEEEeeecCCCCCCCcEEEEEEecCCCCCCCCCHHHHHHHH
Q psy12980 322 ASFSLTLARPSSTFAQPAAQ-RAFLEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWANRTLASNVECPETTIGLLR 400 (609)
Q Consensus 322 vef~L~l~~~~~~ft~~~~q-~~LVekLA~llGD~d~s~ItVrsI~~~~~~~~gSTvV~w~N~TLp~~~~CP~~eI~~L~ 400 (609)
|+|+++|++++++|+++.++ ++||++||.+|||+|+++|+||+|+ +|||+|+|||+||+. +.||+++|++|+
T Consensus 1 ~~F~~~l~~d~~~f~~dv~~ki~lVekLA~~~GD~nts~ItV~sIt------~gstiVtwtNnTLp~-~~CP~eeI~~L~ 73 (290)
T PF05454_consen 1 VEFSATLDIDYESFNNDVQRKILLVEKLARLFGDRNTSSITVRSIT------SGSTIVTWTNNTLPT-SPCPKEEIEKLR 73 (290)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CeeEEEEcCCHHHhhhhHHHHHHHHHHHHHHhCCCCCCeEEEEEec------CCCEEEEEEcCCCCC-CCCCHHHHHHHH
Confidence 68999999999999986544 7899999999999999999999999 899999999999998 899999999999
Q ss_pred HHHhhccchhhHHHHHhhcccceeeEEeeeeccccCCCCCCCCCCCCCCCCC------CCCCCCCCCCCccEEEEeehHH
Q psy12980 401 EVLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKPA------YSGKNTPANASVEYLITFVAPL 474 (609)
Q Consensus 401 ~kL~~~dg~~s~~f~~am~peF~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~~------~~~~~~~~na~~~~Llv~ViPa 474 (609)
++|.+++|+++++|+++|+|+|+|++|++.++|+|++.......+.+..|.. +......+.++++||+ ++||+
T Consensus 74 ~~L~~~~g~~~~~f~~am~pef~V~svsv~~~G~C~~~~~ip~~~~~~iP~~~~~t~~p~~~~~~~s~~d~yL~-T~Ipa 152 (290)
T PF05454_consen 74 KRLVDDDGKPSQEFVRAMGPEFKVKSVSVIPIGSCQDTSFIPTPPDSEIPTPAPPTNTPDEPSPKSSFSDDYLH-TFIPA 152 (290)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHhcCCCCcCHHHHHHhCCCCceeEEEEEEeeccCCCccCCCCCcCcCCCCCCCCCCCCCCCCCcccccchHH-HHHHH
Confidence 9999999999999999999999999999999999999766543222223321 1111123444565655 55799
Q ss_pred HHHHHHHHHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCCCCCCCCCeeccCCCCCCCCCCCC
Q psy12980 475 VVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEEKPPLPPPEYN 554 (609)
Q Consensus 475 VIIa~mllia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P~IlkeEkpplpppey~ 554 (609)
|||++|||||++||||||||||||||++ |||++||+|||||||||||||. ||++++|+||||||||||||||||+
T Consensus 153 VVI~~iLLIA~iIa~icyrrkR~GK~~~----ee~~~f~~KGiPvIF~dElee~-kp~~~~~~P~IlkeEkPPl~pp~y~ 227 (290)
T PF05454_consen 153 VVIAAILLIAGIIACICYRRKRKGKMSL----EEQKTFISKGIPVIFQDELEES-KPEPGSKSPVILKEEKPPLPPPEYP 227 (290)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHhhhhhhcccccc----chhHHHHhcCCceecccccccc-CCCCCCCCCeeecccCCCCCCCCCC
Confidence 9999999999999999999999999998 5699999999999999999995 4999999999999999999999999
Q ss_pred CCCCCCCCCcCccCCCCCCccccccCCCCCCCCCCCCCCC---------C--CCCCCCCCCCCCCC
Q psy12980 555 RDKNDFSRDKNDYSRSQPNATTALLSDIDDTSPYHPPPPI---------R--PAPTYRLPPPYSPP 609 (609)
Q Consensus 555 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~pppp~---------r--~~~~yr~pppyvpp 609 (609)
+++++++++|+.++.+ +..++.++|+|++|||||||| | +||+||+|||||||
T Consensus 228 ~~~~~~~~p~~~~~~~---~~~~~~~~~~~~~~y~ppp~~~~~~~~~~~~p~~~~~yr~PPpyvpp 290 (290)
T PF05454_consen 228 NSNMPESTPLNQDTLG---EYTPLRDEDPDAPPYQPPPPFTSPGEGRRSRPKNTPPYRSPPPYVPP 290 (290)
T ss_dssp -------------------------------------------------------S----------
T ss_pred CCCCCCCCCccccccc---cccccccCCCCCCccCCCCCcccCCCCCCCCCCCCCCCCCCCCCCCC
Confidence 9999999988433311 112767777899999999999 3 69999999999998
No 2
>KOG3781|consensus
Probab=100.00 E-value=2.2e-47 Score=408.06 Aligned_cols=327 Identities=25% Similarity=0.367 Sum_probs=289.7
Q ss_pred ceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEE
Q psy12980 214 PLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQ 293 (609)
Q Consensus 214 Pvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~ 293 (609)
|++.|++..+++++|+.|...||+|||.|.||+++..+++...+.||.. .|++|+..++-++|+|++ +|+++|.
T Consensus 227 P~l~n~~~~~~~~~g~~~~m~IpSdTf~dsed~~~f~~kl~ilsld~ae----vwv~f~sNskd~~glPms--~gkf~y~ 300 (559)
T KOG3781|consen 227 PLLKNQVLRVNCLSGQHCHMHIPSDTFEDSEDITSFIIKLEILSLDGAE----VWVKFVSNSKDSIGLPMS--CGKFHYV 300 (559)
T ss_pred ccccCcceeeeeecchhhheeCCccchhhhhhcccceeeeeeecCCCce----eEEEeccCCccccccccc--ccceeee
Confidence 9999999999999999999999999999999999999999999999976 899999988999999999 6999999
Q ss_pred EEEEcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcCchHH-HHHHHHHHHHHhCCCCCCcEEEEeeecCCCCC
Q psy12980 294 LVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAA-QRAFLEKLAGLFGDRNASKIVNIEFEDNPQEP 372 (609)
Q Consensus 294 LtATDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft~~~~-q~~LVekLA~llGD~d~s~ItVrsI~~~~~~~ 372 (609)
+.|+|.+|++++|+|+|+|.+.+..++.|.|.+.+..+ +++.+++. +-.|++.||..+|++. ..|+++.|...
T Consensus 301 lea~dk~Gq~a~~pf~vnv~~~~s~n~~~~f~~~~p~d-~~~~~np~~rt~Lv~~La~a~~~~~-~~itLq~I~~~---- 374 (559)
T KOG3781|consen 301 LEASDKDGQSATDPFVVNVIPLPSANHLHTFHFPFPPD-TFYDQNPGNRTHLVLELASALGYPS-GEITLQFIMTI---- 374 (559)
T ss_pred eeeecCCCCcccCCeEEeeeecCCCccceEEeccCCCc-hhhhcCCcchhHHHHHHHHhhcCCc-cchhhhhhhhh----
Confidence 99999999999999999999998888988888887755 45555555 4689999999999986 69999999842
Q ss_pred CCcEEEEEEecCCCCCCCCCHHHHHHHHH-HHhhccchhhHHHHHhhcccceeeEEeeeeccccCCCCCCCCCCCCCCCC
Q psy12980 373 GKSTVVRWANRTLASNVECPETTIGLLRE-VLLTEDDLLQESLVTSLEKDYGVKHASVIPTGICEGLKTPLHTPGVERKP 451 (609)
Q Consensus 373 ~gSTvV~w~N~TLp~~~~CP~~eI~~L~~-kL~~~dg~~s~~f~~am~peF~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~ 451 (609)
.++++++|.|+++|+ +.|+.++|+.+.. ++..++|+++.+|.+.|+++|+++.+++..+|+|.+..++...
T Consensus 375 r~~~~vsf~nnslp~-e~C~s~qI~~va~kmi~~~~~rpr~~f~~tm~~dfk~kt~s~~k~gnc~hl~fI~lt------- 446 (559)
T KOG3781|consen 375 RYGTPVSFDNNSLPY-EFCDSDQIGLVAQKMITIELGRPRSSFCHTMGLDFKVKTASFIKRGNCIHLFFIKLT------- 446 (559)
T ss_pred ccceeEEeecccCcc-ccCcHHHHHHHhhhhhHHhcCCccchhhhhccccceeeEEEeeccccccceeeeecc-------
Confidence 789999999999999 8899999999995 6677788999999999999999999999999999876433210
Q ss_pred CCCCCCCCCCCCccEEEEeehHHHHHHHHHHHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCC
Q psy12980 452 AYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEV 531 (609)
Q Consensus 452 ~~~~~~~~~na~~~~Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~ 531 (609)
-||++..+.|.++++.++..|.||+|+| .++|||++.++ ..|.+||.||||.+|+++
T Consensus 447 ------------PD~~l~~~~~~l~l~tl~t~a~Ii~~~~--~lk~Gk~ks~~-----~~~~kk~lpIvf~lel~d---- 503 (559)
T KOG3781|consen 447 ------------PDYLLSSNPPQLVLITLRTGATIISWIC--ALKKGKSKSLK-----WCPNKKILPIVFKLELPD---- 503 (559)
T ss_pred ------------chhhhccCCceeeehhhHhhhHHHHhHH--HHhhccccccc-----eeeccCceeEEEEeecCc----
Confidence 1788889999999999999999999998 67789999875 799999999999999998
Q ss_pred CCCCCCCeeccCCCCCCCCCCCCCCCCCCCCCc---CccCCCCCCccccccCCCCCCCCCCCCCCC
Q psy12980 532 SSGTKSPIILKEEKPPLPPPEYNRDKNDFSRDK---NDYSRSQPNATTALLSDIDDTSPYHPPPPI 594 (609)
Q Consensus 532 ~~~~~~P~IlkeEkpplpppey~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~y~pppp~ 594 (609)
++.+|+++|||||||++|+|+|. -..+.+| |+|. .....++++++||+|+|+
T Consensus 504 --~~~~pmlvkeek~plk~~~~~N~-i~~t~pl~t~g~~~--------l~~~nden~p~~q~p~p~ 558 (559)
T KOG3781|consen 504 --GNVHPMLVKEEKPPLKIPTVENI-ICLTLPLQTIGNSI--------LTSSNDENSPWIQNPIPA 558 (559)
T ss_pred --cCCchhHHHhhcCCCCccccCcc-cccCCChhhcCCce--------ecccCCccCCcccCCCCC
Confidence 59999999999999999999987 5555555 5554 334456799999999986
No 3
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=99.80 E-value=4.5e-19 Score=155.88 Aligned_cols=94 Identities=30% Similarity=0.558 Sum_probs=87.8
Q ss_pred CccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEE
Q psy12980 218 NPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCK 297 (609)
Q Consensus 218 n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtAT 297 (609)
|.|.+++|++|+.|+|+||.++|.|.| |+ +|+|++...+|.+|| +||+||+.+++|+|+|...++|.|+|+|+|+
T Consensus 2 ~~i~~~~a~~g~~f~~~i~~~tF~d~d-~~--~lty~~~~~~~~~lP--~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~ 76 (97)
T smart00736 2 NAIGDQTATEGESFSYTIPSSTFTDAD-GD--TLTYSATLSDGSALP--SWLSFDSDTGTLSGTPTNSDVGSLSLKVTAT 76 (97)
T ss_pred CcccceEEeCCCeEEEEcChhceECCC-CC--eEEEEEEeCCCCCCC--CeEEEeCCCCEEEEECCCCCCcEEEEEEEEE
Confidence 678999999999999999999999996 77 888888888888887 8999999999999999999999999999999
Q ss_pred cCCCCeeEeeEEEEEECCC
Q psy12980 298 DSNGLMANDGLEVIVKSPG 316 (609)
Q Consensus 298 Ds~G~sasd~F~V~V~~~p 316 (609)
|+.|.++++.|+|.|.+++
T Consensus 77 D~~g~~~~~~f~i~V~~~~ 95 (97)
T smart00736 77 DSSGASASDTFTITVVNTN 95 (97)
T ss_pred ECCCCEEEEEEEEEEeCCC
Confidence 9999999999999998753
No 4
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=99.65 E-value=1.4e-17 Score=172.56 Aligned_cols=95 Identities=33% Similarity=0.610 Sum_probs=0.0
Q ss_pred cchhhHHHHHHHHHhCCCCCcccEEEEeeecCCCcEEEEEecCCCCCCCCCHHHHHHHHHHhccCCCCCchHHHHHhhcc
Q psy12980 2 VDWQIFVYNILLDLYGNSGNSSYITVRSVVTNKEPYVFTYTNESLPRNECPTAQINNLLEVISNTTTGQAKPVLTRALGR 81 (609)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~f~~~~~~~p~~~c~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 81 (609)
++|+++++++||++|||. |+++|+||+|+ ++.+.|+|+|.|||.+.||+++|++|.++|.+. .|+.+++|+++|+|
T Consensus 18 v~~ki~lVekLA~~~GD~-nts~ItV~sIt--~gstiVtwtNnTLp~~~CP~eeI~~L~~~L~~~-~g~~~~~f~~am~p 93 (290)
T PF05454_consen 18 VQRKILLVEKLARLFGDR-NTSSITVRSIT--SGSTIVTWTNNTLPTSPCPKEEIEKLRKRLVDD-DGKPSQEFVRAMGP 93 (290)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHhCCC-CCCeEEEEEec--CCCEEEEEEcCCCCCCCCCHHHHHHHHHHHhcC-CCCcCHHHHHHhCC
Confidence 688999999999999998 99999999998 578999999999999999999999999999977 99999999999999
Q ss_pred ceeeeeeeeecccccCCCC
Q psy12980 82 LMTVKNITWRGIGHCEDSP 100 (609)
Q Consensus 82 ~~~vk~~~~~~~g~~~~~~ 100 (609)
++.|+++++.++|+|....
T Consensus 94 ef~V~svsv~~~G~C~~~~ 112 (290)
T PF05454_consen 94 EFKVKSVSVIPIGSCQDTS 112 (290)
T ss_dssp -------------------
T ss_pred CCceeEEEEEEeeccCCCc
Confidence 9999999999999999854
No 5
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=99.00 E-value=7.2e-10 Score=97.55 Aligned_cols=90 Identities=22% Similarity=0.266 Sum_probs=67.4
Q ss_pred CcCccceeecCccEEEEcCCCCcccCCCCCeeeEEEEEecc---cceecccC--ccCcCccccccccceeeEEEeeeecc
Q psy12980 118 NPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTM---DRYVISNT--TTGQAKPVLTRALGRLMTVKNITWRG 192 (609)
Q Consensus 118 ~pi~~~~a~e~~~fs~~iP~~tF~D~d~Gdt~~ltlsl~~~---d~~~~~~t--~T~sgtp~~~d~~~~~v~v~~~~~~~ 192 (609)
|.|.+++|++|+.|+|+||.++|.|.| |++.+|++++..+ -+|+.||. ++++|+|..++.+..++.|+..+..+
T Consensus 2 ~~i~~~~a~~g~~f~~~i~~~tF~d~d-~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~GtP~~~~~g~~~i~v~a~D~~g 80 (97)
T smart00736 2 NAIGDQTATEGESFSYTIPSSTFTDAD-GDTLTYSATLSDGSALPSWLSFDSDTGTLSGTPTNSDVGSLSLKVTATDSSG 80 (97)
T ss_pred CcccceEEeCCCeEEEEcChhceECCC-CCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEECCCCCCcEEEEEEEEEECCC
Confidence 678999999999999999999999998 8875555544332 25999976 66789999999988888888766654
Q ss_pred cccccCCCCCCceecccCCC
Q psy12980 193 IGHCEDSPPPPVVKVKLENQ 212 (609)
Q Consensus 193 ~~~~~~~~~~~tv~v~~~N~ 212 (609)
.. +.. .+++.|.+.|+
T Consensus 81 ~~-~~~---~f~i~V~~~~~ 96 (97)
T smart00736 81 AS-ASD---TFTITVVNTND 96 (97)
T ss_pred CE-EEE---EEEEEEeCCCC
Confidence 22 211 36666666665
No 6
>PF05510 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=98.76 E-value=3.1e-07 Score=99.08 Aligned_cols=259 Identities=15% Similarity=0.220 Sum_probs=158.7
Q ss_pred eeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCC----ceEEeecCCCCcee-EEEEEEE
Q psy12980 222 HLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKN----QEFYGIPQLTDLGR-REYQLVC 296 (609)
Q Consensus 222 ~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t----~tf~G~P~~~dvG~-~~f~LtA 296 (609)
.+++.+|..|.|+|=..+|-.-.++.+...+++....+-=.|| +||.|-..+ +-+||+|...++|+ ..+.|+|
T Consensus 22 ~~~~~~g~~Fv~~l~~~~F~~~~~~~~~~~~f~~sL~g~PDLP--~WLry~q~~~~~~GfLYGTP~~~~~g~~~~iEVia 99 (386)
T PF05510_consen 22 TLTALVGELFVHELEPETFNWTFQPSSDQFTFRASLQGYPDLP--SWLRYTQSHPYHPGFLYGTPTSEDAGKPVVIEVIA 99 (386)
T ss_pred eeccccceEEEEEeCHHHCCCcCCCCcceEEEecccCCCCCCc--chhhhccCCCCCCeeEecCCCCccCCCceEEEEEE
Confidence 6789999999999999999555556666788888755443466 999998875 47999999999999 9999999
Q ss_pred EcCCCC-eeEeeEEEEEECCCC-CCcceEEEEEecCCCCCcCchHHHHHHHHHHHHHhCCCCCCcEEEEeeecCCCC---
Q psy12980 297 KDSNGL-MANDGLEVIVKSPGP-AYYSASFSLTLARPSSTFAQPAAQRAFLEKLAGLFGDRNASKIVNIEFEDNPQE--- 371 (609)
Q Consensus 297 TDs~G~-sasd~F~V~V~~~p~-~~~~vef~L~l~~~~~~ft~~~~q~~LVekLA~llGD~d~s~ItVrsI~~~~~~--- 371 (609)
-++..- ++...+.|.|..... ..+.+||.+.- -+.+.+.....+..|...+..+...+ ..++.|.+|+.. .+
T Consensus 100 ~Nr~tyet~r~~lvi~i~~~~~~~~~qae~~I~n-lnvEdll~~~~~~~l~~~~r~lW~~~-~~~L~v~~i~SA-ld~Gg 176 (386)
T PF05510_consen 100 LNRDTYETRRQVLVINIGEKEGPTPYQAEFKIDN-LNVEDLLDSERIERLKNIFRKLWPNS-PEDLYVVNITSA-LDRGG 176 (386)
T ss_pred eecccceeeeEEEEEEeccCCCCCCeEEEEEEcC-CCHHHhcChhHHHHHHHHHHHHhhcc-cCCceEEeeeeh-hhccC
Confidence 998653 555577777766432 56777777652 33456665445677777888888865 479999888642 22
Q ss_pred --------CCCcEEEEEEecCCCCCCCCCHHHHHHHHHHHhhccc-hhhHHHHHhhccccee--eEEeeeeccccCC---
Q psy12980 372 --------PGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDD-LLQESLVTSLEKDYGV--KHASVIPTGICEG--- 437 (609)
Q Consensus 372 --------~~gSTvV~w~N~TLp~~~~CP~~eI~~L~~kL~~~dg-~~s~~f~~am~peF~V--~~Vs~~g~G~C~~--- 437 (609)
+.| ++| .+.|-..|+.|=.+-+.-|.+. .=..+ .+-....+.+.+.|+| |++++...-.=..
T Consensus 177 RvPL~~~~~EG-V~V--~vGS~~~FS~~L~~lv~pl~~~-~C~~k~t~v~~~~~~f~p~F~iDWC~f~Lv~~s~~~~~~~ 252 (386)
T PF05510_consen 177 RVPLPPNQKEG-VYV--KVGSQAPFSPCLLELVRPLQKL-SCPYKRTSVESCFRNFAPGFQIDWCNFRLVDLSSPEPMPE 252 (386)
T ss_pred CCCCCccccce-EEE--EecCCCCcCHHHHHHhChHhhc-cCCCCCCcceecccccCCCceeeeeeeEEEccCcCCcCCc
Confidence 122 222 4555555677766554444444 22222 3333333445677775 4555432222100
Q ss_pred CCCC-CC---C-CCCCC-CCCCCCCCCCCCCCccEEEEeehHHHHHHHHHHHHHHhheee
Q psy12980 438 LKTP-LH---T-PGVER-KPAYSGKNTPANASVEYLITFVAPLVVIVIMLLCAALIACLL 491 (609)
Q Consensus 438 ~t~~-~~---~-pe~~~-p~~~~~~~~~~na~~~~Llv~ViPaVIIa~mllia~iIaC~c 491 (609)
.... +. . .+..+ |+ .......+-.++++++++||++|.++++++.+.|-||.
T Consensus 253 ~~~~~~g~gi~~~~~~~~Pp--~~~~p~R~y~~d~~vtl~iPl~i~llL~llLs~Imc~r 310 (386)
T PF05510_consen 253 QPTQPYGSGILEEDGWFNPP--KESVPGRDYFPDFLVTLAIPLIIALLLLLLLSYIMCCR 310 (386)
T ss_pred ccccccCCCccCCCCCCCCC--cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHheec
Confidence 0000 00 0 00001 11 11112233455677778888877776666666665553
No 7
>KOG3781|consensus
Probab=98.76 E-value=1.3e-10 Score=126.44 Aligned_cols=288 Identities=21% Similarity=0.289 Sum_probs=189.2
Q ss_pred CCcccCcCccceeecCccEEEEcCCCCcccCCCCCeeeEEEEEecccc---eecccCc--cCcCccccccccceeeEEEe
Q psy12980 113 VPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDR---YVISNTT--TGQAKPVLTRALGRLMTVKN 187 (609)
Q Consensus 113 ~P~v~~pi~~~~a~e~~~fs~~iP~~tF~D~d~Gdt~~ltlsl~~~d~---~~~~~t~--T~sgtp~~~d~~~~~v~v~~ 187 (609)
-|++.|++...++++|++|...||.|||.|.+++++.++.+.....|| |..|+.. -..|=|-. .|...--++.
T Consensus 226 ~P~l~n~~~~~~~~~g~~~~m~IpSdTf~dsed~~~f~~kl~ilsld~aevwv~f~sNskd~~glPms--~gkf~y~lea 303 (559)
T KOG3781|consen 226 TPLLKNQVLRVNCLSGQHCHMHIPSDTFEDSEDITSFIIKLEILSLDGAEVWVKFVSNSKDSIGLPMS--CGKFHYVLEA 303 (559)
T ss_pred CccccCcceeeeeecchhhheeCCccchhhhhhcccceeeeeeecCCCceeEEEeccCCccccccccc--ccceeeeeee
Confidence 389999999999999999999999999999999999999999999998 9999554 33466654 2333222221
Q ss_pred eeecccccccCCCCCCceecccCCCCceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCc
Q psy12980 188 ITWRGIGHCEDSPPPPVVKVKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTH 267 (609)
Q Consensus 188 ~~~~~~~~~~~~~~~~tv~v~~~N~aPvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~s 267 (609)
.+-+ | ..+..|++-|.++...+..+..|.+..|.++|+|-++|..--|..+|..+-|.+.. ..
T Consensus 304 ~dk~--G--------------q~a~~pf~vnv~~~~s~n~~~~f~~~~p~d~~~~~np~~rt~Lv~~La~a~~~~~~-~i 366 (559)
T KOG3781|consen 304 SDKD--G--------------QSATDPFVVNVIPLPSANHLHTFHFPFPPDTFYDQNPGNRTHLVLELASALGYPSG-EI 366 (559)
T ss_pred ecCC--C--------------CcccCCeEEeeeecCCCccceEEeccCCCchhhhcCCcchhHHHHHHHHhhcCCcc-ch
Confidence 1111 1 24567888888999999999999999999999999999843477777778888877 78
Q ss_pred cceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcCchHHH---HHH
Q psy12980 268 WLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQ---RAF 344 (609)
Q Consensus 268 WL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft~~~~q---~~L 344 (609)
||||.-..+ ||++.+=+--..-|...++|..|+.+.--...+-.. |.-...+-+..++....-+|... .+ ..|
T Consensus 367 tLq~I~~~r--~~~~vsf~nnslp~e~C~s~qI~~va~kmi~~~~~r-pr~~f~~tm~~dfk~kt~s~~k~-gnc~hl~f 442 (559)
T KOG3781|consen 367 TLQFIMTIR--YGTPVSFDNNSLPYEFCDSDQIGLVAQKMITIELGR-PRSSFCHTMGLDFKVKTASFIKR-GNCIHLFF 442 (559)
T ss_pred hhhhhhhhc--cceeEEeecccCccccCcHHHHHHHhhhhhHHhcCC-ccchhhhhccccceeeEEEeecc-ccccceee
Confidence 999997633 444433322334444477887777775555444433 11222222222222222223221 12 133
Q ss_pred HHHHHHHhCCCCCCcEEEEeeecCCCCCCCcEEEEEEe-----cCCCCCCCCCHHHHHHHHHHHhhccchhhHHHHHhhc
Q psy12980 345 LEKLAGLFGDRNASKIVNIEFEDNPQEPGKSTVVRWAN-----RTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLE 419 (609)
Q Consensus 345 VekLA~llGD~d~s~ItVrsI~~~~~~~~gSTvV~w~N-----~TLp~~~~CP~~eI~~L~~kL~~~dg~~s~~f~~am~ 419 (609)
|.-.-+++.-.+...+.|+.+. .+++++.|.+ ++-.. +.|+-++|-...=++.-++|....-+.+.+.
T Consensus 443 I~ltPD~~l~~~~~~l~l~tl~------t~a~Ii~~~~~lk~Gk~ks~-~~~~~kk~lpIvf~lel~d~~~~pmlvkeek 515 (559)
T KOG3781|consen 443 IKLTPDYLLSSNPPQLVLITLR------TGATIISWICALKKGKSKSL-KWCPNKKILPIVFKLELPDGNVHPMLVKEEK 515 (559)
T ss_pred eeccchhhhccCCceeeehhhH------hhhHHHHhHHHHhhcccccc-ceeeccCceeEEEEeecCccCCchhHHHhhc
Confidence 3222144444556788888888 7888999988 11112 3577666654444454455666677788777
Q ss_pred ccceeeEEeee
Q psy12980 420 KDYGVKHASVI 430 (609)
Q Consensus 420 peF~V~~Vs~~ 430 (609)
+.+++-.+.-.
T Consensus 516 ~plk~~~~~N~ 526 (559)
T KOG3781|consen 516 PPLKIPTVENI 526 (559)
T ss_pred CCCCccccCcc
Confidence 77777665543
No 8
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=98.31 E-value=1.4e-06 Score=68.18 Aligned_cols=47 Identities=21% Similarity=0.372 Sum_probs=40.9
Q ss_pred EEEeccCCCCCCCCccceeccCCceEEeecCCC-CceeEEEEEEEEcCCC
Q psy12980 253 LHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLT-DLGRREYQLVCKDSNG 301 (609)
Q Consensus 253 lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~-dvG~~~f~LtATDs~G 301 (609)
+++....+..|| +||.||+.+++|+|.|... ..|.|.|.|+|+|..|
T Consensus 2 ys~~~~~~~~LP--~gLs~d~~tG~isGtp~~~~~~G~y~~~vtatd~~G 49 (49)
T PF05345_consen 2 YSLTTPTGGGLP--SGLSLDPSTGTISGTPTSSVQPGTYTFTVTATDGSG 49 (49)
T ss_pred EEEEcCCCCCCC--CcEEEeCCCCEEEeecCCCccccEEEEEEEEEcCCC
Confidence 445556677788 9999999999999999988 5699999999999987
No 9
>KOG4482|consensus
Probab=97.90 E-value=8e-05 Score=79.51 Aligned_cols=205 Identities=15% Similarity=0.213 Sum_probs=116.9
Q ss_pred eeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCC----ceEEeecCCCCceeEEEEEEEE
Q psy12980 222 HLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKN----QEFYGIPQLTDLGRREYQLVCK 297 (609)
Q Consensus 222 ~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t----~tf~G~P~~~dvG~~~f~LtAT 297 (609)
.+.+++|..|.++|-..+|.-----.+-..++......--.+| +||.+=++. +-+||+|.....|+..+.|+|-
T Consensus 28 ~v~a~vG~lFvhtLe~e~F~~~h~p~~~~v~y~a~L~~yPdlP--~WLRy~~s~~yh~gfLyGtp~ae~~gk~~leVtA~ 105 (449)
T KOG4482|consen 28 HVTATVGKLFVHTLESEHFFWTHQPSPVKVKYEATLNGYPDLP--NWLRYLQSRPYHIGFLYGTPTAEVQGKVTLEVTAY 105 (449)
T ss_pred eeecccceEEEEeccHHHhcCcCcCCcceeEeehhccCCCCch--hHHHhcccCCCCcceEecCCCcccCceeEEEEEEe
Confidence 4689999999999999998832111223456666544333356 999998875 5899999999999999999998
Q ss_pred cCCCCee-EeeEEEEE--ECCCCCCcceEEEEEecCCCCCcCchHHHHHHHHHHHH-HhCCCCCCcEEEEeeecC-----
Q psy12980 298 DSNGLMA-NDGLEVIV--KSPGPAYYSASFSLTLARPSSTFAQPAAQRAFLEKLAG-LFGDRNASKIVNIEFEDN----- 368 (609)
Q Consensus 298 Ds~G~sa-sd~F~V~V--~~~p~~~~~vef~L~l~~~~~~ft~~~~q~~LVekLA~-llGD~d~s~ItVrsI~~~----- 368 (609)
....--. .--..|++ .......+.+||.+. .-+.+.+......-.|...|.. +-|-.+-+.|-+.+...-
T Consensus 106 nrdnfdt~r~~vVini~~ae~fptp~v~e~~In-nldveell~s~~rs~l~nalrn~lw~~e~l~~i~~tsAld~ggRvp 184 (449)
T KOG4482|consen 106 NRDNFDTQRRMVVINIVEAELFPTPYVAEFPIN-NLDVEELLSSRDRSQLLNALRNTLWGPEVLPYINGTSALDRGGRVP 184 (449)
T ss_pred cccchhhhhheEEEEEeccccCCCcceeeeeec-CCCHHHhcCcccHHHHHHHHhccccCcccccceeeeehhhcCCCCC
Confidence 8754322 21233333 333456677788755 2233444432122355555555 333223345554443210
Q ss_pred -CCCCCCcEEEEEEecCCCCCCCCCHHHHHHHHHHHhhccchhhHHHHHhhccccee--eEEeee
Q psy12980 369 -PQEPGKSTVVRWANRTLASNVECPETTIGLLREVLLTEDDLLQESLVTSLEKDYGV--KHASVI 430 (609)
Q Consensus 369 -~~~~~gSTvV~w~N~TLp~~~~CP~~eI~~L~~kL~~~dg~~s~~f~~am~peF~V--~~Vs~~ 430 (609)
+.. +.+-=+-=++.+-..+++|=.+-.+.+...--.....+...-.+.++|.|+| |+|.+.
T Consensus 185 l~i~-~rkeGv~V~VGs~~~Fsp~l~ev~~~~~~l~c~~~~psv~~~~k~fap~F~vDWCnv~lV 248 (449)
T KOG4482|consen 185 LPIF-KRKEGVYVYVGSQQPFSPCLMEVVHPLQTLRCFQSNPSVVVCNKYFAPTFPVDWCNVALV 248 (449)
T ss_pred cCcc-cccceeEEEecccCCcchhHHHhhccccccchhccCCceEEhhhhcCCcceeeeeeeEee
Confidence 000 1111122366777778899775544333321222223344455677888886 566653
No 10
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=96.93 E-value=0.018 Score=55.11 Aligned_cols=162 Identities=15% Similarity=0.139 Sum_probs=94.2
Q ss_pred cccCCCCCeeeEEEEEecccc--eecccCccCcC--cccc----ccccceeeEEEeeeecccccccCCCCCCceecccCC
Q psy12980 140 FFDHEDGGTRKLKLHLMTMDR--YVISNTTTGQA--KPVL----TRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLEN 211 (609)
Q Consensus 140 F~D~d~Gdt~~ltlsl~~~d~--~~~~~t~T~sg--tp~~----~d~~~~~v~v~~~~~~~~~~~~~~~~~~tv~v~~~N 211 (609)
-.|.|.|....+++++...+. |..-+..||.- +..+ .....+.|.+++. +. ........++|.|.+.|
T Consensus 21 a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~---g~-~~~~~~~~v~I~V~d~N 96 (199)
T cd00031 21 ATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDG---GG-PPLSSTATVTVTVLDVN 96 (199)
T ss_pred EECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEEC---Cc-CcceeEEEEEEEEccCC
Confidence 356666643456667666653 44445555531 1111 1123334444432 11 11112234667777888
Q ss_pred C-CceecCccceeeeeeCC---eEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEee-cCCCC
Q psy12980 212 Q-VPLIRNPIDHLEAISGE---LLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGI-PQLTD 286 (609)
Q Consensus 212 ~-aPvv~n~i~~~~at~G~---~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~-P~~~d 286 (609)
+ +|......-.+.+.++. ....++- ..|.+.|....++|++..... ..|+++|..++.++-. ++.-.
T Consensus 97 d~~P~~~~~~~~~~v~e~~~~~~~i~~~~---a~D~D~~~~~~~~y~l~~~~~-----~~~f~i~~~~G~i~~~~~ld~e 168 (199)
T cd00031 97 DNPPVFEQSSYEASVPENAPPGTVVGTVT---ATDADSGENAKLTYSILSGND-----KELFSIDPNTGIITLAKPLDRE 168 (199)
T ss_pred CCCCcccccceEEEEeCCCCCCCEEEEEE---EEcCCCCCCccEEEEEeCCCC-----CCEEEEeCCceEEEeCCccCCc
Confidence 7 67776655566665553 2223332 677766533478888865433 3799999988877766 34333
Q ss_pred c-eeEEEEEEEEcCC--CCeeEeeEEEEEE
Q psy12980 287 L-GRREYQLVCKDSN--GLMANDGLEVIVK 313 (609)
Q Consensus 287 v-G~~~f~LtATDs~--G~sasd~F~V~V~ 313 (609)
. +.|.+.|+|+|.+ +.+.+-.+.|.|.
T Consensus 169 ~~~~~~l~v~a~D~~~~~~~~~~~i~i~v~ 198 (199)
T cd00031 169 EKSSYELTVVATDGGGPPLSSTATVTVTVL 198 (199)
T ss_pred cCceEEEEEEEEECCCCCceeEEEEEEEEE
Confidence 3 5999999999998 6777777777664
No 11
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=95.83 E-value=0.064 Score=61.52 Aligned_cols=78 Identities=18% Similarity=0.208 Sum_probs=50.1
Q ss_pred HHHHHHHHHHhCCC-------------CCCcEEEEeeecCCCC-CCCcEEEEEEecCCCCCCCCCHHHHHHHHHHHhhcc
Q psy12980 342 RAFLEKLAGLFGDR-------------NASKIVNIEFEDNPQE-PGKSTVVRWANRTLASNVECPETTIGLLREVLLTED 407 (609)
Q Consensus 342 ~~LVekLA~llGD~-------------d~s~ItVrsI~~~~~~-~~gSTvV~w~N~TLp~~~~CP~~eI~~L~~kL~~~d 407 (609)
.++-+.||++|+.. +...+.+.+++ ... +++-+-++||...... ++.++..-+++++.|..+
T Consensus 161 r~laqll~e~~~~~~~~~~R~rRattvGn~tVQmV~~s--Rl~G~~nP~EL~YyV~~~~G-~pl~a~~AA~~Ln~ld~Q- 236 (684)
T PF12877_consen 161 RRLAQLLAELFGVAQRQRRRFRRATTVGNYTVQMVNMS--RLEGPDNPVELTYYVEGQNG-KPLPAVTAAKDLNLLDSQ- 236 (684)
T ss_pred HHHHHHHHHHhhhhhcccceeeccccccceEEEEEEee--eccCCCCceEEEEEEEcCCC-cCCcHHHHHHHHhccCHH-
Confidence 34445667777665 22344444444 222 1334899999987777 899999999999988755
Q ss_pred chhhHHHHHhhcccceeeEEeee
Q psy12980 408 DLLQESLVTSLEKDYGVKHASVI 430 (609)
Q Consensus 408 g~~s~~f~~am~peF~V~~Vs~~ 430 (609)
+. ++...|.|..+--+
T Consensus 237 -----~~--Al~LGy~V~~~~Aq 252 (684)
T PF12877_consen 237 -----RM--ALILGYRVQGIVAQ 252 (684)
T ss_pred -----HH--HHhcCceecccccc
Confidence 22 45567777665443
No 12
>PF15102 TMEM154: TMEM154 protein family
Probab=95.50 E-value=0.026 Score=53.85 Aligned_cols=37 Identities=24% Similarity=0.585 Sum_probs=27.8
Q ss_pred CccEEEEeehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980 463 SVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGK 499 (609)
Q Consensus 463 ~~~~Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK 499 (609)
....++..+||+|+++++||++.+++.+|+|||+|++
T Consensus 54 q~efiLmIlIP~VLLvlLLl~vV~lv~~~kRkr~K~~ 90 (146)
T PF15102_consen 54 QLEFILMILIPLVLLVLLLLSVVCLVIYYKRKRTKQE 90 (146)
T ss_pred CcceEEEEeHHHHHHHHHHHHHHHheeEEeecccCCC
Confidence 3344666889998888888888777777777777775
No 13
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=95.44 E-value=0.03 Score=53.21 Aligned_cols=22 Identities=18% Similarity=0.383 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHhheeeeccc
Q psy12980 474 LVVIVIMLLCAALIACLLYRRR 495 (609)
Q Consensus 474 aVIIa~mllia~iIaC~cyRrK 495 (609)
+-||.+|..+|+|++|+|+.-|
T Consensus 68 fgiVfimgvva~i~icvCmc~k 89 (155)
T PF10873_consen 68 FGIVFIMGVVAGIAICVCMCMK 89 (155)
T ss_pred hhhHHHHHHHHHHHHHHhhhhh
Confidence 4556667788988888777754
No 14
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=95.30 E-value=0.069 Score=52.19 Aligned_cols=9 Identities=33% Similarity=0.017 Sum_probs=3.4
Q ss_pred EEeehHHHH
Q psy12980 468 ITFVAPLVV 476 (609)
Q Consensus 468 lv~ViPaVI 476 (609)
++.||-+||
T Consensus 81 ivgvi~~Vi 89 (179)
T PF13908_consen 81 IVGVICGVI 89 (179)
T ss_pred eeehhhHHH
Confidence 333443333
No 15
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=93.81 E-value=0.031 Score=52.02 Aligned_cols=27 Identities=37% Similarity=0.536 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHhheeeecccCCCCc
Q psy12980 474 LVVIVIMLLCAALIACLLYRRRHTGKM 500 (609)
Q Consensus 474 aVIIa~mllia~iIaC~cyRrKRkgK~ 500 (609)
+|||++++|+..+++|+-+||||||--
T Consensus 6 ~iii~~i~l~~~~~~~~~rRR~r~G~~ 32 (130)
T PF12273_consen 6 AIIIVAILLFLFLFYCHNRRRRRRGLQ 32 (130)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCC
Confidence 455555555555555555566666544
No 16
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=92.95 E-value=0.91 Score=37.47 Aligned_cols=68 Identities=16% Similarity=0.255 Sum_probs=48.8
Q ss_pred cCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEee-cCCCCc-eeEEEEEEEEcCCC--CeeEeeEEEEEEC
Q psy12980 242 DHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGI-PQLTDL-GRREYQLVCKDSNG--LMANDGLEVIVKS 314 (609)
Q Consensus 242 D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~-P~~~dv-G~~~f~LtATDs~G--~sasd~F~V~V~~ 314 (609)
|++.|....++|++...+.. .|+..|..++.+.-. ++.-.. ..|.+.|.|+|.++ .++...+.|.|..
T Consensus 2 D~D~g~n~~i~Y~i~~~~~~-----~~F~i~~~tg~i~~~~~LD~e~~~~y~l~v~a~D~~~~~~~~~~~v~I~V~D 73 (79)
T smart00112 2 DADSGENGKVTYSILSGNED-----GLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTVTVLD 73 (79)
T ss_pred CCCCCcCcEEEEEEecCCCC-----CEEEEeCCccEEEeCCccCeeCCCeEEEEEEEEECCCCCcccEEEEEEEEEE
Confidence 56666434678888644322 789999998855543 554443 68999999999987 7888888888865
No 17
>PF00028 Cadherin: Cadherin domain; InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=92.55 E-value=1.6 Score=37.20 Aligned_cols=68 Identities=12% Similarity=0.120 Sum_probs=46.8
Q ss_pred cccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEee-cCCC-CceeEEEEEEEEcC-CCCeeEeeEEEEE
Q psy12980 240 FFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGI-PQLT-DLGRREYQLVCKDS-NGLMANDGLEVIV 312 (609)
Q Consensus 240 F~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~-P~~~-dvG~~~f~LtATDs-~G~sasd~F~V~V 312 (609)
-.|+|.|....+.+++...+. ..|...|+.+++++=. ++.- ..+.|.+.+.|+|. ++..-+.+..|.|
T Consensus 20 a~D~D~~~n~~i~y~i~~~~~-----~~~F~I~~~tg~i~~~~~LD~E~~~~y~l~v~a~D~~~~~~~~~~~~V~I 90 (93)
T PF00028_consen 20 ATDPDSGPNSQITYSILGGNP-----DGLFSIDPNTGEISLKKPLDRETQSSYQLTVRATDSGGSPPLSSTATVTI 90 (93)
T ss_dssp EEESSTSTTSSEEEEEEETTS-----TTSEEEETTTTEEEESSSSCTTTTSEEEEEEEEEETTTSSEEEEEEEEEE
T ss_pred EEeCCCCCCceEEEEEecCcc-----cCceEEeeeeeccccceecCcccCCEEEEEEEEEECCCCCCCEEEEEEEE
Confidence 356655555578888875443 3799999999886654 4433 34899999999999 5555555555554
No 18
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=91.64 E-value=0.13 Score=38.48 Aligned_cols=11 Identities=27% Similarity=0.537 Sum_probs=5.9
Q ss_pred eeecccCCCCc
Q psy12980 490 LLYRRRHTGKM 500 (609)
Q Consensus 490 ~cyRrKRkgK~ 500 (609)
|||||.|.-|+
T Consensus 28 Ccykk~~~~k~ 38 (38)
T PF02439_consen 28 CCYKKHRRQKM 38 (38)
T ss_pred HHHcccccccC
Confidence 56666553453
No 19
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=90.22 E-value=0.19 Score=48.45 Aligned_cols=31 Identities=29% Similarity=0.178 Sum_probs=23.7
Q ss_pred EEEeehHHHHHHHHHHHHHHhheeeecccCC
Q psy12980 467 LITFVAPLVVIVIMLLCAALIACLLYRRRHT 497 (609)
Q Consensus 467 Llv~ViPaVIIa~mllia~iIaC~cyRrKRk 497 (609)
.++=|+.+|-+++||++++|+.|||+||||+
T Consensus 50 IVIGvVVGVGg~ill~il~lvf~~c~r~kkt 80 (154)
T PF04478_consen 50 IVIGVVVGVGGPILLGILALVFIFCIRRKKT 80 (154)
T ss_pred EEEEEEecccHHHHHHHHHhheeEEEecccC
Confidence 3445566777777788888999999998874
No 20
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=89.92 E-value=3.2 Score=39.60 Aligned_cols=70 Identities=11% Similarity=0.158 Sum_probs=48.9
Q ss_pred cccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeec-C-CCCceeEEEEEEEEcCCCCe--eEeeEEEEEEC
Q psy12980 240 FFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIP-Q-LTDLGRREYQLVCKDSNGLM--ANDGLEVIVKS 314 (609)
Q Consensus 240 F~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P-~-~~dvG~~~f~LtATDs~G~s--asd~F~V~V~~ 314 (609)
-.|++.|....++|++...+.. .|+..|..++.++-.. + ....+.|.+.+.|+|+++.. +.-.+.|.|..
T Consensus 21 a~D~D~~~~~~~~y~i~~~~~~-----~~F~i~~~tG~l~~~~~lD~e~~~~~~l~v~a~D~g~~~~~~~~~v~I~V~d 94 (199)
T cd00031 21 ATDPDSGENGRVTYSILGGNED-----GLFSIDPNTGVITTTKPLDREEQSEYTLTVVASDGGGPPLSSTATVTVTVLD 94 (199)
T ss_pred EECCCCCCCceEEEEEeCCCCc-----ccEEEeCCCCEEEECCCCCCcCCceEEEEEEEEECCcCcceeEEEEEEEEcc
Confidence 4566555323788877644322 6999999877666553 3 33348999999999987766 77788888865
No 21
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=89.23 E-value=2.5 Score=33.83 Aligned_cols=44 Identities=18% Similarity=0.397 Sum_probs=29.9
Q ss_pred ccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEe--eEEEEE
Q psy12980 267 HWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMAND--GLEVIV 312 (609)
Q Consensus 267 sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd--~F~V~V 312 (609)
.|......+..+.=.++.. |+|+|+|.|.|..|....+ ++.|+|
T Consensus 20 ~W~~~~~~~~~~~~~~L~~--G~Y~l~V~a~~~~~~~~~~~~~l~i~I 65 (66)
T PF07495_consen 20 EWITLGSYSNSISYTNLPP--GKYTLEVRAKDNNGKWSSDEKSLTITI 65 (66)
T ss_dssp SEEEESSTS-EEEEES--S--EEEEEEEEEEETTS-B-SS-EEEEEEE
T ss_pred eEEECCCCcEEEEEEeCCC--EEEEEEEEEECCCCCcCcccEEEEEEE
Confidence 7998877654555555555 9999999999999998776 455554
No 22
>cd00146 PKD polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Probab=88.60 E-value=0.96 Score=37.67 Aligned_cols=27 Identities=19% Similarity=0.241 Sum_probs=24.0
Q ss_pred CceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980 286 DLGRREYQLVCKDSNGLMANDGLEVIV 312 (609)
Q Consensus 286 dvG~~~f~LtATDs~G~sasd~F~V~V 312 (609)
..|.|.+.|+|+|..|.+.+++..|.|
T Consensus 55 ~~G~y~v~l~v~d~~g~~~~~~~~V~V 81 (81)
T cd00146 55 KPGTYTVTLTVTNAVGSSSTKTTTVVV 81 (81)
T ss_pred CCcEEEEEEEEEeCCCCEEEEEEEEEC
Confidence 349999999999999999998888875
No 23
>PF15050 SCIMP: SCIMP protein
Probab=88.53 E-value=0.46 Score=44.10 Aligned_cols=28 Identities=18% Similarity=0.366 Sum_probs=16.9
Q ss_pred HHHHHHHHHHH-HHHhheeeecccCCCCc
Q psy12980 473 PLVVIVIMLLC-AALIACLLYRRRHTGKM 500 (609)
Q Consensus 473 PaVIIa~mlli-a~iIaC~cyRrKRkgK~ 500 (609)
-||-|+++-++ +-|..|+|+|..|.||.
T Consensus 12 LAVaII~vS~~lglIlyCvcR~~lRqGkk 40 (133)
T PF15050_consen 12 LAVAIILVSVVLGLILYCVCRWQLRQGKK 40 (133)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcccc
Confidence 34444444444 44466778877788887
No 24
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=87.19 E-value=4.3 Score=38.38 Aligned_cols=86 Identities=16% Similarity=0.304 Sum_probs=54.6
Q ss_pred cceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEE------eccCCCCCC--CCccceeccCC--------ceEEe---
Q psy12980 220 IDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHL------MTMDRTTIP--PTHWLQFDAKN--------QEFYG--- 280 (609)
Q Consensus 220 i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl------~~~Dg~~l~--~~sWL~Fd~~t--------~tf~G--- 280 (609)
-..+.+..|.-|.|.. .|.|...=. .+++.+ -. ++.... ...|. |+..= -+|.=
T Consensus 27 ~~~~~~~~G~~ihfe~---~i~d~~~i~--si~VeIH~nfd~H~-h~~~~~~~~~~~~-~~~~~~~~~g~~~~~~h~~i~ 99 (132)
T PF15418_consen 27 ENCKVATRGDDIHFEA---DISDNSAIK--SIKVEIHNNFDHHT-HSTEAGECEKPWV-FEQDYDIYGGKKNYDFHEHID 99 (132)
T ss_pred CCCeEEecCCcEEEEE---EEEccccee--EEEEEEecCcCccc-ccccccccccCcE-EEEEEcccCCcccEeEEEeee
Confidence 4567888888776653 255533333 555555 21 221111 33676 55321 12222
Q ss_pred ecCCCCceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980 281 IPQLTDLGRREYQLVCKDSNGLMANDGLEVIV 312 (609)
Q Consensus 281 ~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V 312 (609)
+|.+...|.|.|.+++||..|......-.|.|
T Consensus 100 IPa~a~~G~YH~~i~VtD~~Gn~~~~~~~i~I 131 (132)
T PF15418_consen 100 IPADAPAGDYHFMITVTDAAGNQTEEERSIKI 131 (132)
T ss_pred CCCCCCCcceEEEEEEEECCCCEEEEEEEEEE
Confidence 58888889999999999999999988888776
No 25
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=86.36 E-value=0.38 Score=44.89 Aligned_cols=28 Identities=14% Similarity=0.128 Sum_probs=13.6
Q ss_pred ehHHHHHHHHHHHHHHhh---eeeecccCCC
Q psy12980 471 VAPLVVIVIMLLCAALIA---CLLYRRRHTG 498 (609)
Q Consensus 471 ViPaVIIa~mllia~iIa---C~cyRrKRkg 498 (609)
++-++|+++|++++++|+ .||+|||+|.
T Consensus 65 ~i~~Ii~gv~aGvIg~Illi~y~irR~~Kk~ 95 (122)
T PF01102_consen 65 AIIGIIFGVMAGVIGIILLISYCIRRLRKKS 95 (122)
T ss_dssp CHHHHHHHHHHHHHHHHHHHHHHHHHHS---
T ss_pred ceeehhHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 455666666666555544 3444444443
No 26
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=84.23 E-value=0.32 Score=44.70 Aligned_cols=9 Identities=33% Similarity=0.744 Sum_probs=0.9
Q ss_pred CCCCCCCCC
Q psy12980 548 LPPPEYNRD 556 (609)
Q Consensus 548 lpppey~~~ 556 (609)
=.||.|-+-
T Consensus 99 nAPPAYeKi 107 (118)
T PF14991_consen 99 NAPPAYEKI 107 (118)
T ss_dssp ----B----
T ss_pred CCCchhhhc
Confidence 348888865
No 27
>PF12245 Big_3_2: Bacterial Ig-like domain (group 3); InterPro: IPR022038 This family of proteins is found in bacteria. They have two conserved sequence motifs: AGN and GMT.
Probab=81.87 E-value=3.5 Score=33.52 Aligned_cols=40 Identities=13% Similarity=0.145 Sum_probs=31.7
Q ss_pred CCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980 274 KNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVK 313 (609)
Q Consensus 274 ~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~ 313 (609)
....-+++|.....|.|.+.++|+|..|...+..-...+.
T Consensus 9 ~~~~~~~~P~~~~dg~yt~~v~a~D~AGN~~~~~~~~~i~ 48 (60)
T PF12245_consen 9 SGVWSTVIPENDADGEYTLTVTATDKAGNTSSSTTQIVIV 48 (60)
T ss_pred ccceeccccCccCCccEEEEEEEEECCCCEEEeeeEEEEE
Confidence 3445678899977799999999999999988766555543
No 28
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=77.77 E-value=2.2 Score=45.31 Aligned_cols=61 Identities=23% Similarity=0.267 Sum_probs=33.1
Q ss_pred ceeeEEeeeeccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCccEEEEeehHHHHHHHH--HHHHHHhheeeecccCCCC
Q psy12980 422 YGVKHASVIPTGICEGLKTPLHTPGVERKPAYSGKNTPANASVEYLITFVAPLVVIVIM--LLCAALIACLLYRRRHTGK 499 (609)
Q Consensus 422 F~V~~Vs~~g~G~C~~~t~~~~~pe~~~p~~~~~~~~~~na~~~~Llv~ViPaVIIa~m--llia~iIaC~cyRrKRkgK 499 (609)
+.+.++.+++.+.=.+. + + ....|..|.- -.+||.+|=++| |+|..|||+++.|||++.-
T Consensus 241 l~~~~lqvQaF~~~~~~-F--------------g--~a~~C~~D~~-~~~vPIaVG~~La~lvlivLiaYli~Rrr~~~g 302 (306)
T PF01299_consen 241 LNLSDLQVQAFRVKNNT-F--------------G--TAEECSSDDT-SDLVPIAVGAALAGLVLIVLIAYLIGRRRSRAG 302 (306)
T ss_pred EEEeEEEEEEEEecCCC-C--------------C--ChhcCCcCCc-cchHHHHHHHHHHHHHHHHHHhheeEecccccc
Confidence 67888888876553222 1 1 1123543332 245555443332 3355678999999886543
Q ss_pred c
Q psy12980 500 M 500 (609)
Q Consensus 500 ~ 500 (609)
|
T Consensus 303 Y 303 (306)
T PF01299_consen 303 Y 303 (306)
T ss_pred c
Confidence 3
No 29
>PF11669 WBP-1: WW domain-binding protein 1; InterPro: IPR021684 This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain [].
Probab=75.30 E-value=2.6 Score=38.16 Aligned_cols=19 Identities=26% Similarity=0.670 Sum_probs=9.8
Q ss_pred CCeeccCCCCCCCCCCCCCC
Q psy12980 537 SPIILKEEKPPLPPPEYNRD 556 (609)
Q Consensus 537 ~P~IlkeEkpplpppey~~~ 556 (609)
.|+-++- -+.-+||.|...
T Consensus 72 ~p~~~~~-~~~~~pP~Y~ev 90 (102)
T PF11669_consen 72 APLDIRV-LSPYKPPSYSEV 90 (102)
T ss_pred Ccccccc-CCCCCCCCcHHh
Confidence 3444433 334467788743
No 30
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=75.12 E-value=7.4 Score=33.76 Aligned_cols=59 Identities=17% Similarity=0.278 Sum_probs=43.6
Q ss_pred eEEEEEeccCCCCCCCCccce-eccCCceEEee---cCCCCceeEEEEEEEEcCCCCeeEeeEEE
Q psy12980 250 KLKLHLMTMDRTTIPPTHWLQ-FDAKNQEFYGI---PQLTDLGRREYQLVCKDSNGLMANDGLEV 310 (609)
Q Consensus 250 ~L~lsl~~~Dg~~l~~~sWL~-Fd~~t~tf~G~---P~~~dvG~~~f~LtATDs~G~sasd~F~V 310 (609)
.+++.+...+|..+. .|.. -....+.+++. |.+...|.|.+.+...+.++...+.+|.|
T Consensus 37 ~~~v~i~dp~g~~v~--~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~~~~~~~~~~~F~V 99 (99)
T PF01835_consen 37 PVTVTIKDPSGNEVF--RWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTDDDGGQSFSKTFQV 99 (99)
T ss_dssp EEEEEEEETTSEEEE--EEEEEETTCTTEEEEEEE--SS---EEEEEEEEETTTTCEEEEEEEEE
T ss_pred ceEEEEECCCCCEEE--EEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEccCCCCEEEEEEEC
Confidence 677788877777765 7776 44567777774 66666799999999988889999999986
No 31
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=74.58 E-value=25 Score=34.03 Aligned_cols=34 Identities=15% Similarity=0.229 Sum_probs=28.3
Q ss_pred cCCCCceeEEEEEEEEcCCCCeeEeeEEEEEECC
Q psy12980 282 PQLTDLGRREYQLVCKDSNGLMANDGLEVIVKSP 315 (609)
Q Consensus 282 P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~~~ 315 (609)
|.-..-+.|...+.|+|..|.....+......|.
T Consensus 117 psle~~~~YtLtV~a~D~aGN~~~~si~F~y~P~ 150 (158)
T PF13750_consen 117 PSLEADDSYTLTVSATDKAGNQSTKSISFSYMPP 150 (158)
T ss_pred CCcCCCCeEEEEEEEEecCCCEEEEEEEEEEeCC
Confidence 4333448999999999999999999999888864
No 32
>PF02494 HYR: HYR domain; InterPro: IPR003410 This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is uncertain it may be involved in cell adhesion. In the Sushi repeat-containing protein (SrpX), this domain is found between two sushi repeats.
Probab=73.48 E-value=4.9 Score=33.91 Aligned_cols=26 Identities=23% Similarity=0.300 Sum_probs=24.4
Q ss_pred ceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980 287 LGRREYQLVCKDSNGLMANDGLEVIV 312 (609)
Q Consensus 287 vG~~~f~LtATDs~G~sasd~F~V~V 312 (609)
+|.|.+..+|+|..|.++.=.|.|.|
T Consensus 56 ~G~t~V~ytA~D~~GN~a~C~f~V~V 81 (81)
T PF02494_consen 56 VGTTTVTYTATDAAGNSATCSFTVTV 81 (81)
T ss_pred eceEEEEEEEEECCCCEEEEEEEEEC
Confidence 59999999999999999999999986
No 33
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=72.83 E-value=1.4 Score=52.75 Aligned_cols=33 Identities=15% Similarity=0.208 Sum_probs=21.7
Q ss_pred eehHHHHHHHHHHHHHHhheeeecccC-CCCccc
Q psy12980 470 FVAPLVVIVIMLLCAALIACLLYRRRH-TGKMSV 502 (609)
Q Consensus 470 ~ViPaVIIa~mllia~iIaC~cyRrKR-kgK~~i 502 (609)
.++.++++++++-+.|+|.|||++++| .|||.-
T Consensus 96 ~~~i~ll~~il~P~vg~~fCcCRCc~~CGg~~~~ 129 (806)
T PF05478_consen 96 CAVIGLLFIILMPLVGLCFCCCRCCGNCGGRMHQ 129 (806)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccCCCcCCcccc
Confidence 333355555555577899999976665 788843
No 34
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=72.60 E-value=1.3 Score=38.01 Aligned_cols=31 Identities=23% Similarity=0.365 Sum_probs=21.3
Q ss_pred hHHHHHHHHHHHHHHhheeeecccCCCCccc
Q psy12980 472 APLVVIVIMLLCAALIACLLYRRRHTGKMSV 502 (609)
Q Consensus 472 iPaVIIa~mllia~iIaC~cyRrKRkgK~~i 502 (609)
+-..+|++|+++|-+|.++-||.|++.--.+
T Consensus 6 l~~plivf~ifVap~WL~lHY~sk~~~~~gL 36 (75)
T PF06667_consen 6 LFVPLIVFMIFVAPIWLILHYRSKWKSSQGL 36 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCCCC
Confidence 3455666778888888888888776443334
No 35
>KOG4289|consensus
Probab=72.13 E-value=1.4e+02 Score=38.64 Aligned_cols=69 Identities=9% Similarity=0.042 Sum_probs=37.2
Q ss_pred cccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeec--CCCCceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980 240 FFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIP--QLTDLGRREYQLVCKDSNGLMANDGLEVIV 312 (609)
Q Consensus 240 F~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P--~~~dvG~~~f~LtATDs~G~sasd~F~V~V 312 (609)
|.|+|-|---...+++...++.+ -=...++.++++--.- ....+..|...+.|+|.+.-..+.+.+|+|
T Consensus 807 atDaD~g~Ng~v~y~~qg~~d~p----~~F~IEptSGviRtl~rLdRE~~avy~L~a~avDrg~p~ls~~~eItv 877 (2531)
T KOG4289|consen 807 ATDADSGPNGRVYYTFQGGDDGP----GDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVDRGNPPLSAPVEITV 877 (2531)
T ss_pred EeccCCCCCceEEEEecCCCCCC----CceEEccCcceeehhhhhcchheeEEEEEEEEeeCCCCCcCCceEEEE
Confidence 66666655333444444444433 1122444444332221 223457889999999998776666665554
No 36
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=71.69 E-value=1.7 Score=51.62 Aligned_cols=30 Identities=27% Similarity=0.153 Sum_probs=14.4
Q ss_pred EEEEeehHHHHHHHHHHHH--HHhheeeeccc
Q psy12980 466 YLITFVAPLVVIVIMLLCA--ALIACLLYRRR 495 (609)
Q Consensus 466 ~Llv~ViPaVIIa~mllia--~iIaC~cyRrK 495 (609)
..|++++-..|++.|+||. -+|.+||||||
T Consensus 268 ~~YHT~fLl~ILG~~~livl~lL~vLl~yCrr 299 (807)
T PF10577_consen 268 TTYHTVFLLAILGGTALIVLILLCVLLCYCRR 299 (807)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 3455665555555444422 22444555544
No 37
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=69.38 E-value=17 Score=32.86 Aligned_cols=49 Identities=20% Similarity=0.271 Sum_probs=31.3
Q ss_pred EEeecCCCCceeEEEEEEEEc------CCCCeeEeeEEEEEECCCCCCcceEEEEEecC
Q psy12980 278 FYGIPQLTDLGRREYQLVCKD------SNGLMANDGLEVIVKSPGPAYYSASFSLTLAR 330 (609)
Q Consensus 278 f~G~P~~~dvG~~~f~LtATD------s~G~sasd~F~V~V~~~p~~~~~vef~L~l~~ 330 (609)
-||.-.-+.-|.|.|.|--.+ ..|.+..++|+|.|... . ...+.+++..
T Consensus 25 ~yGtlti~~~G~wtYtl~n~~~avq~L~~Ge~~tdsFtvtv~DG--t--t~~vtItI~G 79 (99)
T TIGR01965 25 QYGTFSIDADGQWTYQADNSQTAVQALKAGETLTDTFTVTSADG--T--SQTVTITITG 79 (99)
T ss_pred CcEEEEECCCCcEEEEeCCCcHHHHhhcCCCEEEEEEEEEEeCC--C--eEEEEEEEEc
Confidence 445443333488999885332 37889999999998753 1 4455555554
No 38
>PHA03283 envelope glycoprotein E; Provisional
Probab=69.07 E-value=13 Score=42.49 Aligned_cols=25 Identities=40% Similarity=0.577 Sum_probs=15.6
Q ss_pred HHHHHHH--hheeeecccCCCCcccCC
Q psy12980 480 MLLCAAL--IACLLYRRRHTGKMSVGN 504 (609)
Q Consensus 480 mllia~i--IaC~cyRrKRkgK~~i~d 504 (609)
++++++| |+|+.|||+|+-.|.|++
T Consensus 411 ~~~~~~l~vw~c~~~r~~~~~~y~iln 437 (542)
T PHA03283 411 AALLVALVVWGCILYRRSNRKPYEVLN 437 (542)
T ss_pred HHHHHHHhhhheeeehhhcCCcccccC
Confidence 3334444 777777777777776655
No 39
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=68.19 E-value=1.7 Score=48.79 Aligned_cols=49 Identities=18% Similarity=0.266 Sum_probs=16.3
Q ss_pred eeeCCeEEEEcCCCccccCCCCCeeeEEEEEecc--CCCC-CCCCccceeccCC
Q psy12980 225 AISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTM--DRTT-IPPTHWLQFDAKN 275 (609)
Q Consensus 225 at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~--Dg~~-l~~~sWL~Fd~~t 275 (609)
-..|-.|.++-+.---| -.|+|..|..+|... |..+ --.-.|+-|+...
T Consensus 143 ~~~~~~~~~~~~~s~vf--~~Gdtf~~~v~l~~~~~d~~~f~~~i~W~~~~~~~ 194 (439)
T PF02480_consen 143 HRHGATFHLKNYHSHVF--SPGDTFHLSVHLQSEAHDDPPFSLEIDWYYMPTDP 194 (439)
T ss_dssp ---SEEEEEE--SEEE----TT--EE---EEEEEESSS--EEEEEEEEEE---T
T ss_pred cccccEEEEeccceEEe--cCCCcEEEeEEEEeccCCCCCeeEEEEEEEecCCC
Confidence 45566777766644323 257766666665532 2111 0112577777654
No 40
>PRK09458 pspB phage shock protein B; Provisional
Probab=66.76 E-value=2.6 Score=36.24 Aligned_cols=30 Identities=20% Similarity=0.461 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHhheeeecccCCCCcccC
Q psy12980 474 LVVIVIMLLCAALIACLLYRRRHTGKMSVG 503 (609)
Q Consensus 474 aVIIa~mllia~iIaC~cyRrKRkgK~~i~ 503 (609)
+-+|++|+++|=+|.++-||.||++.-.+.
T Consensus 8 ~PliiF~ifVaPiWL~LHY~sk~~~~~~Ls 37 (75)
T PRK09458 8 IPLTIFVLFVAPIWLWLHYRSKRQGSQGLS 37 (75)
T ss_pred HhHHHHHHHHHHHHHHHhhcccccCCCCCC
Confidence 445666788888888888998876655554
No 41
>KOG1094|consensus
Probab=64.02 E-value=15 Score=42.82 Aligned_cols=30 Identities=30% Similarity=0.564 Sum_probs=19.1
Q ss_pred ehHHHHHHHHHHHHHHhheeeecccC-CCCc
Q psy12980 471 VAPLVVIVIMLLCAALIACLLYRRRH-TGKM 500 (609)
Q Consensus 471 ViPaVIIa~mllia~iIaC~cyRrKR-kgK~ 500 (609)
++..+++++++++|.||+.|+||+|| +.|.
T Consensus 392 ~~~~~f~~if~iva~ii~~~L~R~rr~~~ka 422 (807)
T KOG1094|consen 392 ILIIIFVAIFLIVALIIALMLWRWRRLLSKA 422 (807)
T ss_pred ehHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 33345555556678888889999654 3433
No 42
>smart00089 PKD Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Probab=63.28 E-value=12 Score=30.88 Aligned_cols=25 Identities=24% Similarity=0.335 Sum_probs=21.8
Q ss_pred ceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980 287 LGRREYQLVCKDSNGLMANDGLEVIV 312 (609)
Q Consensus 287 vG~~~f~LtATDs~G~sasd~F~V~V 312 (609)
.|.|.++|+|+|..| +++...+|.|
T Consensus 54 ~G~y~v~l~v~n~~g-~~~~~~~i~v 78 (79)
T smart00089 54 PGTYTVTLTVTNAVG-SASATVTVVV 78 (79)
T ss_pred CcEEEEEEEEEcCCC-cEEEEEEEEE
Confidence 399999999999999 7777887776
No 43
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=63.24 E-value=4.3 Score=30.59 Aligned_cols=27 Identities=15% Similarity=0.435 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHhheeee--cccCCCCc
Q psy12980 474 LVVIVIMLLCAALIACLLY--RRRHTGKM 500 (609)
Q Consensus 474 aVIIa~mllia~iIaC~cy--RrKRkgK~ 500 (609)
++|++++..++-++.|++| +++||.+.
T Consensus 7 aIIv~V~vg~~iiii~~~~YaCcykk~~~ 35 (38)
T PF02439_consen 7 AIIVAVVVGMAIIIICMFYYACCYKKHRR 35 (38)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHccccc
Confidence 5566666667767666555 78888764
No 44
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=62.55 E-value=9.6 Score=29.87 Aligned_cols=27 Identities=19% Similarity=0.146 Sum_probs=18.9
Q ss_pred ceecccCccC--cCcccccc-ccceeeEEE
Q psy12980 160 RYVISNTTTG--QAKPVLTR-ALGRLMTVK 186 (609)
Q Consensus 160 ~~~~~~t~T~--sgtp~~~d-~~~~~v~v~ 186 (609)
.|+.||..|+ +|+|...+ .+...++|+
T Consensus 14 ~gLs~d~~tG~isGtp~~~~~~G~y~~~vt 43 (49)
T PF05345_consen 14 SGLSLDPSTGTISGTPTSSVQPGTYTFTVT 43 (49)
T ss_pred CcEEEeCCCCEEEeecCCCccccEEEEEEE
Confidence 3999987776 69998773 355555554
No 45
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=62.38 E-value=5.3 Score=38.51 Aligned_cols=55 Identities=11% Similarity=0.152 Sum_probs=29.9
Q ss_pred HHHHHhheeeecccCCCCcccCCCcccccccccCCCCEEcccccccccCCCCCCCCCeeccCC
Q psy12980 482 LCAALIACLLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSPIILKEE 544 (609)
Q Consensus 482 lia~iIaC~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P~IlkeE 544 (609)
++.|+.+.+|-|||||.-..|+ ||..-|+ ++.=.-|--+ ..|.|+.+.|.=-.+-
T Consensus 43 iiiivli~lcssRKkKaaAAi~---eediQfi----npyqDqqw~~-v~pqPgt~kPAgAtt~ 97 (189)
T PF05568_consen 43 IIIIVLIYLCSSRKKKAAAAIE---EEDIQFI----NPYQDQQWAG-VTPQPGTSKPAGATTG 97 (189)
T ss_pred HHHHHHHHHHhhhhHHHHhhhh---hhccccc----Ccccchhhcc-CCCCCCCCCCCCCccc
Confidence 3444445567777766665563 5566676 3322222222 2477777776655443
No 46
>PRK13211 N-acetylglucosamine-binding protein A; Reviewed
Probab=61.06 E-value=30 Score=39.47 Aligned_cols=64 Identities=9% Similarity=0.133 Sum_probs=44.3
Q ss_pred eEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCee-EeeEEEEEEC
Q psy12980 250 KLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMA-NDGLEVIVKS 314 (609)
Q Consensus 250 ~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sa-sd~F~V~V~~ 314 (609)
++++.|...+|+.+...+ ...+..++.|.=.-.+...|.|.+.++|+|+.|... ++++.|.|..
T Consensus 342 ~vta~V~d~~g~~~~~~~-~~v~d~s~~vtL~Ls~~~AG~y~Lvv~~t~~dG~~~~q~~~~~~v~~ 406 (478)
T PRK13211 342 NVEATVYNHDGEALGSKS-QTVNDGSQSVSLDLSKLKAGHHMLVVKAKPKDGELIKQQTLDFMLEA 406 (478)
T ss_pred EEEEEEEcCCCCeeeeee-EEecCCceeEEEecccCCCceEEEEEEEEeCCCceeeeeeEEEEEEe
Confidence 677777888888776332 222333344443333334599999999999999988 8899999863
No 47
>PF14610 DUF4448: Protein of unknown function (DUF4448)
Probab=60.26 E-value=8.4 Score=38.04 Aligned_cols=29 Identities=24% Similarity=0.477 Sum_probs=15.0
Q ss_pred EEEEeehHHHHHHHHHHHHHHhheeeecccC
Q psy12980 466 YLITFVAPLVVIVIMLLCAALIACLLYRRRH 496 (609)
Q Consensus 466 ~Llv~ViPaVIIa~mllia~iIaC~cyRrKR 496 (609)
+.+...||.++++++++++ +.|+++||+|
T Consensus 158 ~~laI~lPvvv~~~~~~~~--~~~~~~R~~R 186 (189)
T PF14610_consen 158 YALAIALPVVVVVLALIMY--GFFFWNRKKR 186 (189)
T ss_pred eeEEEEccHHHHHHHHHHH--hhheeeccce
Confidence 3444558888777333332 3334455554
No 48
>PF00974 Rhabdo_glycop: Rhabdovirus spike glycoprotein; InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=59.81 E-value=3 Score=47.57 Aligned_cols=17 Identities=24% Similarity=0.403 Sum_probs=7.7
Q ss_pred eEEEEcCCCccccCCCC
Q psy12980 230 LLVYHVPEDTFFDHEDG 246 (609)
Q Consensus 230 ~f~f~IP~dTF~D~eDG 246 (609)
.|....|-.-.+|+=+|
T Consensus 146 ~f~~i~ph~V~~DpYs~ 162 (501)
T PF00974_consen 146 TFVIITPHPVLLDPYSG 162 (501)
T ss_dssp EEEEEEE--EEEETTTT
T ss_pred EEEEEecccceeccccC
Confidence 55555555555554433
No 49
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=58.78 E-value=28 Score=27.59 Aligned_cols=27 Identities=11% Similarity=0.214 Sum_probs=21.0
Q ss_pred CceeEEEEEEEEcCCCCeeEe-eEEEEE
Q psy12980 286 DLGRREYQLVCKDSNGLMAND-GLEVIV 312 (609)
Q Consensus 286 dvG~~~f~LtATDs~G~sasd-~F~V~V 312 (609)
.-|.|.|.++|+|..|..... ...|.|
T Consensus 22 ~dG~y~itv~a~D~AGN~s~~~~~~~ti 49 (54)
T PF13754_consen 22 ADGTYTITVTATDAAGNTSTSSSVTFTI 49 (54)
T ss_pred CCccEEEEEEEEeCCCCCCCccceeEEE
Confidence 359999999999999987665 344544
No 50
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=58.50 E-value=3.5 Score=35.44 Aligned_cols=27 Identities=26% Similarity=0.536 Sum_probs=17.8
Q ss_pred ehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980 471 VAPLVVIVIMLLCAALIACLLYRRRHTGK 499 (609)
Q Consensus 471 ViPaVIIa~mllia~iIaC~cyRrKRkgK 499 (609)
++|++|+ |+++|-+|-++-||+|+++.
T Consensus 7 ~~Pliif--~ifVap~wl~lHY~~k~~~~ 33 (75)
T TIGR02976 7 AIPLIIF--VIFVAPLWLILHYRSKRKTA 33 (75)
T ss_pred HHHHHHH--HHHHHHHHHHHHHHhhhccC
Confidence 4555444 67778888888888665443
No 51
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=58.36 E-value=3.1 Score=38.68 Aligned_cols=26 Identities=8% Similarity=0.211 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHhheeeecccCCCCc
Q psy12980 475 VVIVIMLLCAALIACLLYRRRHTGKM 500 (609)
Q Consensus 475 VIIa~mllia~iIaC~cyRrKRkgK~ 500 (609)
+++++|++|..+|+++|+++||+-|.
T Consensus 4 l~~iii~~i~l~~~~~~~~~rRR~r~ 29 (130)
T PF12273_consen 4 LFAIIIVAILLFLFLFYCHNRRRRRR 29 (130)
T ss_pred eHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 34444444555566667777777666
No 52
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=56.96 E-value=1.7e+02 Score=40.13 Aligned_cols=61 Identities=11% Similarity=0.072 Sum_probs=39.2
Q ss_pred CCCeeeEEEEEeccCCCCCCCCccceeccCC----ceEEeecCCCCceeEEEEEEEEcC-CCCeeEeeEEEE
Q psy12980 245 DGGTRKLKLHLMTMDRTTIPPTHWLQFDAKN----QEFYGIPQLTDLGRREYQLVCKDS-NGLMANDGLEVI 311 (609)
Q Consensus 245 DGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t----~tf~G~P~~~dvG~~~f~LtATDs-~G~sasd~F~V~ 311 (609)
.|. ++.|.....+|+++ ...|- |+..+ +...=++... |.|.+.|.|++. ++.+++-.|.|+
T Consensus 1799 ag~--~v~F~a~vstGsnV-sw~W~-f~~g~s~~gk~v~~Tf~~a--G~ytV~L~AsN~vs~~~~s~~~~VQ 1864 (2740)
T TIGR00864 1799 ADS--SVCFQGELATGTNV-SWCWA-IDGGSSKMGKHACMTFPDA--GTFAIRLNASNAVSGKSASREFFAE 1864 (2740)
T ss_pred cCc--EEEEEEEccCCCee-EEEEE-eCCCCccccceeEEecCCC--eEEEEEEEEEcccCcceeeeeEEEE
Confidence 466 56666666788887 45675 55332 3333344443 999999999998 555666666654
No 53
>KOG1219|consensus
Probab=56.03 E-value=1.2e+02 Score=41.22 Aligned_cols=186 Identities=16% Similarity=0.154 Sum_probs=90.5
Q ss_pred CCCcccCcCccceeecCccE---EEEcCCCCcccCCCCCeeeEEEEEecccceecc--cCccCcCc-cccccc-cc--ee
Q psy12980 112 QVPLIRNPIDHLEAISGELL---VYHVPEDTFFDHEDGGTRKLKLHLMTMDRYVIS--NTTTGQAK-PVLTRA-LG--RL 182 (609)
Q Consensus 112 ~~P~v~~pi~~~~a~e~~~f---s~~iP~~tF~D~d~Gdt~~ltlsl~~~d~~~~~--~t~T~sgt-p~~~d~-~~--~~ 182 (609)
++|+...=+....+.|+..- -.+|-| .|.|.|-.-.|+|.+..+|||..| |..||+-. -..=|. -. .-
T Consensus 949 hpp~F~~~v~e~~V~EnapiGT~vi~i~A---~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~ks~Yw 1025 (4289)
T KOG1219|consen 949 HPPEFISFVTEGHVLENAPIGTIVIRIQA---RDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREKKSSYW 1025 (4289)
T ss_pred CCcchheeeeeeeEeecCCcceEEEEEEE---ecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhhcceEE
Confidence 45666555555566665432 122332 577888888899999999999999 66666521 000010 00 01
Q ss_pred eEEEeeeecccccccCC-CCCCceecccCCC-CceecCccceeeeeeCCeE---EEEcCCCccccCCCCCeeeEEEEEec
Q psy12980 183 MTVKNITWRGIGHCEDS-PPPPVVKVKLENQ-VPLIRNPIDHLEAISGELL---VYHVPEDTFFDHEDGGTRKLKLHLMT 257 (609)
Q Consensus 183 v~v~~~~~~~~~~~~~~-~~~~tv~v~~~N~-aPvv~n~i~~~~at~G~~f---~f~IP~dTF~D~eDGdT~~L~lsl~~ 257 (609)
+||.+.+. |+..-+ --...+.|..+|+ +|....+.=--.+.+...- ..++ + =+|+|.-.-.+|.|.++.
T Consensus 1026 ltveA~D~---gt~~~ssv~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~--e-a~D~Dsssn~kLmykI~s 1099 (4289)
T KOG1219|consen 1026 LTVEAKDL---GTVPLSSVCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSIVQA--E-ANDPDSSSNQKLMYKITS 1099 (4289)
T ss_pred EEEEEEec---CCCccccceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEEEEe--c-cCCCCcccCcceEEEEcc
Confidence 22221111 111000 0011122333333 2222222111111111111 1112 1 235543333377777764
Q ss_pred cCCCCCCCCccceeccCCceEEeecCCCC--ceeEEEEEEEEcCCCCeeEeeEEEE
Q psy12980 258 MDRTTIPPTHWLQFDAKNQEFYGIPQLTD--LGRREYQLVCKDSNGLMANDGLEVI 311 (609)
Q Consensus 258 ~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~d--vG~~~f~LtATDs~G~sasd~F~V~ 311 (609)
.| + ....+.|+.++.+.|.-.-+. -.-|...|+.+|.+--+-...--|.
T Consensus 1100 Gn--y---q~FF~Id~~TG~iTt~r~LDRE~qdEHiLeVTi~D~gep~l~s~~rvi 1150 (4289)
T KOG1219|consen 1100 GN--Y---QGFFQIDPETGLITTIRRLDREKQDEHILEVTIQDNGEPWLCSNQRVI 1150 (4289)
T ss_pred CC--c---cceEEEccccceeeeehhhcccccccceEEEEEecCCCCccccceEEE
Confidence 43 2 368899999998887643222 2468888999997655533333333
No 54
>PF15069 FAM163: FAM163 family
Probab=55.63 E-value=9 Score=36.81 Aligned_cols=18 Identities=17% Similarity=0.231 Sum_probs=10.2
Q ss_pred EeehHHHHHHHHHHHHHH
Q psy12980 469 TFVAPLVVIVIMLLCAAL 486 (609)
Q Consensus 469 v~ViPaVIIa~mllia~i 486 (609)
++||-+.|+|.++||..|
T Consensus 5 TvVItGgILAtVILLcII 22 (143)
T PF15069_consen 5 TVVITGGILATVILLCII 22 (143)
T ss_pred eEEEechHHHHHHHHHHH
Confidence 355666666666554433
No 55
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=55.07 E-value=10 Score=28.99 Aligned_cols=23 Identities=35% Similarity=0.616 Sum_probs=10.7
Q ss_pred eehHHHHHHHHHHHHHHhheeeeccc
Q psy12980 470 FVAPLVVIVIMLLCAALIACLLYRRR 495 (609)
Q Consensus 470 ~ViPaVIIa~mllia~iIaC~cyRrK 495 (609)
.++|.++|++ +.+++-++-||||
T Consensus 17 VvVPV~vI~~---vl~~~l~~~~rR~ 39 (40)
T PF08693_consen 17 VVVPVGVIII---VLGAFLFFWYRRK 39 (40)
T ss_pred EEechHHHHH---HHHHHhheEEecc
Confidence 4566655542 2223333445555
No 56
>PF14979 TMEM52: Transmembrane 52
Probab=54.75 E-value=6.1 Score=38.08 Aligned_cols=26 Identities=23% Similarity=0.313 Sum_probs=17.2
Q ss_pred CccEEEEeehHHHHHHHHHHHHHHhhe
Q psy12980 463 SVEYLITFVAPLVVIVIMLLCAALIAC 489 (609)
Q Consensus 463 ~~~~Llv~ViPaVIIa~mllia~iIaC 489 (609)
+|.-|. ||.-.+++++++|++|+.+.
T Consensus 15 ~W~~LW-yIwLill~~~llLLCG~ta~ 40 (154)
T PF14979_consen 15 RWSSLW-YIWLILLIGFLLLLCGLTAS 40 (154)
T ss_pred ceehhh-HHHHHHHHHHHHHHHHHHHH
Confidence 444443 66667777888888888543
No 57
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=54.70 E-value=11 Score=38.54 Aligned_cols=30 Identities=7% Similarity=0.163 Sum_probs=18.7
Q ss_pred ccEEEEeehHHHHHHHHHHHHHHhheeeec
Q psy12980 464 VEYLITFVAPLVVIVIMLLCAALIACLLYR 493 (609)
Q Consensus 464 ~~~Llv~ViPaVIIa~mllia~iIaC~cyR 493 (609)
-...++-||.+++.|++++++..++++|+.
T Consensus 36 ~~~I~iaiVAG~~tVILVI~i~v~vR~CRq 65 (221)
T PF08374_consen 36 YVKIMIAIVAGIMTVILVIFIVVLVRYCRQ 65 (221)
T ss_pred ceeeeeeeecchhhhHHHHHHHHHHHHHhh
Confidence 344555667677766666666666666663
No 58
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=53.63 E-value=9.7 Score=32.56 Aligned_cols=13 Identities=23% Similarity=0.340 Sum_probs=3.4
Q ss_pred heeeecccCCCCc
Q psy12980 488 ACLLYRRRHTGKM 500 (609)
Q Consensus 488 aC~cyRrKRkgK~ 500 (609)
..+|.||++.+|.
T Consensus 20 ~~~~~rr~~~~~~ 32 (75)
T PF14575_consen 20 VIVCFRRCKYSKA 32 (75)
T ss_dssp HHCCCTT------
T ss_pred EEEEEeeEcCCCC
Confidence 3355555544444
No 59
>KOG4331|consensus
Probab=52.45 E-value=3.9 Score=48.68 Aligned_cols=37 Identities=19% Similarity=0.214 Sum_probs=16.1
Q ss_pred CccEEEEeehHHHHHHHHHHHHHH--hheeeecccCCCCc
Q psy12980 463 SVEYLITFVAPLVVIVIMLLCAAL--IACLLYRRRHTGKM 500 (609)
Q Consensus 463 ~~~~Llv~ViPaVIIa~mllia~i--IaC~cyRrKRkgK~ 500 (609)
+|+|++-.|+-+++|+.+ -++++ |.|+||||+++++.
T Consensus 103 ~~g~lv~sV~~v~~iil~-p~~~~~yccc~C~~rc~~r~~ 141 (865)
T KOG4331|consen 103 EAGMLVCSVILVLYIILK-PSEGLCYCCCRCCRRCGGRIK 141 (865)
T ss_pred HHHHHHHHHHHHHHHHHH-HHHhHHhheeeeHhhhcCCCC
Confidence 445544334433333322 24455 44455556544444
No 60
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=51.89 E-value=5 Score=33.55 Aligned_cols=35 Identities=29% Similarity=0.425 Sum_probs=1.8
Q ss_pred EEeehHHHHHHHHHHHHHHhheeeecccC--CCCcccC
Q psy12980 468 ITFVAPLVVIVIMLLCAALIACLLYRRRH--TGKMSVG 503 (609)
Q Consensus 468 lv~ViPaVIIa~mllia~iIaC~cyRrKR--kgK~~i~ 503 (609)
+..+|.++|++ +++...+|.+++||-|+ .|-+.++
T Consensus 11 laavIaG~Vvg-ll~ailLIlf~iyR~rkkdEGSY~l~ 47 (64)
T PF01034_consen 11 LAAVIAGGVVG-LLFAILLILFLIYRMRKKDEGSYDLD 47 (64)
T ss_dssp ---------------------------S------SS--
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCCccCC
Confidence 34555555543 33333456677777443 6666663
No 61
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=51.72 E-value=25 Score=31.85 Aligned_cols=17 Identities=12% Similarity=0.210 Sum_probs=13.6
Q ss_pred CCceecccCCCCceecC
Q psy12980 202 PPVVKVKLENQVPLIRN 218 (609)
Q Consensus 202 ~~tv~v~~~N~aPvv~n 218 (609)
.++|+|...|++|++..
T Consensus 72 ~vtItI~GtNDapvi~~ 88 (99)
T TIGR01965 72 TVTITITGANDAAVIGG 88 (99)
T ss_pred EEEEEEEccCCCCEEec
Confidence 36777889999998765
No 62
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=50.49 E-value=13 Score=37.81 Aligned_cols=23 Identities=13% Similarity=0.159 Sum_probs=10.2
Q ss_pred CcEEEEeeecCCCCCCCcEEEEEEecC
Q psy12980 358 SKIVNIEFEDNPQEPGKSTVVRWANRT 384 (609)
Q Consensus 358 s~ItVrsI~~~~~~~~gSTvV~w~N~T 384 (609)
=.|.+-.-. .+ ..-.+++-++.+
T Consensus 38 C~l~Laq~~---~~-~q~Lll~l~~~t 60 (202)
T PF06365_consen 38 CSLSLAQSE---EN-QQCLLLVLTNET 60 (202)
T ss_pred cEEEEecCC---CC-cceEEEEeeccc
Confidence 456665433 11 233444445544
No 63
>KOG4289|consensus
Probab=49.65 E-value=1.9e+02 Score=37.50 Aligned_cols=84 Identities=12% Similarity=0.191 Sum_probs=47.9
Q ss_pred ccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEE--eecCCCCceeEEEEEEEEcCCCC--eeEeeEEEEEECCC
Q psy12980 241 FDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFY--GIPQLTDLGRREYQLVCKDSNGL--MANDGLEVIVKSPG 316 (609)
Q Consensus 241 ~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~--G~P~~~dvG~~~f~LtATDs~G~--sasd~F~V~V~~~p 316 (609)
.|++.|.--.|.|++... . =.|.|..++.+. +.-....+-.|.|.|+|+|.+-- +++.+..|.|..-.
T Consensus 503 idadsg~na~l~y~laG~---~-----pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvn 574 (2531)
T KOG4289|consen 503 IDADSGENARLHYSLAGV---G-----PFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVN 574 (2531)
T ss_pred ccCCCCcccceeeeeccC---C-----CeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccC
Confidence 467778744566665422 2 125666666444 34455567889999999998654 45556666664411
Q ss_pred ---CCCcceEEEEEecCCC
Q psy12980 317 ---PAYYSASFSLTLARPS 332 (609)
Q Consensus 317 ---~~~~~vef~L~l~~~~ 332 (609)
+.-..-|..++++.++
T Consensus 575 dndP~Ft~~eytl~inED~ 593 (2531)
T KOG4289|consen 575 DNDPTFTQKEYTLRINEDA 593 (2531)
T ss_pred CCCCccccCceEEEecCCc
Confidence 1112224556665543
No 64
>PHA03286 envelope glycoprotein E; Provisional
Probab=49.04 E-value=7.1 Score=43.78 Aligned_cols=26 Identities=38% Similarity=0.423 Sum_probs=16.7
Q ss_pred ehHHHHHHHHHHHHHHhheeeecccC
Q psy12980 471 VAPLVVIVIMLLCAALIACLLYRRRH 496 (609)
Q Consensus 471 ViPaVIIa~mllia~iIaC~cyRrKR 496 (609)
-+.++.|+++++++.+|||+++||||
T Consensus 395 s~~~~~~~~~~~~~~~~~~~~~r~~~ 420 (492)
T PHA03286 395 SMAAGAILVVLLFALCIAGLYRRRRR 420 (492)
T ss_pred HHHHHHHHHHHHHHHHhHhHhhhhhh
Confidence 34455555566677778888777664
No 65
>PHA03291 envelope glycoprotein I; Provisional
Probab=49.01 E-value=8.6 Score=41.88 Aligned_cols=29 Identities=7% Similarity=0.005 Sum_probs=24.0
Q ss_pred cCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980 282 PQLTDLGRREYQLVCKDSNGLMANDGLEVIVK 313 (609)
Q Consensus 282 P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~ 313 (609)
|.-.|.|.|.++|.+ +|++..|-|-+-+.
T Consensus 136 P~~~DsGiYvLRV~v---dgat~adlF~lg~~ 164 (401)
T PHA03291 136 AARAVVGLYVLRVWV---EGATNASLFPLGLA 164 (401)
T ss_pred CCcccceeEEEEEEe---CCCcccceEEEeee
Confidence 666678999999998 67788888988876
No 66
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=48.04 E-value=13 Score=32.51 Aligned_cols=30 Identities=20% Similarity=0.374 Sum_probs=10.3
Q ss_pred eehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980 470 FVAPLVVIVIMLLCAALIACLLYRRRHTGK 499 (609)
Q Consensus 470 ~ViPaVIIa~mllia~iIaC~cyRrKRkgK 499 (609)
.++.++|..++..+.|.++.+.|||-++.|
T Consensus 8 ~iialiv~~iiaIvvW~iv~ieYrk~~rqr 37 (81)
T PF00558_consen 8 AIIALIVALIIAIVVWTIVYIEYRKIKRQR 37 (81)
T ss_dssp HHHHHHHHHHHHHHHHHHH-----------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334344444444577888888888654433
No 67
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=44.72 E-value=13 Score=33.03 Aligned_cols=12 Identities=25% Similarity=0.401 Sum_probs=4.6
Q ss_pred eeeeccc-CCCCc
Q psy12980 489 CLLYRRR-HTGKM 500 (609)
Q Consensus 489 C~cyRrK-RkgK~ 500 (609)
||..||| |..=|
T Consensus 54 CC~kRkrsRrPIY 66 (94)
T PF05393_consen 54 CCKKRKRSRRPIY 66 (94)
T ss_pred HHHHhhhccCCcc
Confidence 3333433 43333
No 68
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=44.32 E-value=17 Score=34.41 Aligned_cols=10 Identities=10% Similarity=0.202 Sum_probs=6.1
Q ss_pred CCCCCCCCCH
Q psy12980 384 TLASNVECPE 393 (609)
Q Consensus 384 TLp~~~~CP~ 393 (609)
..|....||.
T Consensus 38 ~~~~i~~Cp~ 47 (139)
T PHA03099 38 DIPAIRLCGP 47 (139)
T ss_pred CCcccccCCh
Confidence 3455577876
No 69
>PF14914 LRRC37AB_C: LRRC37A/B like protein 1 C-terminal domain
Probab=44.27 E-value=10 Score=36.69 Aligned_cols=21 Identities=33% Similarity=0.491 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHhheeeeccc
Q psy12980 475 VVIVIMLLCAALIACLLYRRR 495 (609)
Q Consensus 475 VIIa~mllia~iIaC~cyRrK 495 (609)
+|+.+++++.++|-+||+||.
T Consensus 129 vv~~iliii~CLiei~shr~a 149 (154)
T PF14914_consen 129 VVVMILIIIFCLIEICSHRRA 149 (154)
T ss_pred HHHHHHHHHHHHHHHHhcccc
Confidence 333333345556777777776
No 70
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=43.11 E-value=15 Score=35.90 Aligned_cols=22 Identities=18% Similarity=0.324 Sum_probs=13.4
Q ss_pred EEEEeehHHHHHHHHHHHHHHh
Q psy12980 466 YLITFVAPLVVIVIMLLCAALI 487 (609)
Q Consensus 466 ~Llv~ViPaVIIa~mllia~iI 487 (609)
++.+.++-+||+.+++++.+|+
T Consensus 75 ~~~~~iivgvi~~Vi~Iv~~Iv 96 (179)
T PF13908_consen 75 YFITGIIVGVICGVIAIVVLIV 96 (179)
T ss_pred cceeeeeeehhhHHHHHHHhHh
Confidence 4566677777776665555443
No 71
>PF15065 NCU-G1: Lysosomal transcription factor, NCU-G1
Probab=42.18 E-value=7.2 Score=42.63 Aligned_cols=29 Identities=24% Similarity=0.305 Sum_probs=23.8
Q ss_pred ceEEeecCCCCceeEEEEEEEEcCCCCee
Q psy12980 276 QEFYGIPQLTDLGRREYQLVCKDSNGLMA 304 (609)
Q Consensus 276 ~tf~G~P~~~dvG~~~f~LtATDs~G~sa 304 (609)
.+|.|.+..+.-|...|+|.|-+..|..+
T Consensus 117 ~~~~g~~~~~~ngsi~~~~~af~~~gr~~ 145 (350)
T PF15065_consen 117 ATFRGHPYSDPNGSIAFKLQAFSTSGRDA 145 (350)
T ss_pred EEEEecccCCCCCeEEEEEEEecCCCCCc
Confidence 47888887766799999999999988754
No 72
>PRK00269 zipA cell division protein ZipA; Reviewed
Probab=42.17 E-value=14 Score=39.41 Aligned_cols=27 Identities=19% Similarity=0.345 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHhheeeeccc-CCCCcc
Q psy12980 475 VVIVIMLLCAALIACLLYRRR-HTGKMS 501 (609)
Q Consensus 475 VIIa~mllia~iIaC~cyRrK-RkgK~~ 501 (609)
+||+.+++|+||+.--.+|+| ||||++
T Consensus 9 livig~i~i~~il~~~~~r~r~~~gk~~ 36 (293)
T PRK00269 9 LIVIGIIVIAGILFDGWRRMRGGKGKLK 36 (293)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccc
Confidence 345555667777665555554 688885
No 73
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=41.48 E-value=24 Score=33.54 Aligned_cols=16 Identities=19% Similarity=0.262 Sum_probs=7.0
Q ss_pred hheeeecccCCCCccc
Q psy12980 487 IACLLYRRRHTGKMSV 502 (609)
Q Consensus 487 IaC~cyRrKRkgK~~i 502 (609)
++.+.||+-||.|+-+
T Consensus 118 ~~~~~yr~~r~~~~~~ 133 (139)
T PHA03099 118 CLLSVYRFTRRTKLPL 133 (139)
T ss_pred HHHhhheeeecccCch
Confidence 3333444444445544
No 74
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=41.34 E-value=11 Score=30.14 Aligned_cols=25 Identities=8% Similarity=0.046 Sum_probs=19.3
Q ss_pred ehHHHHHHHHHHHHHHhheeeeccc
Q psy12980 471 VAPLVVIVIMLLCAALIACLLYRRR 495 (609)
Q Consensus 471 ViPaVIIa~mllia~iIaC~cyRrK 495 (609)
-+-++|+|++|+++||+...+.|-|
T Consensus 15 rigGLi~A~vlfi~Gi~iils~kck 39 (50)
T PF02038_consen 15 RIGGLIFAGVLFILGILIILSGKCK 39 (50)
T ss_dssp HHHHHHHHHHHHHHHHHHHCTTHHH
T ss_pred hccchHHHHHHHHHHHHHHHcCccc
Confidence 4678999999999999777655544
No 75
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=40.94 E-value=12 Score=33.58 Aligned_cols=7 Identities=43% Similarity=1.146 Sum_probs=4.0
Q ss_pred ccccCCC
Q psy12980 432 TGICEGL 438 (609)
Q Consensus 432 ~G~C~~~ 438 (609)
.|+|++.
T Consensus 6 ~g~c~g~ 12 (98)
T PF07204_consen 6 SGSCNGA 12 (98)
T ss_pred CCCcccH
Confidence 4666654
No 76
>PF09394 Inhibitor_I42: Chagasin family peptidase inhibitor I42; InterPro: IPR018990 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Chagasin reversible inhibitor of papain-like cysteine proteases []. Chagasin has a beta-barrel structure, which is a unique variant of the immunoglobulin fold with homology to human CD8alpha [, ].; PDB: 2NQD_A 2NNR_A 2H7W_B 3E1Z_A 3CBK_B 3CBJ_B 2OUL_B 2FO8_A 2WGN_B 2C34_A ....
Probab=40.54 E-value=59 Score=27.71 Aligned_cols=74 Identities=11% Similarity=0.224 Sum_probs=38.7
Q ss_pred eeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCC
Q psy12980 223 LEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGL 302 (609)
Q Consensus 223 ~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~ 302 (609)
+++..|+.|...||+|. ..|. .-.++. ..++-.+....++..+...+ +-+.-|.+.|.+.|...+
T Consensus 1 I~v~~g~~~~I~L~~np----stGY--~W~~~~-~~~~l~l~~~~~~~~~~~~~------~vG~~g~~~f~f~a~~~G-- 65 (92)
T PF09394_consen 1 ITVKVGDTFEIELPENP----STGY--SWSLSS-DSDGLQLVSEEYIPDNSPSG------LVGAPGTRTFTFKALKPG-- 65 (92)
T ss_dssp -EEETTSEEEEEEEEBC----CGTB--EEEECT-STTTEEEEEEEEEESSTSST------SSTSSEEEEEEEEESSSE--
T ss_pred CeecCCCEEEEEECCCC----CCCe--EEEEec-CCCeEEEcCCcEEeCCCCcC------CCCCCcEEEEEEEEecCe--
Confidence 46788999999999873 2233 122211 12221122223444333322 333348899999998873
Q ss_pred eeEeeEEEEEE
Q psy12980 303 MANDGLEVIVK 313 (609)
Q Consensus 303 sasd~F~V~V~ 313 (609)
...+.+.-.
T Consensus 66 --~~~i~~~y~ 74 (92)
T PF09394_consen 66 --TTTIKFEYR 74 (92)
T ss_dssp --EEEEEEEEE
T ss_pred --eEEEEEEEE
Confidence 455555543
No 77
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=39.93 E-value=2.7e+02 Score=38.42 Aligned_cols=61 Identities=18% Similarity=0.160 Sum_probs=39.7
Q ss_pred ccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEEC
Q psy12980 239 TFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVKS 314 (609)
Q Consensus 239 TF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~~ 314 (609)
+|.|..+|+ ..++.=.-.||.. . ..+.+ .......|.|.+.|+|+|..|.+ +.++.|.|..
T Consensus 1449 ~Ftd~StGt--~~Sy~WdFGDGst------s----t~qnp--tHtY~~~GtYtVtLTvtN~~Gss-t~T~~VtV~~ 1509 (2740)
T TIGR00864 1449 LFSAFGRAR--NASYLWDFGDGGL------L----EGPEI--LHAFNSPGDFNIRLAAANEVGKN-EATLNVAVKA 1509 (2740)
T ss_pred EEeecCCCC--ceEEEEECCCCCc------c----Cccce--EEEcCCCceEEEEEEEECCCCce-EEEEEEEEec
Confidence 477776776 6666666677742 1 12222 22333449999999999999985 5577787754
No 78
>PF07523 Big_3: Bacterial Ig-like domain (group 3); InterPro: IPR011080 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.; PDB: 2L7Y_A 2KPN_A.
Probab=39.39 E-value=81 Score=25.65 Aligned_cols=50 Identities=18% Similarity=0.201 Sum_probs=33.2
Q ss_pred eEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEE
Q psy12980 250 KLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIV 312 (609)
Q Consensus 250 ~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V 312 (609)
.|.++++..||..++ ...-+++|.......|.|.+.++..+ .+.+|.|.|
T Consensus 18 ~~~v~at~~dG~~~~--------~~~~~vs~~~d~~~~G~y~Vt~~y~~-----~t~t~~VtV 67 (67)
T PF07523_consen 18 GLFVTATYSDGTSLP--------LSDVTVSGTVDTSKAGTYTVTYTYKG-----VTATFTVTV 67 (67)
T ss_dssp CHEEEEEETTS-ES---------GCCSEEES---TTS-CCEEEEEEECT-----EEEEEEEEE
T ss_pred CCEEEEEEcCCCEec--------eeeeEEEeeeecCCCceEEEEEEECC-----EEEEEEEEC
Confidence 356777888888632 23468888555666799999999866 778888876
No 79
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=37.31 E-value=55 Score=31.20 Aligned_cols=49 Identities=16% Similarity=0.232 Sum_probs=27.4
Q ss_pred ceeEEEEEEEEcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcC
Q psy12980 287 LGRREYQLVCKDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFA 336 (609)
Q Consensus 287 vG~~~f~LtATDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft 336 (609)
-|+|+|++.+-|..|-+.+++.+|.|... ...|+--+.+.+....+.|.
T Consensus 82 gG~y~~~VeLCN~~GCS~S~~~~V~VaDT-DGsHl~pL~~~~~~nN~~y~ 130 (133)
T PF08329_consen 82 GGRYQMQVELCNADGCSTSAPVEVVVADT-DGSHLAPLPYNWDENNKPYT 130 (133)
T ss_dssp -EEEEEEEEEEETTEEEE---EEEEEE-T-TSTTS--------TTS--BE
T ss_pred CCEEEEEEEEECCCCcccCCCEEEEEeCC-CccccccccCcccccCCCcc
Confidence 49999999999999999999999999863 23444445555555545554
No 80
>PF15471 TMEM171: Transmembrane protein family 171
Probab=36.04 E-value=1.2e+02 Score=32.30 Aligned_cols=27 Identities=22% Similarity=0.294 Sum_probs=17.7
Q ss_pred CccEEEEeehHHHHHHHHHHHHHHhhee
Q psy12980 463 SVEYLITFVAPLVVIVIMLLCAALIACL 490 (609)
Q Consensus 463 ~~~~Llv~ViPaVIIa~mllia~iIaC~ 490 (609)
-|+.|- .-|-+-+|+++.|...+||++
T Consensus 156 ~CGFLs-lQImGPlIVl~GLCFFVVAHv 182 (319)
T PF15471_consen 156 ICGFLS-LQIMGPLIVLVGLCFFVVAHV 182 (319)
T ss_pred ccceee-hhhhhhHHHHHhhhhhheeee
Confidence 344443 455566677777777888888
No 81
>PF15298 AJAP1_PANP_C: AJAP1/PANP C-terminus
Probab=33.77 E-value=63 Score=32.80 Aligned_cols=54 Identities=17% Similarity=0.200 Sum_probs=30.5
Q ss_pred cEEEEeehHHHHHHHHHHHHHHhhe-eeec----ccCCCCcccCCCcccccc---cccCCCC
Q psy12980 465 EYLITFVAPLVVIVIMLLCAALIAC-LLYR----RRHTGKMSVGNGVDSGIL---IRSKGIP 518 (609)
Q Consensus 465 ~~Llv~ViPaVIIa~mllia~iIaC-~cyR----rKRkgK~~i~d~deE~~t---F~~KGiP 518 (609)
-.+|++|-.-+|+++..||.+||.= ||-| ||-++--++..|||.++. |.-.++|
T Consensus 98 vh~~iTITvSlImViaAliTtlvlK~C~~~s~~~r~~s~qr~~~qqeeS~Q~Ltd~~p~~~p 159 (205)
T PF15298_consen 98 VHQIITITVSLIMVIAALITTLVLKNCCAQSQNRRRNSHQRKINQQEESCQNLTDFTPARVP 159 (205)
T ss_pred ceEEEEEeeehhHHHHHhhhhhhhhhhhhhhcccCCCccccccccchhhccccccCCcccCc
Confidence 3566666545555555556666543 4433 333555567777777775 6655555
No 82
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=33.72 E-value=14 Score=39.16 Aligned_cols=27 Identities=26% Similarity=0.581 Sum_probs=0.0
Q ss_pred eehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980 470 FVAPLVVIVIMLLCAALIACLLYRRRHTGK 499 (609)
Q Consensus 470 ~ViPaVIIa~mllia~iIaC~cyRrKRkgK 499 (609)
+++..-||.++|.++| .++|||||+..
T Consensus 229 ~lLVPSiILVLLaVGG---LLfYr~rrRs~ 255 (285)
T PF05337_consen 229 YLLVPSIILVLLAVGG---LLFYRRRRRSH 255 (285)
T ss_dssp ------------------------------
T ss_pred cccccchhhhhhhccc---eeeeccccccc
Confidence 3333444444444444 46788776543
No 83
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=33.57 E-value=5.9e+02 Score=26.81 Aligned_cols=207 Identities=14% Similarity=0.151 Sum_probs=96.1
Q ss_pred ccceeeeeeeeecccccCCCCCCCccccccCCCCCcccC-cCccce------eecCccEEEEcCCCCcccCCCCCeeeEE
Q psy12980 80 GRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQVPLIRN-PIDHLE------AISGELLVYHVPEDTFFDHEDGGTRKLK 152 (609)
Q Consensus 80 ~~~~~vk~~~~~~~g~~~~~~~~~i~~v~~~N~~P~v~~-pi~~~~------a~e~~~fs~~iP~~tF~D~d~Gdt~~lt 152 (609)
...+.|.....+..|+=..-+ ..+.-.+.+|.|.- ++.+.. +..+..|+.++ .+.+.|++..++
T Consensus 11 d~~~~v~vt~tD~aGN~~~~t----~~~~vDt~~P~v~i~~~~~~~~~~~~~~~~~~t~s~tv-----s~~~~g~~v~v~ 81 (317)
T PF13753_consen 11 DGTYTVSVTVTDAAGNTSTAT----QSITVDTTAPTVTITSIADDDIINGDEATNTVTFSGTV-----SGAEPGSTVTVT 81 (317)
T ss_pred CCcEEEEEEEEeCCCCeeeee----EEEEEecCCCceeeecccCCCccccceeeeeeEEEEEe-----cCCCCCCEEEEE
Confidence 567888888999998876522 22333556885533 344433 33444455443 334556653332
Q ss_pred E-------EEecccc-eecccCccCcCccc-cccccceeeEEEeeeecccccccCCCCCCceecccC-CCCceecCc-cc
Q psy12980 153 L-------HLMTMDR-YVISNTTTGQAKPV-LTRALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLE-NQVPLIRNP-ID 221 (609)
Q Consensus 153 l-------sl~~~d~-~~~~~t~T~sgtp~-~~d~~~~~v~v~~~~~~~~~~~~~~~~~~tv~v~~~-N~aPvv~n~-i~ 221 (609)
. ++. ++| |..- .++. .-..+...|+|+ ...+..|+.... .+..+++... +.+|.+.-. +.
T Consensus 82 ~~g~~~t~~~~-~~G~ws~t------~~~~~~l~~g~~ti~v~-~~tD~aGN~~t~-~s~~~~vDt~~~~~p~vti~~~~ 152 (317)
T PF13753_consen 82 INGTTGTLTAD-ADGNWSVT------VTPSDDLPDGDYTITVT-TVTDAAGNTSTA-ASQTFTVDTTAPTAPTVTITGIS 152 (317)
T ss_pred ECCEEEEEEEe-cCCcEEEe------eccccccccCcceeEEE-EEEccCCccccc-cccccccccccccccccceeccc
Confidence 2 111 122 2221 1121 112234344444 122233333322 1234445444 455655332 32
Q ss_pred eeeeeeCC--eEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCc-eEEeec--CCCC-ceeEEEEEE
Q psy12980 222 HLEAISGE--LLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQ-EFYGIP--QLTD-LGRREYQLV 295 (609)
Q Consensus 222 ~~~at~G~--~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~-tf~G~P--~~~d-vG~~~f~Lt 295 (609)
.=....+. ...+.+-.. ..+.+.|+ .+++++ +|... .+ ......+ ++...| +... -|.|.+.+.
T Consensus 153 ~~~~~~~~~~~~t~t~sg~-v~~~~~~d--~v~vt~---~G~~~---~~-~~~~~g~~t~~~~~~~~~~~~d~~~~v~v~ 222 (317)
T PF13753_consen 153 DDNIINGAESTVTVTFSGT-VTGFDAGD--TVTVTI---NGTTY---TT-TVGADGTWTVTVTPSDLAGLADGTYTVTVT 222 (317)
T ss_pred CCceeeccceeeccccccc-ceeeeece--eEEEee---ccccc---ce-eecCCCcccccccccccccccCceEEEEEE
Confidence 32222222 344444433 35566666 344444 33321 11 1111111 222233 3222 279999999
Q ss_pred EEcCCCCee-EeeEEEEEEC
Q psy12980 296 CKDSNGLMA-NDGLEVIVKS 314 (609)
Q Consensus 296 ATDs~G~sa-sd~F~V~V~~ 314 (609)
++|..|... +.+..+.|..
T Consensus 223 ~tD~AGN~~~t~s~~~tvDt 242 (317)
T PF13753_consen 223 VTDAAGNTAGTASQTFTVDT 242 (317)
T ss_pred eeecccCcccccceeEEEec
Confidence 999999988 6766776654
No 84
>PF00801 PKD: PKD domain; InterPro: IPR000601 The PKD (Polycystic Kidney Disease) domain was first identified in the Polycystic Kidney Disease protein, polycystin-1 (PDK1 gene), and contains an Ig-like fold consisting of a beta-sandwich of seven strands in two sheets with a Greek key topology, although some members have additional strands []. Polycystin-1 is a large cell-surface glycoprotein involved in adhesive protein-protein and protein-carbohydrate interactions; however it is not clear if the PKD domain mediates any of these interactions. PKD domains are also found in other proteins, usually in the extracellular parts of proteins involved in interactions with other proteins. For example, domains with a PKD-type fold are found in archaeal surface layer proteins that protect the cell from extreme environments [], and in the human VPS10 domain-containing receptor SorCS2 [].; PDB: 1B4R_A 2KZW_A 2C4X_A 2C26_A 2Y72_B 3JQU_A 3JS7_B 1WGO_A 1L0Q_A.
Probab=33.39 E-value=2.5e+02 Score=22.49 Aligned_cols=18 Identities=17% Similarity=0.187 Sum_probs=15.5
Q ss_pred ceeEEEEEEEEcCCCCee
Q psy12980 287 LGRREYQLVCKDSNGLMA 304 (609)
Q Consensus 287 vG~~~f~LtATDs~G~sa 304 (609)
.|.|.++|+|+|..|...
T Consensus 51 ~G~y~V~ltv~n~~g~~t 68 (69)
T PF00801_consen 51 PGTYTVTLTVTNGVGSAT 68 (69)
T ss_dssp SEEEEEEEEEEETTSEEE
T ss_pred CeEEEEEEEEEECCCCcC
Confidence 499999999999988653
No 85
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells []. In contrast to other transmembrane adaptor proteins, GAPT, which this entry represents, is not phosphorylated upon BCR ligation. It associates with GRB2 constitutively through its proline-rich region [].
Probab=33.10 E-value=38 Score=32.94 Aligned_cols=17 Identities=24% Similarity=0.421 Sum_probs=11.4
Q ss_pred HHHHHHHhheeeecccC
Q psy12980 480 MLLCAALIACLLYRRRH 496 (609)
Q Consensus 480 mllia~iIaC~cyRrKR 496 (609)
+||+++.|+|+.|+++|
T Consensus 20 ~lLl~cgiGcvwhwkhr 36 (158)
T PF11770_consen 20 LLLLLCGIGCVWHWKHR 36 (158)
T ss_pred HHHHHHhcceEEEeecc
Confidence 34566668898886554
No 86
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=32.22 E-value=11 Score=40.28 Aligned_cols=20 Identities=35% Similarity=0.715 Sum_probs=8.4
Q ss_pred HHHHHHHHHhheee-ecccCC
Q psy12980 478 VIMLLCAALIACLL-YRRRHT 497 (609)
Q Consensus 478 a~mllia~iIaC~c-yRrKRk 497 (609)
++++||-.||-.|+ ||||||
T Consensus 266 liIVLIMvIIYLILRYRRKKK 286 (299)
T PF02009_consen 266 LIIVLIMVIIYLILRYRRKKK 286 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHhh
Confidence 33334444444433 444443
No 87
>KOG1219|consensus
Probab=31.96 E-value=3.6e+02 Score=37.20 Aligned_cols=185 Identities=14% Similarity=0.129 Sum_probs=109.9
Q ss_pred cccCC-CCCcccCcCccceeecCccEEEEcCCCCcccCCCCCeeeEEEEEeccc--ceecccCccCcCc------ccccc
Q psy12980 107 VKLEN-QVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMD--RYVISNTTTGQAK------PVLTR 177 (609)
Q Consensus 107 v~~~N-~~P~v~~pi~~~~a~e~~~fs~~iP~~tF~D~d~Gdt~~ltlsl~~~d--~~~~~~t~T~sgt------p~~~d 177 (609)
|..+| ++|+...|.=--+..|++.=...|=.--=.|+|...-.+|++.++.++ |.-.-+..||--| .-..|
T Consensus 1048 ieDvNDn~Pq~s~pvy~asI~enSp~~vsivq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~r~LDRE~qd 1127 (4289)
T KOG1219|consen 1048 IEDVNDNVPQFSSPVYYASISENSPETVSIVQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTIRRLDREKQD 1127 (4289)
T ss_pred EEecCCCCcccCCceEeeeeccCCCCceEEEEeccCCCCcccCcceEEEEccCCccceEEEccccceeeeehhhcccccc
Confidence 34455 478888888888888888776665444445888666778999999876 3333377777544 22334
Q ss_pred ccceeeEEEeeeecccccccCCCCCCceecccCCC-CceecCccceeeeeeCCeEEEEcCCCc--------cccCCCCCe
Q psy12980 178 ALGRLMTVKNITWRGIGHCEDSPPPPVVKVKLENQ-VPLIRNPIDHLEAISGELLVYHVPEDT--------FFDHEDGGT 248 (609)
Q Consensus 178 ~~~~~v~v~~~~~~~~~~~~~~~~~~tv~v~~~N~-aPvv~n~i~~~~at~G~~f~f~IP~dT--------F~D~eDGdT 248 (609)
.--|+||+.+.--. --.+...+.|.|..+|+ +|.....+- .|.||+.. =.|.|.|.
T Consensus 1128 EHiLeVTi~D~gep----~l~s~~rviV~IldvNdnsp~Flqk~~----------~~~v~~r~s~plyRl~a~d~DeG~- 1192 (4289)
T KOG1219|consen 1128 EHILEVTIQDNGEP----WLCSNQRVIVSILDVNDNSPRFLQKKT----------FLRVPERSSPPLYRLAAQDNDEGN- 1192 (4289)
T ss_pred cceEEEEEecCCCC----ccccceEEEEEEeeccCCchhhhhhee----------EEEeeeccCCceeEEEEEecCCCc-
Confidence 55567776633211 11111123344445544 344433333 33444332 23566777
Q ss_pred eeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEE-EEEEcCCCCeeEeeEEEEEE
Q psy12980 249 RKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQ-LVCKDSNGLMANDGLEVIVK 313 (609)
Q Consensus 249 ~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~-LtATDs~G~sasd~F~V~V~ 313 (609)
+-+++....||.+. ...|..+...|-.-... .|.|.+. +.|||.+--.++.++.|++.
T Consensus 1193 -narityniedgde~-----FsID~~t~vVsss~~~~-~~eydi~~Ikatd~g~pq~sa~trl~lt 1251 (4289)
T KOG1219|consen 1193 -NARITYNIEDGDEV-----FSIDIATGVVSSSTLDP-AGEYDILGIKATDRGAPQASAGTRLHLT 1251 (4289)
T ss_pred -ceEEEEecccCceE-----EEEeccCceEEeeeccC-CcceeEeeEEEecCCCCcccceeEEEEE
Confidence 55555555666552 45677776666544332 3778777 89999988888888888774
No 88
>PF15347 PAG: Phosphoprotein associated with glycosphingolipid-enriched
Probab=31.95 E-value=70 Score=35.45 Aligned_cols=36 Identities=25% Similarity=0.309 Sum_probs=18.8
Q ss_pred EEeehHHHHHHHHHHHHHHhhee--eecccCCCCcccCC
Q psy12980 468 ITFVAPLVVIVIMLLCAALIACL--LYRRRHTGKMSVGN 504 (609)
Q Consensus 468 lv~ViPaVIIa~mllia~iIaC~--cyRrKRkgK~~i~d 504 (609)
++.+.-+.-|+.++||..||..| |.|.| |+|---||
T Consensus 15 ivlwgsLaav~~f~lis~LifLCsSC~reK-K~~~~~GD 52 (428)
T PF15347_consen 15 IVLWGSLAAVTTFLLISFLIFLCSSCDREK-KPKQQSGD 52 (428)
T ss_pred EEeehHHHHHHHHHHHHHHHHHhhcccccc-cCCCCCCc
Confidence 34444444455556666665554 66655 55533333
No 89
>PF10164 DUF2367: Uncharacterized conserved protein (DUF2367); InterPro: IPR019317 This is a highly conserved set of proteins which contains three pairs of cysteine residues within a length of 42 amino acids and is rich in proline residues towards the N terminus. It includes a membrane protein that has been found to be highly expressed in the mouse brain and consequently, several members have been assigned as brain protein i3 (Bri3). Their function is unknown.
Probab=31.57 E-value=33 Score=31.03 Aligned_cols=23 Identities=22% Similarity=0.527 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHhheeeecccC
Q psy12980 474 LVVIVIMLLCAALIACLLYRRRH 496 (609)
Q Consensus 474 aVIIa~mllia~iIaC~cyRrKR 496 (609)
+++.|++++=.||+.|++.|+||
T Consensus 68 Gi~~AI~fFPiGilcc~~~r~~r 90 (98)
T PF10164_consen 68 GILCAIFFFPIGILCCLAMRERR 90 (98)
T ss_pred HHHHHHHHHhhHHHHhhhcCccc
Confidence 55555555556777777777665
No 90
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=31.13 E-value=3.4e+02 Score=25.90 Aligned_cols=31 Identities=16% Similarity=0.161 Sum_probs=26.2
Q ss_pred CceeEEEEEEEEcCCCCeeEeeEEEEEECCC
Q psy12980 286 DLGRREYQLVCKDSNGLMANDGLEVIVKSPG 316 (609)
Q Consensus 286 dvG~~~f~LtATDs~G~sasd~F~V~V~~~p 316 (609)
+.-...|.++|+|..|.+.+..|.|+|..+-
T Consensus 83 d~l~l~~~v~a~D~DGD~s~~~l~VtI~DD~ 113 (137)
T TIGR03660 83 DELTLNFPIIATDFDGDTSSITLPVTIVDDV 113 (137)
T ss_pred ceEEEeeeEEEEeCCCCccccEEEEEEECCC
Confidence 3456888999999999999999999998753
No 91
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=30.42 E-value=40 Score=30.19 Aligned_cols=11 Identities=18% Similarity=0.174 Sum_probs=4.2
Q ss_pred HHhheeeeccc
Q psy12980 485 ALIACLLYRRR 495 (609)
Q Consensus 485 ~iIaC~cyRrK 495 (609)
.++-.+++|||
T Consensus 85 ~l~w~f~~r~k 95 (96)
T PTZ00382 85 FLCWWFVCRGK 95 (96)
T ss_pred HHhheeEEeec
Confidence 33333344433
No 92
>PF04234 CopC: CopC domain; InterPro: IPR007348 CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm [].; GO: 0005507 copper ion binding, 0046688 response to copper ion, 0042597 periplasmic space; PDB: 1IX2_B 1LYQ_A 2C9P_C 2C9R_A 2C9Q_A 1M42_A 1OT4_A 1NM4_A.
Probab=29.95 E-value=3.2e+02 Score=23.94 Aligned_cols=62 Identities=10% Similarity=0.012 Sum_probs=41.1
Q ss_pred EEEEEeccCCCCCCCCccceeccCCceEEeecCC-CCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980 251 LKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQL-TDLGRREYQLVCKDSNGLMANDGLEVIVK 313 (609)
Q Consensus 251 L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~-~dvG~~~f~LtATDs~G~sasd~F~V~V~ 313 (609)
-++.++..+|..+.. .....+....++...... -.-|.|.+..++.-..|-..+..|...|+
T Consensus 35 s~v~v~~~~g~~v~~-~~~~~~~~~~~~~~~l~~~l~~G~YtV~wrvvs~DGH~~~G~~~F~V~ 97 (97)
T PF04234_consen 35 SSVTVTDPDGKRVDL-GEPTVDGDGKTLTVPLPPPLPPGTYTVSWRVVSADGHPVSGSFSFTVK 97 (97)
T ss_dssp -EEEEEEEEETTSCT-CEEEEEESTTEEEEEESS---SEEEEEEEEEEETTSCEEEEEEEEEE-
T ss_pred cEEEEEcCCCceeec-CcceecCCceEEEEECCCCCCCceEEEEEEEEecCCCCcCCEEEEEEC
Confidence 455667777776542 345555544566655444 34599999999988999999988887774
No 93
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=29.65 E-value=41 Score=37.78 Aligned_cols=11 Identities=18% Similarity=0.335 Sum_probs=5.3
Q ss_pred EEeehHHHHHH
Q psy12980 468 ITFVAPLVVIV 478 (609)
Q Consensus 468 lv~ViPaVIIa 478 (609)
++-.++++..+
T Consensus 41 ~~a~iaavwli 51 (418)
T cd07912 41 ILASIPAACLI 51 (418)
T ss_pred HHHHHHHHHHH
Confidence 33446666443
No 94
>PF05915 DUF872: Eukaryotic protein of unknown function (DUF872); InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=29.11 E-value=25 Score=32.61 Aligned_cols=25 Identities=20% Similarity=0.340 Sum_probs=18.1
Q ss_pred hHHHHHHHHHHHHHH----hheeeecccC
Q psy12980 472 APLVVIVIMLLCAAL----IACLLYRRRH 496 (609)
Q Consensus 472 iPaVIIa~mllia~i----IaC~cyRrKR 496 (609)
+++.|+.+|++|-|+ |++||+|++|
T Consensus 77 ~~llilG~L~fIPG~Y~~~i~y~a~rg~~ 105 (115)
T PF05915_consen 77 WALLILGILCFIPGFYHTRIAYYAWRGYK 105 (115)
T ss_pred chHHHHHHHHHhccHHHHHHHHHHHcCCC
Confidence 467777878887765 7777777764
No 95
>PF11857 DUF3377: Domain of unknown function (DUF3377); InterPro: IPR021805 This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=28.88 E-value=88 Score=27.04 Aligned_cols=31 Identities=26% Similarity=0.482 Sum_probs=21.1
Q ss_pred EEEeehHHHHHHHHHHHHHHhheeeecccCCCC
Q psy12980 467 LITFVAPLVVIVIMLLCAALIACLLYRRRHTGK 499 (609)
Q Consensus 467 Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK 499 (609)
....|||.+++.|+|.+...|.- .|||-..|
T Consensus 31 avaVviPl~L~LCiLvl~yai~~--fkrkGtPr 61 (74)
T PF11857_consen 31 AVAVVIPLVLLLCILVLIYAIFQ--FKRKGTPR 61 (74)
T ss_pred EEEEeHHHHHHHHHHHHHHHhhe--eeecCCCc
Confidence 45578999999988887766655 25553333
No 96
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=28.65 E-value=36 Score=26.44 Aligned_cols=18 Identities=17% Similarity=0.456 Sum_probs=9.2
Q ss_pred HHHHHhheeeecccCCCCc
Q psy12980 482 LCAALIACLLYRRRHTGKM 500 (609)
Q Consensus 482 lia~iIaC~cyRrKRkgK~ 500 (609)
++.|++.. .||+|||.++
T Consensus 21 ~F~gi~~w-~~~~~~k~~~ 38 (49)
T PF05545_consen 21 FFIGIVIW-AYRPRNKKRF 38 (49)
T ss_pred HHHHHHHH-HHcccchhhH
Confidence 33344433 3466666666
No 97
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=28.38 E-value=1.2e+02 Score=25.64 Aligned_cols=12 Identities=17% Similarity=0.517 Sum_probs=9.9
Q ss_pred eeEEEEEEEEcC
Q psy12980 288 GRREYQLVCKDS 299 (609)
Q Consensus 288 G~~~f~LtATDs 299 (609)
|.|.|.|.|+|.
T Consensus 69 G~Y~~~v~a~~~ 80 (81)
T PF13860_consen 69 GTYTFRVTATDG 80 (81)
T ss_dssp EEEEEEEEEEET
T ss_pred CCEEEEEEEEeC
Confidence 888888888875
No 98
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=28.20 E-value=2.1e+02 Score=29.57 Aligned_cols=25 Identities=16% Similarity=0.252 Sum_probs=19.4
Q ss_pred EeecCCCCceeEEEEEEEEcCCCCeeE
Q psy12980 279 YGIPQLTDLGRREYQLVCKDSNGLMAN 305 (609)
Q Consensus 279 ~G~P~~~dvG~~~f~LtATDs~G~sas 305 (609)
.|.+... |.|.|.+.|+|..|....
T Consensus 166 ~G~~~~~--G~Y~~~V~a~~~~G~~~~ 190 (230)
T PRK12633 166 GGQPLAD--GKYSITVSASDADAKPVK 190 (230)
T ss_pred CCCcCCC--cceEEEEEEEeCCCcEEe
Confidence 4555544 999999999999987553
No 99
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=27.91 E-value=48 Score=38.48 Aligned_cols=38 Identities=13% Similarity=0.287 Sum_probs=20.6
Q ss_pred CccEEEE-----eehHHHHHHHHHH----HHHHhheeeecccCCCCc
Q psy12980 463 SVEYLIT-----FVAPLVVIVIMLL----CAALIACLLYRRRHTGKM 500 (609)
Q Consensus 463 ~~~~Llv-----~ViPaVIIa~mll----ia~iIaC~cyRrKRkgK~ 500 (609)
..++.++ +|++|+++++..| +.+++.|||-||||+.+.
T Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (548)
T PTZ00364 468 PEEYFLVVRCSTIILMAILLLVAGLISISIIGLYIVIRIRKKRETLK 514 (548)
T ss_pred CCceEEEEeehhhHHHHHHHHHHHHhhhhhheeeEEEEEEeecCcee
Confidence 3456565 5555555444433 344556677777655544
No 100
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=27.60 E-value=30 Score=28.64 Aligned_cols=28 Identities=14% Similarity=0.369 Sum_probs=16.4
Q ss_pred eeecccCCCCcccCCCcccccccccCCCCEEcccccccc
Q psy12980 490 LLYRRRHTGKMSVGNGVDSGILIRSKGIPVIFQDELEER 528 (609)
Q Consensus 490 ~cyRrKRkgK~~i~d~deE~~tF~~KGiPvIF~dElde~ 528 (609)
..||++||+.+ | ++ +.-++.+.||.+++
T Consensus 28 ~ayr~~~K~~~---d--~a------a~~~l~l~Dd~q~~ 55 (60)
T COG4736 28 FAYRPGKKGEF---D--EA------ARGILPLNDDAQDA 55 (60)
T ss_pred HHhcccchhhH---H--HH------hccCCCCCcchhhh
Confidence 45677777777 1 21 23355677777765
No 101
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=26.62 E-value=47 Score=32.71 Aligned_cols=16 Identities=13% Similarity=0.030 Sum_probs=8.4
Q ss_pred CCcccCCCcccccccc
Q psy12980 498 GKMSVGNGVDSGILIR 513 (609)
Q Consensus 498 gK~~i~d~deE~~tF~ 513 (609)
-||-+++.-+|+....
T Consensus 127 RkYgvl~~~~~~~Em~ 142 (163)
T PF06679_consen 127 RKYGVLTTRAENVEMA 142 (163)
T ss_pred eeecccCCCcccceec
Confidence 4565655444555544
No 102
>PF15415 DUF4622: Protein of unknown function (DUF4622)
Probab=26.27 E-value=1.5e+02 Score=31.28 Aligned_cols=147 Identities=18% Similarity=0.204 Sum_probs=79.0
Q ss_pred cccCCCCceecCccceeeeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccce--eccCCc--eEEeec
Q psy12980 207 VKLENQVPLIRNPIDHLEAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQ--FDAKNQ--EFYGIP 282 (609)
Q Consensus 207 v~~~N~aPvv~n~i~~~~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~--Fd~~t~--tf~G~P 282 (609)
|...|-.-+..|+|-+|.| +|.|+|=.. +| ..+|+.. ..|-+| +=|| ||..++ --.|.-
T Consensus 148 i~~~nvq~i~LnPii~qta----Rm~Fti~~g------~g---v~tLe~l-~aGiEI---sGiQ~p~d~t~~~nw~~gD~ 210 (310)
T PF15415_consen 148 INENNVQYIYLNPIINQTA----RMQFTIHKG------EG---VHTLEML-QAGIEI---SGIQQPFDHTSQFNWSNGDN 210 (310)
T ss_pred eccCCeeEEEeChhhhhee----EEEEEEecC------Cc---eeeehhh-hcccee---ecccccCCCccccccccccc
Confidence 4445666788899988887 566666432 34 3555554 234443 3445 555443 233544
Q ss_pred CCCCceeEEEEEEE------EcCCCCeeEeeEEEEEECCCCCCcceEEEEEecCCCCCcCchHHH-HHHHHHHHHHhCCC
Q psy12980 283 QLTDLGRREYQLVC------KDSNGLMANDGLEVIVKSPGPAYYSASFSLTLARPSSTFAQPAAQ-RAFLEKLAGLFGDR 355 (609)
Q Consensus 283 ~~~dvG~~~f~LtA------TDs~G~sasd~F~V~V~~~p~~~~~vef~L~l~~~~~~ft~~~~q-~~LVekLA~llGD~ 355 (609)
+..-+|.-.=.++. ||+.|.-.-++ -|-|..-+.+.|...|.+.+. ..+-| ..+|..+-.+-|-+
T Consensus 211 lp~k~GDk~~~v~ip~~~~~t~a~g~L~g~t---GvLPTd~rStpv~Vl~N~~VN-----gvPTQy~~li~g~~ll~ghS 282 (310)
T PF15415_consen 211 LPMKMGDKSSWVRIPGSHFETNANGSLQGHT---GVLPTDCRSTPVSVLMNMKVN-----GVPTQYMMLINGKHLLHGHS 282 (310)
T ss_pred cceecccccceEEEecceeeecCCCcEEeec---ccccccccCCceEEEEEEEec-----CCchheeEeeeeeEeccccc
Confidence 44445533222332 45555443332 244432334444444444332 22344 34555566666665
Q ss_pred CCCcEEEEeeecCCCCCCCcEEEEEEecCC
Q psy12980 356 NASKIVNIEFEDNPQEPGKSTVVRWANRTL 385 (609)
Q Consensus 356 d~s~ItVrsI~~~~~~~~gSTvV~w~N~TL 385 (609)
- +.-..-+|+ +|-+++.|-|+|-
T Consensus 283 Y-Nyt~Tvki~------~GitV~tWQn~SW 305 (310)
T PF15415_consen 283 Y-NYTWTVKIS------NGITVVTWQNQSW 305 (310)
T ss_pred c-ceEEEEEec------CCEEEEEeccccc
Confidence 4 555666787 8999999999774
No 103
>PF09680 Tiny_TM_bacill: Protein of unknown function (Tiny_TM_bacill); InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=25.31 E-value=66 Score=22.06 Aligned_cols=16 Identities=25% Similarity=0.607 Sum_probs=7.2
Q ss_pred HHHHHHHHHHHHHhheee
Q psy12980 474 LVVIVIMLLCAALIACLL 491 (609)
Q Consensus 474 aVIIa~mllia~iIaC~c 491 (609)
++++.++||+ |+.|.|
T Consensus 8 livVLFILLi--IvG~s~ 23 (24)
T PF09680_consen 8 LIVVLFILLI--IVGASC 23 (24)
T ss_pred hHHHHHHHHH--Hhccee
Confidence 3444444444 445544
No 104
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=25.11 E-value=4.7e+02 Score=25.97 Aligned_cols=76 Identities=17% Similarity=0.263 Sum_probs=46.4
Q ss_pred eEEEEEeccCCCCCCCCccceeccCCce-EEeecCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE--CCCCCCcceEEEE
Q psy12980 250 KLKLHLMTMDRTTIPPTHWLQFDAKNQE-FYGIPQLTDLGRREYQLVCKDSNGLMANDGLEVIVK--SPGPAYYSASFSL 326 (609)
Q Consensus 250 ~L~lsl~~~Dg~~l~~~sWL~Fd~~t~t-f~G~P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~--~~p~~~~~vef~L 326 (609)
...+.|+.-+|..... ...|...+. -.-+|.....|.|.|.|..+.+. .+.+.|.|. +.....+.+.+..
T Consensus 42 ~p~i~L~~P~G~~~~~---~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~----~q~v~vtVtS~~~~~~~~Pitv~~ 114 (179)
T PF09315_consen 42 PPSITLTDPSGTVYTT---FTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS----SQTVTVTVTSRASSPTVPPITVTA 114 (179)
T ss_pred CceEEEECCCCCEEee---eEEcccccEEEEECCCCcccccEEEEEecCCCC----cceEEEEEEeeECCCCCCCEEEEE
Confidence 4566677777777653 335544432 12256666789999999765533 344566554 3334566777777
Q ss_pred EecCCC
Q psy12980 327 TLARPS 332 (609)
Q Consensus 327 ~l~~~~ 332 (609)
.+..+.
T Consensus 115 ~~~~~~ 120 (179)
T PF09315_consen 115 WLSQSS 120 (179)
T ss_pred EECCCC
Confidence 776643
No 105
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=24.92 E-value=88 Score=33.26 Aligned_cols=22 Identities=27% Similarity=0.442 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHhheeeeccc
Q psy12980 474 LVVIVIMLLCAALIACLLYRRR 495 (609)
Q Consensus 474 aVIIa~mllia~iIaC~cyRrK 495 (609)
|+=++++|.+++||....+|||
T Consensus 237 ALG~v~ll~l~Gii~~~~~r~~ 258 (281)
T PF12768_consen 237 ALGTVFLLVLIGIILAYIRRRR 258 (281)
T ss_pred HHHHHHHHHHHHHHHHHHHhhh
Confidence 3444444445555555444443
No 106
>PRK06439 NADH dehydrogenase subunit J; Provisional
Probab=24.59 E-value=66 Score=27.67 Aligned_cols=29 Identities=10% Similarity=0.311 Sum_probs=23.0
Q ss_pred EEEeehHHHHHHHHHHHHHHhheeeeccc
Q psy12980 467 LITFVAPLVVIVIMLLCAALIACLLYRRR 495 (609)
Q Consensus 467 Llv~ViPaVIIa~mllia~iIaC~cyRrK 495 (609)
+-.|++|+-++.++|++|-+-|..+-||+
T Consensus 43 ft~YllpFEl~SiLLLvAmIGAIvLarke 71 (72)
T PRK06439 43 FNTYIVPFELLSVIIVGAVIGVMYVAWGE 71 (72)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHheeeecC
Confidence 34578999999999999988887776653
No 107
>TIGR03370 PEPCTERM_Roseo variant PEP-CTERM putative exosortase signal, Roseobacter type. A probable protein export sorting signal, PEP-CTERM, was described by Haft, et al. (PubMed:16930487). It is predicted to interact with a putative transpeptidase we designate exosortase. Most examples of this signal are recognized by model TIGR02595, but some unusual clades require different models. This model describes a variant with conserved motif VPLPA, rather than VPEP. This variant is found prominently in two members of the Rhodobacterales, namely Jannaschia sp. CCS1 and Roseobacter denitrificans OCh 114. One interesting member protein has a full-length duplication and therefore two copies of this putative sorting domain.
Probab=24.57 E-value=33 Score=23.92 Aligned_cols=18 Identities=39% Similarity=0.360 Sum_probs=10.8
Q ss_pred HHHHHHHhheeeecccCC
Q psy12980 480 MLLCAALIACLLYRRRHT 497 (609)
Q Consensus 480 mllia~iIaC~cyRrKRk 497 (609)
.|+++++-++-..|||||
T Consensus 8 ~LLl~gLggl~~~rRRrk 25 (26)
T TIGR03370 8 LLLLAGLGGLGAMRRRRR 25 (26)
T ss_pred HHHHHHHHHHHHHHHhhc
Confidence 356666666655566654
No 108
>COG5513 Predicted secreted protein [Function unknown]
Probab=24.48 E-value=5.9e+02 Score=23.81 Aligned_cols=90 Identities=17% Similarity=0.216 Sum_probs=47.0
Q ss_pred eeeeCCeEEEEcCCCccccCCCCCeeeEEEEEeccCCCCCCCCccceeccCCceEEeecCCCCceeEEEEEEEEcCCCCe
Q psy12980 224 EAISGELLVYHVPEDTFFDHEDGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFYGIPQLTDLGRREYQLVCKDSNGLM 303 (609)
Q Consensus 224 ~at~G~~f~f~IP~dTF~D~eDGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~G~P~~~dvG~~~f~LtATDs~G~s 303 (609)
....|+.|.+++|+|. -.|. .- .|....|..|-...--+.-+..++ -|.| |.|...+.|.+++-+.
T Consensus 22 kL~~ge~f~L~l~~NP----tTGy--sW--eL~~s~gl~l~~~d~~~~~~~e~~-~G~~-----G~~~W~i~av~~G~q~ 87 (113)
T COG5513 22 KLNNGEKFTLQLPENP----TTGY--SW--ELNTSEGLVLLREDYTVDPAEEGT-VGSP-----GIHTWEIRAVKQGTQQ 87 (113)
T ss_pred EEecCCceEEEecCCC----ccce--EE--EEecCCccEEEeccCccCCCcccc-cCCC-----ceEEEEEEeecCCceE
Confidence 5667889999999884 3344 22 222233433221111111111222 3444 8899999999987665
Q ss_pred eEeeEEEEEECCCCCCcceEEEEEec
Q psy12980 304 ANDGLEVIVKSPGPAYYSASFSLTLA 329 (609)
Q Consensus 304 asd~F~V~V~~~p~~~~~vef~L~l~ 329 (609)
+.-.....=.+. .-+...|++.++
T Consensus 88 v~g~Y~R~we~~--~g~e~~f~l~Ve 111 (113)
T COG5513 88 VEGTYRRLWEKE--AGPEINFSLIVE 111 (113)
T ss_pred EEEEEEEeeccc--cCcceeEEEEEE
Confidence 554444333222 234445666554
No 109
>PLN03150 hypothetical protein; Provisional
Probab=24.45 E-value=29 Score=40.58 Aligned_cols=7 Identities=14% Similarity=0.657 Sum_probs=3.9
Q ss_pred EEEEEEC
Q psy12980 308 LEVIVKS 314 (609)
Q Consensus 308 F~V~V~~ 314 (609)
+.|.+.+
T Consensus 340 l~isl~p 346 (623)
T PLN03150 340 LTIVLQP 346 (623)
T ss_pred EEEEEee
Confidence 5555555
No 110
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=24.28 E-value=59 Score=31.65 Aligned_cols=32 Identities=19% Similarity=0.403 Sum_probs=18.9
Q ss_pred hHHHHHHHHHHHHHHhheeeecccCCCCcccC
Q psy12980 472 APLVVIVIMLLCAALIACLLYRRRHTGKMSVG 503 (609)
Q Consensus 472 iPaVIIa~mllia~iIaC~cyRrKRkgK~~i~ 503 (609)
+.-+|+++++++++|++..-+.|+++|++.+-
T Consensus 14 ~vTvVvaI~~vv~~I~~~~~k~k~~~G~L~Vt 45 (155)
T PF08496_consen 14 IVTVVVAILAVVGLIVAAAQKKKKKKGELEVT 45 (155)
T ss_pred HHHHHHHHHHHHHHHHHHhccCCCCCCeEEEe
Confidence 33455555555666666665555568888543
No 111
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=23.96 E-value=6.2e+02 Score=31.37 Aligned_cols=102 Identities=18% Similarity=0.258 Sum_probs=58.3
Q ss_pred eeeeeCCeEEEEcCCCccccCC-----CCCeeeEEEEEeccCCCCCCCCccceeccCCceEE-eecCCCCceeEEEEEEE
Q psy12980 223 LEAISGELLVYHVPEDTFFDHE-----DGGTRKLKLHLMTMDRTTIPPTHWLQFDAKNQEFY-GIPQLTDLGRREYQLVC 296 (609)
Q Consensus 223 ~~at~G~~f~f~IP~dTF~D~e-----DGdT~~L~lsl~~~Dg~~l~~~sWL~Fd~~t~tf~-G~P~~~dvG~~~f~LtA 296 (609)
+.+..|+.+.-+|+-|.+...+ -....+..+.|++.+|.... +- +.|...+.-+ =+|....+|+|+|.|++
T Consensus 495 ~~l~~~~~~~~tv~iDstvG~~T~F~vtw~~~~p~i~l~dP~G~~~~--~~-~~d~~~~~a~l~ipg~a~~G~W~y~l~~ 571 (863)
T TIGR00868 495 LTLQNNAWMNGTVPVDSTVGKDTFFLITWEFLKPEIFLQDPSGKSTS--DF-LVDKLNKMAYLQIPGTAKVGTWTYSLQA 571 (863)
T ss_pred eEecCCcEEeeeEEecCCCCCCcEEEEEcCCCCceEEEECCCCCccc--cc-eEeecceEEEEEcCCCCccceeEEEEec
Confidence 4555555565555544322111 11112566778888887632 32 3555444322 35766678999999998
Q ss_pred EcCCCCeeEeeEEEEEE--CCCCCCcceEEEEEecCCC
Q psy12980 297 KDSNGLMANDGLEVIVK--SPGPAYYSASFSLTLARPS 332 (609)
Q Consensus 297 TDs~G~sasd~F~V~V~--~~p~~~~~vef~L~l~~~~ 332 (609)
+ .. .+.++|.|. ......+.+.....|+...
T Consensus 572 ~-~~----~q~i~vtVtsra~s~~~~pi~~~a~~~~~~ 604 (863)
T TIGR00868 572 S-AN----PQTLTLTVTSRARSPTLPPVTVTAKMNKDT 604 (863)
T ss_pred c-CC----CceEEEEEEeccCCCCCCCEEEEEEEcccc
Confidence 8 22 344666665 3334567788888877644
No 112
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=23.94 E-value=73 Score=32.46 Aligned_cols=26 Identities=38% Similarity=0.382 Sum_probs=11.0
Q ss_pred EeehHHHHHH-HHHHHHHHhh-eeeeccc
Q psy12980 469 TFVAPLVVIV-IMLLCAALIA-CLLYRRR 495 (609)
Q Consensus 469 v~ViPaVIIa-~mllia~iIa-C~cyRrK 495 (609)
++|. .|+++ ++||++.+++ +||+.||
T Consensus 101 ~lI~-lv~~g~~lLla~~~~~~Y~~~~Rr 128 (202)
T PF06365_consen 101 TLIA-LVTSGSFLLLAILLGAGYCCHQRR 128 (202)
T ss_pred EEEe-hHHhhHHHHHHHHHHHHHHhhhhc
Confidence 3443 34444 3444444444 4444444
No 113
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=23.77 E-value=1.5e+02 Score=30.36 Aligned_cols=26 Identities=15% Similarity=0.139 Sum_probs=19.5
Q ss_pred eecCCCCceeEEEEEEEEcCCCCeeEee
Q psy12980 280 GIPQLTDLGRREYQLVCKDSNGLMANDG 307 (609)
Q Consensus 280 G~P~~~dvG~~~f~LtATDs~G~sasd~ 307 (609)
|.+... |.|.|.+.|+|..|....-.
T Consensus 160 G~~~~~--G~Yt~~v~a~~~~G~~~~~~ 185 (221)
T PRK12634 160 GNRMAA--GKYGVTATQTDTAGSKSKLS 185 (221)
T ss_pred CCcCCC--eeeEEEEEEEeCCCcEEeee
Confidence 444444 99999999999999855443
No 114
>PF05140 ResB: ResB-like family ; InterPro: IPR007816 This domain is found in a number of known and suspected cytochrome c biogenesis proteins, including ResB []. Mutations in ResB indicate that they are essential for growth []. ResB is predicted to be a transmembrane protein.
Probab=23.71 E-value=42 Score=37.73 Aligned_cols=33 Identities=24% Similarity=0.326 Sum_probs=23.1
Q ss_pred ccEEEEee---hHHHHHHHHHHHHHHhheeeecccC
Q psy12980 464 VEYLITFV---APLVVIVIMLLCAALIACLLYRRRH 496 (609)
Q Consensus 464 ~~~Llv~V---iPaVIIa~mllia~iIaC~cyRrKR 496 (609)
+..|-+.. +|.|.+.|+++++|++.+++.|+||
T Consensus 396 ~tGLqV~~DPG~~~V~~G~~llv~Gl~~sfy~~hRR 431 (464)
T PF05140_consen 396 YTGLQVKKDPGVPIVYLGFILLVVGLFLSFYIRHRR 431 (464)
T ss_pred EEEEEEEeCCCchHHHHHHHHHHHHHHHhheecceE
Confidence 44455554 6788888999999997775555554
No 115
>PF00974 Rhabdo_glycop: Rhabdovirus spike glycoprotein; InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=23.71 E-value=26 Score=40.06 Aligned_cols=9 Identities=0% Similarity=0.021 Sum_probs=0.0
Q ss_pred CCccEEEEe
Q psy12980 462 ASVEYLITF 470 (609)
Q Consensus 462 a~~~~Llv~ 470 (609)
.+|+.++..
T Consensus 449 ~~W~~~~~~ 457 (501)
T PF00974_consen 449 SNWGEWLSI 457 (501)
T ss_dssp ---------
T ss_pred ccHHHHHHH
Confidence 456554443
No 116
>PF08372 PRT_C: Plant phosphoribosyltransferase C-terminal; InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO).
Probab=23.45 E-value=51 Score=32.22 Aligned_cols=24 Identities=25% Similarity=0.444 Sum_probs=17.3
Q ss_pred EcccccccccCCCCCCCCCeeccC
Q psy12980 520 IFQDELEERLEVSSGTKSPIILKE 543 (609)
Q Consensus 520 IF~dElde~~~~~~~~~~P~Ilke 543 (609)
+-.||+||+....|+++.+=+|++
T Consensus 36 ~~~deldEEfD~~ps~~~~~~lr~ 59 (156)
T PF08372_consen 36 AHPDELDEEFDTFPSSRPPDSLRM 59 (156)
T ss_pred CCcchhhhhhcccccccccHHHHH
Confidence 346899887666677777777765
No 117
>PHA03289 envelope glycoprotein I; Provisional
Probab=23.13 E-value=2.6e+02 Score=30.58 Aligned_cols=29 Identities=21% Similarity=0.230 Sum_probs=23.3
Q ss_pred cCCCCceeEEEEEEEEcCCCCeeEeeEEEEEE
Q psy12980 282 PQLTDLGRREYQLVCKDSNGLMANDGLEVIVK 313 (609)
Q Consensus 282 P~~~dvG~~~f~LtATDs~G~sasd~F~V~V~ 313 (609)
|.-.|.|.|.++|.. +|+...|-|-+.|.
T Consensus 129 Pr~~DSGiYyLRV~L---dga~~sDvF~lsv~ 157 (352)
T PHA03289 129 PRMSDTGIYYIRVSL---AGRNKSDIFRTSVF 157 (352)
T ss_pred CCcccCeeEEEEEEe---CCCccCceEEEEEE
Confidence 667788999999998 45678888888764
No 118
>KOG3521|consensus
Probab=22.80 E-value=1.5e+02 Score=35.39 Aligned_cols=38 Identities=37% Similarity=0.494 Sum_probs=26.7
Q ss_pred CCcccccccccCCCCEEcccccccccCCCCCCCCC--eeccCCCCCCCCC
Q psy12980 504 NGVDSGILIRSKGIPVIFQDELEERLEVSSGTKSP--IILKEEKPPLPPP 551 (609)
Q Consensus 504 d~deE~~tF~~KGiPvIF~dElde~~~~~~~~~~P--~IlkeEkpplppp 551 (609)
+++++-.+|.++|.++ . +-.-+.+| +-|+|+.|||.++
T Consensus 768 n~~~~~~~~~~a~th~--------s--p~s~~rSpg~~~lse~~p~l~~~ 807 (846)
T KOG3521|consen 768 NNDPSGETLTSAGTHI--------S--PISPNRSPGTLSLSEEAPPLALE 807 (846)
T ss_pred ccccccchhcccCCCC--------C--CCCcccCccchhhhhhcccccCC
Confidence 3356666888888664 1 34446677 8899999999765
No 119
>PF01601 Corona_S2: Coronavirus S2 glycoprotein; InterPro: IPR002552 The type I glycoprotein S of Coronavirus, trimers of which constitute the typical viral spikes, is assembled into virions through noncovalent interactions with the M protein. The spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 IPR002551 from INTERPRO and S2 []. Both chimeric S proteins appeared to cause cell fusion when expressed individually, suggesting that they were biologically fully active []. The spike is a type I membrane glycoprotein that possesses a conserved transmembrane anchor and an unusual cysteine-rich (cys) domain that bridges the putative junction of the anchor and the cytoplasmic tail [].; GO: 0006944 cellular membrane fusion, 0046813 virion attachment, binding of host cell surface receptor, 0016021 integral to membrane, 0019031 viral envelope; PDB: 2BEQ_B 2FXP_A 1ZVB_A 1WNC_D 1ZV8_H 1ZV7_B 1WYY_B 1ZVA_A 2BEZ_F 1WDG_A ....
Probab=22.35 E-value=29 Score=40.40 Aligned_cols=24 Identities=21% Similarity=0.137 Sum_probs=0.0
Q ss_pred ccceeecCccEEEEcCCCC-cccCC
Q psy12980 121 DHLEAISGELLVYHVPEDT-FFDHE 144 (609)
Q Consensus 121 ~~~~a~e~~~fs~~iP~~t-F~D~d 144 (609)
..-+|.||-.|+-.+=+|- |.|.|
T Consensus 146 ~~RS~IEDLLF~KV~t~glGtvd~d 170 (610)
T PF01601_consen 146 SGRSAIEDLLFNKVVTSGLGTVDQD 170 (610)
T ss_dssp -------------------------
T ss_pred CcchHHHHhhhceEEecCCCcchHH
Confidence 3467777777777665543 33544
No 120
>KOG4221|consensus
Probab=22.10 E-value=64 Score=40.65 Aligned_cols=34 Identities=21% Similarity=0.319 Sum_probs=16.1
Q ss_pred CCCCccEEEEeehHHHHHHHHHHHHHHhheeeecc
Q psy12980 460 ANASVEYLITFVAPLVVIVIMLLCAALIACLLYRR 494 (609)
Q Consensus 460 ~na~~~~Llv~ViPaVIIa~mllia~iIaC~cyRr 494 (609)
.+..++-|+..++.++.++ .+++.-+|++||+||
T Consensus 1052 ~~~~~~~ll~i~l~~~~vi-~i~~~~~v~v~C~~r 1085 (1381)
T KOG4221|consen 1052 TNLVDSNLLDIILVAVAVI-LILVLLLVLVTCRRR 1085 (1381)
T ss_pred ccccccceeeeeeehhhhH-HHHHHhhheeEEecc
Confidence 3444444443443333332 233444567777776
No 121
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=22.00 E-value=30 Score=34.00 Aligned_cols=35 Identities=20% Similarity=0.375 Sum_probs=0.0
Q ss_pred cEEEEeehHHHHHHHHHH---HHHHhheeeecccCCCCcc
Q psy12980 465 EYLITFVAPLVVIVIMLL---CAALIACLLYRRRHTGKMS 501 (609)
Q Consensus 465 ~~Llv~ViPaVIIa~mll---ia~iIaC~cyRrKRkgK~~ 501 (609)
+=|-+--|.++||.+++. |+|||+.+. ||=+|||+
T Consensus 124 ~GL~T~tLVGIIVGVLlaIG~igGIIivvv--RKmSGRys 161 (162)
T PF05808_consen 124 DGLSTVTLVGIIVGVLLAIGFIGGIIIVVV--RKMSGRYS 161 (162)
T ss_dssp ----------------------------------------
T ss_pred CCcceeeeeeehhhHHHHHHHHhheeeEEe--ehhccccC
Confidence 334434444555555444 555555553 33357774
No 122
>PF12669 P12: Virus attachment protein p12 family
Probab=21.68 E-value=64 Score=26.32 Aligned_cols=8 Identities=13% Similarity=0.571 Sum_probs=4.0
Q ss_pred eecccCCC
Q psy12980 491 LYRRRHTG 498 (609)
Q Consensus 491 cyRrKRkg 498 (609)
.+|++|+|
T Consensus 20 ~~k~~K~G 27 (58)
T PF12669_consen 20 FIKDKKKG 27 (58)
T ss_pred HHHHhhcC
Confidence 34555555
No 123
>PF15102 TMEM154: TMEM154 protein family
Probab=21.55 E-value=60 Score=31.43 Aligned_cols=40 Identities=15% Similarity=0.266 Sum_probs=26.1
Q ss_pred CCCccEEEEeehHHHHHHHHHHHHHHhheeeecccCCCCc
Q psy12980 461 NASVEYLITFVAPLVVIVIMLLCAALIACLLYRRRHTGKM 500 (609)
Q Consensus 461 na~~~~Llv~ViPaVIIa~mllia~iIaC~cyRrKRkgK~ 500 (609)
+.+.+.+-..++.+|-.++++|+..+++|+.++.|||...
T Consensus 49 ~~~~~q~efiLmIlIP~VLLvlLLl~vV~lv~~~kRkr~K 88 (146)
T PF15102_consen 49 EEDSSQLEFILMILIPLVLLVLLLLSVVCLVIYYKRKRTK 88 (146)
T ss_pred CCCCCCcceEEEEeHHHHHHHHHHHHHHHheeEEeecccC
Confidence 4566766643343333455556677789999999888775
No 124
>PF14967 FAM70: FAM70 protein
Probab=21.46 E-value=2.2e+02 Score=30.80 Aligned_cols=41 Identities=15% Similarity=0.071 Sum_probs=24.8
Q ss_pred CCCCCCCCCCCCCCCc-cEEEEeehHHHHHHHHHHHHHHhhe
Q psy12980 449 RKPAYSGKNTPANASV-EYLITFVAPLVVIVIMLLCAALIAC 489 (609)
Q Consensus 449 ~p~~~~~~~~~~na~~-~~Llv~ViPaVIIa~mllia~iIaC 489 (609)
.+..|-+..+-.+|.| -.||+-.+...|+=++.|+.|||.-
T Consensus 175 ~~~~YyEf~gV~SCqDVvhLy~LLWastvLNilgLfLGIiTA 216 (327)
T PF14967_consen 175 YSGSYYEFVGVSSCQDVVHLYRLLWASTVLNILGLFLGIITA 216 (327)
T ss_pred CCCcceEEeccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444434445455633 3366666778888778888777654
No 125
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=21.39 E-value=32 Score=41.37 Aligned_cols=30 Identities=10% Similarity=0.249 Sum_probs=20.4
Q ss_pred CCCCccEEEEeehHHHHHHHHHHHHHH-hhe
Q psy12980 460 ANASVEYLITFVAPLVVIVIMLLCAAL-IAC 489 (609)
Q Consensus 460 ~na~~~~Llv~ViPaVIIa~mllia~i-IaC 489 (609)
....-.+||..|+-++++++++|++.| |.|
T Consensus 267 i~~YHT~fLl~ILG~~~livl~lL~vLl~yC 297 (807)
T PF10577_consen 267 ITTYHTVFLLAILGGTALIVLILLCVLLCYC 297 (807)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 344456888899887777777776665 556
No 126
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=20.54 E-value=2.1e+02 Score=27.74 Aligned_cols=36 Identities=11% Similarity=0.104 Sum_probs=27.4
Q ss_pred eeEEEEE-EEEcCCCCeeEeeEEEEEECCCCCCcceEE
Q psy12980 288 GRREYQL-VCKDSNGLMANDGLEVIVKSPGPAYYSASF 324 (609)
Q Consensus 288 G~~~f~L-tATDs~G~sasd~F~V~V~~~p~~~~~vef 324 (609)
|.|.+.+ +|+|..|...+..+...+.-+ .-.|.+.+
T Consensus 15 G~Y~l~~~~a~D~agN~~~~~~~~~~~iD-~T~Ptisi 51 (158)
T PF13750_consen 15 GSYTLTVVTATDAAGNTSTSTVSETFTID-NTPPTISI 51 (158)
T ss_pred ccEEEEEEEEEecCCCEEEEEEeeEEEEc-CCCCEEEE
Confidence 9999999 899999999998887554443 34455555
Done!