Query 010708
Match_columns 503
No_of_seqs 272 out of 2350
Neff 7.9
Searched_HMMs 46136
Date Fri Mar 29 03:39:47 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/010708.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/010708hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01453 B_lectin: D-mannose b 100.0 6.9E-30 1.5E-34 219.7 3.7 110 70-180 2-114 (114)
2 cd00028 B_lectin Bulb-type man 99.9 3.2E-25 6.8E-30 191.9 15.2 115 30-148 2-116 (116)
3 PF00954 S_locus_glycop: S-loc 99.9 5.8E-25 1.3E-29 188.3 9.7 99 206-304 1-110 (110)
4 smart00108 B_lectin Bulb-type 99.9 8.2E-24 1.8E-28 182.5 14.5 113 30-147 2-114 (114)
5 PF08276 PAN_2: PAN-like domai 99.4 1.1E-13 2.4E-18 107.0 5.0 57 337-395 4-66 (66)
6 cd00129 PAN_APPLE PAN/APPLE-li 99.3 1.4E-12 3E-17 104.3 5.4 67 337-408 8-79 (80)
7 cd01098 PAN_AP_plant Plant PAN 99.3 6E-12 1.3E-16 101.9 6.9 72 337-410 11-84 (84)
8 smart00108 B_lectin Bulb-type 98.9 7.8E-09 1.7E-13 89.0 9.4 89 88-209 23-112 (114)
9 cd00028 B_lectin Bulb-type man 98.8 2.6E-08 5.6E-13 86.0 8.9 90 88-210 23-114 (116)
10 PF01453 B_lectin: D-mannose b 98.7 1.2E-07 2.5E-12 81.6 11.5 100 36-149 12-114 (114)
11 smart00473 PAN_AP divergent su 98.3 1.6E-06 3.4E-11 68.6 6.5 70 337-408 3-77 (78)
12 KOG1187 Serine/threonine prote 98.0 3.2E-06 7E-11 87.4 3.5 37 467-503 61-97 (361)
13 cd01100 APPLE_Factor_XI_like S 97.2 0.0003 6.5E-09 55.3 3.3 32 358-389 25-57 (73)
14 PLN00113 leucine-rich repeat r 94.9 0.036 7.8E-07 64.9 5.9 29 471-502 683-711 (968)
15 KOG3653 Transforming growth fa 92.9 0.035 7.6E-07 58.0 0.6 17 487-503 216-232 (534)
16 PF00024 PAN_1: PAN domain Thi 90.0 0.4 8.8E-06 37.3 3.8 49 340-389 4-56 (79)
17 PLN03224 probable serine/threo 89.3 0.16 3.5E-06 55.0 1.2 24 479-502 143-166 (507)
18 KOG1025 Epidermal growth facto 89.1 0.34 7.4E-06 54.2 3.5 19 484-502 699-717 (1177)
19 PF07645 EGF_CA: Calcium-bindi 86.8 0.18 3.8E-06 34.9 -0.3 32 272-303 3-36 (42)
20 PRK11138 outer membrane biogen 86.6 15 0.00032 38.3 14.0 77 67-144 86-186 (394)
21 cd00053 EGF Epidermal growth f 83.5 0.81 1.8E-05 29.3 1.8 30 274-303 2-32 (36)
22 PF14295 PAN_4: PAN domain; PD 82.4 0.96 2.1E-05 32.1 2.0 23 358-380 16-38 (51)
23 PLN03225 Serine/threonine-prot 82.2 0.53 1.2E-05 51.9 0.8 24 479-502 130-153 (566)
24 smart00223 APPLE APPLE domain. 82.2 1.4 3.1E-05 35.1 3.0 32 358-389 22-57 (79)
25 smart00179 EGF_CA Calcium-bind 79.7 1.5 3.2E-05 29.0 2.0 30 272-301 3-33 (39)
26 KOG0196 Tyrosine kinase, EPH ( 77.0 0.87 1.9E-05 50.8 0.4 34 470-503 609-651 (996)
27 PF07974 EGF_2: EGF-like domai 76.4 1.9 4.1E-05 28.0 1.7 25 278-303 6-30 (32)
28 cd01099 PAN_AP_HGF Subfamily o 76.1 4.1 8.8E-05 32.4 3.9 32 358-389 25-59 (80)
29 PF12662 cEGF: Complement Clr- 75.2 2 4.3E-05 26.0 1.4 12 292-303 2-13 (24)
30 PF12661 hEGF: Human growth fa 73.3 0.95 2.1E-05 23.2 -0.2 11 293-303 1-11 (13)
31 cd00054 EGF_CA Calcium-binding 73.0 2.9 6.3E-05 27.1 2.1 31 272-302 3-34 (38)
32 PF01683 EB: EB module; Inter 70.4 3.4 7.3E-05 29.8 2.0 33 268-303 16-48 (52)
33 KOG1219 Uncharacterized conser 68.7 6.7 0.00015 48.8 4.9 27 278-304 3870-3898(4289)
34 PTZ00284 protein kinase; Provi 68.2 1.3 2.9E-05 47.3 -0.8 30 473-502 121-150 (467)
35 PF08277 PAN_3: PAN-like domai 67.1 15 0.00032 28.0 5.2 30 358-388 20-49 (71)
36 KOG0193 Serine/threonine prote 67.0 2.5 5.5E-05 45.9 1.0 16 487-502 398-413 (678)
37 PF00008 EGF: EGF-like domain 66.4 1.3 2.8E-05 28.7 -0.8 26 278-303 4-31 (32)
38 PF13360 PQQ_2: PQQ-like domai 65.9 64 0.0014 30.3 10.6 78 67-144 53-148 (238)
39 smart00605 CW CW domain. 65.7 18 0.00039 29.5 5.8 53 358-410 22-75 (94)
40 PF12947 EGF_3: EGF domain; I 64.1 1.5 3.2E-05 29.3 -0.9 27 277-303 5-32 (36)
41 PF09064 Tme5_EGF_like: Thromb 63.7 4.2 9.2E-05 26.7 1.2 18 285-302 11-28 (34)
42 KOG0194 Protein tyrosine kinas 60.7 4.4 9.5E-05 43.4 1.5 16 487-502 163-178 (474)
43 KOG1035 eIF-2alpha kinase GCN2 59.8 3 6.6E-05 48.8 0.0 21 481-501 479-499 (1351)
44 PRK11138 outer membrane biogen 58.4 50 0.0011 34.3 9.0 18 93-110 302-320 (394)
45 TIGR03300 assembly_YfgL outer 56.7 53 0.0011 33.8 8.7 51 93-143 73-130 (377)
46 smart00181 EGF Epidermal growt 56.0 8.9 0.00019 24.6 1.8 24 278-302 6-30 (35)
47 cd05622 STKc_ROCK1 Catalytic d 54.7 4.1 9E-05 42.1 0.1 30 473-502 35-64 (371)
48 cd05106 PTKc_CSF-1R Catalytic 54.1 6 0.00013 40.9 1.2 19 483-501 40-58 (374)
49 PF13360 PQQ_2: PQQ-like domai 54.0 82 0.0018 29.5 9.0 77 68-144 11-102 (238)
50 TIGR03300 assembly_YfgL outer 53.5 74 0.0016 32.7 9.2 56 89-144 245-305 (377)
51 PTZ00036 glycogen synthase kin 53.4 5.3 0.00011 42.5 0.6 23 480-502 65-87 (440)
52 PF04478 Mid2: Mid2 like cell 51.5 8.3 0.00018 34.6 1.4 12 423-434 54-65 (154)
53 PRK09605 bifunctional UGMP fam 51.3 6 0.00013 43.2 0.6 30 473-502 325-354 (535)
54 cd05621 STKc_ROCK2 Catalytic d 50.4 5.2 0.00011 41.4 -0.0 25 478-502 40-64 (370)
55 PHA03210 serine/threonine kina 50.2 8.1 0.00018 41.8 1.4 23 480-502 147-169 (501)
56 PF01436 NHL: NHL repeat; Int 49.3 31 0.00068 21.3 3.4 21 88-108 6-26 (28)
57 PHA02887 EGF-like protein; Pro 48.7 14 0.00031 31.5 2.3 33 270-303 82-119 (126)
58 KOG0663 Protein kinase PITSLRE 47.9 5.4 0.00012 40.6 -0.4 20 482-501 77-96 (419)
59 PF01102 Glycophorin_A: Glycop 47.3 8.1 0.00018 33.5 0.7 7 425-431 71-77 (122)
60 PF08693 SKG6: Transmembrane a 47.2 12 0.00027 25.6 1.4 6 440-445 33-38 (40)
61 PHA03209 serine/threonine kina 46.5 9.8 0.00021 39.0 1.3 25 478-502 63-87 (357)
62 PF13956 Ibs_toxin: Toxin Ibs, 46.4 11 0.00024 21.0 0.9 15 1-15 2-16 (19)
63 cd05596 STKc_ROCK Catalytic do 45.8 4.7 0.0001 41.6 -1.2 23 480-502 42-64 (370)
64 KOG0600 Cdc2-related protein k 43.1 6.5 0.00014 42.0 -0.7 15 487-501 123-137 (560)
65 PLN00034 mitogen-activated pro 42.1 11 0.00023 38.5 0.8 16 487-502 80-95 (353)
66 KOG4649 PQQ (pyrrolo-quinoline 39.2 55 0.0012 32.3 5.0 46 68-113 167-217 (354)
67 KOG0192 Tyrosine kinase specif 38.9 14 0.0003 38.4 1.0 16 487-502 47-62 (362)
68 PHA03211 serine/threonine kina 38.5 15 0.00033 39.4 1.3 23 480-502 168-190 (461)
69 cd00216 PQQ_DH Dehydrogenases 38.1 1.1E+02 0.0023 33.1 7.8 76 68-144 37-136 (488)
70 KOG2052 Activin A type IB rece 34.9 20 0.00044 37.9 1.4 17 486-502 216-232 (513)
71 COG3117 Uncharacterized protei 33.9 2E+02 0.0044 26.8 7.6 73 39-136 42-114 (188)
72 PF15102 TMEM154: TMEM154 prot 33.8 19 0.00042 32.0 0.9 8 423-430 61-68 (146)
73 PF05935 Arylsulfotrans: Aryls 33.3 1E+02 0.0022 33.2 6.6 54 94-150 127-187 (477)
74 PF01011 PQQ: PQQ enzyme repea 32.9 64 0.0014 21.3 3.2 19 125-143 7-25 (38)
75 KOG0291 WD40-repeat-containing 32.9 7.1E+02 0.015 28.4 12.6 85 87-193 354-448 (893)
76 PF02009 Rifin_STEVOR: Rifin/s 32.8 33 0.00072 34.5 2.6 13 436-448 273-285 (299)
77 PHA03212 serine/threonine kina 32.1 20 0.00044 37.3 0.9 20 482-501 93-112 (391)
78 KOG1214 Nidogen and related ba 30.0 36 0.00079 38.5 2.4 32 272-304 828-860 (1289)
79 PF05454 DAG1: Dystroglycan (D 28.9 19 0.0004 36.1 0.0 10 437-446 165-174 (290)
80 KOG1095 Protein tyrosine kinas 28.9 28 0.0006 40.9 1.4 16 487-502 698-713 (1025)
81 cd05845 Ig2_L1-CAM_like Second 26.7 1.2E+02 0.0025 25.0 4.4 33 68-101 32-64 (95)
82 KOG0986 G protein-coupled rece 26.4 33 0.00072 36.5 1.3 21 481-501 185-205 (591)
83 KOG0605 NDR and related serine 26.1 35 0.00076 36.8 1.4 23 479-501 139-161 (550)
84 KOG4257 Focal adhesion tyrosin 26.1 30 0.00066 38.3 1.0 17 486-502 394-410 (974)
85 KOG0694 Serine/threonine prote 25.8 31 0.00066 38.3 1.0 23 480-502 367-389 (694)
86 KOG1214 Nidogen and related ba 25.7 61 0.0013 36.9 3.2 28 277-304 741-769 (1289)
87 PF06247 Plasmod_Pvs28: Plasmo 25.5 19 0.0004 33.5 -0.6 28 276-303 48-81 (197)
88 KOG1166 Mitotic checkpoint ser 24.8 29 0.00062 40.6 0.5 24 478-501 695-718 (974)
89 KOG1024 Receptor-like protein 24.7 1.1E+02 0.0024 32.1 4.6 28 475-502 278-305 (563)
90 PF14670 FXa_inhibition: Coagu 24.2 17 0.00036 24.3 -0.9 14 290-303 17-30 (36)
91 PF14610 DUF4448: Protein of u 24.1 48 0.001 30.9 1.9 22 423-444 162-183 (189)
92 KOG4258 Insulin/growth factor 24.0 41 0.00089 38.3 1.5 19 483-501 996-1014(1025)
93 PF02402 Lysis_col: Lysis prot 23.9 26 0.00057 24.4 0.0 21 1-21 1-21 (46)
94 PF12946 EGF_MSP1_1: MSP1 EGF 23.8 15 0.00033 24.7 -1.1 26 278-303 5-32 (37)
95 PF01034 Syndecan: Syndecan do 23.0 26 0.00057 26.6 -0.1 8 437-444 29-36 (64)
96 TIGR02513 type_III_yscB type I 22.4 1.1E+02 0.0024 26.9 3.5 17 119-135 79-95 (139)
97 PF11920 DUF3438: Protein of u 22.1 4.9E+02 0.011 26.1 8.5 90 1-101 1-97 (288)
98 smart00564 PQQ beta-propeller 21.2 1.6E+02 0.0035 18.1 3.5 17 127-143 15-31 (33)
99 COG2143 Thioredoxin-related pr 20.9 78 0.0017 28.7 2.4 20 36-55 35-54 (182)
100 TIGR01478 STEVOR variant surfa 20.6 56 0.0012 32.4 1.6 6 292-297 143-148 (295)
101 PF12690 BsuPI: Intracellular 20.4 4.1E+02 0.0089 21.1 6.3 16 96-111 27-42 (82)
102 PHA03207 serine/threonine kina 20.3 45 0.00097 34.6 0.9 19 483-501 94-112 (392)
103 KOG4289 Cadherin EGF LAG seven 20.2 54 0.0012 39.6 1.5 41 278-321 1245-1286(2531)
104 PTZ00370 STEVOR; Provisional 20.1 59 0.0013 32.3 1.6 6 292-297 143-148 (296)
No 1
>PF01453 B_lectin: D-mannose binding lectin; InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]: Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity. Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=99.96 E-value=6.9e-30 Score=219.75 Aligned_cols=110 Identities=45% Similarity=0.763 Sum_probs=81.1
Q ss_pred CeEEEEeCCCCCCcC--CccEEEEecCCcEEEEeCCCcEEEee-cCcCCccccceEEEccCCCEEEEeccCCcEEEeecC
Q 010708 70 KAVIWVANRDNPLKD--SSGIITISEDGNLVLVNGQKEVLWSS-NVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQ 146 (503)
Q Consensus 70 ~~vvW~anr~~p~~~--~~~~l~l~~~G~Lvl~~~~~~~~Wss-~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd 146 (503)
++|||+|||+.|+.. ...+|.|+.||||+|++..+.++|++ ++.+....+..|+|+|+|||||++.. +.++|||||
T Consensus 2 ~tvvW~an~~~p~~~~s~~~~L~l~~dGnLvl~~~~~~~iWss~~t~~~~~~~~~~~L~~~GNlvl~d~~-~~~lW~Sf~ 80 (114)
T PF01453_consen 2 RTVVWVANRNSPLTSSSGNYTLILQSDGNLVLYDSNGSVIWSSNNTSGRGNSGCYLVLQDDGNLVLYDSS-GNVLWQSFD 80 (114)
T ss_dssp --------TTEEEEECETTEEEEEETTSEEEEEETTTEEEEE--S-TTSS-SSEEEEEETTSEEEEEETT-SEEEEESTT
T ss_pred cccccccccccccccccccccceECCCCeEEEEcCCCCEEEEecccCCccccCeEEEEeCCCCEEEEeec-ceEEEeecC
Confidence 689999999999954 24799999999999999998999999 55543213789999999999999976 999999999
Q ss_pred CCCCCcCCCCeecccccCCceEEEEecCCCCCCC
Q 010708 147 EPTDSFLPGMHHGIDQRTGKKVQLTSWKSLSDPS 180 (503)
Q Consensus 147 ~PTDTlLpgq~l~~~~~tg~~~~L~Sw~s~~dps 180 (503)
|||||+||||+|+.+..+|.+..++||++.+|||
T Consensus 81 ~ptdt~L~~q~l~~~~~~~~~~~~~sw~s~~dps 114 (114)
T PF01453_consen 81 YPTDTLLPGQKLGDGNVTGKNDSLTSWSSNTDPS 114 (114)
T ss_dssp SSS-EEEEEET--TSEEEEESTSSEEEESS----
T ss_pred CCccEEEeccCcccCCCccccceEEeECCCCCCC
Confidence 9999999999999977776666799999999996
No 2
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=99.93 E-value=3.2e-25 Score=191.90 Aligned_cols=115 Identities=43% Similarity=0.717 Sum_probs=100.6
Q ss_pred CccCCCCeEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEeCCCcEEEe
Q 010708 30 QLIRDPDAILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVANRDNPLKDSSGIITISEDGNLVLVNGQKEVLWS 109 (503)
Q Consensus 30 ~~l~~~~~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Ws 109 (503)
+.|..|++|+|+++.|++|||........+..|||...+ +++||+||++.|. ...+.|.|+.||+|+|+|.++.++|+
T Consensus 2 ~~l~~~~~l~s~~~~f~~G~~~~~~q~~dgnlv~~~~~~-~~~vW~snt~~~~-~~~~~l~l~~dGnLvl~~~~g~~vW~ 79 (116)
T cd00028 2 NPLSSGQTLVSSGSLFELGFFKLIMQSRDYNLILYKGSS-RTVVWVANRDNPS-GSSCTLTLQSDGNLVIYDGSGTVVWS 79 (116)
T ss_pred cCcCCCCEEEeCCCcEEEecccCCCCCCeEEEEEEeCCC-CeEEEECCCCCCC-CCCEEEEEecCCCeEEEcCCCcEEEE
Confidence 568899999999999999999987543378889998765 7899999999984 35689999999999999999999999
Q ss_pred ecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCCC
Q 010708 110 SNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEP 148 (503)
Q Consensus 110 s~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~P 148 (503)
|++.+ ......|+|+|||||||++.+ +.++|||||||
T Consensus 80 S~~~~-~~~~~~~~L~ddGnlvl~~~~-~~~~W~Sf~~P 116 (116)
T cd00028 80 SNTTR-VNGNYVLVLLDDGNLVLYDSD-GNFLWQSFDYP 116 (116)
T ss_pred ecccC-CCCceEEEEeCCCCEEEECCC-CCEEEcCCCCC
Confidence 99875 224788999999999999987 89999999999
No 3
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=99.92 E-value=5.8e-25 Score=188.34 Aligned_cols=99 Identities=49% Similarity=1.022 Sum_probs=88.1
Q ss_pred eecCCCCCeeeeecccccceeeeeEEE---------EEEEeCCe--eEEEEecCCcEEEEEEeCCCCCeeEEeccccccc
Q 010708 206 WRSGPWNGQIFIGIPELKSVYLFRHNF---------TFGFANDW--TFFALTAQGILEERIWIKWKDNWEVGFLNLRTEC 274 (503)
Q Consensus 206 w~sg~~~g~~~~~~~~~~~~~~~~~~~---------~~~~~~~~--~rl~L~~dG~l~~~~~~~~~~~W~~~~~~p~~~C 274 (503)
||+|+|||..|+++|+|.....+.+.| +|.+.+.+ +|++||++|+||+|.|++..++|.+.|.+|.|+|
T Consensus 1 wrsG~WnG~~f~g~p~~~~~~~~~~~fv~~~~e~~~t~~~~~~s~~~r~~ld~~G~l~~~~w~~~~~~W~~~~~~p~d~C 80 (110)
T PF00954_consen 1 WRSGPWNGQRFSGIPEMSSNSLYNYSFVSNNEEVYYTYSLSNSSVLSRLVLDSDGQLQRYIWNESTQSWSVFWSAPKDQC 80 (110)
T ss_pred CCccccCCeEECCcccccccceeEEEEEECCCeEEEEEecCCCceEEEEEEeeeeEEEEEEEecCCCcEEEEEEecccCC
Confidence 899999999999999998656666665 34444554 9999999999999999999999999999999999
Q ss_pred ccccccCCCcccCCCCCCccccCCCCccCC
Q 010708 275 DVYGKCGAFGICNSQEKPICSCLEGFEPKN 304 (503)
Q Consensus 275 ~v~g~CG~~giC~~~~~~~C~C~~Gf~~~~ 304 (503)
|+|+.||+||+|+.+..+.|+|++||+|++
T Consensus 81 d~y~~CG~~g~C~~~~~~~C~Cl~GF~P~n 110 (110)
T PF00954_consen 81 DVYGFCGPNGICNSNNSPKCSCLPGFEPKN 110 (110)
T ss_pred CCccccCCccEeCCCCCCceECCCCcCCCc
Confidence 999999999999887788999999999874
No 4
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=99.91 E-value=8.2e-24 Score=182.47 Aligned_cols=113 Identities=41% Similarity=0.686 Sum_probs=98.8
Q ss_pred CccCCCCeEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEeCCCcEEEe
Q 010708 30 QLIRDPDAILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVANRDNPLKDSSGIITISEDGNLVLVNGQKEVLWS 109 (503)
Q Consensus 30 ~~l~~~~~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Ws 109 (503)
+.|..|+.|+|+++.|++|||.... ...+.+|||...+ .++||+|||+.|+.. ++.|.|++||+|+|++.++.++|+
T Consensus 2 ~~l~~~~~l~s~~~~f~~G~~~~~~-q~dgnlV~~~~~~-~~~vW~snt~~~~~~-~~~l~l~~dGnLvl~~~~g~~vW~ 78 (114)
T smart00108 2 NTLSSGQTLVSGNSLFELGFFTLIM-QNDYNLILYKSSS-RTVVWVANRDNPVSD-SCTLTLQSDGNLVLYDGDGRVVWS 78 (114)
T ss_pred cccCCCCEEecCCCcEeeeccccCC-CCCEEEEEEECCC-CcEEEECCCCCCCCC-CEEEEEeCCCCEEEEeCCCCEEEE
Confidence 5678899999999999999998764 3468889998765 789999999999765 389999999999999999999999
Q ss_pred ecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCC
Q 010708 110 SNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQE 147 (503)
Q Consensus 110 s~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~ 147 (503)
|++.... ....|+|+|+|||||++.. ++++||||||
T Consensus 79 S~t~~~~-~~~~~~L~ddGnlvl~~~~-~~~~W~Sf~~ 114 (114)
T smart00108 79 SNTTGAN-GNYVLVLLDDGNLVIYDSD-GNFLWQSFDY 114 (114)
T ss_pred ecccCCC-CceEEEEeCCCCEEEECCC-CCEEeCCCCC
Confidence 9986222 4688999999999999987 8899999997
No 5
>PF08276 PAN_2: PAN-like domain; InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=99.44 E-value=1.1e-13 Score=107.02 Aligned_cols=57 Identities=46% Similarity=1.011 Sum_probs=49.2
Q ss_pred CCCeEEEeeeeCCCCccccCc---cchhhHHHhhcccCceeeeeecC---CCceeeecccccccc
Q 010708 337 EDGFSKLNKMKVPDFTEWTSP---ATEDECREQCLKNCSCIAYAFDG---GIGCMVWRSINLIDI 395 (503)
Q Consensus 337 ~~~f~~~~~~~~p~~~~~~~~---~~~~~C~~~Cl~nCsC~a~~y~~---g~gC~~~~~~~l~~~ 395 (503)
+++|+++++|++|+ +...++ .++++|++.||+||||+||+|.+ +++|++|++ +|+|+
T Consensus 4 ~d~F~~l~~~~~p~-~~~~~~~~~~s~~~C~~~Cl~nCsC~Ayay~~~~~~~~C~lW~~-~L~d~ 66 (66)
T PF08276_consen 4 GDGFLKLPNMKLPD-FDNAIVDSSVSLEECEKACLSNCSCTAYAYSNLSGGGGCLLWYG-DLVDL 66 (66)
T ss_pred CCEEEEECCeeCCC-CcceeeecCCCHHHHHhhcCCCCCEeeEEeeccCCCCEEEEEcC-EeecC
Confidence 78999999999999 544442 78999999999999999999984 466999999 88874
No 6
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.33 E-value=1.4e-12 Score=104.30 Aligned_cols=67 Identities=27% Similarity=0.385 Sum_probs=59.7
Q ss_pred CCCeEEEeeeeCCCCccccCccchhhHHHhhcc---cCceeeeeecC-CCceeeeccccc-ccceEecCCcceeEEE
Q 010708 337 EDGFSKLNKMKVPDFTEWTSPATEDECREQCLK---NCSCIAYAFDG-GIGCMVWRSINL-IDIQRLPFGGTDLYIR 408 (503)
Q Consensus 337 ~~~f~~~~~~~~p~~~~~~~~~~~~~C~~~Cl~---nCsC~a~~y~~-g~gC~~~~~~~l-~~~~~~~~~~~~~~ir 408 (503)
+..|+++.++++|+ +.. .+.++|++.|++ ||||.||+|.+ +.||.+|++ +| +|+++....+.++|+|
T Consensus 8 ~g~fl~~~~~klpd-~~~---~s~~eC~~~Cl~~~~nCsC~Aya~~~~~~gC~~W~~-~l~~d~~~~~~~g~~Ly~r 79 (80)
T cd00129 8 AGTTLIKIALKIKT-TKA---NTADECANRCEKNGLPFSCKAFVFAKARKQCLWFPF-NSMSGVRKEFSHGFDLYEN 79 (80)
T ss_pred CCeEEEeecccCCc-ccc---cCHHHHHHHHhcCCCCCCceeeeccCCCCCeEEecC-cchhhHHhccCCCceeEeE
Confidence 67899999999999 655 579999999999 99999999975 568999999 99 9998887778999997
No 7
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=99.29 E-value=6e-12 Score=101.93 Aligned_cols=72 Identities=44% Similarity=0.931 Sum_probs=58.7
Q ss_pred CCCeEEEeeeeCCCCccccCc-cchhhHHHhhcccCceeeeeecC-CCceeeecccccccceEecCCcceeEEEee
Q 010708 337 EDGFSKLNKMKVPDFTEWTSP-ATEDECREQCLKNCSCIAYAFDG-GIGCMVWRSINLIDIQRLPFGGTDLYIRVA 410 (503)
Q Consensus 337 ~~~f~~~~~~~~p~~~~~~~~-~~~~~C~~~Cl~nCsC~a~~y~~-g~gC~~~~~~~l~~~~~~~~~~~~~~ir~~ 410 (503)
.++|+++.++++|+ ...... .++++|++.||+||+|.||+|.+ +++|++|.. ++.+.+.....+..+|||++
T Consensus 11 ~~~f~~~~~~~~~~-~~~~~~~~s~~~C~~~Cl~nCsC~a~~~~~~~~~C~~~~~-~~~~~~~~~~~~~~~yiKv~ 84 (84)
T cd01098 11 TDGFLKLPDVKLPD-NASAITAISLEECREACLSNCSCTAYAYNNGSGGCLLWNG-LLNNLRSLSSGGGTLYLRLA 84 (84)
T ss_pred CCEEEEeCCeeCCC-chhhhccCCHHHHHHHHhcCCCcceeeecCCCCeEEEEec-eecceEeecCCCcEEEEEeC
Confidence 36899999999998 433223 68999999999999999999986 345999998 88887765555689999985
No 8
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=98.90 E-value=7.8e-09 Score=88.97 Aligned_cols=89 Identities=28% Similarity=0.454 Sum_probs=67.0
Q ss_pred EEEEecCCcEEEEeCC-CcEEEeecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCCCCCCcCCCCeecccccCCc
Q 010708 88 IITISEDGNLVLVNGQ-KEVLWSSNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEPTDSFLPGMHHGIDQRTGK 166 (503)
Q Consensus 88 ~l~l~~~G~Lvl~~~~-~~~~Wss~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~PTDTlLpgq~l~~~~~tg~ 166 (503)
++.++.||+||+++.. ..++|++++..+......+.|+++|||||++.+ +.++|+|=..
T Consensus 23 ~~~~q~dgnlV~~~~~~~~~vW~snt~~~~~~~~~l~l~~dGnLvl~~~~-g~~vW~S~t~------------------- 82 (114)
T smart00108 23 TLIMQNDYNLILYKSSSRTVVWVANRDNPVSDSCTLTLQSDGNLVLYDGD-GRVVWSSNTT------------------- 82 (114)
T ss_pred ccCCCCCEEEEEEECCCCcEEEECCCCCCCCCCEEEEEeCCCCEEEEeCC-CCEEEEeccc-------------------
Confidence 5677889999999875 579999998654323478999999999999987 8899997220
Q ss_pred eEEEEecCCCCCCCCceEEEEEeeCCCceEEEecCCeeeeecC
Q 010708 167 KVQLTSWKSLSDPSTGSFSAGLIHQNIPEIFVWNVSRPYWRSG 209 (503)
Q Consensus 167 ~~~L~Sw~s~~dps~G~y~l~~~~~g~~~~~~~~~~~~yw~sg 209 (503)
...+.|.+.|+++|+..++- ..+++.|.|.
T Consensus 83 ------------~~~~~~~~~L~ddGnlvl~~-~~~~~~W~Sf 112 (114)
T smart00108 83 ------------GANGNYVLVLLDDGNLVIYD-SDGNFLWQSF 112 (114)
T ss_pred ------------CCCCceEEEEeCCCCEEEEC-CCCCEEeCCC
Confidence 12356889999999865542 2346788764
No 9
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=98.79 E-value=2.6e-08 Score=86.01 Aligned_cols=90 Identities=26% Similarity=0.436 Sum_probs=67.7
Q ss_pred EEEEec-CCcEEEEeCC-CcEEEeecCcCCccccceEEEccCCCEEEEeccCCcEEEeecCCCCCCcCCCCeecccccCC
Q 010708 88 IITISE-DGNLVLVNGQ-KEVLWSSNVSNLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEPTDSFLPGMHHGIDQRTG 165 (503)
Q Consensus 88 ~l~l~~-~G~Lvl~~~~-~~~~Wss~~~~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~PTDTlLpgq~l~~~~~tg 165 (503)
.+.++. ||+||+++.. ..++|++++..+......+.|+++|||||++.+ +.++|+|--..
T Consensus 23 ~~~~q~~dgnlv~~~~~~~~~vW~snt~~~~~~~~~l~l~~dGnLvl~~~~-g~~vW~S~~~~----------------- 84 (116)
T cd00028 23 KLIMQSRDYNLILYKGSSRTVVWVANRDNPSGSSCTLTLQSDGNLVIYDGS-GTVVWSSNTTR----------------- 84 (116)
T ss_pred cCCCCCCeEEEEEEeCCCCeEEEECCCCCCCCCCEEEEEecCCCeEEEcCC-CcEEEEecccC-----------------
Confidence 466776 9999999875 479999998653224678999999999999987 88999874310
Q ss_pred ceEEEEecCCCCCCCCceEEEEEeeCCCceEEEecCCeeeeecCC
Q 010708 166 KKVQLTSWKSLSDPSTGSFSAGLIHQNIPEIFVWNVSRPYWRSGP 210 (503)
Q Consensus 166 ~~~~L~Sw~s~~dps~G~y~l~~~~~g~~~~~~~~~~~~yw~sg~ 210 (503)
..+.+.+.|+++|+..++- ..+.+.|.|..
T Consensus 85 --------------~~~~~~~~L~ddGnlvl~~-~~~~~~W~Sf~ 114 (116)
T cd00028 85 --------------VNGNYVLVLLDDGNLVLYD-SDGNFLWQSFD 114 (116)
T ss_pred --------------CCCceEEEEeCCCCEEEEC-CCCCEEEcCCC
Confidence 2356889999999855543 23567898764
No 10
>PF01453 B_lectin: D-mannose binding lectin; InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]: Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity. Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=98.74 E-value=1.2e-07 Score=81.63 Aligned_cols=100 Identities=27% Similarity=0.450 Sum_probs=70.5
Q ss_pred CeEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEe-CCCCCCcCCccEEEEecCCcEEEEeCCCcEEEeecCcC
Q 010708 36 DAILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVA-NRDNPLKDSSGIITISEDGNLVLVNGQKEVLWSSNVSN 114 (503)
Q Consensus 36 ~~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~a-nr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Wss~~~~ 114 (503)
+.+.+.+|.+.|-|+.+|+ | +.|.. ...+||.+ +...... ..+.+.|++||||||+|..+.++|+|...
T Consensus 12 ~p~~~~s~~~~L~l~~dGn-----L-vl~~~--~~~~iWss~~t~~~~~-~~~~~~L~~~GNlvl~d~~~~~lW~Sf~~- 81 (114)
T PF01453_consen 12 SPLTSSSGNYTLILQSDGN-----L-VLYDS--NGSVIWSSNNTSGRGN-SGCYLVLQDDGNLVLYDSSGNVLWQSFDY- 81 (114)
T ss_dssp EEEEECETTEEEEEETTSE-----E-EEEET--TTEEEEE--S-TTSS--SSEEEEEETTSEEEEEETTSEEEEESTTS-
T ss_pred cccccccccccceECCCCe-----E-EEEcC--CCCEEEEecccCCccc-cCeEEEEeCCCCEEEEeecceEEEeecCC-
Confidence 4566656999999999985 2 33432 36789999 5454432 35899999999999999999999999532
Q ss_pred CccccceEEEcc--CCCEEEEeccCCcEEEeecCCCC
Q 010708 115 LVNNSTSAQLLD--SGNLVLRDNINRAIVWESFQEPT 149 (503)
Q Consensus 115 ~~~~~~~~~L~d--~GNlVl~~~~~~~~~WqSFd~PT 149 (503)
++ .+.+.+++ +||++ +... ..+.|.|=+.|.
T Consensus 82 pt--dt~L~~q~l~~~~~~-~~~~-~~~sw~s~~dps 114 (114)
T PF01453_consen 82 PT--DTLLPGQKLGDGNVT-GKND-SLTSWSSNTDPS 114 (114)
T ss_dssp SS---EEEEEET--TSEEE-EEST-SSEEEESS----
T ss_pred Cc--cEEEeccCcccCCCc-cccc-eEEeECCCCCCC
Confidence 33 57788888 99999 7654 678999987773
No 11
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=98.31 E-value=1.6e-06 Score=68.58 Aligned_cols=70 Identities=37% Similarity=0.835 Sum_probs=52.3
Q ss_pred CCCeEEEeeeeCCCCccccCc--cchhhHHHhhcc-cCceeeeeecC-CCceeeec-ccccccceEecCCcceeEEE
Q 010708 337 EDGFSKLNKMKVPDFTEWTSP--ATEDECREQCLK-NCSCIAYAFDG-GIGCMVWR-SINLIDIQRLPFGGTDLYIR 408 (503)
Q Consensus 337 ~~~f~~~~~~~~p~~~~~~~~--~~~~~C~~~Cl~-nCsC~a~~y~~-g~gC~~~~-~~~l~~~~~~~~~~~~~~ir 408 (503)
...|.+++++.+++ ...... .+.++|++.|++ +|+|.||.|.. +.+|.+|. + ++.+.......+..+|.|
T Consensus 3 ~~~f~~~~~~~l~~-~~~~~~~~~s~~~C~~~C~~~~~~C~s~~y~~~~~~C~l~~~~-~~~~~~~~~~~~~~~y~~ 77 (78)
T smart00473 3 DDCFVRLPNTKLPG-FSRIVISVASLEECASKCLNSNCSCRSFTYNNGTKGCLLWSES-SLGDARLFPSGGVDLYEK 77 (78)
T ss_pred CceeEEecCccCCC-CcceeEcCCCHHHHHHHhCCCCCceEEEEEcCCCCEEEEeeCC-ccccceecccCCceeEEe
Confidence 45688888888876 332212 689999999999 99999999985 44599999 6 777776444455666654
No 12
>KOG1187 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=98.02 E-value=3.2e-06 Score=87.42 Aligned_cols=37 Identities=46% Similarity=0.774 Sum_probs=34.8
Q ss_pred CCCcccHHHHHHHhcCCCcCCcccccCCccEEEEEcC
Q 010708 467 DLPLFQFEELATATNNFQLSSKLGQGGFGPVYWVMVH 503 (503)
Q Consensus 467 ~~~~fs~~el~~AT~nFs~~n~LG~GgfG~VYKG~L~ 503 (503)
..+.|+|.+|.+||++|+++++||+||||.||||+|+
T Consensus 61 ~~~~fs~~el~~AT~~Fs~~~~ig~Ggfg~VYkG~l~ 97 (361)
T KOG1187|consen 61 PLRSFSYDELRKATNNFSESNLIGEGGFGTVYKGVLS 97 (361)
T ss_pred CcceeeHHHHHHHHhCCchhcceecCCCeEEEEEEEC
Confidence 5677999999999999999999999999999999984
No 13
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=97.20 E-value=0.0003 Score=55.29 Aligned_cols=32 Identities=19% Similarity=0.699 Sum_probs=27.7
Q ss_pred cchhhHHHhhcccCceeeeeecCCCc-eeeecc
Q 010708 358 ATEDECREQCLKNCSCIAYAFDGGIG-CMVWRS 389 (503)
Q Consensus 358 ~~~~~C~~~Cl~nCsC~a~~y~~g~g-C~~~~~ 389 (503)
.+.++|++.|+.+|+|.||.|..+.+ |+++..
T Consensus 25 ~s~~~Cq~~C~~~~~C~afT~~~~~~~C~lk~~ 57 (73)
T cd01100 25 SSAEQCQAACTADPGCLAFTYNTKSKKCFLKSS 57 (73)
T ss_pred CCHHHHHHHcCCCCCceEEEEECCCCeEEcccC
Confidence 57899999999999999999986545 999765
No 14
>PLN00113 leucine-rich repeat receptor-like protein kinase; Provisional
Probab=94.92 E-value=0.036 Score=64.94 Aligned_cols=29 Identities=10% Similarity=0.237 Sum_probs=22.6
Q ss_pred ccHHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708 471 FQFEELATATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 471 fs~~el~~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
++++++. ..+.+.++||+|+||.||||+.
T Consensus 683 ~~~~~~~---~~~~~~~~ig~G~~g~Vy~~~~ 711 (968)
T PLN00113 683 ITINDIL---SSLKEENVISRGKKGASYKGKS 711 (968)
T ss_pred hhHHHHH---hhCCcccEEccCCCeeEEEEEE
Confidence 4455544 4577889999999999999974
No 15
>KOG3653 consensus Transforming growth factor beta/activin receptor subfamily of serine/threonine kinases [Signal transduction mechanisms]
Probab=92.95 E-value=0.035 Score=57.98 Aligned_cols=17 Identities=29% Similarity=0.597 Sum_probs=15.4
Q ss_pred CcccccCCccEEEEEcC
Q 010708 487 SKLGQGGFGPVYWVMVH 503 (503)
Q Consensus 487 n~LG~GgfG~VYKG~L~ 503 (503)
.+||+|+||.||||.|.
T Consensus 216 eli~~Grfg~V~KaqL~ 232 (534)
T KOG3653|consen 216 ELIGRGRFGCVWKAQLD 232 (534)
T ss_pred HHhhcCccceeehhhcc
Confidence 68999999999999873
No 16
>PF00024 PAN_1: PAN domain This Prosite entry concerns apple domains, a subset of PAN domains; InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=90.02 E-value=0.4 Score=37.31 Aligned_cols=49 Identities=18% Similarity=0.506 Sum_probs=35.0
Q ss_pred eEEEeeeeCCCCccccCc--cchhhHHHhhcccCc-eeeeeecCCCc-eeeecc
Q 010708 340 FSKLNKMKVPDFTEWTSP--ATEDECREQCLKNCS-CIAYAFDGGIG-CMVWRS 389 (503)
Q Consensus 340 f~~~~~~~~p~~~~~~~~--~~~~~C~~~Cl~nCs-C~a~~y~~g~g-C~~~~~ 389 (503)
|.++.+..+.. ...... .++++|.+.|+.+=. |.+|.|....+ |.+...
T Consensus 4 f~~~~~~~l~~-~~~~~~~v~s~~~C~~~C~~~~~~C~s~~y~~~~~~C~L~~~ 56 (79)
T PF00024_consen 4 FERIPGYRLSG-HSIKEINVPSLEECAQLCLNEPRRCKSFNYDPSSKTCYLSSS 56 (79)
T ss_dssp EEEEEEEEEES-CEEEEEEESSHHHHHHHHHHSTT-ESEEEEETTTTEEEEECS
T ss_pred eEEECCEEEeC-CcceEEcCCCHHHHHhhcCcCcccCCeEEEECCCCEEEEcCC
Confidence 55666665544 221111 479999999999999 99999986544 998755
No 17
>PLN03224 probable serine/threonine protein kinase; Provisional
Probab=89.27 E-value=0.16 Score=55.04 Aligned_cols=24 Identities=38% Similarity=0.696 Sum_probs=21.0
Q ss_pred HhcCCCcCCcccccCCccEEEEEc
Q 010708 479 ATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 479 AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
..++|...++||+|+||+||||.+
T Consensus 143 ~~d~F~i~~~LG~GgFG~VYkG~~ 166 (507)
T PLN03224 143 SSDDFQLRDKLGGGNFGITFEGLR 166 (507)
T ss_pred cccCceEeeEeecCCCeEEEEEEe
Confidence 367899999999999999999963
No 18
>KOG1025 consensus Epidermal growth factor receptor EGFR and related tyrosine kinases [Signal transduction mechanisms]
Probab=89.15 E-value=0.34 Score=54.20 Aligned_cols=19 Identities=37% Similarity=0.601 Sum_probs=15.8
Q ss_pred CcCCcccccCCccEEEEEc
Q 010708 484 QLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 484 s~~n~LG~GgfG~VYKG~L 502 (503)
.+..+||+|.||+||||+.
T Consensus 699 kk~kvLGsgAfGtV~kGiw 717 (1177)
T KOG1025|consen 699 KKDKVLGSGAFGTVYKGIW 717 (1177)
T ss_pred hhhceeccccceeEEeeeE
Confidence 4447899999999999963
No 19
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=86.79 E-value=0.18 Score=34.93 Aligned_cols=32 Identities=31% Similarity=0.669 Sum_probs=25.0
Q ss_pred cccccc-cccCCCcccC-CCCCCccccCCCCccC
Q 010708 272 TECDVY-GKCGAFGICN-SQEKPICSCLEGFEPK 303 (503)
Q Consensus 272 ~~C~v~-g~CG~~giC~-~~~~~~C~C~~Gf~~~ 303 (503)
|.|... ..|..++.|. ..++..|.|++||+..
T Consensus 3 dEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~ 36 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVNTEGSYSCSCPPGYELN 36 (42)
T ss_dssp STTTTTSSSSSTTSEEEEETTEEEEEESTTEEEC
T ss_pred cccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEEC
Confidence 567764 5899999995 3467899999999843
No 20
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=86.57 E-value=15 Score=38.31 Aligned_cols=77 Identities=19% Similarity=0.359 Sum_probs=48.8
Q ss_pred CCCCeEEEEeCCCC----------------CCcCCccEEEEe-cCCcEEEEeC-CCcEEEeecCcCCc------cccceE
Q 010708 67 PSEKAVIWVANRDN----------------PLKDSSGIITIS-EDGNLVLVNG-QKEVLWSSNVSNLV------NNSTSA 122 (503)
Q Consensus 67 ~~~~~vvW~anr~~----------------p~~~~~~~l~l~-~~G~Lvl~~~-~~~~~Wss~~~~~~------~~~~~~ 122 (503)
..+..++|..+-.. |+.. ...+.+. .+|.|+-+|. +|.++|+....+.. .+....
T Consensus 86 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~ssP~v~~~~v~ 164 (394)
T PRK11138 86 ADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVA-GGKVYIGSEKGQVYALNAEDGEVAWQTKVAGEALSRPVVSDGLVL 164 (394)
T ss_pred CCCCcEeeEEcCCCcccccccccccccccccEEE-CCEEEEEcCCCEEEEEECCCCCCcccccCCCceecCCEEECCEEE
Confidence 34678899876433 1111 2344444 5688888885 68999998754321 012223
Q ss_pred EEccCCCEEEEeccCCcEEEee
Q 010708 123 QLLDSGNLVLRDNINRAIVWES 144 (503)
Q Consensus 123 ~L~d~GNlVl~~~~~~~~~WqS 144 (503)
....+|.|+-.|..+|+++|+-
T Consensus 165 v~~~~g~l~ald~~tG~~~W~~ 186 (394)
T PRK11138 165 VHTSNGMLQALNESDGAVKWTV 186 (394)
T ss_pred EECCCCEEEEEEccCCCEeeee
Confidence 3456788888888779999986
No 21
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=83.47 E-value=0.81 Score=29.33 Aligned_cols=30 Identities=23% Similarity=0.615 Sum_probs=22.3
Q ss_pred cccccccCCCcccCCC-CCCccccCCCCccC
Q 010708 274 CDVYGKCGAFGICNSQ-EKPICSCLEGFEPK 303 (503)
Q Consensus 274 C~v~g~CG~~giC~~~-~~~~C~C~~Gf~~~ 303 (503)
|.....|..++.|... +...|.|++||...
T Consensus 2 C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4435678888999653 56789999999743
No 22
>PF14295 PAN_4: PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=82.36 E-value=0.96 Score=32.05 Aligned_cols=23 Identities=26% Similarity=0.810 Sum_probs=17.5
Q ss_pred cchhhHHHhhcccCceeeeeecC
Q 010708 358 ATEDECREQCLKNCSCIAYAFDG 380 (503)
Q Consensus 358 ~~~~~C~~~Cl~nCsC~a~~y~~ 380 (503)
.+.++|.++|..+=.|.++.|..
T Consensus 16 ~s~~~C~~~C~~~~~C~~~~~~~ 38 (51)
T PF14295_consen 16 SSPEECQAACAADPGCQAFTFNP 38 (51)
T ss_dssp --HHHHHHHHHTSTT--EEEEET
T ss_pred CCHHHHHHHccCCCCCCEEEEEC
Confidence 57899999999999999999874
No 23
>PLN03225 Serine/threonine-protein kinase SNT7; Provisional
Probab=82.22 E-value=0.53 Score=51.88 Aligned_cols=24 Identities=42% Similarity=0.787 Sum_probs=21.2
Q ss_pred HhcCCCcCCcccccCCccEEEEEc
Q 010708 479 ATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 479 AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
..++|...++||+|+||.||||++
T Consensus 130 ~~~~y~l~~~LG~G~FG~VYka~~ 153 (566)
T PLN03225 130 KKDDFVLGKKLGEGAFGVVYKASL 153 (566)
T ss_pred ccCCeEEeEEEeeCCCeEEEEEEE
Confidence 567788889999999999999975
No 24
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=82.16 E-value=1.4 Score=35.07 Aligned_cols=32 Identities=19% Similarity=0.613 Sum_probs=26.2
Q ss_pred cchhhHHHhhcccCceeeeeecC-CC---ceeeecc
Q 010708 358 ATEDECREQCLKNCSCIAYAFDG-GI---GCMVWRS 389 (503)
Q Consensus 358 ~~~~~C~~~Cl~nCsC~a~~y~~-g~---gC~~~~~ 389 (503)
.+.++|+++|..+=.|.+|.|.. .. .|+++..
T Consensus 22 ~~~~~Cq~~Ct~~~~C~~FTf~~~~~~~~~C~LK~s 57 (79)
T smart00223 22 PSAQVCQKRCTSHPRCLFFTFSTNEPPEEKCLLKDS 57 (79)
T ss_pred CCHHHHHHhhcCCCCccEEEeeCCCCCCCEeEeCcC
Confidence 56899999999999999999974 23 5888643
No 25
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=79.70 E-value=1.5 Score=29.03 Aligned_cols=30 Identities=27% Similarity=0.613 Sum_probs=22.4
Q ss_pred cccccccccCCCcccCC-CCCCccccCCCCc
Q 010708 272 TECDVYGKCGAFGICNS-QEKPICSCLEGFE 301 (503)
Q Consensus 272 ~~C~v~g~CG~~giC~~-~~~~~C~C~~Gf~ 301 (503)
+.|.....|...+.|.. .+...|.|++||.
T Consensus 3 ~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred ccCcCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence 45665467888889964 3467899999997
No 26
>KOG0196 consensus Tyrosine kinase, EPH (ephrin) receptor family [Signal transduction mechanisms]
Probab=76.96 E-value=0.87 Score=50.80 Aligned_cols=34 Identities=24% Similarity=0.426 Sum_probs=25.1
Q ss_pred cccHHHHHHHhcCCC---------cCCcccccCCccEEEEEcC
Q 010708 470 LFQFEELATATNNFQ---------LSSKLGQGGFGPVYWVMVH 503 (503)
Q Consensus 470 ~fs~~el~~AT~nFs---------~~n~LG~GgfG~VYKG~L~ 503 (503)
.++|+|=-+|...|. -+.+||.|-||-||+|+|+
T Consensus 609 P~TYEDPnqAvreFakEId~s~i~Ie~VIGaGEFGEVc~GrLk 651 (996)
T KOG0196|consen 609 PHTYEDPNQAVREFAKEIDPSCVKIEKVIGAGEFGEVCSGRLK 651 (996)
T ss_pred CccccCccHHHHHhhhhcChhheEEEEEEecccccceeccccc
Confidence 467776555555543 2378999999999999984
No 27
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=76.44 E-value=1.9 Score=28.02 Aligned_cols=25 Identities=28% Similarity=0.650 Sum_probs=19.8
Q ss_pred cccCCCcccCCCCCCccccCCCCccC
Q 010708 278 GKCGAFGICNSQEKPICSCLEGFEPK 303 (503)
Q Consensus 278 g~CG~~giC~~~~~~~C~C~~Gf~~~ 303 (503)
.+|..+|+|... ...|.|.+||...
T Consensus 6 ~~C~~~G~C~~~-~g~C~C~~g~~G~ 30 (32)
T PF07974_consen 6 NICSGHGTCVSP-CGRCVCDSGYTGP 30 (32)
T ss_pred CccCCCCEEeCC-CCEEECCCCCcCC
Confidence 479999999753 4689999999753
No 28
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=76.06 E-value=4.1 Score=32.35 Aligned_cols=32 Identities=22% Similarity=0.663 Sum_probs=26.3
Q ss_pred cchhhHHHhhcc--cCceeeeeecCCCc-eeeecc
Q 010708 358 ATEDECREQCLK--NCSCIAYAFDGGIG-CMVWRS 389 (503)
Q Consensus 358 ~~~~~C~~~Cl~--nCsC~a~~y~~g~g-C~~~~~ 389 (503)
.+.++|.+.|++ +=.|.+|.|....+ |.+-..
T Consensus 25 ~s~~~C~~~C~~~~~f~CrSf~y~~~~~~C~L~~~ 59 (80)
T cd01099 25 ASLEECLRKCLEETEFTCRSFNYNYKSKECILSDE 59 (80)
T ss_pred CCHHHHHHHhCCCCCceEeEEEEEcCCCEEEEeCC
Confidence 578999999999 88999999975444 988543
No 29
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=75.23 E-value=2 Score=26.01 Aligned_cols=12 Identities=33% Similarity=0.830 Sum_probs=10.2
Q ss_pred CccccCCCCccC
Q 010708 292 PICSCLEGFEPK 303 (503)
Q Consensus 292 ~~C~C~~Gf~~~ 303 (503)
..|.|++||+..
T Consensus 2 y~C~C~~Gy~l~ 13 (24)
T PF12662_consen 2 YTCSCPPGYQLS 13 (24)
T ss_pred EEeeCCCCCcCC
Confidence 479999999965
No 30
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=73.30 E-value=0.95 Score=23.17 Aligned_cols=11 Identities=27% Similarity=0.839 Sum_probs=7.8
Q ss_pred ccccCCCCccC
Q 010708 293 ICSCLEGFEPK 303 (503)
Q Consensus 293 ~C~C~~Gf~~~ 303 (503)
.|.|++||...
T Consensus 1 ~C~C~~G~~G~ 11 (13)
T PF12661_consen 1 TCQCPPGWTGP 11 (13)
T ss_dssp EEEE-TTEETT
T ss_pred CccCcCCCcCC
Confidence 48999999754
No 31
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=72.97 E-value=2.9 Score=27.08 Aligned_cols=31 Identities=29% Similarity=0.613 Sum_probs=22.3
Q ss_pred cccccccccCCCcccCCC-CCCccccCCCCcc
Q 010708 272 TECDVYGKCGAFGICNSQ-EKPICSCLEGFEP 302 (503)
Q Consensus 272 ~~C~v~g~CG~~giC~~~-~~~~C~C~~Gf~~ 302 (503)
+.|.....|...+.|... +...|.|++||..
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcC
Confidence 456554578888899643 4568999999874
No 32
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=70.40 E-value=3.4 Score=29.79 Aligned_cols=33 Identities=18% Similarity=0.510 Sum_probs=27.2
Q ss_pred cccccccccccccCCCcccCCCCCCccccCCCCccC
Q 010708 268 LNLRTECDVYGKCGAFGICNSQEKPICSCLEGFEPK 303 (503)
Q Consensus 268 ~~p~~~C~v~g~CG~~giC~~~~~~~C~C~~Gf~~~ 303 (503)
..|.+.|....-|-.+++|.. ..|.|++||.+.
T Consensus 16 ~~~g~~C~~~~qC~~~s~C~~---g~C~C~~g~~~~ 48 (52)
T PF01683_consen 16 VQPGESCESDEQCIGGSVCVN---GRCQCPPGYVEV 48 (52)
T ss_pred CCCCCCCCCcCCCCCcCEEcC---CEeECCCCCEec
Confidence 346678999999999999953 589999999765
No 33
>KOG1219 consensus Uncharacterized conserved protein, contains laminin, cadherin and EGF domains [Signal transduction mechanisms]
Probab=68.75 E-value=6.7 Score=48.82 Aligned_cols=27 Identities=26% Similarity=0.525 Sum_probs=21.2
Q ss_pred cccCCCcccCCC--CCCccccCCCCccCC
Q 010708 278 GKCGAFGICNSQ--EKPICSCLEGFEPKN 304 (503)
Q Consensus 278 g~CG~~giC~~~--~~~~C~C~~Gf~~~~ 304 (503)
..|---|.|+.. ++..|.||+-|..+.
T Consensus 3870 npCqhgG~C~~~~~ggy~CkCpsqysG~~ 3898 (4289)
T KOG1219|consen 3870 NPCQHGGTCISQPKGGYKCKCPSQYSGNH 3898 (4289)
T ss_pred CcccCCCEecCCCCCceEEeCcccccCcc
Confidence 567788899754 577999999998764
No 34
>PTZ00284 protein kinase; Provisional
Probab=68.16 E-value=1.3 Score=47.33 Aligned_cols=30 Identities=33% Similarity=0.473 Sum_probs=23.5
Q ss_pred HHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708 473 FEELATATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 473 ~~el~~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
.+++...++.|...++||+|+||+||+++.
T Consensus 121 ~~~~~~~~~~y~i~~~lG~G~fg~V~~a~~ 150 (467)
T PTZ00284 121 GEDIDVSTQRFKILSLLGEGTFGKVVEAWD 150 (467)
T ss_pred CCccccCCCcEEEEEEEEeccCEEEEEEEE
Confidence 344555677788778999999999999864
No 35
>PF08277 PAN_3: PAN-like domain; InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs The PAN-3 or CW is a domain associated with a number of Caenorhabditis elegans hypothetical proteins.
Probab=67.11 E-value=15 Score=28.04 Aligned_cols=30 Identities=23% Similarity=0.703 Sum_probs=25.8
Q ss_pred cchhhHHHhhcccCceeeeeecCCCceeeec
Q 010708 358 ATEDECREQCLKNCSCIAYAFDGGIGCMVWR 388 (503)
Q Consensus 358 ~~~~~C~~~Cl~nCsC~a~~y~~g~gC~~~~ 388 (503)
.+-++|-+.|.++=.|..+.+. ...|.++.
T Consensus 20 ~sw~~Cv~~C~~~~~C~la~~~-~~~C~~y~ 49 (71)
T PF08277_consen 20 TSWDDCVQKCYNDENCVLAYFD-SGKCYLYN 49 (71)
T ss_pred CCHHHHhHHhCCCCEEEEEEeC-CCCEEEEE
Confidence 5679999999999999998887 56798864
No 36
>KOG0193 consensus Serine/threonine protein kinase RAF [Signal transduction mechanisms]
Probab=67.03 E-value=2.5 Score=45.92 Aligned_cols=16 Identities=38% Similarity=0.734 Sum_probs=14.4
Q ss_pred CcccccCCccEEEEEc
Q 010708 487 SKLGQGGFGPVYWVMV 502 (503)
Q Consensus 487 n~LG~GgfG~VYKG~L 502 (503)
..||+|.||+||||.-
T Consensus 398 ~rIGsGsFGtV~Rg~w 413 (678)
T KOG0193|consen 398 ERIGSGSFGTVYRGRW 413 (678)
T ss_pred ceeccccccceeeccc
Confidence 6899999999999963
No 37
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=66.41 E-value=1.3 Score=28.65 Aligned_cols=26 Identities=27% Similarity=0.613 Sum_probs=19.3
Q ss_pred cccCCCcccCC-C-CCCccccCCCCccC
Q 010708 278 GKCGAFGICNS-Q-EKPICSCLEGFEPK 303 (503)
Q Consensus 278 g~CG~~giC~~-~-~~~~C~C~~Gf~~~ 303 (503)
..|...|.|.. . ....|.|++||...
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 36777888853 2 56789999999753
No 38
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=65.88 E-value=64 Score=30.27 Aligned_cols=78 Identities=23% Similarity=0.384 Sum_probs=48.9
Q ss_pred CCCCeEEEEeCCCCCCcCC----cc-EEEEecCCcEEEEe-CCCcEEEee-cCcC---C--cc------ccceEEEccCC
Q 010708 67 PSEKAVIWVANRDNPLKDS----SG-IITISEDGNLVLVN-GQKEVLWSS-NVSN---L--VN------NSTSAQLLDSG 128 (503)
Q Consensus 67 ~~~~~vvW~anr~~p~~~~----~~-~l~l~~~G~Lvl~~-~~~~~~Wss-~~~~---~--~~------~~~~~~L~d~G 128 (503)
....+++|....+.++... .. .+....+|.|..+| .+|.++|.. .... . .. +...+....+|
T Consensus 53 ~~tG~~~W~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 132 (238)
T PF13360_consen 53 AKTGKVLWRFDLPGPISGAPVVDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSG 132 (238)
T ss_dssp TTTSEEEEEEECSSCGGSGEEEETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCS
T ss_pred CCCCCEEEEeeccccccceeeecccccccccceeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEeccC
Confidence 3568899998876654322 13 34455678888888 689999994 3221 0 00 01122233488
Q ss_pred CEEEEeccCCcEEEee
Q 010708 129 NLVLRDNINRAIVWES 144 (503)
Q Consensus 129 NlVl~~~~~~~~~WqS 144 (503)
.|+..|..+|+++|+-
T Consensus 133 ~l~~~d~~tG~~~w~~ 148 (238)
T PF13360_consen 133 KLVALDPKTGKLLWKY 148 (238)
T ss_dssp EEEEEETTTTEEEEEE
T ss_pred cEEEEecCCCcEEEEe
Confidence 9999987669999986
No 39
>smart00605 CW CW domain.
Probab=65.69 E-value=18 Score=29.47 Aligned_cols=53 Identities=17% Similarity=0.473 Sum_probs=34.2
Q ss_pred cchhhHHHhhcccCceeeeeecCCCceeeecccccccceEecC-CcceeEEEee
Q 010708 358 ATEDECREQCLKNCSCIAYAFDGGIGCMVWRSINLIDIQRLPF-GGTDLYIRVA 410 (503)
Q Consensus 358 ~~~~~C~~~Cl~nCsC~a~~y~~g~gC~~~~~~~l~~~~~~~~-~~~~~~ir~~ 410 (503)
.+-++|...|..+..|..+.......|++..-..+..+++... .+..+=+|+.
T Consensus 22 ~sw~~Ci~~C~~~~~Cvlay~~~~~~C~~f~~~~~~~v~~~~~~~~~~VAfK~~ 75 (94)
T smart00605 22 LSWDECIQKCYEDSNCVLAYGNSSETCYLFSYGTVLTVKKLSSSSGKKVAFKVS 75 (94)
T ss_pred CCHHHHHHHHhCCCceEEEecCCCCceEEEEcCCeEEEEEccCCCCcEEEEEEe
Confidence 5689999999999999987654335698753214555555443 3344555554
No 40
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=64.11 E-value=1.5 Score=29.34 Aligned_cols=27 Identities=26% Similarity=0.690 Sum_probs=19.1
Q ss_pred ccccCCCcccCCC-CCCccccCCCCccC
Q 010708 277 YGKCGAFGICNSQ-EKPICSCLEGFEPK 303 (503)
Q Consensus 277 ~g~CG~~giC~~~-~~~~C~C~~Gf~~~ 303 (503)
.+-|.++..|... +...|.|.+||...
T Consensus 5 ~~~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 5 NGGCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GGGS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCCCcEeecCCCCEEeECCCCCccC
Confidence 4678999999653 46789999999864
No 41
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=63.68 E-value=4.2 Score=26.65 Aligned_cols=18 Identities=28% Similarity=0.654 Sum_probs=13.9
Q ss_pred ccCCCCCCccccCCCCcc
Q 010708 285 ICNSQEKPICSCLEGFEP 302 (503)
Q Consensus 285 iC~~~~~~~C~C~~Gf~~ 302 (503)
.|..+....|.||.||-.
T Consensus 11 ~CDpn~~~~C~CPeGyIl 28 (34)
T PF09064_consen 11 DCDPNSPGQCFCPEGYIL 28 (34)
T ss_pred ccCCCCCCceeCCCceEe
Confidence 566555679999999974
No 42
>KOG0194 consensus Protein tyrosine kinase [Signal transduction mechanisms]
Probab=60.72 E-value=4.4 Score=43.42 Aligned_cols=16 Identities=44% Similarity=0.790 Sum_probs=15.2
Q ss_pred CcccccCCccEEEEEc
Q 010708 487 SKLGQGGFGPVYWVMV 502 (503)
Q Consensus 487 n~LG~GgfG~VYKG~L 502 (503)
.+||+|.||.||+|+|
T Consensus 163 kkLGeGaFGeV~~G~l 178 (474)
T KOG0194|consen 163 KKLGEGAFGEVFKGKL 178 (474)
T ss_pred ceeecccccEEEEEEE
Confidence 7999999999999987
No 43
>KOG1035 consensus eIF-2alpha kinase GCN2 [Translation, ribosomal structure and biogenesis]
Probab=59.75 E-value=3 Score=48.84 Aligned_cols=21 Identities=48% Similarity=0.757 Sum_probs=17.1
Q ss_pred cCCCcCCcccccCCccEEEEE
Q 010708 481 NNFQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 481 ~nFs~~n~LG~GgfG~VYKG~ 501 (503)
+.|-+-..||+||||.|||.+
T Consensus 479 ~DFEEL~lLGkGGFG~VvkVR 499 (1351)
T KOG1035|consen 479 NDFEELELLGKGGFGSVVKVR 499 (1351)
T ss_pred hhhHHHHHhcCCCCceEEEEe
Confidence 445555789999999999985
No 44
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=58.38 E-value=50 Score=34.32 Aligned_cols=18 Identities=17% Similarity=0.398 Sum_probs=8.5
Q ss_pred cCCcEEEEeC-CCcEEEee
Q 010708 93 EDGNLVLVNG-QKEVLWSS 110 (503)
Q Consensus 93 ~~G~Lvl~~~-~~~~~Wss 110 (503)
.+|.|+..|. +|..+|+.
T Consensus 302 ~~g~l~ald~~tG~~~W~~ 320 (394)
T PRK11138 302 QNDRVYALDTRGGVELWSQ 320 (394)
T ss_pred CCCeEEEEECCCCcEEEcc
Confidence 3455554443 34455544
No 45
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=56.70 E-value=53 Score=33.77 Aligned_cols=51 Identities=16% Similarity=0.382 Sum_probs=27.8
Q ss_pred cCCcEEEEe-CCCcEEEeecCcCCc------cccceEEEccCCCEEEEeccCCcEEEe
Q 010708 93 EDGNLVLVN-GQKEVLWSSNVSNLV------NNSTSAQLLDSGNLVLRDNINRAIVWE 143 (503)
Q Consensus 93 ~~G~Lvl~~-~~~~~~Wss~~~~~~------~~~~~~~L~d~GNlVl~~~~~~~~~Wq 143 (503)
.+|.|+-+| .+|.++|........ .+.....-..+|+|+-.|..+|+++|+
T Consensus 73 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~p~v~~~~v~v~~~~g~l~ald~~tG~~~W~ 130 (377)
T TIGR03300 73 ADGTVVALDAETGKRLWRVDLDERLSGGVGADGGLVFVGTEKGEVIALDAEDGKELWR 130 (377)
T ss_pred CCCeEEEEEccCCcEeeeecCCCCcccceEEcCCEEEEEcCCCEEEEEECCCCcEeee
Confidence 446676666 467777876543211 011122224466666666655777775
No 46
>smart00181 EGF Epidermal growth factor-like domain.
Probab=55.99 E-value=8.9 Score=24.62 Aligned_cols=24 Identities=29% Similarity=0.683 Sum_probs=18.1
Q ss_pred cccCCCcccCC-CCCCccccCCCCcc
Q 010708 278 GKCGAFGICNS-QEKPICSCLEGFEP 302 (503)
Q Consensus 278 g~CG~~giC~~-~~~~~C~C~~Gf~~ 302 (503)
..|... .|.. .+...|.|++||+.
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCcc
Confidence 457666 8854 35788999999975
No 47
>cd05622 STKc_ROCK1 Catalytic domain of the Protein Serine/Threonine Kinase, Rho-associated coiled-coil containing protein kinase 1. Serine/Threonine Kinases (STKs), ROCK subfamily, ROCK1 (or ROK-beta) isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The ROCK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. ROCK contains an N-terminal extension, a catalytic kinase domain, and a C-terminal extension, which contains a coiled-coil region encompassing a Rho-binding domain (RBD) and a pleckstrin homology (PH) domain. ROCK is auto-inhibited by the RBD and PH domain interacting with the catalytic domain, and is activated via interaction with Rho GTPases. ROCK1 is preferentially expressed in the liver, lung, spleen, testes, an
Probab=54.73 E-value=4.1 Score=42.08 Aligned_cols=30 Identities=23% Similarity=0.534 Sum_probs=23.6
Q ss_pred HHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708 473 FEELATATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 473 ~~el~~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
+.++....++|....+||+|+||.||+++.
T Consensus 35 ~~~~~~~~~~y~i~~~lG~G~fg~Vy~~~~ 64 (371)
T cd05622 35 IRDLRMKAEDYEVVKVIGRGAFGEVQLVRH 64 (371)
T ss_pred HhhcCcchhhcEEEEEEeecCCeEEEEEEE
Confidence 445555667788778999999999999863
No 48
>cd05106 PTKc_CSF-1R Catalytic domain of the Protein Tyrosine Kinase, Colony-Stimulating Factor-1 Receptor. Protein Tyrosine Kinase (PTK) family; Colony-Stimulating Factor-1 Receptor (CSF-1R); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CSF-1R, also called c-Fms, is a member of the Platelet Derived Growth Factor Receptor (PDGFR) subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with five immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of CSF-1R to its ligand, CSF-1, leads to receptor dimerization, trans phosphorylation and activation, and intracellular signaling. CSF-1R signaling is criti
Probab=54.06 E-value=6 Score=40.91 Aligned_cols=19 Identities=37% Similarity=0.620 Sum_probs=15.8
Q ss_pred CCcCCcccccCCccEEEEE
Q 010708 483 FQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 483 Fs~~n~LG~GgfG~VYKG~ 501 (503)
|.-.++||+|+||.||+++
T Consensus 40 ~~~~~~LG~G~fg~V~~~~ 58 (374)
T cd05106 40 LQFGKTLGAGAFGKVVEAT 58 (374)
T ss_pred ceehheecCCCcccEEEEE
Confidence 4445889999999999875
No 49
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=53.99 E-value=82 Score=29.52 Aligned_cols=77 Identities=19% Similarity=0.401 Sum_probs=46.3
Q ss_pred CCCeEEEEeCC----CCCC--cC-CccEEEE-ecCCcEEEEeC-CCcEEEeecCcCCc------cccceEEEccCCCEEE
Q 010708 68 SEKAVIWVANR----DNPL--KD-SSGIITI-SEDGNLVLVNG-QKEVLWSSNVSNLV------NNSTSAQLLDSGNLVL 132 (503)
Q Consensus 68 ~~~~vvW~anr----~~p~--~~-~~~~l~l-~~~G~Lvl~~~-~~~~~Wss~~~~~~------~~~~~~~L~d~GNlVl 132 (503)
.....+|..+- ..++ .- ....+.+ +.+|.|+.+|. +|..+|........ .+........+|.|..
T Consensus 11 ~tG~~~W~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~ 90 (238)
T PF13360_consen 11 RTGKELWSYDLGPGIGGPVATAVPDGGRVYVASGDGNLYALDAKTGKVLWRFDLPGPISGAPVVDGGRVYVGTSDGSLYA 90 (238)
T ss_dssp TTTEEEEEEECSSSCSSEEETEEEETTEEEEEETTSEEEEEETTTSEEEEEEECSSCGGSGEEEETTEEEEEETTSEEEE
T ss_pred CCCCEEEEEECCCCCCCccceEEEeCCEEEEEcCCCEEEEEECCCCCEEEEeeccccccceeeecccccccccceeeeEe
Confidence 46778888753 2222 00 1233444 48899999996 89999998753321 0112222334556777
Q ss_pred EeccCCcEEEee
Q 010708 133 RDNINRAIVWES 144 (503)
Q Consensus 133 ~~~~~~~~~WqS 144 (503)
.|..+|+++|+.
T Consensus 91 ~d~~tG~~~W~~ 102 (238)
T PF13360_consen 91 LDAKTGKVLWSI 102 (238)
T ss_dssp EETTTSCEEEEE
T ss_pred cccCCcceeeee
Confidence 775569999995
No 50
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=53.47 E-value=74 Score=32.68 Aligned_cols=56 Identities=18% Similarity=0.328 Sum_probs=37.0
Q ss_pred EEEecCCcEEEEeC-CCcEEEeecCcCCc----cccceEEEccCCCEEEEeccCCcEEEee
Q 010708 89 ITISEDGNLVLVNG-QKEVLWSSNVSNLV----NNSTSAQLLDSGNLVLRDNINRAIVWES 144 (503)
Q Consensus 89 l~l~~~G~Lvl~~~-~~~~~Wss~~~~~~----~~~~~~~L~d~GNlVl~~~~~~~~~WqS 144 (503)
+....+|.|+.+|. +|..+|........ .+.......++|.|+..|..+++.+|+.
T Consensus 245 y~~~~~g~l~a~d~~tG~~~W~~~~~~~~~p~~~~~~vyv~~~~G~l~~~d~~tG~~~W~~ 305 (377)
T TIGR03300 245 YAVSYQGRVAALDLRSGRVLWKRDASSYQGPAVDDNRLYVTDADGVVVALDRRSGSELWKN 305 (377)
T ss_pred EEEEcCCEEEEEECCCCcEEEeeccCCccCceEeCCEEEEECCCCeEEEEECCCCcEEEcc
Confidence 33456788998886 67899987632211 0123334466788888887668899974
No 51
>PTZ00036 glycogen synthase kinase; Provisional
Probab=53.35 E-value=5.3 Score=42.47 Aligned_cols=23 Identities=30% Similarity=0.731 Sum_probs=18.6
Q ss_pred hcCCCcCCcccccCCccEEEEEc
Q 010708 480 TNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 480 T~nFs~~n~LG~GgfG~VYKG~L 502 (503)
++.|...++||+|+||.||+|+.
T Consensus 65 ~~~y~~~~~LG~G~fg~Vy~~~~ 87 (440)
T PTZ00036 65 NKSYKLGNIIGNGSFGVVYEAIC 87 (440)
T ss_pred CCeEEEeEEEEeCCCEEEEEEEE
Confidence 34566668999999999999864
No 52
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=51.51 E-value=8.3 Score=34.55 Aligned_cols=12 Identities=8% Similarity=-0.153 Sum_probs=5.1
Q ss_pred EEechhHHHHHH
Q 010708 423 FVSPLIKGMFAL 434 (503)
Q Consensus 423 i~i~~~~~~~~l 434 (503)
+++++.++++++
T Consensus 54 vVVGVGg~ill~ 65 (154)
T PF04478_consen 54 VVVGVGGPILLG 65 (154)
T ss_pred EEecccHHHHHH
Confidence 444444444333
No 53
>PRK09605 bifunctional UGMP family protein/serine/threonine protein kinase; Validated
Probab=51.29 E-value=6 Score=43.25 Aligned_cols=30 Identities=7% Similarity=-0.045 Sum_probs=21.5
Q ss_pred HHHHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708 473 FEELATATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 473 ~~el~~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
+......+..+...++||+|+||.||||+.
T Consensus 325 ~~~~~~~~~~~~~~~~iG~G~~g~Vy~~~~ 354 (535)
T PRK09605 325 WIKEEEVKRRKIPDHLIGKGAEADIKKGEY 354 (535)
T ss_pred eccccccccccCccceeccCCcEEEEEEee
Confidence 333334445556679999999999999764
No 54
>cd05621 STKc_ROCK2 Catalytic domain of the Protein Serine/Threonine Kinase, Rho-associated coiled-coil containing protein kinase 2. Serine/Threonine Kinases (STKs), ROCK subfamily, ROCK2 (or ROK-alpha) isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The ROCK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. ROCK contains an N-terminal extension, a catalytic kinase domain, and a C-terminal extension, which contains a coiled-coil region encompassing a Rho-binding domain (RBD) and a pleckstrin homology (PH) domain. ROCK is auto-inhibited by the RBD and PH domain interacting with the catalytic domain, and is activated via interaction with Rho GTPases. ROCK2 was the first identified target of activated RhoA, and was found
Probab=50.37 E-value=5.2 Score=41.35 Aligned_cols=25 Identities=24% Similarity=0.530 Sum_probs=19.5
Q ss_pred HHhcCCCcCCcccccCCccEEEEEc
Q 010708 478 TATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 478 ~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
...++|....+||+|+||.||+++.
T Consensus 40 ~~~~~y~~~~~lG~G~fg~Vy~~~~ 64 (370)
T cd05621 40 MKAEDYDVVKVIGRGAFGEVQLVRH 64 (370)
T ss_pred CCHHHCeEEEEEEecCCeEEEEEEE
Confidence 3345566668999999999998863
No 55
>PHA03210 serine/threonine kinase US3; Provisional
Probab=50.17 E-value=8.1 Score=41.83 Aligned_cols=23 Identities=22% Similarity=0.564 Sum_probs=18.7
Q ss_pred hcCCCcCCcccccCCccEEEEEc
Q 010708 480 TNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 480 T~nFs~~n~LG~GgfG~VYKG~L 502 (503)
.+.|....+||+|+||.||++.+
T Consensus 147 ~~~Y~ii~~LG~G~fG~Vyl~~~ 169 (501)
T PHA03210 147 LAHFRVIDDLPAGAFGKIFICAL 169 (501)
T ss_pred hhccEEEeEecCCCCcceEEEEE
Confidence 45677678999999999998753
No 56
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=49.28 E-value=31 Score=21.29 Aligned_cols=21 Identities=14% Similarity=0.344 Sum_probs=16.0
Q ss_pred EEEEecCCcEEEEeCCCcEEE
Q 010708 88 IITISEDGNLVLVNGQKEVLW 108 (503)
Q Consensus 88 ~l~l~~~G~Lvl~~~~~~~~W 108 (503)
-+.+..+|++++.|..+..||
T Consensus 6 gvav~~~g~i~VaD~~n~rV~ 26 (28)
T PF01436_consen 6 GVAVDSDGNIYVADSGNHRVQ 26 (28)
T ss_dssp EEEEETTSEEEEEECCCTEEE
T ss_pred EEEEeCCCCEEEEECCCCEEE
Confidence 377778899999887766655
No 57
>PHA02887 EGF-like protein; Provisional
Probab=48.69 E-value=14 Score=31.45 Aligned_cols=33 Identities=21% Similarity=0.557 Sum_probs=24.4
Q ss_pred cccccc--cccccCCCcccCCC---CCCccccCCCCccC
Q 010708 270 LRTECD--VYGKCGAFGICNSQ---EKPICSCLEGFEPK 303 (503)
Q Consensus 270 p~~~C~--v~g~CG~~giC~~~---~~~~C~C~~Gf~~~ 303 (503)
-.++|. ..+.|= +|.|.+- +.+.|.|++||...
T Consensus 82 hf~pC~~eyk~YCi-HG~C~yI~dL~epsCrC~~GYtG~ 119 (126)
T PHA02887 82 FFEKCKNDFNDFCI-NGECMNIIDLDEKFCICNKGYTGI 119 (126)
T ss_pred CccccChHhhCEee-CCEEEccccCCCceeECCCCcccC
Confidence 345675 467887 7999752 57899999999753
No 58
>KOG0663 consensus Protein kinase PITSLRE and related kinases [General function prediction only]
Probab=47.91 E-value=5.4 Score=40.57 Aligned_cols=20 Identities=25% Similarity=0.607 Sum_probs=16.6
Q ss_pred CCCcCCcccccCCccEEEEE
Q 010708 482 NFQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 482 nFs~~n~LG~GgfG~VYKG~ 501 (503)
+|-.-|+|++|.||.||||+
T Consensus 77 efe~lnrI~EGtyGiVYRak 96 (419)
T KOG0663|consen 77 EFEKLNRIEEGTYGVVYRAK 96 (419)
T ss_pred HHHHHhhcccCcceeEEEec
Confidence 34445899999999999986
No 59
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=47.26 E-value=8.1 Score=33.45 Aligned_cols=7 Identities=14% Similarity=0.368 Sum_probs=2.6
Q ss_pred echhHHH
Q 010708 425 SPLIKGM 431 (503)
Q Consensus 425 i~~~~~~ 431 (503)
+++++++
T Consensus 71 ~gv~aGv 77 (122)
T PF01102_consen 71 FGVMAGV 77 (122)
T ss_dssp HHHHHHH
T ss_pred HHHHHHH
Confidence 3333333
No 60
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=47.24 E-value=12 Score=25.61 Aligned_cols=6 Identities=33% Similarity=0.966 Sum_probs=2.3
Q ss_pred hhhhhh
Q 010708 440 FLWRWI 445 (503)
Q Consensus 440 ~~~~~~ 445 (503)
++|+||
T Consensus 33 ~~~~rR 38 (40)
T PF08693_consen 33 FFWYRR 38 (40)
T ss_pred heEEec
Confidence 334443
No 61
>PHA03209 serine/threonine kinase US3; Provisional
Probab=46.54 E-value=9.8 Score=38.95 Aligned_cols=25 Identities=20% Similarity=0.292 Sum_probs=20.4
Q ss_pred HHhcCCCcCCcccccCCccEEEEEc
Q 010708 478 TATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 478 ~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
....+|....+||+|+||.||+|+.
T Consensus 63 ~~~~~y~~~~~lg~G~~g~Vy~~~~ 87 (357)
T PHA03209 63 VASLGYTVIKTLTPGSEGRVFVATK 87 (357)
T ss_pred hhhcCcEEEEEecCCCCeEEEEEEE
Confidence 3445687778999999999999864
No 62
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=46.43 E-value=11 Score=20.97 Aligned_cols=15 Identities=27% Similarity=0.366 Sum_probs=7.8
Q ss_pred ChhHHHHHHHHHHHh
Q 010708 1 MIPIALLIILLSCFC 15 (503)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (503)
|..+++|+++|++++
T Consensus 2 Mk~vIIlvvLLliSf 16 (19)
T PF13956_consen 2 MKLVIILVVLLLISF 16 (19)
T ss_pred ceehHHHHHHHhccc
Confidence 444555555555444
No 63
>cd05596 STKc_ROCK Catalytic domain of the Protein Serine/Threonine Kinase, Rho-associated coiled-coil containing protein kinase. Serine/Threonine Kinases (STKs), Rho-associated coiled-coil containing protein kinase (ROCK) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The ROCK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. ROCK is also referred to as Rho-associated kinase or simply as Rho kinase. It contains an N-terminal extension, a catalytic kinase domain, and a long C-terminal extension, which contains a coiled-coil region encompassing a Rho-binding domain (RBD) and a pleckstrin homology (PH) domain. ROCK is auto-inhibited by the RBD and PH domain interacting with the catalytic domain. It is activated via in
Probab=45.77 E-value=4.7 Score=41.63 Aligned_cols=23 Identities=30% Similarity=0.606 Sum_probs=18.6
Q ss_pred hcCCCcCCcccccCCccEEEEEc
Q 010708 480 TNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 480 T~nFs~~n~LG~GgfG~VYKG~L 502 (503)
.++|....+||+|+||.||+++.
T Consensus 42 ~~~y~~~~~lg~G~~g~Vy~~~~ 64 (370)
T cd05596 42 AEDFDVIKVIGRGAFGEVQLVRH 64 (370)
T ss_pred HHHcEEEEEEeeCCCEEEEEEEE
Confidence 44566668999999999999863
No 64
>KOG0600 consensus Cdc2-related protein kinase [Cell cycle control, cell division, chromosome partitioning]
Probab=43.07 E-value=6.5 Score=41.97 Aligned_cols=15 Identities=40% Similarity=0.835 Sum_probs=13.8
Q ss_pred CcccccCCccEEEEE
Q 010708 487 SKLGQGGFGPVYWVM 501 (503)
Q Consensus 487 n~LG~GgfG~VYKG~ 501 (503)
.+||+|.||.|||++
T Consensus 123 ~kIGeGTyg~VYkAr 137 (560)
T KOG0600|consen 123 EKIGEGTYGQVYKAR 137 (560)
T ss_pred HHhcCcchhheeEee
Confidence 689999999999985
No 65
>PLN00034 mitogen-activated protein kinase kinase; Provisional
Probab=42.14 E-value=11 Score=38.46 Aligned_cols=16 Identities=38% Similarity=0.696 Sum_probs=14.3
Q ss_pred CcccccCCccEEEEEc
Q 010708 487 SKLGQGGFGPVYWVMV 502 (503)
Q Consensus 487 n~LG~GgfG~VYKG~L 502 (503)
++||+|+||.||+++.
T Consensus 80 ~~lg~G~~g~V~~~~~ 95 (353)
T PLN00034 80 NRIGSGAGGTVYKVIH 95 (353)
T ss_pred hhccCCCCeEEEEEEE
Confidence 6899999999999863
No 66
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=39.22 E-value=55 Score=32.28 Aligned_cols=46 Identities=22% Similarity=0.419 Sum_probs=34.9
Q ss_pred CCCeEEEEeCCCCCCcCCc-----cEEEEecCCcEEEEeCCCcEEEeecCc
Q 010708 68 SEKAVIWVANRDNPLKDSS-----GIITISEDGNLVLVNGQKEVLWSSNVS 113 (503)
Q Consensus 68 ~~~~vvW~anr~~p~~~~~-----~~l~l~~~G~Lvl~~~~~~~~Wss~~~ 113 (503)
.+.++.|.+.|..|+-.+. ....=+.||+|.-.|+.|+.||.-.|.
T Consensus 167 ~~~~~~w~~~~~~PiF~splcv~~sv~i~~VdG~l~~f~~sG~qvwr~~t~ 217 (354)
T KOG4649|consen 167 YSSTEFWAATRFGPIFASPLCVGSSVIITTVDGVLTSFDESGRQVWRPATK 217 (354)
T ss_pred CCcceehhhhcCCccccCceeccceEEEEEeccEEEEEcCCCcEEEeecCC
Confidence 4458899999999997541 233345899999999999999976553
No 67
>KOG0192 consensus Tyrosine kinase specific for activated (GTP-bound) p21cdc42Hs [Signal transduction mechanisms]
Probab=38.87 E-value=14 Score=38.37 Aligned_cols=16 Identities=44% Similarity=0.634 Sum_probs=14.0
Q ss_pred CcccccCCccEEEEEc
Q 010708 487 SKLGQGGFGPVYWVMV 502 (503)
Q Consensus 487 n~LG~GgfG~VYKG~L 502 (503)
..||+|+||+||||.-
T Consensus 47 ~~iG~G~~g~V~~~~~ 62 (362)
T KOG0192|consen 47 EVLGSGSFGTVYKGKW 62 (362)
T ss_pred hhcccCCceeEEEEEe
Confidence 4599999999999974
No 68
>PHA03211 serine/threonine kinase US3; Provisional
Probab=38.51 E-value=15 Score=39.37 Aligned_cols=23 Identities=22% Similarity=0.208 Sum_probs=18.9
Q ss_pred hcCCCcCCcccccCCccEEEEEc
Q 010708 480 TNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 480 T~nFs~~n~LG~GgfG~VYKG~L 502 (503)
..+|.-..+||+|+||.||+++.
T Consensus 168 ~~gy~i~~~Lg~G~~G~Vy~a~~ 190 (461)
T PHA03211 168 GLGFAIHRALTPGSEGCVFESSH 190 (461)
T ss_pred cCCeEEEEEEccCCCeEEEEEEE
Confidence 44566678999999999999874
No 69
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=38.11 E-value=1.1e+02 Score=33.08 Aligned_cols=76 Identities=22% Similarity=0.310 Sum_probs=46.5
Q ss_pred CCCeEEEEeCCC-------CCCcCCccEEEEe-cCCcEEEEeC-CCcEEEeecCcCC--------cc------c-cceEE
Q 010708 68 SEKAVIWVANRD-------NPLKDSSGIITIS-EDGNLVLVNG-QKEVLWSSNVSNL--------VN------N-STSAQ 123 (503)
Q Consensus 68 ~~~~vvW~anr~-------~p~~~~~~~l~l~-~~G~Lvl~~~-~~~~~Wss~~~~~--------~~------~-~~~~~ 123 (503)
....++|..+-. .|+.. ..++.+. .+|.|+-+|. +|.++|....... .. . .....
T Consensus 37 ~~~~~~W~~~~~~~~~~~~sPvv~-~g~vy~~~~~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v 115 (488)
T cd00216 37 KKLKVAWTFSTGDERGQEGTPLVV-DGDMYFTTSHSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFF 115 (488)
T ss_pred hcceeeEEEECCCCCCcccCCEEE-CCEEEEeCCCCcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEE
Confidence 446678887654 24332 2344444 5788888885 6889999764321 00 0 11122
Q ss_pred EccCCCEEEEeccCCcEEEee
Q 010708 124 LLDSGNLVLRDNINRAIVWES 144 (503)
Q Consensus 124 L~d~GNlVl~~~~~~~~~WqS 144 (503)
-..+|.|+-.|.++++++|+-
T Consensus 116 ~~~~g~v~AlD~~TG~~~W~~ 136 (488)
T cd00216 116 GTFDGRLVALDAETGKQVWKF 136 (488)
T ss_pred ecCCCeEEEEECCCCCEeeee
Confidence 235788888888779999994
No 70
>KOG2052 consensus Activin A type IB receptor, serine/threonine protein kinase [Signal transduction mechanisms]
Probab=34.88 E-value=20 Score=37.86 Aligned_cols=17 Identities=29% Similarity=0.515 Sum_probs=14.6
Q ss_pred CCcccccCCccEEEEEc
Q 010708 486 SSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 486 ~n~LG~GgfG~VYKG~L 502 (503)
...||+|.||-|.||.-
T Consensus 216 ~e~IGkGRyGEVwrG~w 232 (513)
T KOG2052|consen 216 QEIIGKGRFGEVWRGRW 232 (513)
T ss_pred EEEecCccccceeeccc
Confidence 36899999999999863
No 71
>COG3117 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.88 E-value=2e+02 Score=26.82 Aligned_cols=73 Identities=16% Similarity=0.223 Sum_probs=42.6
Q ss_pred EeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEeCCCcEEEeecCcCCccc
Q 010708 39 LSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWVANRDNPLKDSSGIITISEDGNLVLVNGQKEVLWSSNVSNLVNN 118 (503)
Q Consensus 39 ~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~~~~~~~Wss~~~~~~~~ 118 (503)
....+.=...|.+.|-.+....|+-..+.+.....|+.- | -|.|++. +.+-|.-.
T Consensus 42 Y~~e~~~~~~~de~G~~~y~l~a~~~eh~~~~~~t~ft~---P--------------~l~lf~e-~~q~w~v~------- 96 (188)
T COG3117 42 YTMEGLDTTVYDEQGKLKYRLTAQHVEHYSDSSDTHFTA---P--------------VLTLFRE-NRQTWTVQ------- 96 (188)
T ss_pred eeecCcceeEECCCcceeEEeehhhhhhcccccceEEec---c--------------eEEEEcC-CCccEEEE-------
Confidence 455555566676665443345566555544443444322 1 2444433 55667753
Q ss_pred cceEEEccCCCEEEEecc
Q 010708 119 STSAQLLDSGNLVLRDNI 136 (503)
Q Consensus 119 ~~~~~L~d~GNlVl~~~~ 136 (503)
...+.|.++|.|+|+..+
T Consensus 97 Ad~a~l~~d~~l~l~~~v 114 (188)
T COG3117 97 ADRAKLTKDGKLYLLGDV 114 (188)
T ss_pred hhhheecCCceEEEEece
Confidence 467899999999999754
No 72
>PF15102 TMEM154: TMEM154 protein family
Probab=33.85 E-value=19 Score=32.03 Aligned_cols=8 Identities=25% Similarity=0.384 Sum_probs=3.2
Q ss_pred EEechhHH
Q 010708 423 FVSPLIKG 430 (503)
Q Consensus 423 i~i~~~~~ 430 (503)
|+++.+++
T Consensus 61 IlIP~VLL 68 (146)
T PF15102_consen 61 ILIPLVLL 68 (146)
T ss_pred EeHHHHHH
Confidence 34443333
No 73
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=33.30 E-value=1e+02 Score=33.25 Aligned_cols=54 Identities=20% Similarity=0.420 Sum_probs=31.6
Q ss_pred CCcEEEEeCCCcEEEeecCcCCccccceEEEccCCCEEEEecc-------CCcEEEeecCCCCC
Q 010708 94 DGNLVLVNGQKEVLWSSNVSNLVNNSTSAQLLDSGNLVLRDNI-------NRAIVWESFQEPTD 150 (503)
Q Consensus 94 ~G~Lvl~~~~~~~~Wss~~~~~~~~~~~~~L~d~GNlVl~~~~-------~~~~~WqSFd~PTD 150 (503)
.+..+++|..|.++|.-...... ...+..+++|+|...... -|+++|+ ++.|..
T Consensus 127 ~~~~~~iD~~G~Vrw~~~~~~~~--~~~~~~l~nG~ll~~~~~~~~e~D~~G~v~~~-~~l~~~ 187 (477)
T PF05935_consen 127 SSYTYLIDNNGDVRWYLPLDSGS--DNSFKQLPNGNLLIGSGNRLYEIDLLGKVIWE-YDLPGG 187 (477)
T ss_dssp EEEEEEEETTS-EEEEE-GGGT----SSEEE-TTS-EEEEEBTEEEEE-TT--EEEE-EE--TT
T ss_pred CceEEEECCCccEEEEEccCccc--cceeeEcCCCCEEEecCCceEEEcCCCCEEEe-eecCCc
Confidence 46788899999999997754322 222778899999865441 4888888 777763
No 74
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=32.94 E-value=64 Score=21.31 Aligned_cols=19 Identities=21% Similarity=0.438 Sum_probs=16.1
Q ss_pred ccCCCEEEEeccCCcEEEe
Q 010708 125 LDSGNLVLRDNINRAIVWE 143 (503)
Q Consensus 125 ~d~GNlVl~~~~~~~~~Wq 143 (503)
..+|.|+-.|..+|+++|+
T Consensus 7 ~~~g~l~AlD~~TG~~~W~ 25 (38)
T PF01011_consen 7 TPDGYLYALDAKTGKVLWK 25 (38)
T ss_dssp TTTSEEEEEETTTTSEEEE
T ss_pred CCCCEEEEEECCCCCEEEe
Confidence 6788888888878999997
No 75
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=32.94 E-value=7.1e+02 Score=28.43 Aligned_cols=85 Identities=18% Similarity=0.406 Sum_probs=54.9
Q ss_pred cEEEEecCCcEEEEeC-CCcE-EEeecCc-------CCccccceEEEccCCCEEEEeccCCcEEEeecCCCCCCcCCCCe
Q 010708 87 GIITISEDGNLVLVNG-QKEV-LWSSNVS-------NLVNNSTSAQLLDSGNLVLRDNINRAIVWESFQEPTDSFLPGMH 157 (503)
Q Consensus 87 ~~l~l~~~G~Lvl~~~-~~~~-~Wss~~~-------~~~~~~~~~~L~d~GNlVl~~~~~~~~~WqSFd~PTDTlLpgq~ 157 (503)
..+.-..||.++.... ++.+ ||.+... ..+++++.++..-+||.+|..+--|.|
T Consensus 354 ~~l~YSpDgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtV----------------- 416 (893)
T KOG0291|consen 354 TSLAYSPDGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTV----------------- 416 (893)
T ss_pred eeEEECCCCcEEEeccCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCeE-----------------
Confidence 4688889999988865 4555 8998641 122257788889999999986521332
Q ss_pred ecccccCCceEEEEecCCCCCCCCceEE-EEEeeCCC
Q 010708 158 HGIDQRTGKKVQLTSWKSLSDPSTGSFS-AGLIHQNI 193 (503)
Q Consensus 158 l~~~~~tg~~~~L~Sw~s~~dps~G~y~-l~~~~~g~ 193 (503)
=-+|.+ +....++.+-|.|-.|+ +..|+.|.
T Consensus 417 RAwDlk-----RYrNfRTft~P~p~QfscvavD~sGe 448 (893)
T KOG0291|consen 417 RAWDLK-----RYRNFRTFTSPEPIQFSCVAVDPSGE 448 (893)
T ss_pred Eeeeec-----ccceeeeecCCCceeeeEEEEcCCCC
Confidence 222333 23334456667888887 66677764
No 76
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=32.83 E-value=33 Score=34.52 Aligned_cols=13 Identities=15% Similarity=0.442 Sum_probs=5.7
Q ss_pred HHHhhhhhhhhhh
Q 010708 436 ICTLFLWRWIAKR 448 (503)
Q Consensus 436 ~~~~~~~~~~~~~ 448 (503)
+++++++|.||++
T Consensus 273 vIIYLILRYRRKK 285 (299)
T PF02009_consen 273 VIIYLILRYRRKK 285 (299)
T ss_pred HHHHHHHHHHHHh
Confidence 3444445444433
No 77
>PHA03212 serine/threonine kinase US3; Provisional
Probab=32.13 E-value=20 Score=37.34 Aligned_cols=20 Identities=15% Similarity=0.325 Sum_probs=16.5
Q ss_pred CCCcCCcccccCCccEEEEE
Q 010708 482 NFQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 482 nFs~~n~LG~GgfG~VYKG~ 501 (503)
.|....+||+|+||.||+++
T Consensus 93 ~y~~~~~lg~G~~g~V~~~~ 112 (391)
T PHA03212 93 GFSILETFTPGAEGFAFACI 112 (391)
T ss_pred CcEEEEEEcCCCCeEEEEEE
Confidence 35555789999999999985
No 78
>KOG1214 consensus Nidogen and related basement membrane protein proteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=30.03 E-value=36 Score=38.52 Aligned_cols=32 Identities=22% Similarity=0.597 Sum_probs=24.1
Q ss_pred cccccccccCCCcccCCC-CCCccccCCCCccCC
Q 010708 272 TECDVYGKCGAFGICNSQ-EKPICSCLEGFEPKN 304 (503)
Q Consensus 272 ~~C~v~g~CG~~giC~~~-~~~~C~C~~Gf~~~~ 304 (503)
|+|. +..|-++..|-+. ++-.|.|.|||....
T Consensus 828 DeC~-psrChp~A~CyntpgsfsC~C~pGy~GDG 860 (1289)
T KOG1214|consen 828 DECS-PSRCHPAATCYNTPGSFSCRCQPGYYGDG 860 (1289)
T ss_pred cccC-ccccCCCceEecCCCcceeecccCccCCC
Confidence 6676 7888888889543 456799999998653
No 79
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=28.93 E-value=19 Score=36.12 Aligned_cols=10 Identities=10% Similarity=0.069 Sum_probs=0.0
Q ss_pred HHhhhhhhhh
Q 010708 437 CTLFLWRWIA 446 (503)
Q Consensus 437 ~~~~~~~~~~ 446 (503)
++++++||||
T Consensus 165 Ia~icyrrkR 174 (290)
T PF05454_consen 165 IACICYRRKR 174 (290)
T ss_dssp ----------
T ss_pred HHHHhhhhhh
Confidence 3344444433
No 80
>KOG1095 consensus Protein tyrosine kinase [Signal transduction mechanisms]
Probab=28.89 E-value=28 Score=40.87 Aligned_cols=16 Identities=44% Similarity=0.673 Sum_probs=15.0
Q ss_pred CcccccCCccEEEEEc
Q 010708 487 SKLGQGGFGPVYWVMV 502 (503)
Q Consensus 487 n~LG~GgfG~VYKG~L 502 (503)
+.||+|.||.||+|++
T Consensus 698 ~~lG~G~FG~VY~g~~ 713 (1025)
T KOG1095|consen 698 RVLGKGAFGEVYEGTY 713 (1025)
T ss_pred eeeccccccceEEEEE
Confidence 7899999999999986
No 81
>cd05845 Ig2_L1-CAM_like Second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig2_L1-CAM_like: domain similar to the second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=26.71 E-value=1.2e+02 Score=25.03 Aligned_cols=33 Identities=15% Similarity=0.377 Sum_probs=19.9
Q ss_pred CCCeEEEEeCCCCCCcCCccEEEEecCCcEEEEe
Q 010708 68 SEKAVIWVANRDNPLKDSSGIITISEDGNLVLVN 101 (503)
Q Consensus 68 ~~~~vvW~anr~~p~~~~~~~l~l~~~G~Lvl~~ 101 (503)
+..++.|+-+....+. ....+.+..+|||.+.+
T Consensus 32 P~P~i~W~~~~~~~i~-~~~Ri~~~~~GnL~fs~ 64 (95)
T cd05845 32 VPLRIYWMNSDLLHIT-QDERVSMGQNGNLYFAN 64 (95)
T ss_pred CCCEEEEECCCCcccc-ccccEEECCCceEEEEE
Confidence 4566778844333333 24567777778887654
No 82
>KOG0986 consensus G protein-coupled receptor kinase [Signal transduction mechanisms]
Probab=26.45 E-value=33 Score=36.48 Aligned_cols=21 Identities=43% Similarity=0.754 Sum_probs=17.5
Q ss_pred cCCCcCCcccccCCccEEEEE
Q 010708 481 NNFQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 481 ~nFs~~n~LG~GgfG~VYKG~ 501 (503)
+.|+.-++||+||||-||-.+
T Consensus 185 n~F~~~RvlGkGGFGEV~acq 205 (591)
T KOG0986|consen 185 NTFRVYRVLGKGGFGEVCACQ 205 (591)
T ss_pred cceeeeEEEecccccceeEEE
Confidence 448888999999999999543
No 83
>KOG0605 consensus NDR and related serine/threonine kinases [General function prediction only]
Probab=26.11 E-value=35 Score=36.81 Aligned_cols=23 Identities=30% Similarity=0.632 Sum_probs=18.0
Q ss_pred HhcCCCcCCcccccCCccEEEEE
Q 010708 479 ATNNFQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 479 AT~nFs~~n~LG~GgfG~VYKG~ 501 (503)
..+.|..-.+||+|+||.||-++
T Consensus 139 ~~~DFe~Lk~IgkGAfGeVrLar 161 (550)
T KOG0605|consen 139 SLDDFELLKVIGKGAFGEVRLAR 161 (550)
T ss_pred CcccchhheeeccccceeEEEEE
Confidence 34556666799999999999764
No 84
>KOG4257 consensus Focal adhesion tyrosine kinase FAK, contains FERM domain [Signal transduction mechanisms]
Probab=26.07 E-value=30 Score=38.26 Aligned_cols=17 Identities=35% Similarity=0.634 Sum_probs=15.2
Q ss_pred CCcccccCCccEEEEEc
Q 010708 486 SSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 486 ~n~LG~GgfG~VYKG~L 502 (503)
..+||+|-||-||+|+.
T Consensus 394 ~r~iG~GqFGdVy~gvY 410 (974)
T KOG4257|consen 394 KRLIGEGQFGDVYKGVY 410 (974)
T ss_pred HHhhcCCcccceeeeEe
Confidence 38999999999999974
No 85
>KOG0694 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=25.80 E-value=31 Score=38.34 Aligned_cols=23 Identities=35% Similarity=0.690 Sum_probs=18.8
Q ss_pred hcCCCcCCcccccCCccEEEEEc
Q 010708 480 TNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 480 T~nFs~~n~LG~GgfG~VYKG~L 502 (503)
.++|.--.+||+|.||+|+.+.+
T Consensus 367 l~~F~~l~vLGkGsFGkV~lae~ 389 (694)
T KOG0694|consen 367 LDDFRLLAVLGRGSFGKVLLAEL 389 (694)
T ss_pred ccceEEEEEeccCcCceEEEEEE
Confidence 34566668999999999998876
No 86
>KOG1214 consensus Nidogen and related basement membrane protein proteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=25.71 E-value=61 Score=36.87 Aligned_cols=28 Identities=25% Similarity=0.732 Sum_probs=23.2
Q ss_pred ccccCCCcccCCC-CCCccccCCCCccCC
Q 010708 277 YGKCGAFGICNSQ-EKPICSCLEGFEPKN 304 (503)
Q Consensus 277 ~g~CG~~giC~~~-~~~~C~C~~Gf~~~~ 304 (503)
+..||++++|.+. ....|.|-.||++.+
T Consensus 741 ~~~CGp~s~Cin~pg~~rceC~~gy~F~d 769 (1289)
T KOG1214|consen 741 FHRCGPNSVCINLPGSYRCECRSGYEFAD 769 (1289)
T ss_pred CCCCCCCceeecCCCceeEEEeecceecc
Confidence 5689999999765 467899999998764
No 87
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=25.54 E-value=19 Score=33.51 Aligned_cols=28 Identities=32% Similarity=0.783 Sum_probs=20.3
Q ss_pred cccccCCCcccCCC------CCCccccCCCCccC
Q 010708 276 VYGKCGAFGICNSQ------EKPICSCLEGFEPK 303 (503)
Q Consensus 276 v~g~CG~~giC~~~------~~~~C~C~~Gf~~~ 303 (503)
+.-.||.|+.|... ....|.|.+||...
T Consensus 48 ~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~ 81 (197)
T PF06247_consen 48 VNKPCGDYAKCINQANKGEERAYKCDCINGYILK 81 (197)
T ss_dssp TTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEES
T ss_pred cCccccchhhhhcCCCcccceeEEEecccCceee
Confidence 35689999999642 23479999999865
No 88
>KOG1166 consensus Mitotic checkpoint serine/threonine protein kinase [Cell cycle control, cell division, chromosome partitioning]
Probab=24.79 E-value=29 Score=40.62 Aligned_cols=24 Identities=29% Similarity=0.699 Sum_probs=19.1
Q ss_pred HHhcCCCcCCcccccCCccEEEEE
Q 010708 478 TATNNFQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 478 ~AT~nFs~~n~LG~GgfG~VYKG~ 501 (503)
.-...|.-+..||+|++|+||+|+
T Consensus 695 ~~~~~~~I~~e~G~g~y~~vy~a~ 718 (974)
T KOG1166|consen 695 VGGEKFCISKEIGEGSYGSVYVAT 718 (974)
T ss_pred ecceeEEEEeeeccccceEEEEee
Confidence 334456667899999999999986
No 89
>KOG1024 consensus Receptor-like protein tyrosine kinase RYK/derailed [Signal transduction mechanisms]
Probab=24.70 E-value=1.1e+02 Score=32.12 Aligned_cols=28 Identities=32% Similarity=0.488 Sum_probs=20.0
Q ss_pred HHHHHhcCCCcCCcccccCCccEEEEEc
Q 010708 475 ELATATNNFQLSSKLGQGGFGPVYWVMV 502 (503)
Q Consensus 475 el~~AT~nFs~~n~LG~GgfG~VYKG~L 502 (503)
+|..--.-|....++=+|.||.||+|++
T Consensus 278 ~l~v~r~Rv~l~~llqEGtFGri~~gI~ 305 (563)
T KOG1024|consen 278 ELTVQRCRVRLSCLLQEGTFGRIYRGIW 305 (563)
T ss_pred hhhhhhhheechhhhhcCchhheeeeee
Confidence 3333333366667899999999999965
No 90
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=24.24 E-value=17 Score=24.30 Aligned_cols=14 Identities=29% Similarity=0.629 Sum_probs=10.1
Q ss_pred CCCccccCCCCccC
Q 010708 290 EKPICSCLEGFEPK 303 (503)
Q Consensus 290 ~~~~C~C~~Gf~~~ 303 (503)
.+..|.|++||...
T Consensus 17 g~~~C~C~~Gy~L~ 30 (36)
T PF14670_consen 17 GSYRCSCPPGYKLA 30 (36)
T ss_dssp TSEEEE-STTEEE-
T ss_pred CceEeECCCCCEEC
Confidence 46789999999865
No 91
>PF14610 DUF4448: Protein of unknown function (DUF4448)
Probab=24.08 E-value=48 Score=30.86 Aligned_cols=22 Identities=23% Similarity=0.614 Sum_probs=9.4
Q ss_pred EEechhHHHHHHHHHHhhhhhh
Q 010708 423 FVSPLIKGMFALAICTLFLWRW 444 (503)
Q Consensus 423 i~i~~~~~~~~l~~~~~~~~~~ 444 (503)
|++.++++++++++++++++.|
T Consensus 162 I~lPvvv~~~~~~~~~~~~~~R 183 (189)
T PF14610_consen 162 IALPVVVVVLALIMYGFFFWNR 183 (189)
T ss_pred EEccHHHHHHHHHHHhhheeec
Confidence 3444444444444444444433
No 92
>KOG4258 consensus Insulin/growth factor receptor (contains protein kinase domain) [Signal transduction mechanisms]
Probab=24.00 E-value=41 Score=38.32 Aligned_cols=19 Identities=47% Similarity=0.690 Sum_probs=15.8
Q ss_pred CCcCCcccccCCccEEEEE
Q 010708 483 FQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 483 Fs~~n~LG~GgfG~VYKG~ 501 (503)
....+.||+|+||+||-|.
T Consensus 996 it~~relg~gsfg~Vy~g~ 1014 (1025)
T KOG4258|consen 996 ITLGRELGQGSFGMVYEGN 1014 (1025)
T ss_pred HhhhhhhccCccceEEEec
Confidence 3445899999999999885
No 93
>PF02402 Lysis_col: Lysis protein; InterPro: IPR003059 The DNA sequence of the entire colicin E2 operon has been determined []. The operon comprises the colicin activity gene (ceaB), the colicin immunity gene (ceiB) and the lysis gene (celB), which is essential for colicin release from producing cells []. A putative LexA binding site is located upstream from ceaB, and a rho-independent terminator structure is located downstream from celB []. Comparison of the amino acid sequences of colicin E2 and cloacin DF13 reveal extensive similarity. These colicins have different modes of action and recognise different cell surface receptors; the two major regions of heterology at the C terminus, and in the C-terminal end of the central region are thought to correspond to the catalytic and receptor-recognition domains, respectively []. Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. By contrast, the lysis proteins of the ColE2, ColE1 and CloDF13 plasmids are almost identical except in the N-terminal regions, which themselves are similar to lipoprotein signal peptides []. Processing of the ColE2 prolysis protein to the mature form is prevented by globomycin, a specific inhibitor of the lipoprotein signal peptidase []. The mature ColE2 lysis protein is located in the cell envelope [].; GO: 0009405 pathogenesis, 0019835 cytolysis, 0019867 outer membrane
Probab=23.94 E-value=26 Score=24.38 Aligned_cols=21 Identities=29% Similarity=0.246 Sum_probs=13.5
Q ss_pred ChhHHHHHHHHHHHhhhcccc
Q 010708 1 MIPIALLIILLSCFCLDFAVA 21 (503)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~ 21 (503)
|..|++++++++.+++++|.+
T Consensus 1 MkKi~~~~i~~~~~~L~aCQa 21 (46)
T PF02402_consen 1 MKKIIFIGIFLLTMLLAACQA 21 (46)
T ss_pred CcEEEEeHHHHHHHHHHHhhh
Confidence 556666666566566677776
No 94
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=23.76 E-value=15 Score=24.68 Aligned_cols=26 Identities=31% Similarity=0.583 Sum_probs=16.4
Q ss_pred cccCCCcccC-C-CCCCccccCCCCccC
Q 010708 278 GKCGAFGICN-S-QEKPICSCLEGFEPK 303 (503)
Q Consensus 278 g~CG~~giC~-~-~~~~~C~C~~Gf~~~ 303 (503)
..|-.++-|- + ++...|.|++||+..
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk~~ 32 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYKKV 32 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEEEE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCcccc
Confidence 4677888883 3 357789999999854
No 95
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=23.00 E-value=26 Score=26.58 Aligned_cols=8 Identities=13% Similarity=0.509 Sum_probs=0.0
Q ss_pred HHhhhhhh
Q 010708 437 CTLFLWRW 444 (503)
Q Consensus 437 ~~~~~~~~ 444 (503)
++++++|.
T Consensus 29 Ilf~iyR~ 36 (64)
T PF01034_consen 29 ILFLIYRM 36 (64)
T ss_dssp --------
T ss_pred HHHHHHHH
Confidence 33334443
No 96
>TIGR02513 type_III_yscB type III secretion system chaperone, YscB family. Members of this family include YscB of Yersinia and functionally equivalent (but differently named) proteins from type III secretion systems of other pathogens that affect animal cells. YscB acts, along with SycN (TIGR02503), as a chaperone for YopN, a key part of a complex that regulates type III secretion so it responds to contact with the eukaryotic target cell.
Probab=22.42 E-value=1.1e+02 Score=26.86 Aligned_cols=17 Identities=24% Similarity=0.350 Sum_probs=14.1
Q ss_pred cceEEEccCCCEEEEec
Q 010708 119 STSAQLLDSGNLVLRDN 135 (503)
Q Consensus 119 ~~~~~L~d~GNlVl~~~ 135 (503)
+.+++|.|+|||+|...
T Consensus 79 p~aLvld~~~qLiLear 95 (139)
T TIGR02513 79 PQALVLDADGQLILEAR 95 (139)
T ss_pred CceEEEcCccchhHHHH
Confidence 56799999999999743
No 97
>PF11920 DUF3438: Protein of unknown function (DUF3438); InterPro: IPR021844 Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in a region flanked by markers of conjugative transfer and/or transposition.
Probab=22.05 E-value=4.9e+02 Score=26.08 Aligned_cols=90 Identities=20% Similarity=0.261 Sum_probs=43.6
Q ss_pred ChhHHHHHHHHHHHhhhccccc----cccccCCCccCCCC-eEEeCCCeeEEEeeCCCCCCceEEEEEecCCCCCeEEEE
Q 010708 1 MIPIALLIILLSCFCLDFAVAI----DSSITSSQLIRDPD-AILSNGSNFKLGFFNPADSPYRYMGIWYDMPSEKAVIWV 75 (503)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~----~~~l~~g~~l~~~~-~L~S~~g~f~~gf~~~~~~~~~~l~i~~~~~~~~~vvW~ 75 (503)
|..+++++++++ ++.+.+.|. =+.++.-=.|..|| .++.-+.+-..||=..-+.. |-| ....+++-|.
T Consensus 1 mk~~~~~~~l~~-~~~~~a~a~E~~~Wer~Pl~V~L~VgqERiV~~~~nVrVg~P~~l~~k---Lrv---qS~gGavYL~ 73 (288)
T PF11920_consen 1 MKKLLLLLLLLL-LLASAAQAVELMKWERIPLAVPLPVGQERIVFFDKNVRVGVPASLNGK---LRV---QSAGGAVYLK 73 (288)
T ss_pred ChhhHHHHHHHH-HhcccccchHhhhccCCceeeecCCCcEEEEEcCcCeEEcCChhhhhc---EEE---eeeCCEEEEe
Confidence 666555443333 333333331 13444455566664 46766666666663322111 111 1124566677
Q ss_pred eCCCCCCcCCccEEEEe--cCCcEEEEe
Q 010708 76 ANRDNPLKDSSGIITIS--EDGNLVLVN 101 (503)
Q Consensus 76 anr~~p~~~~~~~l~l~--~~G~Lvl~~ 101 (503)
|+...|. ..|.++ ++|.++|+|
T Consensus 74 A~e~f~~----tRLqlq~~~~GeiILLD 97 (288)
T PF11920_consen 74 ASEPFPP----TRLQLQDVESGEIILLD 97 (288)
T ss_pred ccCCCCc----ceEEEEEcCCCcEEEEE
Confidence 7765542 234433 567777776
No 98
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=21.24 E-value=1.6e+02 Score=18.10 Aligned_cols=17 Identities=29% Similarity=0.565 Sum_probs=10.5
Q ss_pred CCCEEEEeccCCcEEEe
Q 010708 127 SGNLVLRDNINRAIVWE 143 (503)
Q Consensus 127 ~GNlVl~~~~~~~~~Wq 143 (503)
+|.|+-.|..+|+++|+
T Consensus 15 ~g~l~a~d~~~G~~~W~ 31 (33)
T smart00564 15 DGTLYALDAKTGEILWT 31 (33)
T ss_pred CCEEEEEEcccCcEEEE
Confidence 45666666545677776
No 99
>COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
Probab=20.88 E-value=78 Score=28.73 Aligned_cols=20 Identities=15% Similarity=0.305 Sum_probs=16.0
Q ss_pred CeEEeCCCeeEEEeeCCCCC
Q 010708 36 DAILSNGSNFKLGFFNPADS 55 (503)
Q Consensus 36 ~~L~S~~g~f~~gf~~~~~~ 55 (503)
-...++++.|.+-+|...++
T Consensus 35 ~ksi~~~~Kylllmfes~~C 54 (182)
T COG2143 35 NKSISPNDKYLLLMFESNGC 54 (182)
T ss_pred HHhcCccCcEEEEEEcCCCC
Confidence 35689999999999886654
No 100
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=20.65 E-value=56 Score=32.42 Aligned_cols=6 Identities=33% Similarity=1.082 Sum_probs=3.5
Q ss_pred CccccC
Q 010708 292 PICSCL 297 (503)
Q Consensus 292 ~~C~C~ 297 (503)
..|.|-
T Consensus 143 s~cect 148 (295)
T TIGR01478 143 KSCECT 148 (295)
T ss_pred Cceeee
Confidence 456664
No 101
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=20.44 E-value=4.1e+02 Score=21.06 Aligned_cols=16 Identities=13% Similarity=0.461 Sum_probs=8.7
Q ss_pred cEEEEeCCCcEEEeec
Q 010708 96 NLVLVNGQKEVLWSSN 111 (503)
Q Consensus 96 ~Lvl~~~~~~~~Wss~ 111 (503)
+++|+|..|..||.-.
T Consensus 27 D~~v~d~~g~~vwrwS 42 (82)
T PF12690_consen 27 DFVVKDKEGKEVWRWS 42 (82)
T ss_dssp EEEEE-TT--EEEETT
T ss_pred EEEEECCCCCEEEEec
Confidence 5777777777777643
No 102
>PHA03207 serine/threonine kinase US3; Provisional
Probab=20.33 E-value=45 Score=34.59 Aligned_cols=19 Identities=26% Similarity=0.457 Sum_probs=15.7
Q ss_pred CCcCCcccccCCccEEEEE
Q 010708 483 FQLSSKLGQGGFGPVYWVM 501 (503)
Q Consensus 483 Fs~~n~LG~GgfG~VYKG~ 501 (503)
|....+||+|+||.||+++
T Consensus 94 y~i~~~Lg~G~~g~Vy~~~ 112 (392)
T PHA03207 94 YNILSSLTPGSEGEVFVCT 112 (392)
T ss_pred eEEEEeecCCCCeEEEEEE
Confidence 4555789999999999875
No 103
>KOG4289 consensus Cadherin EGF LAG seven-pass G-type receptor [Signal transduction mechanisms]
Probab=20.18 E-value=54 Score=39.58 Aligned_cols=41 Identities=27% Similarity=0.653 Sum_probs=0.0
Q ss_pred cccCCCccc-CCCCCCccccCCCCccCCcccccCCCCCCCceeCC
Q 010708 278 GKCGAFGIC-NSQEKPICSCLEGFEPKNAEEWNRGNWTSGCIRRS 321 (503)
Q Consensus 278 g~CG~~giC-~~~~~~~C~C~~Gf~~~~~~~~~~~~~~~GC~~~~ 321 (503)
+.||++|-| ...+.++|.|-|||....=+ ....++-|++..
T Consensus 1245 ~pC~nng~C~srEggYtCeCrpg~tGehCE---vs~~agrCvpGv 1286 (2531)
T KOG4289|consen 1245 GPCGNNGRCRSREGGYTCECRPGFTGEHCE---VSARAGRCVPGV 1286 (2531)
T ss_pred CCCCCCCceEEecCceeEEecCCcccccee---eecccCccccce
No 104
>PTZ00370 STEVOR; Provisional
Probab=20.14 E-value=59 Score=32.34 Aligned_cols=6 Identities=33% Similarity=1.204 Sum_probs=3.6
Q ss_pred CccccC
Q 010708 292 PICSCL 297 (503)
Q Consensus 292 ~~C~C~ 297 (503)
..|.|-
T Consensus 143 s~cect 148 (296)
T PTZ00370 143 STCECT 148 (296)
T ss_pred Cceeee
Confidence 466665
Done!