Query psy10004
Match_columns 354
No_of_seqs 395 out of 1911
Neff 8.2
Searched_HMMs 46136
Date Fri Aug 16 15:25:59 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10004.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10004hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2353|consensus 100.0 3.6E-62 7.8E-67 509.7 20.4 298 1-344 152-452 (1104)
2 cd01463 vWA_VGCC_like VWA Volt 100.0 8.7E-33 1.9E-37 243.6 20.8 187 70-260 1-189 (190)
3 PF13768 VWA_3: von Willebrand 99.9 1.7E-25 3.7E-30 190.6 17.5 152 83-255 1-154 (155)
4 cd01461 vWA_interalpha_trypsin 99.9 7E-25 1.5E-29 189.2 20.8 169 82-268 2-170 (171)
5 TIGR03788 marine_srt_targ mari 99.9 4.7E-25 1E-29 225.9 21.5 183 80-280 269-451 (596)
6 cd01466 vWA_C3HC4_type VWA C3H 99.9 2.9E-24 6.4E-29 183.2 17.9 154 83-257 1-154 (155)
7 cd01465 vWA_subgroup VWA subgr 99.9 1.1E-22 2.4E-27 175.3 17.2 163 84-265 2-168 (170)
8 cd01470 vWA_complement_factors 99.9 1.7E-22 3.6E-27 179.2 18.5 175 84-265 2-195 (198)
9 cd01456 vWA_ywmD_type VWA ywmD 99.9 1.8E-21 3.8E-26 173.7 17.9 167 80-260 18-203 (206)
10 cd01480 vWA_collagen_alpha_1-V 99.9 7.7E-21 1.7E-25 167.0 16.8 160 81-262 1-174 (186)
11 PRK13685 hypothetical protein; 99.9 2.9E-20 6.2E-25 177.1 21.5 175 80-272 86-288 (326)
12 cd01472 vWA_collagen von Wille 99.9 3.1E-20 6.8E-25 159.6 17.5 155 83-257 1-160 (164)
13 TIGR00868 hCaCC calcium-activa 99.8 1.2E-19 2.6E-24 188.0 23.0 180 72-274 294-476 (863)
14 cd01474 vWA_ATR ATR (Anthrax T 99.8 2.3E-19 5E-24 157.4 18.8 174 81-275 3-182 (185)
15 cd01475 vWA_Matrilin VWA_Matri 99.8 2.7E-19 5.8E-24 161.8 18.7 178 82-276 2-185 (224)
16 cd01451 vWA_Magnesium_chelatas 99.8 2.6E-19 5.7E-24 156.1 17.0 159 85-262 3-169 (178)
17 cd01471 vWA_micronemal_protein 99.8 2.3E-19 5E-24 157.4 16.6 173 83-269 1-184 (186)
18 PF13519 VWA_2: von Willebrand 99.8 2.5E-19 5.4E-24 153.7 16.2 161 84-269 1-171 (172)
19 cd01464 vWA_subfamily VWA subf 99.8 2.6E-19 5.6E-24 155.8 15.3 150 82-250 3-161 (176)
20 cd01469 vWA_integrins_alpha_su 99.8 9.7E-19 2.1E-23 152.4 17.8 164 83-263 1-174 (177)
21 cd01453 vWA_transcription_fact 99.8 9.3E-19 2E-23 153.2 17.5 160 83-267 4-175 (183)
22 TIGR03436 acidobact_VWFA VWFA- 99.8 2E-18 4.4E-23 162.5 19.5 179 80-282 51-262 (296)
23 cd01473 vWA_CTRP CTRP for CS 99.8 4.2E-18 9.2E-23 150.2 19.0 170 84-272 2-188 (192)
24 PF08399 VWA_N: VWA N-terminal 99.8 2.6E-20 5.6E-25 150.6 4.3 47 1-47 77-123 (123)
25 cd01467 vWA_BatA_type VWA BatA 99.8 5.7E-18 1.2E-22 147.5 18.9 156 82-262 2-179 (180)
26 cd01482 vWA_collagen_alphaI-XI 99.8 3.7E-18 8E-23 146.8 17.3 156 83-257 1-161 (164)
27 cd01477 vWA_F09G8-8_type VWA F 99.8 3.6E-18 7.7E-23 150.5 17.2 152 80-250 17-185 (193)
28 cd01450 vWFA_subfamily_ECM Von 99.8 1.6E-17 3.6E-22 141.0 16.1 149 84-252 2-157 (161)
29 PTZ00441 sporozoite surface pr 99.8 5E-17 1.1E-21 160.6 21.1 185 81-280 41-236 (576)
30 cd01476 VWA_integrin_invertebr 99.7 5.9E-17 1.3E-21 138.9 16.4 154 84-254 2-161 (163)
31 cd01462 VWA_YIEM_type VWA YIEM 99.7 5.6E-17 1.2E-21 137.5 15.9 143 83-248 1-145 (152)
32 PF00092 VWA: von Willebrand f 99.7 3.4E-17 7.3E-22 141.6 14.7 169 84-268 1-176 (178)
33 cd01481 vWA_collagen_alpha3-VI 99.7 1.2E-16 2.5E-21 137.7 17.8 152 83-251 1-157 (165)
34 COG1240 ChlD Mg-chelatase subu 99.7 1.6E-16 3.5E-21 141.7 15.4 165 81-263 77-250 (261)
35 smart00327 VWA von Willebrand 99.7 9.3E-16 2E-20 132.0 19.1 156 82-256 1-164 (177)
36 PRK13406 bchD magnesium chelat 99.7 2.8E-16 6E-21 159.0 18.0 159 81-261 400-571 (584)
37 cd01454 vWA_norD_type norD typ 99.7 4.9E-16 1.1E-20 134.8 16.0 150 84-248 2-167 (174)
38 cd01455 vWA_F11C1-5a_type Von 99.7 1.8E-15 3.9E-20 131.2 18.8 170 83-273 1-188 (191)
39 TIGR02031 BchD-ChlD magnesium 99.7 1.9E-15 4E-20 154.2 18.2 162 81-262 406-585 (589)
40 cd01457 vWA_ORF176_type VWA OR 99.6 8.8E-15 1.9E-19 129.8 16.3 153 82-249 2-165 (199)
41 COG4245 TerY Uncharacterized p 99.6 8.7E-15 1.9E-19 123.8 15.1 172 84-279 5-185 (207)
42 cd00198 vWFA Von Willebrand fa 99.6 5.3E-14 1.2E-18 118.1 16.7 146 83-250 1-155 (161)
43 TIGR02442 Cob-chelat-sub cobal 99.6 1E-13 2.3E-18 142.8 19.0 160 80-257 463-632 (633)
44 cd01452 VWA_26S_proteasome_sub 99.5 8E-13 1.7E-17 115.2 16.1 160 84-262 5-177 (187)
45 PF10138 vWA-TerF-like: vWA fo 99.4 5.4E-11 1.2E-15 103.8 17.9 156 84-261 3-172 (200)
46 COG2425 Uncharacterized protei 99.4 5.2E-12 1.1E-16 122.1 12.4 156 64-249 262-418 (437)
47 cd01460 vWA_midasin VWA_Midasi 99.3 9.3E-11 2E-15 107.7 18.9 170 80-267 58-252 (266)
48 PRK10997 yieM hypothetical pro 99.2 2.2E-10 4.9E-15 112.9 15.9 137 80-237 321-459 (487)
49 cd01458 vWA_ku Ku70/Ku80 N-ter 99.2 1.1E-09 2.3E-14 98.6 14.9 144 84-237 3-174 (218)
50 PF11775 CobT_C: Cobalamin bio 99.1 7.4E-09 1.6E-13 91.1 16.2 169 82-274 12-216 (219)
51 TIGR01651 CobT cobaltochelatas 98.9 3.1E-08 6.8E-13 98.6 16.1 162 82-273 392-595 (600)
52 COG4867 Uncharacterized protei 98.7 2.8E-07 6.1E-12 87.4 13.5 163 81-271 462-647 (652)
53 PF05762 VWA_CoxE: VWA domain 98.7 2.1E-07 4.6E-12 84.0 12.1 115 81-213 56-172 (222)
54 PF04056 Ssl1: Ssl1-like; Int 98.6 1.6E-06 3.5E-11 75.8 15.6 157 88-266 1-168 (193)
55 PF09967 DUF2201: VWA-like dom 98.6 1.3E-07 2.9E-12 77.5 8.3 94 85-205 1-96 (126)
56 KOG3768|consensus 98.6 1.4E-06 3E-11 85.7 13.9 193 85-295 4-249 (888)
57 PRK05325 hypothetical protein; 98.5 2.7E-06 5.9E-11 82.0 14.8 184 64-273 203-396 (401)
58 cd01459 vWA_copine_like VWA Co 98.5 7.6E-06 1.6E-10 74.9 16.6 155 81-248 30-206 (254)
59 COG4548 NorD Nitric oxide redu 98.5 9.8E-07 2.1E-11 86.1 10.7 173 84-275 448-636 (637)
60 TIGR02877 spore_yhbH sporulati 98.4 6.5E-06 1.4E-10 78.2 14.6 180 64-271 183-371 (371)
61 cd01468 trunk_domain trunk dom 98.4 4E-05 8.6E-10 70.0 18.7 167 81-260 2-224 (239)
62 PF04811 Sec23_trunk: Sec23/Se 98.4 1.4E-05 3E-10 73.1 15.5 164 81-261 2-227 (243)
63 PF04285 DUF444: Protein of un 98.4 1.1E-05 2.5E-10 78.3 15.3 184 64-273 227-418 (421)
64 KOG2807|consensus 98.4 1.4E-05 3.1E-10 73.4 14.7 174 82-280 60-241 (378)
65 COG2304 Uncharacterized protei 98.3 1.7E-05 3.7E-10 77.4 15.8 165 77-260 32-201 (399)
66 cd01479 Sec24-like Sec24-like: 98.3 6.6E-05 1.4E-09 68.7 18.4 184 81-276 2-242 (244)
67 COG4547 CobT Cobalamin biosynt 98.2 7.7E-06 1.7E-10 78.5 9.4 141 83-237 414-590 (620)
68 PF06707 DUF1194: Protein of u 98.0 0.00093 2E-08 58.9 18.6 177 82-275 3-203 (205)
69 cd01478 Sec23-like Sec23-like: 98.0 0.00037 7.9E-09 64.6 16.8 171 81-261 2-257 (267)
70 PF11443 DUF2828: Domain of un 98.0 4.8E-05 1E-09 76.2 10.6 110 81-204 339-449 (534)
71 PF03731 Ku_N: Ku70/Ku80 N-ter 97.9 8E-05 1.7E-09 67.1 10.2 143 85-233 2-172 (224)
72 COG2718 Uncharacterized conser 97.9 0.00025 5.4E-09 67.1 12.6 183 64-273 228-417 (423)
73 PLN00162 transport protein sec 97.8 0.0015 3.3E-08 69.2 19.1 185 79-273 121-392 (761)
74 PF07002 Copine: Copine; Inte 97.8 0.00045 9.7E-09 58.0 12.2 127 98-234 11-146 (146)
75 TIGR00627 tfb4 transcription f 97.7 0.0023 5.1E-08 59.3 16.7 181 83-269 3-219 (279)
76 smart00187 INB Integrin beta s 97.7 0.0025 5.4E-08 61.9 17.4 189 76-280 93-344 (423)
77 KOG1327|consensus 97.6 0.0015 3.3E-08 64.9 14.4 157 81-249 284-464 (529)
78 KOG2884|consensus 97.5 0.007 1.5E-07 53.2 15.1 171 84-280 5-190 (259)
79 KOG1985|consensus 97.3 0.0084 1.8E-07 62.0 15.0 160 79-253 291-504 (887)
80 PF03850 Tfb4: Transcription f 97.1 0.032 7E-07 51.9 16.1 180 83-269 2-217 (276)
81 PTZ00395 Sec24-related protein 97.1 0.015 3.3E-07 63.3 15.3 166 80-258 950-1179(1560)
82 PF11265 Med25_VWA: Mediator c 97.0 0.011 2.5E-07 52.9 12.0 119 80-204 11-151 (226)
83 COG5148 RPN10 26S proteasome r 97.0 0.033 7.2E-07 47.9 14.0 150 83-251 4-162 (243)
84 COG5151 SSL1 RNA polymerase II 97.0 0.018 4E-07 53.0 12.8 171 83-282 88-274 (421)
85 COG3864 Uncharacterized protei 96.8 0.0038 8.3E-08 57.7 7.4 94 84-206 263-357 (396)
86 TIGR00578 ku70 ATP-dependent D 96.8 0.0077 1.7E-07 62.0 10.2 116 83-203 11-150 (584)
87 COG3552 CoxE Protein containin 96.6 0.015 3.3E-07 55.2 9.9 110 80-206 216-326 (395)
88 PF00362 Integrin_beta: Integr 96.2 0.0048 1E-07 60.9 4.0 187 77-281 97-348 (426)
89 KOG1984|consensus 95.6 0.32 6.9E-06 51.1 14.3 103 78-181 413-545 (1007)
90 KOG4465|consensus 95.6 0.032 7E-07 52.6 6.5 129 64-214 409-546 (598)
91 COG1721 Uncharacterized conser 95.6 0.12 2.7E-06 51.0 11.0 106 81-204 223-335 (416)
92 COG5242 TFB4 RNA polymerase II 95.5 1.7 3.7E-05 38.7 16.5 179 72-263 11-218 (296)
93 COG5028 Vesicle coat complex C 94.8 1.4 3.1E-05 45.9 15.7 157 79-252 273-479 (861)
94 KOG2487|consensus 94.0 3.2 6.9E-05 38.1 14.6 174 81-263 22-231 (314)
95 KOG1226|consensus 92.1 0.5 1.1E-05 49.0 7.5 124 75-203 125-273 (783)
96 KOG1986|consensus 92.1 11 0.00024 39.1 16.9 180 81-272 120-378 (745)
97 COG5271 MDN1 AAA ATPase contai 91.2 6.3 0.00014 45.5 14.8 123 83-214 4393-4518(4600)
98 PF02743 Cache_1: Cache domain 91.2 0.23 4.9E-06 36.8 3.1 22 329-351 10-31 (81)
99 PRK15426 putative diguanylate 84.2 1.2 2.7E-05 45.5 4.4 57 10-78 154-211 (570)
100 KOG2326|consensus 82.8 28 0.0006 35.7 12.8 143 83-235 5-166 (669)
101 PF01882 DUF58: Protein of unk 73.9 7.4 0.00016 28.9 4.7 42 80-121 38-85 (86)
102 TIGR02921 PEP_integral PEP-CTE 70.5 23 0.00049 36.4 8.3 59 189-255 725-786 (952)
103 KOG0071|consensus 61.8 13 0.00027 31.1 4.0 43 70-115 73-115 (180)
104 PF08496 Peptidase_S49_N: Pept 51.6 36 0.00079 28.8 5.3 43 82-124 96-138 (155)
105 TIGR01204 bioW 6-carboxyhexano 45.4 12 0.00025 33.8 1.4 35 1-47 165-200 (232)
106 PRK01322 6-carboxyhexanoate--C 41.9 14 0.00031 33.5 1.4 35 1-47 172-207 (242)
107 PF03744 BioW: 6-carboxyhexano 41.6 15 0.00032 33.4 1.4 35 1-47 172-207 (239)
108 PF13778 DUF4174: Domain of un 37.5 1.7E+02 0.0037 23.3 7.0 70 82-158 41-111 (118)
109 PF06415 iPGM_N: BPG-independe 37.3 2E+02 0.0043 26.0 7.9 44 191-237 29-74 (223)
110 COG5047 SEC23 Vesicle coat com 33.1 83 0.0018 32.3 5.2 48 82-131 122-169 (755)
111 KOG2327|consensus 32.8 3.3E+02 0.0072 28.1 9.4 117 82-204 18-157 (602)
112 PF10221 DUF2151: Cell cycle a 32.6 3.3E+02 0.0071 29.0 9.7 91 84-180 7-130 (695)
113 PF13829 DUF4191: Domain of un 31.2 82 0.0018 28.4 4.5 55 191-248 121-175 (224)
114 KOG0070|consensus 29.4 1.1E+02 0.0023 26.7 4.7 45 81-129 84-128 (181)
115 PF00025 Arf: ADP-ribosylation 25.1 1.6E+02 0.0034 24.9 5.2 30 81-113 81-110 (175)
116 smart00467 GS GS motif. Aa app 25.0 58 0.0013 19.5 1.6 14 91-104 11-24 (30)
117 PF09439 SRPRB: Signal recogni 21.9 1.3E+02 0.0028 26.2 3.9 23 70-92 61-86 (181)
No 1
>KOG2353|consensus
Probab=100.00 E-value=3.6e-62 Score=509.69 Aligned_cols=298 Identities=51% Similarity=0.947 Sum_probs=281.3
Q ss_pred ChhhhhhcchHHHHHHHHHhhcCCCCceeEEeeCCCCceeecCCCCcccccccCCCCCCCCCccccccccccceeeeccC
Q psy10004 1 EEVLRVAKWSEALNDIFVSNYQADPSLSWQVFGSTTGALRVFPAHQWTDFLDLFKPEGRTDNMTDLYDCRVREWYINAAA 80 (354)
Q Consensus 1 ~~v~~~i~ws~~ld~~f~~n~~~~p~l~wqyfgs~~G~~r~yP~~~~~~~~~~~~~~~~~~~~~~~yd~r~r~wy~~a~~ 80 (354)
|+|+++|+||++||.||++|+++||+|.||||||++|++|+||+..|.+ + ...+.+++||||.|+||+++++
T Consensus 152 ~~vl~~~~WS~~ld~vf~~N~~~dP~l~wQ~Fgs~tG~~R~yPa~~W~~-------~-~~~~~idl~D~R~r~Wyi~aAt 223 (1104)
T KOG2353|consen 152 PDVLNAIDWSEHLDFVFRDNRESDPTLSWQYFGSATGLLRYYPAAAWFD-------N-NTDNSIDLYDCRNRSWYIQAAT 223 (1104)
T ss_pred HHHHhhcccccchhHHHhhccccCchhhHHHhhhhhhhhhhCCCccCcc-------C-CCCCcceeeecccccccccccC
Confidence 6899999999999999999999999999999999999999999999986 2 2346799999999999999999
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcC
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGD 160 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~ 160 (354)
+||||+||+|.||||.|.++.++|..+..+|++|+++|+|++++|++.+..++||+.+.+++|+..|++.++++++.+ .
T Consensus 224 ~pKdiviLlD~SgSm~g~~~~lak~tv~~iLdtLs~~Dfvni~tf~~~~~~v~pc~~~~lvqAt~~nk~~~~~~i~~l-~ 302 (1104)
T KOG2353|consen 224 SPKDIVILLDVSGSMSGLRLDLAKQTVNEILDTLSDNDFVNILTFNSEVNPVSPCFNGTLVQATMRNKKVFKEAIETL-D 302 (1104)
T ss_pred CccceEEEEeccccccchhhHHHHHHHHHHHHhcccCCeEEEEeeccccCcccccccCceeecchHHHHHHHHHHhhh-c
Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 7
Q ss_pred CCCCCcHHHHHHHHHHHHHhccccCC--CCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCC
Q psy10004 161 ANNIANFTVALTRAFNILENARNDKK--TGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVA 238 (354)
Q Consensus 161 ~~G~T~~~~aL~~A~~~l~~~~~~~~--~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~ 238 (354)
+.|.+++..||+.||++|.+.+.++. .++.|+++||++|||.+++..++++.++| +.+.||||||.+|.++.
T Consensus 303 ~k~~a~~~~~~e~aF~lL~~~n~s~~~~~~~~C~~~iml~tdG~~~~~~~If~~yn~------~~~~Vrvftflig~~~~ 376 (1104)
T KOG2353|consen 303 AKGIANYTAALEYAFSLLRDYNDSRANTQRSPCNQAIMLITDGVDENAKEIFEKYNW------PDKKVRVFTFLIGDEVY 376 (1104)
T ss_pred cccccchhhhHHHHHHHHHHhccccccccccccceeeEEeecCCcccHHHHHHhhcc------CCCceEEEEEEeccccc
Confidence 99999999999999999997766551 23459999999999999999999999997 45799999999999988
Q ss_pred CHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhccchhcc-CCCCCCccCCcccccCCCCcchhHHHHHHHHHhhhh
Q psy10004 239 DYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMARPLVLQ-RNDHPIVWTPIYADVTDPKLSDWLWELKECEEQRER 317 (354)
Q Consensus 239 ~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~~pl~~~-~~~~~~~w~~~y~d~~~~~~~~~~~~~~~~~~~~~~ 317 (354)
+...++||||.++|+|++|.+.+++.+..++|+++++||++++ .+.+|++||.+|.|.
T Consensus 377 ~~~~~~wmac~n~gyy~~I~~~~~v~~~~~~y~~vlsRp~vl~~~~~~pv~wt~vy~~~--------------------- 435 (1104)
T KOG2353|consen 377 DLDEIQWMACANKGYYVHIISIADVRENVLEYLDVLSRPLVLQGEEAHPVTWTQVYTDL--------------------- 435 (1104)
T ss_pred ccccchhhhhhCCCceEeccchhhcChHhhhhhhhhccceeecccccCCceeeehhhhh---------------------
Confidence 8899999999999999999999999999999999999999998 788999999999998
Q ss_pred cccccccccCCCCceEEEEeeeeeecC
Q psy10004 318 SSYDQNSLRVSPYRLLTTVAMPAFDRR 344 (354)
Q Consensus 318 ~~~~~~~~~~~~~~l~~tv~~pv~~~~ 344 (354)
.+++++++|+.|||++.
T Consensus 436 ----------~~~~~i~~~~~pv~~~~ 452 (1104)
T KOG2353|consen 436 ----------LYLGLIVTVSLPVFNRT 452 (1104)
T ss_pred ----------ccCcceEecccCcccch
Confidence 36788999999999985
No 2
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=100.00 E-value=8.7e-33 Score=243.65 Aligned_cols=187 Identities=51% Similarity=0.951 Sum_probs=161.6
Q ss_pred cccceeeeccCCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHH
Q psy10004 70 RVREWYINAAASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVR 149 (354)
Q Consensus 70 r~r~wy~~a~~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~ 149 (354)
|.||||++++..|++++||||+||||.+.+++.+|+++..+++.|.++|+|+|+.|++.++.++||+++.+.+.+..+++
T Consensus 1 ~~~~~~~~~~~~p~~vv~llD~SgSM~~~~l~~ak~~~~~ll~~l~~~d~v~lv~F~~~~~~~~~~~~~~~~~~~~~~~~ 80 (190)
T cd01463 1 RNRSWYIQAATSPKDIVILLDVSGSMTGQRLHLAKQTVSSILDTLSDNDFFNIITFSNEVNPVVPCFNDTLVQATTSNKK 80 (190)
T ss_pred CCCcceeecccCCceEEEEEECCCCCCcHHHHHHHHHHHHHHHhCCCCCEEEEEEeCCCeeEEeeecccceEecCHHHHH
Confidence 78999999999999999999999999999999999999999999999999999999999999999988777777788999
Q ss_pred HHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccC--CCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeE
Q psy10004 150 ELKMGVEMIGDANNIANFTVALTRAFNILENARNDK--KTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVR 227 (354)
Q Consensus 150 ~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~--~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~ 227 (354)
.+..+|..+ .+.|+|++..||..|++.+....... ...+.+.+.|||||||.+++...++..+++... ...+++
T Consensus 81 ~~~~~l~~l-~~~G~T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~~~~~~~~~~~---~~~~v~ 156 (190)
T cd01463 81 VLKEALDML-EAKGIANYTKALEFAFSLLLKNLQSNHSGSRSQCNQAIMLITDGVPENYKEIFDKYNWDKN---SEIPVR 156 (190)
T ss_pred HHHHHHhhC-CCCCcchHHHHHHHHHHHHHHhhhcccccccCCceeEEEEEeCCCCCcHhHHHHHhccccc---CCCcEE
Confidence 999999999 99999999999999999997622110 012345689999999999988877777642111 235899
Q ss_pred EEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCCh
Q psy10004 228 VFSYLVGKEVADYRDVKWMACANKGYYVHLSTL 260 (354)
Q Consensus 228 IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~ 260 (354)
||+||+|.+..+..+|+.||..++|+|+++.+.
T Consensus 157 i~tigiG~~~~d~~~L~~lA~~~~G~~~~i~~~ 189 (190)
T cd01463 157 VFTYLIGREVTDRREIQWMACENKGYYSHIQSL 189 (190)
T ss_pred EEEEecCCccccchHHHHHHhhcCCeEEEcccC
Confidence 999999987547889999999999999999875
No 3
>PF13768 VWA_3: von Willebrand factor type A domain
Probab=99.94 E-value=1.7e-25 Score=190.56 Aligned_cols=152 Identities=28% Similarity=0.469 Sum_probs=133.0
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcCC-
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGDA- 161 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~~- 161 (354)
.+||||||+|+||.|.. +.+|.++..++++|+++|+|+|+.|++.+..+.| .+++++.++++.+.++|..+ .+
T Consensus 1 ~~vvilvD~S~Sm~g~~-~~~k~al~~~l~~L~~~d~fnii~f~~~~~~~~~----~~~~~~~~~~~~a~~~I~~~-~~~ 74 (155)
T PF13768_consen 1 ADVVILVDTSGSMSGEK-ELVKDALRAILRSLPPGDRFNIIAFGSSVRPLFP----GLVPATEENRQEALQWIKSL-EAN 74 (155)
T ss_pred CeEEEEEeCCCCCCCcH-HHHHHHHHHHHHhCCCCCEEEEEEeCCEeeEcch----hHHHHhHHHHHHHHHHHHHh-ccc
Confidence 48999999999999998 9999999999999999999999999999877555 57888999999999999999 88
Q ss_pred CCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCC-CchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCH
Q psy10004 162 NNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGAS-ENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADY 240 (354)
Q Consensus 162 ~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~-~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~ 240 (354)
.|+|++..||+.|+..+. .+++.+.|||+|||.+ .....+++.++. . ...++||+||+|... +.
T Consensus 75 ~G~t~l~~aL~~a~~~~~--------~~~~~~~IilltDG~~~~~~~~i~~~v~~-~-----~~~~~i~~~~~g~~~-~~ 139 (155)
T PF13768_consen 75 SGGTDLLAALRAALALLQ--------RPGCVRAIILLTDGQPVSGEEEILDLVRR-A-----RGHIRIFTFGIGSDA-DA 139 (155)
T ss_pred CCCccHHHHHHHHHHhcc--------cCCCccEEEEEEeccCCCCHHHHHHHHHh-c-----CCCceEEEEEECChh-HH
Confidence 999999999999998761 2457789999999997 445677777652 1 257999999999977 79
Q ss_pred HHHHHHHhcCCcEEE
Q psy10004 241 RDVKWMACANKGYYV 255 (354)
Q Consensus 241 ~~L~~iA~~~~G~~~ 255 (354)
..|++||+.++|+|.
T Consensus 140 ~~L~~LA~~~~G~~~ 154 (155)
T PF13768_consen 140 DFLRELARATGGSFH 154 (155)
T ss_pred HHHHHHHHcCCCEEE
Confidence 999999999999986
No 4
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=99.94 E-value=7e-25 Score=189.15 Aligned_cols=169 Identities=25% Similarity=0.401 Sum_probs=143.6
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcCC
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGDA 161 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~~ 161 (354)
|++++||||+||||.|.+++.+++++..++..+.+.++++|+.|++++..+.++ ....+..++..+.+.+..+ .+
T Consensus 2 ~~~v~~vlD~S~SM~~~~~~~~~~al~~~l~~l~~~~~~~l~~Fs~~~~~~~~~----~~~~~~~~~~~~~~~l~~~-~~ 76 (171)
T cd01461 2 PKEVVFVIDTSGSMSGTKIEQTKEALLTALKDLPPGDYFNIIGFSDTVEEFSPS----SVSATAENVAAAIEYVNRL-QA 76 (171)
T ss_pred CceEEEEEECCCCCCChhHHHHHHHHHHHHHhCCCCCEEEEEEeCCCceeecCc----ceeCCHHHHHHHHHHHHhc-CC
Confidence 889999999999999999999999999999999999999999999998765442 2234556778888889999 88
Q ss_pred CCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHH
Q psy10004 162 NNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYR 241 (354)
Q Consensus 162 ~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~ 241 (354)
+|+|++..||..|++.+... ..+.+.||++|||.+++..++.+.++. . ...+++||+||+|.+. +..
T Consensus 77 ~g~T~l~~al~~a~~~l~~~-------~~~~~~iillTDG~~~~~~~~~~~~~~-~----~~~~i~i~~i~~g~~~-~~~ 143 (171)
T cd01461 77 LGGTNMNDALEAALELLNSS-------PGSVPQIILLTDGEVTNESQILKNVRE-A----LSGRIRLFTFGIGSDV-NTY 143 (171)
T ss_pred CCCcCHHHHHHHHHHhhccC-------CCCccEEEEEeCCCCCCHHHHHHHHHH-h----cCCCceEEEEEeCCcc-CHH
Confidence 99999999999999988541 245689999999998877777666642 1 1248999999999876 789
Q ss_pred HHHHHHhcCCcEEEEeCChhHHHHHHH
Q psy10004 242 DVKWMACANKGYYVHLSTLAEVRDQIL 268 (354)
Q Consensus 242 ~L~~iA~~~~G~~~~i~~~~~~~~~l~ 268 (354)
.|+.||+.++|.|.++.+.+++.+++.
T Consensus 144 ~l~~ia~~~gG~~~~~~~~~~~~~~~~ 170 (171)
T cd01461 144 LLERLAREGRGIARRIYETDDIESQLL 170 (171)
T ss_pred HHHHHHHcCCCeEEEecChHHHHHHhc
Confidence 999999999999999999999988874
No 5
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=99.93 E-value=4.7e-25 Score=225.87 Aligned_cols=183 Identities=23% Similarity=0.318 Sum_probs=159.9
Q ss_pred CCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 80 ASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
..|++++||||+||||.|.+++.+|.++..++..|.++|+|+|+.|++.+..+.+ ....++..+++.+..+|..+
T Consensus 269 ~~p~~vvfvlD~SgSM~g~~i~~ak~al~~~l~~L~~~d~~~ii~F~~~~~~~~~----~~~~~~~~~~~~a~~~i~~l- 343 (596)
T TIGR03788 269 VLPRELVFVIDTSGSMAGESIEQAKSALLLALDQLRPGDRFNIIQFDSDVTLLFP----VPVPATAHNLARARQFVAGL- 343 (596)
T ss_pred CCCceEEEEEECCCCCCCccHHHHHHHHHHHHHhCCCCCEEEEEEECCcceEecc----ccccCCHHHHHHHHHHHhhC-
Confidence 4689999999999999999999999999999999999999999999999877544 45667888999999999999
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCC
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVAD 239 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~ 239 (354)
.++|+|++..||..|++.... ......+.|||||||...+..++++.++. . ..+++||+||+|.+. +
T Consensus 344 ~a~GgT~l~~aL~~a~~~~~~------~~~~~~~~iillTDG~~~~~~~~~~~~~~-~-----~~~~ri~tvGiG~~~-n 410 (596)
T TIGR03788 344 QADGGTEMAGALSAALRDDGP------ESSGALRQVVFLTDGAVGNEDALFQLIRT-K-----LGDSRLFTVGIGSAP-N 410 (596)
T ss_pred CCCCCccHHHHHHHHHHhhcc------cCCCceeEEEEEeCCCCCCHHHHHHHHHH-h-----cCCceEEEEEeCCCc-C
Confidence 999999999999999986321 12345678999999998887777776642 1 236899999999987 8
Q ss_pred HHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhccchhc
Q psy10004 240 YRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMARPLVL 280 (354)
Q Consensus 240 ~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~~pl~~ 280 (354)
..+|+.||..++|.|.++.+.+++..++.+++..+.+|++.
T Consensus 411 ~~lL~~lA~~g~G~~~~i~~~~~~~~~~~~~l~~~~~p~l~ 451 (596)
T TIGR03788 411 SYFMRKAAQFGRGSFTFIGSTDEVQRKMSQLFAKLEQPALT 451 (596)
T ss_pred HHHHHHHHHcCCCEEEECCCHHHHHHHHHHHHHhhcCeEEE
Confidence 99999999999999999999999999999999999999887
No 6
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most,
Probab=99.92 E-value=2.9e-24 Score=183.18 Aligned_cols=154 Identities=23% Similarity=0.405 Sum_probs=128.6
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcCCC
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGDAN 162 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~~~ 162 (354)
.|++||||+||||.+.+++.+|+++..+++.|.++|+++|+.|++.++...| +...+..++..+...+..+ .++
T Consensus 1 ~~v~~vlD~S~SM~~~rl~~ak~a~~~l~~~l~~~~~~~li~F~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~-~~~ 74 (155)
T cd01466 1 VDLVAVLDVSGSMAGDKLQLVKHALRFVISSLGDADRLSIVTFSTSAKRLSP-----LRRMTAKGKRSAKRVVDGL-QAG 74 (155)
T ss_pred CcEEEEEECCCCCCcHHHHHHHHHHHHHHHhCCCcceEEEEEecCCccccCC-----CcccCHHHHHHHHHHHHhc-cCC
Confidence 4799999999999999999999999999999999999999999998877554 3344556778899999999 999
Q ss_pred CCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHH
Q psy10004 163 NIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRD 242 (354)
Q Consensus 163 G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~ 242 (354)
|+|++..||..|++.+.... ..+..+.|||+|||.++....... . ...+++||+||+|.+. +...
T Consensus 75 g~T~~~~al~~a~~~~~~~~-----~~~~~~~iillTDG~~~~~~~~~~-----~----~~~~v~v~~igig~~~-~~~~ 139 (155)
T cd01466 75 GGTNVVGGLKKALKVLGDRR-----QKNPVASIMLLSDGQDNHGAVVLR-----A----DNAPIPIHTFGLGASH-DPAL 139 (155)
T ss_pred CCccHHHHHHHHHHHHhhcc-----cCCCceEEEEEcCCCCCcchhhhc-----c----cCCCceEEEEecCCCC-CHHH
Confidence 99999999999999986532 123457899999999876522211 1 2358999999999876 8899
Q ss_pred HHHHHhcCCcEEEEe
Q psy10004 243 VKWMACANKGYYVHL 257 (354)
Q Consensus 243 L~~iA~~~~G~~~~i 257 (354)
|++||+.++|.|+++
T Consensus 140 l~~iA~~t~G~~~~~ 154 (155)
T cd01466 140 LAFIAEITGGTFSYV 154 (155)
T ss_pred HHHHHhccCceEEEe
Confidence 999999999999876
No 7
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=99.90 E-value=1.1e-22 Score=175.31 Aligned_cols=163 Identities=17% Similarity=0.245 Sum_probs=132.3
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcCCCC
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGDANN 163 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~~~G 163 (354)
+++||||+||||.+.+++.+|+++..++..|.++++++|+.|++..+.+++.. ...+...+...|..+ .++|
T Consensus 2 ~~~~vlD~S~SM~~~~~~~~k~a~~~~~~~l~~~~~v~li~f~~~~~~~~~~~-------~~~~~~~l~~~l~~~-~~~g 73 (170)
T cd01465 2 NLVFVIDRSGSMDGPKLPLVKSALKLLVDQLRPDDRLAIVTYDGAAETVLPAT-------PVRDKAAILAAIDRL-TAGG 73 (170)
T ss_pred cEEEEEECCCCCCChhHHHHHHHHHHHHHhCCCCCEEEEEEecCCccEEecCc-------ccchHHHHHHHHHcC-CCCC
Confidence 78999999999999899999999999999999999999999999987755421 113457788889999 8999
Q ss_pred CCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch----HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCC
Q psy10004 164 IANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY----KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVAD 239 (354)
Q Consensus 164 ~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~----~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~ 239 (354)
+|++..||..|++.+.+... +.+.+.|||+|||.++.. +.+.+.++ .+ ...+++||+||+|.+. +
T Consensus 74 ~T~~~~al~~a~~~~~~~~~-----~~~~~~ivl~TDG~~~~~~~~~~~~~~~~~--~~---~~~~v~i~~i~~g~~~-~ 142 (170)
T cd01465 74 STAGGAGIQLGYQEAQKHFV-----PGGVNRILLATDGDFNVGETDPDELARLVA--QK---RESGITLSTLGFGDNY-N 142 (170)
T ss_pred CCCHHHHHHHHHHHHHhhcC-----CCCeeEEEEEeCCCCCCCCCCHHHHHHHHH--Hh---hcCCeEEEEEEeCCCc-C
Confidence 99999999999999875432 234477999999998632 33433332 11 2358999999999665 8
Q ss_pred HHHHHHHHhcCCcEEEEeCChhHHHH
Q psy10004 240 YRDVKWMACANKGYYVHLSTLAEVRD 265 (354)
Q Consensus 240 ~~~L~~iA~~~~G~~~~i~~~~~~~~ 265 (354)
...|++||+.++|.|+++.+..++.+
T Consensus 143 ~~~l~~ia~~~~g~~~~~~~~~~~~~ 168 (170)
T cd01465 143 EDLMEAIADAGNGNTAYIDNLAEARK 168 (170)
T ss_pred HHHHHHHHhcCCceEEEeCCHHHHHh
Confidence 99999999999999999999998765
No 8
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=99.90 E-value=1.7e-22 Score=179.21 Aligned_cols=175 Identities=17% Similarity=0.266 Sum_probs=130.3
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcC
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGD 160 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~ 160 (354)
|++||||+||||.+.+++.+|.++..+++.|. ..++++|+.|++.++.++| +......++..+..+|..+ .
T Consensus 2 di~~vlD~SgSM~~~~~~~~k~~~~~l~~~l~~~~~~~~v~li~Fs~~~~~~~~-----~~~~~~~~~~~~~~~l~~~-~ 75 (198)
T cd01470 2 NIYIALDASDSIGEEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVS-----IRDFNSNDADDVIKRLEDF-N 75 (198)
T ss_pred cEEEEEECCCCccHHHHHHHHHHHHHHHHHccccCCCceEEEEEecCCceEEEe-----cccCCCCCHHHHHHHHHhC-C
Confidence 79999999999999999999999999999986 4799999999999887655 2233445677788888877 3
Q ss_pred -----CCCCCcHHHHHHHHHHHHHhccccC-CCCCCCcceEEEeccCCCCc---hHHHHHHhhhc-ccC----CCCCCCe
Q psy10004 161 -----ANNIANFTVALTRAFNILENARNDK-KTGADCNQAIMVVTDGASEN---YKEVFEEFNWR-GQN----DSTLWPV 226 (354)
Q Consensus 161 -----~~G~T~~~~aL~~A~~~l~~~~~~~-~~~~~~~~~IillTDG~~~~---~~~i~~~~~~~-~~~----~~~~~~v 226 (354)
+.|+|++..||..+++.+....... ......+++|||||||.++. +....+.+... ... .....+|
T Consensus 76 ~~~~~~~ggT~~~~Al~~~~~~l~~~~~~~~~~~~~~~~~iillTDG~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~v 155 (198)
T cd01470 76 YDDHGDKTGTNTAAALKKVYERMALEKVRNKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLVYKNNKSDNPREDYL 155 (198)
T ss_pred cccccCccchhHHHHHHHHHHHHHHHHhcCccchhhcceEEEEEcCCCcCCCCChhHHHHHHHHHHhcccccccchhcce
Confidence 3589999999999998874322111 01123468899999999873 33333333200 000 0023579
Q ss_pred EEEEEEeCCCCCCHHHHHHHHhcCCc--EEEEeCChhHHHH
Q psy10004 227 RVFSYLVGKEVADYRDVKWMACANKG--YYVHLSTLAEVRD 265 (354)
Q Consensus 227 ~IftigiG~~~~~~~~L~~iA~~~~G--~~~~i~~~~~~~~ 265 (354)
+||+||+|.+. +..+|+.||..++| +++.+.+.+++.+
T Consensus 156 ~i~~iGvG~~~-~~~~L~~iA~~~~g~~~~f~~~~~~~l~~ 195 (198)
T cd01470 156 DVYVFGVGDDV-NKEELNDLASKKDNERHFFKLKDYEDLQE 195 (198)
T ss_pred eEEEEecCccc-CHHHHHHHhcCCCCCceEEEeCCHHHHHH
Confidence 99999999877 89999999999998 6788888877654
No 9
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=99.88 E-value=1.8e-21 Score=173.73 Aligned_cols=167 Identities=22% Similarity=0.295 Sum_probs=127.0
Q ss_pred CCCceEEEEEeCCCCCC------cchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccc------ccc-cccccccccC-h
Q psy10004 80 ASPKDMVILLDNSGSMM------GQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKE------VVP-CFADILVQAN-L 145 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~------g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~------~~~-c~~~~lv~~~-~ 145 (354)
..|.+++||||+||||. +.+++.+|+++..+++.+.++++|+|+.|++.... .+| |.....+... .
T Consensus 18 ~~~~~vv~vlD~SgSM~~~~~~~~~rl~~ak~a~~~~l~~l~~~~~v~lv~F~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 97 (206)
T cd01456 18 QLPPNVAIVLDNSGSMREVDGGGETRLDNAKAALDETANALPDGTRLGLWTFSGDGDNPLDVRVLVPKGCLTAPVNGFPS 97 (206)
T ss_pred CCCCcEEEEEeCCCCCcCCCCCcchHHHHHHHHHHHHHHhCCCCceEEEEEecCCCCCCccccccccccccccccCCCCc
Confidence 47899999999999998 58999999999999999999999999999996432 111 1000001111 1
Q ss_pred hhHHHHHHHHhhhcC-CCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch---HHHHHHhhhcccCCC
Q psy10004 146 ANVRELKMGVEMIGD-ANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY---KEVFEEFNWRGQNDS 221 (354)
Q Consensus 146 ~n~~~~~~~i~~l~~-~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~---~~i~~~~~~~~~~~~ 221 (354)
.++..+.+.|+.+ . +.|+|++..||..|.+++. . +..+.|||||||.++.. .+...++.. ...
T Consensus 98 ~~~~~l~~~i~~i-~~~~G~T~l~~aL~~a~~~l~----~-----~~~~~iillTDG~~~~~~~~~~~~~~~~~-~~~-- 164 (206)
T cd01456 98 AQRSALDAALNSL-QTPTGWTPLAAALAEAAAYVD----P-----GRVNVVVLITDGEDTCGPDPCEVARELAK-RRT-- 164 (206)
T ss_pred ccHHHHHHHHHhh-cCCCCcChHHHHHHHHHHHhC----C-----CCcceEEEEcCCCccCCCCHHHHHHHHHH-hcC--
Confidence 4788899999999 8 9999999999999999885 1 12368999999998754 333443321 110
Q ss_pred CCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEE-EEeCCh
Q psy10004 222 TLWPVRVFSYLVGKEVADYRDVKWMACANKGYY-VHLSTL 260 (354)
Q Consensus 222 ~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~-~~i~~~ 260 (354)
...+|+||+||||.+. +...|+.||..++|.| +.+.+.
T Consensus 165 ~~~~i~i~~igiG~~~-~~~~l~~iA~~tgG~~~~~~~~~ 203 (206)
T cd01456 165 PAPPIKVNVIDFGGDA-DRAELEAIAEATGGTYAYNQSDL 203 (206)
T ss_pred CCCCceEEEEEecCcc-cHHHHHHHHHhcCCeEecccccc
Confidence 2358999999999876 7899999999999999 655544
No 10
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=99.86 E-value=7.7e-21 Score=166.96 Aligned_cols=160 Identities=21% Similarity=0.281 Sum_probs=124.8
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---------CCCeEEEEEeecccccccccccccccccChhhHHHH
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLG---------NNDYVNVLQFTSVCKEVVPCFADILVQANLANVREL 151 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---------~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~ 151 (354)
||.|++||||+||||.+.+++.+|++++.+++.|. .+++++|+.|++.+...++. .. ...++..+
T Consensus 1 c~~dvv~vlD~S~Sm~~~~~~~~k~~~~~~~~~l~~~~~~~i~~~~~rvglv~fs~~~~~~~~l-----~~-~~~~~~~l 74 (186)
T cd01480 1 GPVDITFVLDSSESVGLQNFDITKNFVKRVAERFLKDYYRKDPAGSWRVGVVQYSDQQEVEAGF-----LR-DIRNYTSL 74 (186)
T ss_pred CCeeEEEEEeCCCccchhhHHHHHHHHHHHHHHHhhhhccCCCCCceEEEEEEecCCceeeEec-----cc-ccCCHHHH
Confidence 68999999999999999999999999999998883 46899999999998765441 11 13568889
Q ss_pred HHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-----HHHHHHhhhcccCCCCCCCe
Q psy10004 152 KMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-----KEVFEEFNWRGQNDSTLWPV 226 (354)
Q Consensus 152 ~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-----~~i~~~~~~~~~~~~~~~~v 226 (354)
.++|+++...+|+|++..||..|.+.+... ......++|||+|||.++.. ....+.++ ..+|
T Consensus 75 ~~~i~~l~~~gg~T~~~~AL~~a~~~l~~~-----~~~~~~~~iillTDG~~~~~~~~~~~~~~~~~~--------~~gi 141 (186)
T cd01480 75 KEAVDNLEYIGGGTFTDCALKYATEQLLEG-----SHQKENKFLLVITDGHSDGSPDGGIEKAVNEAD--------HLGI 141 (186)
T ss_pred HHHHHhCccCCCCccHHHHHHHHHHHHhcc-----CCCCCceEEEEEeCCCcCCCcchhHHHHHHHHH--------HCCC
Confidence 999999833689999999999999998751 12457899999999987421 22233332 3589
Q ss_pred EEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhH
Q psy10004 227 RVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAE 262 (354)
Q Consensus 227 ~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~ 262 (354)
.||+||+|. . +...|++||+.+++.|+ ..+..+
T Consensus 142 ~i~~vgig~-~-~~~~L~~IA~~~~~~~~-~~~~~~ 174 (186)
T cd01480 142 KIFFVAVGS-Q-NEEPLSRIACDGKSALY-RENFAE 174 (186)
T ss_pred EEEEEecCc-c-chHHHHHHHcCCcchhh-hcchhh
Confidence 999999998 4 78899999999998644 334333
No 11
>PRK13685 hypothetical protein; Provisional
Probab=99.86 E-value=2.9e-20 Score=177.14 Aligned_cols=175 Identities=15% Similarity=0.161 Sum_probs=135.2
Q ss_pred CCCceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHH
Q psy10004 80 ASPKDMVILLDNSGSMMG-----QRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
..+.+++||||+||||.+ ++++.+|+++..+++.+.++|++++|.|++++..+.| ...+.+.+..+
T Consensus 86 ~~~~~vvlvlD~S~SM~~~D~~p~RL~~ak~~~~~~l~~l~~~d~vglv~Fa~~a~~~~p---------~t~d~~~l~~~ 156 (326)
T PRK13685 86 RNRAVVMLVIDVSQSMRATDVEPNRLAAAQEAAKQFADELTPGINLGLIAFAGTATVLVS---------PTTNREATKNA 156 (326)
T ss_pred CCCceEEEEEECCccccCCCCCCCHHHHHHHHHHHHHHhCCCCCeEEEEEEcCceeecCC---------CCCCHHHHHHH
Confidence 356789999999999986 6899999999999999999999999999999865443 23577889999
Q ss_pred HhhhcCCCCCCcHHHHHHHHHHHHHhcccc-CCCCCCCcceEEEeccCCCCch------H---HHHHHhhhcccCCCCCC
Q psy10004 155 VEMIGDANNIANFTVALTRAFNILENARND-KKTGADCNQAIMVVTDGASENY------K---EVFEEFNWRGQNDSTLW 224 (354)
Q Consensus 155 i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~-~~~~~~~~~~IillTDG~~~~~------~---~i~~~~~~~~~~~~~~~ 224 (354)
|+.+ .+.|+|+++.++..|++.+...... ........+.||++|||..+.. . ...+... ..
T Consensus 157 l~~l-~~~~~T~~g~al~~A~~~l~~~~~~~~~~~~~~~~~IILlTDG~~~~~~~~~~~~~~~~aa~~a~--------~~ 227 (326)
T PRK13685 157 IDKL-QLADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLMSDGKETVPTNPDNPRGAYTAARTAK--------DQ 227 (326)
T ss_pred HHhC-CCCCCcchHHHHHHHHHHHHhhhcccccccCCCCCEEEEEcCCCCCCCCCCCCcccHHHHHHHHH--------Hc
Confidence 9999 9999999999999999998642110 0001223467999999997642 1 1222221 35
Q ss_pred CeEEEEEEeCCCC-------------CCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHH
Q psy10004 225 PVRVFSYLVGKEV-------------ADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVP 272 (354)
Q Consensus 225 ~v~IftigiG~~~-------------~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~ 272 (354)
+|+||+||+|.+. .+...|++||..++|.|+++.+.+++.+.+.++-+
T Consensus 228 gi~i~~Ig~G~~~g~~~~~g~~~~~~~d~~~L~~iA~~tgG~~~~~~~~~~L~~if~~I~~ 288 (326)
T PRK13685 228 GVPISTISFGTPYGSVEINGQRQPVPVDDESLKKIAQLSGGEFYTAASLEELRAVYATLQQ 288 (326)
T ss_pred CCeEEEEEECCCCCCcCcCCceeeecCCHHHHHHHHHhcCCEEEEcCCHHHHHHHHHHHHH
Confidence 8999999999742 26789999999999999999998776655544433
No 12
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=99.85 E-value=3.1e-20 Score=159.60 Aligned_cols=155 Identities=20% Similarity=0.215 Sum_probs=122.6
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
+||+||||.||||.+.+++.+|+++..++..|. .+++++|+.|+++++...++. ...+...+.+.++.+
T Consensus 1 ~Dvv~vlD~SgSm~~~~~~~~k~~~~~~~~~l~~~~~~~~~giv~Fs~~~~~~~~~~-------~~~~~~~~~~~l~~l- 72 (164)
T cd01472 1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEFYLN-------TYRSKDDVLEAVKNL- 72 (164)
T ss_pred CCEEEEEeCCCCCCHHHHHHHHHHHHHHHhhcccCCCCeEEEEEEEcCceeEEEecC-------CCCCHHHHHHHHHhC-
Confidence 489999999999999999999999999999886 467999999999987765521 134567888889888
Q ss_pred CC-CCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchH-HHHHHhhhcccCCCCCCCeEEEEEEeCCCC
Q psy10004 160 DA-NNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYK-EVFEEFNWRGQNDSTLWPVRVFSYLVGKEV 237 (354)
Q Consensus 160 ~~-~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~-~i~~~~~~~~~~~~~~~~v~IftigiG~~~ 237 (354)
.. +|+|++..||..|.+.+..... . ...+++++|||+|||.+++.. .....+. ..+|+||+||+|..
T Consensus 73 ~~~~g~T~~~~al~~a~~~l~~~~~-~-~~~~~~~~iiliTDG~~~~~~~~~~~~l~--------~~gv~i~~ig~g~~- 141 (164)
T cd01472 73 RYIGGGTNTGKALKYVRENLFTEAS-G-SREGVPKVLVVITDGKSQDDVEEPAVELK--------QAGIEVFAVGVKNA- 141 (164)
T ss_pred cCCCCCchHHHHHHHHHHHhCCccc-C-CCCCCCEEEEEEcCCCCCchHHHHHHHHH--------HCCCEEEEEECCcC-
Confidence 65 7889999999999998875321 1 234678999999999887532 2222222 24799999999975
Q ss_pred CCHHHHHHHHhcCCcEEEEe
Q psy10004 238 ADYRDVKWMACANKGYYVHL 257 (354)
Q Consensus 238 ~~~~~L~~iA~~~~G~~~~i 257 (354)
+...|++||+.++|.|...
T Consensus 142 -~~~~L~~ia~~~~~~~~~~ 160 (164)
T cd01472 142 -DEEELKQIASDPKELYVFN 160 (164)
T ss_pred -CHHHHHHHHCCCchheEEe
Confidence 7999999999999866653
No 13
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=99.85 E-value=1.2e-19 Score=187.95 Aligned_cols=180 Identities=17% Similarity=0.236 Sum_probs=135.4
Q ss_pred cceeeeccCCCceEEEEEeCCCCCCc-chHHHHHHHHHHHH-HHcCCCCeEEEEEeecccccccccccccccccCh-hhH
Q psy10004 72 REWYINAAASPKDMVILLDNSGSMMG-QRREIARHVINNLL-DTLGNNDYVNVLQFTSVCKEVVPCFADILVQANL-ANV 148 (354)
Q Consensus 72 r~wy~~a~~~p~dvvillD~SgSM~g-~~~~~ak~a~~~ll-~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~-~n~ 148 (354)
.|+|......+++|+||||+||||.+ .++..+++++..++ ..+.++|+++||.|++.+..+.+ +.+.+. ..+
T Consensus 294 ~P~Fs~lq~~~r~VVLVLDvSGSM~g~dRL~~lkqAA~~fL~~~l~~~DrVGLVtFsssA~vl~p-----Lt~Its~~dr 368 (863)
T TIGR00868 294 PPTFSLLKIRQRIVCLVLDKSGSMTVEDRLKRMNQAAKLFLLQTVEKGSWVGMVTFDSAAYIKNE-----LIQITSSAER 368 (863)
T ss_pred CCceeecccCCceEEEEEECCccccccCHHHHHHHHHHHHHHHhCCCCCEEEEEEECCceeEeec-----cccCCcHHHH
Confidence 35555555567889999999999986 68999999988765 56789999999999999876433 444444 334
Q ss_pred HHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEE
Q psy10004 149 RELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRV 228 (354)
Q Consensus 149 ~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~I 228 (354)
+.+...+ .+ .+.|+|++..||..|+++|.+... ....+.|||||||.++....+...+. +.+|+|
T Consensus 369 ~aL~~~L-~~-~A~GGT~I~~GL~~Alq~L~~~~~-----~~~~~~IILLTDGedn~~~~~l~~lk--------~~gVtI 433 (863)
T TIGR00868 369 DALTANL-PT-AASGGTSICSGLKAAFQVIKKSYQ-----STDGSEIVLLTDGEDNTISSCFEEVK--------QSGAII 433 (863)
T ss_pred HHHHHhh-cc-ccCCCCcHHHHHHHHHHHHHhccc-----ccCCCEEEEEeCCCCCCHHHHHHHHH--------HcCCEE
Confidence 4455544 35 689999999999999999986432 13457899999999987766665543 248999
Q ss_pred EEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHh
Q psy10004 229 FSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVM 274 (354)
Q Consensus 229 ftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l 274 (354)
|+||+|.+. ...|+.||..++|.|+++.+.+++. .|.+-|..+
T Consensus 434 ~TIg~G~da--d~~L~~IA~~TGG~~f~asd~~dl~-~L~dAF~~i 476 (863)
T TIGR00868 434 HTIALGPSA--AKELEELSDMTGGLRFYASDQADNN-GLIDAFGAL 476 (863)
T ss_pred EEEEeCCCh--HHHHHHHHHhcCCEEEEeCCHHHHH-HHHHHHHHH
Confidence 999999865 4569999999999999998876543 344444444
No 14
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=99.84 E-value=2.3e-19 Score=157.39 Aligned_cols=174 Identities=14% Similarity=0.109 Sum_probs=120.0
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHc-CCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTL-GNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L-~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
.+.|++||||+||||.+. +..++++++.++..+ .++++|+|++|++.++..++. . .. ........+.+..+
T Consensus 3 ~~~Dvv~llD~SgSm~~~-~~~~~~~~~~l~~~~~~~~~rvglv~Fs~~~~~~~~l-~----~~-~~~~~~~l~~l~~~- 74 (185)
T cd01474 3 GHFDLYFVLDKSGSVAAN-WIEIYDFVEQLVDRFNSPGLRFSFITFSTRATKILPL-T----DD-SSAIIKGLEVLKKV- 74 (185)
T ss_pred CceeEEEEEeCcCchhhh-HHHHHHHHHHHHHHcCCCCcEEEEEEecCCceEEEec-c----cc-HHHHHHHHHHHhcc-
Confidence 578999999999999874 444456777777664 467999999999998776652 1 11 11222333345666
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC--ch---HHHHHHhhhcccCCCCCCCeEEEEEEeC
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE--NY---KEVFEEFNWRGQNDSTLWPVRVFSYLVG 234 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~--~~---~~i~~~~~~~~~~~~~~~~v~IftigiG 234 (354)
.++|+|++..||..|.+.+....... ....++|||+|||.++ .. ....+.++ ..++.||+||+|
T Consensus 75 ~~~g~T~~~~aL~~a~~~l~~~~~~~---r~~~~~villTDG~~~~~~~~~~~~~a~~l~--------~~gv~i~~vgv~ 143 (185)
T cd01474 75 TPSGQTYIHEGLENANEQIFNRNGGG---RETVSVIIALTDGQLLLNGHKYPEHEAKLSR--------KLGAIVYCVGVT 143 (185)
T ss_pred CCCCCCcHHHHHHHHHHHHHhhccCC---CCCCeEEEEEcCCCcCCCCCcchHHHHHHHH--------HcCCEEEEEeec
Confidence 78899999999999999885422211 1223899999999984 22 22223332 247999999995
Q ss_pred CCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhc
Q psy10004 235 KEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMA 275 (354)
Q Consensus 235 ~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~ 275 (354)
. . +..+|+.||..+++.|....+.+++...+.++.+.++
T Consensus 144 ~-~-~~~~L~~iA~~~~~~f~~~~~~~~l~~~~~~~~~~~C 182 (185)
T cd01474 144 D-F-LKSQLINIADSKEYVFPVTSGFQALSGIIESVVKKAC 182 (185)
T ss_pred h-h-hHHHHHHHhCCCCeeEecCccHHHHHHHHHHHHHhhc
Confidence 3 3 7889999999886655556677777666666555544
No 15
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=99.83 E-value=2.7e-19 Score=161.76 Aligned_cols=178 Identities=20% Similarity=0.292 Sum_probs=133.1
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhh
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
|.||+||||+||||.+.+++.+|+++..+++.|. ..++|+||.|++.++..++. . ...+...+..+|..+
T Consensus 2 ~~DlvfllD~S~Sm~~~~~~~~k~f~~~l~~~l~~~~~~~rvglv~fs~~~~~~~~l-----~--~~~~~~~l~~~i~~i 74 (224)
T cd01475 2 PTDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPL-----G--RFKSKADLKRAVRRM 74 (224)
T ss_pred CccEEEEEeCCCCCCHHHHHHHHHHHHHHHHhcccCCCccEEEEEEecCceeEEecc-----c--ccCCHHHHHHHHHhC
Confidence 7899999999999999999999999999999985 45799999999998876552 1 123456777888888
Q ss_pred cCCCCCCcHHHHHHHHHHHHHhc-cccCCCCCCCcceEEEeccCCCCc-hHHHHHHhhhcccCCCCCCCeEEEEEEeCCC
Q psy10004 159 GDANNIANFTVALTRAFNILENA-RNDKKTGADCNQAIMVVTDGASEN-YKEVFEEFNWRGQNDSTLWPVRVFSYLVGKE 236 (354)
Q Consensus 159 ~~~~G~T~~~~aL~~A~~~l~~~-~~~~~~~~~~~~~IillTDG~~~~-~~~i~~~~~~~~~~~~~~~~v~IftigiG~~ 236 (354)
...+|+|++..||..|.+.+... ...+....+.+++|||+|||.+.+ ..+..+.++ ..+|.||+||+|.
T Consensus 75 ~~~~~~t~tg~AL~~a~~~~~~~~~g~r~~~~~~~kvvillTDG~s~~~~~~~a~~lk--------~~gv~i~~VgvG~- 145 (224)
T cd01475 75 EYLETGTMTGLAIQYAMNNAFSEAEGARPGSERVPRVGIVVTDGRPQDDVSEVAAKAR--------ALGIEMFAVGVGR- 145 (224)
T ss_pred cCCCCCChHHHHHHHHHHHhCChhcCCCCCCCCCCeEEEEEcCCCCcccHHHHHHHHH--------HCCcEEEEEeCCc-
Confidence 44567899999999999765322 222211223478999999998865 334444442 2479999999997
Q ss_pred CCCHHHHHHHHhcCC-cEEEEeCChhHHHHHHHHHHHHhcc
Q psy10004 237 VADYRDVKWMACANK-GYYVHLSTLAEVRDQILSYVPVMAR 276 (354)
Q Consensus 237 ~~~~~~L~~iA~~~~-G~~~~i~~~~~~~~~l~~~~~~l~~ 276 (354)
.+...|+.||..++ ++++.+.+.+++...+.++...++.
T Consensus 146 -~~~~~L~~ias~~~~~~~f~~~~~~~l~~~~~~l~~~~C~ 185 (224)
T cd01475 146 -ADEEELREIASEPLADHVFYVEDFSTIEELTKKFQGKICV 185 (224)
T ss_pred -CCHHHHHHHhCCCcHhcEEEeCCHHHHHHHhhhcccccCc
Confidence 37899999998765 4778888888776666555555553
No 16
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=99.83 E-value=2.6e-19 Score=156.09 Aligned_cols=159 Identities=19% Similarity=0.245 Sum_probs=120.0
Q ss_pred EEEEEeCCCCCCcc-hHHHHHHHHHHHHH-HcCCCCeEEEEEeecc-cccccccccccccccChhhHHHHHHHHhhhcCC
Q psy10004 85 MVILLDNSGSMMGQ-RREIARHVINNLLD-TLGNNDYVNVLQFTSV-CKEVVPCFADILVQANLANVRELKMGVEMIGDA 161 (354)
Q Consensus 85 vvillD~SgSM~g~-~~~~ak~a~~~ll~-~L~~~d~v~vi~Fs~~-~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~~ 161 (354)
++||||+||||.+. +++.+|.++..++. .+.++|+++|+.|+++ ....+| .+ .++..+...++.+ .+
T Consensus 3 v~lvlD~SgSM~~~~rl~~ak~a~~~~~~~~~~~~d~v~lv~F~~~~~~~~~~--------~t-~~~~~~~~~l~~l-~~ 72 (178)
T cd01451 3 VIFVVDASGSMAARHRMAAAKGAVLSLLRDAYQRRDKVALIAFRGTEAEVLLP--------PT-RSVELAKRRLARL-PT 72 (178)
T ss_pred EEEEEECCccCCCccHHHHHHHHHHHHHHHhhcCCCEEEEEEECCCCceEEeC--------CC-CCHHHHHHHHHhC-CC
Confidence 78999999999976 99999999999986 4678999999999875 443322 12 3566778889999 99
Q ss_pred CCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc---hH-HH-HHHhhhcccCCCCCCCeEEEEEEeCCC
Q psy10004 162 NNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN---YK-EV-FEEFNWRGQNDSTLWPVRVFSYLVGKE 236 (354)
Q Consensus 162 ~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~---~~-~i-~~~~~~~~~~~~~~~~v~IftigiG~~ 236 (354)
+|+|++..||..|++.+.... . .....+.|||||||.++. +. .. .+..+ .. ...+|.+++|++|.+
T Consensus 73 ~G~T~l~~aL~~a~~~l~~~~-~---~~~~~~~ivliTDG~~~~g~~~~~~~~~~~~~--~l---~~~gi~v~~I~~~~~ 143 (178)
T cd01451 73 GGGTPLAAGLLAAYELAAEQA-R---DPGQRPLIVVITDGRANVGPDPTADRALAAAR--KL---RARGISALVIDTEGR 143 (178)
T ss_pred CCCCcHHHHHHHHHHHHHHHh-c---CCCCceEEEEECCCCCCCCCCchhHHHHHHHH--HH---HhcCCcEEEEeCCCC
Confidence 999999999999999983211 1 122358999999999873 21 12 11111 11 135788999999875
Q ss_pred CCCHHHHHHHHhcCCcEEEEeCChhH
Q psy10004 237 VADYRDVKWMACANKGYYVHLSTLAE 262 (354)
Q Consensus 237 ~~~~~~L~~iA~~~~G~~~~i~~~~~ 262 (354)
..+...|+.||..++|.|+++.+.+.
T Consensus 144 ~~~~~~l~~iA~~tgG~~~~~~d~~~ 169 (178)
T cd01451 144 PVRRGLAKDLARALGGQYVRLPDLSA 169 (178)
T ss_pred ccCccHHHHHHHHcCCeEEEcCcCCH
Confidence 44678999999999999999988754
No 17
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=99.83 E-value=2.3e-19 Score=157.38 Aligned_cols=173 Identities=18% Similarity=0.189 Sum_probs=123.5
Q ss_pred ceEEEEEeCCCCCCcch-HHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHH---HHHHH
Q psy10004 83 KDMVILLDNSGSMMGQR-REIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRE---LKMGV 155 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~-~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~---~~~~i 155 (354)
.||+||||+||||.+.. ++.+|+++..+++.+. ++.+++|+.|++.++..++. ......++.. +.+.+
T Consensus 1 ~Dv~~vlD~SgSm~~~~~~~~~k~~~~~~~~~~~~~~~~~~vglv~Fs~~~~~~~~l-----~~~~~~~~~~~~~~i~~l 75 (186)
T cd01471 1 LDLYLLVDGSGSIGYSNWVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRL-----SSPNSTNKDLALNAIRAL 75 (186)
T ss_pred CcEEEEEeCCCCccchhhHHHHHHHHHHHHHhcccCCCceEEEEEEecCCceEEEEC-----CCccccchHHHHHHHHHH
Confidence 38999999999999887 8999999999999875 45699999999998876652 1112223444 33333
Q ss_pred hhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCC
Q psy10004 156 EMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGK 235 (354)
Q Consensus 156 ~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~ 235 (354)
.....++|+|++..||..|.+.+..... ...+..++|||+|||.+++.....+..+ .. ...++.|++||+|.
T Consensus 76 ~~~~~~~G~T~l~~aL~~a~~~l~~~~~---~r~~~~~~villTDG~~~~~~~~~~~a~--~l---~~~gv~v~~igiG~ 147 (186)
T cd01471 76 LSLYYPNGSTNTTSALLVVEKHLFDTRG---NRENAPQLVIIMTDGIPDSKFRTLKEAR--KL---RERGVIIAVLGVGQ 147 (186)
T ss_pred HhCcCCCCCccHHHHHHHHHHHhhccCC---CcccCceEEEEEccCCCCCCcchhHHHH--HH---HHCCCEEEEEEeeh
Confidence 3332578999999999999999976422 2245678999999999986543332221 11 12479999999998
Q ss_pred CCCCHHHHHHHHhcC--Cc--EEEEeCChhHHHHHHHH
Q psy10004 236 EVADYRDVKWMACAN--KG--YYVHLSTLAEVRDQILS 269 (354)
Q Consensus 236 ~~~~~~~L~~iA~~~--~G--~~~~i~~~~~~~~~l~~ 269 (354)
+. +.++|+.||... .| ..+...+.+++...+..
T Consensus 148 ~~-d~~~l~~ia~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (186)
T cd01471 148 GV-NHEENRSLVGCDPDDSPCPLYLQSSWSEVQNVIKP 184 (186)
T ss_pred hh-CHHHHHHhcCCCCCCCCCCeeecCCHHHHHHHhhc
Confidence 76 899999998764 21 34445666776665543
No 18
>PF13519 VWA_2: von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=99.83 E-value=2.5e-19 Score=153.66 Aligned_cols=161 Identities=25% Similarity=0.402 Sum_probs=121.2
Q ss_pred eEEEEEeCCCCCCcc-----hHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhh
Q psy10004 84 DMVILLDNSGSMMGQ-----RREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 84 dvvillD~SgSM~g~-----~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
|++||+|.||||.+. +++.++.++..+++.+. +++|+|+.|++......+ ...+...+++.+.++
T Consensus 1 dvv~v~D~SgSM~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~l~~f~~~~~~~~~---------~t~~~~~~~~~l~~~ 70 (172)
T PF13519_consen 1 DVVFVLDNSGSMNGYDGNRTRIDQAKDALNELLANLP-GDRVGLVSFSDSSRTLSP---------LTSDKDELKNALNKL 70 (172)
T ss_dssp EEEEEEE-SGGGGTTTSSS-HHHHHHHHHHHHHHHHT-TSEEEEEEESTSCEEEEE---------EESSHHHHHHHHHTH
T ss_pred CEEEEEECCcccCCCCCCCcHHHHHHHHHHHHHHHCC-CCEEEEEEeccccccccc---------ccccHHHHHHHhhcc
Confidence 799999999999976 79999999999999975 789999999998654332 124667777777766
Q ss_pred c---CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-HHHHHHhhhcccCCCCCCCeEEEEEEeC
Q psy10004 159 G---DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-KEVFEEFNWRGQNDSTLWPVRVFSYLVG 234 (354)
Q Consensus 159 ~---~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-~~i~~~~~~~~~~~~~~~~v~IftigiG 234 (354)
. .+.|+|++..|+..|.+++.... ...+.|||+|||.++.. .+..+.+. ..+|+||+|++|
T Consensus 71 ~~~~~~~~~t~~~~al~~a~~~~~~~~-------~~~~~iv~iTDG~~~~~~~~~~~~~~--------~~~i~i~~v~~~ 135 (172)
T PF13519_consen 71 SPQGMPGGGTNLYDALQEAAKMLASSD-------NRRRAIVLITDGEDNSSDIEAAKALK--------QQGITIYTVGIG 135 (172)
T ss_dssp HHHG--SSS--HHHHHHHHHHHHHC-S-------SEEEEEEEEES-TTHCHHHHHHHHHH--------CTTEEEEEEEES
T ss_pred cccccCccCCcHHHHHHHHHHHHHhCC-------CCceEEEEecCCCCCcchhHHHHHHH--------HcCCeEEEEEEC
Confidence 1 35789999999999999997532 35689999999988732 33444332 357999999999
Q ss_pred CCCCCHHHHHHHHhcCCcEEEEe-CChhHHHHHHHH
Q psy10004 235 KEVADYRDVKWMACANKGYYVHL-STLAEVRDQILS 269 (354)
Q Consensus 235 ~~~~~~~~L~~iA~~~~G~~~~i-~~~~~~~~~l~~ 269 (354)
........|++||..++|.|+++ .+.+++.+.+++
T Consensus 136 ~~~~~~~~l~~la~~tgG~~~~~~~~~~~l~~~~~~ 171 (172)
T PF13519_consen 136 SDSDANEFLQRLAEATGGRYFHVDNDPEDLDDAFQQ 171 (172)
T ss_dssp -TT-EHHHHHHHHHHTEEEEEEE-SSSHHHHHHHHH
T ss_pred CCccHHHHHHHHHHhcCCEEEEecCCHHHHHHHHhc
Confidence 87744689999999999999999 577887776654
No 19
>cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=99.82 E-value=2.6e-19 Score=155.79 Aligned_cols=150 Identities=20% Similarity=0.263 Sum_probs=112.1
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCC------CCeEEEEEeecccccccccccccccccChhhHHHHHHHH
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLGN------NDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGV 155 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~------~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i 155 (354)
..+|+||||+||||.+.++..+++++..+++.|.+ +++++||.|++.++.++|+- .. . ...+
T Consensus 3 ~~~v~~llD~SgSM~~~~~~~~k~a~~~~~~~l~~~~~~~~~~~v~ii~F~~~a~~~~~l~-----~~--~-----~~~~ 70 (176)
T cd01464 3 RLPIYLLLDTSGSMAGEPIEALNQGLQMLQSELRQDPYALESVEISVITFDSAARVIVPLT-----PL--E-----SFQP 70 (176)
T ss_pred CCCEEEEEECCCCCCChHHHHHHHHHHHHHHHHhcChhhccccEEEEEEecCCceEecCCc-----cH--H-----hcCC
Confidence 46799999999999999999999999999999864 46999999999988766531 10 0 1124
Q ss_pred hhhcCCCCCCcHHHHHHHHHHHHHhccccC--CCCCCCcceEEEeccCCCCch-HHHHHHhhhcccCCCCCCCeEEEEEE
Q psy10004 156 EMIGDANNIANFTVALTRAFNILENARNDK--KTGADCNQAIMVVTDGASENY-KEVFEEFNWRGQNDSTLWPVRVFSYL 232 (354)
Q Consensus 156 ~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~--~~~~~~~~~IillTDG~~~~~-~~i~~~~~~~~~~~~~~~~v~Iftig 232 (354)
..+ .++|+|++..||..|++.+....... .......+.|||||||.+++. ....+.++ .. ...+++|++||
T Consensus 71 ~~l-~~~GgT~l~~aL~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~~~~~~~~--~~---~~~~~~i~~ig 144 (176)
T cd01464 71 PRL-TASGGTSMGAALELALDCIDRRVQRYRADQKGDWRPWVFLLTDGEPTDDLTAAIERIK--EA---RDSKGRIVACA 144 (176)
T ss_pred Ccc-cCCCCCcHHHHHHHHHHHHHHHHHHhcccCcCCcCcEEEEEcCCCCCchHHHHHHHHH--hh---cccCCcEEEEE
Confidence 466 88999999999999999997543211 011234578999999999864 32334342 22 12479999999
Q ss_pred eCCCCCCHHHHHHHHhcC
Q psy10004 233 VGKEVADYRDVKWMACAN 250 (354)
Q Consensus 233 iG~~~~~~~~L~~iA~~~ 250 (354)
+|.+. +.+.|+.||...
T Consensus 145 iG~~~-~~~~L~~ia~~~ 161 (176)
T cd01464 145 VGPKA-DLDTLKQITEGV 161 (176)
T ss_pred ecccc-CHHHHHHHHCCC
Confidence 99976 899999999643
No 20
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=99.82 E-value=9.7e-19 Score=152.41 Aligned_cols=164 Identities=17% Similarity=0.260 Sum_probs=125.4
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCC---CCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLGN---NDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~---~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
.|++||||.|+||.+..++.+|.++..+++.|.. ..+|+|+.|++.++..++ +. ...+...+.++++.+.
T Consensus 1 ~Di~fvlD~S~S~~~~~f~~~k~fi~~~i~~l~~~~~~~rvgvv~fs~~~~~~~~-l~------~~~~~~~~~~~i~~~~ 73 (177)
T cd01469 1 MDIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFT-LN------EYRTKEEPLSLVKHIS 73 (177)
T ss_pred CcEEEEEeCCCCCCHHHHHHHHHHHHHHHHHcCcCCCCcEEEEEEECCceeEEEe-cC------ccCCHHHHHHHHHhCc
Confidence 3899999999999998999999999999999874 679999999999877554 22 2233456777787774
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchH---HHHHHhhhcccCCCCCCCeEEEEEEeCCC
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYK---EVFEEFNWRGQNDSTLWPVRVFSYLVGKE 236 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~---~i~~~~~~~~~~~~~~~~v~IftigiG~~ 236 (354)
..+|+|++..||..|.+.+...... ..++.++++||+|||..++.. ++.+.++ ..+|.||+||+|..
T Consensus 74 ~~~g~T~~~~AL~~a~~~l~~~~~g--~R~~~~kv~illTDG~~~~~~~~~~~~~~~k--------~~gv~v~~Vgvg~~ 143 (177)
T cd01469 74 QLLGLTNTATAIQYVVTELFSESNG--ARKDATKVLVVITDGESHDDPLLKDVIPQAE--------REGIIRYAIGVGGH 143 (177)
T ss_pred cCCCCccHHHHHHHHHHHhcCcccC--CCCCCCeEEEEEeCCCCCCccccHHHHHHHH--------HCCcEEEEEEeccc
Confidence 5678899999999999887432111 234578999999999987543 3344332 25899999999986
Q ss_pred CC---CHHHHHHHHhcCCc-EEEEeCChhHH
Q psy10004 237 VA---DYRDVKWMACANKG-YYVHLSTLAEV 263 (354)
Q Consensus 237 ~~---~~~~L~~iA~~~~G-~~~~i~~~~~~ 263 (354)
.. +..+|+.||+.+++ +++.+.+.+++
T Consensus 144 ~~~~~~~~~L~~ias~p~~~h~f~~~~~~~l 174 (177)
T cd01469 144 FQRENSREELKTIASKPPEEHFFNVTDFAAL 174 (177)
T ss_pred ccccccHHHHHHHhcCCcHHhEEEecCHHHh
Confidence 42 37899999999875 66777777654
No 21
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=99.81 E-value=9.3e-19 Score=153.22 Aligned_cols=160 Identities=11% Similarity=0.203 Sum_probs=121.1
Q ss_pred ceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHHc---CCCCeEEEEEe-ecccccccccccccccccChhhHHHHHH
Q psy10004 83 KDMVILLDNSGSMMG-----QRREIARHVINNLLDTL---GNNDYVNVLQF-TSVCKEVVPCFADILVQANLANVRELKM 153 (354)
Q Consensus 83 ~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~L---~~~d~v~vi~F-s~~~~~~~~c~~~~lv~~~~~n~~~~~~ 153 (354)
+|++|+||.|+||.. +|++.+|.++..+++.+ .+++++++|.| ++.+..+.|+- .+...+..
T Consensus 4 r~ivi~lD~S~SM~a~D~~ptRl~~ak~~~~~fi~~~~~~~~~~~vglv~f~~~~a~~~~PlT---------~D~~~~~~ 74 (183)
T cd01453 4 RHLIIVIDCSRSMEEQDLKPSRLAVVLKLLELFIEEFFDQNPISQLGIISIKNGRAEKLTDLT---------GNPRKHIQ 74 (183)
T ss_pred eEEEEEEECcHHHhcCCCCchHHHHHHHHHHHHHHHHhhcCccccEEEEEEcCCccEEEECCC---------CCHHHHHH
Confidence 589999999999985 69999999999999864 58899999999 78887766641 22334555
Q ss_pred HHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchH---HHHHHhhhcccCCCCCCCeEEEE
Q psy10004 154 GVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYK---EVFEEFNWRGQNDSTLWPVRVFS 230 (354)
Q Consensus 154 ~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~---~i~~~~~~~~~~~~~~~~v~Ift 230 (354)
.+..+..+.|+|++..||..|.+.|..... ....++|||+|||...++. ++.++++ ..+|+|++
T Consensus 75 ~L~~~~~~~G~t~l~~aL~~A~~~l~~~~~-----~~~~~iiil~sd~~~~~~~~~~~~~~~l~--------~~~I~v~~ 141 (183)
T cd01453 75 ALKTARECSGEPSLQNGLEMALESLKHMPS-----HGSREVLIIFSSLSTCDPGNIYETIDKLK--------KENIRVSV 141 (183)
T ss_pred HhhcccCCCCchhHHHHHHHHHHHHhcCCc-----cCceEEEEEEcCCCcCChhhHHHHHHHHH--------HcCcEEEE
Confidence 554431567889999999999999974211 1134689999998876543 3444432 24799999
Q ss_pred EEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHH
Q psy10004 231 YLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQI 267 (354)
Q Consensus 231 igiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l 267 (354)
||+|. +...|++||..++|.|+.+.+.+++.+.+
T Consensus 142 IgiG~---~~~~L~~ia~~tgG~~~~~~~~~~l~~~~ 175 (183)
T cd01453 142 IGLSA---EMHICKEICKATNGTYKVILDETHLKELL 175 (183)
T ss_pred EEech---HHHHHHHHHHHhCCeeEeeCCHHHHHHHH
Confidence 99995 45789999999999999998887665444
No 22
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=99.81 E-value=2e-18 Score=162.46 Aligned_cols=179 Identities=18% Similarity=0.195 Sum_probs=137.8
Q ss_pred CCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHH-cCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhh
Q psy10004 80 ASPKDMVILLDNSGSMMGQRREIARHVINNLLDT-LGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~-L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
..|.+++||||.||||.+ ++..+++++..+++. +.++|+++|+.|++.+..+.+ ...+...+.++|+.+
T Consensus 51 ~~p~~vvlvlD~SgSM~~-~~~~a~~a~~~~l~~~l~~~d~v~lv~f~~~~~~~~~---------~t~~~~~l~~~l~~l 120 (296)
T TIGR03436 51 DLPLTVGLVIDTSGSMRN-DLDRARAAAIRFLKTVLRPNDRVFVVTFNTRLRLLQD---------FTSDPRLLEAALNRL 120 (296)
T ss_pred CCCceEEEEEECCCCchH-HHHHHHHHHHHHHHhhCCCCCEEEEEEeCCceeEeec---------CCCCHHHHHHHHHhc
Confidence 359999999999999986 688899999999987 789999999999998765433 234577788888888
Q ss_pred cCC---------------CCCCcHHHHHHHHH-HHHHhccccCCCCCCCcceEEEeccCCCCchH----HHHHHhhhccc
Q psy10004 159 GDA---------------NNIANFTVALTRAF-NILENARNDKKTGADCNQAIMVVTDGASENYK----EVFEEFNWRGQ 218 (354)
Q Consensus 159 ~~~---------------~G~T~~~~aL~~A~-~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~----~i~~~~~~~~~ 218 (354)
.+ .|+|++..||..|. +++...... ....++||+||||..+... ++++.++
T Consensus 121 -~~~~~~~~~~~~~~~~~~g~T~l~~al~~aa~~~~~~~~~~----~p~rk~iIllTDG~~~~~~~~~~~~~~~~~---- 191 (296)
T TIGR03436 121 -KPPLRTDYNSSGAFVRDGGGTALYDAITLAALEQLANALAG----IPGRKALIVISDGGDNRSRDTLERAIDAAQ---- 191 (296)
T ss_pred -cCCCccccccccccccCCCcchhHHHHHHHHHHHHHHhhcC----CCCCeEEEEEecCCCcchHHHHHHHHHHHH----
Confidence 66 89999999987765 444433211 1135789999999987542 3333332
Q ss_pred CCCCCCCeEEEEEEeCCCC------------CCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhccchhccC
Q psy10004 219 NDSTLWPVRVFSYLVGKEV------------ADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMARPLVLQR 282 (354)
Q Consensus 219 ~~~~~~~v~IftigiG~~~------------~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~~pl~~~~ 282 (354)
..+|.||+||+|... .+...|+.||..++|.|+++ +..++...+.++...+.+..++..
T Consensus 192 ----~~~v~vy~I~~~~~~~~~~~~~~~~~~~~~~~L~~iA~~TGG~~~~~-~~~~l~~~f~~i~~~~~~~Y~l~y 262 (296)
T TIGR03436 192 ----RADVAIYSIDARGLRAPDLGAGAKAGLGGPEALERLAEETGGRAFYV-NSNDLDGAFAQIAEELRSQYLIGY 262 (296)
T ss_pred ----HcCCEEEEeccCccccCCcccccccCCCcHHHHHHHHHHhCCeEecc-cCccHHHHHHHHHHHHhheEEEEE
Confidence 247999999998421 25789999999999998888 888888888888888877766643
No 23
>cd01473 vWA_CTRP CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=99.80 E-value=4.2e-18 Score=150.19 Aligned_cols=170 Identities=17% Similarity=0.194 Sum_probs=121.2
Q ss_pred eEEEEEeCCCCCCcchHH-HHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 84 DMVILLDNSGSMMGQRRE-IARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~-~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
|++||||.|+||....+. .++.+++.+++.|. .+.+|+|+.|++.++..+|. ......++..+.++|+.+.
T Consensus 2 Di~fllD~S~Si~~~~f~~~~~~f~~~lv~~l~i~~~~~rvgvv~fs~~~~~~~~~-----~~~~~~~~~~l~~~i~~l~ 76 (192)
T cd01473 2 DLTLILDESASIGYSNWRKDVIPFTEKIINNLNISKDKVHVGILLFAEKNRDVVPF-----SDEERYDKNELLKKINDLK 76 (192)
T ss_pred cEEEEEeCCCcccHHHHHHHHHHHHHHHHHhCccCCCccEEEEEEecCCceeEEec-----CcccccCHHHHHHHHHHHH
Confidence 899999999999888787 48999999999876 46799999999999776652 1222344667777776651
Q ss_pred ---CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch--HHH---HHHhhhcccCCCCCCCeEEEEE
Q psy10004 160 ---DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY--KEV---FEEFNWRGQNDSTLWPVRVFSY 231 (354)
Q Consensus 160 ---~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~--~~i---~~~~~~~~~~~~~~~~v~Ifti 231 (354)
..+|+|++..||..|.+.+..... .+...++++||||||..++. ..+ .+.++ ..+|.||+|
T Consensus 77 ~~~~~~g~T~~~~AL~~a~~~~~~~~~---~r~~~~kv~IllTDG~s~~~~~~~~~~~a~~lk--------~~gV~i~~v 145 (192)
T cd01473 77 NSYRSGGETYIVEALKYGLKNYTKHGN---RRKDAPKVTMLFTDGNDTSASKKELQDISLLYK--------EENVKLLVV 145 (192)
T ss_pred hccCCCCcCcHHHHHHHHHHHhccCCC---CcccCCeEEEEEecCCCCCcchhhHHHHHHHHH--------HCCCEEEEE
Confidence 257999999999999988764322 12446899999999998753 222 23332 358999999
Q ss_pred EeCCCCCCHHHHHHHHhcC-----CcEEEEeCChhHHHHHHHHHHH
Q psy10004 232 LVGKEVADYRDVKWMACAN-----KGYYVHLSTLAEVRDQILSYVP 272 (354)
Q Consensus 232 giG~~~~~~~~L~~iA~~~-----~G~~~~i~~~~~~~~~l~~~~~ 272 (354)
|+|.. +..+|+.||... ..++... +.+++......+..
T Consensus 146 GiG~~--~~~el~~ia~~~~~~~~~~~~~~~-~f~~l~~~~~~l~~ 188 (192)
T cd01473 146 GVGAA--SENKLKLLAGCDINNDNCPNVIKT-EWNNLNGISKFLTD 188 (192)
T ss_pred Eeccc--cHHHHHHhcCCCCCCCCCCeEEec-chhhHHHHHHHHHh
Confidence 99974 578899999752 2344433 35555444444333
No 24
>PF08399 VWA_N: VWA N-terminal; InterPro: IPR013608 This domain is found at the N terminus of proteins containing von Willebrand factor type A (VWA, IPR002035 from INTERPRO) and Cache (IPR004010 from INTERPRO) domains. It has been found in vertebrates, Drosophila melanogaster (Fruit fly) and Caenorhabditis elegans but has not yet been identified in other eukaryotes. It is probably involved in the function of some voltage-dependent calcium channel subunits [].
Probab=99.80 E-value=2.6e-20 Score=150.59 Aligned_cols=47 Identities=53% Similarity=1.131 Sum_probs=46.1
Q ss_pred ChhhhhhcchHHHHHHHHHhhcCCCCceeEEeeCCCCceeecCCCCc
Q psy10004 1 EEVLRVAKWSEALNDIFVSNYQADPSLSWQVFGSTTGALRVFPAHQW 47 (354)
Q Consensus 1 ~~v~~~i~ws~~ld~~f~~n~~~~p~l~wqyfgs~~G~~r~yP~~~~ 47 (354)
|+|+++|+||++||++|++|+++||+|+||||||++||+|+|||++|
T Consensus 77 ~~vl~~i~ws~~Ld~~f~~N~~~dp~l~WQYFgS~~G~~r~yPa~~W 123 (123)
T PF08399_consen 77 PDVLNAINWSEALDDVFRENYESDPTLKWQYFGSSTGFFRQYPASKW 123 (123)
T ss_pred HHHHhhccccHHHHHHHHHHhhcCCCceEEEEEcccCceeeccCCCC
Confidence 58999999999999999999999999999999999999999999998
No 25
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=99.80 E-value=5.7e-18 Score=147.48 Aligned_cols=156 Identities=26% Similarity=0.292 Sum_probs=115.6
Q ss_pred CceEEEEEeCCCCCCc------chHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHH
Q psy10004 82 PKDMVILLDNSGSMMG------QRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGV 155 (354)
Q Consensus 82 p~dvvillD~SgSM~g------~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i 155 (354)
..+++||||+|+||.. .+++.++.++..++.. .++++++|+.|++.....++ .+ .+...+.+.+
T Consensus 2 ~~~vv~vlD~S~SM~~~~~~~~~r~~~a~~~~~~~~~~-~~~~~v~lv~f~~~~~~~~~--------~~-~~~~~~~~~l 71 (180)
T cd01467 2 GRDIMIALDVSGSMLAQDFVKPSRLEAAKEVLSDFIDR-RENDRIGLVVFAGAAFTQAP--------LT-LDRESLKELL 71 (180)
T ss_pred CceEEEEEECCcccccccCCCCCHHHHHHHHHHHHHHh-CCCCeEEEEEEcCCeeeccC--------CC-ccHHHHHHHH
Confidence 5689999999999974 3678899988888765 47899999999998765433 11 1233444444
Q ss_pred hhhc--CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch----HHHHHHhhhcccCCCCCCCeEEE
Q psy10004 156 EMIG--DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY----KEVFEEFNWRGQNDSTLWPVRVF 229 (354)
Q Consensus 156 ~~l~--~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~----~~i~~~~~~~~~~~~~~~~v~If 229 (354)
+.+. .+.|+|++..||..|++++.... ...+.|||+|||.++.. .+..+..+ ..+|+||
T Consensus 72 ~~l~~~~~~g~T~l~~al~~a~~~l~~~~-------~~~~~iiliTDG~~~~g~~~~~~~~~~~~--------~~gi~i~ 136 (180)
T cd01467 72 EDIKIGLAGQGTAIGDAIGLAIKRLKNSE-------AKERVIVLLTDGENNAGEIDPATAAELAK--------NKGVRIY 136 (180)
T ss_pred HHhhhcccCCCCcHHHHHHHHHHHHHhcC-------CCCCEEEEEeCCCCCCCCCCHHHHHHHHH--------HCCCEEE
Confidence 4441 46889999999999999986532 23479999999987642 23332221 2479999
Q ss_pred EEEeCCC----------CCCHHHHHHHHhcCCcEEEEeCChhH
Q psy10004 230 SYLVGKE----------VADYRDVKWMACANKGYYVHLSTLAE 262 (354)
Q Consensus 230 tigiG~~----------~~~~~~L~~iA~~~~G~~~~i~~~~~ 262 (354)
+||+|.. ..+...|+.||..++|.|+.+.+..+
T Consensus 137 ~i~ig~~~~~~~~~~~~~~~~~~l~~la~~tgG~~~~~~~~~~ 179 (180)
T cd01467 137 TIGVGKSGSGPKPDGSTILDEDSLVEIADKTGGRIFRALDGFE 179 (180)
T ss_pred EEEecCCCCCcCCCCcccCCHHHHHHHHHhcCCEEEEecCccc
Confidence 9999972 13788999999999999999887654
No 26
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=99.80 E-value=3.7e-18 Score=146.82 Aligned_cols=156 Identities=15% Similarity=0.186 Sum_probs=119.7
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
.|++||||.||||.+..+..+|+++..++..|. ++++|+|+.|+++++..++ +. ...+.+.+.+.+..+.
T Consensus 1 ~Dv~~vlD~S~Sm~~~~~~~~k~~~~~l~~~~~~~~~~~rvgli~fs~~~~~~~~-l~------~~~~~~~l~~~l~~~~ 73 (164)
T cd01482 1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFD-LN------AYTSKEDVLAAIKNLP 73 (164)
T ss_pred CCEEEEEeCCCCcChhhHHHHHHHHHHHHhheeeCCCceEEEEEEECCCeeEEEe-cC------CCCCHHHHHHHHHhCc
Confidence 389999999999998899999999999998774 6789999999999877554 11 2245677888888884
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCC
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVA 238 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~ 238 (354)
.++|+|++..||..|.+.+...... .....+++|||+|||.+++. .++.+.++ ..+|.||+||+|..
T Consensus 74 ~~~g~T~~~~aL~~a~~~~~~~~~~--~r~~~~k~iillTDG~~~~~~~~~a~~lk--------~~gi~i~~ig~g~~-- 141 (164)
T cd01482 74 YKGGNTRTGKALTHVREKNFTPDAG--ARPGVPKVVILITDGKSQDDVELPARVLR--------NLGVNVFAVGVKDA-- 141 (164)
T ss_pred CCCCCChHHHHHHHHHHHhcccccC--CCCCCCEEEEEEcCCCCCchHHHHHHHHH--------HCCCEEEEEecCcC--
Confidence 4789999999999999766432111 23456789999999998753 44444442 24799999999973
Q ss_pred CHHHHHHHHhcCCc-EEEEe
Q psy10004 239 DYRDVKWMACANKG-YYVHL 257 (354)
Q Consensus 239 ~~~~L~~iA~~~~G-~~~~i 257 (354)
+...|+.||..+.+ +.+.+
T Consensus 142 ~~~~L~~ia~~~~~~~~~~~ 161 (164)
T cd01482 142 DESELKMIASKPSETHVFNV 161 (164)
T ss_pred CHHHHHHHhCCCchheEEEc
Confidence 68899999998765 33433
No 27
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=99.79 E-value=3.6e-18 Score=150.54 Aligned_cols=152 Identities=13% Similarity=0.181 Sum_probs=112.1
Q ss_pred CCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCC---------CCeEEEEEeecccccccccccccccccChhhHHH
Q psy10004 80 ASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGN---------NDYVNVLQFTSVCKEVVPCFADILVQANLANVRE 150 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~---------~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~ 150 (354)
..+.|+|||||.|+||...+++.+|+++..++..+.. ++||+||.|++.++..++. . +..+...
T Consensus 17 ~~~~DivfvlD~S~Sm~~~~f~~~k~fi~~~~~~~~~~~~~~~~~~~~rVGlV~fs~~a~~~~~L-----~--d~~~~~~ 89 (193)
T cd01477 17 NLWLDIVFVVDNSKGMTQGGLWQVRATISSLFGSSSQIGTDYDDPRSTRVGLVTYNSNATVVADL-----N--DLQSFDD 89 (193)
T ss_pred cceeeEEEEEeCCCCcchhhHHHHHHHHHHHHhhccccccccCCCCCcEEEEEEccCceEEEEec-----c--cccCHHH
Confidence 3689999999999999988999999999998877543 4799999999999886652 1 2234455
Q ss_pred HHHHHhh----hcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC----chHHHHHHhhhcccCCCC
Q psy10004 151 LKMGVEM----IGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE----NYKEVFEEFNWRGQNDST 222 (354)
Q Consensus 151 ~~~~i~~----l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~----~~~~i~~~~~~~~~~~~~ 222 (354)
+.++|+. + ..+|+|++..||..|.++|...... .+++..++|||||||..+ +...+.++++
T Consensus 90 ~~~ai~~~~~~~-~~~ggT~ig~aL~~A~~~l~~~~~~--~R~~v~kvvIllTDg~~~~~~~~~~~~a~~l~-------- 158 (193)
T cd01477 90 LYSQIQGSLTDV-SSTNASYLDTGLQAAEQMLAAGKRT--SRENYKKVVIVFASDYNDEGSNDPRPIAARLK-------- 158 (193)
T ss_pred HHHHHHHHhhcc-ccCCcchHHHHHHHHHHHHHhhhcc--ccCCCCeEEEEEecCccCCCCCCHHHHHHHHH--------
Confidence 5555653 3 4668999999999999999754111 234678999999998543 3344444442
Q ss_pred CCCeEEEEEEeCCCCCCHHHHHHHHhcC
Q psy10004 223 LWPVRVFSYLVGKEVADYRDVKWMACAN 250 (354)
Q Consensus 223 ~~~v~IftigiG~~~~~~~~L~~iA~~~ 250 (354)
+.+|.||+||+|.+. |...|++|+...
T Consensus 159 ~~GI~i~tVGiG~~~-d~~~~~~L~~ia 185 (193)
T cd01477 159 STGIAIITVAFTQDE-SSNLLDKLGKIA 185 (193)
T ss_pred HCCCEEEEEEeCCCC-CHHHHHHHHHhc
Confidence 358999999999865 656666665544
No 28
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=99.77 E-value=1.6e-17 Score=141.03 Aligned_cols=149 Identities=23% Similarity=0.275 Sum_probs=117.2
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcC
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGD 160 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~ 160 (354)
||+||||+||||.+.+++.++.++..++..+. ++++++|+.|++.+...++.... .+...+.+.++.+..
T Consensus 2 di~~llD~S~Sm~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li~f~~~~~~~~~~~~~-------~~~~~~~~~i~~~~~ 74 (161)
T cd01450 2 DIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDY-------KSKDDLLKAVKNLKY 74 (161)
T ss_pred cEEEEEeCCCCcCHHHHHHHHHHHHHHHHheeeCCCceEEEEEEEcCCceEEEECCCC-------CCHHHHHHHHHhccc
Confidence 89999999999998899999999999999886 48899999999988766553221 146777888887722
Q ss_pred CCC-CCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc---hHHHHHHhhhcccCCCCCCCeEEEEEEeCCC
Q psy10004 161 ANN-IANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN---YKEVFEEFNWRGQNDSTLWPVRVFSYLVGKE 236 (354)
Q Consensus 161 ~~G-~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~---~~~i~~~~~~~~~~~~~~~~v~IftigiG~~ 236 (354)
..| +|++..||..|++.+...... .....++|||+|||.++. ..++.+.++ +.+++|++||+|.
T Consensus 75 ~~~~~t~~~~al~~a~~~~~~~~~~---~~~~~~~iiliTDG~~~~~~~~~~~~~~~~--------~~~v~v~~i~~g~- 142 (161)
T cd01450 75 LGGGGTNTGKALQYALEQLFSESNA---RENVPKVIIVLTDGRSDDGGDPKEAAAKLK--------DEGIKVFVVGVGP- 142 (161)
T ss_pred CCCCCccHHHHHHHHHHHhcccccc---cCCCCeEEEEECCCCCCCCcchHHHHHHHH--------HCCCEEEEEeccc-
Confidence 333 899999999999999754321 245678999999999886 344555543 2479999999998
Q ss_pred CCCHHHHHHHHhcCCc
Q psy10004 237 VADYRDVKWMACANKG 252 (354)
Q Consensus 237 ~~~~~~L~~iA~~~~G 252 (354)
. +...|+.||..+++
T Consensus 143 ~-~~~~l~~la~~~~~ 157 (161)
T cd01450 143 A-DEEELREIASCPSE 157 (161)
T ss_pred c-CHHHHHHHhCCCCC
Confidence 4 89999999988743
No 29
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=99.76 E-value=5e-17 Score=160.64 Aligned_cols=185 Identities=16% Similarity=0.199 Sum_probs=135.2
Q ss_pred CCceEEEEEeCCCCCCcchH-HHHHHHHHHHHHHcC-CCC--eEEEEEeecccccccccccccccccChhhHHHHHHHHh
Q psy10004 81 SPKDMVILLDNSGSMMGQRR-EIARHVINNLLDTLG-NND--YVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE 156 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~-~~ak~a~~~ll~~L~-~~d--~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~ 156 (354)
...|++||||.|+||.-..| +.++.++..|+..|. ..| +++|+.|++.++.+++. ......+++.+...|.
T Consensus 41 ~~lDIvFLLD~SgSMg~~Nfle~AK~Fa~~LV~~l~Is~D~V~VgiV~FSd~~r~vfpL-----~s~~s~Dk~~aL~~I~ 115 (576)
T PTZ00441 41 EEVDLYLLVDGSGSIGYHNWITHVIPMLMGLIQQLNLSDDAINLYMSLFSNNTTELIRL-----GSGASKDKEQALIIVK 115 (576)
T ss_pred CCceEEEEEeCCCccCCccHHHHHHHHHHHHHHHhccCCCceEEEEEEeCCCceEEEec-----CCCccccHHHHHHHHH
Confidence 57899999999999985555 789999999999885 334 56669999999877652 1112223444444444
Q ss_pred hh---cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEe
Q psy10004 157 MI---GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 157 ~l---~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~Iftigi 233 (354)
.+ ..++|+|++..||..|.+.+.... .+....++|||||||.+++..++++.++ .+ ...++.||+|||
T Consensus 116 sL~~~~~pgGgTnig~AL~~Aae~L~sr~----~R~nvpKVVILLTDG~sns~~dvleaAq--~L---R~~GVeI~vIGV 186 (576)
T PTZ00441 116 SLRKTYLPYGKTNMTDALLEVRKHLNDRV----NRENAIQLVILMTDGIPNSKYRALEESR--KL---KDRNVKLAVIGI 186 (576)
T ss_pred HHHhhccCCCCccHHHHHHHHHHHHhhcc----cccCCceEEEEEecCCCCCcccHHHHHH--HH---HHCCCEEEEEEe
Confidence 33 157899999999999998886432 1245678999999999865433332221 11 135899999999
Q ss_pred CCCCCCHHHHHHHH-h---cCCcEEEEeCChhHHHHHHHHHHHHhccchhc
Q psy10004 234 GKEVADYRDVKWMA-C---ANKGYYVHLSTLAEVRDQILSYVPVMARPLVL 280 (354)
Q Consensus 234 G~~~~~~~~L~~iA-~---~~~G~~~~i~~~~~~~~~l~~~~~~l~~pl~~ 280 (354)
|... +..+|+.|| | .++|.|+...+..++...+..++..++..+-.
T Consensus 187 G~g~-n~e~LrlIAgC~p~~g~c~~Y~vadf~eL~~ivk~LikkVC~eve~ 236 (576)
T PTZ00441 187 GQGI-NHQFNRLLAGCRPREGKCKFYSDADWEEAKNLIKPFIAKVCTEVER 236 (576)
T ss_pred CCCc-CHHHHHHHhccCCCCCCCceEEeCCHHHHHHHHHHHHHHhcccccc
Confidence 9876 788899998 3 35567888889999999999999999876543
No 30
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=99.75 E-value=5.9e-17 Score=138.89 Aligned_cols=154 Identities=19% Similarity=0.280 Sum_probs=112.8
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCC---CCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcC
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTLGN---NDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGD 160 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~---~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~ 160 (354)
|++||||+|+||.+ .++.++.++..++..|.. .++++|+.|++.....+. + .+. ...+...+...|+.+..
T Consensus 2 dv~~llD~S~Sm~~-~~~~~~~~~~~~~~~l~~~~~~~~v~lv~f~~~~~~~~~-~--~l~--~~~~~~~l~~~i~~l~~ 75 (163)
T cd01476 2 DLLFVLDSSGSVRG-KFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRQRVR-F--NLP--KHNDGEELLEKVDNLRF 75 (163)
T ss_pred CEEEEEeCCcchhh-hHHHHHHHHHHHHHhcCCCCCCcEEEEEEEcCCCceEEE-e--cCC--CCCCHHHHHHHHHhCcc
Confidence 79999999999986 688889999999988864 899999999995333221 1 111 12356778889988833
Q ss_pred CCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-HHHHHHhhhcccCCCCCCCeEEEEEEeCCC--C
Q psy10004 161 ANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-KEVFEEFNWRGQNDSTLWPVRVFSYLVGKE--V 237 (354)
Q Consensus 161 ~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-~~i~~~~~~~~~~~~~~~~v~IftigiG~~--~ 237 (354)
.+|+|++..||..|.+++.+... .+++..++|||+|||.++.. ....+.+. . ..++.||+||+|.. .
T Consensus 76 ~gg~T~l~~aL~~a~~~l~~~~~---~r~~~~~~villTDG~~~~~~~~~~~~l~--~-----~~~v~v~~vg~g~~~~~ 145 (163)
T cd01476 76 IGGTTATGAAIEVALQQLDPSEG---RREGIPKVVVVLTDGRSHDDPEKQARILR--A-----VPNIETFAVGTGDPGTV 145 (163)
T ss_pred CCCCccHHHHHHHHHHHhccccC---CCCCCCeEEEEECCCCCCCchHHHHHHHh--h-----cCCCEEEEEECCCcccc
Confidence 47889999999999999863221 12344589999999988743 33334332 2 15799999999987 5
Q ss_pred CCHHHHHHHHhcCCcEE
Q psy10004 238 ADYRDVKWMACANKGYY 254 (354)
Q Consensus 238 ~~~~~L~~iA~~~~G~~ 254 (354)
+...|+.||......|
T Consensus 146 -~~~~L~~ia~~~~~~~ 161 (163)
T cd01476 146 -DTEELHSITGNEDHIF 161 (163)
T ss_pred -CHHHHHHHhCCCcccc
Confidence 7889999987766443
No 31
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=99.75 E-value=5.6e-17 Score=137.50 Aligned_cols=143 Identities=20% Similarity=0.256 Sum_probs=105.6
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhcCC
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIGDA 161 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~~~ 161 (354)
++++|+||+||||.+.++..++.++..++..+. .+++++|+.|++.... . ......+...+.+.+..+ .+
T Consensus 1 ~~v~illD~SgSM~~~k~~~a~~~~~~l~~~~~~~~~~v~li~F~~~~~~-~-------~~~~~~~~~~~~~~l~~~-~~ 71 (152)
T cd01462 1 GPVILLVDQSGSMYGAPEEVAKAVALALLRIALAENRDTYLILFDSEFQT-K-------IVDKTDDLEEPVEFLSGV-QL 71 (152)
T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHcCCcEEEEEeCCCceE-E-------ecCCcccHHHHHHHHhcC-CC
Confidence 579999999999999999999999999988776 5899999999998322 1 112345667777788778 89
Q ss_pred CCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCC-CCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCH
Q psy10004 162 NNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGA-SENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADY 240 (354)
Q Consensus 162 ~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~-~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~ 240 (354)
+|+|++..||..+++.+.... ..+..||++|||. ......+.+..+... ..+++||+||+|... +.
T Consensus 72 ~ggT~l~~al~~a~~~l~~~~-------~~~~~ivliTDG~~~~~~~~~~~~~~~~~-----~~~~~v~~~~~g~~~-~~ 138 (152)
T cd01462 72 GGGTDINKALRYALELIERRD-------PRKADIVLITDGYEGGVSDELLREVELKR-----SRVARFVALALGDHG-NP 138 (152)
T ss_pred CCCcCHHHHHHHHHHHHHhcC-------CCCceEEEECCCCCCCCCHHHHHHHHHHH-----hcCcEEEEEEecCCC-Cc
Confidence 999999999999999987531 1246899999995 444555532222111 247999999999865 55
Q ss_pred HHHHHHHh
Q psy10004 241 RDVKWMAC 248 (354)
Q Consensus 241 ~~L~~iA~ 248 (354)
. |..+|.
T Consensus 139 ~-~~~~~~ 145 (152)
T cd01462 139 G-YDRISA 145 (152)
T ss_pred h-HHHHhh
Confidence 5 444443
No 32
>PF00092 VWA: von Willebrand factor type A domain; InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=99.74 E-value=3.4e-17 Score=141.56 Aligned_cols=169 Identities=22% Similarity=0.350 Sum_probs=124.9
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHc---CCCCeEEEEEeecccccccccccccccccChhhHHHHHHHH-hhhc
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTL---GNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGV-EMIG 159 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L---~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i-~~l~ 159 (354)
||+||||.|+||.+..++.+++++..+++.+ ..+.+|+|+.|++.++..++ + ....+...+...+ ..+.
T Consensus 1 DivflvD~S~sm~~~~~~~~~~~v~~~i~~~~~~~~~~rv~iv~f~~~~~~~~~-~------~~~~~~~~~~~~i~~~~~ 73 (178)
T PF00092_consen 1 DIVFLVDTSGSMSGDNFEKAKQFVKSIISRLSISNNGTRVGIVTFSDSARVLFS-L------TDYQSKNDLLNAINDSIP 73 (178)
T ss_dssp EEEEEEE-STTSCHHHHHHHHHHHHHHHHHSTBSTTSEEEEEEEESSSEEEEEE-T------TSHSSHHHHHHHHHTTGG
T ss_pred CEEEEEeCCCCCchHHHHHHHHHHHHHHHhhhccccccccceeeeecccccccc-c------cccccccccccccccccc
Confidence 7999999999999999999999999999976 37789999999999986544 1 1234456666666 5555
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHh-hhcccCCCCCCCeEEEEEEeCCCCC
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEF-NWRGQNDSTLWPVRVFSYLVGKEVA 238 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~-~~~~~~~~~~~~v~IftigiG~~~~ 238 (354)
..+|+|++..||..|.+.+...... .++..+++||++|||.+++........ +.+. ..+|.+++||++ ..
T Consensus 74 ~~~g~t~~~~aL~~a~~~l~~~~~~--~r~~~~~~iiliTDG~~~~~~~~~~~~~~~~~-----~~~i~~~~ig~~--~~ 144 (178)
T PF00092_consen 74 SSGGGTNLGAALKFAREQLFSSNNG--GRPNSPKVIILITDGNSNDSDSPSEEAANLKK-----SNGIKVIAIGID--NA 144 (178)
T ss_dssp CCBSSB-HHHHHHHHHHHTTSGGGT--TGTTSEEEEEEEESSSSSSHSGHHHHHHHHHH-----HCTEEEEEEEES--CC
T ss_pred ccchhhhHHHHHhhhhhcccccccc--cccccccceEEEEeecccCCcchHHHHHHHHH-----hcCcEEEEEecC--cC
Confidence 7889999999999999998765221 234678999999999998662222111 1111 137888888882 23
Q ss_pred CHHHHHHHHhcC--CcEEEEeCChhHHHHHHH
Q psy10004 239 DYRDVKWMACAN--KGYYVHLSTLAEVRDQIL 268 (354)
Q Consensus 239 ~~~~L~~iA~~~--~G~~~~i~~~~~~~~~l~ 268 (354)
+...|+.||+.+ .+.+..+.+..++.+...
T Consensus 145 ~~~~l~~la~~~~~~~~~~~~~~~~~l~~~~~ 176 (178)
T PF00092_consen 145 DNEELRELASCPTSEGHVFYLADFSDLSQIIQ 176 (178)
T ss_dssp HHHHHHHHSHSSTCHHHEEEESSHHHHHHHHH
T ss_pred CHHHHHHHhCCCCCCCcEEEcCCHHHHHHHHh
Confidence 799999999664 468888999888775544
No 33
>cd01481 vWA_collagen_alpha3-VI-like VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=99.74 E-value=1.2e-16 Score=137.67 Aligned_cols=152 Identities=17% Similarity=0.182 Sum_probs=115.3
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
+||+||||.|+||....++.+|+.+..+++.|. ...+|+||.|++.++..++ |. ...++..+..+|+++.
T Consensus 1 ~DivfllD~S~Si~~~~f~~~k~fi~~lv~~f~i~~~~~rVgvv~ys~~~~~~~~-l~------~~~~~~~l~~~i~~i~ 73 (165)
T cd01481 1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFY-LN------THSTKADVLGAVRRLR 73 (165)
T ss_pred CCEEEEEeCCCCcCHHHHHHHHHHHHHHHhhccCCCCCcEEEEEEecCCeeEEEe-cc------ccCCHHHHHHHHHhcc
Confidence 489999999999998999999999999999986 4569999999999876543 21 2335678889999883
Q ss_pred CCCC-CCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-HHHHHHhhhcccCCCCCCCeEEEEEEeCCCC
Q psy10004 160 DANN-IANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEV 237 (354)
Q Consensus 160 ~~~G-~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-~~i~~~~~~~~~~~~~~~~v~IftigiG~~~ 237 (354)
..+| +|+++.||..+.+.+.......+..++.++++|++|||...+. ....+.++ ..+|.||++|+|.
T Consensus 74 ~~~g~~t~t~~AL~~~~~~~f~~~~g~R~~~~~~kv~vviTdG~s~d~~~~~a~~lr--------~~gv~i~~vG~~~-- 143 (165)
T cd01481 74 LRGGSQLNTGSALDYVVKNLFTKSAGSRIEEGVPQFLVLITGGKSQDDVERPAVALK--------RAGIVPFAIGARN-- 143 (165)
T ss_pred cCCCCcccHHHHHHHHHHhhcCccccCCccCCCCeEEEEEeCCCCcchHHHHHHHHH--------HCCcEEEEEeCCc--
Confidence 3344 4799999999987664332221123466789999999998753 33333332 2479999999984
Q ss_pred CCHHHHHHHHhcCC
Q psy10004 238 ADYRDVKWMACANK 251 (354)
Q Consensus 238 ~~~~~L~~iA~~~~ 251 (354)
.+.++|+.||+...
T Consensus 144 ~~~~eL~~ias~p~ 157 (165)
T cd01481 144 ADLAELQQIAFDPS 157 (165)
T ss_pred CCHHHHHHHhCCCc
Confidence 38999999999874
No 34
>COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism]
Probab=99.72 E-value=1.6e-16 Score=141.66 Aligned_cols=165 Identities=22% Similarity=0.313 Sum_probs=127.1
Q ss_pred CCceEEEEEeCCCCCCcc-hHHHHHHHHHHHHH-HcCCCCeEEEEEeecc-cccccccccccccccChhhHHHHHHHHhh
Q psy10004 81 SPKDMVILLDNSGSMMGQ-RREIARHVINNLLD-TLGNNDYVNVLQFTSV-CKEVVPCFADILVQANLANVRELKMGVEM 157 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~-~~~~ak~a~~~ll~-~L~~~d~v~vi~Fs~~-~~~~~~c~~~~lv~~~~~n~~~~~~~i~~ 157 (354)
....||||||+||||.+. ++..+|-++..||. .-...|+|+||.|..+ ++.++| .+.+.+.+.++|..
T Consensus 77 ~g~lvvfvVDASgSM~~~~Rm~aaKG~~~~lL~dAYq~RdkvavI~F~G~~A~lll~---------pT~sv~~~~~~L~~ 147 (261)
T COG1240 77 AGNLIVFVVDASGSMAARRRMAAAKGAALSLLRDAYQRRDKVAVIAFRGEKAELLLP---------PTSSVELAERALER 147 (261)
T ss_pred cCCcEEEEEeCcccchhHHHHHHHHHHHHHHHHHHHHccceEEEEEecCCcceEEeC---------CcccHHHHHHHHHh
Confidence 456799999999999987 89999999998874 4678999999999954 444332 34678999999999
Q ss_pred hcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch------HHHHHHhhhcccCCCCCCCeEEEEE
Q psy10004 158 IGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY------KEVFEEFNWRGQNDSTLWPVRVFSY 231 (354)
Q Consensus 158 l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~------~~i~~~~~~~~~~~~~~~~v~Ifti 231 (354)
+ .++|.|.+..||..|++++.+.... ++...+++|+||||.++.+ .+...... +. ...++.+.+|
T Consensus 148 l-~~GG~TPL~~aL~~a~ev~~r~~r~---~p~~~~~~vviTDGr~n~~~~~~~~~e~~~~a~-~~----~~~g~~~lvi 218 (261)
T COG1240 148 L-PTGGKTPLADALRQAYEVLAREKRR---GPDRRPVMVVITDGRANVPIPLGPKAETLEAAS-KL----RLRGIQLLVI 218 (261)
T ss_pred C-CCCCCCchHHHHHHHHHHHHHhhcc---CCCcceEEEEEeCCccCCCCCCchHHHHHHHHH-HH----hhcCCcEEEE
Confidence 9 9999999999999999999876532 3456789999999998732 22222211 11 2356777777
Q ss_pred EeCCCCCCHHHHHHHHhcCCcEEEEeCChhHH
Q psy10004 232 LVGKEVADYRDVKWMACANKGYYVHLSTLAEV 263 (354)
Q Consensus 232 giG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~ 263 (354)
......-.....+.||...+|.|+++.+..+.
T Consensus 219 d~e~~~~~~g~~~~iA~~~Gg~~~~L~~l~~~ 250 (261)
T COG1240 219 DTEGSEVRLGLAEEIARASGGEYYHLDDLSDD 250 (261)
T ss_pred ecCCccccccHHHHHHHHhCCeEEecccccch
Confidence 77544324667899999999999999998664
No 35
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=99.71 E-value=9.3e-16 Score=132.01 Aligned_cols=156 Identities=28% Similarity=0.383 Sum_probs=124.0
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCC---CCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhh
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLGN---NDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~---~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
|.+++||||.|+||.+.+++.++.++..++..+.. +++++|+.|++......+ + ....+...+...+..+
T Consensus 1 ~~~v~l~vD~S~SM~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ii~f~~~~~~~~~-~------~~~~~~~~~~~~i~~~ 73 (177)
T smart00327 1 PLDVVFLLDGSGSMGPNRFEKAKEFVLKLVEQLDIGPDGDRVGLVTFSDDATVLFP-L------NDSRSKDALLEALASL 73 (177)
T ss_pred CccEEEEEeCCCccchHHHHHHHHHHHHHHHhcCCCCCCcEEEEEEeCCCceEEEc-c------cccCCHHHHHHHHHhc
Confidence 67999999999999989999999999999999886 899999999998765433 1 1245667888889888
Q ss_pred cC--CCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc---hHHHHHHhhhcccCCCCCCCeEEEEEEe
Q psy10004 159 GD--ANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN---YKEVFEEFNWRGQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 159 ~~--~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~---~~~i~~~~~~~~~~~~~~~~v~Iftigi 233 (354)
. ..|+|++..||..|.+.+....... ..+..+.||++|||.+.+ ..+.++.++ ..++.+++|++
T Consensus 74 -~~~~~~~~~~~~al~~~~~~~~~~~~~~--~~~~~~~iviitDg~~~~~~~~~~~~~~~~--------~~~i~i~~i~~ 142 (177)
T smart00327 74 -SYKLGGGTNLGAALQYALENLFSKSAGS--RRGAPKVLILITDGESNDGGDLLKAAKELK--------RSGVKVFVVGV 142 (177)
T ss_pred -CCCCCCCchHHHHHHHHHHHhcCcCCCC--CCCCCeEEEEEcCCCCCCCccHHHHHHHHH--------HCCCEEEEEEc
Confidence 6 5889999999999999886322211 123367999999999985 344444443 24699999999
Q ss_pred CCCCCCHHHHHHHHhcCCcEEEE
Q psy10004 234 GKEVADYRDVKWMACANKGYYVH 256 (354)
Q Consensus 234 G~~~~~~~~L~~iA~~~~G~~~~ 256 (354)
|... +...|+.|++.++|.|.+
T Consensus 143 ~~~~-~~~~l~~~~~~~~~~~~~ 164 (177)
T smart00327 143 GNDV-DEEELKKLASAPGGVYVF 164 (177)
T ss_pred cCcc-CHHHHHHHhCCCcceEEe
Confidence 9765 799999999999998876
No 36
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=99.71 E-value=2.8e-16 Score=159.05 Aligned_cols=159 Identities=15% Similarity=0.176 Sum_probs=124.7
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHH-cCCCCeEEEEEeecc-cccccccccccccccChhhHHHHHHHHhhh
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDT-LGNNDYVNVLQFTSV-CKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~-L~~~d~v~vi~Fs~~-~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
.+..++||||+||||.+.+|..+|.++..++.. +.+.|+|+||.|++. +..++| .+ .++..+...|+.+
T Consensus 400 ~~~~vvfvvD~SGSM~~~rl~~aK~a~~~ll~~ay~~rD~v~lI~F~g~~a~~~lp--------pT-~~~~~~~~~L~~l 470 (584)
T PRK13406 400 SETTTIFVVDASGSAALHRLAEAKGAVELLLAEAYVRRDQVALVAFRGRGAELLLP--------PT-RSLVRAKRSLAGL 470 (584)
T ss_pred CCccEEEEEECCCCCcHhHHHHHHHHHHHHHHhhcCCCCEEEEEEECCCceeEEcC--------CC-cCHHHHHHHHhcC
Confidence 578999999999999999999999999999855 689999999999765 555443 23 3677888999999
Q ss_pred cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-----------HHHHHHhhhcccCCCCCCCeE
Q psy10004 159 GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-----------KEVFEEFNWRGQNDSTLWPVR 227 (354)
Q Consensus 159 ~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-----------~~i~~~~~~~~~~~~~~~~v~ 227 (354)
.++|+|++..||..|++++..... .+..+.|||||||..+.. .+...... .. ...++.
T Consensus 471 -~~gGgTpL~~gL~~A~~~l~~~~~-----~~~~~~iVLlTDG~~n~~~~~~~~~~~~~~~~~~~a~--~~---~~~gi~ 539 (584)
T PRK13406 471 -PGGGGTPLAAGLDAAAALALQVRR-----KGMTPTVVLLTDGRANIARDGTAGRAQAEEDALAAAR--AL---RAAGLP 539 (584)
T ss_pred -CCCCCChHHHHHHHHHHHHHHhcc-----CCCceEEEEEeCCCCCCCccccccccchhhHHHHHHH--HH---HhcCCe
Confidence 999999999999999999876432 224589999999998742 12221111 11 235789
Q ss_pred EEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChh
Q psy10004 228 VFSYLVGKEVADYRDVKWMACANKGYYVHLSTLA 261 (354)
Q Consensus 228 IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~ 261 (354)
+++|++|... ...+++||...+|.|+.++..+
T Consensus 540 ~~vId~g~~~--~~~~~~LA~~~gg~y~~l~~~~ 571 (584)
T PRK13406 540 ALVIDTSPRP--QPQARALAEAMGARYLPLPRAD 571 (584)
T ss_pred EEEEecCCCC--cHHHHHHHHhcCCeEEECCCCC
Confidence 9999998643 4579999999999999998753
No 37
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=99.70 E-value=4.9e-16 Score=134.83 Aligned_cols=150 Identities=17% Similarity=0.231 Sum_probs=104.7
Q ss_pred eEEEEEeCCCCCCc-chHHHHHHHHHHHHHHcCC-CCeEEEEEeeccc--ccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 84 DMVILLDNSGSMMG-QRREIARHVINNLLDTLGN-NDYVNVLQFTSVC--KEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 84 dvvillD~SgSM~g-~~~~~ak~a~~~ll~~L~~-~d~v~vi~Fs~~~--~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
.|+||||+||||.+ .+++.+|+++..++..+.. +|+++|+.|++.+ ....... .....+......+...+..+
T Consensus 2 ~v~~llD~SgSM~~~~kl~~ak~a~~~l~~~l~~~~d~~~l~~F~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~- 78 (174)
T cd01454 2 AVTLLLDLSGSMRSDRRIDVAKKAAVLLAEALEACGVPHAILGFTTDAGGRERVRWI--KIKDFDESLHERARKRLAAL- 78 (174)
T ss_pred EEEEEEECCCCCCCCcHHHHHHHHHHHHHHHHHHcCCcEEEEEecCCCCCccceEEE--EecCcccccchhHHHHHHcc-
Confidence 47899999999998 5999999999999998885 9999999999984 2111100 00011111123567788888
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch----------HHHHHHhhhcccCCCCCCCeEEE
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY----------KEVFEEFNWRGQNDSTLWPVRVF 229 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~----------~~i~~~~~~~~~~~~~~~~v~If 229 (354)
.++|+|++..||..|.+.+.... ...++||++|||.++.. .+..+... .. ...+|.+|
T Consensus 79 ~~~g~T~~~~al~~a~~~l~~~~-------~~~~~iiliTDG~~~~~~~~~~~~~~~~~~~~~~~--~~---~~~gi~v~ 146 (174)
T cd01454 79 SPGGNTRDGAAIRHAAERLLARP-------EKRKILLVISDGEPNDLDYYEGNVFATEDALRAVI--EA---RKLGIEVF 146 (174)
T ss_pred CCCCCCcHHHHHHHHHHHHhcCC-------CcCcEEEEEeCCCcCcccccCcchhHHHHHHHHHH--HH---HhCCcEEE
Confidence 89999999999999999997532 23589999999998743 12211111 11 13589999
Q ss_pred EEEeCCCCC--CHHHHHHHHh
Q psy10004 230 SYLVGKEVA--DYRDVKWMAC 248 (354)
Q Consensus 230 tigiG~~~~--~~~~L~~iA~ 248 (354)
+||+|.+.. +.+.++.|-.
T Consensus 147 ~igig~~~~~~~~~~~~~~~~ 167 (174)
T cd01454 147 GITIDRDATTVDKEYLKNIFG 167 (174)
T ss_pred EEEecCccccchHHHHHHhhC
Confidence 999998762 2555555543
No 38
>cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=99.70 E-value=1.8e-15 Score=131.20 Aligned_cols=170 Identities=16% Similarity=0.079 Sum_probs=114.4
Q ss_pred ceEEEEEeCCCCCC------c---chHHHHHHHHHHHHH--HcCCCCeEEEEEeecccccccccccccccccChhhHHHH
Q psy10004 83 KDMVILLDNSGSMM------G---QRREIARHVINNLLD--TLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVREL 151 (354)
Q Consensus 83 ~dvvillD~SgSM~------g---~~~~~ak~a~~~ll~--~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~ 151 (354)
|.++++||.||||. | .+++.+|..+..+.+ .-..+|+++ |++..-...+..-+.......+..+.+
T Consensus 1 ~~l~lavDlSgSM~~~~~~dg~~~~RL~a~k~v~~~f~~f~~~r~~DriG---~~g~~~~~~~lt~d~p~t~d~~~~~~l 77 (191)
T cd01455 1 KRLKLVVDVSGSMYRFNGYDGRLDRSLEAVVMVMEAFDGFEDKIQYDIIG---HSGDGPCVPFVKTNHPPKNNKERLETL 77 (191)
T ss_pred CceEEEEECcHhHHHHhccCCccccHHHHHHHHHHHHHHHHHhCccceee---ecCcccccCccccccCcccchhHHHHH
Confidence 56899999999992 2 567888888777764 334788888 333332211110111111122333566
Q ss_pred HHHHhhh--cCCCCCCcHHHHHHHHHHHHH-hccccCCCCCCCcceEEEeccCCCCc----hHHHHHHhhhcccCCCCCC
Q psy10004 152 KMGVEMI--GDANNIANFTVALTRAFNILE-NARNDKKTGADCNQAIMVVTDGASEN----YKEVFEEFNWRGQNDSTLW 224 (354)
Q Consensus 152 ~~~i~~l--~~~~G~T~~~~aL~~A~~~l~-~~~~~~~~~~~~~~~IillTDG~~~~----~~~i~~~~~~~~~~~~~~~ 224 (354)
...++.. +.++.+|. .||..|.+.|. +.. ...++|||||||..+. +.++...+. ...
T Consensus 78 ~~~l~~~q~g~ag~~Ta--dAi~~av~rl~~~~~-------a~~kvvILLTDG~n~~~~i~P~~aAa~lA-------~~~ 141 (191)
T cd01455 78 KMMHAHSQFCWSGDHTV--EATEFAIKELAAKED-------FDEAIVIVLSDANLERYGIQPKKLADALA-------REP 141 (191)
T ss_pred HHHHHhcccCccCccHH--HHHHHHHHHHHhcCc-------CCCcEEEEEeCCCcCCCCCChHHHHHHHH-------HhC
Confidence 6666655 22344555 99999999996 532 2358999999999763 333222332 135
Q ss_pred CeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHH
Q psy10004 225 PVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPV 273 (354)
Q Consensus 225 ~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~ 273 (354)
+|+|||||+|.. +.+.|+.||..++|.|+.+.+.+++.+.+.++|..
T Consensus 142 gV~iytIgiG~~--d~~~l~~iA~~tgG~~F~A~d~~~L~~iy~~I~~~ 188 (191)
T cd01455 142 NVNAFVIFIGSL--SDEADQLQRELPAGKAFVCMDTSELPHIMQQIFTS 188 (191)
T ss_pred CCEEEEEEecCC--CHHHHHHHHhCCCCcEEEeCCHHHHHHHHHHHHHH
Confidence 899999999973 67889999999999999999999988877777754
No 39
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=99.67 E-value=1.9e-15 Score=154.20 Aligned_cols=162 Identities=17% Similarity=0.207 Sum_probs=123.9
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHH-cCCCCeEEEEEeeccc-ccccccccccccccChhhHHHHHHHHhhh
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDT-LGNNDYVNVLQFTSVC-KEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~-L~~~d~v~vi~Fs~~~-~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
....++||||+||||.+.++..+|.++..++.. +...|+|+||.|++.. ..+++ . ..++..+...|..+
T Consensus 406 ~~~~v~fvvD~SGSM~~~rl~~aK~av~~Ll~~~~~~~D~v~Li~F~~~~a~~~lp--------~-t~~~~~~~~~L~~l 476 (589)
T TIGR02031 406 SGRLLIFVVDASGSAAVARMSEAKGAVELLLGEAYVHRDQVSLIAFRGTAAEVLLP--------P-SRSVEQAKRRLDVL 476 (589)
T ss_pred cCceEEEEEECCCCCChHHHHHHHHHHHHHHHhhccCCCEEEEEEECCCCceEECC--------C-CCCHHHHHHHHhcC
Confidence 456689999999999999999999999999875 5689999999998764 33332 1 23567778889999
Q ss_pred cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch----------------HHHHHHhhhcccCCCC
Q psy10004 159 GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY----------------KEVFEEFNWRGQNDST 222 (354)
Q Consensus 159 ~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~----------------~~i~~~~~~~~~~~~~ 222 (354)
.++|+|++..||..|++.+.+... ....+.|||||||.++.+ .++..... .. .
T Consensus 477 -~~gGgTpL~~gL~~A~~~~~~~~~-----~~~~~~ivllTDG~~nv~~~~~~~~~~~~~~~~~~~~~~~a~--~~---~ 545 (589)
T TIGR02031 477 -PGGGGTPLAAGLAAAFQTALQARS-----SGGTPTIVLITDGRGNIPLDGDPESIKADREQAAEEALALAR--KI---R 545 (589)
T ss_pred -CCCCCCcHHHHHHHHHHHHHHhcc-----cCCceEEEEECCCCCCCCCCcccccccccchhHHHHHHHHHH--HH---H
Confidence 999999999999999999875432 134579999999998721 11111111 11 2
Q ss_pred CCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhH
Q psy10004 223 LWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAE 262 (354)
Q Consensus 223 ~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~ 262 (354)
..+|.+++|++|.......+++.||...+|.|+++.+...
T Consensus 546 ~~gi~~~vid~~~~~~~~~~~~~lA~~~~g~y~~l~~~~a 585 (589)
T TIGR02031 546 EAGMPALVIDTAMRFVSTGFAQKLARKMGAHYIYLPNATA 585 (589)
T ss_pred hcCCeEEEEeCCCCCccchHHHHHHHhcCCcEEeCCCCCh
Confidence 3578999999987644567899999999999999987643
No 40
>cd01457 vWA_ORF176_type VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most
Probab=99.64 E-value=8.8e-15 Score=129.82 Aligned_cols=153 Identities=18% Similarity=0.191 Sum_probs=109.6
Q ss_pred CceEEEEEeCCCCCCcc-------hHHHHHHHHHHHHHHcC--CCCeEEEEEeecccccccccccccccccChhhHHHHH
Q psy10004 82 PKDMVILLDNSGSMMGQ-------RREIARHVINNLLDTLG--NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELK 152 (354)
Q Consensus 82 p~dvvillD~SgSM~g~-------~~~~ak~a~~~ll~~L~--~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~ 152 (354)
++|++|+||.||||... +++.+++++..++..+. +.|.++++.|++......+ .+ ...+.
T Consensus 2 ~~dvv~~ID~SgSM~~~~~~~~~~k~~~ak~~~~~l~~~~~~~D~d~i~l~~f~~~~~~~~~--------~~---~~~v~ 70 (199)
T cd01457 2 NRDYTLLIDKSGSMAEADEAKERSRWEEAQESTRALARKCEEYDSDGITVYLFSGDFRRYDN--------VN---SSKVD 70 (199)
T ss_pred CcCEEEEEECCCcCCCCCCCCCchHHHHHHHHHHHHHHHHHhcCCCCeEEEEecCCccccCC--------cC---HHHHH
Confidence 67999999999999853 79999999999988765 5678999999888644322 12 56677
Q ss_pred HHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhh--cccCCCCCCCeEEEE
Q psy10004 153 MGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNW--RGQNDSTLWPVRVFS 230 (354)
Q Consensus 153 ~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~--~~~~~~~~~~v~Ift 230 (354)
+.+.++ .+.|+|++..+|..|.+.+....... ........||++|||.+++...+.+.+.. +... ...++.|++
T Consensus 71 ~~~~~~-~p~G~T~l~~~l~~a~~~~~~~~~~~-~~~p~~~~vIiiTDG~~~d~~~~~~~i~~a~~~l~--~~~~i~i~~ 146 (199)
T cd01457 71 QLFAEN-SPDGGTNLAAVLQDALNNYFQRKENG-ATCPEGETFLVITDGAPDDKDAVERVIIKASDELD--ADNELAISF 146 (199)
T ss_pred HHHhcC-CCCCcCcHHHHHHHHHHHHHHHHhhc-cCCCCceEEEEEcCCCCCcHHHHHHHHHHHHHhhc--cccCceEEE
Confidence 777888 88999999999999986554332111 01112588999999999876543332210 1111 124799999
Q ss_pred EEeCCCCCCHHHHHHHHhc
Q psy10004 231 YLVGKEVADYRDVKWMACA 249 (354)
Q Consensus 231 igiG~~~~~~~~L~~iA~~ 249 (354)
|+||.+..+...|+.++..
T Consensus 147 v~vG~~~~~~~~L~~ld~~ 165 (199)
T cd01457 147 LQIGRDPAATAFLKALDDQ 165 (199)
T ss_pred EEeCCcHHHHHHHHHHhHH
Confidence 9999875567788888754
No 41
>COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]
Probab=99.63 E-value=8.7e-15 Score=123.83 Aligned_cols=172 Identities=19% Similarity=0.261 Sum_probs=117.8
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCC------CeEEEEEeecccccccccccccccccChhhHHHHHH-HHh
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTLGNN------DYVNVLQFTSVCKEVVPCFADILVQANLANVRELKM-GVE 156 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~------d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~-~i~ 156 (354)
-+++|||+||||.|.+++..+..+..++++|..+ -.++||+|++.++...| |. .+.+ ...
T Consensus 5 P~~lllDtSgSM~Ge~IealN~Glq~m~~~Lkqdp~Ale~v~lsIVTF~~~a~~~~p-f~------------~~~nF~~p 71 (207)
T COG4245 5 PCYLLLDTSGSMIGEPIEALNAGLQMMIDTLKQDPYALERVELSIVTFGGPARVIQP-FT------------DAANFNPP 71 (207)
T ss_pred CEEEEEecCcccccccHHHHHHHHHHHHHHHHhChhhhheeEEEEEEecCcceEEec-hh------------hHhhcCCC
Confidence 4789999999999999999999999999999744 47999999998877655 21 1111 122
Q ss_pred hhcCCCCCCcHHHHHHHHHHHHHhccccC--CCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeC
Q psy10004 157 MIGDANNIANFTVALTRAFNILENARNDK--KTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVG 234 (354)
Q Consensus 157 ~l~~~~G~T~~~~aL~~A~~~l~~~~~~~--~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG 234 (354)
.+ .+.|+|.+++||+.|.+++.+..... ......++.++|+|||.|++.-..-.. ++ ... .+...+|-.+++|
T Consensus 72 ~L-~a~GgT~lGaAl~~a~d~Ie~~~~~~~a~~kgdyrP~vfLiTDG~PtD~w~~~~~-~~-~~~--~~~~k~v~a~~~G 146 (207)
T COG4245 72 IL-TAQGGTPLGAALTLALDMIEERKRKYDANGKGDYRPWVFLITDGEPTDDWQAGAA-LV-FQG--ERRAKSVAAFSVG 146 (207)
T ss_pred ce-ecCCCCchHHHHHHHHHHHHHHHhhcccCCccccceEEEEecCCCcchHHHhHHH-Hh-hhc--ccccceEEEEEec
Confidence 45 77899999999999999998753211 123567899999999999765221111 11 111 1234557777788
Q ss_pred CCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhccchh
Q psy10004 235 KEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMARPLV 279 (354)
Q Consensus 235 ~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~~pl~ 279 (354)
.+.++...|++|+..-+-.+. .+ ..++.++|+=++..+-
T Consensus 147 ~~~ad~~~L~qit~~V~~~~t--~d----~~~f~~fFkW~SaSis 185 (207)
T COG4245 147 VQGADNKTLNQITEKVRQFLT--LD----GLQFREFFKWLSASIS 185 (207)
T ss_pred ccccccHHHHHHHHhhccccc--cc----hHHHHHHHHHHHHHhh
Confidence 775588899999876543322 11 1345666666655443
No 42
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=99.60 E-value=5.3e-14 Score=118.12 Aligned_cols=146 Identities=25% Similarity=0.413 Sum_probs=114.9
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCC---CCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLGN---NDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~---~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
.+++||||.|+||...++..++.++..++..+.. .++++|+.|++......+. ....+...+.+.+..+
T Consensus 1 ~~v~~viD~S~Sm~~~~~~~~~~~~~~~~~~~~~~~~~~~i~v~~f~~~~~~~~~~-------~~~~~~~~~~~~~~~~- 72 (161)
T cd00198 1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPL-------TTDTDKADLLEAIDAL- 72 (161)
T ss_pred CcEEEEEeCCCCcCcchHHHHHHHHHHHHHhcccCCCCcEEEEEEecCccceeecc-------cccCCHHHHHHHHHhc-
Confidence 3789999999999778999999999999999886 8999999999876554331 1113456777788888
Q ss_pred C--CCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch----HHHHHHhhhcccCCCCCCCeEEEEEEe
Q psy10004 160 D--ANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY----KEVFEEFNWRGQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 160 ~--~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~----~~i~~~~~~~~~~~~~~~~v~Iftigi 233 (354)
. ..|+|++..||..|.+.+.... .....+.||++|||.++.. ...++.++ ..+++++.|++
T Consensus 73 ~~~~~~~t~~~~al~~~~~~~~~~~-----~~~~~~~lvvitDg~~~~~~~~~~~~~~~~~--------~~~v~v~~v~~ 139 (161)
T cd00198 73 KKGLGGGTNIGAALRLALELLKSAK-----RPNARRVIILLTDGEPNDGPELLAEAARELR--------KLGITVYTIGI 139 (161)
T ss_pred ccCCCCCccHHHHHHHHHHHhcccC-----CCCCceEEEEEeCCCCCCCcchhHHHHHHHH--------HcCCEEEEEEc
Confidence 5 7899999999999999987542 1345689999999998754 34445443 24799999999
Q ss_pred CCCCCCHHHHHHHHhcC
Q psy10004 234 GKEVADYRDVKWMACAN 250 (354)
Q Consensus 234 G~~~~~~~~L~~iA~~~ 250 (354)
|.+. +...|+.++..+
T Consensus 140 g~~~-~~~~l~~l~~~~ 155 (161)
T cd00198 140 GDDA-NEDELKEIADKT 155 (161)
T ss_pred CCCC-CHHHHHHHhccc
Confidence 9854 788999998876
No 43
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=99.57 E-value=1e-13 Score=142.84 Aligned_cols=160 Identities=19% Similarity=0.214 Sum_probs=118.4
Q ss_pred CCCceEEEEEeCCCCCCc-chHHHHHHHHHHHHH-HcCCCCeEEEEEeecc-cccccccccccccccChhhHHHHHHHHh
Q psy10004 80 ASPKDMVILLDNSGSMMG-QRREIARHVINNLLD-TLGNNDYVNVLQFTSV-CKEVVPCFADILVQANLANVRELKMGVE 156 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g-~~~~~ak~a~~~ll~-~L~~~d~v~vi~Fs~~-~~~~~~c~~~~lv~~~~~n~~~~~~~i~ 156 (354)
.....++||||.||||.+ .++..+|.++..++. .+..+|+|+||.|++. +..++| .+ .+...+...|.
T Consensus 463 r~~~~vv~vvD~SgSM~~~~rl~~ak~a~~~ll~~a~~~~D~v~lI~F~g~~a~~~~p--------~t-~~~~~~~~~L~ 533 (633)
T TIGR02442 463 RAGNLVIFVVDASGSMAARGRMAAAKGAVLSLLRDAYQKRDKVALITFRGEEAEVLLP--------PT-SSVELAARRLE 533 (633)
T ss_pred CCCceEEEEEECCccCCCccHHHHHHHHHHHHHHHhhcCCCEEEEEEECCCCceEEcC--------CC-CCHHHHHHHHH
Confidence 356789999999999987 489999999988875 5678999999999864 444332 22 44666778899
Q ss_pred hhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-------HHHHHHhhhcccCCCCCCCeEEE
Q psy10004 157 MIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-------KEVFEEFNWRGQNDSTLWPVRVF 229 (354)
Q Consensus 157 ~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-------~~i~~~~~~~~~~~~~~~~v~If 229 (354)
.+ .++|+|++..||..|.+++...... .....+.|||||||..+.. .++..... .. ...++.++
T Consensus 534 ~l-~~gG~Tpl~~aL~~A~~~l~~~~~~---~~~~~~~vvliTDG~~n~~~~~~~~~~~~~~~a~--~l---~~~~i~~~ 604 (633)
T TIGR02442 534 EL-PTGGRTPLAAGLLKAAEVLSNELLR---DDDGRPLLVVITDGRANVADGGEPPTDDARTIAA--KL---AARGILFV 604 (633)
T ss_pred hC-CCCCCCCHHHHHHHHHHHHHHhhcc---CCCCceEEEEECCCCCCCCCCCCChHHHHHHHHH--HH---HhcCCeEE
Confidence 99 9999999999999999998743211 1245689999999998642 12221111 11 12468888
Q ss_pred EEEeCCCCCCHHHHHHHHhcCCcEEEEe
Q psy10004 230 SYLVGKEVADYRDVKWMACANKGYYVHL 257 (354)
Q Consensus 230 tigiG~~~~~~~~L~~iA~~~~G~~~~i 257 (354)
+|+.+........++.||...+|.|+.+
T Consensus 605 vIdt~~~~~~~~~~~~lA~~~gg~y~~l 632 (633)
T TIGR02442 605 VIDTESGFVRLGLAEDLARALGGEYVRL 632 (633)
T ss_pred EEeCCCCCcchhHHHHHHHhhCCeEEec
Confidence 8887654335788999999999999875
No 44
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=99.51 E-value=8e-13 Score=115.23 Aligned_cols=160 Identities=14% Similarity=0.101 Sum_probs=117.8
Q ss_pred eEEEEEeCCCCCCc-----chHHHHHHHHHHHHHH---cCCCCeEEEEEeec-ccccccccccccccccChhhHHHHHHH
Q psy10004 84 DMVILLDNSGSMMG-----QRREIARHVINNLLDT---LGNNDYVNVLQFTS-VCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 84 dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~---L~~~d~v~vi~Fs~-~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
-++|+||.|.||.. .|++.+|.++..++.. ..+.+++||+.|.+ .+..+.| -+.+...+...
T Consensus 5 a~vi~lD~S~sM~a~D~~PnRL~aak~~i~~~~~~f~~~np~~~vGlv~fag~~a~v~~p---------lT~D~~~~~~~ 75 (187)
T cd01452 5 ATMICIDNSEYMRNGDYPPTRFQAQADAVNLICQAKTRSNPENNVGLMTMAGNSPEVLVT---------LTNDQGKILSK 75 (187)
T ss_pred EEEEEEECCHHHHcCCCCCCHHHHHHHHHHHHHHHHHhcCCCccEEEEEecCCceEEEEC---------CCCCHHHHHHH
Confidence 36899999999974 6899999999888633 35788999999999 6655433 23457788889
Q ss_pred HhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeC
Q psy10004 155 VEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVG 234 (354)
Q Consensus 155 i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG 234 (354)
++.+ .++|++++..||..|...|...... ....++|+|++++...++..+.+..+. . ...+|+|++||+|
T Consensus 76 L~~i-~~~g~~~l~~AL~~A~~~L~~~~~~----~~~~rivi~v~S~~~~d~~~i~~~~~~-l----kk~~I~v~vI~~G 145 (187)
T cd01452 76 LHDV-QPKGKANFITGIQIAQLALKHRQNK----NQKQRIVAFVGSPIEEDEKDLVKLAKR-L----KKNNVSVDIINFG 145 (187)
T ss_pred HHhC-CCCCcchHHHHHHHHHHHHhcCCCc----CCcceEEEEEecCCcCCHHHHHHHHHH-H----HHcCCeEEEEEeC
Confidence 9999 8999999999999999999754321 122366777777766666555544331 1 1358999999999
Q ss_pred CCCCCHHHHHHHHhcC----CcEEEEeCChhH
Q psy10004 235 KEVADYRDVKWMACAN----KGYYVHLSTLAE 262 (354)
Q Consensus 235 ~~~~~~~~L~~iA~~~----~G~~~~i~~~~~ 262 (354)
....+.+.|+.+...- +..+..++....
T Consensus 146 ~~~~~~~~l~~~~~~~~~~~~s~~~~~~~~~~ 177 (187)
T cd01452 146 EIDDNTEKLTAFIDAVNGKDGSHLVSVPPGEN 177 (187)
T ss_pred CCCCCHHHHHHHHHHhcCCCCceEEEeCCCCc
Confidence 9887888888885443 236777766443
No 45
>PF10138 vWA-TerF-like: vWA found in TerF C terminus ; InterPro: IPR019303 This entry represents the N-terminal domain of a family of proteins that confer resistance to the metalloid element tellurium and its salts.
Probab=99.37 E-value=5.4e-11 Score=103.77 Aligned_cols=156 Identities=14% Similarity=0.191 Sum_probs=110.0
Q ss_pred eEEEEEeCCCCCCcc----hHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh---
Q psy10004 84 DMVILLDNSGSMMGQ----RREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE--- 156 (354)
Q Consensus 84 dvvillD~SgSM~g~----~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~--- 156 (354)
.|++|||.||||++. ..+.+.+-+..+...|.++-.+-++.|+++.+. +...+..|.....+.+.
T Consensus 3 rV~LVLD~SGSM~~~yk~G~vQ~~~Er~lalA~~~DdDG~i~v~~Fs~~~~~--------~~~vt~~~~~~~v~~~~~~~ 74 (200)
T PF10138_consen 3 RVYLVLDISGSMRPLYKDGTVQRVVERILALAAQFDDDGEIDVWFFSTEFDR--------LPDVTLDNYEGYVDELHAGL 74 (200)
T ss_pred EEEEEEeCCCCCchhhhCccHHHHHHHHHHHHhhcCCCCceEEEEeCCCCCc--------CCCcCHHHHHHHHHHHhccc
Confidence 588999999999863 344555555556667888888999999998755 33345555544333321
Q ss_pred -hhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCC
Q psy10004 157 -MIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGK 235 (354)
Q Consensus 157 -~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~ 235 (354)
.+ ...|+|++..+++.+.+....... ...+-.|+++|||.+.+..++.+.+. ++ ...+|-.--||+|.
T Consensus 75 ~~~-~~~G~t~y~~vm~~v~~~y~~~~~-----~~~P~~VlFiTDG~~~~~~~~~~~i~--~a---s~~pifwqFVgiG~ 143 (200)
T PF10138_consen 75 PDW-GRMGGTNYAPVMEDVLDHYFKREP-----SDAPALVLFITDGGPDDRRAIEKLIR--EA---SDEPIFWQFVGIGD 143 (200)
T ss_pred ccc-CCCCCcchHHHHHHHHHHHhhcCC-----CCCCeEEEEEecCCccchHHHHHHHH--hc---cCCCeeEEEEEecC
Confidence 22 344889999999999988764332 23457899999999998888877764 22 34688888899997
Q ss_pred CCCCHHHHHHHHh------cCCcEEEEeCChh
Q psy10004 236 EVADYRDVKWMAC------ANKGYYVHLSTLA 261 (354)
Q Consensus 236 ~~~~~~~L~~iA~------~~~G~~~~i~~~~ 261 (354)
+ +..+|++|.. .|.|+ +++.+.+
T Consensus 144 ~--~f~fL~kLD~l~gR~vDNa~F-f~~~d~~ 172 (200)
T PF10138_consen 144 S--NFGFLEKLDDLAGRVVDNAGF-FAIDDID 172 (200)
T ss_pred C--cchHHHHhhccCCcccCCcCe-EecCCcc
Confidence 5 4889999988 35554 4566543
No 46
>COG2425 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=99.36 E-value=5.2e-12 Score=122.14 Aligned_cols=156 Identities=22% Similarity=0.236 Sum_probs=111.7
Q ss_pred cccccccccceeeeccCCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHH-cCCCCeEEEEEeecccccccccccccccc
Q psy10004 64 TDLYDCRVREWYINAAASPKDMVILLDNSGSMMGQRREIARHVINNLLDT-LGNNDYVNVLQFTSVCKEVVPCFADILVQ 142 (354)
Q Consensus 64 ~~~yd~r~r~wy~~a~~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~-L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~ 142 (354)
+-.|+-+...| ..|+++||+||||.|.+...||..+.+++.. |..+-++.++.|++....+ -+.
T Consensus 262 Ll~Yr~~gk~~--------GpvilllD~SGSM~G~~e~~AKAvalAl~~~alaenR~~~~~lF~s~~~~~-------el~ 326 (437)
T COG2425 262 LLTYRLQGKSE--------GPVILLLDKSGSMSGFKEQWAKAVALALMRIALAENRDCYVILFDSEVIEY-------ELY 326 (437)
T ss_pred chhhhhhcCCC--------CCEEEEEeCCCCcCCcHHHHHHHHHHHHHHHHHHhccceEEEEecccceee-------eec
Confidence 45677776656 7899999999999999999999888888765 6678889999999943221 122
Q ss_pred cChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCC
Q psy10004 143 ANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDST 222 (354)
Q Consensus 143 ~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~ 222 (354)
....+..++.+.+... .++| ||+..||..|.+.+.+..-. .--||+||||...-..+.+..++. .. .
T Consensus 327 ~k~~~~~e~i~fL~~~-f~GG-TD~~~~l~~al~~~k~~~~~-------~adiv~ITDg~~~~~~~~~~~v~e-~~---k 393 (437)
T COG2425 327 EKKIDIEELIEFLSYV-FGGG-TDITKALRSALEDLKSRELF-------KADIVVITDGEDERLDDFLRKVKE-LK---K 393 (437)
T ss_pred CCccCHHHHHHHHhhh-cCCC-CChHHHHHHHHHHhhccccc-------CCCEEEEeccHhhhhhHHHHHHHH-HH---H
Confidence 2234678888888776 6666 99999999999998754321 134999999998755444443331 11 1
Q ss_pred CCCeEEEEEEeCCCCCCHHHHHHHHhc
Q psy10004 223 LWPVRVFSYLVGKEVADYRDVKWMACA 249 (354)
Q Consensus 223 ~~~v~IftigiG~~~~~~~~L~~iA~~ 249 (354)
..+.++|++.||... ..-|..|+..
T Consensus 394 ~~~~rl~aV~I~~~~--~~~l~~Isd~ 418 (437)
T COG2425 394 RRNARLHAVLIGGYG--KPGLMRISDH 418 (437)
T ss_pred HhhceEEEEEecCCC--Ccccceeeee
Confidence 357899999999754 3455555544
No 47
>cd01460 vWA_midasin VWA_Midasin: Midasin is a member of the AAA ATPase family. The proteins of this family are unified by their common archetectural organization that is based upon a conserved ATPase domain. The AAA domain of midasin contains six tandem AAA protomers. The AAA domains in midasin is followed by a D/E rich domain that is following by a VWA domain. The members of this subgroup have a conserved MIDAS motif. The function of this domain is not exactly known although it has been speculated to play a crucial role in midasin function.
Probab=99.35 E-value=9.3e-11 Score=107.65 Aligned_cols=170 Identities=16% Similarity=0.179 Sum_probs=107.4
Q ss_pred CCCceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHH
Q psy10004 80 ASPKDMVILLDNSGSMMG-----QRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
...-+|+|+||.|.||.. .+++ ++..+..++..| +.++++|+.|++++..+.| |. . .-.. +.+.+.
T Consensus 58 kr~~qIvlaID~S~SM~~~~~~~~ale-ak~lIs~al~~L-e~g~vgVv~Fg~~~~~v~P-lt----~-d~~~-~a~~~~ 128 (266)
T cd01460 58 KRDYQILIAIDDSKSMSENNSKKLALE-SLCLVSKALTLL-EVGQLGVCSFGEDVQILHP-FD----E-QFSS-QSGPRI 128 (266)
T ss_pred ccCceEEEEEecchhcccccccccHHH-HHHHHHHHHHhC-cCCcEEEEEeCCCceEeCC-CC----C-Cchh-hHHHHH
Confidence 356789999999999974 2344 666666666655 5699999999999877666 21 1 1111 344455
Q ss_pred HhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCC--CCchH--HHHHHhhhcccCCCCCCCeEEEE
Q psy10004 155 VEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGA--SENYK--EVFEEFNWRGQNDSTLWPVRVFS 230 (354)
Q Consensus 155 i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~--~~~~~--~i~~~~~~~~~~~~~~~~v~Ift 230 (354)
++......++||+..+|..|.+++.+.+... ......++||++|||. .+... ...+++ ...+|.++.
T Consensus 129 l~~~~f~~~~Tni~~aL~~a~~~f~~~~~~~-~s~~~~qlilLISDG~~~~~e~~~~~~~r~a--------~e~~i~l~~ 199 (266)
T cd01460 129 LNQFTFQQDKTDIANLLKFTAQIFEDARTQS-SSGSLWQLLLIISDGRGEFSEGAQKVRLREA--------REQNVFVVF 199 (266)
T ss_pred hCcccCCCCCCcHHHHHHHHHHHHHhhhccc-cccccccEEEEEECCCcccCccHHHHHHHHH--------HHcCCeEEE
Confidence 5554355689999999999999997653221 1111358999999999 44332 222322 135899999
Q ss_pred EEeCCCCCCH----------------HHHHHHHhcCCcEEEEeCChhHHHHHH
Q psy10004 231 YLVGKEVADY----------------RDVKWMACANKGYYVHLSTLAEVRDQI 267 (354)
Q Consensus 231 igiG~~~~~~----------------~~L~~iA~~~~G~~~~i~~~~~~~~~l 267 (354)
|++-...... ..-+.+-..+--||..+.+.+++.+.+
T Consensus 200 I~ld~~~~~~SI~d~~~~~~~~~~~~~l~~Yl~~fpfpYy~~~~~~~~lp~~l 252 (266)
T cd01460 200 IIIDNPDNKQSILDIKVVSFKNDKSGVITPYLDEFPFPYYVIVRDLNQLPSVL 252 (266)
T ss_pred EEEcCCCCCCCcccccccccCCCCccHHHHHHhcCCCCeEEEecChhHhHHHH
Confidence 9997641111 111333444445666666666555443
No 48
>PRK10997 yieM hypothetical protein; Provisional
Probab=99.23 E-value=2.2e-10 Score=112.87 Aligned_cols=137 Identities=12% Similarity=0.125 Sum_probs=100.3
Q ss_pred CCCceEEEEEeCCCCCCcchHHHHHHHHHHHHH-HcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhh
Q psy10004 80 ASPKDMVILLDNSGSMMGQRREIARHVINNLLD-TLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~-~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
.....++++||+||||.|.+...|+..+..+.. .+..+|+++++.|++..... + + ....++..+...+..
T Consensus 321 ~~kGpiII~VDtSGSM~G~ke~~AkalAaAL~~iAl~q~dr~~li~Fs~~i~~~-~-----l--~~~~gl~~ll~fL~~- 391 (487)
T PRK10997 321 QPRGPFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEVVTY-E-----L--TGPDGLEQAIRFLSQ- 391 (487)
T ss_pred CCCCcEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHhcCCCEEEEEecCCceee-c-----c--CCccCHHHHHHHHHH-
Confidence 467899999999999999988888886666654 57899999999999976441 0 1 123456777777754
Q ss_pred cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC-chHHHHHHhhhcccCCCCCCCeEEEEEEeCCCC
Q psy10004 159 GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE-NYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEV 237 (354)
Q Consensus 159 ~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~-~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~ 237 (354)
..+|+|++..+|..+++.+.... -.+-.||+|||+... .++++.+.++.-. ...+.++|++.+|...
T Consensus 392 -~f~GGTDl~~aL~~al~~l~~~~-------~r~adIVVISDF~~~~~~eel~~~L~~Lk----~~~~~rf~~l~i~~~~ 459 (487)
T PRK10997 392 -SFRGGTDLAPCLRAIIEKMQGRE-------WFDADAVVISDFIAQRLPDELVAKVKELQ----RQHQHRFHAVAMSAHG 459 (487)
T ss_pred -hcCCCCcHHHHHHHHHHHHcccc-------cCCceEEEECCCCCCCChHHHHHHHHHHH----HhcCcEEEEEEeCCCC
Confidence 45899999999999999886432 123579999999764 3555655543111 1257999999999644
No 49
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=99.15 E-value=1.1e-09 Score=98.63 Aligned_cols=144 Identities=16% Similarity=0.166 Sum_probs=95.1
Q ss_pred eEEEEEeCCCCCC-------cchHHHHHHHHHHHHHH---cCCCCeEEEEEeecccccccccccc-----cccccChhhH
Q psy10004 84 DMVILLDNSGSMM-------GQRREIARHVINNLLDT---LGNNDYVNVLQFTSVCKEVVPCFAD-----ILVQANLANV 148 (354)
Q Consensus 84 dvvillD~SgSM~-------g~~~~~ak~a~~~ll~~---L~~~d~v~vi~Fs~~~~~~~~c~~~-----~lv~~~~~n~ 148 (354)
.++|+||.|.||. ..+++.++.++..++.. -.++|+|+|+.|++.......-|.. .+...+...+
T Consensus 3 ~ivf~iDvS~SM~~~~~~~~~s~l~~a~~~i~~~~~~ki~~~~~D~vGlilf~t~~~~~~~~~~~i~v~~~l~~~~~~~l 82 (218)
T cd01458 3 SVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKIISSPKDLVGVVFYGTEESKNPVGYENIYVLLDLDTPGAERV 82 (218)
T ss_pred EEEEEEeCCHHHcCCCCCCCCChHHHHHHHHHHHHHhceeCCCCCeEEEEEEcccCCCCcCCCCceEEeecCCCCCHHHH
Confidence 5789999999994 26899999999999987 4799999999999985332111111 1222333334
Q ss_pred HHHHHHHhhhc-------CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc------hHHHHHHhhh
Q psy10004 149 RELKMGVEMIG-------DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN------YKEVFEEFNW 215 (354)
Q Consensus 149 ~~~~~~i~~l~-------~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~------~~~i~~~~~~ 215 (354)
+.+.+.+..-. ...++|++..||..|.++|.... .....+.|||+|||.... ..++...++
T Consensus 83 ~~l~~~~~~~~~~~~~~~~~~~~~~l~~aL~~a~~~~~~~~-----~~~~~k~IvL~TDg~~p~~~~~~~~~~~~~~a~- 156 (218)
T cd01458 83 EDLKELIEPGGLSFAGQVGDSGQVSLSDALWVCLDLFSKGK-----KKKSHKRIFLFTNNDDPHGGDSIKDSQAAVKAE- 156 (218)
T ss_pred HHHHHHhhcchhhhcccCCCCCCccHHHHHHHHHHHHHhcc-----ccccccEEEEECCCCCCCCCCHHHHHHHHHHHH-
Confidence 44444332110 13568999999999999998621 123568899999997652 122222221
Q ss_pred cccCCCCCCCeEEEEEEeCCCC
Q psy10004 216 RGQNDSTLWPVRVFSYLVGKEV 237 (354)
Q Consensus 216 ~~~~~~~~~~v~IftigiG~~~ 237 (354)
.. ...+|.|++||+|...
T Consensus 157 -~l---~~~gI~i~~i~i~~~~ 174 (218)
T cd01458 157 -DL---KDKGIELELFPLSSPG 174 (218)
T ss_pred -HH---HhCCcEEEEEecCCCC
Confidence 11 2358999999999764
No 50
>PF11775 CobT_C: Cobalamin biosynthesis protein CobT VWA domain
Probab=99.08 E-value=7.4e-09 Score=91.13 Aligned_cols=169 Identities=21% Similarity=0.227 Sum_probs=99.5
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeeccccc---ccccccccccccChhhHHHHHHHH--
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKE---VVPCFADILVQANLANVRELKMGV-- 155 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~---~~~c~~~~lv~~~~~n~~~~~~~i-- 155 (354)
..-|-||||.||||.|.+++.|..++..+.++|. .+-.+-|+.|.+.+-. ...-|...--+..+.-+..+...+
T Consensus 12 d~~VtlLID~SGSMrgr~~~vA~~~adila~aL~~~gvp~EVlGFtT~aw~gg~~~~~w~~~G~p~~pgrln~l~h~vyk 91 (219)
T PF11775_consen 12 DTVVTLLIDCSGSMRGRPIEVAALCADILARALERCGVPVEVLGFTTRAWKGGRSREAWLAAGRPRYPGRLNDLRHIVYK 91 (219)
T ss_pred CeEEEEEEeCCcCCCCChHHHHHHHHHHHHHHHHhCCCCeEEEeeecCCcCCcchHHHHHhcCCCCCChHHHHHHHHHHH
Confidence 3446699999999999999999877777777776 5667889999887300 000000000011111122222211
Q ss_pred -------------hhhcCCC-CCCcH-HHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch--------------
Q psy10004 156 -------------EMIGDAN-NIANF-TVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-------------- 206 (354)
Q Consensus 156 -------------~~l~~~~-G~T~~-~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-------------- 206 (354)
..+-+.+ ..-|+ +.||..|.+.|.+..+ .++++|++|||.|.+.
T Consensus 92 ~a~~~wrraR~~l~~m~~~~~~~eniDGeAl~~a~~rL~~r~e-------~rkiLiViSDG~P~d~st~~~n~~~~L~~H 164 (219)
T PF11775_consen 92 DADTPWRRARRNLGLMMREGLLKENIDGEALRWAAERLLARPE-------QRKILIVISDGAPADDSTLSANDGDYLDAH 164 (219)
T ss_pred hcCChhhhHHHhHHHHhhccccccCCcHHHHHHHHHHHHcCCc-------cceEEEEEeCCCcCcccccccCChHHHHHH
Confidence 1110111 12233 7889999888876543 4589999999999731
Q ss_pred -HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHh
Q psy10004 207 -KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVM 274 (354)
Q Consensus 207 -~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l 274 (354)
..+++++. ...+|.+.+||||.+++- . --.+..|.+.+++...+.+.+..+
T Consensus 165 Lr~vi~~ie-------~~~~Vel~aiGIg~D~~~--y--------Y~~~~~i~~~e~l~~~~~~~l~~l 216 (219)
T PF11775_consen 165 LRQVIAEIE-------TRSDVELIAIGIGHDVSR--Y--------YRRAVTIDDVEELGGALFEQLARL 216 (219)
T ss_pred HHHHHHHHh-------ccCCcEEEEEEcCCCchh--h--------cccceecCCHHHHHHHHHHHHHHH
Confidence 23333332 245899999999976520 0 012344677788777776665543
No 51
>TIGR01651 CobT cobaltochelatase, CobT subunit. This model describes the aerobic cobalamin pathway Pseudomonas denitrificans CobT gene product, which is a cobalt chelatase subunit, with a MW ~70 kDa. The aerobic pathway cobalt chelatase is a heterotrimeric, ATP-dependent enzyme that catalyzes cobalt insertion during cobalamin biosynthesis. The other two subunits are the P. denitrificans CobS (TIGR01650) and CobN (pfam02514 CobN/Magnesium Chelatase) proteins. To avoid potential confusion with the nonhomologous Salmonella typhimurium/E.coli cobT gene product, the P. denitrificans gene symbol is not used in the name of this model.
Probab=98.93 E-value=3.1e-08 Score=98.56 Aligned_cols=162 Identities=23% Similarity=0.265 Sum_probs=99.2
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeecccccccccccccc---------cccChhhHHHH
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADIL---------VQANLANVREL 151 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~l---------v~~~~~n~~~~ 151 (354)
..-|.||||.||||.|.+..+|...+..|..+|. .+-.+-|+.|.+.+ |++.. .+..|.-++.+
T Consensus 392 D~~V~LLID~SGSM~~r~~~vA~~~a~iLa~aL~~~gIp~eVlGFtt~a------w~gg~~re~w~~~g~p~~PgRlN~l 465 (600)
T TIGR01651 392 DTVVTLLIDNSGSMRGRPITVAATCADILARTLERCGVKVEILGFTTRA------WKGGQSREKWLKAGKPAAPGRLNDL 465 (600)
T ss_pred CcEEEEEEECCccCCCCHHHHHHHHHHHHHHHHHHCCCCeEEEeecccc------cccccchHHHHhcCCCCCCcccchh
Confidence 4456799999999999988888876767777776 56788899999864 21100 00011111111
Q ss_pred HHHH---------------hhhcCCC-CCCc-HHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch--------
Q psy10004 152 KMGV---------------EMIGDAN-NIAN-FTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-------- 206 (354)
Q Consensus 152 ~~~i---------------~~l~~~~-G~T~-~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-------- 206 (354)
...| ..+-..+ ..-| =+.||..|.+.|....+ ..+++|+||||.|.+.
T Consensus 466 ~hiiyk~ad~~wr~~r~~l~~mm~~~~~~eN~DGeAl~wa~~rL~~R~e-------~rKiL~ViSDG~P~D~~TlsvN~~ 538 (600)
T TIGR01651 466 RHIIYKSADAPWRRARRNLGLMMREGLLKENIDGEALMWAHQRLIARPE-------QRRILMMISDGAPVDDSTLSVNPG 538 (600)
T ss_pred hhhhhhccccchhhhccchhhhhhccccccCCchHHHHHHHHHHhcCcc-------cceEEEEEeCCCcCCccccccCch
Confidence 1111 1110000 0112 27899999998876543 4599999999999742
Q ss_pred -------HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHH
Q psy10004 207 -------KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPV 273 (354)
Q Consensus 207 -------~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~ 273 (354)
..++..+. .+.+|.+++||||.+++ .+- ..+..|.+.+++..+|.+-+..
T Consensus 539 ~~l~~hLr~vi~~~e-------~~~~vel~aigIg~Dv~------r~Y----~~~v~i~~~~eL~~~~~~qLa~ 595 (600)
T TIGR01651 539 NYLERHLRAVIEEIE-------TRSPVELLAIGIGHDVT------RYY----RRAVTIVDAEELAGAMTEQLAA 595 (600)
T ss_pred hHHHHHHHHHHHHHh-------ccCCceEEEeeccccHH------HHc----cccceecCHHHHHHHHHHHHHH
Confidence 23444442 23589999999998652 111 2345678888887776655543
No 52
>COG4867 Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=98.70 E-value=2.8e-07 Score=87.38 Aligned_cols=163 Identities=20% Similarity=0.227 Sum_probs=109.4
Q ss_pred CCceEEEEEeCCCCCCc-chHHHHHHHH---HHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh
Q psy10004 81 SPKDMVILLDNSGSMMG-QRREIARHVI---NNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE 156 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g-~~~~~ak~a~---~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~ 156 (354)
....++++||+|-||-- -+|.-+|+.+ .-++.+--++|.+.+|.|...++.+ + +. .+.
T Consensus 462 t~aAvallvDtS~SM~~eGRw~PmKQtALALhHLv~TrfrGD~l~~i~Fgr~A~~v-~-------------v~----eLt 523 (652)
T COG4867 462 TQAAVALLVDTSFSMVMEGRWLPMKQTALALHHLVCTRFRGDALQIIAFGRYARTV-T-------------AA----ELT 523 (652)
T ss_pred cccceeeeeeccHHHHHhccCCchHHHHHHHHHHHHhcCCCcceEEEeccchhccc-C-------------HH----HHh
Confidence 55678999999999962 2566666654 4456777799999999999988653 1 11 122
Q ss_pred hh-cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc-----------------hHHHHHHhhhccc
Q psy10004 157 MI-GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN-----------------YKEVFEEFNWRGQ 218 (354)
Q Consensus 157 ~l-~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~-----------------~~~i~~~~~~~~~ 218 (354)
.+ +...-+||+..||..|-+.|.... ...+.|+++|||+|+. +..+...+. ..
T Consensus 524 ~l~~v~eqgTNlhhaL~LA~r~l~Rh~-------~~~~~il~vTDGePtAhle~~DG~~~~f~yp~DP~t~~~Tvr--~~ 594 (652)
T COG4867 524 GLAGVYEQGTNLHHALALAGRHLRRHA-------GAQPVVLVVTDGEPTAHLEDGDGTSVFFDYPPDPRTIAHTVR--GF 594 (652)
T ss_pred cCCCccccccchHHHHHHHHHHHHhCc-------ccCceEEEEeCCCccccccCCCCceEecCCCCChhHHHHHHH--HH
Confidence 23 022347999999999999997543 3458899999999861 222222221 11
Q ss_pred CCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHH-HHHHH
Q psy10004 219 NDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQ-ILSYV 271 (354)
Q Consensus 219 ~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~-l~~~~ 271 (354)
.+..+.++.|-+|-+|.+..-..+++.+|.-.+|..+ +++++.+-.. +.+|+
T Consensus 595 d~~~r~G~q~t~FrLg~DpgL~~Fv~qva~rv~G~vv-~pdldglGaaVvgdyl 647 (652)
T COG4867 595 DDMARLGAQVTIFRLGSDPGLARFIDQVARRVQGRVV-VPDLDGLGAAVVGDYL 647 (652)
T ss_pred HHHHhccceeeEEeecCCHhHHHHHHHHHHHhCCeEE-ecCcchhhHHHHHHHH
Confidence 1113457888888888776567899999999999765 4566555443 44554
No 53
>PF05762 VWA_CoxE: VWA domain containing CoxE-like protein; InterPro: IPR008912 This group of proteins contains a VWA type domain and the function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon in several bacteria [].
Probab=98.69 E-value=2.1e-07 Score=83.95 Aligned_cols=115 Identities=22% Similarity=0.328 Sum_probs=70.0
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh-hhc
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE-MIG 159 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~-~l~ 159 (354)
.|..+|+++|+||||.+.. ..+-..+..+...+. ++.++.|++....+.+. +.... .......+. ...
T Consensus 56 ~~~~lvvl~DvSGSM~~~s-~~~l~~~~~l~~~~~---~~~~f~F~~~l~~vT~~----l~~~~---~~~~l~~~~~~~~ 124 (222)
T PF05762_consen 56 KPRRLVVLCDVSGSMAGYS-EFMLAFLYALQRQFR---RVRVFVFSTRLTEVTPL----LRRRD---PEEALARLSALVQ 124 (222)
T ss_pred CCccEEEEEeCCCChHHHH-HHHHHHHHHHHHhCC---CEEEEEEeeehhhhhhh----hccCC---HHHHHHHHHhhcc
Confidence 5669999999999998632 122233333333333 88999999987654332 21111 122222222 122
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCC-CCchHHHHHHh
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGA-SENYKEVFEEF 213 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~-~~~~~~i~~~~ 213 (354)
..+|||+++.||..+.+.+.... - ....+|++|||. .+......+.+
T Consensus 125 ~~~GgTdi~~aL~~~~~~~~~~~------~-~~t~vvIiSDg~~~~~~~~~~~~l 172 (222)
T PF05762_consen 125 SFGGGTDIGQALREFLRQYARPD------L-RRTTVVIISDGWDTNDPEPLAEEL 172 (222)
T ss_pred CCCCccHHHHHHHHHHHHhhccc------c-cCcEEEEEecccccCChHHHHHHH
Confidence 47899999999999998875321 1 246899999994 44444444433
No 54
>PF04056 Ssl1: Ssl1-like; InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=98.64 E-value=1.6e-06 Score=75.80 Aligned_cols=157 Identities=11% Similarity=0.112 Sum_probs=104.1
Q ss_pred EEeCCCCCCc-----chHHHHHHHHHHHHHHc---CCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 88 LLDNSGSMMG-----QRREIARHVINNLLDTL---GNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 88 llD~SgSM~g-----~~~~~ak~a~~~ll~~L---~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
|||.|.+|.. +|+....+++..++..+ .|-.+++|+...+.....+. .-..|.+...+++.++.
T Consensus 1 viD~S~~m~~~D~~PtRl~~~~~~l~~Fv~eff~qNPiSqlgii~~~~~~a~~ls--------~lsgn~~~h~~~L~~~~ 72 (193)
T PF04056_consen 1 VIDMSEAMREKDLKPTRLQCVLKALEEFVREFFDQNPISQLGIIVMRDGRAERLS--------ELSGNPQEHIEALKKLR 72 (193)
T ss_pred CeechHhHHhCcCCccHHHHHHHHHHHHHHHHHhcCChhheeeeeeecceeEEee--------ecCCCHHHHHHHHHHhc
Confidence 6899999974 67888888888887764 35668999998886433222 11123344444444431
Q ss_pred --CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCC-CCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCC
Q psy10004 160 --DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGA-SENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKE 236 (354)
Q Consensus 160 --~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~-~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~ 236 (354)
.+.|...+..||+.|...|..... ...+-|+++.-.- .-|+.++.+.++.- ...+||+.+||++.
T Consensus 73 ~~~~~G~~SLqN~Le~A~~~L~~~p~------~~srEIlvi~gSl~t~Dp~di~~ti~~l-----~~~~IrvsvI~laa- 140 (193)
T PF04056_consen 73 KLEPSGEPSLQNGLEMARSSLKHMPS------HGSREILVIFGSLTTCDPGDIHETIESL-----KKENIRVSVISLAA- 140 (193)
T ss_pred cCCCCCChhHHHHHHHHHHHHhhCcc------ccceEEEEEEeecccCCchhHHHHHHHH-----HHcCCEEEEEEEhH-
Confidence 688999999999999999986532 2223444444221 12444444433210 13589999999996
Q ss_pred CCCHHHHHHHHhcCCcEEEEeCChhHHHHH
Q psy10004 237 VADYRDVKWMACANKGYYVHLSTLAEVRDQ 266 (354)
Q Consensus 237 ~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~ 266 (354)
....++.|+..|+|.|.-+-+.+.+.+.
T Consensus 141 --Ev~I~k~i~~~T~G~y~V~lde~H~~~l 168 (193)
T PF04056_consen 141 --EVYICKKICKETGGTYGVILDEDHFKEL 168 (193)
T ss_pred --HHHHHHHHHHhhCCEEEEecCHHHHHHH
Confidence 4569999999999999988887664443
No 55
>PF09967 DUF2201: VWA-like domain (DUF2201); InterPro: IPR018698 This family of various hypothetical bacterial proteins has no known function.
Probab=98.64 E-value=1.3e-07 Score=77.49 Aligned_cols=94 Identities=18% Similarity=0.248 Sum_probs=66.3
Q ss_pred EEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh--hhcCCC
Q psy10004 85 MVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE--MIGDAN 162 (354)
Q Consensus 85 vvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~--~l~~~~ 162 (354)
++++||+||||+...+......+..++... +.++.|+.|+..++.... +. . ....+. ++ ..+
T Consensus 1 i~vaiDtSGSis~~~l~~fl~ev~~i~~~~--~~~v~vi~~D~~v~~~~~-~~-----~-------~~~~~~~~~~-~Gg 64 (126)
T PF09967_consen 1 IVVAIDTSGSISDEELRRFLSEVAGILRRF--PAEVHVIQFDAEVQDVQV-FR-----S-------LEDELRDIKL-KGG 64 (126)
T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHHHhC--CCCEEEEEECCEeeeeeE-Ee-----c-------cccccccccc-CCC
Confidence 579999999999877777777777777776 556999999998866422 11 0 111222 34 678
Q ss_pred CCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc
Q psy10004 163 NIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN 205 (354)
Q Consensus 163 G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~ 205 (354)
|||++..+++.+.+.- ....++|++|||....
T Consensus 65 GGTdf~pvf~~~~~~~-----------~~~~~vi~fTDg~~~~ 96 (126)
T PF09967_consen 65 GGTDFRPVFEYLEENR-----------PRPSVVIYFTDGEGWP 96 (126)
T ss_pred CCCcchHHHHHHHhcC-----------CCCCEEEEEeCCCCCC
Confidence 9999999999976521 1246788999998743
No 56
>KOG3768|consensus
Probab=98.55 E-value=1.4e-06 Score=85.72 Aligned_cols=193 Identities=21% Similarity=0.258 Sum_probs=127.9
Q ss_pred EEEEEeCCCCCCc------chHHHHHHHHHHHHHHc-----CCCCeEEEEEeecccccccccccccccccChhhHHHHHH
Q psy10004 85 MVILLDNSGSMMG------QRREIARHVINNLLDTL-----GNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKM 153 (354)
Q Consensus 85 vvillD~SgSM~g------~~~~~ak~a~~~ll~~L-----~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~ 153 (354)
+.||||+||||.. +-+++||.|+.+++..- .-+||+-+++|..-.+.+.-.| .++-..+.+
T Consensus 4 ~lFllDTS~SM~qrah~~~tylD~AKgaVEtFiK~R~r~~~~~gdryml~TfeepP~~vk~~~--------~~~~a~~~~ 75 (888)
T KOG3768|consen 4 FLFLLDTSGSMSQRAHPQFTYLDLAKGAVETFIKQRTRVGRETGDRYMLTTFEEPPKNVKVAC--------EKLGAVVIE 75 (888)
T ss_pred EEEEEecccchhhhccCCchhhHHHHHHHHHHHHHHhccccccCceEEEEecccCchhhhhHH--------hhcccHHHH
Confidence 5799999999984 56899999999998653 2578999999988775542211 233355667
Q ss_pred HHhhhcCCCCC-CcHHHHHHHHHHHHHhccccCC-----C---C-CCCcceEEEeccCCCCc------------------
Q psy10004 154 GVEMIGDANNI-ANFTVALTRAFNILENARNDKK-----T---G-ADCNQAIMVVTDGASEN------------------ 205 (354)
Q Consensus 154 ~i~~l~~~~G~-T~~~~aL~~A~~~l~~~~~~~~-----~---~-~~~~~~IillTDG~~~~------------------ 205 (354)
.|.++ .+.++ +-...++..||++|+=.+-..+ + . .--.-+||+||||.--.
T Consensus 76 eik~l-~a~~~s~~~~~~~t~AFdlLnlnR~qtGID~yGqGR~pf~lEP~~iI~iTDG~r~s~~~GV~~e~~Lpl~~p~p 154 (888)
T KOG3768|consen 76 EIKKL-HAPYGSCQLHHAITEAFDLLNLNRVQTGIDGYGQGRLPFNLEPVTIILITDGGRYSGVAGVPIEFRLPLDPPFP 154 (888)
T ss_pred HHHhh-cCccchhhhhHHHHHHhhhhhhhhhhhcccccccccCccccCceEEEEEecCCccccccCCceeEEeccCCCCC
Confidence 78888 66554 5678888889999863221110 0 0 12246799999993210
Q ss_pred hHHH-HHHhhhcccCCCCCCCeEEEEEEe---CCC----------CCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHH
Q psy10004 206 YKEV-FEEFNWRGQNDSTLWPVRVFSYLV---GKE----------VADYRDVKWMACANKGYYVHLSTLAEVRDQILSYV 271 (354)
Q Consensus 206 ~~~i-~~~~~~~~~~~~~~~~v~Iftigi---G~~----------~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~ 271 (354)
..+. .+.++| .-|+|++-+ |.. .-|...++.|..-+||.-+.+-+...+.+.+..++
T Consensus 155 Gse~TkepFRW---------DQrlftlVlRiPgt~~~~~~qlt~Vp~Dds~IermCevTGGRSysV~Spr~lnqciesLv 225 (888)
T KOG3768|consen 155 GSEMTKEPFRW---------DQRLFTLVLRIPGTPYPTISQLTAVPIDDSVIERMCEVTGGRSYSVVSPRQLNQCIESLV 225 (888)
T ss_pred ccccccccchh---------hhhhheeeEecCCCCCccHhhhcCCCCCchhhHHhhhhcCCceeeeeCHHHHHHHHHHHH
Confidence 1111 122344 334566554 211 12566789999999999999999999999999999
Q ss_pred HHhccchhccCCCCCCccCCcccc
Q psy10004 272 PVMARPLVLQRNDHPIVWTPIYAD 295 (354)
Q Consensus 272 ~~l~~pl~~~~~~~~~~w~~~y~d 295 (354)
..+...++...+-.++...++-.|
T Consensus 226 qkvQ~gVvv~FE~~~p~papi~s~ 249 (888)
T KOG3768|consen 226 QKVQYGVVVRFECLPPIPAPITSD 249 (888)
T ss_pred HhhccCeEEEeeecCCCCCCcccC
Confidence 987766666554445555554433
No 57
>PRK05325 hypothetical protein; Provisional
Probab=98.52 E-value=2.7e-06 Score=82.03 Aligned_cols=184 Identities=15% Similarity=0.132 Sum_probs=119.0
Q ss_pred cccccccccceeeeccCCCceE-EEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeeccccccccccccccc
Q psy10004 64 TDLYDCRVREWYINAAASPKDM-VILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILV 141 (354)
Q Consensus 64 ~~~yd~r~r~wy~~a~~~p~dv-villD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv 141 (354)
++.+|-|.|.|-....+....| ++|+|+||||....-++||...--|..-|. .-..|-|+.-.++...-
T Consensus 203 ~d~~DlRYr~~~~~~~p~s~AVmfclMDvSGSM~~~~K~lakrff~lly~fL~r~Y~~vEvvFI~H~t~Ak--------- 273 (401)
T PRK05325 203 IDPFDLRYRNWEKVPKPESQAVMFCLMDVSGSMDEAEKDLAKRFFFLLYLFLRRKYENVEVVFIRHHTEAK--------- 273 (401)
T ss_pred CCcccccccccccccCccCcEEEEEEEeCCCCCchHHHHHHHHHHHHHHHHHHhccCceEEEEEeecCcee---------
Confidence 4667999999987766555555 568999999999999999998766655554 44667776666654321
Q ss_pred ccChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC--chHHHHHHhhhcccC
Q psy10004 142 QANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE--NYKEVFEEFNWRGQN 219 (354)
Q Consensus 142 ~~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~--~~~~i~~~~~~~~~~ 219 (354)
..+. + ..=.. .-.|||-+++|++.|.+++.+... +...|-..+-.|||..- +.....+.++. ...
T Consensus 274 EVdE---e----eFF~~-~esGGT~vSSA~~l~~eIi~~rYp----p~~wNIY~f~aSDGDNw~~D~~~~~~ll~~-~ll 340 (401)
T PRK05325 274 EVDE---E----EFFYS-RESGGTIVSSAYKLALEIIEERYP----PAEWNIYAFQASDGDNWSSDNPRCVELLRE-ELL 340 (401)
T ss_pred EcCH---H----Hcccc-CCCCCeEehHHHHHHHHHHHhhCC----HhHCeeEEEEcccCCCcCCCCHHHHHHHHH-HHH
Confidence 0011 1 11123 678999999999999999987543 34688999999999854 33444443321 111
Q ss_pred CCCCCCeEEEEEE-eCCCC-CCHHHHH---HHHhcCCc-EEEEeCChhHHHHHHHHHHHH
Q psy10004 220 DSTLWPVRVFSYL-VGKEV-ADYRDVK---WMACANKG-YYVHLSTLAEVRDQILSYVPV 273 (354)
Q Consensus 220 ~~~~~~v~Iftig-iG~~~-~~~~~L~---~iA~~~~G-~~~~i~~~~~~~~~l~~~~~~ 273 (354)
.....|+|+ |+... ....++. .+...... ....|.+.+++...+.++|..
T Consensus 341 ----p~~~~f~Y~Ev~~~~~~~~~l~~~y~~i~~~~~~f~~~~I~~~~dIyp~~r~lf~k 396 (401)
T PRK05325 341 ----PVCNYFAYIEVTPRAYRHQTLWREYERLQDTFPNFAMQRIRDKEDIYPVFRELFKK 396 (401)
T ss_pred ----HHhhheEEEEecCCCCCchHHHHHHHHhhccCCCeEEEEeCCHHHHHHHHHHHhcc
Confidence 234566665 33321 1233443 44333222 456788999999999998853
No 58
>cd01459 vWA_copine_like VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
Probab=98.50 E-value=7.6e-06 Score=74.94 Aligned_cols=155 Identities=14% Similarity=0.243 Sum_probs=108.8
Q ss_pred CCceEEEEEeCCCCCC---------------cchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccc---cccccc----c
Q psy10004 81 SPKDMVILLDNSGSMM---------------GQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKE---VVPCFA----D 138 (354)
Q Consensus 81 ~p~dvvillD~SgSM~---------------g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~---~~~c~~----~ 138 (354)
...++++-||-++|=. ....+.|..++..++....++..+-++.|+..... +..||. +
T Consensus 30 ~~~nl~vaIDfT~SNg~p~~~~SLHy~~~~~~N~Yq~aI~~vg~il~~yD~D~~ip~~GFGa~~~~~~~v~~~f~~~~~~ 109 (254)
T cd01459 30 LESNLIVAIDFTKSNGWPGEKRSLHYISPGRLNPYQKAIRIVGEVLQPYDSDKLIPAFGFGAIVTKDQSVFSFFPGYSES 109 (254)
T ss_pred CeeeEEEEEEeCCCCCCCCCCCCcccCCCCCccHHHHHHHHHHHHHHhcCCCCceeeEeecccCCCCCccccccCCCCCC
Confidence 5569999999999853 13456677777777777888889999999986432 344441 0
Q ss_pred cccccChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhccc
Q psy10004 139 ILVQANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQ 218 (354)
Q Consensus 139 ~lv~~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~ 218 (354)
..+..-..-++.-...+.++ ...|.|++...+++|.+...+... .+..-++++||||..++..+..+++.. +
T Consensus 110 p~~~Gi~gvl~aY~~~l~~v-~lsGpT~fapvI~~a~~~a~~~~~-----~~~Y~VLLIiTDG~i~D~~~t~~aIv~--A 181 (254)
T cd01459 110 PECQGFEGVLRAYREALPNV-SLSGPTNFAPVIRAAANIAKASNS-----QSKYHILLIITDGEITDMNETIKAIVE--A 181 (254)
T ss_pred CcccCHHHHHHHHHHHhcee-eecCcchHHHHHHHHHHHHHHhcC-----CCceEEEEEECCCCcccHHHHHHHHHH--H
Confidence 01111112233444566677 788999999999999988765421 124578999999999999888887742 2
Q ss_pred CCCCCCCeEEEEEEeCCCCCCHHHHHHHHh
Q psy10004 219 NDSTLWPVRVFSYLVGKEVADYRDVKWMAC 248 (354)
Q Consensus 219 ~~~~~~~v~IftigiG~~~~~~~~L~~iA~ 248 (354)
.+.++.|..||+|.. +...|+.+-.
T Consensus 182 ---S~~PlSIiiVGVGd~--~F~~M~~LD~ 206 (254)
T cd01459 182 ---SKYPLSIVIVGVGDG--PFDAMERLDD 206 (254)
T ss_pred ---hcCCeEEEEEEeCCC--ChHHHHHhcC
Confidence 245899999999964 6788888865
No 59
>COG4548 NorD Nitric oxide reductase activation protein [Inorganic ion transport and metabolism]
Probab=98.48 E-value=9.8e-07 Score=86.11 Aligned_cols=173 Identities=13% Similarity=0.155 Sum_probs=112.5
Q ss_pred eEEEEEeCCCCCCc------chHHHHHHHHHHHHHHcC-CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh
Q psy10004 84 DMVILLDNSGSMMG------QRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE 156 (354)
Q Consensus 84 dvvillD~SgSM~g------~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~ 156 (354)
-+.+|+|+|-||.. .-+..-+++++.+...+. .+|...+..|.+.....+. |+ .+..-...--......|.
T Consensus 448 a~TLLvD~S~St~a~mdetrRvidl~~eaL~~la~~~qa~gd~~~~~~fts~rr~~vr-i~-tvk~FDes~~~~~~~RIm 525 (637)
T COG4548 448 AFTLLVDVSASTDAKMDETRRVIDLFHEALLVLAHGHQALGDSEDILDFTSRRRPWVR-IN-TVKDFDESMGETVGPRIM 525 (637)
T ss_pred eeEEEeecccchHHHhhhhhhhHHHHHHHHHHhhchhhhhCCHHHhcCchhhcCccee-ee-eeeccccccccccchhhe
Confidence 35689999999973 123445556555544443 5677788888887665322 11 111111111233455777
Q ss_pred hhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchH---------HHHHHhhhcccCCCCCCCeE
Q psy10004 157 MIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYK---------EVFEEFNWRGQNDSTLWPVR 227 (354)
Q Consensus 157 ~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~---------~i~~~~~~~~~~~~~~~~v~ 227 (354)
.+ .|+-.|.++.||+.|-+-|.+. ++..+.+||+|||+|++.. +..++.- ++ ...+|.
T Consensus 526 AL-ePg~ytR~G~AIR~As~kL~~r-------pq~qklLivlSDGkPnd~d~YEgr~gIeDTr~AV~--ea---Rk~Gi~ 592 (637)
T COG4548 526 AL-EPGYYTRDGAAIRHASAKLMER-------PQRQKLLIVLSDGKPNDFDHYEGRFGIEDTREAVI--EA---RKSGIE 592 (637)
T ss_pred ec-CccccccccHHHHHHHHHHhcC-------cccceEEEEecCCCcccccccccccchhhHHHHHH--HH---HhcCce
Confidence 88 9999999999999999988754 4566999999999998431 1111111 00 135899
Q ss_pred EEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhc
Q psy10004 228 VFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMA 275 (354)
Q Consensus 228 IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~ 275 (354)
||.|-+..+. ...+..+- ..+-|..+...+.+...+-.++..+.
T Consensus 593 VF~Vtld~ea--~~y~p~~f--gqngYa~V~~v~~LP~~L~~lyrkL~ 636 (637)
T COG4548 593 VFNVTLDREA--ISYLPALF--GQNGYAFVERVAQLPGALPPLYRKLL 636 (637)
T ss_pred EEEEEecchh--hhhhHHHh--ccCceEEccchhhcchhHHHHHHHhc
Confidence 9999988654 33444333 34456889999999999988888764
No 60
>TIGR02877 spore_yhbH sporulation protein YhbH. This protein family, typified by YhbH in Bacillus subtilis, is found in nearly every endospore-forming bacterium and in no other genome (but note that the trusted cutoff score is set high to exclude a single high-scoring sequence from Nitrosococcus oceani ATCC 19707, which is classified in the Gammaproteobacteria). The gene in Bacillus subtilis was shown to be in the regulon of the sporulation sigma factor, sigma-E, and its mutation was shown to create a sporulation defect.
Probab=98.42 E-value=6.5e-06 Score=78.18 Aligned_cols=180 Identities=14% Similarity=0.122 Sum_probs=117.0
Q ss_pred cccccccccceeeeccCCCceE-EEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeeccccccccccccccc
Q psy10004 64 TDLYDCRVREWYINAAASPKDM-VILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILV 141 (354)
Q Consensus 64 ~~~yd~r~r~wy~~a~~~p~dv-villD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv 141 (354)
++..|-|.|.|-....+....| ++|+|+||||......+||...--|..-|. .-..|-|+.-.+++..-
T Consensus 183 i~~~DlRYr~~~~~~~P~s~AV~fc~MDvSGSM~~~~K~lak~ff~~ly~FL~~~Y~~VeivFI~H~t~Ak--------- 253 (371)
T TIGR02877 183 ITKEDLRYKTWKENEKPESNAVVIAMMDTSGSMGQFKKYIARSFFFWMVKFLRTKYENVEICFISHHTEAK--------- 253 (371)
T ss_pred CCcccccccccccccCccCcEEEEEEEeCCCCCCHHHHHHHHHHHHHHHHHHHhccCceEEEEEeecCeeE---------
Confidence 5678999999987665555554 578999999999999999998766655554 44566666666554321
Q ss_pred ccChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC--chHHHHHHhhhcccC
Q psy10004 142 QANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE--NYKEVFEEFNWRGQN 219 (354)
Q Consensus 142 ~~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~--~~~~i~~~~~~~~~~ 219 (354)
..+ -+. + =.. .-.|||-+++|++.|.+++.+... +...|-..+-.|||..- +.....+.+. + ..
T Consensus 254 EVd---Eee---F-F~~-~EsGGT~vSSA~~l~~eII~~rYp----p~~wNIY~f~aSDGDNw~~D~~~c~~ll~-~-ll 319 (371)
T TIGR02877 254 EVT---EEE---F-FHK-GESGGTYCSSGYKKALEIIDERYN----PARYNIYAFHFSDGDNLTSDNERAVKLVR-K-LL 319 (371)
T ss_pred EcC---HHH---h-ccc-CCCCCeEehHHHHHHHHHHHhhCC----hhhCeeEEEEcccCCCccCCcHHHHHHHH-H-HH
Confidence 011 111 1 123 678999999999999999986543 34688899999999854 3344444443 1 21
Q ss_pred CCCCCCeEEEEEE-eCCCCCCHHHH---HH-HHhcCCcEEEEeCChhHHHHHHHHHH
Q psy10004 220 DSTLWPVRVFSYL-VGKEVADYRDV---KW-MACANKGYYVHLSTLAEVRDQILSYV 271 (354)
Q Consensus 220 ~~~~~~v~Iftig-iG~~~~~~~~L---~~-iA~~~~G~~~~i~~~~~~~~~l~~~~ 271 (354)
.-...|+|+ |+.......++ +. |+. .+=....|.+.+++...++++|
T Consensus 320 ----p~~~~f~Y~Ei~~~~~~~~l~~~y~~~i~~-~~f~~~~I~~~~dIyp~~r~lf 371 (371)
T TIGR02877 320 ----EVCNLFGYGEIMPYGYSNTLKNKFKNEIKD-PNFVPLIIRDKEDLYPALKKFL 371 (371)
T ss_pred ----HhhheEEEEEecCCCCcchHHHHHHhhhcC-CCeEEEEeCCHHHHHHHHHHhC
Confidence 235577765 34322123333 23 554 3335567889999988887764
No 61
>cd01468 trunk_domain trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.
Probab=98.39 E-value=4e-05 Score=69.96 Aligned_cols=167 Identities=16% Similarity=0.120 Sum_probs=107.8
Q ss_pred CCceEEEEEeCCCC-CCcchHHHHHHHHHHHHHHcC--CCCeEEEEEeeccccccc---------------------ccc
Q psy10004 81 SPKDMVILLDNSGS-MMGQRREIARHVINNLLDTLG--NNDYVNVLQFTSVCKEVV---------------------PCF 136 (354)
Q Consensus 81 ~p~dvvillD~SgS-M~g~~~~~ak~a~~~ll~~L~--~~d~v~vi~Fs~~~~~~~---------------------~c~ 136 (354)
.|--++||||+|.. ....-++.+++++...|+.|. ++.+|+||+|++.++..- |..
T Consensus 2 ~pp~~vFvID~s~~ai~~~~l~~~~~sl~~~l~~lp~~~~~~igiITf~~~V~~~~~~~~~~~~~~~v~~dl~d~f~p~~ 81 (239)
T cd01468 2 QPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKDVFLPLP 81 (239)
T ss_pred CCCEEEEEEEcchHhccccHHHHHHHHHHHHHHhCCCCCCcEEEEEEeCCeEEEEECCCCCCCCeEEEeCCCccCcCCCc
Confidence 46678999999974 444568999999999999998 889999999988664320 100
Q ss_pred cccccccChhhHHHHHHHHhhhcC-------CCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch---
Q psy10004 137 ADILVQANLANVRELKMGVEMIGD-------ANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY--- 206 (354)
Q Consensus 137 ~~~lv~~~~~n~~~~~~~i~~l~~-------~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~--- 206 (354)
.+.++. -.+.+..+.+.++++.. ......++.||..|..++.... +.-.|++++.|.++.+
T Consensus 82 ~~~l~~-~~e~~~~i~~~l~~l~~~~~~~~~~~~~~~~G~Al~~A~~ll~~~~--------~gGkI~~f~sg~pt~GpG~ 152 (239)
T cd01468 82 DRFLVP-LSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTF--------AGGRIIVFQGGLPTVGPGK 152 (239)
T ss_pred Cceeee-HHHHHHHHHHHHHhhhhhccccCCCCCcccHHHHHHHHHHHHhhcC--------CCceEEEEECCCCCCCCCc
Confidence 111111 12344555666666511 1234578999999999998642 1234777787877511
Q ss_pred ----------------------HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCCh
Q psy10004 207 ----------------------KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTL 260 (354)
Q Consensus 207 ----------------------~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~ 260 (354)
.+..+.+.. .. ...+|.|..|..+.+..+...|..++..++|...+.++.
T Consensus 153 l~~~~~~~~~~~~~e~~~~~~a~~fY~~la~-~~---~~~~isvdlF~~~~~~~dl~~l~~l~~~TGG~v~~y~~f 224 (239)
T cd01468 153 LKSREDKEPIRSHDEAQLLKPATKFYKSLAK-EC---VKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSF 224 (239)
T ss_pred cccCcccccCCCccchhcccccHHHHHHHHH-HH---HHcCeEEEEEeccccccCHHHhhhhhhcCCceEEEeCCC
Confidence 011111110 00 134677777777766558899999999999977776654
No 62
>PF04811 Sec23_trunk: Sec23/Sec24 trunk domain; InterPro: IPR006896 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain, an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the Sec23/24 alpha/beta trunk domain, which is formed from a single, approximately 250-residue segment plugged into the beta-barrel between strands beta-1 and beta-19. The trunk has an alpha/beta fold with a vWA topology, and it forms the dimer interface, primarily involving strand beta-14 on Sec23 and Sec24; in addition, the trunk domain of Sec23 contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_A 2NUP_A 3EG9_A 3EFO_A 3EGX_A 2NUT_A 1PD0_A 1PD1_A 1M2V_B 1PCX_A ....
Probab=98.38 E-value=1.4e-05 Score=73.12 Aligned_cols=164 Identities=18% Similarity=0.205 Sum_probs=97.7
Q ss_pred CCceEEEEEeCCCC-CCcchHHHHHHHHHHHHHHcC--CCCeEEEEEeecccccc---------------------cccc
Q psy10004 81 SPKDMVILLDNSGS-MMGQRREIARHVINNLLDTLG--NNDYVNVLQFTSVCKEV---------------------VPCF 136 (354)
Q Consensus 81 ~p~dvvillD~SgS-M~g~~~~~ak~a~~~ll~~L~--~~d~v~vi~Fs~~~~~~---------------------~~c~ 136 (354)
.|-..+||||+|.. +...-++.+++++..+|+.|. ++.+|+||+|++.++.. .|+-
T Consensus 2 ~pp~y~FvID~s~~av~~g~~~~~~~sl~~~l~~l~~~~~~~vgiitfd~~V~~y~l~~~~~~~~~~v~~dl~~~~~p~~ 81 (243)
T PF04811_consen 2 QPPVYVFVIDVSYEAVQSGLLQSLIESLKSALDSLPGDERTRVGIITFDSSVHFYNLSSSLSQPQMIVVSDLDDPFIPLP 81 (243)
T ss_dssp S--EEEEEEE-SHHHHHHTHHHHHHHHHHHHGCTSSTSTT-EEEEEEESSSEEEEETTTTSSSTEEEEEHHTTSHHSSTS
T ss_pred CCCEEEEEEECchhhhhccHHHHHHHHHHHHHHhccCCCCcEEEEEEeCCEEEEEECCCCcCCCcccchHHHhhcccCCc
Confidence 46678999999854 344568899999999999999 88999999999876432 1222
Q ss_pred cccccccChhhHHHHHHHHhhhc--CCC-----CCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc----
Q psy10004 137 ADILVQANLANVRELKMGVEMIG--DAN-----NIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN---- 205 (354)
Q Consensus 137 ~~~lv~~~~~n~~~~~~~i~~l~--~~~-----G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~---- 205 (354)
.+.++.. .+.+..+...|+.+. .+. ....++.||+.|..+|.....+ -.|++++-|.|+.
T Consensus 82 ~~llv~~-~e~~~~i~~ll~~L~~~~~~~~~~~~~~c~G~Al~~A~~ll~~~~~g--------GkI~~F~s~~pt~G~Gg 152 (243)
T PF04811_consen 82 DGLLVPL-SECRDAIEELLESLPSIFPETAGKRPERCLGSALSAALSLLSSRNTG--------GKILVFTSGPPTYGPGG 152 (243)
T ss_dssp SSSSEET-TTCHHHHHHHHHHHHHHSTT-TTB-----HHHHHHHHHHHHHHHTS---------EEEEEEESS---SSSTT
T ss_pred ccEEEEh-HHhHHHHHHHHHHhhhhcccccccCccccHHHHHHHHHHHHhccccC--------CEEEEEeccCCCCCCCc
Confidence 2222222 233344444444431 222 3457899999999999843211 2366666665431
Q ss_pred ---------------------------hHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeC
Q psy10004 206 ---------------------------YKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLS 258 (354)
Q Consensus 206 ---------------------------~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~ 258 (354)
++++.... ...+|.|..|.++.+..+...|..++..++|..++.+
T Consensus 153 ~l~~~~~~~~~~~~~~~~~~~~~~~~fY~~la~~~--------~~~~isvDlf~~~~~~~~l~tl~~l~~~TGG~l~~y~ 224 (243)
T PF04811_consen 153 SLKKREDSSHYDTEKEKALLLPPANEFYKKLAEEC--------SKQGISVDLFVFSSDYVDLATLGPLARYTGGSLYYYP 224 (243)
T ss_dssp SS-SBTTSCCCCHCTTHHCHSHSSSHHHHHHHHHH--------HHCTEEEEEEEECSS--SHHHHTHHHHCTT-EEEEET
T ss_pred eecccccccccccccchhhhccccchHHHHHHHHH--------HhcCCEEEEEeecCCCCCcHhHHHHHHhCceeEEEeC
Confidence 11222221 1346777777777765588999999999999777777
Q ss_pred Chh
Q psy10004 259 TLA 261 (354)
Q Consensus 259 ~~~ 261 (354)
+..
T Consensus 225 ~f~ 227 (243)
T PF04811_consen 225 NFN 227 (243)
T ss_dssp TTT
T ss_pred CCC
Confidence 655
No 63
>PF04285 DUF444: Protein of unknown function (DUF444); InterPro: IPR006698 This entry is represented by Thermus phage phiYS40, Orf56. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches [].
Probab=98.38 E-value=1.1e-05 Score=78.34 Aligned_cols=184 Identities=15% Similarity=0.156 Sum_probs=117.7
Q ss_pred cccccccccceeeeccCCCce-EEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeeccccccccccccccc
Q psy10004 64 TDLYDCRVREWYINAAASPKD-MVILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILV 141 (354)
Q Consensus 64 ~~~yd~r~r~wy~~a~~~p~d-vvillD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv 141 (354)
++..|-|+|.|-....+.... |++++|+||||....-.+||...--|..-|. .-+.|-|+.-.+++..-
T Consensus 227 i~~~DlRyr~~~~~~~p~s~AVv~~lmDvSGSM~~~~K~lak~ff~~l~~fL~~~Y~~Ve~vfI~H~t~A~--------- 297 (421)
T PF04285_consen 227 IDPEDLRYRRWEEVPKPESNAVVFCLMDVSGSMGEFKKDLAKRFFFWLYLFLRRKYENVEIVFIRHHTEAK--------- 297 (421)
T ss_pred CCccccccccCccccCCcCcEEEEEEEeCCCCCchHHHHHHHHHHHHHHHHHHhccCceEEEEEeecCceE---------
Confidence 556799999997765554444 4578999999999989999998766665665 33456655554443220
Q ss_pred ccChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC--chHHHHHHhhhcccC
Q psy10004 142 QANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE--NYKEVFEEFNWRGQN 219 (354)
Q Consensus 142 ~~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~--~~~~i~~~~~~~~~~ 219 (354)
..+ -+ ..=.. ...|||-+++|++.|.+++.+... +...|-.++-+|||..- +.....+.+.. ...
T Consensus 298 EVd---Ee----~FF~~-~esGGT~vSSA~~l~~~ii~eryp----p~~wNiY~~~~SDGDN~~~D~~~~~~ll~~-~ll 364 (421)
T PF04285_consen 298 EVD---EE----EFFHS-RESGGTRVSSAYELALEIIEERYP----PSDWNIYVFHASDGDNWSSDNERCVELLEE-ELL 364 (421)
T ss_pred Eec---HH----Hhccc-CCCCCeEehHHHHHHHHHHHhhCC----hhhceeeeEEcccCccccCCCHHHHHHHHH-HHH
Confidence 001 01 11123 678999999999999999987443 34688999999999854 33334443321 111
Q ss_pred CCCCCCeEEEEEE-eCCCCC--CHHHHHHHHhcCCc-EEEEeCChhHHHHHHHHHHHH
Q psy10004 220 DSTLWPVRVFSYL-VGKEVA--DYRDVKWMACANKG-YYVHLSTLAEVRDQILSYVPV 273 (354)
Q Consensus 220 ~~~~~~v~Iftig-iG~~~~--~~~~L~~iA~~~~G-~~~~i~~~~~~~~~l~~~~~~ 273 (354)
..+..|+|+ |+.... ....++.+...... ....|.+.+++...+.++|..
T Consensus 365 ----p~~~~f~Y~Ei~~~~~~~~~~~~~~~~~~~~~f~~~~i~~~~di~~~~r~~f~~ 418 (421)
T PF04285_consen 365 ----PVCNYFGYGEITQPGRHSSWREYEELKESHDNFAMVRIREKEDIYPVFRELFKK 418 (421)
T ss_pred ----HhcCeEEEEEeccCccchHHHHHHHHhhcCCCeEEEEeCCHHHHHHHHHHHhcc
Confidence 245567665 331111 12345555433333 456788999999999998864
No 64
>KOG2807|consensus
Probab=98.36 E-value=1.4e-05 Score=73.43 Aligned_cols=174 Identities=12% Similarity=0.107 Sum_probs=116.8
Q ss_pred CceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHH
Q psy10004 82 PKDMVILLDNSGSMMG-----QRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKM 153 (354)
Q Consensus 82 p~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~ 153 (354)
=+.++||||.|.+|.. .++....+.+..++..+- |-.+++||.--+..... +.. -..|.+.-..
T Consensus 60 iRhl~iviD~S~am~e~Df~P~r~a~~~K~le~Fv~eFFdQNPiSQigii~~k~g~A~~-------lt~-ltgnp~~hI~ 131 (378)
T KOG2807|consen 60 IRHLYIVIDCSRAMEEKDFRPSRFANVIKYLEGFVPEFFDQNPISQIGIISIKDGKADR-------LTD-LTGNPRIHIH 131 (378)
T ss_pred heeEEEEEEhhhhhhhccCCchHHHHHHHHHHHHHHHHhccCchhheeEEEEecchhhH-------HHH-hcCCHHHHHH
Confidence 3689999999999985 466666677777776653 44578888877654332 222 1235556666
Q ss_pred HHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEe
Q psy10004 154 GVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 154 ~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~Iftigi 233 (354)
++..+..+.|.-.+..||+.|.+.|...... ..+-++|+++-=...|+.++.+.+..- ...+||+..||+
T Consensus 132 aL~~~~~~~g~fSLqNaLe~a~~~Lk~~p~H-----~sREVLii~sslsT~DPgdi~~tI~~l-----k~~kIRvsvIgL 201 (378)
T KOG2807|consen 132 ALKGLTECSGDFSLQNALELAREVLKHMPGH-----VSREVLIIFSSLSTCDPGDIYETIDKL-----KAYKIRVSVIGL 201 (378)
T ss_pred HHhcccccCCChHHHHHHHHHHHHhcCCCcc-----cceEEEEEEeeecccCcccHHHHHHHH-----HhhCeEEEEEee
Confidence 7776634778889999999999999865322 124566666555444554444443211 135899999999
Q ss_pred CCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHHHHHHhccchhc
Q psy10004 234 GKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILSYVPVMARPLVL 280 (354)
Q Consensus 234 G~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~~~~~l~~pl~~ 280 (354)
..+ ...-+.|+.+++|.|..+-|..- +++++...+.|.-.
T Consensus 202 saE---v~icK~l~kaT~G~Y~V~lDe~H----lkeLl~e~~~Pp~~ 241 (378)
T KOG2807|consen 202 SAE---VFICKELCKATGGRYSVALDEGH----LKELLLEHTHPPPA 241 (378)
T ss_pred chh---HHHHHHHHHhhCCeEEEEeCHHH----HHHHHHhcCCCCCc
Confidence 853 45779999999999988877665 45566666655433
No 65
>COG2304 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=98.33 E-value=1.7e-05 Score=77.40 Aligned_cols=165 Identities=17% Similarity=0.265 Sum_probs=125.1
Q ss_pred eccCCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh
Q psy10004 77 NAAASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE 156 (354)
Q Consensus 77 ~a~~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~ 156 (354)
.....+.+.++++|+||||.+..+..++.+...++..+.+.+.+.++.|........+ .. ...+...+..++.
T Consensus 32 ~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~------~~-~~~~~~~~~~~i~ 104 (399)
T COG2304 32 LDLLVPANLTLAIDTSGSMTGALLELAKSAAIELVNGLNPGDLLSIVTFAGSADVLIP------PT-GATNKESITAAID 104 (399)
T ss_pred cccccCcceEEEeccCCCccchhHHHHHHHHHHHhcccCCCCceEEEEecCCcceecC------cc-cccCHHHHHHHHh
Confidence 3345789999999999999998899999999999999999999999999995544332 11 2356778888888
Q ss_pred h-hcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC----chHHHHHHhhhcccCCCCCCCeEEEEE
Q psy10004 157 M-IGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE----NYKEVFEEFNWRGQNDSTLWPVRVFSY 231 (354)
Q Consensus 157 ~-l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~----~~~~i~~~~~~~~~~~~~~~~v~Ifti 231 (354)
. + .+.|.|....++..+++.+..... ......+.+.|||..+ +...+....+ .. ...++.+.++
T Consensus 105 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~tdg~~~~~~~d~~~~~~~~~--~~---~~~~i~~~~~ 173 (399)
T COG2304 105 QSL-QAGGATAVEASLSLAVELAAKALP-----RGTLNRILLLTDGENNLGLVDPSRLSALAK--LA---AGKGIVLDTL 173 (399)
T ss_pred hhh-ccccccHHHHHHHHHHHHhhhcCC-----ccceeeEeeeccCccccCCCCHHHHHHHhc--cc---ccCceEEEEE
Confidence 7 6 999999999999999998876432 2344669999999875 2333333322 11 2468999999
Q ss_pred EeCCCCCCHHHHHHHHhcCCcEEEEeCCh
Q psy10004 232 LVGKEVADYRDVKWMACANKGYYVHLSTL 260 (354)
Q Consensus 232 giG~~~~~~~~L~~iA~~~~G~~~~i~~~ 260 (354)
|+|.+. +..++..++....|....+...
T Consensus 174 g~~~~~-n~~~~~~~~~~~~g~l~~~~~~ 201 (399)
T COG2304 174 GLGDDV-NEDELTGIAAAANGNLAFIYLS 201 (399)
T ss_pred eccccc-chhhhhhhhhccCcccccccCc
Confidence 999986 6777878888877766655444
No 66
>cd01479 Sec24-like Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24
Probab=98.32 E-value=6.6e-05 Score=68.73 Aligned_cols=184 Identities=17% Similarity=0.174 Sum_probs=110.9
Q ss_pred CCceEEEEEeCCCCCC-cchHHHHHHHHHHHHHHcCCC---CeEEEEEeecccccc---------------------ccc
Q psy10004 81 SPKDMVILLDNSGSMM-GQRREIARHVINNLLDTLGNN---DYVNVLQFTSVCKEV---------------------VPC 135 (354)
Q Consensus 81 ~p~dvvillD~SgSM~-g~~~~~ak~a~~~ll~~L~~~---d~v~vi~Fs~~~~~~---------------------~~c 135 (354)
.|--++||||+|-.-- ..-++.+++++...|+.+.++ -+|+||+|++.++.. +|.
T Consensus 2 ~pp~~~FvIDvs~~a~~~g~~~~~~~si~~~L~~lp~~~~~~~VgiITfd~~v~~y~l~~~~~~~q~~vv~dl~d~f~P~ 81 (244)
T cd01479 2 QPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDDPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLPL 81 (244)
T ss_pred CCCEEEEEEEccHHHHhhChHHHHHHHHHHHHHhcCCCCCCeEEEEEEECCeEEEEECCCCCCCCeEEEeeCcccccCCC
Confidence 4678899999976432 124788999999999999866 899999999976432 111
Q ss_pred ccccccccChhhHHHHHHHHhhhc-----CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch----
Q psy10004 136 FADILVQANLANVRELKMGVEMIG-----DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY---- 206 (354)
Q Consensus 136 ~~~~lv~~~~~n~~~~~~~i~~l~-----~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~---- 206 (354)
.++.++. -.+.+..+...|+++. ......-++.||..|..+|.... -.|++++.|.|+.+
T Consensus 82 ~~~~lv~-l~e~~~~i~~lL~~L~~~~~~~~~~~~c~G~Al~~A~~lL~~~G----------GkIi~f~s~~pt~GpG~l 150 (244)
T cd01479 82 PDGLLVN-LKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETG----------GKIIVFQSSLPTLGAGKL 150 (244)
T ss_pred Ccceeec-HHHHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHhcC----------CEEEEEeCCCCCcCCccc
Confidence 1111111 1344555566666551 12235678999999999998421 23777777876511
Q ss_pred ------------HH------HHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeC--Ch---hHH
Q psy10004 207 ------------KE------VFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLS--TL---AEV 263 (354)
Q Consensus 207 ------------~~------i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~--~~---~~~ 263 (354)
.+ ..+-|+ .-+.+....+|.|..|.+..+..+...|..++..++|...+.+ .. .+.
T Consensus 151 ~~~~~~~~~~~~~e~~~~~p~~~fY~-~la~~~~~~~isvDlF~~~~~~~dla~l~~l~~~TGG~v~~y~~~~~~~~~d~ 229 (244)
T cd01479 151 KSREDPKLLSTDKEKQLLQPQTDFYK-KLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFNFSAPNDV 229 (244)
T ss_pred ccCccccccCchhhhhhcCcchHHHH-HHHHHHHHcCeEEEEEEccCcccChhhhhhhhhhcCceEEEECCccCCchhhH
Confidence 00 000111 0000012356777777766654588899999999999666655 22 233
Q ss_pred HHHHHHHHHHhcc
Q psy10004 264 RDQILSYVPVMAR 276 (354)
Q Consensus 264 ~~~l~~~~~~l~~ 276 (354)
..-..++.+.+.|
T Consensus 230 ~kl~~dl~~~ltr 242 (244)
T cd01479 230 EKLVNELARYLTR 242 (244)
T ss_pred HHHHHHHHHHhcc
Confidence 3334444555544
No 67
>COG4547 CobT Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) [Coenzyme metabolism]
Probab=98.19 E-value=7.7e-06 Score=78.48 Aligned_cols=141 Identities=25% Similarity=0.289 Sum_probs=88.1
Q ss_pred ceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeecccccccc---cccccccccChhhHHHHHHHHh--
Q psy10004 83 KDMVILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVP---CFADILVQANLANVRELKMGVE-- 156 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~---c~~~~lv~~~~~n~~~~~~~i~-- 156 (354)
.-|-+|||.||||.|.++..|...+-.|..+|. .+-.+-|..|..++=.-.+ .|-..--++.+.-+..+..-+.
T Consensus 414 tvVtlviDnSGSMrGRpItvAatcAdilArtLeRcgVk~eIlGFTT~awkGg~sre~wlk~Gkp~~pgrlndlrhiiyks 493 (620)
T COG4547 414 TVVTLVIDNSGSMRGRPITVAATCADILARTLERCGVKVEILGFTTKAWKGGQSRETWLKRGKPAFPGRLNDLRHIIYKS 493 (620)
T ss_pred hhheeeeccCCCcCCcceehhHHHHHHHHHHHHHcCCceEEeeeeeccccCCccHHHHHhcCCCCCchhhhhHHHHHHhc
Confidence 345699999999999999999999988989986 7788889999875410000 0100011122222222222221
Q ss_pred ----------hhc--CCCC--CCcH-HHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch---------------
Q psy10004 157 ----------MIG--DANN--IANF-TVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY--------------- 206 (354)
Q Consensus 157 ----------~l~--~~~G--~T~~-~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~--------------- 206 (354)
+++ .-.| .-|+ +.||-.|.+.|-...+ -.+++++||||.|-+.
T Consensus 494 AdaPwrRARrnlGlmmreglLkeNiDGEal~wah~rl~gRpE-------qrkIlmmiSDGAPvddstlsvnpGnylerHL 566 (620)
T COG4547 494 ADAPWRRARRNLGLMMREGLLKENIDGEALMWAHQRLIGRPE-------QRKILMMISDGAPVDDSTLSVNPGNYLERHL 566 (620)
T ss_pred cCCHHHHHHhhcchhhhcchhhccCChHHHHHHHHHHhcChh-------hceEEEEecCCCcccccccccCCchHHHHHH
Confidence 120 0011 1233 6788888887765443 3489999999997521
Q ss_pred HHHHHHhhhcccCCCCCCCeEEEEEEeCCCC
Q psy10004 207 KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEV 237 (354)
Q Consensus 207 ~~i~~~~~~~~~~~~~~~~v~IftigiG~~~ 237 (354)
..+++++. +..+|.+.+||||.++
T Consensus 567 RaVieeIE-------trSpveLlAIGighDv 590 (620)
T COG4547 567 RAVIEEIE-------TRSPVELLAIGIGHDV 590 (620)
T ss_pred HHHHHHHh-------cCCchhheeeeccccc
Confidence 34455553 4568999999999887
No 68
>PF06707 DUF1194: Protein of unknown function (DUF1194); InterPro: IPR010607 This family consists of several hypothetical Rhizobiales specific proteins of around 270 residues in length. The function of this family is unknown.
Probab=98.03 E-value=0.00093 Score=58.91 Aligned_cols=177 Identities=15% Similarity=0.156 Sum_probs=110.5
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHH------HHcC--C--CCeEEEEEeecc--cccccccccccccccChhhHH
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLL------DTLG--N--NDYVNVLQFTSV--CKEVVPCFADILVQANLANVR 149 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll------~~L~--~--~d~v~vi~Fs~~--~~~~~~c~~~~lv~~~~~n~~ 149 (354)
...+++.+|.|+||....+.+-++.+...+ +.+. + .-.++++.|++. ...++| |. ...++...+
T Consensus 3 dlaLvLavDvS~SVD~~E~~lQ~~G~A~Al~dp~V~~Ai~~g~~g~Iav~~~eWsg~~~q~~~v~-Wt---~i~~~~da~ 78 (205)
T PF06707_consen 3 DLALVLAVDVSGSVDADEYRLQREGYAAALRDPEVIAAILSGPIGRIAVAVVEWSGPGRQRVVVP-WT---RIDSPADAE 78 (205)
T ss_pred cceeeeeeeccCCCCHHHHHHHHHHHHHHHCCHHHHHHHhcCCCCeEEEEEEEecCCCCceEEeC-CE---EeCCHHHHH
Confidence 457899999999999877666665544332 2232 2 335778888873 444555 42 334667777
Q ss_pred HHHHHHhhhc-CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch-H---HHHHHhhhcccCCCCCC
Q psy10004 150 ELKMGVEMIG-DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY-K---EVFEEFNWRGQNDSTLW 224 (354)
Q Consensus 150 ~~~~~i~~l~-~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~-~---~i~~~~~~~~~~~~~~~ 224 (354)
.+...|.... ...+.|.++.||..|..+|.+... ...+++|=+-.||..|.. . ...... ...
T Consensus 79 a~A~~l~~~~r~~~~~Taig~Al~~a~~ll~~~~~-----~~~RrVIDvSGDG~~N~G~~p~~~ard~~--------~~~ 145 (205)
T PF06707_consen 79 AFAARLRAAPRRFGGRTAIGSALDFAAALLAQNPF-----ECWRRVIDVSGDGPNNQGPRPVTSARDAA--------VAA 145 (205)
T ss_pred HHHHHHHhCCCCCCCCchHHHHHHHHHHHHHhCCC-----CCceEEEEECCCCCCCCCCCccHHHHHHH--------HHC
Confidence 7888887662 345669999999999999986422 134567778889998865 2 223322 135
Q ss_pred CeEEEEEEeCCCCC-CHHHHHHH--Hhc--CCc-EEEEeCChhHHHHHH-HHHHHHhc
Q psy10004 225 PVRVFSYLVGKEVA-DYRDVKWM--ACA--NKG-YYVHLSTLAEVRDQI-LSYVPVMA 275 (354)
Q Consensus 225 ~v~IftigiG~~~~-~~~~L~~i--A~~--~~G-~~~~i~~~~~~~~~l-~~~~~~l~ 275 (354)
+|.|..+.|+.... ....|..- .|- +.| +...+.+.++..++| ++++..|+
T Consensus 146 GitINgL~I~~~~~~~~~~L~~yy~~~VIgGpgAFV~~a~~~~df~~AirrKL~rEi~ 203 (205)
T PF06707_consen 146 GITINGLAILDDDPFGGADLDAYYRRCVIGGPGAFVETARGFEDFAEAIRRKLIREIA 203 (205)
T ss_pred CeEEeeeEecCCCCCccccHHHHHhhhcccCCCceEEEcCCHHHHHHHHHHHHHHHhc
Confidence 89999999987542 11122221 122 223 555667777776665 34444443
No 69
>cd01478 Sec23-like Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24
Probab=98.02 E-value=0.00037 Score=64.63 Aligned_cols=171 Identities=15% Similarity=0.146 Sum_probs=105.1
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccccc----cc-----ccc-------------
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVV----PC-----FAD------------- 138 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~----~c-----~~~------------- 138 (354)
.|--++||||+|- ....++.+++++...|+.|.++.+|++|+|++.++..- .| |++
T Consensus 2 ~pp~~vFviDvs~--~~~el~~l~~sl~~~L~~lP~~a~VGlITfd~~V~~~~L~~~~~~~~~vf~g~~~~~~~~~~~~l 79 (267)
T cd01478 2 SPPVFLFVVDTCM--DEEELDALKESLIMSLSLLPPNALVGLITFGTMVQVHELGFEECSKSYVFRGNKDYTAKQIQDML 79 (267)
T ss_pred CCCEEEEEEECcc--CHHHHHHHHHHHHHHHHhCCCCCEEEEEEECCEEEEEEcCCCcCceeeeccCCccCCHHHHHHHh
Confidence 4667899999976 46678999999999999999999999999999875421 00 000
Q ss_pred ---------------------------cccccChhhHHHHHHHHhhhcCCC---------CCCcHHHHHHHHHHHHHhcc
Q psy10004 139 ---------------------------ILVQANLANVRELKMGVEMIGDAN---------NIANFTVALTRAFNILENAR 182 (354)
Q Consensus 139 ---------------------------~lv~~~~~n~~~~~~~i~~l~~~~---------G~T~~~~aL~~A~~~l~~~~ 182 (354)
.+...-.+....+...|+.+ .+. ....++.||..|..+|....
T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~flvpl~e~~~~i~~lLe~L-~~~~~~~~~~~r~~r~~G~Al~~A~~ll~~~~ 158 (267)
T cd01478 80 GLGGPAMRPSASQHPGAGNPLPSAAASRFLLPVSQCEFTLTDLLEQL-QPDPWPVPAGHRPLRCTGVALSIAVGLLEACF 158 (267)
T ss_pred ccccccccccccCcCCccccccccccccEEEEHHHHHHHHHHHHHhC-cccccccCCCCCCCCchHHHHHHHHHHHHhhc
Confidence 11111123344555666666 332 24578999999999997431
Q ss_pred ccCCCCCCCcceEEEeccCCCCch---------H------------------HHHHHhhhcccCCCCCCCeEEEEEEeCC
Q psy10004 183 NDKKTGADCNQAIMVVTDGASENY---------K------------------EVFEEFNWRGQNDSTLWPVRVFSYLVGK 235 (354)
Q Consensus 183 ~~~~~~~~~~~~IillTDG~~~~~---------~------------------~i~~~~~~~~~~~~~~~~v~IftigiG~ 235 (354)
. +.--.|++++-|-++.+ + ...+-|+ ..+.+....++.|-.|..+.
T Consensus 159 ~------~~gGki~~F~sg~pT~GpG~l~~r~~~~~~r~~~d~~~~~~~~~~~a~~fY~-~la~~~~~~~vsvDlF~~s~ 231 (267)
T cd01478 159 P------NTGARIMLFAGGPCTVGPGAVVSTELKDPIRSHHDIDKDNAKYYKKAVKFYD-SLAKRLAANGHAVDIFAGCL 231 (267)
T ss_pred C------CCCcEEEEEECCCCCCCCceeeccccccccccccccccchhhhhhhHHHHHH-HHHHHHHhCCeEEEEEeccc
Confidence 1 11124667777755411 0 0001111 00000012345555555555
Q ss_pred CCCCHHHHHHHHhcCCcEEEEeCChh
Q psy10004 236 EVADYRDVKWMACANKGYYVHLSTLA 261 (354)
Q Consensus 236 ~~~~~~~L~~iA~~~~G~~~~i~~~~ 261 (354)
+..+...|+.++..++|..++.++..
T Consensus 232 d~vglaem~~l~~~TGG~v~~~~~f~ 257 (267)
T cd01478 232 DQVGLLEMKVLVNSTGGHVVLSDSFT 257 (267)
T ss_pred cccCHHHHHHHHHhcCcEEEEeCCcc
Confidence 44589999999999999887777653
No 70
>PF11443 DUF2828: Domain of unknown function (DUF2828); InterPro: IPR024553 This uncharacterised domain is found in eukaryotic, bacterial and viral proteins.
Probab=97.97 E-value=4.8e-05 Score=76.20 Aligned_cols=110 Identities=17% Similarity=0.204 Sum_probs=78.3
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCe-EEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDY-VNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~-v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
+-.+.+.|.|+||||.|.+++.+..- .-++..+..+.+ =.+|+|++.++. +.....++.+-.+.+..+
T Consensus 339 ~l~n~iav~DvSGSM~~~pm~vaiaL-gll~ae~~~~pf~~~~ITFs~~P~~---------~~i~g~~l~ekv~~~~~~- 407 (534)
T PF11443_consen 339 SLENCIAVCDVSGSMSGPPMDVAIAL-GLLIAELNKGPFKGRFITFSENPQL---------HKIKGDTLREKVRFIRRM- 407 (534)
T ss_pred CccceEEEEecCCccCccHHHHHHHH-HHHHHHhcccccCCeEEeecCCceE---------EEecCCCHHHHHHHHHhC-
Confidence 34799999999999999988876654 345556654443 348999999754 333444666666677777
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE 204 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~ 204 (354)
.-++.||+.+++...++...+.+- .+..-.+.|+++||=+.+
T Consensus 408 ~wg~nTn~~aVFdlIL~~Av~~~l---~~e~M~k~lfV~SDMeFD 449 (534)
T PF11443_consen 408 DWGMNTNFQAVFDLILETAVKNKL---KQEDMPKRLFVFSDMEFD 449 (534)
T ss_pred CcccCCcHHHHHHHHHHHHHHcCC---ChHHCCceEEEEeccccc
Confidence 778999999999887776554332 234567889999986643
No 71
>PF03731 Ku_N: Ku70/Ku80 N-terminal alpha/beta domain; InterPro: IPR005161 The Ku heterodimer (composed of Ku70 P12956 from SWISSPROT and Ku80 P13010 from SWISSPROT) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the N-terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold [].; PDB: 1JEQ_A 1JEY_A.
Probab=97.92 E-value=8e-05 Score=67.10 Aligned_cols=143 Identities=16% Similarity=0.194 Sum_probs=78.3
Q ss_pred EEEEEeCCCCCCc------chHHHHHHHHHHHHHHc---CCCCeEEEEEeecccccc---cccccc-----cccccChhh
Q psy10004 85 MVILLDNSGSMMG------QRREIARHVINNLLDTL---GNNDYVNVLQFTSVCKEV---VPCFAD-----ILVQANLAN 147 (354)
Q Consensus 85 vvillD~SgSM~g------~~~~~ak~a~~~ll~~L---~~~d~v~vi~Fs~~~~~~---~~c~~~-----~lv~~~~~n 147 (354)
++||||.|.||.. ..++.|.+++..++... ++.|.|||+.|+...+.. ...|.+ .+.+.+.+.
T Consensus 2 ~vflID~s~sM~~~~~~~~~~l~~al~~i~~~~~~ki~~~~kD~vgvvl~gt~~t~n~~~~~~~~~i~~l~~l~~~~~~~ 81 (224)
T PF03731_consen 2 TVFLIDVSPSMFEPSSESESPLEEALKAIEDLMQQKIISSPKDEVGVVLFGTDETNNPDEDSGYENIFVLQPLDPPSAER 81 (224)
T ss_dssp EEEEEE-SCGGGS-BTTCS-HHHHHHHHHHHHHHHHHHTT---EEEEEEES-SS-BST-TTT-STTEEEEEECC--BHHH
T ss_pred EEEEEECCHHHCCCCCCcchhHHHHHHHHHHHHHHHHcCCCCCeEEEEEEcCCCCCCcccccCCCceEEeecCCccCHHH
Confidence 7899999999972 35888888888887662 477999999999876543 222332 122233333
Q ss_pred HHHHHHHHhh------hcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc-----hHHHHHHhhhc
Q psy10004 148 VRELKMGVEM------IGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN-----YKEVFEEFNWR 216 (354)
Q Consensus 148 ~~~~~~~i~~------l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~-----~~~i~~~~~~~ 216 (354)
+..+.+.+.. ........++..||..|.++|.+.... .....+.|+|+||+.... ...+...++..
T Consensus 82 l~~L~~~~~~~~~~~~~~~~~~~~~l~~al~v~~~~~~~~~~~---~k~~~krI~l~Td~d~p~~~~~~~~~~~~~l~~~ 158 (224)
T PF03731_consen 82 LKELEELLKPGDKFENFFSGSDEGDLSDALWVASDMFRERTCK---KKKNKKRIFLFTDNDGPHEDDDELERIIQKLKAK 158 (224)
T ss_dssp HHHHHTTSHHHHHHHHHC-SSS---HHHHHHHHHHHHHCHCTT---S-ECEEEEEEEES-SSTTT-CCCHHHHHHHHHHH
T ss_pred HHHHHHhhcccccccccCCCCCccCHHHHHHHHHHHHHHHhhc---ccCCCcEEEEEeCCCCCCCCHHHHHHHHHhhccc
Confidence 4443333332 101234458999999999999753221 234567899999986543 24444441111
Q ss_pred ccCCCCCCCeEEEEEEe
Q psy10004 217 GQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 217 ~~~~~~~~~v~Iftigi 233 (354)
+. ...+|.+-.+.+
T Consensus 159 Dl---~~~~i~~~~~~l 172 (224)
T PF03731_consen 159 DL---QDNGIEIELFFL 172 (224)
T ss_dssp HH---HHHTEEEEEEEC
T ss_pred cc---hhcCcceeEeec
Confidence 11 124677777777
No 72
>COG2718 Uncharacterized conserved protein [Function unknown]
Probab=97.86 E-value=0.00025 Score=67.08 Aligned_cols=183 Identities=13% Similarity=0.101 Sum_probs=110.6
Q ss_pred cccccccccceeeeccCCCceEE-EEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeeccccccccccccccc
Q psy10004 64 TDLYDCRVREWYINAAASPKDMV-ILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILV 141 (354)
Q Consensus 64 ~~~yd~r~r~wy~~a~~~p~dvv-illD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv 141 (354)
++ +|-|++.|-....+..+-|+ -|+|+||||....-.+||.+.--|---|. .-+.+-|+...++.... -|
T Consensus 228 ~d-~DlRf~~~~~~p~pes~AVmfclMDvSGSM~~~~KdlAkrFF~lL~~FL~~kYenveivfIrHht~A~-------EV 299 (423)
T COG2718 228 ID-EDLRYKRYEKVPKPESNAVMFCLMDVSGSMDQSEKDLAKRFFFLLYLFLRRKYENVEIVFIRHHTEAK-------EV 299 (423)
T ss_pred cc-cchhccCCccccCCccceEEEEEEecCCCcchHHHHHHHHHHHHHHHHHhcccceeEEEEEeecCcce-------ec
Confidence 45 89999988776655555554 57999999998888999998755544454 44566666666554221 00
Q ss_pred ccChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC--chHHHHHHhhhcccC
Q psy10004 142 QANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE--NYKEVFEEFNWRGQN 219 (354)
Q Consensus 142 ~~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~--~~~~i~~~~~~~~~~ 219 (354)
+ ... +. .. +-.|||-++.||+.+.+++.+... ....+-...-.+||..- +....+..+. +...
T Consensus 300 --d---E~d---FF-~~-~esGGTivSSAl~~m~evi~ErYp----~aeWNIY~fqaSDGDN~~dDserc~~ll~-~~im 364 (423)
T COG2718 300 --D---ETD---FF-YS-QESGGTIVSSALKLMLEVIKERYP----PAEWNIYAFQASDGDNWADDSERCVELLA-KKLM 364 (423)
T ss_pred --c---hhh---ce-ee-cCCCCeEeHHHHHHHHHHHHhhCC----hhheeeeeeeecCCccccCCCHHHHHHHH-HHHH
Confidence 1 011 11 12 678999999999999999988443 34688899999999864 2233322222 1111
Q ss_pred CCCCCCeEEEEEEeCCCCCCHHH--HHHHHhcC-CcEEEEeCChhHHHHHHHHHHHH
Q psy10004 220 DSTLWPVRVFSYLVGKEVADYRD--VKWMACAN-KGYYVHLSTLAEVRDQILSYVPV 273 (354)
Q Consensus 220 ~~~~~~v~IftigiG~~~~~~~~--L~~iA~~~-~G~~~~i~~~~~~~~~l~~~~~~ 273 (354)
..+..|+|+==..-..... .+.+-... +=.+.+|..++|+...+.++|..
T Consensus 365 ----~~~~~y~Y~Eitq~~~H~t~~y~~~~~~~dnFa~~~I~~~~Diypvfr~lf~k 417 (423)
T COG2718 365 ----PVVQYYGYIEITQRRTHQTLEYEALQGVFDNFAMQTIREPDDIYPVFRELFSK 417 (423)
T ss_pred ----HhhhheEEEeeeecccchhhhhhhhhccCcchheeeecCHHHHHHHHHHHHhc
Confidence 2344444431111001112 22221111 12456788999999999988864
No 73
>PLN00162 transport protein sec23; Provisional
Probab=97.82 E-value=0.0015 Score=69.18 Aligned_cols=185 Identities=14% Similarity=0.146 Sum_probs=111.2
Q ss_pred cCCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccccc-c---c-----ccc-----------
Q psy10004 79 AASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVV-P---C-----FAD----------- 138 (354)
Q Consensus 79 ~~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~-~---c-----~~~----------- 138 (354)
++.|--++||||+| +....++..|.++...|+.|.++.+|++|+|++.++..- . | |.+
T Consensus 121 ~~~pp~fvFvID~s--~~~~~l~~lk~sl~~~L~~LP~~a~VGlITF~s~V~~~~L~~~~~~~~~Vf~g~k~~t~~~l~~ 198 (761)
T PLN00162 121 APSPPVFVFVVDTC--MIEEELGALKSALLQAIALLPENALVGLITFGTHVHVHELGFSECSKSYVFRGNKEVSKDQILE 198 (761)
T ss_pred CCCCcEEEEEEecc--hhHHHHHHHHHHHHHHHHhCCCCCEEEEEEECCEEEEEEcCCCCCcceEEecCCccCCHHHHHH
Confidence 45677899999988 556778899999999999999999999999999874321 0 0 100
Q ss_pred -----------------------------cccccChhhHHHHHHHHhhhcCC------CC---CCcHHHHHHHHHHHHHh
Q psy10004 139 -----------------------------ILVQANLANVRELKMGVEMIGDA------NN---IANFTVALTRAFNILEN 180 (354)
Q Consensus 139 -----------------------------~lv~~~~~n~~~~~~~i~~l~~~------~G---~T~~~~aL~~A~~~l~~ 180 (354)
.+...-.+....+...|+.+ .. .+ ...++.||..|..+|..
T Consensus 199 ~l~l~~~~~~~~~~~~~~~~~~~~~p~~~~fLvpl~e~~~~i~~lLe~L-~~~~~~~~~~~rp~r~tG~AL~vA~~lL~~ 277 (761)
T PLN00162 199 QLGLGGKKRRPAGGGIAGARDGLSSSGVNRFLLPASECEFTLNSALEEL-QKDPWPVPPGHRPARCTGAALSVAAGLLGA 277 (761)
T ss_pred HhccccccccccccccccccccccCCCccceeEEHHHHHHHHHHHHHhh-hccccccCCCCCCCccHHHHHHHHHHHHhh
Confidence 00000112223444555555 22 22 35689999999999974
Q ss_pred ccccCCCCCCCcceEEEeccCCCCch---------H-------H-----------HHHHhhhcccCCCCCCCeEEEEEEe
Q psy10004 181 ARNDKKTGADCNQAIMVVTDGASENY---------K-------E-----------VFEEFNWRGQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 181 ~~~~~~~~~~~~~~IillTDG~~~~~---------~-------~-----------i~~~~~~~~~~~~~~~~v~Iftigi 233 (354)
... +..-.|++++-|.++.+ + + ..+-|+ ..+.+....++.|-.|..
T Consensus 278 ~~~------~~gGrI~~F~sgppT~GpG~v~~r~~~~~~rsh~di~k~~~~~~~~a~~fY~-~la~~~~~~gisvDlF~~ 350 (761)
T PLN00162 278 CVP------GTGARIMAFVGGPCTEGPGAIVSKDLSEPIRSHKDLDKDAAPYYKKAVKFYE-GLAKQLVAQGHVLDVFAC 350 (761)
T ss_pred ccC------CCceEEEEEeCCCCCCCCceeecccccccccCccccccchhhhcchHHHHHH-HHHHHHHHcCceEEEEEc
Confidence 321 11223666666765411 0 0 000111 000000124555555555
Q ss_pred CCCCCCHHHHHHHHhcCCcEEEEeCCh--hHHHHHHHHHHHH
Q psy10004 234 GKEVADYRDVKWMACANKGYYVHLSTL--AEVRDQILSYVPV 273 (354)
Q Consensus 234 G~~~~~~~~L~~iA~~~~G~~~~i~~~--~~~~~~l~~~~~~ 273 (354)
+.+-.+...|+.++..+||..+..++. +...+.+.+++..
T Consensus 351 s~dqvglaem~~l~~~TGG~v~~~~sF~~~~f~~~l~r~~~r 392 (761)
T PLN00162 351 SLDQVGVAEMKVAVERTGGLVVLAESFGHSVFKDSLRRVFER 392 (761)
T ss_pred cccccCHHHHhhhHhhcCcEEEEeCCcChHHHHHHHHHHhcc
Confidence 555458999999999999976666654 4455666666663
No 74
>PF07002 Copine: Copine; InterPro: IPR010734 This represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca2+-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth [].
Probab=97.81 E-value=0.00045 Score=58.02 Aligned_cols=127 Identities=14% Similarity=0.224 Sum_probs=91.3
Q ss_pred chHHHHHHHHHHHHHHcCCCCeEEEEEeecccc---cccccccccc------cccChhhHHHHHHHHhhhcCCCCCCcHH
Q psy10004 98 QRREIARHVINNLLDTLGNNDYVNVLQFTSVCK---EVVPCFADIL------VQANLANVRELKMGVEMIGDANNIANFT 168 (354)
Q Consensus 98 ~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~---~~~~c~~~~l------v~~~~~n~~~~~~~i~~l~~~~G~T~~~ 168 (354)
...+.|..++-.+|.....+..|-+..|+.... .+-.||.=.. +..-..-++.-++.+.++ ...|-|++.
T Consensus 11 N~Y~~ai~~vg~il~~Yd~dk~~p~~GFGa~~~~~~~vsh~F~ln~~~~~p~~~Gi~gvl~~Y~~~~~~v-~l~GPT~fa 89 (146)
T PF07002_consen 11 NPYQQAIRAVGEILQDYDSDKMIPAYGFGAKIPPDYSVSHCFPLNGNPQNPECQGIDGVLEAYRKALPKV-QLSGPTNFA 89 (146)
T ss_pred CHHHHHHHHHHHHHHhhccCCccceeccCCcCCCCcccccceeeecCCCCCcccCHHHHHHHHHHHhhhe-EECCCccHH
Confidence 456778888888888888888999999998743 2234543111 111122335556677788 889999999
Q ss_pred HHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeC
Q psy10004 169 VALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVG 234 (354)
Q Consensus 169 ~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG 234 (354)
.-+++|.+...... .......+++++|||..++.++..+++.. + ...++.|..+|+|
T Consensus 90 piI~~a~~~a~~~~----~~~~~Y~iLlIlTDG~i~D~~~T~~aIv~--A---S~~PlSIIiVGVG 146 (146)
T PF07002_consen 90 PIINHAAKIAKQSN----QNGQQYFILLILTDGQITDMEETIDAIVE--A---SKLPLSIIIVGVG 146 (146)
T ss_pred HHHHHHHHHHhhhc----cCCceEEEEEEecccccccHHHHHHHHHH--H---ccCCeEEEEEEeC
Confidence 99999999887521 23455689999999999999888887642 2 2468999999998
No 75
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=97.73 E-value=0.0023 Score=59.33 Aligned_cols=181 Identities=12% Similarity=0.143 Sum_probs=100.5
Q ss_pred ceEEEEEeCCCCCCc--------chHHHHHHHHHHHHHH---cCCCCeEEEEEeecccc-ccccccccc---c-------
Q psy10004 83 KDMVILLDNSGSMMG--------QRREIARHVINNLLDT---LGNNDYVNVLQFTSVCK-EVVPCFADI---L------- 140 (354)
Q Consensus 83 ~dvvillD~SgSM~g--------~~~~~ak~a~~~ll~~---L~~~d~v~vi~Fs~~~~-~~~~c~~~~---l------- 140 (354)
.=+++|||++.---| ..+..+..++..+++. +....+++||+...... .+.|--... .
T Consensus 3 slL~vvlD~np~~W~~~~~~~~~~~l~~~l~sllvF~NahL~l~~~N~vaVIAs~~~~~~~LYps~~~~~~~~~~~~~~~ 82 (279)
T TIGR00627 3 SLLVVIIEANPCSWGMLALAHGKRTISKVLRAIVVFLNAHLAFNANNKLAVIASHSQDNKYLYPSTRCEDRNASELDPKR 82 (279)
T ss_pred cEEEEEEeCCHHHHHHHhhccCCCcHHHHHHHHHHHHHHHHhcCccCCEEEEEecCCcceEEecCCcccccccccccccc
Confidence 346788888764321 1344444455455544 56888999997764432 222210000 0
Q ss_pred c---------ccChhhHHHHHHHHhhhc---CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC--ch
Q psy10004 141 V---------QANLANVRELKMGVEMIG---DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE--NY 206 (354)
Q Consensus 141 v---------~~~~~n~~~~~~~i~~l~---~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~--~~ 206 (354)
. .....-.+.+++.+.... ...+.+.+..||..|+-.+++..............|++++-+... .+
T Consensus 83 ~~~~~y~~f~~v~~~v~~~l~~l~~~~~~~~~~~~~s~lagals~ALcyinr~~~~~~~~~~~~~RIlii~~s~~~~~qY 162 (279)
T TIGR00627 83 LRELLYRDFRTVDETIVEEIKPLMAHADKHMKKDSRTVLAGALSDALGYINRSEQSETASEKLKSRILVISITPDMALQY 162 (279)
T ss_pred ccchhccchhHHHHHHHHHHHHHHhhchhcccccccccchhHHHhhhhhhcccccccccCcCCcceEEEEECCCCchHHH
Confidence 0 000011233333333321 122566799999999988876432211122334456666554332 23
Q ss_pred HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHH
Q psy10004 207 KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILS 269 (354)
Q Consensus 207 ~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~ 269 (354)
-.+.+.+... .+.+|+|-+++++.+. +...|++++..|+|.|..+.+.+.+.+-|..
T Consensus 163 i~~mn~Ifaa-----qk~~I~Idv~~L~~e~-~~~~lqQa~~~TgG~Y~~~~~~~~L~q~L~~ 219 (279)
T TIGR00627 163 IPLMNCIFSA-----QKQNIPIDVVSIGGDF-TSGFLQQAADITGGSYLHVKKPQGLLQYLMT 219 (279)
T ss_pred HHHHHHHHHH-----HHcCceEEEEEeCCcc-ccHHHHHHHHHhCCEEeccCCHhHHHHHHHH
Confidence 3333332211 1358999999998653 5789999999999999999888776555443
No 76
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=97.72 E-value=0.0025 Score=61.94 Aligned_cols=189 Identities=15% Similarity=0.107 Sum_probs=114.2
Q ss_pred eeccCCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHc---CCCCeEEEEEeeccccc----------cccccc-----
Q psy10004 76 INAAASPKDMVILLDNSGSMMGQRREIARHVINNLLDTL---GNNDYVNVLQFTSVCKE----------VVPCFA----- 137 (354)
Q Consensus 76 ~~a~~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L---~~~d~v~vi~Fs~~~~~----------~~~c~~----- 137 (354)
.++...|.|+++|+|.|+||.. .++.+|.....|+..| ..+-++++-.|-+++.. .-||..
T Consensus 93 ~~a~~yPvDLYyLMDlS~SM~d-dl~~lk~lg~~L~~~m~~it~n~rlGfGsFVDK~v~P~~~t~p~~l~~PC~~~~~~c 171 (423)
T smart00187 93 RQAEDYPVDLYYLMDLSYSMKD-DLDNLKSLGDDLAREMKGLTSNFRLGFGSFVDKTVSPFVSTRPEKLENPCPNYNLTC 171 (423)
T ss_pred EecccCccceEEEEeCCccHHH-HHHHHHHHHHHHHHHHHhcccCceeeEEEeecCccCCcccCCHHHhcCCCcCCCCCc
Confidence 3556689999999999999974 5667777777776554 57789999888887431 124411
Q ss_pred -----ccccccChhhHHHHHHHHhhhcCCCCCCcHHH-HHHHHHHHH--HhccccCCCCCCCcceEEEeccCCCC-----
Q psy10004 138 -----DILVQANLANVRELKMGVEMIGDANNIANFTV-ALTRAFNIL--ENARNDKKTGADCNQAIMVVTDGASE----- 204 (354)
Q Consensus 138 -----~~lv~~~~~n~~~~~~~i~~l~~~~G~T~~~~-aL~~A~~~l--~~~~~~~~~~~~~~~~IillTDG~~~----- 204 (354)
=..+..-..+...+.+.|.+. ...|+-+..+ +|.+..+.. .+..+.| .+..+++|+.||+...
T Consensus 172 ~p~f~f~~~L~LT~~~~~F~~~V~~~-~iSgN~D~PEgG~DAimQaaVC~~~IGWR---~~a~rllv~~TDa~fH~AGDG 247 (423)
T smart00187 172 EPPYGFKHVLSLTDDTDEFNEEVKKQ-RISGNLDAPEGGFDAIMQAAVCTEQIGWR---EDARRLLVFSTDAGFHFAGDG 247 (423)
T ss_pred CCCcceeeeccCCCCHHHHHHHHhhc-eeecCCcCCcccHHHHHHHHhhccccccC---CCceEEEEEEcCCCccccCCc
Confidence 011223346788899999888 6666654322 232222222 1222222 4567999999998642
Q ss_pred ---------------------------ch---HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEE
Q psy10004 205 ---------------------------NY---KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYY 254 (354)
Q Consensus 205 ---------------------------~~---~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~ 254 (354)
++ ..+.+++. + ++=..||++- ... ...-++++..=.|..
T Consensus 248 kLaGIv~PNDg~CHL~~~g~Yt~s~~~DYPSi~ql~~kL~--e-----~nI~~IFAVT--~~~--~~~Y~~Ls~lipgs~ 316 (423)
T smart00187 248 KLAGIVQPNDGQCHLDNNGEYTMSTTQDYPSIGQLNQKLA--E-----NNINPIFAVT--KKQ--VSLYKELSALIPGSS 316 (423)
T ss_pred ceeeEecCCCCcceeCCCCCcCccCcCCCCCHHHHHHHHH--h-----cCceEEEEEc--ccc--hhHHHHHHHhcCcce
Confidence 11 23444442 1 1234566652 221 223455554444544
Q ss_pred E-Ee-CChhHHHHHHHHHHHHhccchhc
Q psy10004 255 V-HL-STLAEVRDQILSYVPVMARPLVL 280 (354)
Q Consensus 255 ~-~i-~~~~~~~~~l~~~~~~l~~pl~~ 280 (354)
. .+ .+...+-+-|.+-|+.|++-+.+
T Consensus 317 vg~Ls~DSsNIv~LI~~aY~~i~S~V~l 344 (423)
T smart00187 317 VGVLSEDSSNVVELIKDAYNKISSRVEL 344 (423)
T ss_pred eeecccCcchHHHHHHHHHHhhceEEEE
Confidence 3 32 56688889999999999887776
No 77
>KOG1327|consensus
Probab=97.61 E-value=0.0015 Score=64.92 Aligned_cols=157 Identities=15% Similarity=0.183 Sum_probs=109.8
Q ss_pred CCceEEEEEeCCCCCC---------------cchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccc---ccccccccccc
Q psy10004 81 SPKDMVILLDNSGSMM---------------GQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKE---VVPCFADILVQ 142 (354)
Q Consensus 81 ~p~dvvillD~SgSM~---------------g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~---~~~c~~~~lv~ 142 (354)
...++.+-||-+.|=. -...+.|..++-.+|....++.+|....|+...-. +.-||--.+-+
T Consensus 284 ~~lnf~vgIDfTaSNg~p~~~sSLHyi~p~~~N~Y~~Ai~~vG~~lq~ydsdk~fpa~GFGakip~~~~vs~~f~ln~~~ 363 (529)
T KOG1327|consen 284 EQLNFTVGIDFTASNGDPRNPSSLHYIDPHQPNPYEQAIRSVGETLQDYDSDKLFPAFGFGAKIPPDGQVSHEFVLNFNP 363 (529)
T ss_pred ceeeeEEEEEEeccCCCCCCCCcceecCCCCCCHHHHHHHHHhhhhcccCCCCccccccccccCCCCcccccceeecCCC
Confidence 4578999999999843 23456677777777777778888999999888322 22222111111
Q ss_pred cC------hhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhc
Q psy10004 143 AN------LANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWR 216 (354)
Q Consensus 143 ~~------~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~ 216 (354)
.+ ..-+..-.+.+.++ +..|.|++..-+.+|.+...+... .....-+++++|||..++.++..+++..
T Consensus 364 ~~~~c~Gi~gVl~aY~~~lp~v-~l~GPTnFaPII~~va~~a~~~~~----~~~qY~VLlIitDG~vTdm~~T~~AIV~- 437 (529)
T KOG1327|consen 364 EDPECRGIEGVLEAYRKALPNV-QLYGPTNFSPIINHVARIAQQSGN----TAGQYHVLLIITDGVVTDMKETRDAIVS- 437 (529)
T ss_pred CCCccccHHHHHHHHHhhcccc-cccCCCccHHHHHHHHHHHHHhcc----CCcceEEEEEEeCCccccHHHHHHHHHh-
Confidence 11 22335556667777 889999999999999998876543 1345579999999999999888887642
Q ss_pred ccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhc
Q psy10004 217 GQNDSTLWPVRVFSYLVGKEVADYRDVKWMACA 249 (354)
Q Consensus 217 ~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~ 249 (354)
+ ..-+..|..||+|+ .+...|+.+-..
T Consensus 438 -A---S~lPlSIIiVGVGd--~df~~M~~lD~d 464 (529)
T KOG1327|consen 438 -A---SDLPLSIIIVGVGD--ADFDMMRELDGD 464 (529)
T ss_pred -h---ccCCeEEEEEEeCC--CCHHHHHHhhcC
Confidence 2 24588999999995 378888887443
No 78
>KOG2884|consensus
Probab=97.49 E-value=0.007 Score=53.15 Aligned_cols=171 Identities=18% Similarity=0.208 Sum_probs=108.9
Q ss_pred eEEEEEeCCCCCCc-----chHHHHHHHHHHHHHH-cC--CCCeEEEEEeec-ccccccccccccccccChhhHHHHHHH
Q psy10004 84 DMVILLDNSGSMMG-----QRREIARHVINNLLDT-LG--NNDYVNVLQFTS-VCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 84 dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~-L~--~~d~v~vi~Fs~-~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
..+|+||.|--|.. +|+..-+.++..+... +. |...|+|++-.+ .+..+ . . .+...-.+...
T Consensus 5 atmi~iDNse~mrNgDy~PtRf~aQ~daVn~v~~~K~~snpEntvGiitla~a~~~vL-s-------T-~T~d~gkils~ 75 (259)
T KOG2884|consen 5 ATMICIDNSEYMRNGDYLPTRFQAQKDAVNLVCQAKLRSNPENTVGIITLANASVQVL-S-------T-LTSDRGKILSK 75 (259)
T ss_pred eEEEEEeChHHhhcCCCChHHHHHHHHHHHHHHHhhhcCCcccceeeEeccCCCceee-e-------e-ccccchHHHHH
Confidence 57899999998874 6888889999888654 33 677899999988 43332 1 1 12345678888
Q ss_pred HhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCC-CCchHHHHHHhhhcccCCCCCCCeEEEEEEe
Q psy10004 155 VEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGA-SENYKEVFEEFNWRGQNDSTLWPVRVFSYLV 233 (354)
Q Consensus 155 i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~-~~~~~~i~~~~~~~~~~~~~~~~v~Iftigi 233 (354)
+..+ .+.|..++..+|+-|.-.|+.+.. ++..+.||++--.. .....+++...+ +. ...+|.|-.|-|
T Consensus 76 lh~i-~~~g~~~~~~~i~iA~lalkhRqn-----k~~~~riVvFvGSpi~e~ekeLv~~ak-rl----kk~~Vaidii~F 144 (259)
T KOG2884|consen 76 LHGI-QPHGKANFMTGIQIAQLALKHRQN-----KNQKQRIVVFVGSPIEESEKELVKLAK-RL----KKNKVAIDIINF 144 (259)
T ss_pred hcCC-CcCCcccHHHHHHHHHHHHHhhcC-----CCcceEEEEEecCcchhhHHHHHHHHH-HH----HhcCeeEEEEEe
Confidence 8899 999999999999999888876532 22233344433222 234455555433 12 235788999999
Q ss_pred CCCCCCHHHHHHH-HhcCC-c---EEEEeCChhHHHHHHHHHHHHhccchhc
Q psy10004 234 GKEVADYRDVKWM-ACANK-G---YYVHLSTLAEVRDQILSYVPVMARPLVL 280 (354)
Q Consensus 234 G~~~~~~~~L~~i-A~~~~-G---~~~~i~~~~~~~~~l~~~~~~l~~pl~~ 280 (354)
|....+.+.|... ...|+ | +...++....+.. ..++.|.+.
T Consensus 145 GE~~~~~e~l~~fida~N~~~~gshlv~Vppg~~L~d------~l~ssPii~ 190 (259)
T KOG2884|consen 145 GEAENNTEKLFEFIDALNGKGDGSHLVSVPPGPLLSD------ALLSSPIIQ 190 (259)
T ss_pred ccccccHHHHHHHHHHhcCCCCCceEEEeCCCccHHH------HhhcCceec
Confidence 9876554555443 33332 3 4555655543332 234667665
No 79
>KOG1985|consensus
Probab=97.26 E-value=0.0084 Score=61.97 Aligned_cols=160 Identities=16% Similarity=0.193 Sum_probs=99.0
Q ss_pred cCCCceEEEEEeCCCCCCcc-hHHHHHHHHHHHHHHcC--CCCeEEEEEeecccccc---------------------cc
Q psy10004 79 AASPKDMVILLDNSGSMMGQ-RREIARHVINNLLDTLG--NNDYVNVLQFTSVCKEV---------------------VP 134 (354)
Q Consensus 79 ~~~p~dvvillD~SgSM~g~-~~~~ak~a~~~ll~~L~--~~d~v~vi~Fs~~~~~~---------------------~~ 134 (354)
++.|.-.+||+|+|-|-... -++.+++++..-|+.|. ++.+|++|+|++..+.. .|
T Consensus 291 ~P~Pavy~FliDVS~~a~ksG~L~~~~~slL~~LD~lpgd~Rt~igfi~fDs~ihfy~~~~~~~qp~mm~vsdl~d~flp 370 (887)
T KOG1985|consen 291 PPQPAVYVFLIDVSISAIKSGYLETVARSLLENLDALPGDPRTRIGFITFDSTIHFYSVQGDLNQPQMMIVSDLDDPFLP 370 (887)
T ss_pred CCCCceEEEEEEeehHhhhhhHHHHHHHHHHHhhhcCCCCCcceEEEEEeeceeeEEecCCCcCCCceeeeccccccccC
Confidence 45788899999999875544 57888899988899998 66799999999876432 01
Q ss_pred cccccccccCh---hhHHHHHHHHhhh--cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch---
Q psy10004 135 CFADILVQANL---ANVRELKMGVEMI--GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY--- 206 (354)
Q Consensus 135 c~~~~lv~~~~---~n~~~~~~~i~~l--~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~--- 206 (354)
+.+.++-.-. ++++.+.+.+..+ ....-+..++.||+.|+.++.... -+++++. -+-|+-+
T Consensus 371 -~pd~lLv~L~~ck~~i~~lL~~lp~~F~~~~~t~~alGpALkaaf~li~~~G---------Gri~vf~-s~lPnlG~G~ 439 (887)
T KOG1985|consen 371 -MPDSLLVPLKECKDLIETLLKTLPEMFQDTRSTGSALGPALKAAFNLIGSTG---------GRISVFQ-STLPNLGAGK 439 (887)
T ss_pred -CchhheeeHHHHHHHHHHHHHHHHHHHhhccCcccccCHHHHHHHHHHhhcC---------CeEEEEe-ccCCCCCccc
Confidence 0111111111 2233333333333 122335689999999999997532 1444444 3433310
Q ss_pred ------------HH----------HHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcE
Q psy10004 207 ------------KE----------VFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGY 253 (354)
Q Consensus 207 ------------~~----------i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~ 253 (354)
++ ..+.+. .......|.|--|.+.....|..-|..|+..++|.
T Consensus 440 L~~rEdp~~~~s~~~~qlL~~~t~FYK~~a----~~cs~~qI~VDlFl~s~qY~DlAsLs~LskySgG~ 504 (887)
T KOG1985|consen 440 LKPREDPNVRSSDEDSQLLSPATDFYKDLA----LECSKSQICVDLFLFSEQYTDLASLSCLSKYSGGQ 504 (887)
T ss_pred cccccccccccchhhhhccCCCchHHHHHH----HHhccCceEEEEEeecccccchhhhhccccccCce
Confidence 01 111111 11134678888888888777888999999999993
No 80
>PF03850 Tfb4: Transcription factor Tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=97.10 E-value=0.032 Score=51.89 Aligned_cols=180 Identities=17% Similarity=0.209 Sum_probs=99.5
Q ss_pred ceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHH---cCCCCeEEEEEeeccccc-cccccc-------ccc-cccCh
Q psy10004 83 KDMVILLDNSGSMMG-----QRREIARHVINNLLDT---LGNNDYVNVLQFTSVCKE-VVPCFA-------DIL-VQANL 145 (354)
Q Consensus 83 ~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~---L~~~d~v~vi~Fs~~~~~-~~~c~~-------~~l-v~~~~ 145 (354)
.=+++|||++..--+ ..+..+.+++..+++. ++...+++||+.+..... +.|... +.. .....
T Consensus 2 SLLvIILD~nP~~W~~~~~~~~l~~~l~~llvFlNahL~l~~~N~vaVIAs~~~~s~~LYP~~~~~~~~~~~~~~~~~~~ 81 (276)
T PF03850_consen 2 SLLVIILDTNPLAWGQLSDQLSLSQFLDSLLVFLNAHLALNHSNQVAVIASHSNSSKFLYPSPSSSESSNSGDVEMNSSD 81 (276)
T ss_pred cEEEEEEECCHHHHhhccccccHHHHHHHHHHHHHHHHhhCccCCEEEEEEcCCccEEEeCCCccccccCCCcccccccc
Confidence 346888998864432 2455555555555555 568889999887665433 223211 000 00000
Q ss_pred -------hh-----HHHHHHHHhhhc---CCCCCCcHHHHHHHHHHHHHhccccCCC-CCCCcceEEEeccCCCC---ch
Q psy10004 146 -------AN-----VRELKMGVEMIG---DANNIANFTVALTRAFNILENARNDKKT-GADCNQAIMVVTDGASE---NY 206 (354)
Q Consensus 146 -------~n-----~~~~~~~i~~l~---~~~G~T~~~~aL~~A~~~l~~~~~~~~~-~~~~~~~IillTDG~~~---~~ 206 (354)
.+ .+.+++.++... .....+.+..||..|+-.+++....... .......|+++.-|.++ .+
T Consensus 82 ~~~y~~f~~v~~~v~~~l~~l~~~~~~~~~~~~~s~LagALS~ALCyINR~~~~~~~~~~~~~~RILv~~s~s~d~~~QY 161 (276)
T PF03850_consen 82 SNKYRQFRNVDETVLEELKKLMSETSESSDSTTSSLLAGALSMALCYINRISRESPSGGTSLKSRILVIVSGSPDSSSQY 161 (276)
T ss_pred cchhHHHHHHHHHHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHhhhhhcccCCCCCcCccEEEEEecCCCccHHH
Confidence 00 122222333220 1111268999999999888776332210 11223334442333333 33
Q ss_pred HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHHHHHHHH
Q psy10004 207 KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEVRDQILS 269 (354)
Q Consensus 207 ~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~~~~l~~ 269 (354)
-.+.+.+=.. .+.+|.|-+..+|. .+..+|+..+..|+|.|.++...+.+.+-+..
T Consensus 162 i~~MN~iFaA-----qk~~v~IDv~~L~~--~~s~fLqQa~d~T~G~y~~~~~~~~l~q~L~~ 217 (276)
T PF03850_consen 162 IPLMNCIFAA-----QKQKVPIDVCKLGG--KDSTFLQQASDITGGIYLKVSKPEGLLQYLLT 217 (276)
T ss_pred HHHHHHHHHH-----hcCCceeEEEEecC--CchHHHHHHHHHhCceeeccCccccHHHHHHH
Confidence 3333332111 13589999999997 36889999999999999999997776554443
No 81
>PTZ00395 Sec24-related protein; Provisional
Probab=97.08 E-value=0.015 Score=63.32 Aligned_cols=166 Identities=11% Similarity=0.136 Sum_probs=93.3
Q ss_pred CCCceEEEEEeCCCCC-CcchHHHHHHHHHHHHHHcC-CCCeEEEEEeeccccc--------------------------
Q psy10004 80 ASPKDMVILLDNSGSM-MGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKE-------------------------- 131 (354)
Q Consensus 80 ~~p~dvvillD~SgSM-~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~-------------------------- 131 (354)
+.|-..+||||+|-.- ..--+..+.++++..|+.+. +..+|+||+|++..+.
T Consensus 950 p~PP~YvFLIDVS~~AVkSGLl~tacesIK~sLDsL~dpRTRVGIITFDSsLHFYNLks~l~~~~~~~~~~~~l~qPQML 1029 (1560)
T PTZ00395 950 MLPPYFVFVVECSYNAIYNNITYTILEGIRYAVQNVKCPQTKIAIITFNSSIYFYHCKGGKGVSGEEGDGGGGSGNHQVI 1029 (1560)
T ss_pred CCCCEEEEEEECCHHHHhhChHHHHHHHHHHHHhcCCCCCcEEEEEEecCcEEEEecCcccccccccccccccCCCceEE
Confidence 4677899999998643 22335667778888888875 5679999999987632
Q ss_pred --------ccccc-cccccccChhhHHHHHHHHhhhc-----CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEE
Q psy10004 132 --------VVPCF-ADILVQANLANVRELKMGVEMIG-----DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMV 197 (354)
Q Consensus 132 --------~~~c~-~~~lv~~~~~n~~~~~~~i~~l~-----~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~Iil 197 (354)
++|+. .+.++. -.+.++.+...|+.|. ....+.-++.||+.|+.+|..... .-+++++
T Consensus 1030 VVSDLDDPFLPlP~ddLLVn-L~ESRevIe~LLDkLPemFt~t~~~esCLGSALqAA~~aLk~~GG-------GGKIiVF 1101 (1560)
T PTZ00395 1030 VMSDVDDPFLPLPLEDLFFG-CVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNG-------LGSICMF 1101 (1560)
T ss_pred eecCCccCcCCCCccCeeec-hHHHHHHHHHHHHHHHHHhhccCCCcccHHHHHHHHHHHHHhcCC-------CceEEEE
Confidence 12221 122221 1244455555555441 123456789999999999986431 1245555
Q ss_pred eccCCCCch--------------------HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCC--HHHHHHHHhcCCcEEE
Q psy10004 198 VTDGASENY--------------------KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVAD--YRDVKWMACANKGYYV 255 (354)
Q Consensus 198 lTDG~~~~~--------------------~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~--~~~L~~iA~~~~G~~~ 255 (354)
.+ ..|+-+ ....+.+. .+ .....|.|-.|.+.....+ ..-|..|+..++|.-+
T Consensus 1102 ~S-SLPniGpGaLK~Re~~~KEk~Ll~pqd~FYK~LA-~E---Csk~qISVDLFLfSsqYvDVDVATLg~Lsr~TGGqly 1176 (1560)
T PTZ00395 1102 YT-TTPNCGIGAIKELKKDLQENFLEVKQKIFYDSLL-LD---LYAFNISVDIFIISSNNVRVCVPSLQYVAQNTGGKIL 1176 (1560)
T ss_pred Ec-CCCCCCCCcccccccccccccccccchHHHHHHH-HH---HHhcCCceEEEEccCcccccccccccchhcccceeEE
Confidence 54 333210 01111111 00 0123455555555543223 4678899999999444
Q ss_pred EeC
Q psy10004 256 HLS 258 (354)
Q Consensus 256 ~i~ 258 (354)
+.+
T Consensus 1177 yYP 1179 (1560)
T PTZ00395 1177 FVE 1179 (1560)
T ss_pred EeC
Confidence 433
No 82
>PF11265 Med25_VWA: Mediator complex subunit 25 von Willebrand factor type A; InterPro: IPR021419 The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA domain which is this one, an SD2 domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This VWA or von Willebrand factor type A domain when bound to RAR and the histone acetyltransferase CBP is responsible for recruiting Med1 to the rest of the Mediator complex [].
Probab=97.04 E-value=0.011 Score=52.87 Aligned_cols=119 Identities=10% Similarity=0.264 Sum_probs=78.6
Q ss_pred CCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-------------CCCeEEEEEeeccccccccccccccccc--C
Q psy10004 80 ASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLG-------------NNDYVNVLQFTSVCKEVVPCFADILVQA--N 144 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~-------------~~d~v~vi~Fs~~~~~~~~c~~~~lv~~--~ 144 (354)
+.-++||||||.+..|...--.+-.+-+.-+++.+. ....+++|.|++.. |+.+.+++- -
T Consensus 11 ~~~~~vVfvvEgTAalgpy~~~Lkt~Yl~P~le~f~~g~~~e~~~~~~~~~t~y~LVvf~t~d-----~~~~~~v~~~g~ 85 (226)
T PF11265_consen 11 PPQAQVVFVVEGTAALGPYWNTLKTNYLDPILEYFNGGPIAERDFGGDYSNTEYGLVVFNTAD-----CYPEPIVQRSGP 85 (226)
T ss_pred CccceEEEEEecchhhhhhHHHHHHHHHHHHHHHhcCCCcccccccccCCCceEEEEEEeccC-----CCcccceeccCC
Confidence 356899999999999975433334445555665554 22468999998874 233233322 2
Q ss_pred hhhHHHHHHHHhhhcCCC-CC----CcHHHHHHHHHHHHHhccccCC--CCCCCcceEEEeccCCCC
Q psy10004 145 LANVRELKMGVEMIGDAN-NI----ANFTVALTRAFNILENARNDKK--TGADCNQAIMVVTDGASE 204 (354)
Q Consensus 145 ~~n~~~~~~~i~~l~~~~-G~----T~~~~aL~~A~~~l~~~~~~~~--~~~~~~~~IillTDG~~~ 204 (354)
..+...+.++|+++ ... || +.+.+||..|++++......+. ......+..||++--.|.
T Consensus 86 T~~~~~fl~~L~~I-~f~GGG~e~~a~iaEGLa~AL~~fd~~~~~r~~~~~~~~~khcILI~nSpP~ 151 (226)
T PF11265_consen 86 TSSPQKFLQWLDAI-QFSGGGFESCAAIAEGLAEALQCFDDFKQMRQQQQQTDVQKHCILICNSPPY 151 (226)
T ss_pred cCCHHHHHHHHHcc-CcCCCCcccchhHHHHHHHHHHHhcchhhhccccCcccccceEEEEeCCCCc
Confidence 35678899999999 544 33 3589999999999986543331 112356888999877663
No 83
>COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=97.02 E-value=0.033 Score=47.85 Aligned_cols=150 Identities=14% Similarity=0.151 Sum_probs=100.9
Q ss_pred ceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccccccccccccccChhhHHHHHHH
Q psy10004 83 KDMVILLDNSGSMMG-----QRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 83 ~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
-..|+|||.|--|.. ++++.-|.++..++..-. +...++++.-......++ ...+...-.++.+
T Consensus 4 EatvvliDNse~s~NgDy~ptRFeAQkd~ve~if~~K~ndnpEntiGli~~~~a~p~vl--------sT~T~~~gkilt~ 75 (243)
T COG5148 4 EATVVLIDNSEASQNGDYLPTRFEAQKDAVESIFSKKFNDNPENTIGLIPLVQAQPNVL--------STPTKQRGKILTF 75 (243)
T ss_pred ceEEEEEeChhhhhcCCCCcHHHHHHHHHHHHHHHHHhcCCccceeeeeecccCCcchh--------ccchhhhhHHHHH
Confidence 357899999977763 689999999999886632 556889988765532221 1223456778889
Q ss_pred HhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeC
Q psy10004 155 VEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVG 234 (354)
Q Consensus 155 i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG 234 (354)
+..+ ...|+.++..+|..|.-.|+.+.... ...+++.++..-..++.++++...+. . ...+|.|-.|-+|
T Consensus 76 lhd~-~~~g~a~~~~~lqiaql~lkhR~nk~----q~qriVaFvgSpi~esedeLirlak~-l----kknnVAidii~fG 145 (243)
T COG5148 76 LHDI-RLHGGADIMRCLQIAQLILKHRDNKG----QRQRIVAFVGSPIQESEDELIRLAKQ-L----KKNNVAIDIIFFG 145 (243)
T ss_pred hccc-cccCcchHHHHHHHHHHHHhcccCCc----cceEEEEEecCcccccHHHHHHHHHH-H----HhcCeeEEEEehh
Confidence 9999 99999999999999998887654321 22344445443344466777765431 1 2358999999999
Q ss_pred CCCCCHHHH-HHHHhcCC
Q psy10004 235 KEVADYRDV-KWMACANK 251 (354)
Q Consensus 235 ~~~~~~~~L-~~iA~~~~ 251 (354)
.-. +...| +.|+..|.
T Consensus 146 E~~-n~~~l~efIda~N~ 162 (243)
T COG5148 146 EAA-NMAGLFEFIDATNF 162 (243)
T ss_pred hhh-hhhHHHHHHHhhcc
Confidence 765 44444 45555554
No 84
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=96.97 E-value=0.018 Score=53.01 Aligned_cols=171 Identities=11% Similarity=0.102 Sum_probs=103.0
Q ss_pred ceEEEEEeCCCCCCcchH-----HH-HHHHHHHHHHHcC--CCCeEEEEEeecccccccccccccccccChhhHHHHHHH
Q psy10004 83 KDMVILLDNSGSMMGQRR-----EI-ARHVINNLLDTLG--NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~-----~~-ak~a~~~ll~~L~--~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
+.++++||.|.+|....+ .. .|-+...+.+.+. |-..++|+..-+..... +..-.-|.+.....
T Consensus 88 Rhl~l~lD~Seam~e~Df~p~r~a~vikya~~Fv~eFf~qNPiSqlsii~irdg~a~~--------~s~~~gnpq~hi~~ 159 (421)
T COG5151 88 RHLHLILDVSEAMDESDFLPTRRANVIKYAEGFVPEFFSQNPISQLSIISIRDGCAKY--------TSSMDGNPQAHIGQ 159 (421)
T ss_pred heeEEEEEhhhhhhhhhccchHHHHHHHHHHHHhHHHhccCCchheeeeehhhhHHHH--------hhhcCCCHHHHHHH
Confidence 578999999999986432 22 2223323333333 44577888766654322 21223355556666
Q ss_pred HhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCC-CcceEEEeccCCCCch---HHHHHHhhhcccCCCCCCCeEEEE
Q psy10004 155 VEMIGDANNIANFTVALTRAFNILENARNDKKTGAD-CNQAIMVVTDGASENY---KEVFEEFNWRGQNDSTLWPVRVFS 230 (354)
Q Consensus 155 i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~-~~~~IillTDG~~~~~---~~i~~~~~~~~~~~~~~~~v~Ift 230 (354)
+..+..+.|.-.+..||+.|.-.|.... .. .+.++|++..-...++ .+.++++. ..+|||..
T Consensus 160 lkS~rd~~gnfSLqNaLEmar~~l~~~~------~H~trEvLiifgS~st~DPgdi~~tid~Lv--------~~~IrV~~ 225 (421)
T COG5151 160 LKSKRDCSGNFSLQNALEMARIELMKNT------MHGTREVLIIFGSTSTRDPGDIAETIDKLV--------AYNIRVHF 225 (421)
T ss_pred hhcccccCCChhHHhHHHHhhhhhcccc------cccceEEEEEEeecccCCCccHHHHHHHHH--------hhceEEEE
Confidence 6666467788889999999955554321 22 2344454433333344 44455442 35899999
Q ss_pred EEeCCCCCCHHHHHHHHhcC----CcEEEEeCChhHHHHHHHHHHHHhccchhccC
Q psy10004 231 YLVGKEVADYRDVKWMACAN----KGYYVHLSTLAEVRDQILSYVPVMARPLVLQR 282 (354)
Q Consensus 231 igiG~~~~~~~~L~~iA~~~----~G~~~~i~~~~~~~~~l~~~~~~l~~pl~~~~ 282 (354)
+|+-. ....-++|..++ .|.|...-+..- +.+++..++.|.-.++
T Consensus 226 igL~a---evaicKeickaTn~~~e~~y~v~vde~H----l~el~~E~~~P~~~n~ 274 (421)
T COG5151 226 IGLCA---EVAICKEICKATNSSTEGRYYVPVDEGH----LSELMRELSHPTDFNG 274 (421)
T ss_pred Eeehh---HHHHHHHHHhhcCcCcCceeEeeecHHH----HHHHHHhcCCCCCCCc
Confidence 99975 345778887776 677766655444 5667777887776643
No 85
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.84 E-value=0.0038 Score=57.68 Aligned_cols=94 Identities=18% Similarity=0.295 Sum_probs=59.8
Q ss_pred eEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh-hhcCCC
Q psy10004 84 DMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE-MIGDAN 162 (354)
Q Consensus 84 dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~-~l~~~~ 162 (354)
.+++++|+||||...-++.+..-+..++ -.++-+..++.-...+..... +.+ .. ++. .+ ..+
T Consensus 263 ~i~vaVDtSGS~~d~ei~a~~~Ei~~Il--~~~~~eltli~~D~~v~~~~~-----~r~--g~-------~~~~~~-~gg 325 (396)
T COG3864 263 KIVVAVDTSGSMTDAEIDAAMTEIFDIL--KNKNYELTLIECDNIVRRMYR-----VRK--GR-------DMKKKL-DGG 325 (396)
T ss_pred heEEEEecCCCccHHHHHHHHHHHHHHH--hCCCcEEEEEEecchhhhhhc-----cCC--cc-------cCCccc-CCC
Confidence 4789999999999776776666666665 236778888877776654211 110 00 111 23 566
Q ss_pred CCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch
Q psy10004 163 NIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY 206 (354)
Q Consensus 163 G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~ 206 (354)
|+|++..+++.. .+.. ....+|++|||.-+.+
T Consensus 326 G~Tdf~Pvfeyl----ek~~--------~~~~lIyfTDG~gd~p 357 (396)
T COG3864 326 GGTDFSPVFEYL----EKNR--------MECFLIYFTDGMGDQP 357 (396)
T ss_pred CCccccHHHHHH----Hhhc--------ccceEEEEccCCCCcc
Confidence 779988876653 3221 1157999999987654
No 86
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70). Proteins in this family are involved in non-homologous end joining, a process used for the repair of double stranded DNA breaks. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). Cutoff does not detect the putative ku70 homologs in yeast.
Probab=96.80 E-value=0.0077 Score=62.00 Aligned_cols=116 Identities=15% Similarity=0.195 Sum_probs=74.2
Q ss_pred ceEEEEEeCCCCCCc--------chHHHHHHHHHHHHHHc---CCCCeEEEEEeecccccccccccc-----cccccChh
Q psy10004 83 KDMVILLDNSGSMMG--------QRREIARHVINNLLDTL---GNNDYVNVLQFTSVCKEVVPCFAD-----ILVQANLA 146 (354)
Q Consensus 83 ~dvvillD~SgSM~g--------~~~~~ak~a~~~ll~~L---~~~d~v~vi~Fs~~~~~~~~c~~~-----~lv~~~~~ 146 (354)
--|+|+||.|.||-. .++..+..++..++... +++|.|||+.|+.+.+.--.+|.+ .+...+.+
T Consensus 11 eailflIDvs~sM~~~~~~~~~~s~~~~al~~i~~l~q~kIis~~~D~vGivlfgT~~t~n~~~~~~i~v~~~L~~p~a~ 90 (584)
T TIGR00578 11 DSLIFLVDASKAMFEESQGEDELTPFDMSIQCIQSVYTSKIISSDKDLLAVVFYGTEKDKNSVNFKNIYVLQELDNPGAK 90 (584)
T ss_pred eEEEEEEECCHHHcCCCcCcCcCChHHHHHHHHHHHHHhcCCCCCCCeEEEEEEeccCCCCccCCCceEEEeeCCCCCHH
Confidence 458899999999984 46778888888887663 489999999999875431111221 23333444
Q ss_pred hHHHHHHHHhhhc------CCC-CC-CcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCC
Q psy10004 147 NVRELKMGVEMIG------DAN-NI-ANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGAS 203 (354)
Q Consensus 147 n~~~~~~~i~~l~------~~~-G~-T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~ 203 (354)
.+..+.+.+..-+ ..+ +. ..+..||..|.++|.+... .-..+-|+++||-..
T Consensus 91 ~i~~L~~l~~~~~~~~~~~~~~~~~~~~l~daL~~~~~~f~~~~~-----k~~~kRI~lfTd~D~ 150 (584)
T TIGR00578 91 RILELDQFKGDQGPKKFRDTYGHGSDYSLSEVLWVCANLFSDVQF-----RMSHKRIMLFTNEDN 150 (584)
T ss_pred HHHHHHHHhhccCccchhhccCCCCCCcHHHHHHHHHHHHHhcch-----hhcCcEEEEECCCCC
Confidence 4444444332200 011 11 3789999999999985321 223577999999854
No 87
>COG3552 CoxE Protein containing von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=96.63 E-value=0.015 Score=55.20 Aligned_cols=110 Identities=18% Similarity=0.250 Sum_probs=60.6
Q ss_pred CCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHHHh-hh
Q psy10004 80 ASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVE-MI 158 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~-~l 158 (354)
..+..+|+|+|+||||++.. .+.-..+..+.. ...++-+..|++....+-+ .+- ....+.+...+. ..
T Consensus 216 ~~~~~lvvL~DVSGSm~~ys-~~~L~l~hAl~q---~~~R~~~F~F~TRLt~vT~----~l~---~rD~~~Al~~~~a~v 284 (395)
T COG3552 216 RRKPPLVVLCDVSGSMSGYS-RIFLHLLHALRQ---QRSRVHVFLFGTRLTRVTH----MLR---ERDLEDALRRLSAQV 284 (395)
T ss_pred cCCCCeEEEEecccchhhhH-HHHHHHHHHHHh---cccceeEEEeechHHHHHH----Hhc---cCCHHHHHHHHHhhc
Confidence 46778999999999998632 222222222222 3345568889988655432 122 122223333222 22
Q ss_pred cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCch
Q psy10004 159 GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENY 206 (354)
Q Consensus 159 ~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~ 206 (354)
..-.|||.++.++..=.+- ...+ -=...-+|+++|||-..+.
T Consensus 285 ~dw~ggTrig~tl~aF~~~----~~~~--~L~~gA~VlilsDg~drd~ 326 (395)
T COG3552 285 KDWDGGTRIGNTLAAFLRR----WHGN--VLSGGAVVLILSDGLDRDD 326 (395)
T ss_pred ccccCCcchhHHHHHHHcc----cccc--ccCCceEEEEEecccccCC
Confidence 2346999999887653221 1111 0112358999999987754
No 88
>PF00362 Integrin_beta: Integrin, beta chain; InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus. Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another. The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices. Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=96.19 E-value=0.0048 Score=60.88 Aligned_cols=187 Identities=16% Similarity=0.153 Sum_probs=100.2
Q ss_pred eccCCCceEEEEEeCCCCCCcchHHHHHHHHHHH---HHHcCCCCeEEEEEeeccccccc----------cc--------
Q psy10004 77 NAAASPKDMVILLDNSGSMMGQRREIARHVINNL---LDTLGNNDYVNVLQFTSVCKEVV----------PC-------- 135 (354)
Q Consensus 77 ~a~~~p~dvvillD~SgSM~g~~~~~ak~a~~~l---l~~L~~~d~v~vi~Fs~~~~~~~----------~c-------- 135 (354)
.+...|.|+++|+|-|+||... ++..|.....| +..+..+-++++-+|-+++...+ ||
T Consensus 97 ~a~~yPvDLYyLmDlS~Sm~dd-l~~l~~lg~~l~~~~~~it~~~~~GfGsfvdK~~~P~~~~~p~~l~~pc~~~~~~c~ 175 (426)
T PF00362_consen 97 PAEDYPVDLYYLMDLSYSMKDD-LENLKSLGQDLAEEMRNITSNFRLGFGSFVDKPVMPFVSTTPEKLKNPCPSKNPNCQ 175 (426)
T ss_dssp BSSS--EEEEEEEE-SGGGHHH-HHHHCCCCHHHHHHHHTT-SSEEEEEEEESSSSSTTTST-SSHCHHSTSCCTTS--B
T ss_pred eccccceeEEEEeechhhhhhh-HHHHHHHHHHHHHHHHhcCccceEechhhcccccCCcccCChhhhcCcccccCCCCC
Confidence 4567899999999999999853 44444433334 44566788999999988763221 33
Q ss_pred ----ccccccccChhhHHHHHHHHhhhcCCCCCCcH-HHHHHHHHHHH--HhccccCCCCCCCcceEEEeccCCCC----
Q psy10004 136 ----FADILVQANLANVRELKMGVEMIGDANNIANF-TVALTRAFNIL--ENARNDKKTGADCNQAIMVVTDGASE---- 204 (354)
Q Consensus 136 ----~~~~lv~~~~~n~~~~~~~i~~l~~~~G~T~~-~~aL~~A~~~l--~~~~~~~~~~~~~~~~IillTDG~~~---- 204 (354)
|+. +..-..+...+...|.+. ...|+-+. ..+|...++.. .+..+.+ ....++||+.||+...
T Consensus 176 ~~~~f~~--~l~Lt~~~~~F~~~v~~~-~is~n~D~PEgg~dal~Qa~vC~~~igWr---~~a~~llv~~TD~~fH~agD 249 (426)
T PF00362_consen 176 PPFSFRH--VLSLTDDITEFNEEVNKQ-KISGNLDAPEGGLDALMQAAVCQEEIGWR---NEARRLLVFSTDAGFHFAGD 249 (426)
T ss_dssp ---SEEE--EEEEES-HHHHHHHHHTS---B--SSSSBSHHHHHHHHHH-HHHHT-----STSEEEEEEEESS-B--TTG
T ss_pred CCeeeEE--eecccchHHHHHHhhhhc-cccCCCCCCccccchheeeeecccccCcc---cCceEEEEEEcCCccccccc
Confidence 111 111235788899998876 44443321 22222222211 2222222 4577999999998532
Q ss_pred ----------------------------ch---HHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCc-
Q psy10004 205 ----------------------------NY---KEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKG- 252 (354)
Q Consensus 205 ----------------------------~~---~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G- 252 (354)
++ ..+.+.+. + ++=+.||++.=. -...-+.++..-.|
T Consensus 250 g~l~gi~~pnd~~Chl~~~~~y~~~~~~DYPSv~ql~~~l~--e-----~~i~~IFAVt~~----~~~~Y~~L~~~i~~s 318 (426)
T PF00362_consen 250 GKLAGIVKPNDGKCHLDDNGMYTASTEQDYPSVGQLVRKLS--E-----NNINPIFAVTKD----VYSIYEELSNLIPGS 318 (426)
T ss_dssp GGGGT--S---SS--BSTTSBBGGGGCS----HHHHHHHHH--H-----TTEEEEEEEEGG----GHHHHHHHHHHSTTE
T ss_pred cccceeeecCCCceEECCCCcccccccccCCCHHHHHHHHH--H-----cCCEEEEEEchh----hhhHHHHHhhcCCCc
Confidence 01 23333332 1 123567776322 23344666555444
Q ss_pred EEEEeC-ChhHHHHHHHHHHHHhccchhcc
Q psy10004 253 YYVHLS-TLAEVRDQILSYVPVMARPLVLQ 281 (354)
Q Consensus 253 ~~~~i~-~~~~~~~~l~~~~~~l~~pl~~~ 281 (354)
....+. +...+-+-|.+-|..|++-+.+.
T Consensus 319 ~vg~L~~dSsNIv~LI~~aY~~i~s~V~L~ 348 (426)
T PF00362_consen 319 SVGELSSDSSNIVQLIKEAYNKISSKVELK 348 (426)
T ss_dssp EEEEESTTSHTHHHHHHHHHHHHCTEEEEE
T ss_pred eecccccCchhHHHHHHHHHHHHhheEEEE
Confidence 444444 35668888999999998776663
No 89
>KOG1984|consensus
Probab=95.64 E-value=0.32 Score=51.06 Aligned_cols=103 Identities=20% Similarity=0.259 Sum_probs=66.0
Q ss_pred ccCCCceEEEEEeCCCCCCcch-HHHHHHHHHHHHHHcC---CCCeEEEEEeeccccc---------------------c
Q psy10004 78 AAASPKDMVILLDNSGSMMGQR-REIARHVINNLLDTLG---NNDYVNVLQFTSVCKE---------------------V 132 (354)
Q Consensus 78 a~~~p~dvvillD~SgSM~g~~-~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~---------------------~ 132 (354)
-.+.|-..||+||+|-.--... ..++-++++.+|..|. ++-+|+|++|++.++. .
T Consensus 413 k~p~ppafvFmIDVSy~Ai~~G~~~a~ce~ik~~l~~lp~~~p~~~Vgivtfd~tvhFfnl~s~L~qp~mliVsdv~dvf 492 (1007)
T KOG1984|consen 413 KPPKPPAFVFMIDVSYNAISNGAVKAACEAIKSVLEDLPREEPNIRVGIVTFDKTVHFFNLSSNLAQPQMLIVSDVDDVF 492 (1007)
T ss_pred CCCCCceEEEEEEeehhhhhcchHHHHHHHHHHHHhhcCccCCceEEEEEEecceeEeeccCccccCceEEEeecccccc
Confidence 3567889999999986433222 4566778888887765 6789999999988743 2
Q ss_pred cccccccccccChhhHHHHHHHHhhhc--CCC---CCCcHHHHHHHHHHHHHhc
Q psy10004 133 VPCFADILVQANLANVRELKMGVEMIG--DAN---NIANFTVALTRAFNILENA 181 (354)
Q Consensus 133 ~~c~~~~lv~~~~~n~~~~~~~i~~l~--~~~---G~T~~~~aL~~A~~~l~~~ 181 (354)
+|...+.++.. .+.+..++..|+.+. -.+ -.|-+..||+.|+..|+..
T Consensus 493 vPf~~g~~V~~-~es~~~i~~lLd~Ip~mf~~sk~pes~~g~alqaa~lalk~~ 545 (1007)
T KOG1984|consen 493 VPFLDGLFVNP-NESRKVIELLLDSIPTMFQDSKIPESVFGSALQAAKLALKAA 545 (1007)
T ss_pred cccccCeeccc-hHHHHHHHHHHHHhhhhhccCCCCchhHHHHHHHHHHHHhcc
Confidence 34333333332 233444444444441 222 2457899999999988754
No 90
>KOG4465|consensus
Probab=95.58 E-value=0.032 Score=52.62 Aligned_cols=129 Identities=20% Similarity=0.227 Sum_probs=82.5
Q ss_pred cccccccccceeeeccCCCceEEEEEeCCCCCCc----chHHHHHHHHHHH-HHHcCCCCeEEEEEeecccccccccccc
Q psy10004 64 TDLYDCRVREWYINAAASPKDMVILLDNSGSMMG----QRREIARHVINNL-LDTLGNNDYVNVLQFTSVCKEVVPCFAD 138 (354)
Q Consensus 64 ~~~yd~r~r~wy~~a~~~p~dvvillD~SgSM~g----~~~~~ak~a~~~l-l~~L~~~d~v~vi~Fs~~~~~~~~c~~~ 138 (354)
|+..|.-+-.-|..+.+..|.+.+.||.|+||.. ..+. +++++..+ +-.+.....+-.+.|.+.... .| |.
T Consensus 409 ~dalda~fykaf~na~ptgkr~~laldvs~sm~~rv~~s~ln-~reaaa~m~linlhnead~~~vaf~d~lte-~p-ft- 484 (598)
T KOG4465|consen 409 CDALDAAFYKAFKNAEPTGKRFCLALDVSASMNQRVLGSILN-AREAAAAMCLINLHNEADSRCVAFCDELTE-CP-FT- 484 (598)
T ss_pred HHHHHHHHHHHhccCCCCCceEEEEEecchhhhhhhhccccc-hHHHHhhhheeeeccccceeEEEecccccc-CC-Cc-
Confidence 6667776666678888899999999999999973 3343 33333332 334555556678999988533 12 11
Q ss_pred cccccChhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC----chHHHHHHhh
Q psy10004 139 ILVQANLANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE----NYKEVFEEFN 214 (354)
Q Consensus 139 ~lv~~~~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~----~~~~i~~~~~ 214 (354)
..-.+.++..+++++ . .|+|+-+-++..|-+- .-...+.|++||...- .+.+.+.+|+
T Consensus 485 -----kd~kigqv~~~~nni-~-~g~tdcglpm~wa~en-----------nlk~dvfii~tdndt~ageihp~~aik~yr 546 (598)
T KOG4465|consen 485 -----KDMKIGQVLDAMNNI-D-AGGTDCGLPMIWAQEN-----------NLKADVFIIFTDNDTFAGEIHPAEAIKEYR 546 (598)
T ss_pred -----ccccHHHHHHHHhcC-C-CCCCccCCceeehhhc-----------CCCccEEEEEecCcccccccCHHHHHHHHH
Confidence 122356677777777 4 4667777777666431 1223578889998643 4566777774
No 91
>COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]
Probab=95.57 E-value=0.12 Score=51.04 Aligned_cols=106 Identities=18% Similarity=0.258 Sum_probs=68.8
Q ss_pred CCceEEEEEeCCCCCCc-----chHHHHHHHHHHHH-HHcCCCCeEEEEEeecccccccccccccccccChhhHHHHHHH
Q psy10004 81 SPKDMVILLDNSGSMMG-----QRREIARHVINNLL-DTLGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMG 154 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g-----~~~~~ak~a~~~ll-~~L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~ 154 (354)
...+++|++|.|.||.. .+++.+..++..+. ..+..+|++++..|+......++ .....+....+.+.
T Consensus 223 r~~~v~l~lD~~~~m~~~~~~~~~~e~av~~a~~la~~~l~~gd~vg~~~~~~~~~~~~~------p~~G~~~l~~~l~~ 296 (416)
T COG1721 223 RGRTVVLVLDASRSMLFGSGVASKFEEAVRAAASLAYAALKNGDRVGLLIFGGGGPKWIP------PSRGRRHLARILKA 296 (416)
T ss_pred cCceEEEEEeCCccccCCCCCccHHHHHHHHHHHHHHHHHhCCCeeEEEEECCCcceeeC------CCcchHHHHHHHHH
Confidence 47899999999999994 68888888777764 55779999999999976433221 11234555666666
Q ss_pred HhhhcCCCCC-CcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC
Q psy10004 155 VEMIGDANNI-ANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE 204 (354)
Q Consensus 155 i~~l~~~~G~-T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~ 204 (354)
+..+ .+.+. ++...+... ... -....+.++++||=...
T Consensus 297 l~~~-~~~~~~~~~~~~~~~-~~~----------l~~~~~~~~~~~~l~~~ 335 (416)
T COG1721 297 LALL-RPAPEETDYIRRVSK-LDF----------LPPRRPLVILITDLARH 335 (416)
T ss_pred hhcc-CCCCcchhHHHHhhh-hhc----------cCcccceEEEeehhhcc
Confidence 6666 44443 444444222 111 12344678888887654
No 92
>COG5242 TFB4 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription / DNA replication, recombination, and repair]
Probab=95.50 E-value=1.7 Score=38.69 Aligned_cols=179 Identities=12% Similarity=0.159 Sum_probs=93.0
Q ss_pred cceeeeccCCCceEEEEEeCCCCC------CcchHHHHHHHHHHHHHH---cCCCCeEEEE-Eeeccccccccccccccc
Q psy10004 72 REWYINAAASPKDMVILLDNSGSM------MGQRREIARHVINNLLDT---LGNNDYVNVL-QFTSVCKEVVPCFADILV 141 (354)
Q Consensus 72 r~wy~~a~~~p~dvvillD~SgSM------~g~~~~~ak~a~~~ll~~---L~~~d~v~vi-~Fs~~~~~~~~c~~~~lv 141 (354)
|+|-+... +|.=++++||.-.-- .|. .....+.+.-+|+. +..+.+|+|+ .|++..+.+.|.....+-
T Consensus 11 ~s~~vtee-spslL~viid~~p~~W~~~~ek~~-~~kvl~di~VFLNAhlaf~~~NrVaVva~~s~~~~yLypss~s~~k 88 (296)
T COG5242 11 RSKQVTEE-SPSLLFVIIDLEPENWELTTEKGS-RDKVLNDIVVFLNAHLAFSRNNRVAVVAGYSQGKTYLYPSSESALK 88 (296)
T ss_pred hhcccccc-CCceEEEEEecChhhccccccccc-HHHHHHHHHHHHHHHHhhccCCeEEEEEeccCceEEeccCcchhhh
Confidence 34544332 477777788864321 122 22233333334433 4577888876 566666666553211111
Q ss_pred ccC-----------------hhhHHHHHHHHhhhcCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC
Q psy10004 142 QAN-----------------LANVRELKMGVEMIGDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE 204 (354)
Q Consensus 142 ~~~-----------------~~n~~~~~~~i~~l~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~ 204 (354)
.+. ...+.++.+.++.-.+-.-.+.++.|+..++.......+.. +-..+++|+---|...
T Consensus 89 ~se~e~tr~sd~yrrfr~vde~~i~eiyrl~e~~~k~sqr~~v~gams~glay~n~~~~e~---slkSriliftlsG~d~ 165 (296)
T COG5242 89 ASESENTRNSDMYRRFRNVDETDITEIYRLIEHPHKNSQRYDVGGAMSLGLAYCNHRDEET---SLKSRILIFTLSGRDR 165 (296)
T ss_pred hhcccCccchhhhhhhcccchHHHHHHHHHHhCcccccceeehhhhhhhhHHHHhhhcccc---cccceEEEEEecCchh
Confidence 000 01122222222221112224688889988888876654332 2223444443346332
Q ss_pred c--hHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHH
Q psy10004 205 N--YKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEV 263 (354)
Q Consensus 205 ~--~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~ 263 (354)
. +-....-+= . ....+++|-++.|+. +..+|++-+.+++|.|.++.+.+.+
T Consensus 166 ~~qYip~mnCiF--~---Aqk~~ipI~v~~i~g---~s~fl~Q~~daTgG~Yl~ve~~eGl 218 (296)
T COG5242 166 KDQYIPYMNCIF--A---AQKFGIPISVFSIFG---NSKFLLQCCDATGGDYLTVEDTEGL 218 (296)
T ss_pred hhhhchhhhhee--e---hhhcCCceEEEEecC---ccHHHHHHhhccCCeeEeecCchhH
Confidence 1 111111110 0 023578888888875 3579999999999999999998765
No 93
>COG5028 Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion]
Probab=94.75 E-value=1.4 Score=45.86 Aligned_cols=157 Identities=19% Similarity=0.179 Sum_probs=85.0
Q ss_pred cCCCceEEEEEeCCCCC-CcchHHHHHHHHHHHHHHcC---CCCeEEEEEeecccccc--------------------cc
Q psy10004 79 AASPKDMVILLDNSGSM-MGQRREIARHVINNLLDTLG---NNDYVNVLQFTSVCKEV--------------------VP 134 (354)
Q Consensus 79 ~~~p~dvvillD~SgSM-~g~~~~~ak~a~~~ll~~L~---~~d~v~vi~Fs~~~~~~--------------------~~ 134 (354)
.+.|..+||+||+|-.- ...-...+.+++...++.+. ++.+++|+.|++..+.+ +|
T Consensus 273 ~p~P~~yvFlIDVS~~a~~~g~~~a~~r~Il~~l~~~~~~dpr~kIaii~fD~sl~ffk~s~d~~~~~~~vsdld~pFlP 352 (861)
T COG5028 273 QPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNFDPRTKIAIICFDSSLHFFKLSPDLDEQMLIVSDLDEPFLP 352 (861)
T ss_pred cCCCCEEEEEEEeehHhhhcchHHHHHHHHHhhccCCCCCCCcceEEEEEEcceeeEEecCCCCccceeeeccccccccc
Confidence 45689999999998643 22223344455555555543 66799999999886542 12
Q ss_pred cccccccccCh---hhHHHHHHHHhhhcCCCCC--CcHHHHHHHHHHHHHhccccCCCCCCCcceEEEecc------CCC
Q psy10004 135 CFADILVQANL---ANVRELKMGVEMIGDANNI--ANFTVALTRAFNILENARNDKKTGADCNQAIMVVTD------GAS 203 (354)
Q Consensus 135 c~~~~lv~~~~---~n~~~~~~~i~~l~~~~G~--T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTD------G~~ 203 (354)
-.++.++..-. .+++.+.+.+..+-...+. -.++.||+.|..++... .-++|++++- |..
T Consensus 353 f~s~~fv~pl~~~k~~~etLl~~~~~If~d~~~pk~~~G~aLk~a~~l~g~~---------GGkii~~~stlPn~G~Gkl 423 (861)
T COG5028 353 FPSGLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGT---------GGKIIVFLSTLPNMGIGKL 423 (861)
T ss_pred CCcchhcccHHHHHHHHHHHHHHhhhhhcccCCCccccCHHHHHHHHHhhcc---------CceEEEEeecCCCcccccc
Confidence 11111221111 1223345555555122233 36899999999877542 1256666633 322
Q ss_pred C---------------chHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCc
Q psy10004 204 E---------------NYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKG 252 (354)
Q Consensus 204 ~---------------~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G 252 (354)
. -+++...++. ..+|.|-.|....+.-|..-|-.++..++|
T Consensus 424 ~~r~d~e~~ll~c~d~fYk~~a~e~~--------k~gIsvd~Flt~~~yidvaTls~l~~~T~G 479 (861)
T COG5028 424 QLREDKESSLLSCKDSFYKEFAIECS--------KVGISVDLFLTSEDYIDVATLSHLCRYTGG 479 (861)
T ss_pred cccccchhhhccccchHHHHHHHHHH--------HhcceEEEEeccccccchhhhcchhhccCc
Confidence 2 0133333322 124444444444433367778888888999
No 94
>KOG2487|consensus
Probab=93.98 E-value=3.2 Score=38.06 Aligned_cols=174 Identities=16% Similarity=0.170 Sum_probs=90.8
Q ss_pred CCceEEEEEeCCC---CCC----c-chHHHHHHHHHHHHHH---cCCCCeEEEEEeeccccc-cccccc-------ccc-
Q psy10004 81 SPKDMVILLDNSG---SMM----G-QRREIARHVINNLLDT---LGNNDYVNVLQFTSVCKE-VVPCFA-------DIL- 140 (354)
Q Consensus 81 ~p~dvvillD~Sg---SM~----g-~~~~~ak~a~~~ll~~---L~~~d~v~vi~Fs~~~~~-~~~c~~-------~~l- 140 (354)
.|.=+++|||.+. -|. + ..+...-+++.-+++. .+.+.++.|++....... +.|.-. ..+
T Consensus 22 ~~slL~vlId~~p~~Wg~~as~~~~~ti~kvl~aivVFlNAHL~~~~~NrvaViA~~~q~~~~lyp~st~~e~~n~~~~~ 101 (314)
T KOG2487|consen 22 NPSLLVVLIDANPCSWGMLASAENWETISKVLNAIVVFLNAHLAFSRNNRVAVIASHSQVDNYLYPSSTRCEDRNASELD 101 (314)
T ss_pred CceeEEEEEecCcchhhhhhhhcCceeHHHHHHHHHHHHHHHHhhccCCcEEEEEecccccceeccccccCCccCccccC
Confidence 5777899999998 232 1 2344445555555554 457889999988544322 111000 000
Q ss_pred ----c--------ccChhhHHHHHHHHhhhc-CCCC-CCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCc-
Q psy10004 141 ----V--------QANLANVRELKMGVEMIG-DANN-IANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASEN- 205 (354)
Q Consensus 141 ----v--------~~~~~n~~~~~~~i~~l~-~~~G-~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~- 205 (354)
. .++..-.+++.+.+..-. ...| .|-+..|+..|+-.......+. ....-...|+++|=+....
T Consensus 102 ~t~~~~~~y~~~~~~d~tiv~ei~~lm~~~~~~~~~~rt~lagals~~L~yi~~~~ke~-~~~~lkSRilV~t~t~d~~~ 180 (314)
T KOG2487|consen 102 PTRLVLFDYSEFRTVDDTIVEEIYRLMEHPDKYDVGDRTVLAGALSDALGYINRLHKEE-ASEKLKSRILVFTLTRDRAL 180 (314)
T ss_pred chhhhcchhhhhcccchHHHHHHHHHHhCccccccccceeeccchhhccchHhhhhhhh-hhhhhhceEEEEEechHHHh
Confidence 0 000111233333333220 1111 3455555555554443322211 1112234577777765442
Q ss_pred -hHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHhcCCcEEEEeCChhHH
Q psy10004 206 -YKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMACANKGYYVHLSTLAEV 263 (354)
Q Consensus 206 -~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~~~~G~~~~i~~~~~~ 263 (354)
+-...+-+=.. ...+++|-++.+|++ ..+|++-+..+||.|-++..++.+
T Consensus 181 qyi~~MNciFaA-----qKq~I~Idv~~l~~~---s~~LqQa~D~TGG~YL~v~~~~gL 231 (314)
T KOG2487|consen 181 QYIPYMNCIFAA-----QKQNIPIDVVSLGGD---SGFLQQACDITGGDYLHVEKPDGL 231 (314)
T ss_pred hhhhHHHHHHHH-----HhcCceeEEEEecCC---chHHHHHHhhcCCeeEecCCcchH
Confidence 22222111000 135789999999964 579999999999999999988765
No 95
>KOG1226|consensus
Probab=92.07 E-value=0.5 Score=49.03 Aligned_cols=124 Identities=17% Similarity=0.119 Sum_probs=68.9
Q ss_pred eeeccCCCceEEEEEeCCCCCCcchHHHHHH---HHHHHHHHcCCCCeEEEEEeecccccc----------ccc------
Q psy10004 75 YINAAASPKDMVILLDNSGSMMGQRREIARH---VINNLLDTLGNNDYVNVLQFTSVCKEV----------VPC------ 135 (354)
Q Consensus 75 y~~a~~~p~dvvillD~SgSM~g~~~~~ak~---a~~~ll~~L~~~d~v~vi~Fs~~~~~~----------~~c------ 135 (354)
|.++...|.|+++|+|-|-||... ++..++ .+..-+..|..+-|++.-.|=+++... -||
T Consensus 125 ~r~a~~yPVDLYyLMDlS~SM~DD-l~~l~~LG~~L~~~m~~lT~nfrlGFGSFVDK~v~P~i~~~pekl~npc~~~~~C 203 (783)
T KOG1226|consen 125 VRQAEDYPVDLYYLMDLSYSMKDD-LENLKSLGTDLAREMRKLTSNFRLGFGSFVDKTVSPYISTTPEKLRNPCPNYKNC 203 (783)
T ss_pred EeeccCCCeeEEEEeecchhhhhh-HHHHHHHHHHHHHHHHHHhccCCccccchhccccccccccCcHHhcCCCCCcccC
Confidence 446667999999999999999864 333443 344445556677788777776654221 111
Q ss_pred ---ccccccccChhhHHHHHHHHhhhcCCCCCCcH-HHHHHHHHHHH--HhccccCCCCCCCcceEEEeccCCC
Q psy10004 136 ---FADILVQANLANVRELKMGVEMIGDANNIANF-TVALTRAFNIL--ENARNDKKTGADCNQAIMVVTDGAS 203 (354)
Q Consensus 136 ---~~~~lv~~~~~n~~~~~~~i~~l~~~~G~T~~-~~aL~~A~~~l--~~~~~~~~~~~~~~~~IillTDG~~ 203 (354)
|.=..+....++...+.+.+.+- +-.|.-+- ..+|.+..+.. ...... .....+.+||.||...
T Consensus 204 ~ppfgfkhvLsLT~~~~~F~~~V~~q-~ISgNlDaPEGGfDAimQaavC~~~IGW---R~~a~~lLVF~td~~~ 273 (783)
T KOG1226|consen 204 APPFGFKHVLSLTNDAEEFNEEVGKQ-RISGNLDAPEGGFDAIMQAAVCTEKIGW---RNDATRLLVFSTDAGF 273 (783)
T ss_pred CCCcccceeeecCCChHHHHHHHhhc-eeccCCCCCCchHHHHHhhhhccccccc---cccceeEEEEEcCcce
Confidence 11012223346778888888754 33332211 11122222222 111222 2346689999999864
No 96
>KOG1986|consensus
Probab=92.07 E-value=11 Score=39.05 Aligned_cols=180 Identities=17% Similarity=0.225 Sum_probs=104.5
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccccc----cc-----cccc------------
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKEVV----PC-----FADI------------ 139 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~~~----~c-----~~~~------------ 139 (354)
.|--++||||+- |....+..+|.++...++.|.++..|++|+|+..++... .| |++.
T Consensus 120 ~ppvf~fVvDtc--~~eeeL~~LkssL~~~l~lLP~~alvGlItfg~~v~v~el~~~~~sk~~VF~G~ke~s~~q~~~~L 197 (745)
T KOG1986|consen 120 SPPVFVFVVDTC--MDEEELQALKSSLKQSLSLLPENALVGLITFGTMVQVHELGFEECSKSYVFSGNKEYSAKQLLDLL 197 (745)
T ss_pred CCceEEEEEeec--cChHHHHHHHHHHHHHHhhCCCcceEEEEEecceEEEEEcCCCcccceeEEeccccccHHHHHHHh
Confidence 355568999975 666889999999999999999999999999998774321 11 1100
Q ss_pred ------------------ccccChhhHHHHHHHHhhh-----cCCCCCC---cHHHHHHHHHHHHHhccccCCCCCCCcc
Q psy10004 140 ------------------LVQANLANVRELKMGVEMI-----GDANNIA---NFTVALTRAFNILENARNDKKTGADCNQ 193 (354)
Q Consensus 140 ------------------lv~~~~~n~~~~~~~i~~l-----~~~~G~T---~~~~aL~~A~~~l~~~~~~~~~~~~~~~ 193 (354)
+.....+-...+.+.+..+ ..+.|.- -.+.||..|..+|..-. ++...
T Consensus 198 ~~~~~~~~~~~~~~~~~rFL~P~~~c~~~L~~lle~L~~d~wpV~~g~Rp~RcTG~Al~iA~~Ll~~c~------p~~g~ 271 (745)
T KOG1986|consen 198 GLSGGAGKGSENQSASNRFLLPAQECEFKLTNLLEELQPDPWPVPPGHRPLRCTGVALSIASGLLEGCF------PNTGA 271 (745)
T ss_pred cCCcccccCCcccccchhhhccHHHHHHHHHHHHHHhcCCCCCCCCCCCcccchhHHHHHHHHHhcccC------CCCcc
Confidence 0000011112233333333 1123333 25677777777775422 23445
Q ss_pred eEEEeccCCCCc---------------------------hHHHHHHh---hhcccCCCCCCCeEEEEEEeCCCCCCHHHH
Q psy10004 194 AIMVVTDGASEN---------------------------YKEVFEEF---NWRGQNDSTLWPVRVFSYLVGKEVADYRDV 243 (354)
Q Consensus 194 ~IillTDG~~~~---------------------------~~~i~~~~---~~~~~~~~~~~~v~IftigiG~~~~~~~~L 243 (354)
-||+++-|..+. .+..++-| ..+-.. ...-+-||+-++.. . ...+|
T Consensus 272 rIv~f~gGPcT~GpG~vv~~el~~piRshhdi~~d~a~y~kKa~KfY~~La~r~~~--~ghvlDifa~~lDQ-v-Gi~EM 347 (745)
T KOG1986|consen 272 RIVLFAGGPCTRGPGTVVSRELKEPIRSHHDIEKDNAPYYKKAIKFYEKLAERLAN--QGHVLDIFAAALDQ-V-GILEM 347 (745)
T ss_pred eEEEeccCCCCcCCceecchhhcCCCcCcccccCcchHHHHHHHHHHHHHHHHHHh--CCceEeeeeeeccc-c-chHHH
Confidence 688888884220 01111111 101110 12345566666543 2 57799
Q ss_pred HHHHhcCCcEEEEeCChh-HH-HHHHHHHHH
Q psy10004 244 KWMACANKGYYVHLSTLA-EV-RDQILSYVP 272 (354)
Q Consensus 244 ~~iA~~~~G~~~~i~~~~-~~-~~~l~~~~~ 272 (354)
+.++..+||...--.+.+ ++ ...+++++.
T Consensus 348 k~l~~~TGG~lvl~dsF~~s~Fk~sfqR~f~ 378 (745)
T KOG1986|consen 348 KPLVESTGGVLVLGDSFNTSIFKQSFQRIFT 378 (745)
T ss_pred HHHhhcCCcEEEEecccchHHHHHHHHHHhc
Confidence 999999999888877764 33 345677776
No 97
>COG5271 MDN1 AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=91.24 E-value=6.3 Score=45.52 Aligned_cols=123 Identities=15% Similarity=0.171 Sum_probs=71.9
Q ss_pred ceEEEEEeCCCCCCcchH-HHHHHHHHHHHHHcC--CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhhc
Q psy10004 83 KDMVILLDNSGSMMGQRR-EIARHVINNLLDTLG--NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMIG 159 (354)
Q Consensus 83 ~dvvillD~SgSM~g~~~-~~ak~a~~~ll~~L~--~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l~ 159 (354)
-.|+|-||-|.||+...- ..|-+++.-+-.+|. +--.++|+.|+..++.+.| |...+ ..+ .-.+....++
T Consensus 4393 yqvmisiddsksmses~~~~la~etl~lvtkals~le~g~iav~kfge~~~~lh~-fdkqf---s~e---sg~~~f~~f~ 4465 (4600)
T COG5271 4393 YQVMISIDDSKSMSESGSTVLALETLALVTKALSLLEVGQIAVMKFGEQPELLHP-FDKQF---SSE---SGVQMFSHFT 4465 (4600)
T ss_pred eEEEEEecccccccccCceeeehHHHHHHHHHHHHHhhccEEEEecCCChhhhCc-hhhhh---cch---HHHHHHHhhc
Confidence 468899999999986432 233444433333333 4567899999999988766 43222 111 1222233332
Q ss_pred CCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHHHHHHhh
Q psy10004 160 DANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKEVFEEFN 214 (354)
Q Consensus 160 ~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~i~~~~~ 214 (354)
-....||..+-.....+.+...+... ...-.+.=|+|+||--.+.+.+.+.++
T Consensus 4466 feqs~tnv~~l~~~s~k~f~~a~t~~--h~d~~qleiiisdgicedhdsi~kllr 4518 (4600)
T COG5271 4466 FEQSNTNVLALADASMKCFNYANTAS--HHDIRQLEIIISDGICEDHDSIRKLLR 4518 (4600)
T ss_pred hhcccccHHHHHHHHHHHHHHhhhhc--ccchheeEEEeecCcccchHHHHHHHH
Confidence 33456766544444444444333221 223457889999999998877776664
No 98
>PF02743 Cache_1: Cache domain; InterPro: IPR004010 Cache is an extracellular domain that is predicted to have a role in small-molecule recognition in a wide range of proteins, including the animal dihydropyridine-sensitive voltage-gated Ca2+ channel; alpha-2delta subunit, and various bacterial chemotaxis receptors. The name Cache comes from CAlcium channels and CHEmotaxis receptors. This domain consists of an N-terminal part with three predicted strands and an alpha-helix, and a C-terminal part with a strand dyad followed by a relatively unstructured region. The N-terminal portion of the (unpermuted) Cache domain contains three predicted strands that could form a sheet analogous to that present in the core of the PAS domain structure. Cache domains are particularly widespread in bacteria, with Vibrio cholerae. The animal calcium channel alpha-2delta subunits might have acquired a part of their extracellular domains from a bacterial source []. The Cache domain appears to have arisen from the GAF-PAS fold despite their divergent functions [].; GO: 0016020 membrane; PDB: 3C8C_A 3LIB_D 3LIA_A 3LI8_A 3LI9_A.
Probab=91.22 E-value=0.23 Score=36.80 Aligned_cols=22 Identities=18% Similarity=0.424 Sum_probs=17.2
Q ss_pred CCceEEEEeeeeeecCCceeecc
Q psy10004 329 PYRLLTTVAMPAFDRRPRIIVSI 351 (354)
Q Consensus 329 ~~~l~~tv~~pv~~~~~~~~~~~ 351 (354)
...+++|+|.||++. +...+++
T Consensus 10 ~~~~vi~~s~pi~~~-~g~~~Gv 31 (81)
T PF02743_consen 10 TGQPVITISVPIYDD-DGKIIGV 31 (81)
T ss_dssp TTEEEEEEEEEEEET-TTEEEEE
T ss_pred CCcEEEEEEEEEECC-CCCEEEE
Confidence 567999999999994 4556664
No 99
>PRK15426 putative diguanylate cyclase YedQ; Provisional
Probab=84.23 E-value=1.2 Score=45.49 Aligned_cols=57 Identities=16% Similarity=0.264 Sum_probs=41.9
Q ss_pred hHHHHHHHHHhhcCCCCceeEEeeCCCCceee-cCCCCcccccccCCCCCCCCCccccccccccceeeec
Q psy10004 10 SEALNDIFVSNYQADPSLSWQVFGSTTGALRV-FPAHQWTDFLDLFKPEGRTDNMTDLYDCRVREWYINA 78 (354)
Q Consensus 10 s~~ld~~f~~n~~~~p~l~wqyfgs~~G~~r~-yP~~~~~~~~~~~~~~~~~~~~~~~yd~r~r~wy~~a 78 (354)
...+...|..-.+.+|...+-||.+.+|.+.. ||...... . .-..||+|.||||..+
T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~~-------~-----~~~~~d~~~r~WY~~a 211 (570)
T PRK15426 154 ALELGYLLRLAHNSSSLVERAMYVSRAGFYVSTYPTLFPSD-------V-----PTRYYQYVTQPWFIGQ 211 (570)
T ss_pred HHHHHHHhccccccCCceeEEEEEeCCceEEEeCCCCCccc-------c-----ccccCCcccChHHHhh
Confidence 35577888888888888889988899998765 88533211 0 1235899999999876
No 100
>KOG2326|consensus
Probab=82.79 E-value=28 Score=35.68 Aligned_cols=143 Identities=15% Similarity=0.149 Sum_probs=77.3
Q ss_pred ceEEEEEeCCCCCCc------chHHHHHHHHHHHHHH--c--CCCCeEEEEEeeccccccc----cccccc--ccccChh
Q psy10004 83 KDMVILLDNSGSMMG------QRREIARHVINNLLDT--L--GNNDYVNVLQFTSVCKEVV----PCFADI--LVQANLA 146 (354)
Q Consensus 83 ~dvvillD~SgSM~g------~~~~~ak~a~~~ll~~--L--~~~d~v~vi~Fs~~~~~~~----~c~~~~--lv~~~~~ 146 (354)
...+|++|.+.+|+. ..++.|+..+...+.. + +..|-|+++.|+-.++..- ..|.+. +.+.+..
T Consensus 5 e~ttfilDvG~~Ms~~~~~~~S~fE~a~~y~~~~lsrK~fa~rktD~is~vlyncD~ten~legg~~fqnisvl~p~~tp 84 (669)
T KOG2326|consen 5 ESTTFILDVGPSMSKNNETGKSNFEKAMAYLEYTLSRKSFASRKTDWISCVLYNCDVTENSLEGGNVFQNISVLAPVTTP 84 (669)
T ss_pred cceEEEEecCccccccCCCccccHHHHHHHHHHHHHHHHhhccCCceEEEEEecCCCccCccccccccceeEEeecccch
Confidence 345666699999984 3688888888776644 2 2568999999997654310 111111 1111222
Q ss_pred hHHHHHHHHhhhcCCC-CCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCCchHH--HHHHhhhcccCCCCC
Q psy10004 147 NVRELKMGVEMIGDAN-NIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASENYKE--VFEEFNWRGQNDSTL 223 (354)
Q Consensus 147 n~~~~~~~i~~l~~~~-G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~~~~~--i~~~~~~~~~~~~~~ 223 (354)
.-..+...+......+ -..++..||-..+.++.....-. ..-..+.|+..++|..+..+. +++.+. .
T Consensus 85 af~~l~k~~~~~~qqns~q~Df~gal~vs~dL~~qhe~~~--k~~~kr~Il~~~~l~~dfsd~~~ive~l~--------~ 154 (669)
T KOG2326|consen 85 AFIGLIKRLKQYCQQNSHQSDFEGALSVSQDLLVQHEDIK--KQFQKRKILKQIVLFTDFSDDLFIVEDLT--------D 154 (669)
T ss_pred hhHHHHHHHHHhcCCCccccchhhhHHHHHHHHHHHHhcc--chhhceEEEEeecccccchhhHHHHHHHh--------h
Confidence 2233444444331221 23468888888888665432211 112234455555555543333 444442 3
Q ss_pred CCeEEEEEEeCC
Q psy10004 224 WPVRVFSYLVGK 235 (354)
Q Consensus 224 ~~v~IftigiG~ 235 (354)
..|-+-++|+..
T Consensus 155 ~didL~~~gldf 166 (669)
T KOG2326|consen 155 EDIDLLTEGLDF 166 (669)
T ss_pred cCcceeEeeccC
Confidence 467777777754
No 101
>PF01882 DUF58: Protein of unknown function DUF58; InterPro: IPR002881 This domain is found in a family of prokaryotic proteins that have no known function. Proteins belonging to this family include hypothetical proteins from eubacteria and archaebacteria. Some of these proteins also contain the Von Willebrand factor, type A domain (see IPR002035 from INTERPRO).
Probab=73.86 E-value=7.4 Score=28.92 Aligned_cols=42 Identities=26% Similarity=0.187 Sum_probs=31.3
Q ss_pred CCCceEEEEEeCCCCCCc-----chHHHHHHHHHHHHHHc-CCCCeEE
Q psy10004 80 ASPKDMVILLDNSGSMMG-----QRREIARHVINNLLDTL-GNNDYVN 121 (354)
Q Consensus 80 ~~p~dvvillD~SgSM~g-----~~~~~ak~a~~~ll~~L-~~~d~v~ 121 (354)
....+++|++|.+++|.. ..++.+...+..++..+ ..++.|+
T Consensus 38 ~~~~~~~i~ld~~~~~~~~~~~~~~~e~~l~~a~~l~~~~~~~g~~v~ 85 (86)
T PF01882_consen 38 ERSQPVWIVLDLSPSMYFGSNGRSKFERALSAAASLANQALRQGDPVG 85 (86)
T ss_pred ccCCcEEEEEECCCccccCcCCCCHHHHHHHHHHHHHHHHHhcCCccc
Confidence 356889999999999985 67888888777776654 3555543
No 102
>TIGR02921 PEP_integral PEP-CTERM family integral membrane protein. Members of this protein family, found in three different species so far, have a PEP-CTERM sequence at the carboxyl-terminus (see model TIGR02595), but are unusual among PEP-CTERM proteins in having multiple predicted transmembrane segments. The function is unknown. It is proposed that a member of the EpsH family, to be designated exosortase (see TIGR02602), recognizes and cleaves PEP-CTERM proteins in a manner analogous to the cleavage of LPXTG proteins by sortase (see Haft, et al., 2006).
Probab=70.52 E-value=23 Score=36.39 Aligned_cols=59 Identities=10% Similarity=0.170 Sum_probs=31.6
Q ss_pred CCCcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCC---CCHHHHHHHHhcCCcEEE
Q psy10004 189 ADCNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEV---ADYRDVKWMACANKGYYV 255 (354)
Q Consensus 189 ~~~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~---~~~~~L~~iA~~~~G~~~ 255 (354)
.....+++|+||...-..+.-+.. . +.....+.-+-+|... .+...++.+-..+||...
T Consensus 725 ~~~ydav~llsd~gsyel~~~~~~-~-------~~~~~p~wlvhlg~tl~~ay~d~~i~~l~~s~ggva~ 786 (952)
T TIGR02921 725 DQAYDAVLLLSDAGSYELEKNFKL-K-------AAQMAPLWLVHLGETLAGAYHDGIIDLLKDSGGGVAL 786 (952)
T ss_pred ccccceEEEeccCcchhhhhcccc-c-------cCCCCceEEEecCccccccccchHHHHHHhcCCCeEe
Confidence 345578999999875433222211 1 1234456666666531 234456666666666554
No 103
>KOG0071|consensus
Probab=61.83 E-value=13 Score=31.08 Aligned_cols=43 Identities=14% Similarity=0.367 Sum_probs=32.1
Q ss_pred cccceeeeccCCCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC
Q psy10004 70 RVREWYINAAASPKDMVILLDNSGSMMGQRREIARHVINNLLDTLG 115 (354)
Q Consensus 70 r~r~wy~~a~~~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~ 115 (354)
+.||.+..--+....++||+|.+.| .+++.|++-+..++..-.
T Consensus 73 ~iRplWrhYy~gtqglIFV~Dsa~~---dr~eeAr~ELh~ii~~~e 115 (180)
T KOG0071|consen 73 KIRPLWRHYYTGTQGLIFVVDSADR---DRIEEARNELHRIINDRE 115 (180)
T ss_pred hhhHHHHhhccCCceEEEEEeccch---hhHHHHHHHHHHHhCCHh
Confidence 4555444333467889999999887 789999999998886533
No 104
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=51.57 E-value=36 Score=28.78 Aligned_cols=43 Identities=28% Similarity=0.322 Sum_probs=38.0
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEE
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQ 124 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~ 124 (354)
++.-+||||-.|+|.....+..++-+..+|..-.++|.|=|-.
T Consensus 96 ~~~r~~VldF~Gdi~A~~v~~LReeisail~~a~~~DeV~~rL 138 (155)
T PF08496_consen 96 PKPRLFVLDFKGDIKASEVESLREEISAILSVATPEDEVLVRL 138 (155)
T ss_pred CCCeEEEEecCCCccHHHHHHHHHHHHHHHHhCCCCCeEEEEE
Confidence 5677999999999999999999999999999999999975543
No 105
>TIGR01204 bioW 6-carboxyhexanoate--CoA ligase. Alternate name: pimeloyl-CoA synthase.
Probab=45.45 E-value=12 Score=33.80 Aligned_cols=35 Identities=23% Similarity=0.396 Sum_probs=30.2
Q ss_pred ChhhhhhcchHHHHHHHHHhhcCCCCceeEEeeCCC-CceeecCCCCc
Q psy10004 1 EEVLRVAKWSEALNDIFVSNYQADPSLSWQVFGSTT-GALRVFPAHQW 47 (354)
Q Consensus 1 ~~v~~~i~ws~~ld~~f~~n~~~~p~l~wqyfgs~~-G~~r~yP~~~~ 47 (354)
|.|+-.+|||+ ||.+.=-|+++.. |++|+-|--..
T Consensus 165 pgvvAElC~SD------------DP~YtTGYVA~~~~gY~RI~~lK~~ 200 (232)
T TIGR01204 165 PAVVAELCWSD------------DPDYVTGYVSGKEIGYVRITPLKEK 200 (232)
T ss_pred CCeEEEEEecC------------CCCCeeEEEEeCCCCeEeCcccccc
Confidence 67888999997 8999999999998 99998886544
No 106
>PRK01322 6-carboxyhexanoate--CoA ligase; Provisional
Probab=41.94 E-value=14 Score=33.49 Aligned_cols=35 Identities=20% Similarity=0.311 Sum_probs=29.9
Q ss_pred ChhhhhhcchHHHHHHHHHhhcCCCCceeEEeeCCC-CceeecCCCCc
Q psy10004 1 EEVLRVAKWSEALNDIFVSNYQADPSLSWQVFGSTT-GALRVFPAHQW 47 (354)
Q Consensus 1 ~~v~~~i~ws~~ld~~f~~n~~~~p~l~wqyfgs~~-G~~r~yP~~~~ 47 (354)
|.|+-.+|||+ ||.+.=-|+++.. |++|+-|--..
T Consensus 172 pgivAElC~SD------------DP~YtTGYVA~~~~gY~RI~~mK~~ 207 (242)
T PRK01322 172 PGVIAELCWSD------------DPDYTTGYVATKKLGYHRITNLKEE 207 (242)
T ss_pred CCeEEEEEecC------------CCCCeeEEEEeCCCCeEeCcccccc
Confidence 56888899997 8999999999999 99998886444
No 107
>PF03744 BioW: 6-carboxyhexanoate--CoA ligase; InterPro: IPR005499 This family contains the enzyme 6-carboxyhexanoate--CoA ligase 6.2.1.14 from EC. This enzyme is involved in the first step of biotin synthesis, where it converts pimelate into pimeloyl-CoA []. The enzyme requires magnesium as a cofactor and forms a homodimer [].; GO: 0009102 biotin biosynthetic process
Probab=41.63 E-value=15 Score=33.42 Aligned_cols=35 Identities=20% Similarity=0.326 Sum_probs=30.4
Q ss_pred ChhhhhhcchHHHHHHHHHhhcCCCCceeEEeeCCC-CceeecCCCCc
Q psy10004 1 EEVLRVAKWSEALNDIFVSNYQADPSLSWQVFGSTT-GALRVFPAHQW 47 (354)
Q Consensus 1 ~~v~~~i~ws~~ld~~f~~n~~~~p~l~wqyfgs~~-G~~r~yP~~~~ 47 (354)
|.|+-.+|||+ ||.+.=-|+++.. |++|+.|--..
T Consensus 172 PgiiaELC~SD------------DP~YtTGYVA~~~~GY~RI~~mK~~ 207 (239)
T PF03744_consen 172 PGIIAELCWSD------------DPDYTTGYVASKKFGYHRITNMKEA 207 (239)
T ss_pred CcceEEEEecC------------CCCCceEEEEeCCcceEECcccccc
Confidence 67888999997 8999999999988 99999887544
No 108
>PF13778 DUF4174: Domain of unknown function (DUF4174)
Probab=37.48 E-value=1.7e+02 Score=23.27 Aligned_cols=70 Identities=13% Similarity=0.105 Sum_probs=40.4
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcC-CCCeEEEEEeecccccccccccccccccChhhHHHHHHHHhhh
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLG-NNDYVNVLQFTSVCKEVVPCFADILVQANLANVRELKMGVEMI 158 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~-~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~~~~~i~~l 158 (354)
-+||+++.++...........-...+..+...+. +.+.|+++.-+.+... .+....+-+.+++.+.|+++
T Consensus 41 eRdi~v~~i~~~~~~~~~~~~~~~~~~~lr~~l~~~~~~f~~vLiGKDG~v-------K~r~~~p~~~~~lf~~ID~M 111 (118)
T PF13778_consen 41 ERDIVVIVITGDGARSPGKPLSPEDIQALRKRLRIPPGGFTVVLIGKDGGV-------KLRWPEPIDPEELFDTIDAM 111 (118)
T ss_pred cCceEEEEEeCCccccccCcCCHHHHHHHHHHhCCCCCceEEEEEeCCCcE-------EEecCCCCCHHHHHHHHhCC
Confidence 4678777765554443222233344455556665 3455777776666543 23334455678888888877
No 109
>PF06415 iPGM_N: BPG-independent PGAM N-terminus (iPGM_N); InterPro: IPR011258 This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (5.4.2.1 from EC). The family is found in conjunction with Metalloenzyme (located in the C-terminal region of the protein). ; GO: 0004619 phosphoglycerate mutase activity, 0030145 manganese ion binding, 0006007 glucose catabolic process, 0005737 cytoplasm; PDB: 1EQJ_A 1EJJ_A 1O99_A 1O98_A 3IGZ_B 3IGY_B 3NVL_A 2IFY_A.
Probab=37.28 E-value=2e+02 Score=25.96 Aligned_cols=44 Identities=16% Similarity=0.279 Sum_probs=18.9
Q ss_pred CcceEEEeccCCCCch-HHHHHHhhh-cccCCCCCCCeEEEEEEeCCCC
Q psy10004 191 CNQAIMVVTDGASENY-KEVFEEFNW-RGQNDSTLWPVRVFSYLVGKEV 237 (354)
Q Consensus 191 ~~~~IillTDG~~~~~-~~i~~~~~~-~~~~~~~~~~v~IftigiG~~~ 237 (354)
+...+=|+|||..-.. ..+...++. +.. .-..|.|++|.=|.+.
T Consensus 29 ~lHl~GLlSdGGVHSh~~Hl~al~~~a~~~---gv~~V~vH~f~DGRDt 74 (223)
T PF06415_consen 29 RLHLMGLLSDGGVHSHIDHLFALIKLAKKQ---GVKKVYVHAFTDGRDT 74 (223)
T ss_dssp -EEEEEEESS-SSS--HHHHHHHHHHHHHT---T-SEEEEEEEE-SSSS
T ss_pred eEEEEEEecCCCccccHHHHHHHHHHHHHc---CCCEEEEEEecCCCCC
Confidence 4466777777776533 222222211 111 1135667777767664
No 110
>COG5047 SEC23 Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion]
Probab=33.07 E-value=83 Score=32.33 Aligned_cols=48 Identities=10% Similarity=0.151 Sum_probs=41.3
Q ss_pred CceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeeccccc
Q psy10004 82 PKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVCKE 131 (354)
Q Consensus 82 p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~~~ 131 (354)
|.-..||+|.-. .+..+...|.++..-+..|.++--+++|+|+..++.
T Consensus 122 ppvf~fvvD~~~--D~e~l~~LkdslivslsllppeaLvglItygt~i~v 169 (755)
T COG5047 122 PPVFFFVVDACC--DEEELTALKDSLIVSLSLLPPEALVGLITYGTSIQV 169 (755)
T ss_pred CceEEEEEEeec--CHHHHHHHHHHHHHHHhcCCccceeeEEEecceeEE
Confidence 345678999876 788999999999999999999999999999987643
No 111
>KOG2327|consensus
Probab=32.78 E-value=3.3e+02 Score=28.15 Aligned_cols=117 Identities=17% Similarity=0.189 Sum_probs=58.3
Q ss_pred CceEEEEEeCCCCCCcc-hHHHHHH-------HHHHHHHH---cCCCCeEEEEEeecccccccccccccccccChhhHHH
Q psy10004 82 PKDMVILLDNSGSMMGQ-RREIARH-------VINNLLDT---LGNNDYVNVLQFTSVCKEVVPCFADILVQANLANVRE 150 (354)
Q Consensus 82 p~dvvillD~SgSM~g~-~~~~ak~-------a~~~ll~~---L~~~d~v~vi~Fs~~~~~~~~c~~~~lv~~~~~n~~~ 150 (354)
..-+.|++|.|.||.+. ..+..+. .+..+.-. .++.+.+++..++..-.....-.. .+.....-+...
T Consensus 18 ~~~ilfvi~~~~s~~~~~~~e~~lspl~~~L~~~~~l~~~~vitn~~~~~~v~~y~~~~~~~~~~~~-~l~~l~d~~~~~ 96 (602)
T KOG2327|consen 18 KEAILFVIDVNPSMKAEEPDEFKLSPLKMILDCIDRLCIQLVITNPIDSVGVLFYGTEETEGLENNT-LLFPLGDLGQEE 96 (602)
T ss_pred ccceEEEEecCHHhhccCcccchhhhHHHHHHHHHHHHhheeecCCCCccceEeecccccccCccce-EEeeccccChHH
Confidence 34578999999999872 2222222 22222222 246777888877765433221111 111111111111
Q ss_pred HHHHHh-----------hh-cCCCCCCcHHHHHHHHHHHHHhccccCCCCCCCcceEEEeccCCCC
Q psy10004 151 LKMGVE-----------MI-GDANNIANFTVALTRAFNILENARNDKKTGADCNQAIMVVTDGASE 204 (354)
Q Consensus 151 ~~~~i~-----------~l-~~~~G~T~~~~aL~~A~~~l~~~~~~~~~~~~~~~~IillTDG~~~ 204 (354)
++...+ ++ +.+.-.+.+..-|....+++..... ...++.|+++||-...
T Consensus 97 ~~k~~~~~e~~~q~~~~~~~~~~~~~s~ls~vl~~c~~~~~~~~~-----~~~~krv~l~Td~d~P 157 (602)
T KOG2327|consen 97 VKKILELFEEENQLSAVNFYGGMHQKSDLSNVLNYCKRMVFASQK-----KLSNKRVFLFTDNDNP 157 (602)
T ss_pred HHHHHHHhhhhhhhhhhhccCcccccccHHHHHHHHHHHHHHHhh-----hcccceEEEEecCCCc
Confidence 111111 11 1122234677788887776654322 3467889999998654
No 112
>PF10221 DUF2151: Cell cycle and development regulator; InterPro: IPR019355 This entry represents the cell cycle regulator Mat89b, which plays an evolutionarily conserved role as a crucial regulator of both cell cycle and development []. Mat89Bb is a PNG kinase substrate that is essential for S-M cycles of early Drosophila embryogenesis, Xenopus embryonic cell cycles and morphogenesis, and cell division in cultured mammalian cells.
Probab=32.60 E-value=3.3e+02 Score=28.96 Aligned_cols=91 Identities=11% Similarity=0.130 Sum_probs=57.1
Q ss_pred eEEEEEeCCCCCCc-----------------------chHHHHHHHHHHH---HHH-cCCCCeEEEEEeecccccccccc
Q psy10004 84 DMVILLDNSGSMMG-----------------------QRREIARHVINNL---LDT-LGNNDYVNVLQFTSVCKEVVPCF 136 (354)
Q Consensus 84 dvvillD~SgSM~g-----------------------~~~~~ak~a~~~l---l~~-L~~~d~v~vi~Fs~~~~~~~~c~ 136 (354)
-.|||||.|.-|.- .-|..+.+++... +-. ++.+-.+.++.+...++.+-. |
T Consensus 7 KTVfVLDh~p~f~~ss~~~i~~d~~~~~~~~~~~i~KSLWTc~vEa~~EYcRIV~DlFP~~k~IrfivsD~~a~~lnt-W 85 (695)
T PF10221_consen 7 KTVFVLDHSPYFAESSNQPIDFDIVKKSRQQKAPISKSLWTCAVEASIEYCRIVWDLFPDGKLIRFIVSDTAAHILNT-W 85 (695)
T ss_pred cEEEEEcCCchhhhhccCcEEEeeecCCCCCcCcccchHHHHHHHHHHHHHHHHhhccCCCceEEEEEEccccccccC-c
Confidence 35677777777751 2355555555443 333 345556777777777766433 5
Q ss_pred cccccccChhhHHHHHHHHhhhcCCC----CCC--cHHHHHHHHHHHHHh
Q psy10004 137 ADILVQANLANVRELKMGVEMIGDAN----NIA--NFTVALTRAFNILEN 180 (354)
Q Consensus 137 ~~~lv~~~~~n~~~~~~~i~~l~~~~----G~T--~~~~aL~~A~~~l~~ 180 (354)
.. ...++..+.+.+..++.|. ..+ .+..||..|++.|..
T Consensus 86 ~~-----~~Qsl~~L~~~la~vG~P~~~~~~~~d~svi~GL~~AIEaL~e 130 (695)
T PF10221_consen 86 ST-----SQQSLSHLMNALATVGPPPRSDPENSDYSVIHGLRMAIEALAE 130 (695)
T ss_pred Ch-----hhccHHHHHHHHHhcCCCCCCCcccccchhHHHHHHHHHHHhc
Confidence 42 3357788888888885552 222 578999999997753
No 113
>PF13829 DUF4191: Domain of unknown function (DUF4191)
Probab=31.22 E-value=82 Score=28.35 Aligned_cols=55 Identities=13% Similarity=0.203 Sum_probs=36.4
Q ss_pred CcceEEEeccCCCCchHHHHHHhhhcccCCCCCCCeEEEEEEeCCCCCCHHHHHHHHh
Q psy10004 191 CNQAIMVVTDGASENYKEVFEEFNWRGQNDSTLWPVRVFSYLVGKEVADYRDVKWMAC 248 (354)
Q Consensus 191 ~~~~IillTDG~~~~~~~i~~~~~~~~~~~~~~~~v~IftigiG~~~~~~~~L~~iA~ 248 (354)
..+-|||++.|.++-....+++-+ +...+ --.++.||.|-+|++. +.-.|+++..
T Consensus 121 GrpGVvLVgEG~~~Rv~~Ll~~E~-krv~R-V~~~vPV~~i~vG~ge-gQVpL~kL~~ 175 (224)
T PF13829_consen 121 GRPGVVLVGEGPPSRVKKLLAQEK-KRVAR-VVGNVPVHDIIVGNGE-GQVPLRKLQK 175 (224)
T ss_pred CCCcEEEEecCCHHHHHHHHHHHH-HHhcc-ccCCCCeEEEEecCCC-CceeHHHHHH
Confidence 346699999999987766665533 12111 2347889999999875 5555655544
No 114
>KOG0070|consensus
Probab=29.43 E-value=1.1e+02 Score=26.69 Aligned_cols=45 Identities=18% Similarity=0.293 Sum_probs=32.3
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHHcCCCCeEEEEEeeccc
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDTLGNNDYVNVLQFTSVC 129 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~L~~~d~v~vi~Fs~~~ 129 (354)
.-.-+|||+|.+. -.|+..+|+.+..++..-. -..+.++.|+++-
T Consensus 84 ~t~~lIfVvDS~D---r~Ri~eak~eL~~~l~~~~-l~~~~llv~aNKq 128 (181)
T KOG0070|consen 84 NTQGLIFVVDSSD---RERIEEAKEELHRMLAEPE-LRNAPLLVFANKQ 128 (181)
T ss_pred CCcEEEEEEeCCc---HHHHHHHHHHHHHHHcCcc-cCCceEEEEechh
Confidence 4567999999876 4578888888877775533 3456777887764
No 115
>PF00025 Arf: ADP-ribosylation factor family The prints entry specific to Sar1 proteins The Prosite entry specific to Sar1 proteins; InterPro: IPR006689 Small GTPases form an independent superfamily within the larger class of regulatory GTP hydrolases. This superfamily contains proteins that control a vast number of important processes and possess a common, structurally preserved GTP-binding domain [, ]. Sequence comparisons of small G proteins from various species have revealed that they are conserved in primary structures at the level of 30-55% similarity []. Crystallographic analysis of various small G proteins revealed the presence of a 20 kDa catalytic domain that is unique for the whole superfamily [, ]. The domain is built of five alpha helices (A1-A5), six beta-strands (B1-B6) and five polypeptide loops (G1-G5). A structural comparison of the GTP- and GDP-bound form, allows one to distinguish two functional loop regions: switch I and switch II that surround the gamma-phosphate group of the nucleotide. The G1 loop (also called the P-loop) that connects the B1 strand and the A1 helix is responsible for the binding of the phosphate groups. The G3 loop provides residues for Mg(2+) and phosphate binding and is located at the N terminus of the A2 helix. The G1 and G3 loops are sequentially similar to Walker A and Walker B boxes that are found in other nucleotide binding motifs. The G2 loop connects the A1 helix and the B2 strand and contains a conserved Thr residue responsible for Mg(2+) binding. The guanine base is recognised by the G4 and G5 loops. The consensus sequence NKXD of the G4 loop contains Lys and Asp residues directly interacting with the nucleotide. Part of the G5 loop located between B6 and A5 acts as a recognition site for the guanine base []. The small GTPase superfamily can be divided into at least 8 different families, including: Arf small GTPases. GTP-binding proteins involved in protein trafficking by modulating vesicle budding and uncoating within the Golgi apparatus. Ran small GTPases. GTP-binding proteins involved in nucleocytoplasmic transport. Required for the import of proteins into the nucleus and also for RNA export. Rab small GTPases. GTP-binding proteins involved in vesicular traffic. Rho small GTPases. GTP-binding proteins that control cytoskeleton reorganisation. Ras small GTPases. GTP-binding proteins involved in signalling pathways. Sar1 small GTPases. Small GTPase component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER). Mitochondrial Rho (Miro). Small GTPase domain found in mitochondrial proteins involved in mitochondrial trafficking. Roc small GTPases domain. Small GTPase domain always found associated with the COR domain. This entry represents a branch of the small GTPase superfamily that includes the ADP ribosylation factor Arf, Arl (Arf-like), Arp (Arf-related proteins) and the remotely related Sar (Secretion-associated and Ras-related) proteins. Arf proteins are major regulators of vesicle biogenesis in intracellular traffic []. They cycle between inactive GDP-bound and active GTP-bound forms that bind selectively to effectors. The classical structural GDP/GTP switch is characterised by conformational changes at the so-called switch 1 and switch 2 regions, which bind tightly to the gamma-phosphate of GTP but poorly or not at all to the GDP nucleotide. Structural studies of Arf1 and Arf6 have revealed that although these proteins feature the switch 1 and 2 conformational changes, they depart from other small GTP-binding proteins in that they use an additional, unique switch to propagate structural information from one side of the protein to the other. The GDP/GTP structural cycles of human Arf1 and Arf6 feature a unique conformational change that affects the beta2-beta3 strands connecting switch 1 and switch 2 (interswitch) and also the amphipathic helical N terminus. In GDP-bound Arf1 and Arf6, the interswitch is retracted and forms a pocket to which the N-terminal helix binds, the latter serving as a molecular hasp to maintain the inactive conformation. In the GTP-bound form of these proteins, the interswitch undergoes a two-residue register shift that pulls switch 1 and switch 2 up, restoring an active conformation that can bind GTP. In this conformation, the interswitch projects out of the protein and extrudes the N-terminal hasp by occluding its binding pocket.; GO: 0005525 GTP binding; PDB: 2H57_B 2W83_B 3N5C_B 2J5X_A 3LVR_E 2BAO_A 3LVQ_E 2A5F_A 3PCR_B 1E0S_A ....
Probab=25.08 E-value=1.6e+02 Score=24.88 Aligned_cols=30 Identities=20% Similarity=0.443 Sum_probs=23.5
Q ss_pred CCceEEEEEeCCCCCCcchHHHHHHHHHHHHHH
Q psy10004 81 SPKDMVILLDNSGSMMGQRREIARHVINNLLDT 113 (354)
Q Consensus 81 ~p~dvvillD~SgSM~g~~~~~ak~a~~~ll~~ 113 (354)
...-++||+|.|. ..++..++..+..++..
T Consensus 81 ~~~~iIfVvDssd---~~~l~e~~~~L~~ll~~ 110 (175)
T PF00025_consen 81 NADGIIFVVDSSD---PERLQEAKEELKELLND 110 (175)
T ss_dssp TESEEEEEEETTG---GGGHHHHHHHHHHHHTS
T ss_pred ccceeEEEEeccc---ceeecccccchhhhcch
Confidence 3457999999885 34788888888888865
No 116
>smart00467 GS GS motif. Aa approx. 30 amino acid motif that precedes the kinase domain in types I and II TGF beta receptors. Mutation of two or more of the serines or threonines in the TTSGSGSG of TGF-beta type I receptor impairs phosphorylation and signaling activity.
Probab=25.04 E-value=58 Score=19.48 Aligned_cols=14 Identities=36% Similarity=0.318 Sum_probs=10.3
Q ss_pred CCCCCCcchHHHHH
Q psy10004 91 NSGSMMGQRREIAR 104 (354)
Q Consensus 91 ~SgSM~g~~~~~ak 104 (354)
+|||++|.++-.-+
T Consensus 11 tSGSGSG~p~LvqR 24 (30)
T smart00467 11 TSGSGSGLPLLVQR 24 (30)
T ss_pred cCCCCCCchhHHHH
Confidence 89999997654333
No 117
>PF09439 SRPRB: Signal recognition particle receptor beta subunit; InterPro: IPR019009 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. The SR receptor is a monomer consisting of the loosely membrane-associated SR-alpha homologue FtsY, while the eukaryotic SR receptor is a heterodimer of SR-alpha (70 kDa) and SR-beta (25 kDa), both of which contain a GTP-binding domain []. SR-alpha regulates the targeting of SRP-ribosome-nascent polypeptide complexes to the translocon []. SR-alpha binds to the SRP54 subunit of the SRP complex. The SR-beta subunit is a transmembrane GTPase that anchors the SR-alpha subunit (a peripheral membrane GTPase) to the ER membrane []. SR-beta interacts with the N-terminal SRX-domain of SR-alpha, which is not present in the bacterial FtsY homologue. SR-beta also functions in recruiting the SRP-nascent polypeptide to the protein-conducting channel. The beta subunit of the signal recognition particle receptor (SRP) is a transmembrane GTPase, which anchors the alpha subunit to the endoplasmic reticulum membrane []. ; PDB: 2GED_B 1NRJ_B 2GO5_2 2FH5_B.
Probab=21.93 E-value=1.3e+02 Score=26.17 Aligned_cols=23 Identities=22% Similarity=0.512 Sum_probs=16.1
Q ss_pred cccceeeec---cCCCceEEEEEeCC
Q psy10004 70 RVREWYINA---AASPKDMVILLDNS 92 (354)
Q Consensus 70 r~r~wy~~a---~~~p~dvvillD~S 92 (354)
|.|..+.+. ....+-||||||.|
T Consensus 61 rlr~~~~~~~~~~~~~k~IIfvvDSs 86 (181)
T PF09439_consen 61 RLRSKLLDELKYLSNAKGIIFVVDSS 86 (181)
T ss_dssp CCCHHHHHHHHHHGGEEEEEEEEETT
T ss_pred HHHHHHHHhhhchhhCCEEEEEEeCc
Confidence 445555544 45689999999976
Done!