Query 001711
Match_columns 1021
No_of_seqs 255 out of 767
Neff 6.3
Searched_HMMs 46136
Date Fri Mar 29 07:30:09 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001711.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001711hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1984 Vesicle coat complex C 100.0 5E-184 1E-188 1584.2 85.7 730 277-1020 236-1005(1007)
2 KOG1985 Vesicle coat complex C 100.0 2E-165 3E-170 1430.8 74.5 711 304-1019 160-887 (887)
3 PTZ00395 Sec24-related protein 100.0 7E-152 2E-156 1365.6 69.0 721 278-1020 599-1556(1560)
4 COG5028 Vesicle coat complex C 100.0 1E-150 3E-155 1296.1 67.8 706 299-1019 132-861 (861)
5 PLN00162 transport protein sec 100.0 2E-120 3E-125 1115.3 69.7 656 312-1019 7-760 (761)
6 KOG1986 Vesicle coat complex C 100.0 4E-90 8.8E-95 791.5 53.9 653 312-1019 7-743 (745)
7 COG5047 SEC23 Vesicle coat com 100.0 9.8E-83 2.1E-87 712.4 42.5 661 311-1020 6-754 (755)
8 cd01479 Sec24-like Sec24-like: 100.0 4.5E-54 9.8E-59 466.0 25.8 241 425-666 1-244 (244)
9 cd01468 trunk_domain trunk dom 100.0 5.9E-50 1.3E-54 433.2 25.7 235 425-660 1-239 (239)
10 PF04811 Sec23_trunk: Sec23/Se 100.0 9E-50 1.9E-54 432.7 21.8 237 425-662 1-243 (243)
11 cd01478 Sec23-like Sec23-like: 100.0 2E-44 4.3E-49 394.5 20.6 225 425-654 1-265 (267)
12 PF04815 Sec23_helical: Sec23/ 99.9 2.1E-21 4.5E-26 183.8 11.6 103 763-868 1-103 (103)
13 PF08033 Sec23_BS: Sec23/Sec24 99.8 1.7E-20 3.7E-25 175.3 11.0 85 667-751 1-96 (96)
14 PF04810 zf-Sec23_Sec24: Sec23 99.2 6E-12 1.3E-16 98.6 1.8 35 354-388 6-40 (40)
15 PRK13685 hypothetical protein; 98.8 3.7E-07 8E-12 104.0 19.7 174 427-661 88-289 (326)
16 cd01453 vWA_transcription_fact 98.7 5.7E-07 1.2E-11 94.1 17.4 163 429-660 5-177 (183)
17 cd01467 vWA_BatA_type VWA BatA 98.5 3.5E-06 7.7E-11 86.9 16.7 154 429-643 4-175 (180)
18 cd01466 vWA_C3HC4_type VWA C3H 98.5 1.8E-06 4E-11 87.6 14.1 147 430-642 3-154 (155)
19 cd01465 vWA_subgroup VWA subgr 98.5 3.5E-06 7.6E-11 85.8 16.2 155 430-644 3-162 (170)
20 cd01463 vWA_VGCC_like VWA Volt 98.5 5E-06 1.1E-10 87.2 17.6 163 426-644 12-188 (190)
21 cd01451 vWA_Magnesium_chelatas 98.5 4.1E-06 8.9E-11 87.0 16.6 160 429-647 2-169 (178)
22 cd01456 vWA_ywmD_type VWA ywmD 98.5 3E-06 6.4E-11 90.0 15.3 174 423-639 16-196 (206)
23 TIGR00868 hCaCC calcium-activa 98.4 2.5E-05 5.4E-10 97.8 24.3 167 428-662 305-477 (863)
24 TIGR03788 marine_srt_targ mari 98.3 0.00045 9.7E-09 85.2 32.3 284 424-803 268-556 (596)
25 cd01474 vWA_ATR ATR (Anthrax T 98.3 2.3E-05 5E-10 81.8 17.6 167 429-662 6-181 (185)
26 PF13519 VWA_2: von Willebrand 98.3 1E-05 2.2E-10 81.7 13.2 151 430-643 2-159 (172)
27 cd01472 vWA_collagen von Wille 98.3 2.8E-05 6E-10 79.4 16.0 151 430-644 3-163 (164)
28 TIGR03436 acidobact_VWFA VWFA- 98.2 7.4E-05 1.6E-09 83.9 19.9 158 426-642 52-238 (296)
29 cd01470 vWA_complement_factors 98.2 4.4E-05 9.6E-10 80.5 15.9 167 430-645 3-190 (198)
30 cd01461 vWA_interalpha_trypsin 98.2 0.00012 2.5E-09 74.6 18.3 157 427-644 2-161 (171)
31 cd01452 VWA_26S_proteasome_sub 98.1 8E-05 1.7E-09 78.2 15.4 142 429-634 5-160 (187)
32 cd01480 vWA_collagen_alpha_1-V 98.0 0.00011 2.4E-09 76.9 14.9 157 429-646 4-173 (186)
33 PF00626 Gelsolin: Gelsolin re 98.0 6.7E-06 1.4E-10 73.0 4.5 66 892-983 4-70 (76)
34 PF13768 VWA_3: von Willebrand 98.0 0.00011 2.4E-09 74.2 13.6 150 430-641 3-155 (155)
35 cd01475 vWA_Matrilin VWA_Matri 97.9 0.0002 4.3E-09 77.2 15.5 167 429-662 4-183 (224)
36 PTZ00441 sporozoite surface pr 97.9 0.00037 8.1E-09 83.4 18.9 163 428-646 43-217 (576)
37 cd01450 vWFA_subfamily_ECM Von 97.9 0.00022 4.8E-09 71.3 14.5 145 430-635 3-155 (161)
38 cd01477 vWA_F09G8-8_type VWA F 97.9 0.00038 8.3E-09 73.6 15.9 151 429-638 21-188 (193)
39 cd01471 vWA_micronemal_protein 97.9 0.00038 8.2E-09 72.5 15.7 149 430-634 3-160 (186)
40 TIGR02442 Cob-chelat-sub cobal 97.8 0.00018 3.9E-09 89.2 14.6 160 427-642 465-632 (633)
41 cd01469 vWA_integrins_alpha_su 97.8 0.00065 1.4E-08 70.6 16.3 156 430-646 3-172 (177)
42 cd01482 vWA_collagen_alphaI-XI 97.8 0.00083 1.8E-08 68.7 15.9 150 430-643 3-162 (164)
43 TIGR02031 BchD-ChlD magnesium 97.7 0.00044 9.5E-09 84.9 16.0 174 426-647 406-585 (589)
44 COG1240 ChlD Mg-chelatase subu 97.7 0.00043 9.3E-09 75.0 13.7 166 426-647 77-249 (261)
45 PHA03247 large tegument protei 97.7 0.069 1.5E-06 72.3 35.3 14 446-459 3114-3127(3151)
46 smart00327 VWA von Willebrand 97.7 0.0012 2.6E-08 66.9 16.2 153 429-641 3-164 (177)
47 PRK13406 bchD magnesium chelat 97.7 0.00099 2.1E-08 81.5 18.1 167 426-647 400-572 (584)
48 cd00198 vWFA Von Willebrand fa 97.7 0.00096 2.1E-08 65.6 15.1 148 429-635 2-155 (161)
49 PF00092 VWA: von Willebrand f 97.6 0.00086 1.9E-08 68.3 14.0 155 430-646 2-169 (178)
50 cd01481 vWA_collagen_alpha3-VI 97.6 0.0024 5.3E-08 65.8 16.0 151 430-645 3-165 (165)
51 cd01473 vWA_CTRP CTRP for CS 97.5 0.0037 8.1E-08 66.0 16.9 150 430-634 3-161 (192)
52 cd01476 VWA_integrin_invertebr 97.4 0.0057 1.2E-07 62.1 16.3 102 430-566 3-115 (163)
53 cd01464 vWA_subfamily VWA subf 97.3 0.0012 2.6E-08 68.3 10.4 138 430-633 6-159 (176)
54 smart00262 GEL Gelsolin homolo 97.2 0.0018 4E-08 59.6 9.3 71 896-995 16-87 (90)
55 KOG1924 RhoA GTPase effector D 97.1 0.0036 7.9E-08 75.5 11.7 12 827-838 1046-1057(1102)
56 cd01454 vWA_norD_type norD typ 97.0 0.021 4.5E-07 59.0 15.4 147 429-622 2-154 (174)
57 KOG1984 Vesicle coat complex C 96.9 0.1 2.3E-06 64.5 22.2 33 667-699 717-752 (1007)
58 cd01458 vWA_ku Ku70/Ku80 N-ter 96.9 0.023 5E-07 61.0 15.1 154 429-621 3-173 (218)
59 PF04056 Ssl1: Ssl1-like; Int 96.8 0.0066 1.4E-07 64.1 9.8 163 433-662 1-173 (193)
60 KOG1924 RhoA GTPase effector D 96.7 0.011 2.3E-07 71.7 11.6 12 328-339 656-667 (1102)
61 KOG0443 Actin regulatory prote 96.6 0.0047 1E-07 75.4 8.1 91 866-985 616-706 (827)
62 COG4245 TerY Uncharacterized p 96.4 0.066 1.4E-06 55.6 13.5 158 428-661 5-180 (207)
63 KOG2884 26S proteasome regulat 96.3 0.1 2.2E-06 55.2 14.5 154 429-644 5-175 (259)
64 cd01462 VWA_YIEM_type VWA YIEM 96.2 0.13 2.8E-06 51.6 14.4 130 430-621 3-135 (152)
65 TIGR00578 ku70 ATP-dependent D 95.5 0.23 4.9E-06 61.4 15.4 162 429-626 12-190 (584)
66 COG5148 RPN10 26S proteasome r 95.1 0.69 1.5E-05 48.0 14.6 133 428-620 4-146 (243)
67 cd01457 vWA_ORF176_type VWA OR 94.6 0.42 9.2E-06 50.5 12.5 146 429-634 4-165 (199)
68 cd01460 vWA_midasin VWA_Midasi 94.4 0.53 1.2E-05 52.4 13.1 132 426-620 59-204 (266)
69 KOG0443 Actin regulatory prote 94.1 0.19 4E-06 62.1 9.4 79 898-1001 277-358 (827)
70 cd01455 vWA_F11C1-5a_type Von 93.7 3.2 6.9E-05 44.0 16.5 98 514-644 72-174 (191)
71 TIGR00627 tfb4 transcription f 93.3 5.4 0.00012 44.8 18.4 95 536-662 117-221 (279)
72 PF03731 Ku_N: Ku70/Ku80 N-ter 92.7 0.77 1.7E-05 49.4 10.6 154 429-618 1-172 (224)
73 PF03850 Tfb4: Transcription f 92.6 4.9 0.00011 45.2 16.9 184 429-644 3-207 (276)
74 KOG0444 Cytoskeletal regulator 91.2 0.31 6.6E-06 58.8 5.7 66 894-985 637-703 (1255)
75 KOG2807 RNA polymerase II tran 90.8 2.6 5.6E-05 47.4 11.9 165 427-660 60-234 (378)
76 KOG4849 mRNA cleavage factor I 90.2 8.2 0.00018 43.7 15.1 13 448-460 391-403 (498)
77 COG2425 Uncharacterized protei 89.9 2.1 4.5E-05 50.8 11.0 148 427-643 273-424 (437)
78 KOG4849 mRNA cleavage factor I 88.8 9.5 0.00021 43.3 14.3 7 354-360 412-418 (498)
79 PRK10997 yieM hypothetical pro 88.1 2 4.3E-05 51.8 9.3 149 428-644 324-475 (487)
80 PF06707 DUF1194: Protein of u 86.9 29 0.00062 37.4 16.1 119 514-666 75-202 (205)
81 smart00187 INB Integrin beta s 85.2 91 0.002 37.2 26.0 272 427-715 99-389 (423)
82 KOG2353 L-type voltage-depende 84.2 19 0.00041 47.6 15.7 116 408-553 203-322 (1104)
83 PF00362 Integrin_beta: Integr 83.7 94 0.002 37.3 20.2 275 412-715 93-392 (426)
84 KOG3768 DEAD box RNA helicase 83.1 16 0.00035 44.2 13.2 32 428-459 2-38 (888)
85 KOG0444 Cytoskeletal regulator 82.4 3.1 6.8E-05 50.7 7.2 56 867-927 731-788 (1255)
86 KOG2487 RNA polymerase II tran 76.5 46 0.00099 37.1 13.0 55 599-662 185-239 (314)
87 COG4867 Uncharacterized protei 69.4 27 0.00059 40.9 9.8 160 428-643 464-634 (652)
88 PF11265 Med25_VWA: Mediator c 67.2 2E+02 0.0043 31.6 15.4 103 516-641 89-204 (226)
89 PF09967 DUF2201: VWA-like dom 63.4 12 0.00026 37.0 5.0 93 431-566 2-94 (126)
90 COG5242 TFB4 RNA polymerase II 61.1 1.4E+02 0.003 32.5 12.4 177 426-644 19-214 (296)
91 KOG0307 Vesicle coat complex C 58.8 5.7E+02 0.012 34.0 22.2 9 354-362 960-968 (1049)
92 PF10138 vWA-TerF-like: vWA fo 54.3 2.6E+02 0.0057 30.1 13.3 144 430-634 4-155 (200)
93 PF05762 VWA_CoxE: VWA domain 44.3 32 0.00069 37.3 4.9 102 425-564 54-159 (222)
94 KOG2893 Zn finger protein [Gen 40.4 1.3E+02 0.0028 32.8 8.4 10 511-520 323-332 (341)
95 PF02905 EBV-NA1: Epstein Barr 32.5 71 0.0015 31.5 4.5 33 446-478 112-145 (146)
96 KOG1923 Rac1 GTPase effector F 31.7 1.5E+02 0.0033 37.6 8.2 6 477-482 465-470 (830)
97 KOG4672 Uncharacterized conser 31.5 2.7E+02 0.0059 32.9 9.6 6 150-155 381-386 (487)
98 PF10058 DUF2296: Predicted in 25.7 55 0.0012 27.7 2.2 13 370-382 42-54 (54)
99 KOG1985 Vesicle coat complex C 25.1 1.3E+03 0.028 30.1 14.5 24 359-383 206-230 (887)
100 PF12257 DUF3608: Protein of u 23.9 8.3E+02 0.018 27.8 11.7 28 596-623 246-273 (281)
101 COG5415 Predicted integral mem 23.5 34 0.00073 36.6 0.7 33 354-386 188-228 (251)
102 COG1580 FliL Flagellar basal b 22.8 2.5E+02 0.0053 29.2 6.8 65 721-799 76-143 (159)
103 COG1592 Rubrerythrin [Energy p 21.8 47 0.001 34.6 1.4 15 369-383 131-145 (166)
104 KOG4368 Predicted RNA binding 21.1 1.6E+03 0.035 28.0 13.7 151 81-253 291-446 (757)
105 PF13894 zf-C2H2_4: C2H2-type 20.7 47 0.001 21.9 0.8 12 373-384 1-12 (24)
106 COG3285 Predicted eukaryotic-t 20.3 4.2E+02 0.009 30.2 8.3 15 354-368 66-80 (299)
No 1
>KOG1984 consensus Vesicle coat complex COPII, subunit SFB3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=5.5e-184 Score=1584.25 Aligned_cols=730 Identities=37% Similarity=0.685 Sum_probs=703.8
Q ss_pred CCCCCCCCCCCCCCCCCCCCCC--------------CCCCCCCCcccc----cCCCCceeccccccCCCHHHHhhcCCce
Q 001711 277 SIPGSIEPGIDLKSLPRPLDGD--------------VEPSSLAETYPL----NCHSRYLRLTTSAIPNSQSLVSRWHLPL 338 (1021)
Q Consensus 277 ~~~~~~dp~~~~~~ip~p~~~~--------------~~pp~~~~~~~~----N~~P~y~R~T~~~iP~t~~l~~~~~lPl 338 (1021)
..+.|+|| ++||+|.... ..||++||+|.+ ||||||||||+|+||+|.++++.++|||
T Consensus 236 ~~~~rldp----~~iPs~~qv~~~d~~~~r~~~~~~~~PPl~TTd~~~~DqGN~sPr~mr~T~Y~iP~T~Dl~~as~iPL 311 (1007)
T KOG1984|consen 236 PPPQRLDP----NAIPSPPQVSIEDDSSFRSTDTRAQPPPLVTTDFFIQDQGNCSPRFMRCTMYTIPCTNDLLKASQIPL 311 (1007)
T ss_pred CccccCCh----hhCCCchhcccchhhhhhcCCccCCCCCCcccceEEeccCCCCcchheeecccCCccHhHHHhcCCcc
Confidence 46789999 9999997651 579999999986 9999999999999999999999999999
Q ss_pred EEEEccCCCCCCCCC---------------CccceEEccceeEecCCceEEEcCCCCCCCCCcccccccCcCcccCCCCC
Q 001711 339 GAVVCPLAEPPEGNL---------------FICRTYVNPYVTFTDAGRKWRCNICALLNDVPGDYFAHLDATGRRIDIDQ 403 (1021)
Q Consensus 339 g~vv~Pfa~~~~~e~---------------~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~vP~~Y~~~l~~~g~R~D~~~ 403 (1021)
|+||+|||.+.+.|. +||||||||||+|+++||+|+||||+.+|++|++||++|+++|||.|+++
T Consensus 312 alvIqPfa~l~p~E~~~~vVd~g~sgPvRC~RCkaYinPFmqF~~~gr~f~Cn~C~~~n~vp~~yf~~L~~~grr~D~~e 391 (1007)
T KOG1984|consen 312 ALVIQPFATLTPNEAPVPVVDLGESGPVRCNRCKAYINPFMQFIDGGRKFICNFCGSKNQVPDDYFNHLGPTGRRVDVEE 391 (1007)
T ss_pred eeEecccccCCcccCCCceecCCCCCCcchhhhhhhcCcceEEecCCceEEecCCCccccCChhhcccCCCccccccccc
Confidence 999999998876553 99999999999999999999999999999999999999999999999999
Q ss_pred CCccccccEEEEccccccCC--CCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCC-CCCCceEEEEEEcCeEEEE
Q 001711 404 RPELTKGSVEFVAPTEYMVR--PPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELP-GFPRTQIGFITFDSTIHFY 480 (1021)
Q Consensus 404 rPEL~~gtVEfvap~eY~~r--~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp-~~~rt~VgiITFds~V~fy 480 (1021)
||||++|+|||+|+++||++ ++++++|||+||||++|+++|++.++|++|+++|+.|+ ++++++|||||||++||||
T Consensus 392 rpEL~~Gt~dfvatk~Y~~~~k~p~ppafvFmIDVSy~Ai~~G~~~a~ce~ik~~l~~lp~~~p~~~Vgivtfd~tvhFf 471 (1007)
T KOG1984|consen 392 RPELCLGTVDFVATKDYCRKTKPPKPPAFVFMIDVSYNAISNGAVKAACEAIKSVLEDLPREEPNIRVGIVTFDKTVHFF 471 (1007)
T ss_pred CchhcccccceeeehhhhhcCCCCCCceEEEEEEeehhhhhcchHHHHHHHHHHHHhhcCccCCceEEEEEEecceeEee
Confidence 99999999999999999998 89999999999999999999999999999999999999 6789999999999999999
Q ss_pred ecCCCCCCcceeeccccccccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-CCEEE
Q 001711 481 NMKSSLTQPQMMVISDLDDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-GGKLL 559 (1021)
Q Consensus 481 nl~~~~~~p~mlVvsDldd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-GGkIi 559 (1021)
|+++++++++|+||+|++|+|+|+.+++||+..|++..|+.|||+|+.||.+.+.+++|+|+||+||..+||.+ ||||+
T Consensus 472 nl~s~L~qp~mliVsdv~dvfvPf~~g~~V~~~es~~~i~~lLd~Ip~mf~~sk~pes~~g~alqaa~lalk~~~gGKl~ 551 (1007)
T KOG1984|consen 472 NLSSNLAQPQMLIVSDVDDVFVPFLDGLFVNPNESRKVIELLLDSIPTMFQDSKIPESVFGSALQAAKLALKAADGGKLF 551 (1007)
T ss_pred ccCccccCceEEEeecccccccccccCeeccchHHHHHHHHHHHHhhhhhccCCCCchhHHHHHHHHHHHHhccCCceEE
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998 99999
Q ss_pred EEecCCCCCCcc-cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccE
Q 001711 560 IFQNSLPSLGVG-CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQ 638 (1021)
Q Consensus 560 vF~sg~Pt~GpG-~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~ 638 (1021)
||++.+||+|.| +|+.|+| .|+++|+||++|+.+++++|++||++|++.|||||||++...|+|+|+|+.+++.|||+
T Consensus 552 vF~s~Lpt~g~g~kl~~r~D-~~l~~t~kek~l~~pq~~~y~~LA~e~v~~g~svDlF~t~~ayvDvAtlg~v~~~TgG~ 630 (1007)
T KOG1984|consen 552 VFHSVLPTAGAGGKLSNRDD-RRLIGTDKEKNLLQPQDKTYTTLAKEFVESGCSVDLFLTPNAYVDVATLGVVPALTGGQ 630 (1007)
T ss_pred EEecccccccCcccccccch-hhhhcccchhhccCcchhHHHHHHHHHHHhCceEEEEEcccceeeeeeecccccccCce
Confidence 999999999977 8877754 89999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEeCCCCCchhHHHHHHHHHHhcccccccceEEEEEeCCCeEEEeeecCcccCCCCceeeccCCCCCcEEEEEEecccc
Q 001711 639 VYYYPSFQSTTHGERLRHELSRDLTRETAWEAVMRIRCGKGVRFTNYHGNFMLRSTDLLALPAVDCDKAYAMQLSLEETL 718 (1021)
Q Consensus 639 v~~y~~F~~~~d~~kl~~dL~r~ltr~~g~~a~mrVR~S~Gl~V~~~~Gnf~~rs~~~~~l~~id~d~Sia~~~~~d~~l 718 (1021)
+|+|.+|....|+.+|.+||.|++++++||+|+||||||+||++.+|||||+++++++++|+.+|+||+++|+|+|||+|
T Consensus 631 vy~Y~~F~a~~D~~rl~nDL~~~vtk~~gf~a~mrvRtStGirv~~f~Gnf~~~~~tDiela~lD~dkt~~v~fkhDdkL 710 (1007)
T KOG1984|consen 631 VYKYYPFQALTDGPRLLNDLVRNVTKKQGFDAVMRVRTSTGIRVQDFYGNFLMRNPTDIELAALDCDKTLTVEFKHDDKL 710 (1007)
T ss_pred eEEecchhhcccHHHHHHHHHHhcccceeeeeEEEEeecCceeeeeeechhhhcCCCCccccccccCceeEEEEeccccc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Q 001711 719 LTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKAL 798 (1021)
Q Consensus 719 ~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL 798 (1021)
+++..++||+|||||+.+|+|||||+|+++++|+.++|+|+++|.|+++++|+|.|+..+.++.++++|+.++++|++||
T Consensus 711 q~~s~~~fQ~AlLYTti~G~RR~Rv~Nlsl~~ts~l~~lyr~~~~d~l~a~maK~a~~~i~~~~lk~vre~l~~~~~~iL 790 (1007)
T KOG1984|consen 711 QDGSDVHFQTALLYTTIDGQRRLRVLNLSLAVTSQLSELYRSADTDPLIAIMAKQAAKAILDKPLKEVREQLVSQCAQIL 790 (1007)
T ss_pred cCCcceeEEEEEEEeccCCceeEEEEecchhhhhhHHHHHHhcCccHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEeecC
Q 001711 799 KEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRVDEH 878 (1021)
Q Consensus 799 ~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~ 878 (1021)
++||| +|++..+++||||||+||+||+|+++|+||.+|++ .+++.|+|+|++.+++++++++++.++||||+++|++
T Consensus 791 ~~YRk-~cas~~ssgQLILPeslKLlPly~la~lKs~~l~~--~~~~~DdRi~~~~~v~sl~v~~~~~~~YPrl~p~hdl 867 (1007)
T KOG1984|consen 791 ASYRK-NCASPASSGQLILPESLKLLPLYMLALLKSSALRP--QEIRTDDRIYQLQLVTSLSVEQLMPFFYPRLLPFHDL 867 (1007)
T ss_pred HHHHH-hhcCCCCcccEechhhhHHHHHHHHHHHHhhcccc--cccccchhHHHHHHhhcccHHhhhhhhccceeeeecc
Confidence 99999 99999999999999999999999999999999996 7899999999999999999999999999999999999
Q ss_pred CCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhh--ccccccccchHHHH
Q 001711 879 LLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAEL--SKVMLREQDNEMSR 956 (1021)
Q Consensus 879 ~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l--~~~~lp~~~n~~s~ 956 (1021)
..+++ ....+|.+|++|+|+|+++||||||||+++|||||+++++.|+|+||+|++.+++ ...+||++||.+|+
T Consensus 868 ~i~dt----l~~~~p~~VraS~e~l~negiYll~nG~~~ylwvg~sv~~~llQ~lf~V~s~~~i~s~~~~Lpe~dn~lS~ 943 (1007)
T KOG1984|consen 868 DIEDT----LEFVLPKAVRASSEFLSNEGIYLLDNGQKIYLWVGESVDPDLLQDLFSVSSFEQIDSQSGVLPELDNPLSR 943 (1007)
T ss_pred ccccc----cccccccceecchhhccCCceEEEecCcEEEEEecCCCCHHHHHHHhcCccccccccccccccccCcHHHH
Confidence 64432 2236799999999999999999999999999999999999999999999999999 34789999999999
Q ss_pred HHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccCCCCCCCHHHHHHHHHHHHhcC
Q 001711 957 KLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQIGGSNGYADWIMQIHRQVLQN 1020 (1021)
Q Consensus 957 ~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~~~~~SY~dFL~~lh~~I~~k 1020 (1021)
++|++|..||+.|..++++ +++|+|++.. +.+|.++||||++++++||+||||.|||+|++|
T Consensus 944 k~r~~i~~i~~~r~~~l~v-~~~k~g~~~~-~~~~~~~lved~~~~~~sY~dyL~~~H~ki~~~ 1005 (1007)
T KOG1984|consen 944 KVRNVISLIRRQRSSELPV-VLVKQGLDGS-EVEFSEYLVEDRGRNISSYVDYLCELHKKIQQK 1005 (1007)
T ss_pred HHHHHHHHHHhcccccccc-EEEecCCCch-hhhhhhhhhcccccCccccchHHHHHHHHHHhh
Confidence 9999999999999999998 9999999883 588999999999999999999999999999986
No 2
>KOG1985 consensus Vesicle coat complex COPII, subunit SEC24/subunit SFB2 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=1.6e-165 Score=1430.85 Aligned_cols=711 Identities=47% Similarity=0.769 Sum_probs=671.9
Q ss_pred CCCcccccCCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCCCC------------CccceEEccceeEecCCc
Q 001711 304 LAETYPLNCHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEGNL------------FICRTYVNPYVTFTDAGR 371 (1021)
Q Consensus 304 ~~~~~~~N~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~e~------------~rCrAYiNPf~~f~~~g~ 371 (1021)
.+.....||+|+|+|+|+++||.++++++|+|||||++|+||+++.+.++ ++||+||||||.|++.|+
T Consensus 160 ~~~~~~~nc~p~y~RsTl~~iP~t~sLl~kskLPlglvv~Pf~~~~d~~~~p~~~~~~IvRCr~CRtYiNPFV~fid~gr 239 (887)
T KOG1985|consen 160 VTPSESSNCSPSYVRSTLSAIPQTQSLLKKSKLPLGLVVHPFAHLDDIDPLPVITSTLIVRCRRCRTYINPFVEFIDQGR 239 (887)
T ss_pred cCCccccCCCHHHHHHHHHhCCccHHHHHhcCCCceEEEeecccccccCCCCcccCCceeeehhhhhhcCCeEEecCCCc
Confidence 33334569999999999999999999999999999999999997653322 999999999999999999
Q ss_pred eEEEcCCCCCCCCCcccccccCcCcccCCCCCCCccccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHH
Q 001711 372 KWRCNICALLNDVPGDYFAHLDATGRRIDIDQRPELTKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQ 451 (1021)
Q Consensus 372 ~W~Cn~C~~~N~vP~~Y~~~l~~~g~R~D~~~rPEL~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~ 451 (1021)
+|+||+|+..|+||.+|+++. -++.+.|.++||||++++|||+||.|||.|+|+|++||||||||.+|+++|+|+++|+
T Consensus 240 ~WrCNlC~~~NdvP~~f~~~~-~t~~~~~~~~RpEl~~s~vE~iAP~eYmlR~P~Pavy~FliDVS~~a~ksG~L~~~~~ 318 (887)
T KOG1985|consen 240 RWRCNLCGRVNDVPDDFDWDP-LTGAYGDPYSRPELTSSVVEFIAPSEYMLRPPQPAVYVFLIDVSISAIKSGYLETVAR 318 (887)
T ss_pred eeeechhhhhcCCcHHhhcCc-cccccCCcccCccccceeEEEecCcccccCCCCCceEEEEEEeehHhhhhhHHHHHHH
Confidence 999999999999999999874 3567889999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccceehhhhHHHHHHHHhhCCCccc
Q 001711 452 TIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQ 531 (1021)
Q Consensus 452 sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~ 531 (1021)
+|+++||.||+++|++|||||||++||||+++.++.+|+|++|+|+||+|+|.+++|||+++|||+.|+.+|+.|+.||.
T Consensus 319 slL~~LD~lpgd~Rt~igfi~fDs~ihfy~~~~~~~qp~mm~vsdl~d~flp~pd~lLv~L~~ck~~i~~lL~~lp~~F~ 398 (887)
T KOG1985|consen 319 SLLENLDALPGDPRTRIGFITFDSTIHFYSVQGDLNQPQMMIVSDLDDPFLPMPDSLLVPLKECKDLIETLLKTLPEMFQ 398 (887)
T ss_pred HHHHhhhcCCCCCcceEEEEEeeceeeEEecCCCcCCCceeeeccccccccCCchhheeeHHHHHHHHHHHHHHHHHHHh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCc
Q 001711 532 DNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQI 611 (1021)
Q Consensus 532 ~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gI 611 (1021)
+++..++|+|+||++|+++|+.+||||++|++++||.|.|+|+.||+ .++.+++++.+++.+++.|||+||.+|++.||
T Consensus 399 ~~~~t~~alGpALkaaf~li~~~GGri~vf~s~lPnlG~G~L~~rEd-p~~~~s~~~~qlL~~~t~FYK~~a~~cs~~qI 477 (887)
T KOG1985|consen 399 DTRSTGSALGPALKAAFNLIGSTGGRISVFQSTLPNLGAGKLKPRED-PNVRSSDEDSQLLSPATDFYKDLALECSKSQI 477 (887)
T ss_pred hccCcccccCHHHHHHHHHHhhcCCeEEEEeccCCCCCccccccccc-cccccchhhhhccCCCchHHHHHHHHhccCce
Confidence 99999999999999999999999999999999999999999999954 78888999999999999999999999999999
Q ss_pred EEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCc--hhHHHHHHHHHHhcccccccceEEEEEeCCCeEEEeeecCc
Q 001711 612 AVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQST--THGERLRHELSRDLTRETAWEAVMRIRCGKGVRFTNYHGNF 689 (1021)
Q Consensus 612 sVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~--~d~~kl~~dL~r~ltr~~g~~a~mrVR~S~Gl~V~~~~Gnf 689 (1021)
|||+|+++.+|+|+|||+.|+++|||.+|||++|+.. .|..||.+||.|.|+|++||||+||||||+||+++.|||||
T Consensus 478 ~VDlFl~s~qY~DlAsLs~LskySgG~~y~YP~f~~s~p~~~~Kf~~el~r~Ltr~~~feaVmRiR~S~gl~~~~f~GnF 557 (887)
T KOG1985|consen 478 CVDLFLFSEQYTDLASLSCLSKYSGGQVYYYPSFDGSNPHDVLKFARELARYLTRKIGFEAVMRIRCSTGLRMSSFFGNF 557 (887)
T ss_pred EEEEEeecccccchhhhhccccccCceeEEccCCCCCCHHHHHHHHHHHHHHhhhhhhhheeEEeeccccccccceeccc
Confidence 9999999999999999999999999999999999987 57889999999999999999999999999999999999999
Q ss_pred ccCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCHhHHHHH
Q 001711 690 MLRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADTGAIVSV 769 (1021)
Q Consensus 690 ~~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~eai~~~ 769 (1021)
+.|++|++.++++++|++++|++++|+.+. ...++||+|+|||...|||||||||+++++++++.|||+++|++||+.+
T Consensus 558 F~RStDLla~~~v~~D~sy~~qisiEesl~-~~~~~fQvAlLyT~~~GERRIRV~T~~lpt~~sl~evY~saD~~AI~~l 636 (887)
T KOG1985|consen 558 FVRSTDLLALPNVNPDQSYAFQISIEESLT-TGFCVFQVALLYTLSKGERRIRVHTLCLPTVSSLNEVYASADQEAIASL 636 (887)
T ss_pred ccCcHHHhcccCCCCCccceEEEEeehhcC-CceeEEEeeeeecccCCceeEEEEEeeccccccHHHHHhhcCHHHHHHH
Confidence 999999999999999999999999999986 4667899999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHH
Q 001711 770 FSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDER 849 (1021)
Q Consensus 770 laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR 849 (1021)
|+|+|+++.++..+.|+|+.|+++++++|.+|||++..++.....|.+|.+|++||+|+++|+||++||.| ..++.|+|
T Consensus 637 la~~Av~ksl~ssL~dardal~~~~~D~l~aYk~~~~~~~~~~~~l~~p~~LrllPllvlALlK~~~fr~g-~~~~lD~R 715 (887)
T KOG1985|consen 637 LAKKAVEKSLSSSLSDARDALTNAVVDILNAYKKLVSNQNGQGITLSLPASLRLLPLLVLALLKHPAFRPG-TGTRLDYR 715 (887)
T ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHhcccccCCcceecCcchhhhHHHHHHHhcCCcccCC-CCCCchHH
Confidence 99999999999999999999999999999999996665556666799999999999999999999999987 69999999
Q ss_pred HHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCCcc-CCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHH
Q 001711 850 CAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPSAQ-LDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPD 928 (1021)
Q Consensus 850 ~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~~~-~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ 928 (1021)
++++++|+++++..++++|||.||++|++..+...- .|+.+.+|++|+|+.+.|+.+|+||||+|..+|||||++++++
T Consensus 716 ~~a~~~~~~lpl~~L~k~IYP~Lysl~~l~~ea~~~~~d~~~~~p~~L~ltae~l~~~GlyL~D~g~~lfl~vg~~a~P~ 795 (887)
T KOG1985|consen 716 AYAMCLMSTLPLKYLMKYIYPTLYSLHDLDDEAGLPIHDQTVVLPPPLNLTAELLSRRGLYLMDTGTTLFLWVGSNADPS 795 (887)
T ss_pred HHHHHHhhcCCHHHHHhhhcccceeccccccccCcccccccccCCCccchHHHHhccCceEEEecCcEEEEEEcCCCCcc
Confidence 999999999999999999999999999984211111 3566788999999999999999999999999999999999999
Q ss_pred HHHhhcCCchhhhh--ccccccccchHHHHHHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccCCCCCCCH
Q 001711 929 IAMNLLGSEFAAEL--SKVMLREQDNEMSRKLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQIGGSNGY 1006 (1021)
Q Consensus 929 ll~~lFgv~~~~~l--~~~~lp~~~n~~s~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~~~~~SY 1006 (1021)
++.++||++.+.++ ++.+|++.+|+.+++++++|++||..|..+..+ +|||+++.+.+..||++.||||++.+..||
T Consensus 796 ll~~vfg~~~~adi~~~~~~lp~~~n~~s~r~~~fI~~lR~d~~~~p~~-~ivr~~~~s~~k~~f~~~lvEDrs~~~~SY 874 (887)
T KOG1985|consen 796 LLFDVFGVSTLADIPIGKYTLPELDNEESDRVRRFIKKLRDDRTYFPNL-YIVRGDDNSPLKAWFFSRLVEDRSENSPSY 874 (887)
T ss_pred ccccccCcchHhhcccccccCcccccchhHHHHHHHHHhhcCCcccceE-EEEecCCCchHHHHHHHHHHhhhhcCcHHH
Confidence 99999999999999 678999999999999999999999777666665 999998777778999999999999999999
Q ss_pred HHHHHHHHHHHhc
Q 001711 1007 ADWIMQIHRQVLQ 1019 (1021)
Q Consensus 1007 ~dFL~~lh~~I~~ 1019 (1021)
+|||.+||++|++
T Consensus 875 ~efLq~lk~qv~~ 887 (887)
T KOG1985|consen 875 YEFLQHLKAQVSK 887 (887)
T ss_pred HHHHHHHHHHhcC
Confidence 9999999999974
No 3
>PTZ00395 Sec24-related protein; Provisional
Probab=100.00 E-value=7e-152 Score=1365.61 Aligned_cols=721 Identities=24% Similarity=0.420 Sum_probs=649.1
Q ss_pred CCCCCCCCCCCCCCCCCCCCC-----------------CCCCCCCCcccc----cCCCCceeccccccCCCHHHHhhcCC
Q 001711 278 IPGSIEPGIDLKSLPRPLDGD-----------------VEPSSLAETYPL----NCHSRYLRLTTSAIPNSQSLVSRWHL 336 (1021)
Q Consensus 278 ~~~~~dp~~~~~~ip~p~~~~-----------------~~pp~~~~~~~~----N~~P~y~R~T~~~iP~t~~l~~~~~l 336 (1021)
+.+|||+ ++||||+... ..||+.+++|++ ||+|+|||+|||+||.+.++++.++|
T Consensus 599 ~~~ri~~----~~ip~p~~~~~~~~~~~~~~~~~t~k~~~pp~~~~~~~~~dtgn~dP~~~r~tmY~iP~~~~~~~~~~i 674 (1560)
T PTZ00395 599 TINRIDM----NKIPRPIINTQEKKKKKNLKVFETCKYISPPSYYQPYISIDTGKADPRFLKSTLYQIPLFSETLKLSQI 674 (1560)
T ss_pred cccccCc----ccCCCcccccccccccccchhhhhccCCCCCCCCCceEEeecCCCChhhhhhhhhcCcchHHHHHhcCC
Confidence 5689999 9999998543 468999999996 99999999999999999999999999
Q ss_pred ceEEEEccCCCCCCCCC----------------------CccceEEccceeEecCCceEEEcCCCCCCCCCcc----ccc
Q 001711 337 PLGAVVCPLAEPPEGNL----------------------FICRTYVNPYVTFTDAGRKWRCNICALLNDVPGD----YFA 390 (1021)
Q Consensus 337 Plg~vv~Pfa~~~~~e~----------------------~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~vP~~----Y~~ 390 (1021)
|||+||+|||.+.+.|. .+|++|+|+++.|+.. ++++||||+..+.+... +++
T Consensus 675 P~gi~v~Pfa~~~~~e~~~~~~~~~~~~d~~~~~~~~rc~~c~~y~~~~~~~~~~-~~~~c~~c~~~~~i~e~~~~~~~~ 753 (1560)
T PTZ00395 675 PFGIIVNPFACLNEGEGIDKIDMKDIINDKEENIEILRCPKCLGYLHATILEDIS-SSVQCVFCDTDFLINENVLFDIFQ 753 (1560)
T ss_pred CceeecchhhhcCCCCCCcccchhhcccchhhccceeecchhHhhhcchheeccc-ceEEEEecCCcchhhHHHHHHHHH
Confidence 99999999999765432 7999999999999976 99999999999988542 221
Q ss_pred -ccCcCcccCCCCCC----CccccccEEEEcccccc--------------------------------------------
Q 001711 391 -HLDATGRRIDIDQR----PELTKGSVEFVAPTEYM-------------------------------------------- 421 (1021)
Q Consensus 391 -~l~~~g~R~D~~~r----PEL~~gtVEfvap~eY~-------------------------------------------- 421 (1021)
+..-.-+..|.+++ --|.+|+||+++|.-|.
T Consensus 754 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 833 (1560)
T PTZ00395 754 YNEKIGHKESDHNEHGNSLSPLLKGSVDIIIPPIYYHNVNKFKLTYTYLNKNINQTAFMITNKIMSFTKHISNSLVANDS 833 (1560)
T ss_pred HhhhhccccccccccccccchhhcCceeEEccchhhccCCccceeeehhhcchhhhhhhhhhhhhhhhhhhcchheeccc
Confidence 11101111222222 14679999999886542
Q ss_pred --------------------------------------------------------------------------------
Q 001711 422 -------------------------------------------------------------------------------- 421 (1021)
Q Consensus 422 -------------------------------------------------------------------------------- 421 (1021)
T Consensus 834 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 913 (1560)
T PTZ00395 834 KGGNKATSASAFGDSGDANFLAGGGYTNYGGAGGYNTYDNQSGYNNHDVVNNRGGSGAGNHLYGKDHDVQNFDNVMDNAN 913 (1560)
T ss_pred ccccccchhhhcccccccccccccccccccccccccccccccccccccccccccccCcCcccccCcccccchhhhccCCc
Confidence
Q ss_pred ---------------------------------CCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceE
Q 001711 422 ---------------------------------VRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQI 468 (1021)
Q Consensus 422 ---------------------------------~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~V 468 (1021)
++.++||+||||||||+.||++|+++++|++|+++|+.|+ ++|+||
T Consensus 914 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~PP~YvFLIDVS~~AVkSGLl~tacesIK~sLDsL~-dpRTRV 992 (1560)
T PTZ00395 914 FTIHDMKNLICEKNGEPDSAKIRRNSFLAKYPQVKNMLPPYFVFVVECSYNAIYNNITYTILEGIRYAVQNVK-CPQTKI 992 (1560)
T ss_pred eeeecchhhhhcccCCchhhhhhccchhhccccccCCCCCEEEEEEECCHHHHhhChHHHHHHHHHHHHhcCC-CCCcEE
Confidence 0236889999999999999999999999999999999997 578999
Q ss_pred EEEEEcCeEEEEecCCC-------------CCCcceeeccccccccCCCC-CccceehhhhHHHHHHHHhhCCCcccCCC
Q 001711 469 GFITFDSTIHFYNMKSS-------------LTQPQMMVISDLDDIFVPLP-DDLLVNLSESRSVVDTLLDSLPSMFQDNM 534 (1021)
Q Consensus 469 giITFds~V~fynl~~~-------------~~~p~mlVvsDldd~f~Pl~-~~lLv~l~es~~~I~~lLd~Lp~~f~~~~ 534 (1021)
||||||++||||+|+.+ +++|||+||+||||+|+|++ ++|||++.|+|+.|+.|||.|+.||....
T Consensus 993 GIITFDSsLHFYNLks~l~~~~~~~~~~~~l~qPQMLVVSDLDDPFLPlP~ddLLVnL~ESRevIe~LLDkLPemFt~t~ 1072 (1560)
T PTZ00395 993 AIITFNSSIYFYHCKGGKGVSGEEGDGGGGSGNHQVIVMSDVDDPFLPLPLEDLFFGCVEEIDKINTLIDTIKSVSTTMQ 1072 (1560)
T ss_pred EEEEecCcEEEEecCcccccccccccccccCCCceEEeecCCccCcCCCCccCeeechHHHHHHHHHHHHHHHHHhhccC
Confidence 99999999999999875 47899999999999999998 89999999999999999999999999999
Q ss_pred CcccchHHHHHHHHHHHHhcC--CEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcE
Q 001711 535 NVESAFGPALKAAFMVMSRLG--GKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIA 612 (1021)
Q Consensus 535 ~~~~alG~AL~aA~~lL~~~G--GkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIs 612 (1021)
..++|+|+||++|+++|+.+| |||++|++++|++|+|+|+.|++ +.+|+.++.++++|||+||.+|++++||
T Consensus 1073 ~~esCLGSALqAA~~aLk~~GGGGKIiVF~SSLPniGpGaLK~Re~------~~KEk~Ll~pqd~FYK~LA~ECsk~qIS 1146 (1560)
T PTZ00395 1073 SYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNCGIGAIKELKK------DLQENFLEVKQKIFYDSLLLDLYAFNIS 1146 (1560)
T ss_pred CCcccHHHHHHHHHHHHHhcCCCceEEEEEcCCCCCCCCccccccc------ccccccccccchHHHHHHHHHHHhcCCc
Confidence 999999999999999999986 99999999999999999997753 3477788999999999999999999999
Q ss_pred EEEEEecCCCcC--hhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhccc-ccccceEEEEEeCCCeEEEeee--c
Q 001711 613 VNVYAFSDKYTD--IASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDLTR-ETAWEAVMRIRCGKGVRFTNYH--G 687 (1021)
Q Consensus 613 VDlF~~s~~~~d--iatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~ltr-~~g~~a~mrVR~S~Gl~V~~~~--G 687 (1021)
||||+++..|+| |++|+.|+++|||+||||+.|+..+|..+|++||.+.|++ ++||+|+||||||+||+|++|| |
T Consensus 1147 VDLFLfSsqYvDVDVATLg~Lsr~TGGqlyyYPnFna~rD~~KL~~DL~r~LTre~iGyEAVMRVRCS~GLrVs~fyG~G 1226 (1560)
T PTZ00395 1147 VDIFIISSNNVRVCVPSLQYVAQNTGGKILFVENFLWQKDYKEIYMNIMDTLTSEDIAYCCELKLRYSHHMSVKKLFCCN 1226 (1560)
T ss_pred eEEEEccCcccccccccccchhcccceeEEEeCCCcccccHHHHHHHHHHHhhccceeeEEEEEEECCCCeEEEEEeccC
Confidence 999999999986 7999999999999999999999999999999999999998 6999999999999999999999 5
Q ss_pred Ccc--cCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCHhH
Q 001711 688 NFM--LRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADTGA 765 (1021)
Q Consensus 688 nf~--~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~ea 765 (1021)
+++ .++++++.|+.+++|++|+|+|+||++|.+...+|||+|||||+.+|||||||||++|+||+++.+||+++|++|
T Consensus 1227 nnF~s~rStDLLaLP~Id~DqSfaVeLk~DEkL~~~~~AYFQaALLYTSssGERRIRVHTLALPVTSsLseVFrsADqdA 1306 (1560)
T PTZ00395 1227 NNFNSIISVDTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLTSSLSTVFRYTDAEA 1306 (1560)
T ss_pred CccccccccccccccccCCCceEEEEEEeccccCCCCcEEEEEEEeeccCCCcEEEEEEeeeecccCCHHHHHHhhcHHH
Confidence 555 468899999999999999999999999987889999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCC
Q 001711 766 IVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVT 845 (1021)
Q Consensus 766 i~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s 845 (1021)
++++|+|+|+++++++ .++|+.|.++|+++|++||| +|+...+.+||||||+||+||+|+++|+||.+|+ .+++
T Consensus 1307 IvslLAK~AV~~aLss--sdARe~L~dklVdILtaYRK-~CAsssssgQLILPESLKLLPLYILSLLKS~AfR---t~I~ 1380 (1560)
T PTZ00395 1307 LMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRI-NCASSAHSGQLILPDTLKLLPLFTSSLLKHNVTK---KEIL 1380 (1560)
T ss_pred HHHHHHHHHHHHhccc--HHHHHHHHHHHHHHHHHHHH-HhhccCCCccccchhHHHHHHHHHHHHhcccccc---CCCC
Confidence 9999999999999987 49999999999999999999 9998888999999999999999999999999998 5789
Q ss_pred hhHHHHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCC---ccCCcccccccccccchhhccCCcEEEEECCceeEEEec
Q 001711 846 LDERCAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPS---AQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFG 922 (1021)
Q Consensus 846 ~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~---~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG 922 (1021)
.|+|++++++|+++++..++.+||||||+||++..+.. ...++.+.+|..|+||.++|+++||||||+|+.||||||
T Consensus 1381 sDeRVyaL~rL~SmPI~~Li~yLYPRLYpLHdL~~e~e~d~~d~d~~ivLPp~LrLS~ErLesdGIYLLDNGe~IyLWVG 1460 (1560)
T PTZ00395 1381 HDLKVYSLIKLLSMPIISSLLYVYPVMYVIHIKGKTNEIDSMDVDDDLFIPKTIPSSAEKIYSNGIYLLDACTHFYLYFG 1460 (1560)
T ss_pred ccHHHHHHHHHhCCCHHHHHhhhcCceEEcccccccccCCccCCCCccccCCcccchHHHhcCCcEEEEECCCEEEEEEC
Confidence 99999999999999999999999999999999721111 112345678999999999999999999999999999999
Q ss_pred CCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHHhC--CCCCceEEEeccCCCcchHHHHHhhccccCC
Q 001711 923 RMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLREQD--PSYYQLCQLVRQGEQPREGFLLLANLVEDQI 1000 (1021)
Q Consensus 923 ~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~~r--~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~ 1000 (1021)
++++++|++||||+.... ....+||++++++++||++||+.||++| ..|+++ +|||++++. |.||+++|||||+
T Consensus 1461 ~~V~PqLLqDLFGv~~~~-~~~~eLPelDT~iS~RVrnII~~LR~~r~~~~Y~pL-~IVRqgDp~--E~~F~s~LVEDRs 1536 (1560)
T PTZ00395 1461 FHSDANFAKEIVGDIPTE-KNAHELNLTDTPNAQKVQRIIKNLSRIHHFNKYVPL-VMVAPKSNE--EEHLISLCVEDKA 1536 (1560)
T ss_pred CCCCHHHHHHHcCCCccc-cccccccCCCCHHHHHHHHHHHHHHHhccCCCcceE-EEEeCCCch--HHHHHHhCeecCC
Confidence 999999999999974222 2234689999999999999999999986 488998 999999877 8999999999999
Q ss_pred CCCCCHHHHHHHHHHHHhcC
Q 001711 1001 GGSNGYADWIMQIHRQVLQN 1020 (1021)
Q Consensus 1001 ~~~~SY~dFL~~lh~~I~~k 1020 (1021)
.+++||+||||+|||+|++|
T Consensus 1537 ~g~~SYvDFLc~LHKqIq~k 1556 (1560)
T PTZ00395 1537 DKEYSYVNFLCFIHKLVHKR 1556 (1560)
T ss_pred CCCCCHHHHHHHHHHHHHHh
Confidence 99999999999999999987
No 4
>COG5028 Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion]
Probab=100.00 E-value=1.3e-150 Score=1296.13 Aligned_cols=706 Identities=37% Similarity=0.689 Sum_probs=669.9
Q ss_pred CCCCCCCCcccc----cCCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCCCC-------------CccceEEc
Q 001711 299 VEPSSLAETYPL----NCHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEGNL-------------FICRTYVN 361 (1021)
Q Consensus 299 ~~pp~~~~~~~~----N~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~e~-------------~rCrAYiN 361 (1021)
..||. ++.++. ||+|+|+|+|+|+||.+.+++++++||||+||+||.++.+.+. +|||+|||
T Consensus 132 ~~ppl-tt~~~~~e~~n~~p~yvrsT~yaiP~t~dl~~~skiPfgLVI~Pf~~l~~e~~~vpl~~d~~ivRCrrCrsYiN 210 (861)
T COG5028 132 IVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPFGLVIRPFLELYPEEDPVPLVEDGSIVRCRRCRSYIN 210 (861)
T ss_pred CCCCc-ccceeeeccCCCCHHHHHHHHhhCCCchhHHHhcCCCceEEeehhhhcCccCCCCccCCCCcchhhhhhHhhcC
Confidence 34555 777764 9999999999999999999999999999999999999876432 99999999
Q ss_pred cceeEecCCceEEEcCCCCCCCCCcccccccCcCcccCCCCCCCccccccEEEEccccccCCCCCCCeEEEEEecchhHH
Q 001711 362 PYVTFTDAGRKWRCNICALLNDVPGDYFAHLDATGRRIDIDQRPELTKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAI 441 (1021)
Q Consensus 362 Pf~~f~~~g~~W~Cn~C~~~N~vP~~Y~~~l~~~g~R~D~~~rPEL~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av 441 (1021)
||++|+++|++|+||+|+..|++|.++++...+++.|.|+++|+||.+|+|||+||++|+.|.+.|++|||+||||.+++
T Consensus 211 Pfv~fi~~g~kw~CNiC~~kN~vp~~~~~~~~~~~~r~d~~~r~El~~~vvdf~ap~~Y~~~~p~P~~yvFlIDVS~~a~ 290 (861)
T COG5028 211 PFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDRYSRPELKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAI 290 (861)
T ss_pred ceEEEecCCcEEEEeeccccccCcccccCcCCCCCccccccccchhhceeeEEecccceeeccCCCCEEEEEEEeehHhh
Confidence 99999999999999999999999999999899999999999999999999999999999999999999999999999999
Q ss_pred hhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC-CccceehhhhHHHH
Q 001711 442 RSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP-DDLLVNLSESRSVV 519 (1021)
Q Consensus 442 ~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~-~~lLv~l~es~~~I 519 (1021)
++|++.+++++|++.|+.+++ ++|+||+||.||++|||++++.+++ .+|++|+|+||+|+|.+ .+|++++.+++..+
T Consensus 291 ~~g~~~a~~r~Il~~l~~~~~~dpr~kIaii~fD~sl~ffk~s~d~~-~~~~~vsdld~pFlPf~s~~fv~pl~~~k~~~ 369 (861)
T COG5028 291 KNGLVKAAIRAILENLDQIPNFDPRTKIAIICFDSSLHFFKLSPDLD-EQMLIVSDLDEPFLPFPSGLFVLPLKSCKQII 369 (861)
T ss_pred hcchHHHHHHHHHhhccCCCCCCCcceEEEEEEcceeeEEecCCCCc-cceeeecccccccccCCcchhcccHHHHHHHH
Confidence 999999999999999999975 7899999999999999999998874 38999999999999998 67899999999999
Q ss_pred HHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHH
Q 001711 520 DTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFY 599 (1021)
Q Consensus 520 ~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY 599 (1021)
+.||+.++.+|.+++.++.|+|+||++|..+++.+||||++|.+++||.|.|+|..|+| +|+.++.+.+.||
T Consensus 370 etLl~~~~~If~d~~~pk~~~G~aLk~a~~l~g~~GGkii~~~stlPn~G~Gkl~~r~d--------~e~~ll~c~d~fY 441 (861)
T COG5028 370 ETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLSTLPNMGIGKLQLRED--------KESSLLSCKDSFY 441 (861)
T ss_pred HHHHHHhhhhhcccCCCccccCHHHHHHHHHhhccCceEEEEeecCCCccccccccccc--------chhhhccccchHH
Confidence 99999999999999999999999999999999999999999999999999999999865 6777999999999
Q ss_pred HHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCch--hHHHHHHHHHHhcccccccceEEEEEeC
Q 001711 600 KQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTT--HGERLRHELSRDLTRETAWEAVMRIRCG 677 (1021)
Q Consensus 600 ~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~--d~~kl~~dL~r~ltr~~g~~a~mrVR~S 677 (1021)
|++|.+|++.||+||+|+++.+|+|+||++.|+++|||++|||++|+..+ |..||.+||.+++++++||+++||||||
T Consensus 442 k~~a~e~~k~gIsvd~Flt~~~yidvaTls~l~~~T~G~~~~Yp~f~~~~~~d~~kl~~dL~~~ls~~~gy~~~~rvR~S 521 (861)
T COG5028 442 KEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYFYPNFSATRPNDATKLANDLVSHLSMEIGYEAVMRVRCS 521 (861)
T ss_pred HHHHHHHHHhcceEEEEeccccccchhhhcchhhccCcceEEcCCcccCCchhHHHHHHHHHHhhhhhhhhheeeEeecc
Confidence 99999999999999999999999999999999999999999999999998 9999999999999999999999999999
Q ss_pred CCeEEEeeecCcccCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHH
Q 001711 678 KGVRFTNYHGNFMLRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDM 757 (1021)
Q Consensus 678 ~Gl~V~~~~Gnf~~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~v 757 (1021)
+|+++++|||||+.|+.++++|+.++.|+|+.|+|++|+++.. ..+|||+|+|||+.+|||||||.|+++++++++.|+
T Consensus 522 ~glr~s~fyGnf~~rs~dl~~F~tm~rd~Sl~~~~sid~~l~~-~~v~fQvAlL~T~~~GeRRiRVvn~s~~~ss~~~ev 600 (861)
T COG5028 522 TGLRVSSFYGNFFNRSSDLCAFSTMPRDTSLLVEFSIDEKLMT-SDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREV 600 (861)
T ss_pred CceehhhhhccccccCcccccccccCCCceEEEEEEecccccC-CceEEEEEEEeeccCCceEEEEEEeccccchhHHHH
Confidence 9999999999999999999999999999999999999999976 899999999999999999999999999999999999
Q ss_pred HHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCC
Q 001711 758 YQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPI 837 (1021)
Q Consensus 758 f~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~L 837 (1021)
|+++|+++|+.+|+|+|+.++....++++|+.|.+++++||++||| .|+....++||+||++||+||+++++|+||.+|
T Consensus 601 yasadq~aIa~~lak~a~~~~~~~s~~~~r~~i~~s~~~IL~~Ykk-~~~~snt~tql~Lp~nL~lLPll~lal~Ks~~~ 679 (861)
T COG5028 601 YASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILKAYKK-ELVKSNTSTQLPLPANLKLLPLLMLALLKSSAF 679 (861)
T ss_pred HHhccHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH-HHhhccCCccccchhhhHHHHHHHHHHhhhccc
Confidence 9999999999999999999999999999999999999999999999 888888899999999999999999999999999
Q ss_pred CCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCCccCCcccccccccccchhhccCCcEEEEECCcee
Q 001711 838 RGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRF 917 (1021)
Q Consensus 838 r~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i 917 (1021)
|. ..++.|.|+++++++.+++++++++.|||+||++|++..+....+++...++.+|++|.+.|+++|+||||+|.++
T Consensus 680 rs--~~~~sD~r~~~L~~l~~~p~~~l~~~iYP~lyalHdm~~e~~l~~~~~~~~~~piNaT~s~le~~GlYLidtg~~i 757 (861)
T COG5028 680 RS--GSTPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMPIEAGLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKI 757 (861)
T ss_pred cc--CCCccchhHHHHHHhhcCCHHHHHHhhccceeeecccccccCCCcccccccccchhhhHHHHhcCCeEEEEcCCEE
Confidence 95 6789999999999999999999999999999999999643322123456789999999999999999999999999
Q ss_pred EEEecCCCCHHHHHhhcCCchhhhh--ccccccccchHHHHHHHHHHHHHHH-hCCCCCceEEEeccCCCcchHHHHHhh
Q 001711 918 VLWFGRMLSPDIAMNLLGSEFAAEL--SKVMLREQDNEMSRKLLGILKKLRE-QDPSYYQLCQLVRQGEQPREGFLLLAN 994 (1021)
Q Consensus 918 ~lwvG~~v~~~ll~~lFgv~~~~~l--~~~~lp~~~n~~s~~l~~ii~~lr~-~r~~~~~l~~vvrqg~~~~~e~~f~~~ 994 (1021)
|||+|+++++.|++|+||++++.+| .+.++|+.+|++++++++||++||+ .+...+++ ++||+|.++..+.||.++
T Consensus 758 flw~g~d~~p~Ll~dlf~~~~~~~I~~~k~~~p~~~n~~n~~v~~iI~~lrs~~~~~tl~l-vlVR~~~d~s~~~~~~s~ 836 (861)
T COG5028 758 FLWFGKDAVPSLLQDLFGVDSLSDIPSGKFTLPPTGNEFNERVRNIIGELRSVNDDSTLPL-VLVRGGGDPSLRLWFFST 836 (861)
T ss_pred EEEecCCCCHHHHHHhcCcchhhhccccccccCCcCCHHHHHHHHHHHHHHhhCCCCccce-EEEecCCCcchhhheehh
Confidence 9999999999999999999999999 7889999999999999999999999 56777887 999998777668999999
Q ss_pred ccccCCCCCCCHHHHHHHHHHHHhc
Q 001711 995 LVEDQIGGSNGYADWIMQIHRQVLQ 1019 (1021)
Q Consensus 995 LVED~~~~~~SY~dFL~~lh~~I~~ 1019 (1021)
||||++.+..||.|||+.||++|+.
T Consensus 837 lVEDk~~n~~SY~~yL~~lh~ki~~ 861 (861)
T COG5028 837 LVEDKTLNIPSYLDYLQILHEKIKS 861 (861)
T ss_pred eecccccCCccHHHHHHHHHHHhcC
Confidence 9999999999999999999999974
No 5
>PLN00162 transport protein sec23; Provisional
Probab=100.00 E-value=1.6e-120 Score=1115.31 Aligned_cols=656 Identities=20% Similarity=0.283 Sum_probs=584.1
Q ss_pred CCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCC-----CC---CccceEEccceeEecCCceEEEcCCCCCCC
Q 001711 312 CHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEG-----NL---FICRTYVNPYVTFTDAGRKWRCNICALLND 383 (1021)
Q Consensus 312 ~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~-----e~---~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~ 383 (1021)
-+-++||+|||+||+|+.++++++|||||+|+||++..+. +. ++|||||||||+|+++|++|+||||+..|+
T Consensus 7 e~~~gvR~s~n~~P~t~~~~~~~~iPlg~v~tPl~~~~~vp~v~~~pvRC~~CraylNPf~~~d~~~~~W~C~~C~~~N~ 86 (761)
T PLN00162 7 EAIDGVRMSWNVWPSSKIEASKCVIPLAALYTPLKPLPELPVLPYDPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNH 86 (761)
T ss_pred cccCceEeeeecCCCCHHHHhcCCCCeEEEEecCCcCCCCCcCCCCCCccCCCcCEECCceEEecCCCEEEccCCCCCCC
Confidence 3457999999999999999999999999999999875432 11 899999999999999999999999999999
Q ss_pred CCcccccccCcCcccCCCCCCCcc--ccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCC
Q 001711 384 VPGDYFAHLDATGRRIDIDQRPEL--TKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELP 461 (1021)
Q Consensus 384 vP~~Y~~~l~~~g~R~D~~~rPEL--~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp 461 (1021)
+|.+|+ +++++ +.+||| .++||||++|+ |+.+++.||+|+||||+|..+++ ++.++++|+.+|+.||
T Consensus 87 ~P~~Y~-~~~~~------~~p~EL~p~~~TvEY~~p~-~~~~~~~pp~fvFvID~s~~~~~---l~~lk~sl~~~L~~LP 155 (761)
T PLN00162 87 FPPHYS-SISET------NLPAELFPQYTTVEYTLPP-GSGGAPSPPVFVFVVDTCMIEEE---LGALKSALLQAIALLP 155 (761)
T ss_pred CchHhc-ccCcc------CCChhhcCCceeEEEECCC-CCCCCCCCcEEEEEEecchhHHH---HHHHHHHHHHHHHhCC
Confidence 999997 44433 478999 89999999998 99999999999999999999987 6667899999999999
Q ss_pred CCCCceEEEEEEcCeEEEEecCCCCCCcceeecc--------cccc----------------------ccCCCCCcccee
Q 001711 462 GFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVIS--------DLDD----------------------IFVPLPDDLLVN 511 (1021)
Q Consensus 462 ~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvs--------Dldd----------------------~f~Pl~~~lLv~ 511 (1021)
++ ++|||||||++||||+|+.+. .++++|+. |++| .|+|..++||++
T Consensus 156 ~~--a~VGlITF~s~V~~~~L~~~~-~~~~~Vf~g~k~~t~~~l~~~l~l~~~~~~~~~~~~~~~~~~~~~p~~~~fLvp 232 (761)
T PLN00162 156 EN--ALVGLITFGTHVHVHELGFSE-CSKSYVFRGNKEVSKDQILEQLGLGGKKRRPAGGGIAGARDGLSSSGVNRFLLP 232 (761)
T ss_pred CC--CEEEEEEECCEEEEEEcCCCC-CcceEEecCCccCCHHHHHHHhccccccccccccccccccccccCCCccceeEE
Confidence 76 999999999999999998653 67777775 2322 234567899999
Q ss_pred hhhhHHHHHHHHhhCCCcc---cCCCCcccchHHHHHHHHHHHH----hcCCEEEEEecCCCCCCcccccccC--CcCcc
Q 001711 512 LSESRSVVDTLLDSLPSMF---QDNMNVESAFGPALKAAFMVMS----RLGGKLLIFQNSLPSLGVGCLKLRG--DDLRV 582 (1021)
Q Consensus 512 l~es~~~I~~lLd~Lp~~f---~~~~~~~~alG~AL~aA~~lL~----~~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~ 582 (1021)
++||+..|+++||+|+.++ .+++++++|+|+||++|..+|+ .+||||++|++|+||.|||+|+.|+ +..|.
T Consensus 233 l~e~~~~i~~lLe~L~~~~~~~~~~~rp~r~tG~AL~vA~~lL~~~~~~~gGrI~~F~sgppT~GpG~v~~r~~~~~~rs 312 (761)
T PLN00162 233 ASECEFTLNSALEELQKDPWPVPPGHRPARCTGAALSVAAGLLGACVPGTGARIMAFVGGPCTEGPGAIVSKDLSEPIRS 312 (761)
T ss_pred HHHHHHHHHHHHHhhhccccccCCCCCCCccHHHHHHHHHHHHhhccCCCceEEEEEeCCCCCCCCceeecccccccccC
Confidence 9999999999999998763 6778899999999999999998 5799999999999999999999885 34555
Q ss_pred cCC--CccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHH
Q 001711 583 YGT--DKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSR 660 (1021)
Q Consensus 583 ~gt--~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r 660 (1021)
+.+ +++.++++++.+||++||.+|+++||+||||+++.+|+||++|+.|++.|||.+++|++|+. ++|.++|+|
T Consensus 313 h~di~k~~~~~~~~a~~fY~~la~~~~~~gisvDlF~~s~dqvglaem~~l~~~TGG~v~~~~sF~~----~~f~~~l~r 388 (761)
T PLN00162 313 HKDLDKDAAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVVLAESFGH----SVFKDSLRR 388 (761)
T ss_pred ccccccchhhhcchHHHHHHHHHHHHHHcCceEEEEEccccccCHHHHhhhHhhcCcEEEEeCCcCh----HHHHHHHHH
Confidence 542 45567999999999999999999999999999999999999999999999999999999976 578888898
Q ss_pred hcccc------cccceEEEEEeCCCeEEEeeecCccc---------------CCCCceeeccCCCCCcEEEEEEecccc-
Q 001711 661 DLTRE------TAWEAVMRIRCGKGVRFTNYHGNFML---------------RSTDLLALPAVDCDKAYAMQLSLEETL- 718 (1021)
Q Consensus 661 ~ltr~------~g~~a~mrVR~S~Gl~V~~~~Gnf~~---------------rs~~~~~l~~id~d~Sia~~~~~d~~l- 718 (1021)
.++|+ +||+|+||||||+||+|+++||||+. +++++|+++++++|+||+|+|+++++.
T Consensus 389 ~~~r~~~~~~~~gf~a~~~VrtS~glkv~g~~G~~~s~~~~~~~vsd~~iG~g~T~~w~l~~l~~~~t~av~f~~~~~~~ 468 (761)
T PLN00162 389 VFERDGEGSLGLSFNGTFEVNCSKDVKVQGAIGPCASLEKKGPSVSDTEIGEGGTTAWKLCGLDKKTSLAVFFEVANSGQ 468 (761)
T ss_pred HhcccccccccccceeEEEEEecCCeEEeeeEcCcccccccCCccccccccCCCCceeeecCcCcCCEEEEEEEEccccc
Confidence 88864 79999999999999999999999862 457889999999999999999998765
Q ss_pred ----CCCceeEEEEEEEEEecCCcEEEEEEeeeecccC--CHHHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHH
Q 001711 719 ----LTTQTVYFQVALLYTASCGERRIRVHTLAAPVVS--NLSDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQL 792 (1021)
Q Consensus 719 ----~~~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~--~l~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~ 792 (1021)
.++..+|||+|++||+.+|||||||||++++++. ++.++|+++|+||++++|+|+|+.+++++++.|+|++|++
T Consensus 469 ~~~~~~~~~~~iQ~a~lYt~~~G~rRiRV~T~~~~~~~~~~~~~v~~~fDqeA~a~llaR~av~k~~~~~~~d~~r~ld~ 548 (761)
T PLN00162 469 SNPQPPGQQFFLQFLTRYQHSNGQTRLRVTTVTRRWVEGSSSEELVAGFDQEAAAVVMARLASHKMETEEEFDATRWLDR 548 (761)
T ss_pred cCCCCCCceEEEEEEEEEEcCCCCEEEEEEccccCccCCCCHHHHHHhcCHHHHHHHHHHHHHHHHhhCCHHHHHHHHHH
Confidence 4557899999999999999999999999999654 8899999999999999999999999999999999999999
Q ss_pred HHHHHH---HHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhc
Q 001711 793 RLVKAL---KEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLY 869 (1021)
Q Consensus 793 ~lv~iL---~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lY 869 (1021)
+|++++ ..||| .+ +.+|+||++||+||+|||+|+||.+|+. .++++|||++++++++++++.+++.|||
T Consensus 549 ~li~~~~~f~~Yrk-~~-----~~s~~Lp~~~~~lP~f~~~LrRS~~l~~--~n~spDera~~r~~l~~~~~~~sl~mI~ 620 (761)
T PLN00162 549 ALIRLCSKFGDYRK-DD-----PSSFRLSPNFSLYPQFMFNLRRSQFVQV--FNNSPDETAYFRMMLNRENVTNSLVMIQ 620 (761)
T ss_pred HHHHHHHHHhhhcc-cC-----CccccCCHHHHHHHHHHHHHhhhhhccC--CCCCchHHHHHHHHHhcCCHHHHHHhhC
Confidence 999874 67888 44 3469999999999999999999999995 7899999999999999999999999999
Q ss_pred ccEEEeecCCCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccc
Q 001711 870 PCLIRVDEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLRE 949 (1021)
Q Consensus 870 PrL~~lh~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~ 949 (1021)
|+||++|.- .+|+++.|+.++|++|||||||+|++++||+|+.+.+|..+.+... |+
T Consensus 621 P~L~sy~~~------------~~P~pv~Ld~~si~~d~ilLLD~~f~vvi~~G~~ia~w~~~~~~~~-----------~~ 677 (761)
T PLN00162 621 PTLISYSFN------------GPPEPVLLDVASIAADRILLLDSYFSVVIFHGSTIAQWRKAGYHNQ-----------PE 677 (761)
T ss_pred CeEEEecCC------------CCCcceecchhhccCCceEEEeCCCEEEEEecCcccchhhcCCCCC-----------cc
Confidence 999999831 1377899999999999999999999999999999999999888876 44
Q ss_pred cch--HHHHHHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccCC--------------CCCCCHHHHHHHH
Q 001711 950 QDN--EMSRKLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQI--------------GGSNGYADWIMQI 1013 (1021)
Q Consensus 950 ~~n--~~s~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~~--------------~~~~SY~dFL~~l 1013 (1021)
+++ ++.+..++.+++|.+.|.+.+++ +++.||.++ .++++++|---.+ -++.|+..|+.||
T Consensus 678 ~~~~~~~l~~p~~~a~~~~~~Rfp~Pr~-i~~~~~~Sq--aRfl~~klnPs~~~~~~~~~~~~~~~~tdd~sl~~f~~~l 754 (761)
T PLN00162 678 HEAFAQLLEAPQADAQAIIKERFPVPRL-VVCDQHGSQ--ARFLLAKLNPSATYNSANAMGGSDIIFTDDVSLQVFMEHL 754 (761)
T ss_pred hhhHHHHHHhHHHHHHHHHhcCCCCCeE-EEeCCCCcH--HHHHHHhcCCcccccCCCCCCCCCeeecCCcCHHHHHHHH
Confidence 442 67778888999999999999998 999999988 8888888875411 1579999999999
Q ss_pred HHHHhc
Q 001711 1014 HRQVLQ 1019 (1021)
Q Consensus 1014 h~~I~~ 1019 (1021)
+|.+.+
T Consensus 755 ~~~~v~ 760 (761)
T PLN00162 755 QRLAVQ 760 (761)
T ss_pred HHHhcC
Confidence 998764
No 6
>KOG1986 consensus Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=4e-90 Score=791.54 Aligned_cols=653 Identities=19% Similarity=0.290 Sum_probs=563.8
Q ss_pred CCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCC-----CC---CccceEEccceeEecCCceEEEcCCCCCCC
Q 001711 312 CHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEG-----NL---FICRTYVNPYVTFTDAGRKWRCNICALLND 383 (1021)
Q Consensus 312 ~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~-----e~---~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~ 383 (1021)
-.-+.+|+|||.+|.++....++.+|++++++||.+.+.. +. ++|+||+||||.++.+.+.|.|+||+..|.
T Consensus 7 e~~dGvR~twnvwPs~~~~~~~~vvPla~lytPl~e~~~~~~~~y~P~~C~~C~AvlNPyc~vd~~a~~W~CpfC~qrN~ 86 (745)
T KOG1986|consen 7 EEIDGVRFTWNVWPSTRAEASRTVVPLACLYTPLKERPDLPPIQYDPLRCSKCGAVLNPYCSVDFRAKSWICPFCNQRNP 86 (745)
T ss_pred ccCCCcccccccCCCcccccccccccHHHhccccccCCCCCccCCCCchhccchhhcCcceeecccCceEeccccccCCC
Confidence 3446899999999999999999999999999999975541 12 889999999999999999999999999999
Q ss_pred CCcccccccCcCcccCCCCCCCcc--ccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCC
Q 001711 384 VPGDYFAHLDATGRRIDIDQRPEL--TKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELP 461 (1021)
Q Consensus 384 vP~~Y~~~l~~~g~R~D~~~rPEL--~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp 461 (1021)
+|.+|-. +.++ +..+|| ...+|||+.++.. ..||+|+||||+|....+ ++.++++|+.+|+.||
T Consensus 87 ~p~~Y~~-is~~------n~P~el~Pq~stvEy~l~~~~----~~ppvf~fVvDtc~~eee---L~~LkssL~~~l~lLP 152 (745)
T KOG1986|consen 87 FPPHYSG-ISEN------NLPPELLPQYSTVEYTLSPGR----VSPPVFVFVVDTCMDEEE---LQALKSSLKQSLSLLP 152 (745)
T ss_pred CChhhcc-cCcc------CCChhhcCCcceeEEecCCCC----CCCceEEEEEeeccChHH---HHHHHHHHHHHHhhCC
Confidence 9999853 3332 466688 7999999998653 358999999999999866 8999999999999999
Q ss_pred CCCCceEEEEEEcCeEEEEecCCCCCCcceeecc---c-----ccccc------------CCCCCccceehhhhHHHHHH
Q 001711 462 GFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVIS---D-----LDDIF------------VPLPDDLLVNLSESRSVVDT 521 (1021)
Q Consensus 462 ~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvs---D-----ldd~f------------~Pl~~~lLv~l~es~~~I~~ 521 (1021)
++ +.||||||++.||+|+|+... ..+..|.. | +.|.. -.....||.++.+|...+.+
T Consensus 153 ~~--alvGlItfg~~v~v~el~~~~-~sk~~VF~G~ke~s~~q~~~~L~~~~~~~~~~~~~~~~~rFL~P~~~c~~~L~~ 229 (745)
T KOG1986|consen 153 EN--ALVGLITFGTMVQVHELGFEE-CSKSYVFSGNKEYSAKQLLDLLGLSGGAGKGSENQSASNRFLLPAQECEFKLTN 229 (745)
T ss_pred Cc--ceEEEEEecceEEEEEcCCCc-ccceeEEeccccccHHHHHHHhcCCcccccCCcccccchhhhccHHHHHHHHHH
Confidence 87 999999999999999998642 22333432 1 11111 00124799999999999999
Q ss_pred HHhhCC---CcccCCCCcccchHHHHHHHHHHHHh----cCCEEEEEecCCCCCCcccccccC--CcCcccC--CCcccc
Q 001711 522 LLDSLP---SMFQDNMNVESAFGPALKAAFMVMSR----LGGKLLIFQNSLPSLGVGCLKLRG--DDLRVYG--TDKEHS 590 (1021)
Q Consensus 522 lLd~Lp---~~f~~~~~~~~alG~AL~aA~~lL~~----~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~~g--t~~e~~ 590 (1021)
+|++|. +.....+++.||+|.||.+|+.+|+. +|+||++|++|+||.|||++..+| +.+|.+. .++...
T Consensus 230 lle~L~~d~wpV~~g~Rp~RcTG~Al~iA~~Ll~~c~p~~g~rIv~f~gGPcT~GpG~vv~~el~~piRshhdi~~d~a~ 309 (745)
T KOG1986|consen 230 LLEELQPDPWPVPPGHRPLRCTGVALSIASGLLEGCFPNTGARIVLFAGGPCTRGPGTVVSRELKEPIRSHHDIEKDNAP 309 (745)
T ss_pred HHHHhcCCCCCCCCCCCcccchhHHHHHHHHHhcccCCCCcceEEEeccCCCCcCCceecchhhcCCCcCcccccCcchH
Confidence 999994 56677899999999999999999986 699999999999999999999885 5677776 455667
Q ss_pred CCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhcc------c
Q 001711 591 LRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDLT------R 664 (1021)
Q Consensus 591 l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~lt------r 664 (1021)
+++.+.+||++||++++.+|++||+|+++.++++|++|..|++.|||.+...++|+.+.++. .++|.++ .
T Consensus 310 y~kKa~KfY~~La~r~~~~ghvlDifa~~lDQvGi~EMk~l~~~TGG~lvl~dsF~~s~Fk~----sfqR~f~~d~~~~l 385 (745)
T KOG1986|consen 310 YYKKAIKFYEKLAERLANQGHVLDIFAAALDQVGILEMKPLVESTGGVLVLGDSFNTSIFKQ----SFQRIFTRDGEGDL 385 (745)
T ss_pred HHHHHHHHHHHHHHHHHhCCceEeeeeeeccccchHHHHHHhhcCCcEEEEecccchHHHHH----HHHHHhccccccch
Confidence 88999999999999999999999999999999999999999999999999999998865544 4555555 4
Q ss_pred ccccceEEEEEeCCCeEEEeeecCccc---------------CCCCceeeccCCCCCcEEEEEEeccc--cCCCceeEEE
Q 001711 665 ETAWEAVMRIRCGKGVRFTNYHGNFML---------------RSTDLLALPAVDCDKAYAMQLSLEET--LLTTQTVYFQ 727 (1021)
Q Consensus 665 ~~g~~a~mrVR~S~Gl~V~~~~Gnf~~---------------rs~~~~~l~~id~d~Sia~~~~~d~~--l~~~~~~~iQ 727 (1021)
..||+|.|+|+||++|+|++.+|++.. +++..|++..++..+++++.|++..+ +.....+|||
T Consensus 386 ~~~fn~~leV~tSkdlkI~g~IGp~~Sl~~k~~~vsdt~ig~g~t~~wkm~~ls~~t~~s~~fei~~~~~~~~~~~~~iQ 465 (745)
T KOG1986|consen 386 KMGFNGTLEVKTSKDLKIQGVIGPCVSLNKKGPNVSDTEIGEGNTSAWKMCGLSPSTTLSLFFEISNQHNIPQSGQGYIQ 465 (745)
T ss_pred hhhcCceEEEEecCCcEEEecccccccccCCCCccccceeccccccceeeeccCCCceEEEEEEeccccCCCCCCeeEEE
Confidence 689999999999999999999998651 35678999999999999999998643 3345789999
Q ss_pred EEEEEEecCCcEEEEEEeeeecccCCH-HHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH---HHHh
Q 001711 728 VALLYTASCGERRIRVHTLAAPVVSNL-SDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALK---EYRN 803 (1021)
Q Consensus 728 ~AllYTt~~GeRrIRV~Tl~lpvt~~l-~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~---~YRk 803 (1021)
|++.|.+.+|++||||+|++.+.++.. .++-.++|+||.++++||+++.++.++...|++.++++.++++.. .|+|
T Consensus 466 FiT~Yq~s~g~~riRVtT~~r~~~d~~~~~i~~~FDqEaaAV~mAR~~~~kae~e~~~d~~rwlDr~Lirlc~kFg~y~k 545 (745)
T KOG1986|consen 466 FITQYQHSSGQKRIRVTTLARPWADSGSPEISQSFDQEAAAVLMARLALLKAETEDGPDVLRWLDRNLIRLCQKFGDYRK 545 (745)
T ss_pred EEEEEEcCCCcEEEEEEEeehhhccccchHhhhccchHHHHHHHHHHHHHhhhccccchHHHHHHHHHHHHHHHHhccCC
Confidence 999999999999999999999999987 588899999999999999999999999888999999999988854 5666
Q ss_pred hhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEeecCCCCCC
Q 001711 804 LYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRVDEHLLKPS 883 (1021)
Q Consensus 804 ~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~lh~~~~~~~ 883 (1021)
..+..+.|+++|.++|.|||+|+||+.|.- .+.|+|||+|++|+|.+.++.+++.||.|+|++++..
T Consensus 546 ------~dPssf~l~~~fsl~PQfmfhLRRS~fLqv--fNnSPDEt~~yrhll~~e~v~~sliMIqP~L~sySf~----- 612 (745)
T KOG1986|consen 546 ------DDPSSFRLSPNFSLYPQFMFHLRRSPFLQV--FNNSPDETAYYRHLLNREDVDNSLIMIQPTLLSYSFN----- 612 (745)
T ss_pred ------CCchhhcCChhhhhhHHHHHhhccchhhhc--cCCCcchHHHHHHHHhhccchhhhheecceeeeeecC-----
Confidence 455679999999999999999999999994 8999999999999999999999999999999999853
Q ss_pred ccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccch--HHHHHHHHH
Q 001711 884 AQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDN--EMSRKLLGI 961 (1021)
Q Consensus 884 ~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n--~~s~~l~~i 961 (1021)
. -|+++.|+..+|.+|.|+|||+++.|+||.|..+..|...++... ||+++ ++.+..++.
T Consensus 613 ---g----~~epvlLD~~Si~~D~iLLlDt~f~i~i~hG~tIaqWR~~gy~~~-----------pe~~~f~~LL~ap~~d 674 (745)
T KOG1986|consen 613 ---G----PPEPVLLDVASILADRILLLDTYFTIVIFHGSTIAQWRKAGYHEQ-----------PEYENFKELLEAPRED 674 (745)
T ss_pred ---C----CCceeEecccccCCceEEEeecceEEEEECCchHHHHHhcccccC-----------hhhHHHHHHHHhHHHH
Confidence 1 156789999999999999999999999999999999999888876 55653 788888999
Q ss_pred HHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccccC--C------------CCCCCHHHHHHHHHHHHhc
Q 001711 962 LKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVEDQ--I------------GGSNGYADWIMQIHRQVLQ 1019 (1021)
Q Consensus 962 i~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVED~--~------------~~~~SY~dFL~~lh~~I~~ 1019 (1021)
+++|-..|.+.+++ ++++||.++ ..++++++.--. + -+++||.+|+.||+|.+..
T Consensus 675 A~el~~~RFP~PR~-v~~~q~GSQ--ARFLlsklnPS~t~~~~~~~~~s~~I~TDDvSlq~fm~hLkklav~ 743 (745)
T KOG1986|consen 675 AQELLLERFPMPRY-VVTDQGGSQ--ARFLLSKLNPSETHNNLTAHGGSSIILTDDVSLQVFMEHLKKLAVS 743 (745)
T ss_pred HHHHHHhhCCCCeE-EEecCCccH--HHhhhhhcCcchhccchhhccCCCeeeeccccHHHHHHHHHhhcCC
Confidence 99999999999998 999999877 677778877521 1 1579999999999987654
No 7
>COG5047 SEC23 Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion]
Probab=100.00 E-value=9.8e-83 Score=712.42 Aligned_cols=661 Identities=17% Similarity=0.279 Sum_probs=553.7
Q ss_pred cCCCCceeccccccCCCHHHHhhcCCceEEEEccCCCCCCC-----CC---Cc-cceEEccceeEecCCceEEEcCCCCC
Q 001711 311 NCHSRYLRLTTSAIPNSQSLVSRWHLPLGAVVCPLAEPPEG-----NL---FI-CRTYVNPYVTFTDAGRKWRCNICALL 381 (1021)
Q Consensus 311 N~~P~y~R~T~~~iP~t~~l~~~~~lPlg~vv~Pfa~~~~~-----e~---~r-CrAYiNPf~~f~~~g~~W~Cn~C~~~ 381 (1021)
+-+-+.||+|||++|.|+...+++.+|++|+|+||.+.+.- +. .. |+||+||||.++.+.+.|+|.||+..
T Consensus 6 iee~dgir~twnvfpat~~da~~~~iPia~lY~Pl~e~~~~~v~~yepv~C~~pC~avlnpyC~id~r~~~W~CpfCnqr 85 (755)
T COG5047 6 IEENDGIRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYYEPVKCTAPCKAVLNPYCHIDERNQSWICPFCNQR 85 (755)
T ss_pred hccccceEEEEecccCCccccccccccHHHhccccccccccCcccCCCceecccchhhcCcceeeccCCceEecceecCC
Confidence 34557899999999999999999999999999999987432 12 44 99999999999999999999999999
Q ss_pred CCCCcccccccCcCcccCCCCCCCcc--ccccEEEEccccccCCCCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhc
Q 001711 382 NDVPGDYFAHLDATGRRIDIDQRPEL--TKGSVEFVAPTEYMVRPPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDE 459 (1021)
Q Consensus 382 N~vP~~Y~~~l~~~g~R~D~~~rPEL--~~gtVEfvap~eY~~r~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~ 459 (1021)
|.+|..|- .+.+ .+..+|| ++.||||+.+++ .-.||+|+||||++++..+ +.+++++|+.+|..
T Consensus 86 n~lp~qy~-~iS~------~~LplellpqssTiey~lskp----~~~ppvf~fvvD~~~D~e~---l~~Lkdslivslsl 151 (755)
T COG5047 86 NTLPPQYR-DISN------ANLPLELLPQSSTIEYTLSKP----VILPPVFFFVVDACCDEEE---LTALKDSLIVSLSL 151 (755)
T ss_pred CCCChhhc-CCCc------ccCCccccCCCceEEEEccCC----ccCCceEEEEEEeecCHHH---HHHHHHHHHHHHhc
Confidence 99999884 3332 2566798 799999999875 3578999999999997766 89999999999999
Q ss_pred CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccc--------ccccc------CC-------------CCCccceeh
Q 001711 460 LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISD--------LDDIF------VP-------------LPDDLLVNL 512 (1021)
Q Consensus 460 Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsD--------ldd~f------~P-------------l~~~lLv~l 512 (1021)
||.+ +.||||||++.||+|.++... ..+-.|.+- |+++. .+ .+..|+.++
T Consensus 152 lppe--aLvglItygt~i~v~el~ae~-~~r~~VF~g~~eyt~~~L~~ll~~~~~~~~~~~es~is~~~~~~~~rFl~p~ 228 (755)
T COG5047 152 LPPE--ALVGLITYGTSIQVHELNAEN-HRRSYVFSGNKEYTKENLQELLALSKPTKSGGFESKISGIGQFASSRFLLPT 228 (755)
T ss_pred CCcc--ceeeEEEecceeEEEeccccc-cCcceeecchHHHHHHHHHHHhcccCCCCcchhhhhcccccccchhhhhccH
Confidence 9977 999999999999999997642 222233211 22211 11 123589999
Q ss_pred hhhHHHHHHHHhhCC---CcccCCCCcccchHHHHHHHHHHHHh----cCCEEEEEecCCCCCCcccccccC--CcCccc
Q 001711 513 SESRSVVDTLLDSLP---SMFQDNMNVESAFGPALKAAFMVMSR----LGGKLLIFQNSLPSLGVGCLKLRG--DDLRVY 583 (1021)
Q Consensus 513 ~es~~~I~~lLd~Lp---~~f~~~~~~~~alG~AL~aA~~lL~~----~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~~ 583 (1021)
.+|...+.++||+|. +.....+++.||+|+||.+|..+|+. .|+||++|.+|+||.|||.|..+| +.+|.+
T Consensus 229 q~ce~~L~n~le~L~pd~~~v~~~~Rp~RCTGsAl~ias~Ll~~~~p~~~~~i~lF~~GPcTvGpG~Vvs~elkEpmRsh 308 (755)
T COG5047 229 QQCEFKLLNILEQLQPDPWPVPAGKRPLRCTGSALNIASSLLEQCFPNAGCHIVLFAGGPCTVGPGTVVSTELKEPMRSH 308 (755)
T ss_pred HHHHHHHHHHHHHhCCCCccCCCCCCCccccchhHHHHHHHHHhhccCcceeEEEEcCCCccccCceeeehhhccccccc
Confidence 999999999999994 45677899999999999999999986 699999999999999999999874 567766
Q ss_pred C--CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHh
Q 001711 584 G--TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRD 661 (1021)
Q Consensus 584 g--t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~ 661 (1021)
. +.+..++.+++.+||+.||++.+.+|.++|+|+.+.++++|.+|..|...|||.+...++|+.+++...|.+-|.+.
T Consensus 309 H~ie~d~aqh~kka~KFY~~laeR~a~~gh~~DifagcldqIGI~eM~~L~~sTgg~lvlsdsF~t~ifkqSfqrif~~d 388 (755)
T COG5047 309 HDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTSIFKQSFQRIFNRD 388 (755)
T ss_pred ccccccchhhccchHHHHHHHHHHHhccchhHHHHHHHHHhhhhhcchhhccCCcceEEEeccccHHHHHHHHHHHhCcC
Confidence 5 34446889999999999999999999999999999999999999999999999999999999887777766655543
Q ss_pred ccc--ccccceEEEEEeCCCeEEEeeecCccc---------------CCCCceeeccCCCCCcEEEEEEeccccCC----
Q 001711 662 LTR--ETAWEAVMRIRCGKGVRFTNYHGNFML---------------RSTDLLALPAVDCDKAYAMQLSLEETLLT---- 720 (1021)
Q Consensus 662 ltr--~~g~~a~mrVR~S~Gl~V~~~~Gnf~~---------------rs~~~~~l~~id~d~Sia~~~~~d~~l~~---- 720 (1021)
-.. ..||+|.|+|.|||+|+|++.+|+... ..++.|.++.+.+.+++++.|++...-..
T Consensus 389 ~~g~l~~gfNa~m~V~TsKnl~~~g~ig~a~~~~k~~~ni~~~eigi~~t~swkm~slsPk~nyal~fei~~~~~~~~~~ 468 (755)
T COG5047 389 SEGYLKMGFNANMEVKTSKNLKIKGLIGHAVSVKKKANNISDSEIGIGATNSWKMASLSPKSNYALYFEIALGAASGSAQ 468 (755)
T ss_pred cccchhhhhccceeEeeccCceeeeeecceeeecccccccccccccccccccccccccCCCcceEEEEEeccccCCCccC
Confidence 222 479999999999999999999998541 24567999999999999999998643322
Q ss_pred -CceeEEEEEEEEEecCCcEEEEEEeeeecccCC-HHHHHHhcCHhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHH-
Q 001711 721 -TQTVYFQVALLYTASCGERRIRVHTLAAPVVSN-LSDMYQQADTGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKA- 797 (1021)
Q Consensus 721 -~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~-l~~vf~s~D~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~i- 797 (1021)
...+|+|+...|.+++|.-||||.|++...++. ...+++++|+||.++++||+|+.++......|+-++++..++++
T Consensus 469 ~~~~a~iQfiT~yQhss~t~riRVtTvar~f~~~~~p~i~~SFdqEaaaV~~aR~a~~K~~~ed~~Dv~rw~dr~lirlc 548 (755)
T COG5047 469 RPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDGGLPKINRSFDQEAAAVFMARIAAFKAETEDIIDVFRWIDRNLIRLC 548 (755)
T ss_pred CcccchhhhhhhhhccCCcEEEEEeehhhhhccCCChhhhhcchhhHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHH
Confidence 268999999999999999999999999777764 56688899999999999999999999888889888888876665
Q ss_pred --HHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCCCCChhHHHHHHHHHcCCCHHHHHhhhcccEEEe
Q 001711 798 --LKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYADVTLDERCAAGYTMMALPVKKLLKLLYPCLIRV 875 (1021)
Q Consensus 798 --L~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~~~s~DeR~~~~~~l~s~~v~~~~~~lYPrL~~l 875 (1021)
++.||| ..+..+.|+.++.++|.|||+|+||+.|.- .+.++|||++++|.+.+.++.+++.|+.|.|.++
T Consensus 549 q~fa~y~k------~dpssfrl~~~f~lypqf~y~lrRSpfL~v--fNnSPDEt~fyrh~l~~~dv~~sLimiqPtL~Sy 620 (755)
T COG5047 549 QKFADYRK------DDPSSFRLDPNFTLYPQFMYHLRRSPFLSV--FNNSPDETAFYRHMLNNADVNDSLIMIQPTLQSY 620 (755)
T ss_pred HHHHhcCC------CCchhhcCCcchhhhhHHHhhhhccceeec--cCCCcchHHHHHHHHhcccccchhhhhcchheee
Confidence 567777 456679999999999999999999999994 8999999999999999999999999999999999
Q ss_pred ecCCCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHH
Q 001711 876 DEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMS 955 (1021)
Q Consensus 876 h~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s 955 (1021)
|... + ..++-|++-++++|-|+|+|++++|+||-|+.+..|.-..+.....+..+ .++.
T Consensus 621 s~~~--------~----~~pVlLDs~svkpdviLLlDtff~Ili~hG~~iaqwr~agyq~qpey~~l---------K~Ll 679 (755)
T COG5047 621 SFEK--------G----GVPVLLDSVSVKPDVILLLDTFFHILIFHGSYIAQWRNAGYQEQPEYLNL---------KELL 679 (755)
T ss_pred eccC--------C----CceEEEeccccCCCeEEEeeceeEEEEECChHHHHHHhhhhhcCchhhhH---------HHHh
Confidence 9641 1 23578899999999999999999999999999999988877766333222 1455
Q ss_pred HHHHHHHHHHHHhCCCCCceEEEeccCCCcchHHHHHhhccc-cCCC------------CCCCHHHHHHHHHHHHhcC
Q 001711 956 RKLLGILKKLREQDPSYYQLCQLVRQGEQPREGFLLLANLVE-DQIG------------GSNGYADWIMQIHRQVLQN 1020 (1021)
Q Consensus 956 ~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~~~e~~f~~~LVE-D~~~------------~~~SY~dFL~~lh~~I~~k 1020 (1021)
+.-+-.+.++-..|.+.+++ ++++||.++ ..++++++.- |..+ +.++|.+|+.+|+|....|
T Consensus 680 ~~p~~ea~ell~dRfP~Prf-i~teqggSQ--aRfLlskinPsd~~~~~~~~~s~tilTddv~lq~fm~hl~~lav~~ 754 (755)
T COG5047 680 EAPRLEAAELLQDRFPIPRF-IVTEQGGSQ--ARFLLSKINPSDITNKMSGGGSETILTDDVNLQKFMNHLRKLAVSK 754 (755)
T ss_pred hchhhHHHHHHHhhCCCCeE-EEecCCccH--HHHHHhhcCccccccccccCccceeeecccCHHHHHHHHHHHhccC
Confidence 55555667777889999998 999999888 7778888875 2221 4699999999999976544
No 8
>cd01479 Sec24-like Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24
Probab=100.00 E-value=4.5e-54 Score=466.04 Aligned_cols=241 Identities=56% Similarity=0.965 Sum_probs=231.4
Q ss_pred CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711 425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP 503 (1021)
Q Consensus 425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P 503 (1021)
|+||+||||||||..++++|+++++|++|+++|+.||++ +|++|||||||+.||||+++...++++|++++|++|+|+|
T Consensus 1 p~pp~~~FvIDvs~~a~~~g~~~~~~~si~~~L~~lp~~~~~~~VgiITfd~~v~~y~l~~~~~~~q~~vv~dl~d~f~P 80 (244)
T cd01479 1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGDDPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLP 80 (244)
T ss_pred CCCCEEEEEEEccHHHHhhChHHHHHHHHHHHHHhcCCCCCCeEEEEEEECCeEEEEECCCCCCCCeEEEeeCcccccCC
Confidence 579999999999999999999999999999999999987 8999999999999999999998889999999999999999
Q ss_pred CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCCCCCcccccccCCcCccc
Q 001711 504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLPSLGVGCLKLRGDDLRVY 583 (1021)
Q Consensus 504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~ 583 (1021)
++++||++++|+++.|+++||+|+++|.+++++++|+|+||++|..+|+.+||||++|++|+||+|+|+|+.|++ .+..
T Consensus 81 ~~~~~lv~l~e~~~~i~~lL~~L~~~~~~~~~~~~c~G~Al~~A~~lL~~~GGkIi~f~s~~pt~GpG~l~~~~~-~~~~ 159 (244)
T cd01479 81 LPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSRED-PKLL 159 (244)
T ss_pred CCcceeecHHHHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHhcCCEEEEEeCCCCCcCCcccccCcc-cccc
Confidence 999999999999999999999999999999999999999999999999999999999999999999999999875 4567
Q ss_pred CCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeC--CCCCchhHHHHHHHHHHh
Q 001711 584 GTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYP--SFQSTTHGERLRHELSRD 661 (1021)
Q Consensus 584 gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~--~F~~~~d~~kl~~dL~r~ 661 (1021)
++++|+++++++++||++||.+|+++||+||+|+++.+|+|+++|+.|+++|||.+++|+ +|+..+|.+||++||+|+
T Consensus 160 ~~~~e~~~~~p~~~fY~~la~~~~~~~isvDlF~~~~~~~dla~l~~l~~~TGG~v~~y~~~~~~~~~d~~kl~~dl~~~ 239 (244)
T cd01479 160 STDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYPSFNFSAPNDVEKLVNELARY 239 (244)
T ss_pred CchhhhhhcCcchHHHHHHHHHHHHcCeEEEEEEccCcccChhhhhhhhhhcCceEEEECCccCCchhhHHHHHHHHHHH
Confidence 778888999999999999999999999999999999999999999999999999999999 788889999999999999
Q ss_pred ccccc
Q 001711 662 LTRET 666 (1021)
Q Consensus 662 ltr~~ 666 (1021)
++|++
T Consensus 240 ltr~~ 244 (244)
T cd01479 240 LTRKI 244 (244)
T ss_pred hcccC
Confidence 99864
No 9
>cd01468 trunk_domain trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.
Probab=100.00 E-value=5.9e-50 Score=433.17 Aligned_cols=235 Identities=46% Similarity=0.848 Sum_probs=224.2
Q ss_pred CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCC
Q 001711 425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPL 504 (1021)
Q Consensus 425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl 504 (1021)
|+||+||||||+|++|+++|++++++++|+++|+.||++++++|||||||++||||++++...+++|+|++|++|+|+|.
T Consensus 1 p~pp~~vFvID~s~~ai~~~~l~~~~~sl~~~l~~lp~~~~~~igiITf~~~V~~~~~~~~~~~~~~~v~~dl~d~f~p~ 80 (239)
T cd01468 1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKDVFLPL 80 (239)
T ss_pred CCCCEEEEEEEcchHhccccHHHHHHHHHHHHHHhCCCCCCcEEEEEEeCCeEEEEECCCCCCCCeEEEeCCCccCcCCC
Confidence 68999999999999999999999999999999999997677999999999999999999887779999999999999999
Q ss_pred CCccceehhhhHHHHHHHHhhCCCcccC--CCCcccchHHHHHHHHHHHHhc--CCEEEEEecCCCCCCcccccccCCcC
Q 001711 505 PDDLLVNLSESRSVVDTLLDSLPSMFQD--NMNVESAFGPALKAAFMVMSRL--GGKLLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~--~~~~~~alG~AL~aA~~lL~~~--GGkIivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
++++|++++|+++.|+++|++|+.++.. +++.++|+|+||++|..+|+.. ||||++|++|+||+|||+|+.|++ .
T Consensus 81 ~~~~l~~~~e~~~~i~~~l~~l~~~~~~~~~~~~~~~~G~Al~~A~~ll~~~~~gGkI~~f~sg~pt~GpG~l~~~~~-~ 159 (239)
T cd01468 81 PDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTFAGGRIIVFQGGLPTVGPGKLKSRED-K 159 (239)
T ss_pred cCceeeeHHHHHHHHHHHHHhhhhhccccCCCCCcccHHHHHHHHHHHHhhcCCCceEEEEECCCCCCCCCccccCcc-c
Confidence 9999999999999999999999999987 8899999999999999999998 999999999999999999999854 4
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHH
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSR 660 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r 660 (1021)
+..++++|+++++++++||++||++|++++|+||+|+++.+++|+++|+.|++.|||.+++|++|+..+|.++|.+||+|
T Consensus 160 ~~~~~~~e~~~~~~a~~fY~~la~~~~~~~isvdlF~~~~~~~dl~~l~~l~~~TGG~v~~y~~f~~~~~~~~~~~~l~r 239 (239)
T cd01468 160 EPIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDGSKFKQDLQR 239 (239)
T ss_pred ccCCCccchhcccccHHHHHHHHHHHHHcCeEEEEEeccccccCHHHhhhhhhcCCceEEEeCCCCCcccHHHHHHHhcC
Confidence 56667889999999999999999999999999999999999999999999999999999999999999999999999975
No 10
>PF04811 Sec23_trunk: Sec23/Sec24 trunk domain; InterPro: IPR006896 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain, an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the Sec23/24 alpha/beta trunk domain, which is formed from a single, approximately 250-residue segment plugged into the beta-barrel between strands beta-1 and beta-19. The trunk has an alpha/beta fold with a vWA topology, and it forms the dimer interface, primarily involving strand beta-14 on Sec23 and Sec24; in addition, the trunk domain of Sec23 contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_A 2NUP_A 3EG9_A 3EFO_A 3EGX_A 2NUT_A 1PD0_A 1PD1_A 1M2V_B 1PCX_A ....
Probab=100.00 E-value=9e-50 Score=432.72 Aligned_cols=237 Identities=51% Similarity=0.915 Sum_probs=205.8
Q ss_pred CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCC
Q 001711 425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPL 504 (1021)
Q Consensus 425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl 504 (1021)
|+||+|+||||+|.+|+++|++++++++|+++|+.|+.+++++|||||||++||||+++.+..+++|+|++|+||+|+|.
T Consensus 1 P~pp~y~FvID~s~~av~~g~~~~~~~sl~~~l~~l~~~~~~~vgiitfd~~V~~y~l~~~~~~~~~~v~~dl~~~~~p~ 80 (243)
T PF04811_consen 1 PQPPVYVFVIDVSYEAVQSGLLQSLIESLKSALDSLPGDERTRVGIITFDSSVHFYNLSSSLSQPQMIVVSDLDDPFIPL 80 (243)
T ss_dssp -S--EEEEEEE-SHHHHHHTHHHHHHHHHHHHGCTSSTSTT-EEEEEEESSSEEEEETTTTSSSTEEEEEHHTTSHHSST
T ss_pred CCCCEEEEEEECchhhhhccHHHHHHHHHHHHHHhccCCCCcEEEEEEeCCEEEEEECCCCcCCCcccchHHHhhcccCC
Confidence 68999999999999999999999999999999999997778999999999999999999988889999999999999999
Q ss_pred CCccceehhhhHHHHHHHHhhCCCcccCC--CCcccchHHHHHHHHHHHH--hcCCEEEEEecCCCCCCc-ccccccCCc
Q 001711 505 PDDLLVNLSESRSVVDTLLDSLPSMFQDN--MNVESAFGPALKAAFMVMS--RLGGKLLIFQNSLPSLGV-GCLKLRGDD 579 (1021)
Q Consensus 505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~~--~~~~~alG~AL~aA~~lL~--~~GGkIivF~sg~Pt~Gp-G~L~~r~~~ 579 (1021)
+++||+++.|+++.|+++|++|+.++..+ +++++|+|+||++|..+|+ ..||||++|++|+||+|+ |+|+.+++
T Consensus 81 ~~~llv~~~e~~~~i~~ll~~L~~~~~~~~~~~~~~c~G~Al~~A~~ll~~~~~gGkI~~F~s~~pt~G~Gg~l~~~~~- 159 (243)
T PF04811_consen 81 PDGLLVPLSECRDAIEELLESLPSIFPETAGKRPERCLGSALSAALSLLSSRNTGGKILVFTSGPPTYGPGGSLKKRED- 159 (243)
T ss_dssp SSSSSEETTTCHHHHHHHHHHHHHHSTT-TTB-----HHHHHHHHHHHHHHHTS-EEEEEEESS---SSSTTSS-SBTT-
T ss_pred cccEEEEhHHhHHHHHHHHHHhhhhcccccccCccccHHHHHHHHHHHHhccccCCEEEEEeccCCCCCCCceeccccc-
Confidence 99999999999999999999999988887 8899999999999999999 899999999999999999 77777754
Q ss_pred CcccCCCcc-ccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHH
Q 001711 580 LRVYGTDKE-HSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHEL 658 (1021)
Q Consensus 580 ~r~~gt~~e-~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL 658 (1021)
.+.+++++| ..++.++++||++||++|+++||+||+|+++.+++|+++|+.|++.|||.+++|++|+.++|.++|++||
T Consensus 160 ~~~~~~~~~~~~~~~~~~~fY~~la~~~~~~~isvDlf~~~~~~~~l~tl~~l~~~TGG~l~~y~~f~~~~~~~~l~~dl 239 (243)
T PF04811_consen 160 SSHYDTEKEKALLLPPANEFYKKLAEECSKQGISVDLFVFSSDYVDLATLGPLARYTGGSLYYYPNFNAERDGEKLRQDL 239 (243)
T ss_dssp SCCCCHCTTHHCHSHSSSHHHHHHHHHHHHCTEEEEEEEECSS--SHHHHTHHHHCTT-EEEEETTTTCHHHHHHHHHHH
T ss_pred ccccccccchhhhccccchHHHHHHHHHHhcCCEEEEEeecCCCCCcHhHHHHHHhCceeEEEeCCCCCchhHHHHHHHH
Confidence 456666666 6778888999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHhc
Q 001711 659 SRDL 662 (1021)
Q Consensus 659 ~r~l 662 (1021)
+|++
T Consensus 240 ~r~~ 243 (243)
T PF04811_consen 240 KRLV 243 (243)
T ss_dssp HHHH
T ss_pred HHhC
Confidence 9874
No 11
>cd01478 Sec23-like Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24
Probab=100.00 E-value=2e-44 Score=394.49 Aligned_cols=225 Identities=20% Similarity=0.330 Sum_probs=195.5
Q ss_pred CCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCC---------------CCc
Q 001711 425 PMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSL---------------TQP 489 (1021)
Q Consensus 425 p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~---------------~~p 489 (1021)
|.||+|+||||+|.++++ +++++++|+++|+.||++ ++|||||||++||||||+... ++.
T Consensus 1 p~pp~~vFviDvs~~~~e---l~~l~~sl~~~L~~lP~~--a~VGlITfd~~V~~~~L~~~~~~~~~vf~g~~~~~~~~~ 75 (267)
T cd01478 1 TSPPVFLFVVDTCMDEEE---LDALKESLIMSLSLLPPN--ALVGLITFGTMVQVHELGFEECSKSYVFRGNKDYTAKQI 75 (267)
T ss_pred CCCCEEEEEEECccCHHH---HHHHHHHHHHHHHhCCCC--CEEEEEEECCEEEEEEcCCCcCceeeeccCCccCCHHHH
Confidence 578999999999999998 889999999999999976 899999999999999998541 111
Q ss_pred -cee------------eccccccccCCCC-CccceehhhhHHHHHHHHhhCCCc---ccCCCCcccchHHHHHHHHHHHH
Q 001711 490 -QMM------------VISDLDDIFVPLP-DDLLVNLSESRSVVDTLLDSLPSM---FQDNMNVESAFGPALKAAFMVMS 552 (1021)
Q Consensus 490 -~ml------------VvsDldd~f~Pl~-~~lLv~l~es~~~I~~lLd~Lp~~---f~~~~~~~~alG~AL~aA~~lL~ 552 (1021)
+|+ +.+|++|.|+|.+ ++||++++||++.|+++||+|+.+ +.+++++++|+|+||++|..+|+
T Consensus 76 ~~~l~~~~~~~~~~~~~~~~~~~~~~p~~~~~flvpl~e~~~~i~~lLe~L~~~~~~~~~~~r~~r~~G~Al~~A~~ll~ 155 (267)
T cd01478 76 QDMLGLGGPAMRPSASQHPGAGNPLPSAAASRFLLPVSQCEFTLTDLLEQLQPDPWPVPAGHRPLRCTGVALSIAVGLLE 155 (267)
T ss_pred HHHhccccccccccccCcCCccccccccccccEEEEHHHHHHHHHHHHHhCcccccccCCCCCCCCchHHHHHHHHHHHH
Confidence 222 2245788999876 699999999999999999999875 46678899999999999999998
Q ss_pred ----hcCCEEEEEecCCCCCCcccccccC--CcCcccC-CCcc-ccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcC
Q 001711 553 ----RLGGKLLIFQNSLPSLGVGCLKLRG--DDLRVYG-TDKE-HSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTD 624 (1021)
Q Consensus 553 ----~~GGkIivF~sg~Pt~GpG~L~~r~--~~~r~~g-t~~e-~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~d 624 (1021)
.+||||++|++|+||+|||+|+.|+ +..|.+. .+++ .++++++++||++||.+|+++||+||+|+++.+|+|
T Consensus 156 ~~~~~~gGki~~F~sg~pT~GpG~l~~r~~~~~~r~~~d~~~~~~~~~~~a~~fY~~la~~~~~~~vsvDlF~~s~d~vg 235 (267)
T cd01478 156 ACFPNTGARIMLFAGGPCTVGPGAVVSTELKDPIRSHHDIDKDNAKYYKKAVKFYDSLAKRLAANGHAVDIFAGCLDQVG 235 (267)
T ss_pred hhcCCCCcEEEEEECCCCCCCCceeeccccccccccccccccchhhhhhhHHHHHHHHHHHHHhCCeEEEEEeccccccC
Confidence 5899999999999999999999885 3455544 4444 469999999999999999999999999999999999
Q ss_pred hhhhhhhccccccEEEEeCCCCCchhHHHH
Q 001711 625 IASLGTLAKYTGGQVYYYPSFQSTTHGERL 654 (1021)
Q Consensus 625 iatl~~L~~~TGG~v~~y~~F~~~~d~~kl 654 (1021)
|++|+.|++.|||.+|+|++|+.+.+.+.|
T Consensus 236 laem~~l~~~TGG~v~~~~~f~~~~f~~s~ 265 (267)
T cd01478 236 LLEMKVLVNSTGGHVVLSDSFTTSIFKQSF 265 (267)
T ss_pred HHHHHHHHHhcCcEEEEeCCcchHHHHHHh
Confidence 999999999999999999999886544443
No 12
>PF04815 Sec23_helical: Sec23/Sec24 helical domain; InterPro: IPR006900 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region, and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the all-helical domain, which forms an approximately 105-residue segment with the C-terminal 30 residues. The linker between alpha-M and alpha-N contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_B 2NUP_B 2NUT_B 3EGX_B 3EH2_C 3EH1_A 3EFO_B 3EG9_B 2QTV_A 1M2O_C ....
Probab=99.86 E-value=2.1e-21 Score=183.76 Aligned_cols=103 Identities=41% Similarity=0.650 Sum_probs=96.9
Q ss_pred HhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCccccccccccHHHHHHHhhhccCCCCCCC
Q 001711 763 TGAIVSVFSRLAIEKTLSHKLEDARNAVQLRLVKALKEYRNLYAVQHRLGSRMIYPESLKFLPLYCLAICKSTPIRGGYA 842 (1021)
Q Consensus 763 ~eai~~~laK~a~~~~l~~~l~d~R~~l~~~lv~iL~~YRk~~~a~~~~~~qLiLPesLklLPlyil~LlKS~~Lr~g~~ 842 (1021)
|||++++++|++++++.+++++|+|+.++++|+++|++||+ +|+..++++||+|||+||+||+|+++|+||++|++ .
T Consensus 1 Qda~~~llak~ai~~~~~~~l~~~r~~l~~~~v~il~~Yr~-~~~~~~~~~qLilPe~lklLPly~l~llKs~alr~--~ 77 (103)
T PF04815_consen 1 QDAITSLLAKQAIDKALSSSLKDARESLDNRLVDILAAYRK-NCASSSSSGQLILPESLKLLPLYILALLKSPALRP--T 77 (103)
T ss_dssp HHHHHHHHHHHHHHHHCCS-HHHHHHHHHHHHHHHHHHHHH-HCTTECCCTEEEEEGGGTTHHHHHHHHHTSTTTSC--S
T ss_pred CHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHh-hccCCCCchhhhCCHHHHHHHHHHHHHHcchhhcC--C
Confidence 79999999999999999999999999999999999999999 99998888999999999999999999999999996 7
Q ss_pred CCChhHHHHHHHHHcCCCHHHHHhhh
Q 001711 843 DVTLDERCAAGYTMMALPVKKLLKLL 868 (1021)
Q Consensus 843 ~~s~DeR~~~~~~l~s~~v~~~~~~l 868 (1021)
++++|||+|+++++++++++.++.||
T Consensus 78 ~v~~D~R~~~~~~~~~~~~~~~~~~i 103 (103)
T PF04815_consen 78 NVSPDERAYAMHLLLSMPVDSLLRMI 103 (103)
T ss_dssp TS-HHHHHHHHHHHHHS-HHHHHHHH
T ss_pred CCCCcHHHHHHHHHHCCCHHHHHhhC
Confidence 99999999999999999999999875
No 13
>PF08033 Sec23_BS: Sec23/Sec24 beta-sandwich domain; InterPro: IPR012990 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes part of the Sec23/24 beta-barrel domain, which is formed from approximately 180 residues from three segments of the polypeptide. The strands of the barrel are oriented roughly parallel to the membrane such that one end of the barrel forms part of the inner surface of the coat and the other end part of the membrane-distal surface. The barrel is constructed from two opposed sheets: a six-stranded beta sheet facing partly towards the zinc finger domain and partly towards the solvent, and a five-stranded beta sheet facing the helical domain.; PDB: 3EFO_B 3EG9_B 1PD0_A 1PD1_A 1M2V_B 1PCX_A 3EH2_C 3EGD_A 2NUP_A 3EGX_A ....
Probab=99.83 E-value=1.7e-20 Score=175.26 Aligned_cols=85 Identities=44% Similarity=0.742 Sum_probs=77.2
Q ss_pred ccceEEEEEeCCCeEEEeeecCcccCC---------CCc--eeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEec
Q 001711 667 AWEAVMRIRCGKGVRFTNYHGNFMLRS---------TDL--LALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTAS 735 (1021)
Q Consensus 667 g~~a~mrVR~S~Gl~V~~~~Gnf~~rs---------~~~--~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~ 735 (1021)
||+|+||||||+||+|++++||+..++ .+. |.++++++|++|+|+|++++++...+.+|||+|++||+.
T Consensus 1 g~~~~l~vr~S~gl~v~~~~G~~~~~~~~s~~~~g~~~~~~~~~~~l~~~~s~~~~~~~~~~~~~~~~~~iQ~~~~Yt~~ 80 (96)
T PF08033_consen 1 GFNAVLRVRCSKGLKVSGVIGPCFNRSSVSDNEIGEGDTTRWKLPSLDPDTSFAFEFEIDEDLPNGSQAYIQFALLYTDS 80 (96)
T ss_dssp EEEEEEEEEE-TTEEEEEEESSSEESSTBESSECSBSSCSEEEEEEEETT--EEEEEEESSBTBTTSEEEEEEEEEEEET
T ss_pred CceEEEEEEECCCeEEEEEEcCccccccccceeeccCCccEEEecccCCCCEEEEEEEECCCCCCCCeEEEEEEEEEECC
Confidence 799999999999999999999998766 455 999999999999999999999877899999999999999
Q ss_pred CCcEEEEEEeeeeccc
Q 001711 736 CGERRIRVHTLAAPVV 751 (1021)
Q Consensus 736 ~GeRrIRV~Tl~lpvt 751 (1021)
+|+|||||+|+++++|
T Consensus 81 ~G~r~iRV~T~~l~vt 96 (96)
T PF08033_consen 81 NGERRIRVTTLSLPVT 96 (96)
T ss_dssp TSEEEEEEEEEEEEEE
T ss_pred CCCEEEEEEeeccccC
Confidence 9999999999999986
No 14
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=99.19 E-value=6e-12 Score=98.55 Aligned_cols=35 Identities=43% Similarity=1.091 Sum_probs=26.9
Q ss_pred CccceEEccceeEecCCceEEEcCCCCCCCCCccc
Q 001711 354 FICRTYVNPYVTFTDAGRKWRCNICALLNDVPGDY 388 (1021)
Q Consensus 354 ~rCrAYiNPf~~f~~~g~~W~Cn~C~~~N~vP~~Y 388 (1021)
++|+||||||++|+++|++|+|+||++.|++|.+|
T Consensus 6 ~~C~aylNp~~~~~~~~~~w~C~~C~~~N~lp~~Y 40 (40)
T PF04810_consen 6 RRCRAYLNPFCQFDDGGKTWICNFCGTKNPLPPHY 40 (40)
T ss_dssp TTT--BS-TTSEEETTTTEEEETTT--EEE--GGG
T ss_pred CCCCCEECCcceEcCCCCEEECcCCCCcCCCCCCC
Confidence 68999999999999999999999999999999887
No 15
>PRK13685 hypothetical protein; Provisional
Probab=98.76 E-value=3.7e-07 Score=103.96 Aligned_cols=174 Identities=20% Similarity=0.282 Sum_probs=122.0
Q ss_pred CCeEEEEEecchhHHhh----cHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711 427 PPLYFFLIDVSISAIRS----GMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV 502 (1021)
Q Consensus 427 pp~yvFvIDvS~~av~s----G~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~ 502 (1021)
.-.+|||||+|.++-.. ..++.+++.++..|+.+.++ .+||+|+|++..++. .
T Consensus 88 ~~~vvlvlD~S~SM~~~D~~p~RL~~ak~~~~~~l~~l~~~--d~vglv~Fa~~a~~~---------------------~ 144 (326)
T PRK13685 88 RAVVMLVIDVSQSMRATDVEPNRLAAAQEAAKQFADELTPG--INLGLIAFAGTATVL---------------------V 144 (326)
T ss_pred CceEEEEEECCccccCCCCCCCHHHHHHHHHHHHHHhCCCC--CeEEEEEEcCceeec---------------------C
Confidence 34689999999998532 46889999999999998654 689999999765421 0
Q ss_pred CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----------CCEEEEEecCCCCCCcc
Q 001711 503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----------GGKLLIFQNSLPSLGVG 571 (1021)
Q Consensus 503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----------GGkIivF~sg~Pt~GpG 571 (1021)
| +.+.++.+.+.|+.|.. ...+++|.||..|++.++.. .++|+++++|.-|.|..
T Consensus 145 p--------~t~d~~~l~~~l~~l~~------~~~T~~g~al~~A~~~l~~~~~~~~~~~~~~~~~IILlTDG~~~~~~~ 210 (326)
T PRK13685 145 S--------PTTNREATKNAIDKLQL------ADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLMSDGKETVPTN 210 (326)
T ss_pred C--------CCCCHHHHHHHHHhCCC------CCCcchHHHHHHHHHHHHhhhcccccccCCCCCEEEEEcCCCCCCCCC
Confidence 1 22456778888888853 24577899999999888631 36799999987665421
Q ss_pred cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC-------------CcChhhhhhhccccccE
Q 001711 572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK-------------YTDIASLGTLAKYTGGQ 638 (1021)
Q Consensus 572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~-------------~~diatl~~L~~~TGG~ 638 (1021)
.. + +... .+.+..+.+.||.|.++.++.+ ..|-..|..+++.|||+
T Consensus 211 ~~----~---------------~~~~--~~aa~~a~~~gi~i~~Ig~G~~~g~~~~~g~~~~~~~d~~~L~~iA~~tgG~ 269 (326)
T PRK13685 211 PD----N---------------PRGA--YTAARTAKDQGVPISTISFGTPYGSVEINGQRQPVPVDDESLKKIAQLSGGE 269 (326)
T ss_pred CC----C---------------cccH--HHHHHHHHHcCCeEEEEEECCCCCCcCcCCceeeecCCHHHHHHHHHhcCCE
Confidence 10 0 0001 2456777889999999998864 26778999999999998
Q ss_pred EEEeCCCCCchhHHHHHHHHHHh
Q 001711 639 VYYYPSFQSTTHGERLRHELSRD 661 (1021)
Q Consensus 639 v~~y~~F~~~~d~~kl~~dL~r~ 661 (1021)
.|+..+ ..+-++.+.++.+.
T Consensus 270 ~~~~~~---~~~L~~if~~I~~~ 289 (326)
T PRK13685 270 FYTAAS---LEELRAVYATLQQQ 289 (326)
T ss_pred EEEcCC---HHHHHHHHHHHHHH
Confidence 887654 22334455555443
No 16
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=98.70 E-value=5.7e-07 Score=94.10 Aligned_cols=163 Identities=20% Similarity=0.198 Sum_probs=109.2
Q ss_pred eEEEEEecchhHHhh----cHHHHHHHHHHHHHhcCC-CCCCceEEEEEE-cCeEEEEecCCCCCCcceeeccccccccC
Q 001711 429 LYFFLIDVSISAIRS----GMLEVVAQTIKSCLDELP-GFPRTQIGFITF-DSTIHFYNMKSSLTQPQMMVISDLDDIFV 502 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s----G~l~~~~~sI~~~L~~Lp-~~~rt~VgiITF-ds~V~fynl~~~~~~p~mlVvsDldd~f~ 502 (1021)
-.+|+||+|.++.++ ..++.+++.+...++.+. .++..+||||+| ++.-|+. +
T Consensus 5 ~ivi~lD~S~SM~a~D~~ptRl~~ak~~~~~fi~~~~~~~~~~~vglv~f~~~~a~~~---------------------~ 63 (183)
T cd01453 5 HLIIVIDCSRSMEEQDLKPSRLAVVLKLLELFIEEFFDQNPISQLGIISIKNGRAEKL---------------------T 63 (183)
T ss_pred EEEEEEECcHHHhcCCCCchHHHHHHHHHHHHHHHHhhcCccccEEEEEEcCCccEEE---------------------E
Confidence 368999999998643 368888998888887642 234478999999 5543321 1
Q ss_pred CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc----CCEEEEEecCCCCCCcccccccCC
Q 001711 503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL----GGKLLIFQNSLPSLGVGCLKLRGD 578 (1021)
Q Consensus 503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~----GGkIivF~sg~Pt~GpG~L~~r~~ 578 (1021)
|+ +...+.+...|+.+ +. ...+++++.||+.|...|+.. .++|+++.++.-+.++
T Consensus 64 Pl--------T~D~~~~~~~L~~~--~~---~~G~t~l~~aL~~A~~~l~~~~~~~~~~iiil~sd~~~~~~-------- 122 (183)
T cd01453 64 DL--------TGNPRKHIQALKTA--RE---CSGEPSLQNGLEMALESLKHMPSHGSREVLIIFSSLSTCDP-------- 122 (183)
T ss_pred CC--------CCCHHHHHHHhhcc--cC---CCCchhHHHHHHHHHHHHhcCCccCceEEEEEEcCCCcCCh--------
Confidence 22 12223444455554 11 234589999999999999752 3568888774211100
Q ss_pred cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHH
Q 001711 579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHEL 658 (1021)
Q Consensus 579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL 658 (1021)
.-+.++++++.+.+|.|++..++. ++..|..+|+.|||+.|.-. |.+.|...+
T Consensus 123 ------------------~~~~~~~~~l~~~~I~v~~IgiG~---~~~~L~~ia~~tgG~~~~~~------~~~~l~~~~ 175 (183)
T cd01453 123 ------------------GNIYETIDKLKKENIRVSVIGLSA---EMHICKEICKATNGTYKVIL------DETHLKELL 175 (183)
T ss_pred ------------------hhHHHHHHHHHHcCcEEEEEEech---HHHHHHHHHHHhCCeeEeeC------CHHHHHHHH
Confidence 112567888999999999999974 56789999999999998754 345565555
Q ss_pred HH
Q 001711 659 SR 660 (1021)
Q Consensus 659 ~r 660 (1021)
.+
T Consensus 176 ~~ 177 (183)
T cd01453 176 LE 177 (183)
T ss_pred Hh
Confidence 44
No 17
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=98.50 E-value=3.5e-06 Score=86.93 Aligned_cols=154 Identities=18% Similarity=0.244 Sum_probs=104.1
Q ss_pred eEEEEEecchhHHhh-----cHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711 429 LYFFLIDVSISAIRS-----GMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP 503 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s-----G~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P 503 (1021)
-++||||+|.++-.. ..++.+++.+...+...+ +.+||+|+|++.++.. +|
T Consensus 4 ~vv~vlD~S~SM~~~~~~~~~r~~~a~~~~~~~~~~~~---~~~v~lv~f~~~~~~~---------------------~~ 59 (180)
T cd01467 4 DIMIALDVSGSMLAQDFVKPSRLEAAKEVLSDFIDRRE---NDRIGLVVFAGAAFTQ---------------------AP 59 (180)
T ss_pred eEEEEEECCcccccccCCCCCHHHHHHHHHHHHHHhCC---CCeEEEEEEcCCeeec---------------------cC
Confidence 478999999987422 135667777777666544 3689999998765421 01
Q ss_pred CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecCCCCCCcccccccCCcC
Q 001711 504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
+...+..+.++|+.|.... ...++.++.||..|...+... ...|++++.|.++.|. .
T Consensus 60 --------~~~~~~~~~~~l~~l~~~~---~~g~T~l~~al~~a~~~l~~~~~~~~~iiliTDG~~~~g~--~------- 119 (180)
T cd01467 60 --------LTLDRESLKELLEDIKIGL---AGQGTAIGDAIGLAIKRLKNSEAKERVIVLLTDGENNAGE--I------- 119 (180)
T ss_pred --------CCccHHHHHHHHHHhhhcc---cCCCCcHHHHHHHHHHHHHhcCCCCCEEEEEeCCCCCCCC--C-------
Confidence 1123445566666665211 234577999999999998653 2458888887655431 0
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC----------CcChhhhhhhccccccEEEEeC
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK----------YTDIASLGTLAKYTGGQVYYYP 643 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~----------~~diatl~~L~~~TGG~v~~y~ 643 (1021)
...+.+..+.+.||.|+.+.+... ..|...|..|++.|||.+|+..
T Consensus 120 -----------------~~~~~~~~~~~~gi~i~~i~ig~~~~~~~~~~~~~~~~~~l~~la~~tgG~~~~~~ 175 (180)
T cd01467 120 -----------------DPATAAELAKNKGVRIYTIGVGKSGSGPKPDGSTILDEDSLVEIADKTGGRIFRAL 175 (180)
T ss_pred -----------------CHHHHHHHHHHCCCEEEEEEecCCCCCcCCCCcccCCHHHHHHHHHhcCCEEEEec
Confidence 012334556678999999998862 4788889999999999999865
No 18
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most,
Probab=98.50 E-value=1.8e-06 Score=87.60 Aligned_cols=147 Identities=17% Similarity=0.268 Sum_probs=104.3
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL 509 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL 509 (1021)
.+||||+|.++-. .-++.+.++|+..++.|+++ .+||||+|++..+.+- .+.+.
T Consensus 3 v~~vlD~S~SM~~-~rl~~ak~a~~~l~~~l~~~--~~~~li~F~~~~~~~~------------------~~~~~----- 56 (155)
T cd01466 3 LVAVLDVSGSMAG-DKLQLVKHALRFVISSLGDA--DRLSIVTFSTSAKRLS------------------PLRRM----- 56 (155)
T ss_pred EEEEEECCCCCCc-HHHHHHHHHHHHHHHhCCCc--ceEEEEEecCCccccC------------------CCccc-----
Confidence 5799999998743 24777889999999998865 6899999998754320 00000
Q ss_pred eehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCcccccccCCcCcccC
Q 001711 510 VNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLKLRGDDLRVYG 584 (1021)
Q Consensus 510 v~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~~r~~~~r~~g 584 (1021)
-.+.++.+.++|+.+. ....++++.||+.|..+++.. ...|++++.|.++.|..
T Consensus 57 --~~~~~~~~~~~i~~~~------~~g~T~~~~al~~a~~~~~~~~~~~~~~~iillTDG~~~~~~~------------- 115 (155)
T cd01466 57 --TAKGKRSAKRVVDGLQ------AGGGTNVVGGLKKALKVLGDRRQKNPVASIMLLSDGQDNHGAV------------- 115 (155)
T ss_pred --CHHHHHHHHHHHHhcc------CCCCccHHHHHHHHHHHHhhcccCCCceEEEEEcCCCCCcchh-------------
Confidence 0134566777777763 245689999999999998643 25788888888765500
Q ss_pred CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEe
Q 001711 585 TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYY 642 (1021)
Q Consensus 585 t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y 642 (1021)
..++.+.+|.|..+.++. ..|..+|..|+..|||+.||.
T Consensus 116 ------------------~~~~~~~~v~v~~igig~-~~~~~~l~~iA~~t~G~~~~~ 154 (155)
T cd01466 116 ------------------VLRADNAPIPIHTFGLGA-SHDPALLAFIAEITGGTFSYV 154 (155)
T ss_pred ------------------hhcccCCCceEEEEecCC-CCCHHHHHHHHhccCceEEEe
Confidence 001224678888888764 468899999999999999874
No 19
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=98.50 E-value=3.5e-06 Score=85.85 Aligned_cols=155 Identities=17% Similarity=0.240 Sum_probs=110.6
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL 509 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL 509 (1021)
++||||+|.++-... ++.+++++...+..+..+ .+|++|+|++..+.+- +. . .
T Consensus 3 ~~~vlD~S~SM~~~~-~~~~k~a~~~~~~~l~~~--~~v~li~f~~~~~~~~--------------~~----~--~---- 55 (170)
T cd01465 3 LVFVIDRSGSMDGPK-LPLVKSALKLLVDQLRPD--DRLAIVTYDGAAETVL--------------PA----T--P---- 55 (170)
T ss_pred EEEEEECCCCCCChh-HHHHHHHHHHHHHhCCCC--CEEEEEEecCCccEEe--------------cC----c--c----
Confidence 789999999885433 778888999999988754 6899999997644320 00 0 0
Q ss_pred eehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---C--CEEEEEecCCCCCCcccccccCCcCcccC
Q 001711 510 VNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---G--GKLLIFQNSLPSLGVGCLKLRGDDLRVYG 584 (1021)
Q Consensus 510 v~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---G--GkIivF~sg~Pt~GpG~L~~r~~~~r~~g 584 (1021)
...++.+...|+.+.. ...+.++.||+.|+..++.. + .+|++|+.|.++.|...
T Consensus 56 ---~~~~~~l~~~l~~~~~------~g~T~~~~al~~a~~~~~~~~~~~~~~~ivl~TDG~~~~~~~~------------ 114 (170)
T cd01465 56 ---VRDKAAILAAIDRLTA------GGSTAGGAGIQLGYQEAQKHFVPGGVNRILLATDGDFNVGETD------------ 114 (170)
T ss_pred ---cchHHHHHHHHHcCCC------CCCCCHHHHHHHHHHHHHhhcCCCCeeEEEEEeCCCCCCCCCC------------
Confidence 0123445555665541 34567899999999988652 2 57999999988765311
Q ss_pred CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 585 TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 585 t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
.+-+++....+.+.+|.|+++.++ ...|...|..+++.++|..++-++
T Consensus 115 -----------~~~~~~~~~~~~~~~v~i~~i~~g-~~~~~~~l~~ia~~~~g~~~~~~~ 162 (170)
T cd01465 115 -----------PDELARLVAQKRESGITLSTLGFG-DNYNEDLMEAIADAGNGNTAYIDN 162 (170)
T ss_pred -----------HHHHHHHHHHhhcCCeEEEEEEeC-CCcCHHHHHHHHhcCCceEEEeCC
Confidence 122345555667889999999998 678999999999999999887654
No 20
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=98.49 E-value=5e-06 Score=87.17 Aligned_cols=163 Identities=21% Similarity=0.250 Sum_probs=107.0
Q ss_pred CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEe-cCCCCCCcceeeccccccccCCC
Q 001711 426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYN-MKSSLTQPQMMVISDLDDIFVPL 504 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fyn-l~~~~~~p~mlVvsDldd~f~Pl 504 (1021)
.|-..+||||+|.++-.+ -++.++++++..|+.|+++ .+||||+|++.++.+- +..
T Consensus 12 ~p~~vv~llD~SgSM~~~-~l~~ak~~~~~ll~~l~~~--d~v~lv~F~~~~~~~~~~~~-------------------- 68 (190)
T cd01463 12 SPKDIVILLDVSGSMTGQ-RLHLAKQTVSSILDTLSDN--DFFNIITFSNEVNPVVPCFN-------------------- 68 (190)
T ss_pred CCceEEEEEECCCCCCcH-HHHHHHHHHHHHHHhCCCC--CEEEEEEeCCCeeEEeeecc--------------------
Confidence 456789999999988543 4678899999999999765 7899999999877431 100
Q ss_pred CCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---c---------CCEEEEEecCCCCCCccc
Q 001711 505 PDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---L---------GGKLLIFQNSLPSLGVGC 572 (1021)
Q Consensus 505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~---------GGkIivF~sg~Pt~GpG~ 572 (1021)
..++....+.++.+...|+.|.. ...+.++.||+.|+..|+. . ...|++++.|.++.+.
T Consensus 69 -~~~~~~~~~~~~~~~~~l~~l~~------~G~T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~-- 139 (190)
T cd01463 69 -DTLVQATTSNKKVLKEALDMLEA------KGIANYTKALEFAFSLLLKNLQSNHSGSRSQCNQAIMLITDGVPENYK-- 139 (190)
T ss_pred -cceEecCHHHHHHHHHHHhhCCC------CCcchHHHHHHHHHHHHHHhhhcccccccCCceeEEEEEeCCCCCcHh--
Confidence 11111122445556666666652 3357899999999998875 1 1358888888765311
Q ss_pred ccccCCcCcccCCCccccCCCCCcHHHHHHHH-HHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 573 LKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAA-DLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 573 L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~-~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
..+.++.. ...+.+|.|..|.++.+..|...|..|+..+||..++.++
T Consensus 140 ------------------------~~~~~~~~~~~~~~~v~i~tigiG~~~~d~~~L~~lA~~~~G~~~~i~~ 188 (190)
T cd01463 140 ------------------------EIFDKYNWDKNSEIPVRVFTYLIGREVTDRREIQWMACENKGYYSHIQS 188 (190)
T ss_pred ------------------------HHHHHhcccccCCCcEEEEEEecCCccccchHHHHHHhhcCCeEEEccc
Confidence 01111110 1112245566666666656889999999999999998764
No 21
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=98.48 E-value=4.1e-06 Score=86.96 Aligned_cols=160 Identities=19% Similarity=0.246 Sum_probs=109.6
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
.++||||+|.++-...-++.+++++...+..+.. ++.+||||+|++. .++. +|
T Consensus 2 ~v~lvlD~SgSM~~~~rl~~ak~a~~~~~~~~~~-~~d~v~lv~F~~~~~~~~---------------------~~---- 55 (178)
T cd01451 2 LVIFVVDASGSMAARHRMAAAKGAVLSLLRDAYQ-RRDKVALIAFRGTEAEVL---------------------LP---- 55 (178)
T ss_pred eEEEEEECCccCCCccHHHHHHHHHHHHHHHhhc-CCCEEEEEEECCCCceEE---------------------eC----
Confidence 3689999999885432577788888887765322 2378999999864 2211 01
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHH-h---cC--CEEEEEecCCCCCCcccccccCCcCc
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMS-R---LG--GKLLIFQNSLPSLGVGCLKLRGDDLR 581 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~-~---~G--GkIivF~sg~Pt~GpG~L~~r~~~~r 581 (1021)
+...++.+...|+.++. ...+.++.||..|...++ . .+ ..|++++.|.++.|...
T Consensus 56 ----~t~~~~~~~~~l~~l~~------~G~T~l~~aL~~a~~~l~~~~~~~~~~~~ivliTDG~~~~g~~~--------- 116 (178)
T cd01451 56 ----PTRSVELAKRRLARLPT------GGGTPLAAGLLAAYELAAEQARDPGQRPLIVVITDGRANVGPDP--------- 116 (178)
T ss_pred ----CCCCHHHHHHHHHhCCC------CCCCcHHHHHHHHHHHHHHHhcCCCCceEEEEECCCCCCCCCCc---------
Confidence 11223344556666642 456789999999999982 1 12 46888888877765210
Q ss_pred ccCCCccccCCCCCcHHH-HHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711 582 VYGTDKEHSLRIPEDPFY-KQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS 647 (1021)
Q Consensus 582 ~~gt~~e~~l~~pa~~fY-~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~ 647 (1021)
...- .+++.++.+.+|.|.++.+...+.|-..|..|++.|||+.|+.++.+.
T Consensus 117 --------------~~~~~~~~~~~l~~~gi~v~~I~~~~~~~~~~~l~~iA~~tgG~~~~~~d~~~ 169 (178)
T cd01451 117 --------------TADRALAAARKLRARGISALVIDTEGRPVRRGLAKDLARALGGQYVRLPDLSA 169 (178)
T ss_pred --------------hhHHHHHHHHHHHhcCCcEEEEeCCCCccCccHHHHHHHHcCCeEEEcCcCCH
Confidence 0111 567788889999887776666667888899999999999999887543
No 22
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=98.46 E-value=3e-06 Score=90.03 Aligned_cols=174 Identities=22% Similarity=0.228 Sum_probs=111.3
Q ss_pred CCCCCCeEEEEEecchhHHh-----hcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeecccc
Q 001711 423 RPPMPPLYFFLIDVSISAIR-----SGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDL 497 (1021)
Q Consensus 423 r~p~pp~yvFvIDvS~~av~-----sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDl 497 (1021)
....+..++||||+|.++.. ...++.+++++...|+.++++ .+|||++|++.++-. .. .. .+++
T Consensus 16 ~~~~~~~vv~vlD~SgSM~~~~~~~~~rl~~ak~a~~~~l~~l~~~--~~v~lv~F~~~~~~~---~~---~~-~~~p-- 84 (206)
T cd01456 16 EPQLPPNVAIVLDNSGSMREVDGGGETRLDNAKAALDETANALPDG--TRLGLWTFSGDGDNP---LD---VR-VLVP-- 84 (206)
T ss_pred ccCCCCcEEEEEeCCCCCcCCCCCcchHHHHHHHHHHHHHHhCCCC--ceEEEEEecCCCCCC---cc---cc-cccc--
Confidence 34567789999999999862 135888999999999998755 789999999854210 00 00 0000
Q ss_pred ccccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-CEEEEEecCCCCCCccccccc
Q 001711 498 DDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-GKLLIFQNSLPSLGVGCLKLR 576 (1021)
Q Consensus 498 dd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-GkIivF~sg~Pt~GpG~L~~r 576 (1021)
..+.....--.....++.+.+.|+.|.. ...++.++.||+.|...++... ..||+++.|..+.|...+
T Consensus 85 ---~~~~~~~~~~~~~~~~~~l~~~i~~i~~-----~~G~T~l~~aL~~a~~~l~~~~~~~iillTDG~~~~~~~~~--- 153 (206)
T cd01456 85 ---KGCLTAPVNGFPSAQRSALDAALNSLQT-----PTGWTPLAAALAEAAAYVDPGRVNVVVLITDGEDTCGPDPC--- 153 (206)
T ss_pred ---ccccccccCCCCcccHHHHHHHHHhhcC-----CCCcChHHHHHHHHHHHhCCCCcceEEEEcCCCccCCCCHH---
Confidence 0011000000001356677777888751 2456889999999999996222 578888888766542000
Q ss_pred CCcCcccCCCccccCCCCCcHHHHHHHHHH-hhCCcEEEEEEecCCCcChhhhhhhccccccEE
Q 001711 577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADL-TKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQV 639 (1021)
Q Consensus 577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~-~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v 639 (1021)
+..++++.+. .+.+|.|+++.++.+ .|...|..|++.|||..
T Consensus 154 --------------------~~~~~~~~~~~~~~~i~i~~igiG~~-~~~~~l~~iA~~tgG~~ 196 (206)
T cd01456 154 --------------------EVARELAKRRTPAPPIKVNVIDFGGD-ADRAELEAIAEATGGTY 196 (206)
T ss_pred --------------------HHHHHHHHhcCCCCCceEEEEEecCc-ccHHHHHHHHHhcCCeE
Confidence 1112222211 225899999999865 67889999999999988
No 23
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=98.43 E-value=2.5e-05 Score=97.83 Aligned_cols=167 Identities=19% Similarity=0.260 Sum_probs=109.4
Q ss_pred CeEEEEEecchhHHhhcHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711 428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD 506 (1021)
Q Consensus 428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~ 506 (1021)
...+||||+|.++-....++.+.++++..|.. ++.+ .+||||+||+..++.. + +.++.+
T Consensus 305 r~VVLVLDvSGSM~g~dRL~~lkqAA~~fL~~~l~~~--DrVGLVtFsssA~vl~--------------p----Lt~Its 364 (863)
T TIGR00868 305 RIVCLVLDKSGSMTVEDRLKRMNQAAKLFLLQTVEKG--SWVGMVTFDSAAYIKN--------------E----LIQITS 364 (863)
T ss_pred ceEEEEEECCccccccCHHHHHHHHHHHHHHHhCCCC--CEEEEEEECCceeEee--------------c----cccCCc
Confidence 56899999999985433577777777776654 4433 7999999998765421 0 111111
Q ss_pred ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCcccccccCCcCc
Q 001711 507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLKLRGDDLR 581 (1021)
Q Consensus 507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~~r~~~~r 581 (1021)
...++.|...|... ...+++++.||+.|+++|+.. +..||+++.|..+.+
T Consensus 365 ------~~dr~aL~~~L~~~-------A~GGT~I~~GL~~Alq~L~~~~~~~~~~~IILLTDGedn~~------------ 419 (863)
T TIGR00868 365 ------SAERDALTANLPTA-------ASGGTSICSGLKAAFQVIKKSYQSTDGSEIVLLTDGEDNTI------------ 419 (863)
T ss_pred ------HHHHHHHHHhhccc-------cCCCCcHHHHHHHHHHHHHhcccccCCCEEEEEeCCCCCCH------------
Confidence 12344444333311 245789999999999999763 467777777643210
Q ss_pred ccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHh
Q 001711 582 VYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRD 661 (1021)
Q Consensus 582 ~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~ 661 (1021)
.+++.++.+.||.|..+.++.+. | ..|..||+.|||..|+..+ ..+...|...|.++
T Consensus 420 ------------------~~~l~~lk~~gVtI~TIg~G~da-d-~~L~~IA~~TGG~~f~asd---~~dl~~L~dAF~~i 476 (863)
T TIGR00868 420 ------------------SSCFEEVKQSGAIIHTIALGPSA-A-KELEELSDMTGGLRFYASD---QADNNGLIDAFGAL 476 (863)
T ss_pred ------------------HHHHHHHHHcCCEEEEEEeCCCh-H-HHHHHHHHhcCCEEEEeCC---HHHHHHHHHHHHHH
Confidence 23445567789999999998764 2 4589999999999998864 22334565555554
Q ss_pred c
Q 001711 662 L 662 (1021)
Q Consensus 662 l 662 (1021)
.
T Consensus 477 s 477 (863)
T TIGR00868 477 S 477 (863)
T ss_pred h
Confidence 3
No 24
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=98.34 E-value=0.00045 Score=85.22 Aligned_cols=284 Identities=13% Similarity=0.156 Sum_probs=161.6
Q ss_pred CCCCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711 424 PPMPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP 503 (1021)
Q Consensus 424 ~p~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P 503 (1021)
...+..++||||+|.++-. .-++.+++++..+|+.|.++ .+|+||+||+.++.+.-.. . +
T Consensus 268 ~~~p~~vvfvlD~SgSM~g-~~i~~ak~al~~~l~~L~~~--d~~~ii~F~~~~~~~~~~~-------~----------~ 327 (596)
T TIGR03788 268 QVLPRELVFVIDTSGSMAG-ESIEQAKSALLLALDQLRPG--DRFNIIQFDSDVTLLFPVP-------V----------P 327 (596)
T ss_pred cCCCceEEEEEECCCCCCC-ccHHHHHHHHHHHHHhCCCC--CEEEEEEECCcceEecccc-------c----------c
Confidence 3556689999999998843 23677889999999999865 7899999999877542100 0 0
Q ss_pred CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C---CEEEEEecCCCCCCcccccccCCc
Q 001711 504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G---GKLLIFQNSLPSLGVGCLKLRGDD 579 (1021)
Q Consensus 504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G---GkIivF~sg~Pt~GpG~L~~r~~~ 579 (1021)
. -.+.++.+...|+.|.. ..++.+..||+.|+...... . -.|+++++|..+ +
T Consensus 328 ~-------~~~~~~~a~~~i~~l~a------~GgT~l~~aL~~a~~~~~~~~~~~~~~iillTDG~~~----------~- 383 (596)
T TIGR03788 328 A-------TAHNLARARQFVAGLQA------DGGTEMAGALSAALRDDGPESSGALRQVVFLTDGAVG----------N- 383 (596)
T ss_pred C-------CHHHHHHHHHHHhhCCC------CCCccHHHHHHHHHHhhcccCCCceeEEEEEeCCCCC----------C-
Confidence 0 02334445556666642 35678999999998775332 1 258888887421 0
Q ss_pred CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHH
Q 001711 580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELS 659 (1021)
Q Consensus 580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~ 659 (1021)
+ ...++.+. ....++.|..|.++.+ .|-..|..|++.+||..++... .+...+++.+.+.
T Consensus 384 --------~-------~~~~~~~~--~~~~~~ri~tvGiG~~-~n~~lL~~lA~~g~G~~~~i~~--~~~~~~~~~~~l~ 443 (596)
T TIGR03788 384 --------E-------DALFQLIR--TKLGDSRLFTVGIGSA-PNSYFMRKAAQFGRGSFTFIGS--TDEVQRKMSQLFA 443 (596)
T ss_pred --------H-------HHHHHHHH--HhcCCceEEEEEeCCC-cCHHHHHHHHHcCCCEEEECCC--HHHHHHHHHHHHH
Confidence 0 11222332 1234567777776654 6778899999999998776543 2222334444444
Q ss_pred HhcccccccceEEEEEeCCCeEEEeeecCcccCCCCceeeccCCCCCcEEEEEEeccccCCCceeEEEEEEEEEecCCcE
Q 001711 660 RDLTRETAWEAVMRIRCGKGVRFTNYHGNFMLRSTDLLALPAVDCDKAYAMQLSLEETLLTTQTVYFQVALLYTASCGER 739 (1021)
Q Consensus 660 r~ltr~~g~~a~mrVR~S~Gl~V~~~~Gnf~~rs~~~~~l~~id~d~Sia~~~~~d~~l~~~~~~~iQ~AllYTt~~GeR 739 (1021)
+ +..+..-+..+++.... +..++- -.++.+-....+.|.-++... ...+ .+.....++.
T Consensus 444 ~-~~~p~l~~v~v~~~~~~---~~~v~P---------~~~p~L~~g~~l~v~g~~~~~---~~~i----~v~g~~~~~~- 502 (596)
T TIGR03788 444 K-LEQPALTDIALTFDNGN---AADVYP---------SPIPDLYRGEPLQIAIKLQQA---AGEL----QLTGRTGSQP- 502 (596)
T ss_pred h-hcCeEEEEEEEEEcCCc---cceecc---------CCCccccCCCEEEEEEEecCC---CCeE----EEEEEcCCce-
Confidence 4 55566666666664322 222221 234556666666666664321 1222 2222322222
Q ss_pred EEEEEeeeecccCCHHHHHHhcCHhHHHHHHHHHHHHHHhcCCH-HHHHHHHHHHHHHHHHHHHh
Q 001711 740 RIRVHTLAAPVVSNLSDMYQQADTGAIVSVFSRLAIEKTLSHKL-EDARNAVQLRLVKALKEYRN 803 (1021)
Q Consensus 740 rIRV~Tl~lpvt~~l~~vf~s~D~eai~~~laK~a~~~~l~~~l-~d~R~~l~~~lv~iL~~YRk 803 (1021)
.+..+.+... .+-..+-.+.||+-+..+..... ..-++.+.++++++-.+|+-
T Consensus 503 ----~~~~~~~~~~-------~~~~~l~~lwA~~~I~~L~~~~~~~~~~~~~~~~Ii~Lsl~y~l 556 (596)
T TIGR03788 503 ----WSQQLDLDSA-------APGKGIDKLWARRKIDSLEDSLRYGANEEKVKDQVTALALNHHL 556 (596)
T ss_pred ----EEEEEecCCC-------CCcchHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHhCC
Confidence 1222333321 13344677788877776653211 01124466677777777765
No 25
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=98.32 E-value=2.3e-05 Score=81.85 Aligned_cols=167 Identities=16% Similarity=0.156 Sum_probs=97.9
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
-.+||||+|.++-. . .....+.++..++.+.. ++.+||||+|++..+. +.+.
T Consensus 6 Dvv~llD~SgSm~~-~-~~~~~~~~~~l~~~~~~-~~~rvglv~Fs~~~~~~~~l~------------------------ 58 (185)
T cd01474 6 DLYFVLDKSGSVAA-N-WIEIYDFVEQLVDRFNS-PGLRFSFITFSTRATKILPLT------------------------ 58 (185)
T ss_pred eEEEEEeCcCchhh-h-HHHHHHHHHHHHHHcCC-CCcEEEEEEecCCceEEEecc------------------------
Confidence 47999999998743 2 33344667777766532 4589999999876432 1111
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHH--hcCC----E-EEEEecCCCCCCcccccccCCcC
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMS--RLGG----K-LLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~--~~GG----k-IivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
+..+.+.+.|+.|..+.. ...+++|.||+.|...|. ..|| | |++++.|..+-..+
T Consensus 59 ------~~~~~~~~~l~~l~~~~~---~g~T~~~~aL~~a~~~l~~~~~~~r~~~~~villTDG~~~~~~~--------- 120 (185)
T cd01474 59 ------DDSSAIIKGLEVLKKVTP---SGQTYIHEGLENANEQIFNRNGGGRETVSVIIALTDGQLLLNGH--------- 120 (185)
T ss_pred ------ccHHHHHHHHHHHhccCC---CCCCcHHHHHHHHHHHHHhhccCCCCCCeEEEEEcCCCcCCCCC---------
Confidence 111123344444443322 357899999999998773 3444 2 67777765431000
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEE-EeCCCCCchhHHHHHHHHH
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVY-YYPSFQSTTHGERLRHELS 659 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~-~y~~F~~~~d~~kl~~dL~ 659 (1021)
..-...+.++.+.||.|..+.+ ...|..+|..++..++ .+| ...+|+. -..+.++|.
T Consensus 121 ----------------~~~~~~a~~l~~~gv~i~~vgv--~~~~~~~L~~iA~~~~-~~f~~~~~~~~---l~~~~~~~~ 178 (185)
T cd01474 121 ----------------KYPEHEAKLSRKLGAIVYCVGV--TDFLKSQLINIADSKE-YVFPVTSGFQA---LSGIIESVV 178 (185)
T ss_pred ----------------cchHHHHHHHHHcCCEEEEEee--chhhHHHHHHHhCCCC-eeEecCccHHH---HHHHHHHHH
Confidence 0002335567778886666555 5678899999998774 455 3334432 234455555
Q ss_pred Hhc
Q 001711 660 RDL 662 (1021)
Q Consensus 660 r~l 662 (1021)
+.+
T Consensus 179 ~~~ 181 (185)
T cd01474 179 KKA 181 (185)
T ss_pred Hhh
Confidence 444
No 26
>PF13519 VWA_2: von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=98.28 E-value=1e-05 Score=81.70 Aligned_cols=151 Identities=17% Similarity=0.235 Sum_probs=101.0
Q ss_pred EEEEEecchhHHhhc----HHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711 430 YFFLIDVSISAIRSG----MLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 430 yvFvIDvS~~av~sG----~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
+|||||+|.++-..+ .++.+++++...++.+++ .+|+|++|++..+.
T Consensus 2 vv~v~D~SgSM~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~l~~f~~~~~~-------------------------- 52 (172)
T PF13519_consen 2 VVFVLDNSGSMNGYDGNRTRIDQAKDALNELLANLPG---DRVGLVSFSDSSRT-------------------------- 52 (172)
T ss_dssp EEEEEE-SGGGGTTTSSS-HHHHHHHHHHHHHHHHTT---SEEEEEEESTSCEE--------------------------
T ss_pred EEEEEECCcccCCCCCCCcHHHHHHHHHHHHHHHCCC---CEEEEEEecccccc--------------------------
Confidence 589999999986542 578889999999988763 48999999875311
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC---CEEEEEecCCCCCCcccccccCCcCcc
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG---GKLLIFQNSLPSLGVGCLKLRGDDLRV 582 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G---GkIivF~sg~Pt~GpG~L~~r~~~~r~ 582 (1021)
+.++...++.+.+.|+.+.... ......+++.||..|.+++.... ..|++|+.|.++
T Consensus 53 ---~~~~t~~~~~~~~~l~~~~~~~--~~~~~t~~~~al~~a~~~~~~~~~~~~~iv~iTDG~~~--------------- 112 (172)
T PF13519_consen 53 ---LSPLTSDKDELKNALNKLSPQG--MPGGGTNLYDALQEAAKMLASSDNRRRAIVLITDGEDN--------------- 112 (172)
T ss_dssp ---EEEEESSHHHHHHHHHTHHHHG----SSS--HHHHHHHHHHHHHC-SSEEEEEEEEES-TTH---------------
T ss_pred ---cccccccHHHHHHHhhcccccc--cCccCCcHHHHHHHHHHHHHhCCCCceEEEEecCCCCC---------------
Confidence 0112234555566666664321 12455789999999999998653 355555554222
Q ss_pred cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeC
Q 001711 583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYP 643 (1021)
Q Consensus 583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~ 643 (1021)
.-..+.+..+.+.+|.|.++.+..+...-..|..|++.|||..+...
T Consensus 113 --------------~~~~~~~~~~~~~~i~i~~v~~~~~~~~~~~l~~la~~tgG~~~~~~ 159 (172)
T PF13519_consen 113 --------------SSDIEAAKALKQQGITIYTVGIGSDSDANEFLQRLAEATGGRYFHVD 159 (172)
T ss_dssp --------------CHHHHHHHHHHCTTEEEEEEEES-TT-EHHHHHHHHHHTEEEEEEE-
T ss_pred --------------cchhHHHHHHHHcCCeEEEEEECCCccHHHHHHHHHHhcCCEEEEec
Confidence 00113667788999999999998887766789999999999988873
No 27
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=98.26 E-value=2.8e-05 Score=79.37 Aligned_cols=151 Identities=18% Similarity=0.146 Sum_probs=96.5
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL 508 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l 508 (1021)
.+||||+|.++-.. -++.++++++..+..|... .+.+||||+|++..+..- .+..
T Consensus 3 vv~vlD~SgSm~~~-~~~~~k~~~~~~~~~l~~~~~~~~~giv~Fs~~~~~~~--------------~~~~--------- 58 (164)
T cd01472 3 IVFLVDGSESIGLS-NFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEF--------------YLNT--------- 58 (164)
T ss_pred EEEEEeCCCCCCHH-HHHHHHHHHHHHHhhcccCCCCeEEEEEEEcCceeEEE--------------ecCC---------
Confidence 58999999987543 4677888888888877532 347999999998765421 0000
Q ss_pred ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc--------CCEEEEEecCCCCCCcccccccCCcC
Q 001711 509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL--------GGKLLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~--------GGkIivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
...++.+.+.|+.|... ...+.+|.||..|.+.|... ...|++++.|.++.+
T Consensus 59 ----~~~~~~~~~~l~~l~~~-----~g~T~~~~al~~a~~~l~~~~~~~~~~~~~~iiliTDG~~~~~----------- 118 (164)
T cd01472 59 ----YRSKDDVLEAVKNLRYI-----GGGTNTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDD----------- 118 (164)
T ss_pred ----CCCHHHHHHHHHhCcCC-----CCCchHHHHHHHHHHHhCCcccCCCCCCCEEEEEEcCCCCCch-----------
Confidence 02244556667777642 34578999999999988641 123556655532110
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-EEEEeCC
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-QVYYYPS 644 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-~v~~y~~ 644 (1021)
. ...+.++.+.||.|..+.++. .|...|..++..++| .+|.+..
T Consensus 119 -----------------~-~~~~~~l~~~gv~i~~ig~g~--~~~~~L~~ia~~~~~~~~~~~~~ 163 (164)
T cd01472 119 -----------------V-EEPAVELKQAGIEVFAVGVKN--ADEEELKQIASDPKELYVFNVAD 163 (164)
T ss_pred -----------------H-HHHHHHHHHCCCEEEEEECCc--CCHHHHHHHHCCCchheEEeccC
Confidence 0 123344556777655554444 499999999999987 5665544
No 28
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=98.22 E-value=7.4e-05 Score=83.85 Aligned_cols=158 Identities=17% Similarity=0.231 Sum_probs=101.8
Q ss_pred CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCC
Q 001711 426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPL 504 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl 504 (1021)
.|...+||||+|.++.. .+..++++++..|+. +.. +.+|+||+|++.+++.. +
T Consensus 52 ~p~~vvlvlD~SgSM~~--~~~~a~~a~~~~l~~~l~~--~d~v~lv~f~~~~~~~~--------------~-------- 105 (296)
T TIGR03436 52 LPLTVGLVIDTSGSMRN--DLDRARAAAIRFLKTVLRP--NDRVFVVTFNTRLRLLQ--------------D-------- 105 (296)
T ss_pred CCceEEEEEECCCCchH--HHHHHHHHHHHHHHhhCCC--CCEEEEEEeCCceeEee--------------c--------
Confidence 47789999999998753 467788888888877 543 47999999998765421 1
Q ss_pred CCccceehhhhHHHHHHHHhhCCCccc---------CCCCcccchHHHHHHH-HHHHHhc-----CCE-EEEEecCCCCC
Q 001711 505 PDDLLVNLSESRSVVDTLLDSLPSMFQ---------DNMNVESAFGPALKAA-FMVMSRL-----GGK-LLIFQNSLPSL 568 (1021)
Q Consensus 505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~---------~~~~~~~alG~AL~aA-~~lL~~~-----GGk-IivF~sg~Pt~ 568 (1021)
....++.|...|+.|..... .....++++..||..| ..++... |-| ||+|+.|..+
T Consensus 106 -------~t~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~g~T~l~~al~~aa~~~~~~~~~~~p~rk~iIllTDG~~~- 177 (296)
T TIGR03436 106 -------FTSDPRLLEAALNRLKPPLRTDYNSSGAFVRDGGGTALYDAITLAALEQLANALAGIPGRKALIVISDGGDN- 177 (296)
T ss_pred -------CCCCHHHHHHHHHhccCCCccccccccccccCCCcchhHHHHHHHHHHHHHHhhcCCCCCeEEEEEecCCCc-
Confidence 01224556666666643110 0124567788887544 4555442 334 4555544211
Q ss_pred CcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCC------------cChhhhhhhccccc
Q 001711 569 GVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKY------------TDIASLGTLAKYTG 636 (1021)
Q Consensus 569 GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~------------~diatl~~L~~~TG 636 (1021)
....-++++...|.+.+|.|..+.++... .+-..|..||+.||
T Consensus 178 -------------------------~~~~~~~~~~~~~~~~~v~vy~I~~~~~~~~~~~~~~~~~~~~~~~L~~iA~~TG 232 (296)
T TIGR03436 178 -------------------------RSRDTLERAIDAAQRADVAIYSIDARGLRAPDLGAGAKAGLGGPEALERLAEETG 232 (296)
T ss_pred -------------------------chHHHHHHHHHHHHHcCCEEEEeccCccccCCcccccccCCCcHHHHHHHHHHhC
Confidence 01234577888888999998888775321 24568999999999
Q ss_pred cEEEEe
Q 001711 637 GQVYYY 642 (1021)
Q Consensus 637 G~v~~y 642 (1021)
|+.|+-
T Consensus 233 G~~~~~ 238 (296)
T TIGR03436 233 GRAFYV 238 (296)
T ss_pred CeEecc
Confidence 997654
No 29
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=98.17 E-value=4.4e-05 Score=80.51 Aligned_cols=167 Identities=14% Similarity=0.181 Sum_probs=101.6
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
++||||+|.++-.+ -++.++++|+..++.|... .+.+||||+|++..+. +.+...
T Consensus 3 i~~vlD~SgSM~~~-~~~~~k~~~~~l~~~l~~~~~~~~v~li~Fs~~~~~~~~~~~~---------------------- 59 (198)
T cd01470 3 IYIALDASDSIGEE-DFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVSIRDF---------------------- 59 (198)
T ss_pred EEEEEECCCCccHH-HHHHHHHHHHHHHHHccccCCCceEEEEEecCCceEEEecccC----------------------
Confidence 68999999987544 3678899999999988642 3579999999987653 222110
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---------CC--EEEEEecCCCCCCccccccc
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---------GG--KLLIFQNSLPSLGVGCLKLR 576 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---------GG--kIivF~sg~Pt~GpG~L~~r 576 (1021)
....++.+...|+.+..... ....++.++.||+.+...|... ++ .|++|+.|.+|.|.....
T Consensus 60 ----~~~~~~~~~~~l~~~~~~~~-~~~ggT~~~~Al~~~~~~l~~~~~~~~~~~~~~~~~iillTDG~~~~g~~~~~-- 132 (198)
T cd01470 60 ----NSNDADDVIKRLEDFNYDDH-GDKTGTNTAAALKKVYERMALEKVRNKEAFNETRHVIILFTDGKSNMGGSPLP-- 132 (198)
T ss_pred ----CCCCHHHHHHHHHhCCcccc-cCccchhHHHHHHHHHHHHHHHHhcCccchhhcceEEEEEcCCCcCCCCChhH--
Confidence 01123344555666643211 1234678999999988776311 12 378899998886521100
Q ss_pred CCcCcccCCCccccCCCCCcHHHHHHHH------HHhhCCcEEEEEEecCCCcChhhhhhhcccccc--EEEEeCCC
Q 001711 577 GDDLRVYGTDKEHSLRIPEDPFYKQMAA------DLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG--QVYYYPSF 645 (1021)
Q Consensus 577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~------~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG--~v~~y~~F 645 (1021)
..+.++++.. .+.+.+|+|..+.++. ..|..+|..|+..||| ++|+..+|
T Consensus 133 ------------------~~~~~~~~~~~~~~~~~~~~~~v~i~~iGvG~-~~~~~~L~~iA~~~~g~~~~f~~~~~ 190 (198)
T cd01470 133 ------------------TVDKIKNLVYKNNKSDNPREDYLDVYVFGVGD-DVNKEELNDLASKKDNERHFFKLKDY 190 (198)
T ss_pred ------------------HHHHHHHHHhcccccccchhcceeEEEEecCc-ccCHHHHHHHhcCCCCCceEEEeCCH
Confidence 0111222211 1234456665555543 4789999999999999 46665544
No 30
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=98.16 E-value=0.00012 Score=74.65 Aligned_cols=157 Identities=17% Similarity=0.204 Sum_probs=102.1
Q ss_pred CCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711 427 PPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD 506 (1021)
Q Consensus 427 pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~ 506 (1021)
|.-++||+|+|.++.. .-++.+.++|...+..++.+ .+|+|++|++.++.+- .. +.+ .
T Consensus 2 ~~~v~~vlD~S~SM~~-~~~~~~~~al~~~l~~l~~~--~~~~l~~Fs~~~~~~~-~~----------------~~~--~ 59 (171)
T cd01461 2 PKEVVFVIDTSGSMSG-TKIEQTKEALLTALKDLPPG--DYFNIIGFSDTVEEFS-PS----------------SVS--A 59 (171)
T ss_pred CceEEEEEECCCCCCC-hhHHHHHHHHHHHHHhCCCC--CEEEEEEeCCCceeec-Cc----------------cee--C
Confidence 4568999999999843 23778888999999988755 6899999998765431 00 000 0
Q ss_pred ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCCEEEEEecCCCCCCcccccccCCcCccc
Q 001711 507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGGKLLIFQNSLPSLGVGCLKLRGDDLRVY 583 (1021)
Q Consensus 507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~ 583 (1021)
+ .+.++.+.+.|+.+.. ...+.+..||..|...++. ....|++|+.|..+ +
T Consensus 60 ----~-~~~~~~~~~~l~~~~~------~g~T~l~~al~~a~~~l~~~~~~~~~iillTDG~~~----------~----- 113 (171)
T cd01461 60 ----T-AENVAAAIEYVNRLQA------LGGTNMNDALEAALELLNSSPGSVPQIILLTDGEVT----------N----- 113 (171)
T ss_pred ----C-HHHHHHHHHHHHhcCC------CCCcCHHHHHHHHHHhhccCCCCccEEEEEeCCCCC----------C-----
Confidence 0 1223333445555432 4457799999999998874 23456666665411 0
Q ss_pred CCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 584 GTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 584 gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
.... .+.+.++.+.+|.|..+.++. ..|-..|..+++.|||..++..+
T Consensus 114 -----------~~~~-~~~~~~~~~~~i~i~~i~~g~-~~~~~~l~~ia~~~gG~~~~~~~ 161 (171)
T cd01461 114 -----------ESQI-LKNVREALSGRIRLFTFGIGS-DVNTYLLERLAREGRGIARRIYE 161 (171)
T ss_pred -----------HHHH-HHHHHHhcCCCceEEEEEeCC-ccCHHHHHHHHHcCCCeEEEecC
Confidence 0122 234445555678777777764 35678899999999999998875
No 31
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=98.08 E-value=8e-05 Score=78.21 Aligned_cols=142 Identities=15% Similarity=0.217 Sum_probs=95.8
Q ss_pred eEEEEEecchhHHhh----cHHHHHHHHHHHHH----hcCCCCCCceEEEEEEcC-eEEEEecCCCCCCcceeecccccc
Q 001711 429 LYFFLIDVSISAIRS----GMLEVVAQTIKSCL----DELPGFPRTQIGFITFDS-TIHFYNMKSSLTQPQMMVISDLDD 499 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s----G~l~~~~~sI~~~L----~~Lp~~~rt~VgiITFds-~V~fynl~~~~~~p~mlVvsDldd 499 (1021)
+.+++||+|..+.+. ..+++.++.+...+ +..+ ..+||||+|.. .-++
T Consensus 5 a~vi~lD~S~sM~a~D~~PnRL~aak~~i~~~~~~f~~~np---~~~vGlv~fag~~a~v-------------------- 61 (187)
T cd01452 5 ATMICIDNSEYMRNGDYPPTRFQAQADAVNLICQAKTRSNP---ENNVGLMTMAGNSPEV-------------------- 61 (187)
T ss_pred EEEEEEECCHHHHcCCCCCCHHHHHHHHHHHHHHHHHhcCC---CccEEEEEecCCceEE--------------------
Confidence 568999999987432 35778888877664 4444 36899999975 2221
Q ss_pred ccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCccccc
Q 001711 500 IFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLK 574 (1021)
Q Consensus 500 ~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~ 574 (1021)
+++++.....+...|+.+.. ..+..+|.||+.|..+|++. ..||++|.+++-+.
T Consensus 62 ---------~~plT~D~~~~~~~L~~i~~------~g~~~l~~AL~~A~~~L~~~~~~~~~~rivi~v~S~~~~------ 120 (187)
T cd01452 62 ---------LVTLTNDQGKILSKLHDVQP------KGKANFITGIQIAQLALKHRQNKNQKQRIVAFVGSPIEE------ 120 (187)
T ss_pred ---------EECCCCCHHHHHHHHHhCCC------CCcchHHHHHHHHHHHHhcCCCcCCcceEEEEEecCCcC------
Confidence 22333446667777777641 25567999999999999752 24889998865221
Q ss_pred ccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711 575 LRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY 634 (1021)
Q Consensus 575 ~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~ 634 (1021)
.+ +-..++++++.++||.||+..++...-+..-|..+.+.
T Consensus 121 ------------d~--------~~i~~~~~~lkk~~I~v~vI~~G~~~~~~~~l~~~~~~ 160 (187)
T cd01452 121 ------------DE--------KDLVKLAKRLKKNNVSVDIINFGEIDDNTEKLTAFIDA 160 (187)
T ss_pred ------------CH--------HHHHHHHHHHHHcCCeEEEEEeCCCCCCHHHHHHHHHH
Confidence 11 11347899999999999999998664444444444433
No 32
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=98.01 E-value=0.00011 Score=76.93 Aligned_cols=157 Identities=14% Similarity=0.131 Sum_probs=100.8
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-------CCCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-------FPRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDI 500 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-------~~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~ 500 (1021)
-.+||||.|.+.-.+. ++.+++.++..++.|.. ....+||+|+|++..++. .+.
T Consensus 4 dvv~vlD~S~Sm~~~~-~~~~k~~~~~~~~~l~~~~~~~i~~~~~rvglv~fs~~~~~~~~l~----------------- 65 (186)
T cd01480 4 DITFVLDSSESVGLQN-FDITKNFVKRVAERFLKDYYRKDPAGSWRVGVVQYSDQQEVEAGFL----------------- 65 (186)
T ss_pred eEEEEEeCCCccchhh-HHHHHHHHHHHHHHHhhhhccCCCCCceEEEEEEecCCceeeEecc-----------------
Confidence 3689999999875444 56667777777777621 234799999999764421 110
Q ss_pred cCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh----cC-CEEEEEecCCCCCCcccccc
Q 001711 501 FVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR----LG-GKLLIFQNSLPSLGVGCLKL 575 (1021)
Q Consensus 501 f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~----~G-GkIivF~sg~Pt~GpG~L~~ 575 (1021)
+. ...++.+.+.|+.|... ...+++|.||..|...+.. .. ..|++++.|..+.+
T Consensus 66 -----~~-----~~~~~~l~~~i~~l~~~-----gg~T~~~~AL~~a~~~l~~~~~~~~~~~iillTDG~~~~~------ 124 (186)
T cd01480 66 -----RD-----IRNYTSLKEAVDNLEYI-----GGGTFTDCALKYATEQLLEGSHQKENKFLLVITDGHSDGS------ 124 (186)
T ss_pred -----cc-----cCCHHHHHHHHHhCccC-----CCCccHHHHHHHHHHHHhccCCCCCceEEEEEeCCCcCCC------
Confidence 00 12356667777777531 3468899999999999864 12 34555555543210
Q ss_pred cCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCC
Q 001711 576 RGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQ 646 (1021)
Q Consensus 576 r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~ 646 (1021)
...-..+.+.++.+.||.|-.+.++. .|...|..++...+|. |+-.+|.
T Consensus 125 -------------------~~~~~~~~~~~~~~~gi~i~~vgig~--~~~~~L~~IA~~~~~~-~~~~~~~ 173 (186)
T cd01480 125 -------------------PDGGIEKAVNEADHLGIKIFFVAVGS--QNEEPLSRIACDGKSA-LYRENFA 173 (186)
T ss_pred -------------------cchhHHHHHHHHHHCCCEEEEEecCc--cchHHHHHHHcCCcch-hhhcchh
Confidence 00122456677888888866666654 7888899999887776 5555553
No 33
>PF00626 Gelsolin: Gelsolin repeat; InterPro: IPR007123 Gelsolin is a cytoplasmic, calcium-regulated, actin-modulating protein that binds to the barbed ends of actin filaments, preventing monomer exchange (end-blocking or capping) []. It can promote nucleation (the assembly of monomers into filaments), as well as sever existing filaments. In addition, this protein binds with high affinity to fibronectin. Plasma gelsolin and cytoplasmic gelsolin are derived from a single gene by alternate initiation sites and differential splicing. Sequence comparisons indicate an evolutionary relationship between gelsolin, villin, fragmin and severin []. Six large repeating segments occur in gelsolin and villin, and 3 similar segments in severin and fragmin. While the multiple repeats have yet to be related to any known function of the actin-severing proteins, the superfamily appears to have evolved from an ancestral sequence of 120 to 130 amino acid residues [].; PDB: 3FG6_F 1RGI_G 2FGH_A 1D0N_B 3EGD_B 2NUP_B 2NUT_B 3EGX_B 1JHW_A 1J72_A ....
Probab=97.99 E-value=6.7e-06 Score=72.99 Aligned_cols=66 Identities=24% Similarity=0.488 Sum_probs=50.1
Q ss_pred cccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHH-HhCC
Q 001711 892 IMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLR-EQDP 970 (1021)
Q Consensus 892 lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr-~~r~ 970 (1021)
++..++++.+.|.++++||||+|..||+|+|+.. ...++.++. .+++++. ..|.
T Consensus 4 ~~~~~~~s~~~L~s~~~yIld~~~~i~vW~G~~~--~~~e~~~a~-----------------------~~a~~~~~~~~~ 58 (76)
T PF00626_consen 4 RPEQVPLSQSSLNSDDCYILDCGYEIFVWVGKKS--SPEEKAFAA-----------------------QLAQELLSEERP 58 (76)
T ss_dssp EEEEESSSGGGEETTSEEEEEESSEEEEEEHTTS--HHHHHHHHH-----------------------HHHHHHHHHHTT
T ss_pred cCCcCCCCHHHcCCCCEEEEEeCCCcEEEEeccC--CHHHHHHHH-----------------------HHHHHhhhhcCC
Confidence 4677899999999999999999999999999994 344444433 3445555 6677
Q ss_pred CCCceEEEeccCC
Q 001711 971 SYYQLCQLVRQGE 983 (1021)
Q Consensus 971 ~~~~l~~vvrqg~ 983 (1021)
...++ .++.+|.
T Consensus 59 ~~~~~-~~~~eg~ 70 (76)
T PF00626_consen 59 PLPEV-IRVEEGK 70 (76)
T ss_dssp TTSEE-EEEETTH
T ss_pred CCCEE-EEecCCC
Confidence 77776 7778874
No 34
>PF13768 VWA_3: von Willebrand factor type A domain
Probab=97.97 E-value=0.00011 Score=74.18 Aligned_cols=150 Identities=23% Similarity=0.305 Sum_probs=99.8
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL 509 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL 509 (1021)
.|||||+|.++. |..+.++++|+..|+.|+++ .++.||+||+.++.|.- . +
T Consensus 3 vvilvD~S~Sm~--g~~~~~k~al~~~l~~L~~~--d~fnii~f~~~~~~~~~--~-----------------------~ 53 (155)
T PF13768_consen 3 VVILVDTSGSMS--GEKELVKDALRAILRSLPPG--DRFNIIAFGSSVRPLFP--G-----------------------L 53 (155)
T ss_pred EEEEEeCCCCCC--CcHHHHHHHHHHHHHhCCCC--CEEEEEEeCCEeeEcch--h-----------------------H
Confidence 689999999984 33388999999999999865 79999999998775431 1 1
Q ss_pred eeh-hhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh--cCCEEEEEecCCCCCCcccccccCCcCcccCCC
Q 001711 510 VNL-SESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR--LGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTD 586 (1021)
Q Consensus 510 v~l-~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~--~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~ 586 (1021)
+.. .+.++...+.++.+.. ....+.+..||+.|+..+.. .--.|++++.|.++.+.
T Consensus 54 ~~~~~~~~~~a~~~I~~~~~-----~~G~t~l~~aL~~a~~~~~~~~~~~~IilltDG~~~~~~---------------- 112 (155)
T PF13768_consen 54 VPATEENRQEALQWIKSLEA-----NSGGTDLLAALRAALALLQRPGCVRAIILLTDGQPVSGE---------------- 112 (155)
T ss_pred HHHhHHHHHHHHHHHHHhcc-----cCCCccHHHHHHHHHHhcccCCCccEEEEEEeccCCCCH----------------
Confidence 111 1344444555555432 25667899999999988632 34577888777653221
Q ss_pred ccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEE
Q 001711 587 KEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYY 641 (1021)
Q Consensus 587 ~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~ 641 (1021)
....+.+ .++. ..+.|+.|.++. ..+-..|..|++.|||...+
T Consensus 113 ---------~~i~~~v-~~~~-~~~~i~~~~~g~-~~~~~~L~~LA~~~~G~~~f 155 (155)
T PF13768_consen 113 ---------EEILDLV-RRAR-GHIRIFTFGIGS-DADADFLRELARATGGSFHF 155 (155)
T ss_pred ---------HHHHHHH-HhcC-CCceEEEEEECC-hhHHHHHHHHHHcCCCEEEC
Confidence 1122222 2222 457777777765 46678899999999998763
No 35
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=97.93 E-value=0.0002 Score=77.24 Aligned_cols=167 Identities=21% Similarity=0.272 Sum_probs=104.2
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
-.+||||.|.+.-... ++.+++.++..++.|.-. ..++||||+|++.+++.- ++.+
T Consensus 4 DlvfllD~S~Sm~~~~-~~~~k~f~~~l~~~l~~~~~~~rvglv~fs~~~~~~~--------------~l~~-------- 60 (224)
T cd01475 4 DLVFLIDSSRSVRPEN-FELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEF--------------PLGR-------- 60 (224)
T ss_pred cEEEEEeCCCCCCHHH-HHHHHHHHHHHHHhcccCCCccEEEEEEecCceeEEe--------------cccc--------
Confidence 4799999999864333 678888899888877432 358999999998765420 1110
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH-Hh-cC--------CE-EEEEecCCCCCCccccccc
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM-SR-LG--------GK-LLIFQNSLPSLGVGCLKLR 576 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL-~~-~G--------Gk-IivF~sg~Pt~GpG~L~~r 576 (1021)
..+++.|.+.|+.|..+ ...+.+|.||+.|...+ .. .| -| |++|+.|.++
T Consensus 61 -----~~~~~~l~~~i~~i~~~-----~~~t~tg~AL~~a~~~~~~~~~g~r~~~~~~~kvvillTDG~s~--------- 121 (224)
T cd01475 61 -----FKSKADLKRAVRRMEYL-----ETGTMTGLAIQYAMNNAFSEAEGARPGSERVPRVGIVVTDGRPQ--------- 121 (224)
T ss_pred -----cCCHHHHHHHHHhCcCC-----CCCChHHHHHHHHHHHhCChhcCCCCCCCCCCeEEEEEcCCCCc---------
Confidence 01344556667777543 23467899999888653 21 11 13 4566555321
Q ss_pred CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-EEEEeCCCCCchhHHHHH
Q 001711 577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-QVYYYPSFQSTTHGERLR 655 (1021)
Q Consensus 577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-~v~~y~~F~~~~d~~kl~ 655 (1021)
+-+++.+.++.+.||.| |+++-...|...|..|+..+++ .+|+-.+|+. -+++.
T Consensus 122 --------------------~~~~~~a~~lk~~gv~i--~~VgvG~~~~~~L~~ias~~~~~~~f~~~~~~~---l~~~~ 176 (224)
T cd01475 122 --------------------DDVSEVAAKARALGIEM--FAVGVGRADEEELREIASEPLADHVFYVEDFST---IEELT 176 (224)
T ss_pred --------------------ccHHHHHHHHHHCCcEE--EEEeCCcCCHHHHHHHhCCCcHhcEEEeCCHHH---HHHHh
Confidence 01356778888888655 5544445788999999987754 6666666542 34455
Q ss_pred HHHHHhc
Q 001711 656 HELSRDL 662 (1021)
Q Consensus 656 ~dL~r~l 662 (1021)
.+|...+
T Consensus 177 ~~l~~~~ 183 (224)
T cd01475 177 KKFQGKI 183 (224)
T ss_pred hhccccc
Confidence 5554443
No 36
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=97.93 E-value=0.00037 Score=83.43 Aligned_cols=163 Identities=11% Similarity=0.064 Sum_probs=100.5
Q ss_pred CeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEE-EEecCCCCCCcceeeccccccccCCCC
Q 001711 428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIH-FYNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~-fynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
.-++||||+|.+.-...+++.++..++..++.+.. ..+++||+|+|++..+ ++.+....
T Consensus 43 lDIvFLLD~SgSMg~~Nfle~AK~Fa~~LV~~l~Is~D~V~VgiV~FSd~~r~vfpL~s~~------------------- 103 (576)
T PTZ00441 43 VDLYLLVDGSGSIGYHNWITHVIPMLMGLIQQLNLSDDAINLYMSLFSNNTTELIRLGSGA------------------- 103 (576)
T ss_pred ceEEEEEeCCCccCCccHHHHHHHHHHHHHHHhccCCCceEEEEEEeCCCceEEEecCCCc-------------------
Confidence 35799999999886656667788888888887753 3458899999987654 33332211
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC------CEEEEEecCCCCCCcccccccCCc
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG------GKLLIFQNSLPSLGVGCLKLRGDD 579 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G------GkIivF~sg~Pt~GpG~L~~r~~~ 579 (1021)
-.+.......|..++..+. ....+.+|.||..|...+...+ +.||||+.|.++-+
T Consensus 104 ---s~Dk~~aL~~I~sL~~~~~------pgGgTnig~AL~~Aae~L~sr~~R~nvpKVVILLTDG~sns~---------- 164 (576)
T PTZ00441 104 ---SKDKEQALIIVKSLRKTYL------PYGKTNMTDALLEVRKHLNDRVNRENAIQLVILMTDGIPNSK---------- 164 (576)
T ss_pred ---cccHHHHHHHHHHHHhhcc------CCCCccHHHHHHHHHHHHhhcccccCCceEEEEEecCCCCCc----------
Confidence 0011122333333333321 1245779999999988887543 56778877664311
Q ss_pred CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhc----cccccEEEEeCCCC
Q 001711 580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLA----KYTGGQVYYYPSFQ 646 (1021)
Q Consensus 580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~----~~TGG~v~~y~~F~ 646 (1021)
.+. .+.+.++.+.||.|-+|.++. ..|...+..|+ ..++|.+|.+.+|+
T Consensus 165 ----------------~dv-leaAq~LR~~GVeI~vIGVG~-g~n~e~LrlIAgC~p~~g~c~~Y~vadf~ 217 (576)
T PTZ00441 165 ----------------YRA-LEESRKLKDRNVKLAVIGIGQ-GINHQFNRLLAGCRPREGKCKFYSDADWE 217 (576)
T ss_pred ----------------ccH-HHHHHHHHHCCCEEEEEEeCC-CcCHHHHHHHhccCCCCCCCceEEeCCHH
Confidence 001 134566777888766666643 46666555555 33556788877774
No 37
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=97.91 E-value=0.00022 Score=71.32 Aligned_cols=145 Identities=21% Similarity=0.198 Sum_probs=98.8
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL 508 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l 508 (1021)
++||||+|.++-. .-++.+++.+...++.+.. +.+.+|+||+|++..+... ++. +.
T Consensus 3 i~~llD~S~Sm~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~li~f~~~~~~~~--------------~~~-------~~- 59 (161)
T cd01450 3 IVFLLDGSESVGP-ENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEF--------------SLN-------DY- 59 (161)
T ss_pred EEEEEeCCCCcCH-HHHHHHHHHHHHHHHheeeCCCceEEEEEEEcCCceEEE--------------ECC-------CC-
Confidence 5799999998743 2567788888888887763 2468999999997543210 100 00
Q ss_pred ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-------CEEEEEecCCCCCCcccccccCCcCc
Q 001711 509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-------GKLLIFQNSLPSLGVGCLKLRGDDLR 581 (1021)
Q Consensus 509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-------GkIivF~sg~Pt~GpG~L~~r~~~~r 581 (1021)
..++.+.+.|+.+..... ..+.++.||+.|...+.... ..|++|++|.++.+.
T Consensus 60 -----~~~~~~~~~i~~~~~~~~----~~t~~~~al~~a~~~~~~~~~~~~~~~~~iiliTDG~~~~~~----------- 119 (161)
T cd01450 60 -----KSKDDLLKAVKNLKYLGG----GGTNTGKALQYALEQLFSESNARENVPKVIIVLTDGRSDDGG----------- 119 (161)
T ss_pred -----CCHHHHHHHHHhcccCCC----CCccHHHHHHHHHHHhcccccccCCCCeEEEEECCCCCCCCc-----------
Confidence 024455556666543211 46889999999999986542 257777777655431
Q ss_pred ccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccc
Q 001711 582 VYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYT 635 (1021)
Q Consensus 582 ~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~T 635 (1021)
-..++..++.+.+|.|..+.++. .|...|..|+..|
T Consensus 120 ----------------~~~~~~~~~~~~~v~v~~i~~g~--~~~~~l~~la~~~ 155 (161)
T cd01450 120 ----------------DPKEAAAKLKDEGIKVFVVGVGP--ADEEELREIASCP 155 (161)
T ss_pred ----------------chHHHHHHHHHCCCEEEEEeccc--cCHHHHHHHhCCC
Confidence 12566777788888888887766 7888899999888
No 38
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=97.87 E-value=0.00038 Score=73.64 Aligned_cols=151 Identities=23% Similarity=0.265 Sum_probs=90.2
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-------CCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-------FPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDI 500 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-------~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~ 500 (1021)
=.|||||.|.+.-..+ ++.+++.|+..+..+.. ...+|||+|+|++..++ ++|. |.
T Consensus 21 DivfvlD~S~Sm~~~~-f~~~k~fi~~~~~~~~~~~~~~~~~~~~rVGlV~fs~~a~~~~~L~------------d~--- 84 (193)
T cd01477 21 DIVFVVDNSKGMTQGG-LWQVRATISSLFGSSSQIGTDYDDPRSTRVGLVTYNSNATVVADLN------------DL--- 84 (193)
T ss_pred eEEEEEeCCCCcchhh-HHHHHHHHHHHHhhccccccccCCCCCcEEEEEEccCceEEEEecc------------cc---
Confidence 4799999999875433 67788888887776543 13489999999987653 2221 10
Q ss_pred cCCCCCccceehhhhHHHHHHHHhh-CCCcccCCCCcccchHHHHHHHHHHHHhc--C-----CE-EEEEecCCCCCCcc
Q 001711 501 FVPLPDDLLVNLSESRSVVDTLLDS-LPSMFQDNMNVESAFGPALKAAFMVMSRL--G-----GK-LLIFQNSLPSLGVG 571 (1021)
Q Consensus 501 f~Pl~~~lLv~l~es~~~I~~lLd~-Lp~~f~~~~~~~~alG~AL~aA~~lL~~~--G-----Gk-IivF~sg~Pt~GpG 571 (1021)
...+.+.+.|+. +..+. ...++.+|.||+.|.+++... + -| ||+++++--+.+
T Consensus 85 -------------~~~~~~~~ai~~~~~~~~---~~ggT~ig~aL~~A~~~l~~~~~~~R~~v~kvvIllTDg~~~~~-- 146 (193)
T cd01477 85 -------------QSFDDLYSQIQGSLTDVS---STNASYLDTGLQAAEQMLAAGKRTSRENYKKVVIVFASDYNDEG-- 146 (193)
T ss_pred -------------cCHHHHHHHHHHHhhccc---cCCcchHHHHHHHHHHHHHhhhccccCCCCeEEEEEecCccCCC--
Confidence 011222222332 21111 123678999999999999742 3 46 455544421100
Q ss_pred cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccE
Q 001711 572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQ 638 (1021)
Q Consensus 572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~ 638 (1021)
+ . -..+.|.++.+.||.|..+.++. +.|...+..|++..++.
T Consensus 147 ----------------~-------~-~~~~~a~~l~~~GI~i~tVGiG~-~~d~~~~~~L~~ias~~ 188 (193)
T cd01477 147 ----------------S-------N-DPRPIAARLKSTGIAIITVAFTQ-DESSNLLDKLGKIASPG 188 (193)
T ss_pred ----------------C-------C-CHHHHHHHHHHCCCEEEEEEeCC-CCCHHHHHHHHHhcCCC
Confidence 0 0 02467888999999998888875 45544455555554443
No 39
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=97.86 E-value=0.00038 Score=72.53 Aligned_cols=149 Identities=15% Similarity=0.153 Sum_probs=92.5
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
++||||+|.++-....++.+++.++..++.+.- ..+++||+|+|++..+. +++...
T Consensus 3 v~~vlD~SgSm~~~~~~~~~k~~~~~~~~~~~~~~~~~~vglv~Fs~~~~~~~~l~~~---------------------- 60 (186)
T cd01471 3 LYLLVDGSGSIGYSNWVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSP---------------------- 60 (186)
T ss_pred EEEEEeCCCCccchhhHHHHHHHHHHHHHhcccCCCceEEEEEEecCCceEEEECCCc----------------------
Confidence 689999999986655477888888888887752 23589999999987652 323211
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C------CEEEEEecCCCCCCcccccccCCcC
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G------GKLLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G------GkIivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
....++.+.++++.|..+. .....++++.||+.|.+.+... + ..|+++++|.++-+..
T Consensus 61 ----~~~~~~~~~~~i~~l~~~~--~~~G~T~l~~aL~~a~~~l~~~~~~r~~~~~~villTDG~~~~~~~--------- 125 (186)
T cd01471 61 ----NSTNKDLALNAIRALLSLY--YPNGSTNTTSALLVVEKHLFDTRGNRENAPQLVIIMTDGIPDSKFR--------- 125 (186)
T ss_pred ----cccchHHHHHHHHHHHhCc--CCCCCccHHHHHHHHHHHhhccCCCcccCceEEEEEccCCCCCCcc---------
Confidence 0112222223333332211 1235678999999999999652 1 2477777766432100
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY 634 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~ 634 (1021)
. .+.+.++.+.||.|-++.++ ...|...|..|+..
T Consensus 126 ----------------~--~~~a~~l~~~gv~v~~igiG-~~~d~~~l~~ia~~ 160 (186)
T cd01471 126 ----------------T--LKEARKLRERGVIIAVLGVG-QGVNHEENRSLVGC 160 (186)
T ss_pred ----------------h--hHHHHHHHHCCCEEEEEEee-hhhCHHHHHHhcCC
Confidence 0 13456677788776666665 35777777777664
No 40
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=97.83 E-value=0.00018 Score=89.15 Aligned_cols=160 Identities=21% Similarity=0.273 Sum_probs=109.7
Q ss_pred CCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCCC
Q 001711 427 PPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 427 pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
.-.++||||+|.++...+-++.++.++...|..... .+.+||||+|++. ..+
T Consensus 465 ~~~vv~vvD~SgSM~~~~rl~~ak~a~~~ll~~a~~-~~D~v~lI~F~g~~a~~-------------------------- 517 (633)
T TIGR02442 465 GNLVIFVVDASGSMAARGRMAAAKGAVLSLLRDAYQ-KRDKVALITFRGEEAEV-------------------------- 517 (633)
T ss_pred CceEEEEEECCccCCCccHHHHHHHHHHHHHHHhhc-CCCEEEEEEECCCCceE--------------------------
Confidence 457889999999985444577778777777764322 2478999999743 111
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh-------cCCEEEEEecCCCCCCcccccccCC
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR-------LGGKLLIFQNSLPSLGVGCLKLRGD 578 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~-------~GGkIivF~sg~Pt~GpG~L~~r~~ 578 (1021)
++++..+++.+...|+.|+. ...+.++.||..|..+++. ..+.|++++.|..|.|.+. ++
T Consensus 518 ---~~p~t~~~~~~~~~L~~l~~------gG~Tpl~~aL~~A~~~l~~~~~~~~~~~~~vvliTDG~~n~~~~~----~~ 584 (633)
T TIGR02442 518 ---LLPPTSSVELAARRLEELPT------GGRTPLAAGLLKAAEVLSNELLRDDDGRPLLVVITDGRANVADGG----EP 584 (633)
T ss_pred ---EcCCCCCHHHHHHHHHhCCC------CCCCCHHHHHHHHHHHHHHhhccCCCCceEEEEECCCCCCCCCCC----CC
Confidence 11122344555667777753 4567899999999999883 2367999999998875110 00
Q ss_pred cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEe
Q 001711 579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYY 642 (1021)
Q Consensus 579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y 642 (1021)
+..+ -..+|.++.+.+|.+.++-+...+++...+..||+.+||+.|+.
T Consensus 585 ---------------~~~~-~~~~a~~l~~~~i~~~vIdt~~~~~~~~~~~~lA~~~gg~y~~l 632 (633)
T TIGR02442 585 ---------------PTDD-ARTIAAKLAARGILFVVIDTESGFVRLGLAEDLARALGGEYVRL 632 (633)
T ss_pred ---------------hHHH-HHHHHHHHHhcCCeEEEEeCCCCCcchhHHHHHHHhhCCeEEec
Confidence 0011 24567777778887766666667777888999999999999864
No 41
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=97.81 E-value=0.00065 Score=70.58 Aligned_cols=156 Identities=12% Similarity=0.183 Sum_probs=100.0
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC-CCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccccCCCCCc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF-PRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~-~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
++|+||.|.+.-.. -++.+++.++..++.+..+ ..+|||+|+|++..++. ++. |.
T Consensus 3 i~fvlD~S~S~~~~-~f~~~k~fi~~~i~~l~~~~~~~rvgvv~fs~~~~~~~~l~------------~~---------- 59 (177)
T cd01469 3 IVFVLDGSGSIYPD-DFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLN------------EY---------- 59 (177)
T ss_pred EEEEEeCCCCCCHH-HHHHHHHHHHHHHHHcCcCCCCcEEEEEEECCceeEEEecC------------cc----------
Confidence 68999999886432 3677888899988887643 35899999999876531 221 10
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH--HhcCC------EEEEEecCCCCCCcccccccCCc
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM--SRLGG------KLLIFQNSLPSLGVGCLKLRGDD 579 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL--~~~GG------kIivF~sg~Pt~GpG~L~~r~~~ 579 (1021)
.+.+.+.+.++.+... ...+.+|.||+.|...+ ...|. -+++++.|..+-+.
T Consensus 60 ------~~~~~~~~~i~~~~~~-----~g~T~~~~AL~~a~~~l~~~~~g~R~~~~kv~illTDG~~~~~~--------- 119 (177)
T cd01469 60 ------RTKEEPLSLVKHISQL-----LGLTNTATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDP--------- 119 (177)
T ss_pred ------CCHHHHHHHHHhCccC-----CCCccHHHHHHHHHHHhcCcccCCCCCCCeEEEEEeCCCCCCcc---------
Confidence 1122344455666532 22378999999998876 22332 36666665533211
Q ss_pred CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCC---cChhhhhhhcccccc-EEEEeCCCC
Q 001711 580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKY---TDIASLGTLAKYTGG-QVYYYPSFQ 646 (1021)
Q Consensus 580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~---~diatl~~L~~~TGG-~v~~y~~F~ 646 (1021)
..+..+.++.+.||.|-.+..+..+ .+..+|..++..+++ ++|...+|+
T Consensus 120 ------------------~~~~~~~~~k~~gv~v~~Vgvg~~~~~~~~~~~L~~ias~p~~~h~f~~~~~~ 172 (177)
T cd01469 120 ------------------LLKDVIPQAEREGIIRYAIGVGGHFQRENSREELKTIASKPPEEHFFNVTDFA 172 (177)
T ss_pred ------------------ccHHHHHHHHHCCcEEEEEEecccccccccHHHHHHHhcCCcHHhEEEecCHH
Confidence 0044566677788877777766543 347889999998874 666666653
No 42
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=97.76 E-value=0.00083 Score=68.69 Aligned_cols=150 Identities=19% Similarity=0.185 Sum_probs=93.6
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL 508 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l 508 (1021)
.+||||.|.+.-+.+ ++.+++.++..+..+.- .++++||||+|++..+..- ++++
T Consensus 3 v~~vlD~S~Sm~~~~-~~~~k~~~~~l~~~~~~~~~~~rvgli~fs~~~~~~~--------------~l~~--------- 58 (164)
T cd01482 3 IVFLVDGSWSIGRSN-FNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEF--------------DLNA--------- 58 (164)
T ss_pred EEEEEeCCCCcChhh-HHHHHHHHHHHHhheeeCCCceEEEEEEECCCeeEEE--------------ecCC---------
Confidence 689999999886544 57788888888887642 2458999999998654310 0110
Q ss_pred ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH-Hh-cC------CEEEEEecCCCCCCcccccccCCcC
Q 001711 509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM-SR-LG------GKLLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL-~~-~G------GkIivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
..+++.+.+.|+++.. ....+.+|.||..+...+ +. .| ..|++|+.|.++-
T Consensus 59 ----~~~~~~l~~~l~~~~~-----~~g~T~~~~aL~~a~~~~~~~~~~~r~~~~k~iillTDG~~~~------------ 117 (164)
T cd01482 59 ----YTSKEDVLAAIKNLPY-----KGGNTRTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQD------------ 117 (164)
T ss_pred ----CCCHHHHHHHHHhCcC-----CCCCChHHHHHHHHHHHhcccccCCCCCCCEEEEEEcCCCCCc------------
Confidence 0123445555666653 234567999999877644 32 11 2366776654320
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-EEEEeC
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-QVYYYP 643 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-~v~~y~ 643 (1021)
-.++.+.++.+.||.+-.+ +-+..+...|..|+..+.. +++...
T Consensus 118 -----------------~~~~~a~~lk~~gi~i~~i--g~g~~~~~~L~~ia~~~~~~~~~~~~ 162 (164)
T cd01482 118 -----------------DVELPARVLRNLGVNVFAV--GVKDADESELKMIASKPSETHVFNVA 162 (164)
T ss_pred -----------------hHHHHHHHHHHCCCEEEEE--ecCcCCHHHHHHHhCCCchheEEEcC
Confidence 1245677888888754444 4444668889999888654 455443
No 43
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=97.75 E-value=0.00044 Score=84.93 Aligned_cols=174 Identities=20% Similarity=0.242 Sum_probs=117.6
Q ss_pred CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711 426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
..-.++||||+|.++-. .-++.+++++...|..+-. .+-+||||+|++...-+ + +|
T Consensus 406 ~~~~v~fvvD~SGSM~~-~rl~~aK~av~~Ll~~~~~-~~D~v~Li~F~~~~a~~------------~--------lp-- 461 (589)
T TIGR02031 406 SGRLLIFVVDASGSAAV-ARMSEAKGAVELLLGEAYV-HRDQVSLIAFRGTAAEV------------L--------LP-- 461 (589)
T ss_pred cCceEEEEEECCCCCCh-HHHHHHHHHHHHHHHhhcc-CCCEEEEEEECCCCceE------------E--------CC--
Confidence 45568899999998832 3578888888888875422 23589999997542110 0 11
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCC--EEEEEecCCCCCCccc-ccccCCc
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGG--KLLIFQNSLPSLGVGC-LKLRGDD 579 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GG--kIivF~sg~Pt~GpG~-L~~r~~~ 579 (1021)
...+++.+...|+.|+. ..++.++.||..|...++. .++ .|++++.|.+|+|.+. ......
T Consensus 462 ------~t~~~~~~~~~L~~l~~------gGgTpL~~gL~~A~~~~~~~~~~~~~~~ivllTDG~~nv~~~~~~~~~~~- 528 (589)
T TIGR02031 462 ------PSRSVEQAKRRLDVLPG------GGGTPLAAGLAAAFQTALQARSSGGTPTIVLITDGRGNIPLDGDPESIKA- 528 (589)
T ss_pred ------CCCCHHHHHHHHhcCCC------CCCCcHHHHHHHHHHHHHHhcccCCceEEEEECCCCCCCCCCcccccccc-
Confidence 11233444556777752 4567899999999999864 233 6999999999987531 110000
Q ss_pred CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711 580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS 647 (1021)
Q Consensus 580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~ 647 (1021)
. .....+-...++.++.+.||.+-++-+...+.+..-+..|++..||..|+.++-++
T Consensus 529 ------~-----~~~~~~~~~~~a~~~~~~gi~~~vid~~~~~~~~~~~~~lA~~~~g~y~~l~~~~a 585 (589)
T TIGR02031 529 ------D-----REQAAEEALALARKIREAGMPALVIDTAMRFVSTGFAQKLARKMGAHYIYLPNATA 585 (589)
T ss_pred ------c-----chhHHHHHHHHHHHHHhcCCeEEEEeCCCCCccchHHHHHHHhcCCcEEeCCCCCh
Confidence 0 11223344677888999998877777777777777789999999999999887543
No 44
>COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism]
Probab=97.73 E-value=0.00043 Score=75.00 Aligned_cols=166 Identities=17% Similarity=0.236 Sum_probs=119.5
Q ss_pred CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711 426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
....+|||||.|.++-...-.++++-++...|.+--. .|-||++|+|... +
T Consensus 77 ~g~lvvfvVDASgSM~~~~Rm~aaKG~~~~lL~dAYq-~RdkvavI~F~G~-----------~----------------- 127 (261)
T COG1240 77 AGNLIVFVVDASGSMAARRRMAAAKGAALSLLRDAYQ-RRDKVAVIAFRGE-----------K----------------- 127 (261)
T ss_pred cCCcEEEEEeCcccchhHHHHHHHHHHHHHHHHHHHH-ccceEEEEEecCC-----------c-----------------
Confidence 4457899999999986655688888888888875332 3578999999632 1
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-------CEEEEEecCCCCCCcccccccCC
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-------GKLLIFQNSLPSLGVGCLKLRGD 578 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-------GkIivF~sg~Pt~GpG~L~~r~~ 578 (1021)
-.++++...+.+.+++.|+.|+. ...+=+..||+.|.+++.... -.+++.+.|.+|.+.+.=..
T Consensus 128 A~lll~pT~sv~~~~~~L~~l~~------GG~TPL~~aL~~a~ev~~r~~r~~p~~~~~~vviTDGr~n~~~~~~~~--- 198 (261)
T COG1240 128 AELLLPPTSSVELAERALERLPT------GGKTPLADALRQAYEVLAREKRRGPDRRPVMVVITDGRANVPIPLGPK--- 198 (261)
T ss_pred ceEEeCCcccHHHHHHHHHhCCC------CCCCchHHHHHHHHHHHHHhhccCCCcceEEEEEeCCccCCCCCCchH---
Confidence 13455566677888889999984 344559999999999997532 47888999998876431100
Q ss_pred cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711 579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS 647 (1021)
Q Consensus 579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~ 647 (1021)
.--...+.++...|+-+=+.=+...++.+.-...||+..||++|+.+..+.
T Consensus 199 ------------------~e~~~~a~~~~~~g~~~lvid~e~~~~~~g~~~~iA~~~Gg~~~~L~~l~~ 249 (261)
T COG1240 199 ------------------AETLEAASKLRLRGIQLLVIDTEGSEVRLGLAEEIARASGGEYYHLDDLSD 249 (261)
T ss_pred ------------------HHHHHHHHHHhhcCCcEEEEecCCccccccHHHHHHHHhCCeEEecccccc
Confidence 001345666667777666666677777777789999999999999987654
No 45
>PHA03247 large tegument protein UL36; Provisional
Probab=97.72 E-value=0.069 Score=72.32 Aligned_cols=14 Identities=21% Similarity=0.228 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHhc
Q 001711 446 LEVVAQTIKSCLDE 459 (1021)
Q Consensus 446 l~~~~~sI~~~L~~ 459 (1021)
|-.+|+.|...|..
T Consensus 3114 Li~ACr~i~r~lr~ 3127 (3151)
T PHA03247 3114 LIEACRRIRRQLRR 3127 (3151)
T ss_pred HHHHHHHHHHHHHH
Confidence 45566667666653
No 46
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=97.71 E-value=0.0012 Score=66.89 Aligned_cols=153 Identities=22% Similarity=0.217 Sum_probs=104.8
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
-++||||+|.++-. ..++.+.+.+...+..+.. .+..+||||+|++..+.+. +..
T Consensus 3 ~v~l~vD~S~SM~~-~~~~~~~~~~~~~~~~~~~~~~~~~i~ii~f~~~~~~~~---------------------~~~-- 58 (177)
T smart00327 3 DVVFLLDGSGSMGP-NRFEKAKEFVLKLVEQLDIGPDGDRVGLVTFSDDATVLF---------------------PLN-- 58 (177)
T ss_pred cEEEEEeCCCccch-HHHHHHHHHHHHHHHhcCCCCCCcEEEEEEeCCCceEEE---------------------ccc--
Confidence 47899999998842 4577888888888888764 2358999999998443321 000
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---c-----CCEEEEEecCCCCCCcccccccCCc
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---L-----GGKLLIFQNSLPSLGVGCLKLRGDD 579 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~-----GGkIivF~sg~Pt~GpG~L~~r~~~ 579 (1021)
....++.+...++.+... .....-++.||+.+...++. . .-.|++|++|.++.+
T Consensus 59 ----~~~~~~~~~~~i~~~~~~----~~~~~~~~~al~~~~~~~~~~~~~~~~~~~~~iviitDg~~~~~---------- 120 (177)
T smart00327 59 ----DSRSKDALLEALASLSYK----LGGGTNLGAALQYALENLFSKSAGSRRGAPKVLILITDGESNDG---------- 120 (177)
T ss_pred ----ccCCHHHHHHHHHhcCCC----CCCCchHHHHHHHHHHHhcCcCCCCCCCCCeEEEEEcCCCCCCC----------
Confidence 123345566677766532 33456789999999998852 1 125666666554422
Q ss_pred CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEE
Q 001711 580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYY 641 (1021)
Q Consensus 580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~ 641 (1021)
..+++...++.+.+|.+..+.++... +...+..++..++|...+
T Consensus 121 -----------------~~~~~~~~~~~~~~i~i~~i~~~~~~-~~~~l~~~~~~~~~~~~~ 164 (177)
T smart00327 121 -----------------GDLLKAAKELKRSGVKVFVVGVGNDV-DEEELKKLASAPGGVYVF 164 (177)
T ss_pred -----------------ccHHHHHHHHHHCCCEEEEEEccCcc-CHHHHHHHhCCCcceEEe
Confidence 23467778888889888888887653 778899999999987765
No 47
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=97.71 E-value=0.00099 Score=81.47 Aligned_cols=167 Identities=18% Similarity=0.179 Sum_probs=111.8
Q ss_pred CCCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCC
Q 001711 426 MPPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPL 504 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl 504 (1021)
..-.++||||+|.++.. .-+..++.+++..|+..-. .|-+|++|+|++. ..+
T Consensus 400 ~~~~vvfvvD~SGSM~~-~rl~~aK~a~~~ll~~ay~-~rD~v~lI~F~g~~a~~------------------------- 452 (584)
T PRK13406 400 SETTTIFVVDASGSAAL-HRLAEAKGAVELLLAEAYV-RRDQVALVAFRGRGAEL------------------------- 452 (584)
T ss_pred CCccEEEEEECCCCCcH-hHHHHHHHHHHHHHHhhcC-CCCEEEEEEECCCceeE-------------------------
Confidence 34688999999999843 3578888888888876422 3468999999754 211
Q ss_pred CCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---C--CEEEEEecCCCCCCcccccccCCc
Q 001711 505 PDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---G--GKLLIFQNSLPSLGVGCLKLRGDD 579 (1021)
Q Consensus 505 ~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---G--GkIivF~sg~Pt~GpG~L~~r~~~ 579 (1021)
+++...+.+.+...|+.|+ ...++.++.||..|..+++.. | -.|++++.|-.|.|.+.-..+++
T Consensus 453 ----~lppT~~~~~~~~~L~~l~------~gGgTpL~~gL~~A~~~l~~~~~~~~~~~iVLlTDG~~n~~~~~~~~~~~- 521 (584)
T PRK13406 453 ----LLPPTRSLVRAKRSLAGLP------GGGGTPLAAGLDAAAALALQVRRKGMTPTVVLLTDGRANIARDGTAGRAQ- 521 (584)
T ss_pred ----EcCCCcCHHHHHHHHhcCC------CCCCChHHHHHHHHHHHHHHhccCCCceEEEEEeCCCCCCCccccccccc-
Confidence 1111123344556667775 246788999999999988642 2 47888999998886532111110
Q ss_pred CcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCC
Q 001711 580 LRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQS 647 (1021)
Q Consensus 580 ~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~ 647 (1021)
+..+ =..++..+.+.+|.+-++-+.... ...+..|++.+||..|..++-+.
T Consensus 522 --------------~~~~-~~~~a~~~~~~gi~~~vId~g~~~--~~~~~~LA~~~gg~y~~l~~~~a 572 (584)
T PRK13406 522 --------------AEED-ALAAARALRAAGLPALVIDTSPRP--QPQARALAEAMGARYLPLPRADA 572 (584)
T ss_pred --------------hhhH-HHHHHHHHHhcCCeEEEEecCCCC--cHHHHHHHHhcCCeEEECCCCCH
Confidence 0001 145678888888876666665444 34478999999999999997544
No 48
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=97.71 E-value=0.00096 Score=65.57 Aligned_cols=148 Identities=22% Similarity=0.320 Sum_probs=98.2
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
.++|+||+|.++ ....++.+++.+...+..+.. ....+|++++|+...+.+- ++.+.
T Consensus 2 ~v~~viD~S~Sm-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~v~~f~~~~~~~~--------------~~~~~------- 59 (161)
T cd00198 2 DIVFLLDVSGSM-GGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVL--------------PLTTD------- 59 (161)
T ss_pred cEEEEEeCCCCc-CcchHHHHHHHHHHHHHhcccCCCCcEEEEEEecCccceee--------------ccccc-------
Confidence 378999999987 345678888889999988875 2348999999997433211 00000
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CCEEEEEecCCCCCCcccccccCCcCcc
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GGKLLIFQNSLPSLGVGCLKLRGDDLRV 582 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GGkIivF~sg~Pt~GpG~L~~r~~~~r~ 582 (1021)
..++.+...++.+.. .......+..|+..+.+.+... ...|++|+.|..+.+.
T Consensus 60 ------~~~~~~~~~~~~~~~----~~~~~t~~~~al~~~~~~~~~~~~~~~~~~lvvitDg~~~~~~------------ 117 (161)
T cd00198 60 ------TDKADLLEAIDALKK----GLGGGTNIGAALRLALELLKSAKRPNARRVIILLTDGEPNDGP------------ 117 (161)
T ss_pred ------CCHHHHHHHHHhccc----CCCCCccHHHHHHHHHHHhcccCCCCCceEEEEEeCCCCCCCc------------
Confidence 134445556666643 2345677889999999999753 4567777776543321
Q ss_pred cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccc
Q 001711 583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYT 635 (1021)
Q Consensus 583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~T 635 (1021)
.-.++...++.+.+|.|.++.++. ..+-..+..|+..|
T Consensus 118 --------------~~~~~~~~~~~~~~v~v~~v~~g~-~~~~~~l~~l~~~~ 155 (161)
T cd00198 118 --------------ELLAEAARELRKLGITVYTIGIGD-DANEDELKEIADKT 155 (161)
T ss_pred --------------chhHHHHHHHHHcCCEEEEEEcCC-CCCHHHHHHHhccc
Confidence 011345666777799998888776 45666788888887
No 49
>PF00092 VWA: von Willebrand factor type A domain; InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=97.64 E-value=0.00086 Score=68.30 Aligned_cols=155 Identities=25% Similarity=0.331 Sum_probs=95.2
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCC-CCCCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccccCCCCCc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELP-GFPRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp-~~~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
.+|+||.|.++-..+ ++.+++.|...++.+. ...++|||||+|++..+.+ ++..
T Consensus 2 ivflvD~S~sm~~~~-~~~~~~~v~~~i~~~~~~~~~~rv~iv~f~~~~~~~~~~~~----------------------- 57 (178)
T PF00092_consen 2 IVFLVDTSGSMSGDN-FEKAKQFVKSIISRLSISNNGTRVGIVTFSDSARVLFSLTD----------------------- 57 (178)
T ss_dssp EEEEEE-STTSCHHH-HHHHHHHHHHHHHHSTBSTTSEEEEEEEESSSEEEEEETTS-----------------------
T ss_pred EEEEEeCCCCCchHH-HHHHHHHHHHHHHhhhccccccccceeeeeccccccccccc-----------------------
Confidence 589999999875433 6678888999988773 3456999999999887622 2211
Q ss_pred cceehhhhHHHHHHHH-hhCCCcccCCCCcccchHHHHHHHHHHHHhc--C------CEEEEEecCCCCCCcccccccCC
Q 001711 508 LLVNLSESRSVVDTLL-DSLPSMFQDNMNVESAFGPALKAAFMVMSRL--G------GKLLIFQNSLPSLGVGCLKLRGD 578 (1021)
Q Consensus 508 lLv~l~es~~~I~~lL-d~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~--G------GkIivF~sg~Pt~GpG~L~~r~~ 578 (1021)
.++.+.+.+.+ +.++. ....+.+|.||+.|...+... | .-|+++++|.++.+.
T Consensus 58 -----~~~~~~~~~~i~~~~~~-----~~g~t~~~~aL~~a~~~l~~~~~~~r~~~~~~iiliTDG~~~~~~-------- 119 (178)
T PF00092_consen 58 -----YQSKNDLLNAINDSIPS-----SGGGTNLGAALKFAREQLFSSNNGGRPNSPKVIILITDGNSNDSD-------- 119 (178)
T ss_dssp -----HSSHHHHHHHHHTTGGC-----CBSSB-HHHHHHHHHHHTTSGGGTTGTTSEEEEEEEESSSSSSHS--------
T ss_pred -----ccccccccccccccccc-----cchhhhHHHHHhhhhhcccccccccccccccceEEEEeecccCCc--------
Confidence 01222222222 33332 345677999999999998643 2 235666665543221
Q ss_pred cCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc--cccEEEEeCCCC
Q 001711 579 DLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY--TGGQVYYYPSFQ 646 (1021)
Q Consensus 579 ~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~--TGG~v~~y~~F~ 646 (1021)
.....+..+.+. ..|.+|+++.+..|...|..|+.. .+|++++..+|+
T Consensus 120 -------------------~~~~~~~~~~~~-~~i~~~~ig~~~~~~~~l~~la~~~~~~~~~~~~~~~~ 169 (178)
T PF00092_consen 120 -------------------SPSEEAANLKKS-NGIKVIAIGIDNADNEELRELASCPTSEGHVFYLADFS 169 (178)
T ss_dssp -------------------GHHHHHHHHHHH-CTEEEEEEEESCCHHHHHHHHSHSSTCHHHEEEESSHH
T ss_pred -------------------chHHHHHHHHHh-cCcEEEEEecCcCCHHHHHHHhCCCCCCCcEEEcCCHH
Confidence 011122222222 567777777777889999999965 447888877654
No 50
>cd01481 vWA_collagen_alpha3-VI-like VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=97.56 E-value=0.0024 Score=65.76 Aligned_cols=151 Identities=18% Similarity=0.232 Sum_probs=93.8
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEE-EecCCCCCCcceeeccccccccCCCCCc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHF-YNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
.+|+||.|.+.-+ .-++.+++.|+..++.+.- ...+|||+|+|++..+. ++|. +
T Consensus 3 ivfllD~S~Si~~-~~f~~~k~fi~~lv~~f~i~~~~~rVgvv~ys~~~~~~~~l~---------------~-------- 58 (165)
T cd01481 3 IVFLIDGSDNVGS-GNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLN---------------T-------- 58 (165)
T ss_pred EEEEEeCCCCcCH-HHHHHHHHHHHHHHhhccCCCCCcEEEEEEecCCeeEEEecc---------------c--------
Confidence 5899999987543 3477888889999988763 24589999999876542 1221 1
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHH-Hh-cCC-------EE-EEEecCCCCCCcccccccC
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVM-SR-LGG-------KL-LIFQNSLPSLGVGCLKLRG 577 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL-~~-~GG-------kI-ivF~sg~Pt~GpG~L~~r~ 577 (1021)
..+++.+.+.++.|+.+ ....+.+|.||+.+.+.+ .. .|+ |+ ++++.|..+
T Consensus 59 -----~~~~~~l~~~i~~i~~~----~g~~t~t~~AL~~~~~~~f~~~~g~R~~~~~~kv~vviTdG~s~---------- 119 (165)
T cd01481 59 -----HSTKADVLGAVRRLRLR----GGSQLNTGSALDYVVKNLFTKSAGSRIEEGVPQFLVLITGGKSQ---------- 119 (165)
T ss_pred -----cCCHHHHHHHHHhcccC----CCCcccHHHHHHHHHHhhcCccccCCccCCCCeEEEEEeCCCCc----------
Confidence 01233455566666532 112356899999887654 32 232 34 455544211
Q ss_pred CcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCC
Q 001711 578 DDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSF 645 (1021)
Q Consensus 578 ~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F 645 (1021)
+ -+++-|.++.+.|| .+|..+....|..+|..++..- -.+|...+|
T Consensus 120 ------------------d-~~~~~a~~lr~~gv--~i~~vG~~~~~~~eL~~ias~p-~~vf~v~~f 165 (165)
T cd01481 120 ------------------D-DVERPAVALKRAGI--VPFAIGARNADLAELQQIAFDP-SFVFQVSDF 165 (165)
T ss_pred ------------------c-hHHHHHHHHHHCCc--EEEEEeCCcCCHHHHHHHhCCC-ccEEEecCC
Confidence 1 13566778888875 5677776668888988888665 355555443
No 51
>cd01473 vWA_CTRP CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=97.51 E-value=0.0037 Score=66.04 Aligned_cols=150 Identities=13% Similarity=0.127 Sum_probs=91.9
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEEEE-ecCCCCCCcceeeccccccccCCCCCc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIHFY-NMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~fy-nl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
.+|+||.|.+.-+..+-..+++.++..++.+.- ..++|||+|+|++..+++ .+...
T Consensus 3 i~fllD~S~Si~~~~f~~~~~~f~~~lv~~l~i~~~~~rvgvv~fs~~~~~~~~~~~~---------------------- 60 (192)
T cd01473 3 LTLILDESASIGYSNWRKDVIPFTEKIINNLNISKDKVHVGILLFAEKNRDVVPFSDE---------------------- 60 (192)
T ss_pred EEEEEeCCCcccHHHHHHHHHHHHHHHHHhCccCCCccEEEEEEecCCceeEEecCcc----------------------
Confidence 589999999875544433567778888887653 245899999999866532 22110
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCC------E-EEEEecCCCCCCcccccccCCcC
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGG------K-LLIFQNSLPSLGVGCLKLRGDDL 580 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GG------k-IivF~sg~Pt~GpG~L~~r~~~~ 580 (1021)
....++.+.+.++.|..... ....+.+|.||+.|.+.+...+| | +|+++.|-.+-+
T Consensus 61 ----~~~~~~~l~~~i~~l~~~~~--~~g~T~~~~AL~~a~~~~~~~~~~r~~~~kv~IllTDG~s~~~----------- 123 (192)
T cd01473 61 ----ERYDKNELLKKINDLKNSYR--SGGETYIVEALKYGLKNYTKHGNRRKDAPKVTMLFTDGNDTSA----------- 123 (192)
T ss_pred ----cccCHHHHHHHHHHHHhccC--CCCcCcHHHHHHHHHHHhccCCCCcccCCeEEEEEecCCCCCc-----------
Confidence 01123444555566543221 13467799999999888754322 3 555555432110
Q ss_pred cccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711 581 RVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY 634 (1021)
Q Consensus 581 r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~ 634 (1021)
.+ .--.+.++++.+.||.|-.+..+. .+..+|..|+..
T Consensus 124 ------~~--------~~~~~~a~~lk~~gV~i~~vGiG~--~~~~el~~ia~~ 161 (192)
T cd01473 124 ------SK--------KELQDISLLYKEENVKLLVVGVGA--ASENKLKLLAGC 161 (192)
T ss_pred ------ch--------hhHHHHHHHHHHCCCEEEEEEecc--ccHHHHHHhcCC
Confidence 00 112466788888998877777664 467788888764
No 52
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=97.41 E-value=0.0057 Score=62.07 Aligned_cols=102 Identities=18% Similarity=0.265 Sum_probs=66.5
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCC-CCCceEEEEEEcC--eEEE-EecCCCCCCcceeeccccccccCCCC
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPG-FPRTQIGFITFDS--TIHF-YNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds--~V~f-ynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
++|+||+|.+.-. -++..++.+++.++.|.. ..+.+||+|+|++ ..++ +.+..
T Consensus 3 v~~llD~S~Sm~~--~~~~~~~~~~~~~~~l~~~~~~~~v~lv~f~~~~~~~~~~~l~~--------------------- 59 (163)
T cd01476 3 LLFVLDSSGSVRG--KFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRQRVRFNLPK--------------------- 59 (163)
T ss_pred EEEEEeCCcchhh--hHHHHHHHHHHHHHhcCCCCCCcEEEEEEEcCCCceEEEecCCC---------------------
Confidence 6899999998743 366778888888888753 2358999999987 3332 11110
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C------CEEEEEecCCC
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G------GKLLIFQNSLP 566 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G------GkIivF~sg~P 566 (1021)
...++.+...|+.|.. ....+.+|.||+.|.+++... + ..|++++.|.+
T Consensus 60 -------~~~~~~l~~~i~~l~~-----~gg~T~l~~aL~~a~~~l~~~~~~r~~~~~~villTDG~~ 115 (163)
T cd01476 60 -------HNDGEELLEKVDNLRF-----IGGTTATGAAIEVALQQLDPSEGRREGIPKVVVVLTDGRS 115 (163)
T ss_pred -------CCCHHHHHHHHHhCcc-----CCCCccHHHHHHHHHHHhccccCCCCCCCeEEEEECCCCC
Confidence 1123455556666652 134578999999999999521 1 34666666543
No 53
>cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=97.33 E-value=0.0012 Score=68.30 Aligned_cols=138 Identities=18% Similarity=0.243 Sum_probs=84.3
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC----CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCC
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF----PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLP 505 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~----~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~ 505 (1021)
++||||+|.++-.. -++.++++++..++.|..+ ++.+|+||+|++..+..- . +.++++.
T Consensus 6 v~~llD~SgSM~~~-~~~~~k~a~~~~~~~l~~~~~~~~~~~v~ii~F~~~a~~~~---~--------l~~~~~~----- 68 (176)
T cd01464 6 IYLLLDTSGSMAGE-PIEALNQGLQMLQSELRQDPYALESVEISVITFDSAARVIV---P--------LTPLESF----- 68 (176)
T ss_pred EEEEEECCCCCCCh-HHHHHHHHHHHHHHHHhcChhhccccEEEEEEecCCceEec---C--------CccHHhc-----
Confidence 58999999987432 3567778888888777543 467999999998765421 0 0010000
Q ss_pred CccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----C-------CEEEEEecCCCCCCcccc
Q 001711 506 DDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----G-------GKLLIFQNSLPSLGVGCL 573 (1021)
Q Consensus 506 ~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----G-------GkIivF~sg~Pt~GpG~L 573 (1021)
.++.| ....+++++.||+.|.+.|+.. + ..|++++.|.++-+...
T Consensus 69 ----------------~~~~l------~~~GgT~l~~aL~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~~- 125 (176)
T cd01464 69 ----------------QPPRL------TASGGTSMGAALELALDCIDRRVQRYRADQKGDWRPWVFLLTDGEPTDDLTA- 125 (176)
T ss_pred ----------------CCCcc------cCCCCCcHHHHHHHHHHHHHHHHHHhcccCcCCcCcEEEEEcCCCCCchHHH-
Confidence 00111 1235689999999999998542 0 15888888776422100
Q ss_pred cccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcc
Q 001711 574 KLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAK 633 (1021)
Q Consensus 574 ~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~ 633 (1021)
. .+...++.+.++.|..|.++. .+|...|..|+.
T Consensus 126 ---------------------~----~~~~~~~~~~~~~i~~igiG~-~~~~~~L~~ia~ 159 (176)
T cd01464 126 ---------------------A----IERIKEARDSKGRIVACAVGP-KADLDTLKQITE 159 (176)
T ss_pred ---------------------H----HHHHHhhcccCCcEEEEEecc-ccCHHHHHHHHC
Confidence 0 122233344567777777766 578777777774
No 54
>smart00262 GEL Gelsolin homology domain. Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.
Probab=97.23 E-value=0.0018 Score=59.55 Aligned_cols=71 Identities=25% Similarity=0.453 Sum_probs=49.8
Q ss_pred cccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHH-hCCCCCc
Q 001711 896 LPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLRE-QDPSYYQ 974 (1021)
Q Consensus 896 l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~-~r~~~~~ 974 (1021)
++++.+.|.++.+||||+|..||+|+|+.++...... ...+.+.+.+ .+....+
T Consensus 16 ~~~~~~~L~s~d~fild~~~~iyvW~G~~as~~ek~~-------------------------A~~~a~~~~~~~~~~~~~ 70 (90)
T smart00262 16 VPFSQGSLNSGDCYILDTGSEIYVWVGKKSSQDEKKK-------------------------AAELAVELDDTLGPGPVQ 70 (90)
T ss_pred cCCCHHHCCCCCEEEEECCCEEEEEECCCCCHHHHHH-------------------------HHHHHHHHHHhcCCCCce
Confidence 5678899999999999999999999999997765421 1222333332 2345567
Q ss_pred eEEEeccCCCcchHHHHHhhc
Q 001711 975 LCQLVRQGEQPREGFLLLANL 995 (1021)
Q Consensus 975 l~~vvrqg~~~~~e~~f~~~L 995 (1021)
+ .+++||... ..|..+|
T Consensus 71 i-~~v~eg~E~---~~F~~~f 87 (90)
T smart00262 71 V-RVVDEGKEP---PEFWSLF 87 (90)
T ss_pred E-EEEeCCCCC---HHHHHHh
Confidence 7 889998754 3565554
No 55
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=97.06 E-value=0.0036 Score=75.52 Aligned_cols=12 Identities=17% Similarity=0.158 Sum_probs=6.7
Q ss_pred HHHHhhhccCCC
Q 001711 827 YCLAICKSTPIR 838 (1021)
Q Consensus 827 yil~LlKS~~Lr 838 (1021)
++-+|+-..+||
T Consensus 1046 lLeaLqsgaafr 1057 (1102)
T KOG1924|consen 1046 LLEALQSGAAFR 1057 (1102)
T ss_pred HHHHHHhhcccc
Confidence 455555555555
No 56
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=96.99 E-value=0.021 Score=58.97 Aligned_cols=147 Identities=16% Similarity=0.126 Sum_probs=87.0
Q ss_pred eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCcc
Q 001711 429 LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDL 508 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~l 508 (1021)
.++|+||+|.++....-++.+++++...++.|.. .+.+++|++|++.. . .......+...+.++ .+
T Consensus 2 ~v~~llD~SgSM~~~~kl~~ak~a~~~l~~~l~~-~~d~~~l~~F~~~~-----~-~~~~~~~~~~~~~~~-------~~ 67 (174)
T cd01454 2 AVTLLLDLSGSMRSDRRIDVAKKAAVLLAEALEA-CGVPHAILGFTTDA-----G-GRERVRWIKIKDFDE-------SL 67 (174)
T ss_pred EEEEEEECCCCCCCCcHHHHHHHHHHHHHHHHHH-cCCcEEEEEecCCC-----C-CccceEEEEecCccc-------cc
Confidence 4789999999985433677788877777766654 23689999998752 0 000001111111111 00
Q ss_pred ceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCCEEEEEecCCCCCCcccccccCCcCcccCC
Q 001711 509 LVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGT 585 (1021)
Q Consensus 509 Lv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt 585 (1021)
...+...|+.+.. ...+.+|.||..|...+.. ....|++++.|.|+.+...- +
T Consensus 68 -------~~~~~~~l~~~~~------~g~T~~~~al~~a~~~l~~~~~~~~~iiliTDG~~~~~~~~~----------~- 123 (174)
T cd01454 68 -------HERARKRLAALSP------GGNTRDGAAIRHAAERLLARPEKRKILLVISDGEPNDLDYYE----------G- 123 (174)
T ss_pred -------chhHHHHHHccCC------CCCCcHHHHHHHHHHHHhcCCCcCcEEEEEeCCCcCcccccC----------c-
Confidence 1122334444431 2357899999999999874 34568888899887653100 0
Q ss_pred CccccCCCCCcHHHHHH---HHHHhhCCcEEEEEEecCCC
Q 001711 586 DKEHSLRIPEDPFYKQM---AADLTKFQIAVNVYAFSDKY 622 (1021)
Q Consensus 586 ~~e~~l~~pa~~fY~~L---a~~~~~~gIsVDlF~~s~~~ 622 (1021)
.+ ...++. +.++.+.||.|..+.++.+.
T Consensus 124 ----~~-----~~~~~~~~~~~~~~~~gi~v~~igig~~~ 154 (174)
T cd01454 124 ----NV-----FATEDALRAVIEARKLGIEVFGITIDRDA 154 (174)
T ss_pred ----ch-----hHHHHHHHHHHHHHhCCcEEEEEEecCcc
Confidence 00 012233 77888899998877776553
No 57
>KOG1984 consensus Vesicle coat complex COPII, subunit SFB3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.90 E-value=0.1 Score=64.51 Aligned_cols=33 Identities=12% Similarity=0.138 Sum_probs=14.3
Q ss_pred ccceEEEEEeCCC---eEEEeeecCcccCCCCceee
Q 001711 667 AWEAVMRIRCGKG---VRFTNYHGNFMLRSTDLLAL 699 (1021)
Q Consensus 667 g~~a~mrVR~S~G---l~V~~~~Gnf~~rs~~~~~l 699 (1021)
.|.|.+---|-+| ++|.++-+++..+..+++++
T Consensus 717 ~fQ~AlLYTti~G~RR~Rv~Nlsl~~ts~l~~lyr~ 752 (1007)
T KOG1984|consen 717 HFQTALLYTTIDGQRRLRVLNLSLAVTSQLSELYRS 752 (1007)
T ss_pred eEEEEEEEeccCCceeEEEEecchhhhhhHHHHHHh
Confidence 3444443334444 44555555544433344333
No 58
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=96.87 E-value=0.023 Score=61.03 Aligned_cols=154 Identities=21% Similarity=0.282 Sum_probs=90.6
Q ss_pred eEEEEEecchhHHhh------cHHHHHHHHHHHHHhcC-CCCCCceEEEEEEcCeEEEEecCCCCCCcceeecccccccc
Q 001711 429 LYFFLIDVSISAIRS------GMLEVVAQTIKSCLDEL-PGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIF 501 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s------G~l~~~~~sI~~~L~~L-p~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f 501 (1021)
..+|+||+|.++.+. ..++.+++.|...+... -..+..+||+|.|++.-+-- ...-..+.|+.++..+
T Consensus 3 ~ivf~iDvS~SM~~~~~~~~~s~l~~a~~~i~~~~~~ki~~~~~D~vGlilf~t~~~~~----~~~~~~i~v~~~l~~~- 77 (218)
T cd01458 3 SVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKIISSPKDLVGVVFYGTEESKN----PVGYENIYVLLDLDTP- 77 (218)
T ss_pred EEEEEEeCCHHHcCCCCCCCCChHHHHHHHHHHHHHhceeCCCCCeEEEEEEcccCCCC----cCCCCceEEeecCCCC-
Confidence 479999999988522 35778888888888852 11233689999997653210 0011123333333211
Q ss_pred CCCCCccceehhhhHHHHHHHHhhCCCc-c----cCCCCcccchHHHHHHHHHHHHh-----cCCEEEEEecCCCCCCcc
Q 001711 502 VPLPDDLLVNLSESRSVVDTLLDSLPSM-F----QDNMNVESAFGPALKAAFMVMSR-----LGGKLLIFQNSLPSLGVG 571 (1021)
Q Consensus 502 ~Pl~~~lLv~l~es~~~I~~lLd~Lp~~-f----~~~~~~~~alG~AL~aA~~lL~~-----~GGkIivF~sg~Pt~GpG 571 (1021)
..+.|+.+++.+..- . ......+..++.||..|..+++. ..-+|++|+++--..| |
T Consensus 78 -------------~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~l~~aL~~a~~~~~~~~~~~~~k~IvL~TDg~~p~~-~ 143 (218)
T cd01458 78 -------------GAERVEDLKELIEPGGLSFAGQVGDSGQVSLSDALWVCLDLFSKGKKKKSHKRIFLFTNNDDPHG-G 143 (218)
T ss_pred -------------CHHHHHHHHHHhhcchhhhcccCCCCCCccHHHHHHHHHHHHHhccccccccEEEEECCCCCCCC-C
Confidence 123334444433211 0 01123577899999999999985 2346888888643222 0
Q ss_pred cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC
Q 001711 572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK 621 (1021)
Q Consensus 572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~ 621 (1021)
+ . -...-+.+++.++.+.||.|.+|.+...
T Consensus 144 ------~--------~------~~~~~~~~~a~~l~~~gI~i~~i~i~~~ 173 (218)
T cd01458 144 ------D--------S------IKDSQAAVKAEDLKDKGIELELFPLSSP 173 (218)
T ss_pred ------C--------H------HHHHHHHHHHHHHHhCCcEEEEEecCCC
Confidence 0 0 0123356788899999999999887543
No 59
>PF04056 Ssl1: Ssl1-like; InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=96.80 E-value=0.0066 Score=64.10 Aligned_cols=163 Identities=20% Similarity=0.263 Sum_probs=103.1
Q ss_pred EEecchhHHhhc----HHHHHHHHHHHHHhcC-CCCCCceEEEEEEcCe-EEEEecCCCCCCcceeeccccccccCCCCC
Q 001711 433 LIDVSISAIRSG----MLEVVAQTIKSCLDEL-PGFPRTQIGFITFDST-IHFYNMKSSLTQPQMMVISDLDDIFVPLPD 506 (1021)
Q Consensus 433 vIDvS~~av~sG----~l~~~~~sI~~~L~~L-p~~~rt~VgiITFds~-V~fynl~~~~~~p~mlVvsDldd~f~Pl~~ 506 (1021)
|||.|..+.+.- .++++++.+..-+++. ..+|-.++|||+.-+. .+. ++++
T Consensus 1 viD~S~~m~~~D~~PtRl~~~~~~l~~Fv~eff~qNPiSqlgii~~~~~~a~~--------------ls~l--------- 57 (193)
T PF04056_consen 1 VIDMSEAMREKDLKPTRLQCVLKALEEFVREFFDQNPISQLGIIVMRDGRAER--------------LSEL--------- 57 (193)
T ss_pred CeechHhHHhCcCCccHHHHHHHHHHHHHHHHHhcCChhheeeeeeecceeEE--------------eeec---------
Confidence 589998875432 4666777766666653 3467789999987432 221 1221
Q ss_pred ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CC-EEEEEecCCCCCCcccccccCCcCcc
Q 001711 507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GG-KLLIFQNSLPSLGVGCLKLRGDDLRV 582 (1021)
Q Consensus 507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GG-kIivF~sg~Pt~GpG~L~~r~~~~r~ 582 (1021)
+-+-....+.|+++.+ ..-..+..+-.||+.|...|++. |. .|+++.+++-|..||.
T Consensus 58 ------sgn~~~h~~~L~~~~~---~~~~G~~SLqN~Le~A~~~L~~~p~~~srEIlvi~gSl~t~Dp~d---------- 118 (193)
T PF04056_consen 58 ------SGNPQEHIEALKKLRK---LEPSGEPSLQNGLEMARSSLKHMPSHGSREILVIFGSLTTCDPGD---------- 118 (193)
T ss_pred ------CCCHHHHHHHHHHhcc---CCCCCChhHHHHHHHHHHHHhhCccccceEEEEEEeecccCCchh----------
Confidence 1111122223333322 22356678999999999999864 33 5666666665555442
Q ss_pred cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhc
Q 001711 583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDL 662 (1021)
Q Consensus 583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~l 662 (1021)
+.+..+.+.+.+|-||+..++. .+..+..||+.|||..... .|.+.|..-|....
T Consensus 119 ----------------i~~ti~~l~~~~IrvsvI~laa---Ev~I~k~i~~~T~G~y~V~------lde~H~~~lL~~~~ 173 (193)
T PF04056_consen 119 ----------------IHETIESLKKENIRVSVISLAA---EVYICKKICKETGGTYGVI------LDEDHFKELLMEHV 173 (193)
T ss_pred ----------------HHHHHHHHHHcCCEEEEEEEhH---HHHHHHHHHHhhCCEEEEe------cCHHHHHHHHHhhC
Confidence 2366788999999999999986 4777899999999954433 34455655555544
No 60
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=96.70 E-value=0.011 Score=71.72 Aligned_cols=12 Identities=17% Similarity=0.382 Sum_probs=5.9
Q ss_pred HHHHhhcCCceE
Q 001711 328 QSLVSRWHLPLG 339 (1021)
Q Consensus 328 ~~l~~~~~lPlg 339 (1021)
.+++.+..+=|+
T Consensus 656 ~dlfakL~~~Fa 667 (1102)
T KOG1924|consen 656 DDLFAKLALKFA 667 (1102)
T ss_pred hHHHHHHHHHhh
Confidence 455555444443
No 61
>KOG0443 consensus Actin regulatory proteins (gelsolin/villin family) [Cytoskeleton]
Probab=96.61 E-value=0.0047 Score=75.39 Aligned_cols=91 Identities=16% Similarity=0.227 Sum_probs=61.0
Q ss_pred hhhcccEEEeecCCCCCCccCCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccc
Q 001711 866 KLLYPCLIRVDEHLLKPSAQLDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKV 945 (1021)
Q Consensus 866 ~~lYPrL~~lh~~~~~~~~~~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~ 945 (1021)
.-.-||||..+.-. +.+.+-+....+.+.|..+.|||||++..+|||||+.++++.....+..
T Consensus 616 ~~~~PrLF~Cs~~~--------g~f~~~EI~~F~QdDL~tdDi~lLDt~~evfvWvG~~a~~~eK~~Al~~--------- 678 (827)
T KOG0443|consen 616 PERDPRLFSCSNKT--------GSFVVEEIYNFTQDDLMTDDIMLLDTWSEVFVWVGQEANEKEKEEALTI--------- 678 (827)
T ss_pred CCCCCcEEEEEecC--------CcEEEEEecCcchhhccccceEEEecCceEEEEecCCCChhHHHHHHHH---------
Confidence 45678999988531 1122223346788999999999999999999999999988877554421
Q ss_pred cccccchHHHHHHHHHHHHHHHhCCCCCceEEEeccCCCc
Q 001711 946 MLREQDNEMSRKLLGILKKLREQDPSYYQLCQLVRQGEQP 985 (1021)
Q Consensus 946 ~lp~~~n~~s~~l~~ii~~lr~~r~~~~~l~~vvrqg~~~ 985 (1021)
.++-.+. + +-+.|.+.-|+ +||+||...
T Consensus 679 ---------~~~yl~~-~-~p~gr~~~TPI-~vV~qG~EP 706 (827)
T KOG0443|consen 679 ---------GQKYLET-D-LPEGRDPRTPI-YVVKQGHEP 706 (827)
T ss_pred ---------HHHHHhc-c-CcccCCCCCce-EEecCCCCC
Confidence 1111111 1 23345566788 999998544
No 62
>COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]
Probab=96.38 E-value=0.066 Score=55.64 Aligned_cols=158 Identities=19% Similarity=0.313 Sum_probs=92.1
Q ss_pred CeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCC----CCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711 428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGF----PRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP 503 (1021)
Q Consensus 428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~----~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P 503 (1021)
|+ +|++|+|.+++-. -++++-.+|+..++.|..+ .+.+++|||||+.++.|.- ..|++. |-|
T Consensus 5 P~-~lllDtSgSM~Ge-~IealN~Glq~m~~~Lkqdp~Ale~v~lsIVTF~~~a~~~~p-----------f~~~~n-F~~ 70 (207)
T COG4245 5 PC-YLLLDTSGSMIGE-PIEALNAGLQMMIDTLKQDPYALERVELSIVTFGGPARVIQP-----------FTDAAN-FNP 70 (207)
T ss_pred CE-EEEEecCcccccc-cHHHHHHHHHHHHHHHHhChhhhheeEEEEEEecCcceEEec-----------hhhHhh-cCC
Confidence 44 4699999988643 3677778888888877654 4689999999987666531 122221 111
Q ss_pred CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc------CC------EEEEEecCCCCCCcc
Q 001711 504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL------GG------KLLIFQNSLPSLGVG 571 (1021)
Q Consensus 504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~------GG------kIivF~sg~Pt~GpG 571 (1021)
|..+ ...++.+|+||+.|.++++.. .| -|++.+.|-||
T Consensus 71 -----------------------p~L~---a~GgT~lGaAl~~a~d~Ie~~~~~~~a~~kgdyrP~vfLiTDG~Pt---- 120 (207)
T COG4245 71 -----------------------PILT---AQGGTPLGAALTLALDMIEERKRKYDANGKGDYRPWVFLITDGEPT---- 120 (207)
T ss_pred -----------------------Ccee---cCCCCchHHHHHHHHHHHHHHHhhcccCCccccceEEEEecCCCcc----
Confidence 1111 236788999999999999642 11 34555555442
Q ss_pred cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhh--CCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCch
Q 001711 572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTK--FQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTT 649 (1021)
Q Consensus 572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~--~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~ 649 (1021)
+++=+.++..... ...+|=.|.+..+..|...|..+.+ ++..+.. .
T Consensus 121 ------------------------D~w~~~~~~~~~~~~~~k~v~a~~~G~~~ad~~~L~qit~----~V~~~~t----~ 168 (207)
T COG4245 121 ------------------------DDWQAGAALVFQGERRAKSVAAFSVGVQGADNKTLNQITE----KVRQFLT----L 168 (207)
T ss_pred ------------------------hHHHhHHHHhhhcccccceEEEEEecccccccHHHHHHHH----hhccccc----c
Confidence 2222222222211 2234555666666678777777653 3333332 3
Q ss_pred hHHHHHHHHHHh
Q 001711 650 HGERLRHELSRD 661 (1021)
Q Consensus 650 d~~kl~~dL~r~ 661 (1021)
|..+|...+.+.
T Consensus 169 d~~~f~~fFkW~ 180 (207)
T COG4245 169 DGLQFREFFKWL 180 (207)
T ss_pred chHHHHHHHHHH
Confidence 556676666553
No 63
>KOG2884 consensus 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=96.30 E-value=0.1 Score=55.21 Aligned_cols=154 Identities=16% Similarity=0.277 Sum_probs=96.5
Q ss_pred eEEEEEecchhHHhhc-----HHHHHHHHHHHHHh-cCCCCCCceEEEEEEcC-eEEEEecCCCCCCcceeecccccccc
Q 001711 429 LYFFLIDVSISAIRSG-----MLEVVAQTIKSCLD-ELPGFPRTQIGFITFDS-TIHFYNMKSSLTQPQMMVISDLDDIF 501 (1021)
Q Consensus 429 ~yvFvIDvS~~av~sG-----~l~~~~~sI~~~L~-~Lp~~~rt~VgiITFds-~V~fynl~~~~~~p~mlVvsDldd~f 501 (1021)
+.+.|||-|.-+.+ | .+++=+++|..... .+..++...|||||... .+.+..
T Consensus 5 atmi~iDNse~mrN-gDy~PtRf~aQ~daVn~v~~~K~~snpEntvGiitla~a~~~vLs-------------------- 63 (259)
T KOG2884|consen 5 ATMICIDNSEYMRN-GDYLPTRFQAQKDAVNLVCQAKLRSNPENTVGIITLANASVQVLS-------------------- 63 (259)
T ss_pred eEEEEEeChHHhhc-CCCChHHHHHHHHHHHHHHHhhhcCCcccceeeEeccCCCceeee--------------------
Confidence 56889999887643 4 35555555554443 34445556799999864 333321
Q ss_pred CCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-----CEEEEEecCCCCCCccccccc
Q 001711 502 VPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-----GKLLIFQNSLPSLGVGCLKLR 576 (1021)
Q Consensus 502 ~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-----GkIivF~sg~Pt~GpG~L~~r 576 (1021)
.+...+-.|...|..|. ...+.-++.+|+.|..+||++- -||++|.+++-.
T Consensus 64 ---------T~T~d~gkils~lh~i~------~~g~~~~~~~i~iA~lalkhRqnk~~~~riVvFvGSpi~--------- 119 (259)
T KOG2884|consen 64 ---------TLTSDRGKILSKLHGIQ------PHGKANFMTGIQIAQLALKHRQNKNQKQRIVVFVGSPIE--------- 119 (259)
T ss_pred ---------eccccchHHHHHhcCCC------cCCcccHHHHHHHHHHHHHhhcCCCcceEEEEEecCcch---------
Confidence 11222334444555554 2345568999999999999853 588999987621
Q ss_pred CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhcccccc-----EEEEeCC
Q 001711 577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGG-----QVYYYPS 644 (1021)
Q Consensus 577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG-----~v~~y~~ 644 (1021)
+.| +-.-++|.++.+.+|.|||.-|+....+-.-+......++| ++...+.
T Consensus 120 ---------e~e--------keLv~~akrlkk~~Vaidii~FGE~~~~~e~l~~fida~N~~~~gshlv~Vpp 175 (259)
T KOG2884|consen 120 ---------ESE--------KELVKLAKRLKKNKVAIDIINFGEAENNTEKLFEFIDALNGKGDGSHLVSVPP 175 (259)
T ss_pred ---------hhH--------HHHHHHHHHHHhcCeeEEEEEeccccccHHHHHHHHHHhcCCCCCceEEEeCC
Confidence 112 22357999999999999999998766664444444444444 3555554
No 64
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=96.16 E-value=0.13 Score=51.61 Aligned_cols=130 Identities=15% Similarity=0.142 Sum_probs=75.0
Q ss_pred EEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccc
Q 001711 430 YFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLL 509 (1021)
Q Consensus 430 yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lL 509 (1021)
++|+||+|.++-.+ -++.++..+...++.+.. .+.+|+||+|++..+.+.+..
T Consensus 3 v~illD~SgSM~~~-k~~~a~~~~~~l~~~~~~-~~~~v~li~F~~~~~~~~~~~------------------------- 55 (152)
T cd01462 3 VILLVDQSGSMYGA-PEEVAKAVALALLRIALA-ENRDTYLILFDSEFQTKIVDK------------------------- 55 (152)
T ss_pred EEEEEECCCCCCCC-HHHHHHHHHHHHHHHHHH-cCCcEEEEEeCCCceEEecCC-------------------------
Confidence 68999999988532 244455555555555432 125799999998733221110
Q ss_pred eehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecCCCCCCcccccccCCcCcccCCC
Q 001711 510 VNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTD 586 (1021)
Q Consensus 510 v~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~ 586 (1021)
... +..+++.|..+. ...++.++.||..+.+.++.. .+.|++++.|..+.
T Consensus 56 ---~~~---~~~~~~~l~~~~---~~ggT~l~~al~~a~~~l~~~~~~~~~ivliTDG~~~~------------------ 108 (152)
T cd01462 56 ---TDD---LEEPVEFLSGVQ---LGGGTDINKALRYALELIERRDPRKADIVLITDGYEGG------------------ 108 (152)
T ss_pred ---ccc---HHHHHHHHhcCC---CCCCcCHHHHHHHHHHHHHhcCCCCceEEEECCCCCCC------------------
Confidence 011 122233332221 245678999999999998763 46777777764110
Q ss_pred ccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCC
Q 001711 587 KEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDK 621 (1021)
Q Consensus 587 ~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~ 621 (1021)
...+.. +.+....+.++.|..+.++.+
T Consensus 109 -------~~~~~~-~~~~~~~~~~~~v~~~~~g~~ 135 (152)
T cd01462 109 -------VSDELL-REVELKRSRVARFVALALGDH 135 (152)
T ss_pred -------CCHHHH-HHHHHHHhcCcEEEEEEecCC
Confidence 011222 334444566789999988764
No 65
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70). Proteins in this family are involved in non-homologous end joining, a process used for the repair of double stranded DNA breaks. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). Cutoff does not detect the putative ku70 homologs in yeast.
Probab=95.48 E-value=0.23 Score=61.39 Aligned_cols=162 Identities=17% Similarity=0.260 Sum_probs=90.2
Q ss_pred eEEEEEecchhHHh-------hcHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccc
Q 001711 429 LYFFLIDVSISAIR-------SGMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDI 500 (1021)
Q Consensus 429 ~yvFvIDvS~~av~-------sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~ 500 (1021)
..|||||+|.++.+ ..-+..++++|...+.. +-.+++..|||+.|++.=+ ++.+.-....|+.||+.+
T Consensus 12 ailflIDvs~sM~~~~~~~~~~s~~~~al~~i~~l~q~kIis~~~D~vGivlfgT~~t----~n~~~~~~i~v~~~L~~p 87 (584)
T TIGR00578 12 SLIFLVDASKAMFEESQGEDELTPFDMSIQCIQSVYTSKIISSDKDLLAVVFYGTEKD----KNSVNFKNIYVLQELDNP 87 (584)
T ss_pred EEEEEEECCHHHcCCCcCcCcCChHHHHHHHHHHHHHhcCCCCCCCeEEEEEEeccCC----CCccCCCceEEEeeCCCC
Confidence 68999999999864 12355666777777764 2234568999999976422 122223355666666542
Q ss_pred cCCCCCccceehhhhHHHHHHHHhh-CCCcccC--CCCcccchHHHHHHHHHHHHh----cCC-EEEEEecCCCCCCccc
Q 001711 501 FVPLPDDLLVNLSESRSVVDTLLDS-LPSMFQD--NMNVESAFGPALKAAFMVMSR----LGG-KLLIFQNSLPSLGVGC 572 (1021)
Q Consensus 501 f~Pl~~~lLv~l~es~~~I~~lLd~-Lp~~f~~--~~~~~~alG~AL~aA~~lL~~----~GG-kIivF~sg~Pt~GpG~ 572 (1021)
- .+....|++|++. -...|.. .......+..||.+|..++.. .+. ||++||+.---
T Consensus 88 ~-----------a~~i~~L~~l~~~~~~~~~~~~~~~~~~~~l~daL~~~~~~f~~~~~k~~~kRI~lfTd~D~P----- 151 (584)
T TIGR00578 88 G-----------AKRILELDQFKGDQGPKKFRDTYGHGSDYSLSEVLWVCANLFSDVQFRMSHKRIMLFTNEDNP----- 151 (584)
T ss_pred C-----------HHHHHHHHHHhhccCccchhhccCCCCCCcHHHHHHHHHHHHHhcchhhcCcEEEEECCCCCC-----
Confidence 1 1122223333332 1111111 112234789999999999965 233 58998863211
Q ss_pred ccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecC-CCcChh
Q 001711 573 LKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSD-KYTDIA 626 (1021)
Q Consensus 573 L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~-~~~dia 626 (1021)
++.++. ...-=...|.++.+.||.+++|.++. +.+|+.
T Consensus 152 ----------~~~~~~------~~~~a~~~a~dl~~~gi~ielf~l~~~~~Fd~s 190 (584)
T TIGR00578 152 ----------HGNDSA------KASRARTKAGDLRDTGIFLDLMHLKKPGGFDIS 190 (584)
T ss_pred ----------CCCchh------HHHHHHHHHHHHHhcCeEEEEEecCCCCCCChh
Confidence 111100 00111346888999999999996542 224444
No 66
>COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=95.06 E-value=0.69 Score=48.04 Aligned_cols=133 Identities=20% Similarity=0.320 Sum_probs=89.4
Q ss_pred CeEEEEEecchhHHhhc----HHHHHHHHHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711 428 PLYFFLIDVSISAIRSG----MLEVVAQTIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV 502 (1021)
Q Consensus 428 p~yvFvIDvS~~av~sG----~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~ 502 (1021)
-+.|.+||-|..+.+.- .+++-++++...+.. ..+++...||||+... .+|+.
T Consensus 4 EatvvliDNse~s~NgDy~ptRFeAQkd~ve~if~~K~ndnpEntiGli~~~~-----------a~p~v----------- 61 (243)
T COG5148 4 EATVVLIDNSEASQNGDYLPTRFEAQKDAVESIFSKKFNDNPENTIGLIPLVQ-----------AQPNV----------- 61 (243)
T ss_pred ceEEEEEeChhhhhcCCCCcHHHHHHHHHHHHHHHHHhcCCccceeeeeeccc-----------CCcch-----------
Confidence 46789999998775422 366777777777763 4455666799998542 12321
Q ss_pred CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---C--CEEEEEecCCCCCCcccccccC
Q 001711 503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---G--GKLLIFQNSLPSLGVGCLKLRG 577 (1021)
Q Consensus 503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---G--GkIivF~sg~Pt~GpG~L~~r~ 577 (1021)
|..+...+-.|...|..++- +.+--++-+|+.|..+|++. | -+|++|.+++-.
T Consensus 62 ------lsT~T~~~gkilt~lhd~~~------~g~a~~~~~lqiaql~lkhR~nk~q~qriVaFvgSpi~---------- 119 (243)
T COG5148 62 ------LSTPTKQRGKILTFLHDIRL------HGGADIMRCLQIAQLILKHRDNKGQRQRIVAFVGSPIQ---------- 119 (243)
T ss_pred ------hccchhhhhHHHHHhccccc------cCcchHHHHHHHHHHHHhcccCCccceEEEEEecCccc----------
Confidence 22234456667777777752 34445889999999999984 3 689999987521
Q ss_pred CcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecC
Q 001711 578 DDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSD 620 (1021)
Q Consensus 578 ~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~ 620 (1021)
+.| +-.-.+|..+.+++|.||+.-|+.
T Consensus 120 --------ese--------deLirlak~lkknnVAidii~fGE 146 (243)
T COG5148 120 --------ESE--------DELIRLAKQLKKNNVAIDIIFFGE 146 (243)
T ss_pred --------ccH--------HHHHHHHHHHHhcCeeEEEEehhh
Confidence 111 223468999999999999998763
No 67
>cd01457 vWA_ORF176_type VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most
Probab=94.58 E-value=0.42 Score=50.53 Aligned_cols=146 Identities=17% Similarity=0.221 Sum_probs=80.3
Q ss_pred eEEEEEecchhHHhh----c--HHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711 429 LYFFLIDVSISAIRS----G--MLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV 502 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s----G--~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~ 502 (1021)
-++|+||+|.++-.. + -++.+++++...+..+......+|++++|++..+-+ .
T Consensus 4 dvv~~ID~SgSM~~~~~~~~~~k~~~ak~~~~~l~~~~~~~D~d~i~l~~f~~~~~~~---------------------~ 62 (199)
T cd01457 4 DYTLLIDKSGSMAEADEAKERSRWEEAQESTRALARKCEEYDSDGITVYLFSGDFRRY---------------------D 62 (199)
T ss_pred CEEEEEECCCcCCCCCCCCCchHHHHHHHHHHHHHHHHHhcCCCCeEEEEecCCcccc---------------------C
Confidence 379999999998532 1 256666666666665443223568888886542110 0
Q ss_pred CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHH-HHHhc--------CCEEEEEecCCCCCCcccc
Q 001711 503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFM-VMSRL--------GGKLLIFQNSLPSLGVGCL 573 (1021)
Q Consensus 503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~-lL~~~--------GGkIivF~sg~Pt~GpG~L 573 (1021)
+ +. ++.+.++++.+.. ...+.++.||+.++. +++.. +..||+++.|.++- ...+
T Consensus 63 ~--------~~--~~~v~~~~~~~~p------~G~T~l~~~l~~a~~~~~~~~~~~~~~p~~~~vIiiTDG~~~d-~~~~ 125 (199)
T cd01457 63 N--------VN--SSKVDQLFAENSP------DGGTNLAAVLQDALNNYFQRKENGATCPEGETFLVITDGAPDD-KDAV 125 (199)
T ss_pred C--------cC--HHHHHHHHhcCCC------CCcCcHHHHHHHHHHHHHHHHhhccCCCCceEEEEEcCCCCCc-HHHH
Confidence 1 11 4555666655432 245789999998874 33321 34566677766541 1100
Q ss_pred cccCCcCcccCCCccccCCCCCcHHHHHHHHHHhh-CCcEEEEEEecCCCcChhhhhhhccc
Q 001711 574 KLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTK-FQIAVNVYAFSDKYTDIASLGTLAKY 634 (1021)
Q Consensus 574 ~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~-~gIsVDlF~~s~~~~diatl~~L~~~ 634 (1021)
. +.-.+.+.++.+ .+|++.++.++.+.-+...|..|...
T Consensus 126 ~----------------------~~i~~a~~~l~~~~~i~i~~v~vG~~~~~~~~L~~ld~~ 165 (199)
T cd01457 126 E----------------------RVIIKASDELDADNELAISFLQIGRDPAATAFLKALDDQ 165 (199)
T ss_pred H----------------------HHHHHHHHhhccccCceEEEEEeCCcHHHHHHHHHHhHH
Confidence 0 000111111111 47888888887766665556665543
No 68
>cd01460 vWA_midasin VWA_Midasin: Midasin is a member of the AAA ATPase family. The proteins of this family are unified by their common archetectural organization that is based upon a conserved ATPase domain. The AAA domain of midasin contains six tandem AAA protomers. The AAA domains in midasin is followed by a D/E rich domain that is following by a VWA domain. The members of this subgroup have a conserved MIDAS motif. The function of this domain is not exactly known although it has been speculated to play a crucial role in midasin function.
Probab=94.41 E-value=0.53 Score=52.42 Aligned_cols=132 Identities=19% Similarity=0.201 Sum_probs=77.2
Q ss_pred CCCeEEEEEecchhHHhhcH----HHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeecccccccc
Q 001711 426 MPPLYFFLIDVSISAIRSGM----LEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIF 501 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av~sG~----l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f 501 (1021)
...-++|+||+|.++.++.. ++ .+..|.++|+.+.. -+|||+.|+.++.+ +.++++.|
T Consensus 59 r~~qIvlaID~S~SM~~~~~~~~ale-ak~lIs~al~~Le~---g~vgVv~Fg~~~~~--------------v~Plt~d~ 120 (266)
T cd01460 59 RDYQILIAIDDSKSMSENNSKKLALE-SLCLVSKALTLLEV---GQLGVCSFGEDVQI--------------LHPFDEQF 120 (266)
T ss_pred cCceEEEEEecchhcccccccccHHH-HHHHHHHHHHhCcC---CcEEEEEeCCCceE--------------eCCCCCCc
Confidence 45678999999999865443 33 45567777777765 47999999976431 22222211
Q ss_pred CCCCCccceehhhhHHHHHHHHhhCCC-cccCCCCcccchHHHHHHHHHHHHhc-----CC---EEEEEec-CCCCCCcc
Q 001711 502 VPLPDDLLVNLSESRSVVDTLLDSLPS-MFQDNMNVESAFGPALKAAFMVMSRL-----GG---KLLIFQN-SLPSLGVG 571 (1021)
Q Consensus 502 ~Pl~~~lLv~l~es~~~I~~lLd~Lp~-~f~~~~~~~~alG~AL~aA~~lL~~~-----GG---kIivF~s-g~Pt~GpG 571 (1021)
.. +..++.+.. .|. ..++.++.||..|..+++.. +| ++++..| |-+.
T Consensus 121 ---------------~~-~a~~~~l~~~~f~---~~~Tni~~aL~~a~~~f~~~~~~~~s~~~~qlilLISDG~~~---- 177 (266)
T cd01460 121 ---------------SS-QSGPRILNQFTFQ---QDKTDIANLLKFTAQIFEDARTQSSSGSLWQLLLIISDGRGE---- 177 (266)
T ss_pred ---------------hh-hHHHHHhCcccCC---CCCCcHHHHHHHHHHHHHhhhccccccccccEEEEEECCCcc----
Confidence 11 222333321 222 23456999999999998754 32 5555544 2211
Q ss_pred cccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecC
Q 001711 572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSD 620 (1021)
Q Consensus 572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~ 620 (1021)
.. | .--+..+.++.+.+|.|-..+.-.
T Consensus 178 -~~-------------e--------~~~~~~~r~a~e~~i~l~~I~ld~ 204 (266)
T cd01460 178 -FS-------------E--------GAQKVRLREAREQNVFVVFIIIDN 204 (266)
T ss_pred -cC-------------c--------cHHHHHHHHHHHcCCeEEEEEEcC
Confidence 00 0 001345788889998887776544
No 69
>KOG0443 consensus Actin regulatory proteins (gelsolin/villin family) [Cytoskeleton]
Probab=94.13 E-value=0.19 Score=62.08 Aligned_cols=79 Identities=25% Similarity=0.284 Sum_probs=53.6
Q ss_pred cchhhccCCcEEEEECC-ceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHHh-CCCCCce
Q 001711 898 LVAESLDSRGLYIFDDG-FRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLREQ-DPSYYQL 975 (1021)
Q Consensus 898 LS~~~L~~~giyLlD~G-~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~~-r~~~~~l 975 (1021)
|+.+-|+.+++||||+| ..||||+|+.++.+-.+..+ ...+++| |.. +..+-.+
T Consensus 277 l~qdlLd~~dCYILD~g~~~IfVW~Gr~as~~ERkaAm---------------------~~AeeFl---k~k~yP~~TqV 332 (827)
T KOG0443|consen 277 LTKDLLDTEDCYILDCGGGEIFVWKGRQASLDERKAAM---------------------SSAEEFL---KKKKYPPNTQV 332 (827)
T ss_pred hhHHhhccCCeEEEecCCceEEEEeCCCCCHHHHHHHH---------------------HHHHHHH---HhccCCCCceE
Confidence 88899999999999999 99999999999776543222 2233344 443 4566666
Q ss_pred EEEeccCC-CcchHHHHHhhccccCCC
Q 001711 976 CQLVRQGE-QPREGFLLLANLVEDQIG 1001 (1021)
Q Consensus 976 ~~vvrqg~-~~~~e~~f~~~LVED~~~ 1001 (1021)
.+|-+|- +.....+|.+..-+|+++
T Consensus 333 -~rv~EG~Esa~FKq~F~~W~~~~~t~ 358 (827)
T KOG0443|consen 333 -VRVLEGAESAPFKQLFDSWPDKDQTN 358 (827)
T ss_pred -EEecCCCcchhHHHHHhhCccccccc
Confidence 6666653 332234666777777765
No 70
>cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=93.70 E-value=3.2 Score=44.05 Aligned_cols=98 Identities=10% Similarity=0.068 Sum_probs=61.2
Q ss_pred hhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHH-h--cCCEEEEEec-CCCCCCcccccccCCcCcccCCCccc
Q 001711 514 ESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMS-R--LGGKLLIFQN-SLPSLGVGCLKLRGDDLRVYGTDKEH 589 (1021)
Q Consensus 514 es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~-~--~GGkIivF~s-g~Pt~GpG~L~~r~~~~r~~gt~~e~ 589 (1021)
+..+.+..+|+.+.--+.. ..++ .||..|++.|+ . ...|+++..+ |-=|.| +
T Consensus 72 ~~~~~l~~~l~~~q~g~ag---~~Ta--dAi~~av~rl~~~~~a~~kvvILLTDG~n~~~--------------~----- 127 (191)
T cd01455 72 ERLETLKMMHAHSQFCWSG---DHTV--EATEFAIKELAAKEDFDEAIVIVLSDANLERY--------------G----- 127 (191)
T ss_pred hHHHHHHHHHHhcccCccC---ccHH--HHHHHHHHHHHhcCcCCCcEEEEEeCCCcCCC--------------C-----
Confidence 4456788888887543322 1233 88888888886 4 2355555544 321110 0
Q ss_pred cCCCCCcHHHHHH-HHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 590 SLRIPEDPFYKQM-AADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 590 ~l~~pa~~fY~~L-a~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
..| .+. |+.+.+.||-|..+.++. .|-.++..+++.|||+.|.-.+
T Consensus 128 --i~P-----~~aAa~lA~~~gV~iytIgiG~--~d~~~l~~iA~~tgG~~F~A~d 174 (191)
T cd01455 128 --IQP-----KKLADALAREPNVNAFVIFIGS--LSDEADQLQRELPAGKAFVCMD 174 (191)
T ss_pred --CCh-----HHHHHHHHHhCCCEEEEEEecC--CCHHHHHHHHhCCCCcEEEeCC
Confidence 011 344 355667888887777765 3677899999999999998754
No 71
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=93.29 E-value=5.4 Score=44.82 Aligned_cols=95 Identities=18% Similarity=0.169 Sum_probs=62.8
Q ss_pred cccchHHHHHHHHHHHHh----------cCCEEEEEecCCCCCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHH
Q 001711 536 VESAFGPALKAAFMVMSR----------LGGKLLIFQNSLPSLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAAD 605 (1021)
Q Consensus 536 ~~~alG~AL~aA~~lL~~----------~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~ 605 (1021)
.++.+..||..|+-.+.. ..+||+++..+. |. ..+.-=+-+....
T Consensus 117 ~~s~lagals~ALcyinr~~~~~~~~~~~~~RIlii~~s~------------~~-------------~~qYi~~mn~Ifa 171 (279)
T TIGR00627 117 SRTVLAGALSDALGYINRSEQSETASEKLKSRILVISITP------------DM-------------ALQYIPLMNCIFS 171 (279)
T ss_pred ccccchhHHHhhhhhhcccccccccCcCCcceEEEEECCC------------Cc-------------hHHHHHHHHHHHH
Confidence 466788888888877743 247888887631 10 1112223477788
Q ss_pred HhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhc
Q 001711 606 LTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDL 662 (1021)
Q Consensus 606 ~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~l 662 (1021)
|.+.+|.||++..+.+ .|..-+..+++.|||...... |.+.|...|...+
T Consensus 172 aqk~~I~Idv~~L~~e-~~~~~lqQa~~~TgG~Y~~~~------~~~~L~q~L~~~~ 221 (279)
T TIGR00627 172 AQKQNIPIDVVSIGGD-FTSGFLQQAADITGGSYLHVK------KPQGLLQYLMTNM 221 (279)
T ss_pred HHHcCceEEEEEeCCc-cccHHHHHHHHHhCCEEeccC------CHhHHHHHHHHhc
Confidence 9999999999988643 467889999999999544443 2344555554433
No 72
>PF03731 Ku_N: Ku70/Ku80 N-terminal alpha/beta domain; InterPro: IPR005161 The Ku heterodimer (composed of Ku70 P12956 from SWISSPROT and Ku80 P13010 from SWISSPROT) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the N-terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold [].; PDB: 1JEQ_A 1JEY_A.
Probab=92.72 E-value=0.77 Score=49.37 Aligned_cols=154 Identities=20% Similarity=0.242 Sum_probs=74.3
Q ss_pred eEEEEEecchhHHhh-----cHHHHHHHHHHHHHhcC-CCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccC
Q 001711 429 LYFFLIDVSISAIRS-----GMLEVVAQTIKSCLDEL-PGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFV 502 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s-----G~l~~~~~sI~~~L~~L-p~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~ 502 (1021)
+.|||||+|.++.+. .-++.++++|...+.+. -..+...||||.|++.-.=-. .....-..+.++.+|+-
T Consensus 1 ~~vflID~s~sM~~~~~~~~~~l~~al~~i~~~~~~ki~~~~kD~vgvvl~gt~~t~n~-~~~~~~~~i~~l~~l~~--- 76 (224)
T PF03731_consen 1 ATVFLIDVSPSMFEPSSESESPLEEALKAIEDLMQQKIISSPKDEVGVVLFGTDETNNP-DEDSGYENIFVLQPLDP--- 76 (224)
T ss_dssp EEEEEEE-SCGGGS-BTTCS-HHHHHHHHHHHHHHHHHHTT---EEEEEEES-SS-BST--TTT-STTEEEEEECC----
T ss_pred CEEEEEECCHHHCCCCCCcchhHHHHHHHHHHHHHHHHcCCCCCeEEEEEEcCCCCCCc-ccccCCCceEEeecCCc---
Confidence 469999999988532 23666777777777642 122337899999975421000 00111123333333321
Q ss_pred CCCCccceehhhhHHHHHHHHhhCCC----cccCCCCcccchHHHHHHHHHHHHh--c-----CCEEEEEecCCCCCCcc
Q 001711 503 PLPDDLLVNLSESRSVVDTLLDSLPS----MFQDNMNVESAFGPALKAAFMVMSR--L-----GGKLLIFQNSLPSLGVG 571 (1021)
Q Consensus 503 Pl~~~lLv~l~es~~~I~~lLd~Lp~----~f~~~~~~~~alG~AL~aA~~lL~~--~-----GGkIivF~sg~Pt~GpG 571 (1021)
-+.+.|..|.+.+.. ........+..+..||.+|..+++. . .-||++|++.- +|-
T Consensus 77 -----------~~~~~l~~L~~~~~~~~~~~~~~~~~~~~~l~~al~v~~~~~~~~~~~~k~~~krI~l~Td~d---~p~ 142 (224)
T PF03731_consen 77 -----------PSAERLKELEELLKPGDKFENFFSGSDEGDLSDALWVASDMFRERTCKKKKNKKRIFLFTDND---GPH 142 (224)
T ss_dssp ------------BHHHHHHHHTTSHHHHHHHHHC-SSS---HHHHHHHHHHHHHCHCTTS-ECEEEEEEEES-S---STT
T ss_pred -----------cCHHHHHHHHHhhcccccccccCCCCCccCHHHHHHHHHHHHHHHhhcccCCCcEEEEEeCCC---CCC
Confidence 122333333333322 0011233456799999999999975 1 23677777621 111
Q ss_pred cccccCCcCcccCCCccccCCCCCcHHHHH-HHHHHhhCCcEEEEEEe
Q 001711 572 CLKLRGDDLRVYGTDKEHSLRIPEDPFYKQ-MAADLTKFQIAVNVYAF 618 (1021)
Q Consensus 572 ~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~-La~~~~~~gIsVDlF~~ 618 (1021)
. +.+ + -..-.++ .+.++...+|.+++|..
T Consensus 143 ~-----------~~~-~------~~~~~~~l~~~Dl~~~~i~~~~~~l 172 (224)
T PF03731_consen 143 E-----------DDD-E------LERIIQKLKAKDLQDNGIEIELFFL 172 (224)
T ss_dssp T------------CC-C------HHHHHHHHHHHHHHHHTEEEEEEEC
T ss_pred C-----------CHH-H------HHHHHHhhccccchhcCcceeEeec
Confidence 0 000 0 0011111 26779999999999987
No 73
>PF03850 Tfb4: Transcription factor Tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=92.62 E-value=4.9 Score=45.17 Aligned_cols=184 Identities=17% Similarity=0.167 Sum_probs=96.4
Q ss_pred eEEEEEecchhHHhh----cHHHHHHHHHHHHHhc-CCCCCCceEEEEEEcC--eEEEEecCCCC--CCcceeecccccc
Q 001711 429 LYFFLIDVSISAIRS----GMLEVVAQTIKSCLDE-LPGFPRTQIGFITFDS--TIHFYNMKSSL--TQPQMMVISDLDD 499 (1021)
Q Consensus 429 ~yvFvIDvS~~av~s----G~l~~~~~sI~~~L~~-Lp~~~rt~VgiITFds--~V~fynl~~~~--~~p~mlVvsDldd 499 (1021)
..+.|||++-.+... ..+..++++|.--++. |--+..-+|+||.... .-.+|.-.... ....-.-..+.++
T Consensus 3 LLvIILD~nP~~W~~~~~~~~l~~~l~~llvFlNahL~l~~~N~vaVIAs~~~~s~~LYP~~~~~~~~~~~~~~~~~~~~ 82 (276)
T PF03850_consen 3 LLVIILDTNPLAWGQLSDQLSLSQFLDSLLVFLNAHLALNHSNQVAVIASHSNSSKFLYPSPSSSESSNSGDVEMNSSDS 82 (276)
T ss_pred EEEEEEECCHHHHhhccccccHHHHHHHHHHHHHHHHhhCccCCEEEEEEcCCccEEEeCCCccccccCCCccccccccc
Confidence 468899999877432 2344555555555542 2222235799988743 33455543310 0000000111110
Q ss_pred ccCCCCCccceehhhh-HHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh-----------cCCEEEEEecCCCC
Q 001711 500 IFVPLPDDLLVNLSES-RSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR-----------LGGKLLIFQNSLPS 567 (1021)
Q Consensus 500 ~f~Pl~~~lLv~l~es-~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~-----------~GGkIivF~sg~Pt 567 (1021)
. -.+.+-.++|. .+.+.+++++....- .....+.+..||..|+-.+.. ..+||+++.++-
T Consensus 83 ~----~y~~f~~v~~~v~~~l~~l~~~~~~~~--~~~~~s~LagALS~ALCyINR~~~~~~~~~~~~~~RILv~~s~s-- 154 (276)
T PF03850_consen 83 N----KYRQFRNVDETVLEELKKLMSETSESS--DSTTSSLLAGALSMALCYINRISRESPSGGTSLKSRILVIVSGS-- 154 (276)
T ss_pred c----hhHHHHHHHHHHHHHHHHHHhhccccc--ccccchhhHHHHHHHHHHHhhhhhcccCCCCCcCccEEEEEecC--
Confidence 0 00111112221 233333333332211 111226788899888876643 235888853321
Q ss_pred CCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 568 LGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 568 ~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
+| . ..+.-=+-+..-.+.+.+|.||++..+. .|-.-|...+..|||.-+..+.
T Consensus 155 ---------~d--------~-----~~QYi~~MN~iFaAqk~~v~IDv~~L~~--~~s~fLqQa~d~T~G~y~~~~~ 207 (276)
T PF03850_consen 155 ---------PD--------S-----SSQYIPLMNCIFAAQKQKVPIDVCKLGG--KDSTFLQQASDITGGIYLKVSK 207 (276)
T ss_pred ---------CC--------c-----cHHHHHHHHHHHHHhcCCceeEEEEecC--CchHHHHHHHHHhCceeeccCc
Confidence 11 0 0112223466677889999999999887 5666789999999999887765
No 74
>KOG0444 consensus Cytoskeletal regulator Flightless-I (contains leucine-rich and gelsolin repeats) [Cytoskeleton]
Probab=91.25 E-value=0.31 Score=58.81 Aligned_cols=66 Identities=27% Similarity=0.456 Sum_probs=49.4
Q ss_pred cccccchhhccCCcEEEEECCceeEEEecCCCCHHHHHhhcCCchhhhhccccccccchHHHHHHHHHHHHHHHh-CCCC
Q 001711 894 KRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSPDIAMNLLGSEFAAELSKVMLREQDNEMSRKLLGILKKLREQ-DPSY 972 (1021)
Q Consensus 894 ~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~~ll~~lFgv~~~~~l~~~~lp~~~n~~s~~l~~ii~~lr~~-r~~~ 972 (1021)
++++|+..+|++.-+||||-|.+||||-|...- +..+.+.|-+.++|.+. |.--
T Consensus 637 EPVpl~~tSLDPRf~FlLD~G~~IyiW~G~~s~-------------------------~t~~~KARLfAEkinK~eRKgK 691 (1255)
T KOG0444|consen 637 EPVPLSVTSLDPRFCFLLDAGETIYIWSGYKSR-------------------------ITVSNKARLFAEKINKRERKGK 691 (1255)
T ss_pred eccCccccccCcceEEEEeCCceEEEEeccchh-------------------------cccchHHHHHHHHhhhhhccCc
Confidence 468999999999999999999999999997641 13445667677777544 3333
Q ss_pred CceEEEeccCCCc
Q 001711 973 YQLCQLVRQGEQP 985 (1021)
Q Consensus 973 ~~l~~vvrqg~~~ 985 (1021)
..+ .++|||...
T Consensus 692 ~EI-~l~rQg~e~ 703 (1255)
T KOG0444|consen 692 SEI-ELCRQGREP 703 (1255)
T ss_pred eee-ehhhhcCCC
Confidence 455 788998654
No 75
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=90.85 E-value=2.6 Score=47.40 Aligned_cols=165 Identities=23% Similarity=0.311 Sum_probs=99.3
Q ss_pred CCeEEEEEecchhHHhhc----HHHHHHHHHHHHHhcCC-CCCCceEEEEEEcCeEEEEecCCCCCCcceeecccccccc
Q 001711 427 PPLYFFLIDVSISAIRSG----MLEVVAQTIKSCLDELP-GFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIF 501 (1021)
Q Consensus 427 pp~yvFvIDvS~~av~sG----~l~~~~~sI~~~L~~Lp-~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f 501 (1021)
-...+.|||+|-.+.++- .++.+++.+..-+.+.- .+|-..||||+.-+. .. -+++|
T Consensus 60 iRhl~iviD~S~am~e~Df~P~r~a~~~K~le~Fv~eFFdQNPiSQigii~~k~g---------~A----~~lt~----- 121 (378)
T KOG2807|consen 60 IRHLYIVIDCSRAMEEKDFRPSRFANVIKYLEGFVPEFFDQNPISQIGIISIKDG---------KA----DRLTD----- 121 (378)
T ss_pred heeEEEEEEhhhhhhhccCCchHHHHHHHHHHHHHHHHhccCchhheeEEEEecc---------hh----hHHHH-----
Confidence 346678999999886654 34555555555555432 356678999875321 10 01122
Q ss_pred CCCCCccceehhhh-HHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCC----EEEEEecCCCCCCccccccc
Q 001711 502 VPLPDDLLVNLSES-RSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGG----KLLIFQNSLPSLGVGCLKLR 576 (1021)
Q Consensus 502 ~Pl~~~lLv~l~es-~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GG----kIivF~sg~Pt~GpG~L~~r 576 (1021)
++.+ +..|+.|.... .-.....+-.||+.|...|++.-| .|++..+++.|.-||-+
T Consensus 122 ----------ltgnp~~hI~aL~~~~------~~~g~fSLqNaLe~a~~~Lk~~p~H~sREVLii~sslsT~DPgdi--- 182 (378)
T KOG2807|consen 122 ----------LTGNPRIHIHALKGLT------ECSGDFSLQNALELAREVLKHMPGHVSREVLIIFSSLSTCDPGDI--- 182 (378)
T ss_pred ----------hcCCHHHHHHHHhccc------ccCCChHHHHHHHHHHHHhcCCCcccceEEEEEEeeecccCcccH---
Confidence 2222 22333332222 123455688899999999998633 45666677777766633
Q ss_pred CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHH
Q 001711 577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRH 656 (1021)
Q Consensus 577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~ 656 (1021)
| +.-..+.+..|-|.++-.+.+ ++.-..||+.|||. |+. ..|...|..
T Consensus 183 ----------------------~-~tI~~lk~~kIRvsvIgLsaE---v~icK~l~kaT~G~-Y~V-----~lDe~Hlke 230 (378)
T KOG2807|consen 183 ----------------------Y-ETIDKLKAYKIRVSVIGLSAE---VFICKELCKATGGR-YSV-----ALDEGHLKE 230 (378)
T ss_pred ----------------------H-HHHHHHHhhCeEEEEEeechh---HHHHHHHHHhhCCe-EEE-----EeCHHHHHH
Confidence 3 334567888899999887744 66678899999993 222 245555544
Q ss_pred HHHH
Q 001711 657 ELSR 660 (1021)
Q Consensus 657 dL~r 660 (1021)
-|..
T Consensus 231 Ll~e 234 (378)
T KOG2807|consen 231 LLLE 234 (378)
T ss_pred HHHh
Confidence 4443
No 76
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=90.19 E-value=8.2 Score=43.74 Aligned_cols=13 Identities=8% Similarity=0.171 Sum_probs=6.3
Q ss_pred HHHHHHHHHHhcC
Q 001711 448 VVAQTIKSCLDEL 460 (1021)
Q Consensus 448 ~~~~sI~~~L~~L 460 (1021)
.++|+|..+|.-+
T Consensus 391 ~AiETllTAI~lI 403 (498)
T KOG4849|consen 391 GAIETLLTAIQLI 403 (498)
T ss_pred hHHHHHHHHHHHH
Confidence 3445555555444
No 77
>COG2425 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=89.92 E-value=2.1 Score=50.78 Aligned_cols=148 Identities=16% Similarity=0.216 Sum_probs=94.0
Q ss_pred CCeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711 427 PPLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD 506 (1021)
Q Consensus 427 pp~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~ 506 (1021)
.| ++.|||.|.++ .|..+...+++..+|-.+.--.+.++.++.||+.++=|.+....
T Consensus 273 Gp-villlD~SGSM--~G~~e~~AKAvalAl~~~alaenR~~~~~lF~s~~~~~el~~k~-------------------- 329 (437)
T COG2425 273 GP-VILLLDKSGSM--SGFKEQWAKAVALALMRIALAENRDCYVILFDSEVIEYELYEKK-------------------- 329 (437)
T ss_pred CC-EEEEEeCCCCc--CCcHHHHHHHHHHHHHHHHHHhccceEEEEecccceeeeecCCc--------------------
Confidence 44 45699999998 57777777777777765432233789999999954444433210
Q ss_pred ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh---cCCEEEEEecCCCCCCcccccccCCcCccc
Q 001711 507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR---LGGKLLIFQNSLPSLGVGCLKLRGDDLRVY 583 (1021)
Q Consensus 507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~---~GGkIivF~sg~Pt~GpG~L~~r~~~~r~~ 583 (1021)
-.++++++.|...|.. ++-+-.||..|++.++. .++.|++.|.|-.
T Consensus 330 ----------~~~~e~i~fL~~~f~G----GTD~~~~l~~al~~~k~~~~~~adiv~ITDg~~----------------- 378 (437)
T COG2425 330 ----------IDIEELIEFLSYVFGG----GTDITKALRSALEDLKSRELFKADIVVITDGED----------------- 378 (437)
T ss_pred ----------cCHHHHHHHHhhhcCC----CCChHHHHHHHHHHhhcccccCCCEEEEeccHh-----------------
Confidence 0134456666555543 35678899999999985 4678888776421
Q ss_pred CCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCC-cChhhhhhhccccccEEEEeC
Q 001711 584 GTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKY-TDIASLGTLAKYTGGQVYYYP 643 (1021)
Q Consensus 584 gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~-~diatl~~L~~~TGG~v~~y~ 643 (1021)
.. .+.|-++..+...+.+.=|.-.+++... -++..+.. ++ +|.++
T Consensus 379 ------~~---~~~~~~~v~e~~k~~~~rl~aV~I~~~~~~~l~~Isd---~~---i~~~~ 424 (437)
T COG2425 379 ------ER---LDDFLRKVKELKKRRNARLHAVLIGGYGKPGLMRISD---HI---IYRVE 424 (437)
T ss_pred ------hh---hhHHHHHHHHHHHHhhceEEEEEecCCCCcccceeee---ee---EEeeC
Confidence 11 1567777777776777777777766544 55554444 33 66655
No 78
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=88.82 E-value=9.5 Score=43.26 Aligned_cols=7 Identities=29% Similarity=0.591 Sum_probs=2.7
Q ss_pred CccceEE
Q 001711 354 FICRTYV 360 (1021)
Q Consensus 354 ~rCrAYi 360 (1021)
.|||-.|
T Consensus 412 dRCrvLi 418 (498)
T KOG4849|consen 412 DRCRVLI 418 (498)
T ss_pred hHHHHHH
Confidence 3444333
No 79
>PRK10997 yieM hypothetical protein; Provisional
Probab=88.12 E-value=2 Score=51.76 Aligned_cols=149 Identities=13% Similarity=0.169 Sum_probs=85.8
Q ss_pred CeEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCc
Q 001711 428 PLYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDD 507 (1021)
Q Consensus 428 p~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~ 507 (1021)
--+++|||+|.++. |.-+..+.++..+|-.+.-..+.++++|.|++.+..|.+...
T Consensus 324 GpiII~VDtSGSM~--G~ke~~AkalAaAL~~iAl~q~dr~~li~Fs~~i~~~~l~~~---------------------- 379 (487)
T PRK10997 324 GPFIVCVDTSGSMG--GFNEQCAKAFCLALMRIALAENRRCYIMLFSTEVVTYELTGP---------------------- 379 (487)
T ss_pred CcEEEEEECCCCCC--CCHHHHHHHHHHHHHHHHHhcCCCEEEEEecCCceeeccCCc----------------------
Confidence 45788999999984 554455556666665543323367999999988776644321
Q ss_pred cceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecCCCCCCcccccccCCcCcccC
Q 001711 508 LLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNSLPSLGVGCLKLRGDDLRVYG 584 (1021)
Q Consensus 508 lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg~Pt~GpG~L~~r~~~~r~~g 584 (1021)
..+..+..+|+.. + ..++.+..||+.++..++.. .|-|+++++.....
T Consensus 380 ------~gl~~ll~fL~~~---f----~GGTDl~~aL~~al~~l~~~~~r~adIVVISDF~~~~---------------- 430 (487)
T PRK10997 380 ------DGLEQAIRFLSQS---F----RGGTDLAPCLRAIIEKMQGREWFDADAVVISDFIAQR---------------- 430 (487)
T ss_pred ------cCHHHHHHHHHHh---c----CCCCcHHHHHHHHHHHHcccccCCceEEEECCCCCCC----------------
Confidence 1112222233321 2 34677899999999888652 46677665543110
Q ss_pred CCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 585 TDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 585 t~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
..+.+.+.+...-.+.+.-+...+++.. +-..+..++. +++.|+.
T Consensus 431 ---------~~eel~~~L~~Lk~~~~~rf~~l~i~~~--~~p~l~~ifD----~~W~~d~ 475 (487)
T PRK10997 431 ---------LPDELVAKVKELQRQHQHRFHAVAMSAH--GKPGIMRIFD----HIWRFDT 475 (487)
T ss_pred ---------ChHHHHHHHHHHHHhcCcEEEEEEeCCC--CCchHHHhcC----eeeEecC
Confidence 0123444444333347777777777642 2233444443 4677664
No 80
>PF06707 DUF1194: Protein of unknown function (DUF1194); InterPro: IPR010607 This family consists of several hypothetical Rhizobiales specific proteins of around 270 residues in length. The function of this family is unknown.
Probab=86.90 E-value=29 Score=37.40 Aligned_cols=119 Identities=18% Similarity=0.171 Sum_probs=64.5
Q ss_pred hhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecC--CCCCCcccccccCCcCcccCCCcc
Q 001711 514 ESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNS--LPSLGVGCLKLRGDDLRVYGTDKE 588 (1021)
Q Consensus 514 es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg--~Pt~GpG~L~~r~~~~r~~gt~~e 588 (1021)
+..+.+-.-|+..+..+ ...+|+|.||..+..+|... +.|-++=.|| .-|.|+
T Consensus 75 ~da~a~A~~l~~~~r~~----~~~Taig~Al~~a~~ll~~~~~~~~RrVIDvSGDG~~N~G~------------------ 132 (205)
T PF06707_consen 75 ADAEAFAARLRAAPRRF----GGRTAIGSALDFAAALLAQNPFECWRRVIDVSGDGPNNQGP------------------ 132 (205)
T ss_pred HHHHHHHHHHHhCCCCC----CCCchHHHHHHHHHHHHHhCCCCCceEEEEECCCCCCCCCC------------------
Confidence 34444455555555432 22389999999999999874 3444444442 222221
Q ss_pred ccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCc----ChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhccc
Q 001711 589 HSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYT----DIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDLTR 664 (1021)
Q Consensus 589 ~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~----diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~ltr 664 (1021)
.|. +..-..+...||.||=+.+....- +|...-.=+-.+|---|.... .+.+.|.+-++|-|.|
T Consensus 133 ----~p~----~~ard~~~~~GitINgL~I~~~~~~~~~~L~~yy~~~VIgGpgAFV~~a----~~~~df~~AirrKL~r 200 (205)
T PF06707_consen 133 ----RPV----TSARDAAVAAGITINGLAILDDDPFGGADLDAYYRRCVIGGPGAFVETA----RGFEDFAEAIRRKLIR 200 (205)
T ss_pred ----Ccc----HHHHHHHHHCCeEEeeeEecCCCCCccccHHHHHhhhcccCCCceEEEc----CCHHHHHHHHHHHHHH
Confidence 122 122234556899999998877655 565544333333322232222 2345566666666655
Q ss_pred cc
Q 001711 665 ET 666 (1021)
Q Consensus 665 ~~ 666 (1021)
|+
T Consensus 201 Ei 202 (205)
T PF06707_consen 201 EI 202 (205)
T ss_pred Hh
Confidence 53
No 81
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=85.16 E-value=91 Score=37.16 Aligned_cols=272 Identities=15% Similarity=0.192 Sum_probs=140.4
Q ss_pred CCeEEEEEecchhHHhh-cHHHHHHHHHHHHHhcCCCCCCceEEEEEE-cCeEEEEec--CCCCCCcceeeccccccccC
Q 001711 427 PPLYFFLIDVSISAIRS-GMLEVVAQTIKSCLDELPGFPRTQIGFITF-DSTIHFYNM--KSSLTQPQMMVISDLDDIFV 502 (1021)
Q Consensus 427 pp~yvFvIDvS~~av~s-G~l~~~~~sI~~~L~~Lp~~~rt~VgiITF-ds~V~fynl--~~~~~~p~mlVvsDldd~f~ 502 (1021)
|-=..|+.|+|+++... .-++.+...|.+.|..+-.+ .|+||=+| |+.|.=|-. ...+..|-.-.-...+-.|
T Consensus 99 PvDLYyLMDlS~SM~ddl~~lk~lg~~L~~~m~~it~n--~rlGfGsFVDK~v~P~~~t~p~~l~~PC~~~~~~c~p~f- 175 (423)
T smart00187 99 PVDLYYLMDLSYSMKDDLDNLKSLGDDLAREMKGLTSN--FRLGFGSFVDKTVSPFVSTRPEKLENPCPNYNLTCEPPY- 175 (423)
T ss_pred ccceEEEEeCCccHHHHHHHHHHHHHHHHHHHHhcccC--ceeeEEEeecCccCCcccCCHHHhcCCCcCCCCCcCCCc-
Confidence 33467899999988431 12445555666666666544 88999988 665532221 1111111000000001111
Q ss_pred CCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC-----CEEEEEecCCCC--CCcccccc
Q 001711 503 PLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG-----GKLLIFQNSLPS--LGVGCLKL 575 (1021)
Q Consensus 503 Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G-----GkIivF~sg~Pt--~GpG~L~~ 575 (1021)
.-.-.++|.+..+.+.+.+.... ...+...+|-.|-+-+++|+ .-+.+| -||+||.+--.- .|-|+|-.
T Consensus 176 --~f~~~L~LT~~~~~F~~~V~~~~-iSgN~D~PEgG~DAimQaaV-C~~~IGWR~~a~rllv~~TDa~fH~AGDGkLaG 251 (423)
T smart00187 176 --GFKHVLSLTDDTDEFNEEVKKQR-ISGNLDAPEGGFDAIMQAAV-CTEQIGWREDARRLLVFSTDAGFHFAGDGKLAG 251 (423)
T ss_pred --ceeeeccCCCCHHHHHHHHhhce-eecCCcCCcccHHHHHHHHh-hccccccCCCceEEEEEEcCCCccccCCcceee
Confidence 11224667777776776666643 23344567777777777774 112233 489999987775 38888765
Q ss_pred c--CCcCcccC-CCccccC-CCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEE-EeCCCCCchh
Q 001711 576 R--GDDLRVYG-TDKEHSL-RIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVY-YYPSFQSTTH 650 (1021)
Q Consensus 576 r--~~~~r~~g-t~~e~~l-~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~-~y~~F~~~~d 650 (1021)
. .++.+-|= .+.+..- ..-...--.+|++++.+++|-+ ||+.+....++. ..|+.+-.|... ... ..+.+
T Consensus 252 Iv~PNDg~CHL~~~g~Yt~s~~~DYPSi~ql~~kL~e~nI~~-IFAVT~~~~~~Y--~~Ls~lipgs~vg~Ls--~DSsN 326 (423)
T smart00187 252 IVQPNDGQCHLDNNGEYTMSTTQDYPSIGQLNQKLAENNINP-IFAVTKKQVSLY--KELSALIPGSSVGVLS--EDSSN 326 (423)
T ss_pred EecCCCCcceeCCCCCcCccCcCCCCCHHHHHHHHHhcCceE-EEEEcccchhHH--HHHHHhcCcceeeecc--cCcch
Confidence 4 12233221 1101110 0112234578899999999865 788777776653 344444444332 211 12233
Q ss_pred HHHHHHHHHHhcccccccceEEEEE-eCCCeEEEeeecCccc--CCCCceeeccCCCCCcEEEEEEec
Q 001711 651 GERLRHELSRDLTRETAWEAVMRIR-CGKGVRFTNYHGNFML--RSTDLLALPAVDCDKAYAMQLSLE 715 (1021)
Q Consensus 651 ~~kl~~dL~r~ltr~~g~~a~mrVR-~S~Gl~V~~~~Gnf~~--rs~~~~~l~~id~d~Sia~~~~~d 715 (1021)
.-+|..+-++.|. -.++|+.. ..++++++-.- .+-. .....-...++.-.+.+.|++++.
T Consensus 327 Iv~LI~~aY~~i~----S~V~l~~~~~p~~v~~~y~s-~C~~g~~~~~~~~C~~v~iG~~V~F~v~vt 389 (423)
T smart00187 327 VVELIKDAYNKIS----SRVELEDNSLPEGVSVTYTS-SCPGGVVGPGTRKCEGVKIGDTVSFEVTVT 389 (423)
T ss_pred HHHHHHHHHHhhc----eEEEEecCCCCCcEEEEEEe-eCCCCCcccCCcccCCcccCCEEEEEEEEE
Confidence 4455555555443 34455544 36677766332 2111 011112345666667777777654
No 82
>KOG2353 consensus L-type voltage-dependent Ca2+ channel, alpha2/delta subunit [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=84.19 E-value=19 Score=47.62 Aligned_cols=116 Identities=23% Similarity=0.344 Sum_probs=73.4
Q ss_pred ccccEEEEcc---ccccCCCCCCCeEEEEEecchhHHhhc-HHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecC
Q 001711 408 TKGSVEFVAP---TEYMVRPPMPPLYFFLIDVSISAIRSG-MLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMK 483 (1021)
Q Consensus 408 ~~gtVEfvap---~eY~~r~p~pp~yvFvIDvS~~av~sG-~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~ 483 (1021)
...++|+... +-|+.....+--.+|++|+|.+. +| .+..++.++.++|+.|.++ ..|-|+||++.++.-
T Consensus 203 ~~~~idl~D~R~r~Wyi~aAt~pKdiviLlD~SgSm--~g~~~~lak~tv~~iLdtLs~~--Dfvni~tf~~~~~~v--- 275 (1104)
T KOG2353|consen 203 TDNSIDLYDCRNRSWYIQAATSPKDIVILLDVSGSM--SGLRLDLAKQTVNEILDTLSDN--DFVNILTFNSEVNPV--- 275 (1104)
T ss_pred CCCcceeeecccccccccccCCccceEEEEeccccc--cchhhHHHHHHHHHHHHhcccC--CeEEEEeeccccCcc---
Confidence 3444444433 33565667778899999999987 34 3677888899999999876 789999999876532
Q ss_pred CCCCCcceeeccccccccCCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHh
Q 001711 484 SSLTQPQMMVISDLDDIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSR 553 (1021)
Q Consensus 484 ~~~~~p~mlVvsDldd~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~ 553 (1021)
+++.. .+|+----..++.+.++++.|. .+.. .-+-.|++.|+.+|..
T Consensus 276 ----------~pc~~-------~~lvqAt~~nk~~~~~~i~~l~--~k~~----a~~~~~~e~aF~lL~~ 322 (1104)
T KOG2353|consen 276 ----------SPCFN-------GTLVQATMRNKKVFKEAIETLD--AKGI----ANYTAALEYAFSLLRD 322 (1104)
T ss_pred ----------ccccc-------CceeecchHHHHHHHHHHhhhc--cccc----cchhhhHHHHHHHHHH
Confidence 22211 1222222244566666666665 1111 2245688888888865
No 83
>PF00362 Integrin_beta: Integrin, beta chain; InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus. Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another. The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices. Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=83.68 E-value=94 Score=37.25 Aligned_cols=275 Identities=17% Similarity=0.238 Sum_probs=131.8
Q ss_pred EEEEccccccCCCCCCCeEEEEEecchhHHhh-cHHHHHHHHHHHHHhcCCCCCCceEEEEEE-cCeEEEEecCCCCCCc
Q 001711 412 VEFVAPTEYMVRPPMPPLYFFLIDVSISAIRS-GMLEVVAQTIKSCLDELPGFPRTQIGFITF-DSTIHFYNMKSSLTQP 489 (1021)
Q Consensus 412 VEfvap~eY~~r~p~pp~yvFvIDvS~~av~s-G~l~~~~~sI~~~L~~Lp~~~rt~VgiITF-ds~V~fynl~~~~~~p 489 (1021)
++|..+++| |-=.-|++|+|+++... .-++.+-..|...|.++-.+ .|+||=+| |+.|.=|-- ..|
T Consensus 93 v~~~~a~~y------PvDLYyLmDlS~Sm~ddl~~l~~lg~~l~~~~~~it~~--~~~GfGsfvdK~~~P~~~----~~p 160 (426)
T PF00362_consen 93 VTVRPAEDY------PVDLYYLMDLSYSMKDDLENLKSLGQDLAEEMRNITSN--FRLGFGSFVDKPVMPFVS----TTP 160 (426)
T ss_dssp EEEEBSSS--------EEEEEEEE-SGGGHHHHHHHCCCCHHHHHHHHTT-SS--EEEEEEEESSSSSTTTST-----SS
T ss_pred EEEeecccc------ceeEEEEeechhhhhhhHHHHHHHHHHHHHHHHhcCcc--ceEechhhcccccCCccc----CCh
Confidence 444445555 33467899999987321 11344556677777777655 88999999 554321110 001
Q ss_pred ceeecccccccc--------CCCCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-----CC
Q 001711 490 QMMVISDLDDIF--------VPLPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-----GG 556 (1021)
Q Consensus 490 ~mlVvsDldd~f--------~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-----GG 556 (1021)
. .+.++. -|..-.-.++|.+..+.+...+.+.. +-.+...++..|-+-++||+= -+.+ .-
T Consensus 161 ~-----~l~~pc~~~~~~c~~~~~f~~~l~Lt~~~~~F~~~v~~~~-is~n~D~PEgg~dal~Qa~vC-~~~igWr~~a~ 233 (426)
T PF00362_consen 161 E-----KLKNPCPSKNPNCQPPFSFRHVLSLTDDITEFNEEVNKQK-ISGNLDAPEGGLDALMQAAVC-QEEIGWRNEAR 233 (426)
T ss_dssp H-----CHHSTSCCTTS--B---SEEEEEEEES-HHHHHHHHHTS---B--SSSSBSHHHHHHHHHH--HHHHT--STSE
T ss_pred h-----hhcCcccccCCCCCCCeeeEEeecccchHHHHHHhhhhcc-ccCCCCCCccccchheeeeec-ccccCcccCce
Confidence 0 111111 01111234677777777888887753 344556777777777777651 1222 35
Q ss_pred EEEEEecCCCC--CCcccccccC--CcCccc-CCCcccc-CCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhh
Q 001711 557 KLLIFQNSLPS--LGVGCLKLRG--DDLRVY-GTDKEHS-LRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGT 630 (1021)
Q Consensus 557 kIivF~sg~Pt--~GpG~L~~r~--~~~r~~-gt~~e~~-l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~ 630 (1021)
||+||.+--.- .|-|+|...- ++.+-| ..+.+.. -..-...-..+|.+.+.+++|.+ ||+......++. ..
T Consensus 234 ~llv~~TD~~fH~agDg~l~gi~~pnd~~Chl~~~~~y~~~~~~DYPSv~ql~~~l~e~~i~~-IFAVt~~~~~~Y--~~ 310 (426)
T PF00362_consen 234 RLLVFSTDAGFHFAGDGKLAGIVKPNDGKCHLDDNGMYTASTEQDYPSVGQLVRKLSENNINP-IFAVTKDVYSIY--EE 310 (426)
T ss_dssp EEEEEEESS-B--TTGGGGGT--S---SS--BSTTSBBGGGGCS----HHHHHHHHHHTTEEE-EEEEEGGGHHHH--HH
T ss_pred EEEEEEcCCccccccccccceeeecCCCceEECCCCcccccccccCCCHHHHHHHHHHcCCEE-EEEEchhhhhHH--HH
Confidence 89999887664 4888887542 223322 1111100 01124466778888899988765 777776655543 23
Q ss_pred hcccc-ccEEEEeCCCCCchhHHHHHHHHHHhcccccccceEEEEE-eCCCeEEEeeecCcccCC--CCceeeccCCCCC
Q 001711 631 LAKYT-GGQVYYYPSFQSTTHGERLRHELSRDLTRETAWEAVMRIR-CGKGVRFTNYHGNFMLRS--TDLLALPAVDCDK 706 (1021)
Q Consensus 631 L~~~T-GG~v~~y~~F~~~~d~~kl~~dL~r~ltr~~g~~a~mrVR-~S~Gl~V~~~~Gnf~~rs--~~~~~l~~id~d~ 706 (1021)
|+.+- |+.+-.... .+....+|..+-++.++. .+.|+.. ..++++|+ |..++..+. ...-...++..++
T Consensus 311 L~~~i~~s~vg~L~~--dSsNIv~LI~~aY~~i~s----~V~L~~~~~p~~v~v~-y~s~C~~~~~~~~~~~C~~V~iG~ 383 (426)
T PF00362_consen 311 LSNLIPGSSVGELSS--DSSNIVQLIKEAYNKISS----KVELKHDNAPDGVKVS-YTSNCPNGSTVPGTNECSNVKIGD 383 (426)
T ss_dssp HHHHSTTEEEEEEST--TSHTHHHHHHHHHHHHCT----EEEEEECS--TTEEEE-EEEEESSSEEEECCEEECSE-TT-
T ss_pred HhhcCCCceeccccc--CchhHHHHHHHHHHHHhh----eEEEEecCCCCcEEEE-EEEEccCCcccCcCccccCEecCC
Confidence 33332 444444432 223344555555554432 2333322 23456553 222222110 1224455566666
Q ss_pred cEEEEEEec
Q 001711 707 AYAMQLSLE 715 (1021)
Q Consensus 707 Sia~~~~~d 715 (1021)
++.|++.+.
T Consensus 384 ~V~F~VtVt 392 (426)
T PF00362_consen 384 TVTFNVTVT 392 (426)
T ss_dssp EEEEEEEEE
T ss_pred EEEEEEEEE
Confidence 666666554
No 84
>KOG3768 consensus DEAD box RNA helicase [General function prediction only]
Probab=83.14 E-value=16 Score=44.20 Aligned_cols=32 Identities=22% Similarity=0.507 Sum_probs=24.0
Q ss_pred CeEEEEEecchhHHh-----hcHHHHHHHHHHHHHhc
Q 001711 428 PLYFFLIDVSISAIR-----SGMLEVVAQTIKSCLDE 459 (1021)
Q Consensus 428 p~yvFvIDvS~~av~-----sG~l~~~~~sI~~~L~~ 459 (1021)
|+|+|+||+|.++-+ ..+|+.++.+|..-|+.
T Consensus 2 pi~lFllDTS~SM~qrah~~~tylD~AKgaVEtFiK~ 38 (888)
T KOG3768|consen 2 PIFLFLLDTSGSMSQRAHPQFTYLDLAKGAVETFIKQ 38 (888)
T ss_pred ceEEEEEecccchhhhccCCchhhHHHHHHHHHHHHH
Confidence 689999999998743 34677777777777764
No 85
>KOG0444 consensus Cytoskeletal regulator Flightless-I (contains leucine-rich and gelsolin repeats) [Cytoskeleton]
Probab=82.43 E-value=3.1 Score=50.67 Aligned_cols=56 Identities=23% Similarity=0.335 Sum_probs=37.5
Q ss_pred hhcccEEEeecCCCCCCcc--CCcccccccccccchhhccCCcEEEEECCceeEEEecCCCCH
Q 001711 867 LLYPCLIRVDEHLLKPSAQ--LDEYKNIMKRLPLVAESLDSRGLYIFDDGFRFVLWFGRMLSP 927 (1021)
Q Consensus 867 ~lYPrL~~lh~~~~~~~~~--~~~~~~lP~~l~LS~~~L~~~giyLlD~G~~i~lwvG~~v~~ 927 (1021)
-..|+||.+. + ..+ +--.+.+-+...|-.+-|.+.|+|+||+..+||||+|+....
T Consensus 731 p~qpkLYkV~-l----GmGyLELPQvel~P~~~l~q~lL~sk~VyiLDc~sDiF~W~GkKs~R 788 (1255)
T KOG0444|consen 731 PEQPKLYKVN-L----GMGYLELPQVELLPKGILKQDLLGSKGVYILDCNSDIFLWIGKKSNR 788 (1255)
T ss_pred CCCcceEEEc-c----ccceeecchhhhchhhHHHHHhhcCCeEEEEecCCceEEEecccchH
Confidence 4578999874 2 010 111121212345667778999999999999999999998644
No 86
>KOG2487 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription; Replication, recombination and repair]
Probab=76.54 E-value=46 Score=37.07 Aligned_cols=55 Identities=20% Similarity=0.189 Sum_probs=40.7
Q ss_pred HHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCCCCCchhHHHHHHHHHHhc
Q 001711 599 YKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPSFQSTTHGERLRHELSRDL 662 (1021)
Q Consensus 599 Y~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~F~~~~d~~kl~~dL~r~l 662 (1021)
|-+.--.+.+++|.||++....+ -..|...|..|||...+.+. .+.|.+.|...+
T Consensus 185 ~MNciFaAqKq~I~Idv~~l~~~---s~~LqQa~D~TGG~YL~v~~------~~gLLqyLlt~~ 239 (314)
T KOG2487|consen 185 YMNCIFAAQKQNIPIDVVSLGGD---SGFLQQACDITGGDYLHVEK------PDGLLQYLLTLL 239 (314)
T ss_pred HHHHHHHHHhcCceeEEEEecCC---chHHHHHHhhcCCeeEecCC------cchHHHHHHHHh
Confidence 45666678899999999998877 34588999999999888764 234555555443
No 87
>COG4867 Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=69.43 E-value=27 Score=40.88 Aligned_cols=160 Identities=16% Similarity=0.248 Sum_probs=95.7
Q ss_pred CeEEEEEecchhHHhhcHHHHHHH---HHHHHHhc-CCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711 428 PLYFFLIDVSISAIRSGMLEVVAQ---TIKSCLDE-LPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP 503 (1021)
Q Consensus 428 p~yvFvIDvS~~av~sG~l~~~~~---sI~~~L~~-Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P 503 (1021)
...+.++|+|++++-.|..--+++ +|...+.. .++ --+.||+|...- +.+-+++
T Consensus 464 aAvallvDtS~SM~~eGRw~PmKQtALALhHLv~TrfrG---D~l~~i~Fgr~A------------~~v~v~e------- 521 (652)
T COG4867 464 AAVALLVDTSFSMVMEGRWLPMKQTALALHHLVCTRFRG---DALQIIAFGRYA------------RTVTAAE------- 521 (652)
T ss_pred cceeeeeeccHHHHHhccCCchHHHHHHHHHHHHhcCCC---cceEEEeccchh------------cccCHHH-------
Confidence 467889999999988885433333 33333332 333 358899986421 1111111
Q ss_pred CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcC---CEEEEEecCCCCC----Cccccccc
Q 001711 504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLG---GKLLIFQNSLPSL----GVGCLKLR 576 (1021)
Q Consensus 504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~G---GkIivF~sg~Pt~----GpG~L~~r 576 (1021)
|..++... ..++.+--||..|-.+++... -.|++.+.|-||. |-|...--
T Consensus 522 -------------------Lt~l~~v~----eqgTNlhhaL~LA~r~l~Rh~~~~~~il~vTDGePtAhle~~DG~~~~f 578 (652)
T COG4867 522 -------------------LTGLAGVY----EQGTNLHHALALAGRHLRRHAGAQPVVLVVTDGEPTAHLEDGDGTSVFF 578 (652)
T ss_pred -------------------HhcCCCcc----ccccchHHHHHHHHHHHHhCcccCceEEEEeCCCccccccCCCCceEec
Confidence 12233222 223456678888888887643 4788899999874 33322211
Q ss_pred CCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeC
Q 001711 577 GDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYP 643 (1021)
Q Consensus 577 ~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~ 643 (1021)
-|++|-+ .+. ...+++ ..|.+.|+-|++|....+.-=..-+..+++.|+|.+|+-+
T Consensus 579 -----~yp~DP~-t~~----~Tvr~~-d~~~r~G~q~t~FrLg~DpgL~~Fv~qva~rv~G~vv~pd 634 (652)
T COG4867 579 -----DYPPDPR-TIA----HTVRGF-DDMARLGAQVTIFRLGSDPGLARFIDQVARRVQGRVVVPD 634 (652)
T ss_pred -----CCCCChh-HHH----HHHHHH-HHHHhccceeeEEeecCCHhHHHHHHHHHHHhCCeEEecC
Confidence 2333322 111 112233 4589999999999998876545557889999999999643
No 88
>PF11265 Med25_VWA: Mediator complex subunit 25 von Willebrand factor type A; InterPro: IPR021419 The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA domain which is this one, an SD2 domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This VWA or von Willebrand factor type A domain when bound to RAR and the histone acetyltransferase CBP is responsible for recruiting Med1 to the rest of the Mediator complex [].
Probab=67.17 E-value=2e+02 Score=31.61 Aligned_cols=103 Identities=16% Similarity=0.138 Sum_probs=62.9
Q ss_pred HHHHHHHHhhCCCcccCCCCcccc-hHHHHHHHHHHHHhc-------C-----CEEEEEecCCCCCCcccccccCCcCcc
Q 001711 516 RSVVDTLLDSLPSMFQDNMNVESA-FGPALKAAFMVMSRL-------G-----GKLLIFQNSLPSLGVGCLKLRGDDLRV 582 (1021)
Q Consensus 516 ~~~I~~lLd~Lp~~f~~~~~~~~a-lG~AL~aA~~lL~~~-------G-----GkIivF~sg~Pt~GpG~L~~r~~~~r~ 582 (1021)
-+.+.+.|++|+ |..+.-.+.| +.-+|.+|+.++... + -+.|+..+++|..=| ..
T Consensus 89 ~~~fl~~L~~I~--f~GGG~e~~a~iaEGLa~AL~~fd~~~~~r~~~~~~~~~khcILI~nSpP~~~p----~~------ 156 (226)
T PF11265_consen 89 PQKFLQWLDAIQ--FSGGGFESCAAIAEGLAEALQCFDDFKQMRQQQQQTDVQKHCILICNSPPYRLP----VN------ 156 (226)
T ss_pred HHHHHHHHHccC--cCCCCcccchhHHHHHHHHHHHhcchhhhccccCcccccceEEEEeCCCCcccc----cc------
Confidence 345566778887 3344444444 777888888877631 1 234555555553211 11
Q ss_pred cCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEE
Q 001711 583 YGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYY 641 (1021)
Q Consensus 583 ~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~ 641 (1021)
+..+ -....++++|..+.+++|.+.++.- --+..+..|-+..+|....
T Consensus 157 ----~~~~---~~~~~~d~la~~~~~~~I~LSiisP----rklP~l~~Lfeka~~~~~~ 204 (226)
T PF11265_consen 157 ----ECPQ---YSGKTCDQLAVLISERNISLSIISP----RKLPSLRSLFEKAKGNPRA 204 (226)
T ss_pred ----CCCc---ccCCCHHHHHHHHHhcCceEEEEcC----ccCHHHHHHHHhcCCCccc
Confidence 1111 1335678999999999999999863 2356677777777776654
No 89
>PF09967 DUF2201: VWA-like domain (DUF2201); InterPro: IPR018698 This family of various hypothetical bacterial proteins has no known function.
Probab=63.40 E-value=12 Score=36.96 Aligned_cols=93 Identities=18% Similarity=0.212 Sum_probs=58.5
Q ss_pred EEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCCccce
Q 001711 431 FFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPDDLLV 510 (1021)
Q Consensus 431 vFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~~lLv 510 (1021)
+++||+|.+.-+ ..++.++..|...++... .+|-+|.||..|+--. .+.+.++
T Consensus 2 ~vaiDtSGSis~-~~l~~fl~ev~~i~~~~~----~~v~vi~~D~~v~~~~-----------~~~~~~~----------- 54 (126)
T PF09967_consen 2 VVAIDTSGSISD-EELRRFLSEVAGILRRFP----AEVHVIQFDAEVQDVQ-----------VFRSLED----------- 54 (126)
T ss_pred EEEEECCCCCCH-HHHHHHHHHHHHHHHhCC----CCEEEEEECCEeeeee-----------EEecccc-----------
Confidence 689999997633 357777888888887762 5699999999887321 1111000
Q ss_pred ehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhcCCEEEEEecCCC
Q 001711 511 NLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRLGGKLLIFQNSLP 566 (1021)
Q Consensus 511 ~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~GGkIivF~sg~P 566 (1021)
.+..+ .-....++++.++++.+.+.. ....-|++|+.|-.
T Consensus 55 -----------~~~~~----~~~GgGGTdf~pvf~~~~~~~-~~~~~vi~fTDg~~ 94 (126)
T PF09967_consen 55 -----------ELRDI----KLKGGGGTDFRPVFEYLEENR-PRPSVVIYFTDGEG 94 (126)
T ss_pred -----------ccccc----ccCCCCCCcchHHHHHHHhcC-CCCCEEEEEeCCCC
Confidence 00011 112456788888888876543 34566778998654
No 90
>COG5242 TFB4 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription / DNA replication, recombination, and repair]
Probab=61.13 E-value=1.4e+02 Score=32.46 Aligned_cols=177 Identities=19% Similarity=0.272 Sum_probs=93.0
Q ss_pred CCCeEEEEEecchhHH----hhcHHHHHHHHHHHHHhc-CCCCCCceEEEEE-EcCeEEEEecCCCCCCcceeeccccc-
Q 001711 426 MPPLYFFLIDVSISAI----RSGMLEVVAQTIKSCLDE-LPGFPRTQIGFIT-FDSTIHFYNMKSSLTQPQMMVISDLD- 498 (1021)
Q Consensus 426 ~pp~yvFvIDvS~~av----~sG~l~~~~~sI~~~L~~-Lp~~~rt~VgiIT-Fds~V~fynl~~~~~~p~mlVvsDld- 498 (1021)
.|...+.+||.--... +.|-..-+.+.|.--|.. |.-..+-||++|. |+..+.+.--+... .+.+++.|
T Consensus 19 spslL~viid~~p~~W~~~~ek~~~~kvl~di~VFLNAhlaf~~~NrVaVva~~s~~~~yLypss~s----~~k~se~e~ 94 (296)
T COG5242 19 SPSLLFVIIDLEPENWELTTEKGSRDKVLNDIVVFLNAHLAFSRNNRVAVVAGYSQGKTYLYPSSES----ALKASESEN 94 (296)
T ss_pred CCceEEEEEecChhhcccccccccHHHHHHHHHHHHHHHHhhccCCeEEEEEeccCceEEeccCcch----hhhhhcccC
Confidence 3555666788755442 345455555555555553 3322335787764 66666543222211 12233332
Q ss_pred ----cccCCCCCccceehhhhHHHHHHHHhhCCCcccCC--CCcccchHHHHHHHHHHHHh------cCCEEEEEecCCC
Q 001711 499 ----DIFVPLPDDLLVNLSESRSVVDTLLDSLPSMFQDN--MNVESAFGPALKAAFMVMSR------LGGKLLIFQNSLP 566 (1021)
Q Consensus 499 ----d~f~Pl~~~lLv~l~es~~~I~~lLd~Lp~~f~~~--~~~~~alG~AL~aA~~lL~~------~GGkIivF~sg~P 566 (1021)
|+|.- +-++ =+.+++.|-..++.. .....-+|-|+.+++.+..+ .-.||++|+.+
T Consensus 95 tr~sd~yrr-----fr~v------de~~i~eiyrl~e~~~k~sqr~~v~gams~glay~n~~~~e~slkSriliftls-- 161 (296)
T COG5242 95 TRNSDMYRR-----FRNV------DETDITEIYRLIEHPHKNSQRYDVGGAMSLGLAYCNHRDEETSLKSRILIFTLS-- 161 (296)
T ss_pred ccchhhhhh-----hccc------chHHHHHHHHHHhCcccccceeehhhhhhhhHHHHhhhcccccccceEEEEEec--
Confidence 12211 1111 122333333322222 22334678899999888755 34899999872
Q ss_pred CCCcccccccCCcCcccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccccccEEEEeCC
Q 001711 567 SLGVGCLKLRGDDLRVYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKYTGGQVYYYPS 644 (1021)
Q Consensus 567 t~GpG~L~~r~~~~r~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~TGG~v~~y~~ 644 (1021)
| |+. ..+|. =|-+-.-.+.+.||-||+|-+... -..|...+..|||.....++
T Consensus 162 --G------~d~---------~~qYi-----p~mnCiF~Aqk~~ipI~v~~i~g~---s~fl~Q~~daTgG~Yl~ve~ 214 (296)
T COG5242 162 --G------RDR---------KDQYI-----PYMNCIFAAQKFGIPISVFSIFGN---SKFLLQCCDATGGDYLTVED 214 (296)
T ss_pred --C------chh---------hhhhc-----hhhhheeehhhcCCceEEEEecCc---cHHHHHHhhccCCeeEeecC
Confidence 2 211 11111 122223345678999999976654 34578899999998777664
No 91
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=58.82 E-value=5.7e+02 Score=33.99 Aligned_cols=9 Identities=22% Similarity=0.552 Sum_probs=3.9
Q ss_pred CccceEEcc
Q 001711 354 FICRTYVNP 362 (1021)
Q Consensus 354 ~rCrAYiNP 362 (1021)
.||.+-.++
T Consensus 960 ~r~~a~~~~ 968 (1049)
T KOG0307|consen 960 QRCSARTDP 968 (1049)
T ss_pred HHhhccCCH
Confidence 444444443
No 92
>PF10138 vWA-TerF-like: vWA found in TerF C terminus ; InterPro: IPR019303 This entry represents the N-terminal domain of a family of proteins that confer resistance to the metalloid element tellurium and its salts.
Probab=54.31 E-value=2.6e+02 Score=30.10 Aligned_cols=144 Identities=17% Similarity=0.247 Sum_probs=84.8
Q ss_pred EEEEEecchhH---HhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCCCCC
Q 001711 430 YFFLIDVSISA---IRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVPLPD 506 (1021)
Q Consensus 430 yvFvIDvS~~a---v~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~Pl~~ 506 (1021)
..+|||.|.++ -++|.++.+.+.|...=..+-++ ..|=+.+|++..+= +.+
T Consensus 4 V~LVLD~SGSM~~~yk~G~vQ~~~Er~lalA~~~DdD--G~i~v~~Fs~~~~~--------------~~~---------- 57 (200)
T PF10138_consen 4 VYLVLDISGSMRPLYKDGTVQRVVERILALAAQFDDD--GEIDVWFFSTEFDR--------------LPD---------- 57 (200)
T ss_pred EEEEEeCCCCCchhhhCccHHHHHHHHHHHHhhcCCC--CceEEEEeCCCCCc--------------CCC----------
Confidence 56899999987 67788888888888776666544 44555555543221 111
Q ss_pred ccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc-C---CEEEEEec-CCCCCCcccccccCCcCc
Q 001711 507 DLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL-G---GKLLIFQN-SLPSLGVGCLKLRGDDLR 581 (1021)
Q Consensus 507 ~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~-G---GkIivF~s-g~Pt~GpG~L~~r~~~~r 581 (1021)
+.+.+....|..+...+..+ .....+...+||+.++.--... + .-+++|++ |-|+ .+
T Consensus 58 ---vt~~~~~~~v~~~~~~~~~~---~~~G~t~y~~vm~~v~~~y~~~~~~~~P~~VlFiTDG~~~-------~~----- 119 (200)
T PF10138_consen 58 ---VTLDNYEGYVDELHAGLPDW---GRMGGTNYAPVMEDVLDHYFKREPSDAPALVLFITDGGPD-------DR----- 119 (200)
T ss_pred ---cCHHHHHHHHHHHhcccccc---CCCCCcchHHHHHHHHHHHhhcCCCCCCeEEEEEecCCcc-------ch-----
Confidence 12334445555555444222 2234477889999988776532 1 24555554 3221 11
Q ss_pred ccCCCccccCCCCCcHHHHHHHHHHhhCCcEEEEEEecCCCcChhhhhhhccc
Q 001711 582 VYGTDKEHSLRIPEDPFYKQMAADLTKFQIAVNVYAFSDKYTDIASLGTLAKY 634 (1021)
Q Consensus 582 ~~gt~~e~~l~~pa~~fY~~La~~~~~~gIsVDlF~~s~~~~diatl~~L~~~ 634 (1021)
.--+++-.+++...|-.-..-++.+..++ |..|-.+
T Consensus 120 ---------------~~~~~~i~~as~~pifwqFVgiG~~~f~f--L~kLD~l 155 (200)
T PF10138_consen 120 ---------------RAIEKLIREASDEPIFWQFVGIGDSNFGF--LEKLDDL 155 (200)
T ss_pred ---------------HHHHHHHHhccCCCeeEEEEEecCCcchH--HHHhhcc
Confidence 11245566667777888877777776554 6666664
No 93
>PF05762 VWA_CoxE: VWA domain containing CoxE-like protein; InterPro: IPR008912 This group of proteins contains a VWA type domain and the function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon in several bacteria [].
Probab=44.34 E-value=32 Score=37.27 Aligned_cols=102 Identities=16% Similarity=0.244 Sum_probs=52.8
Q ss_pred CCCC-eEEEEEecchhHHhhcHHHHHHHHHHHHHhcCCCCCCceEEEEEEcCeEEEEecCCCCCCcceeeccccccccCC
Q 001711 425 PMPP-LYFFLIDVSISAIRSGMLEVVAQTIKSCLDELPGFPRTQIGFITFDSTIHFYNMKSSLTQPQMMVISDLDDIFVP 503 (1021)
Q Consensus 425 p~pp-~yvFvIDvS~~av~sG~l~~~~~sI~~~L~~Lp~~~rt~VgiITFds~V~fynl~~~~~~p~mlVvsDldd~f~P 503 (1021)
+..+ -+|+|+|||.++.. +...++..+..+.... .+|.++.|++.|. .+. +.+.
T Consensus 54 ~~~~~~lvvl~DvSGSM~~--~s~~~l~~~~~l~~~~-----~~~~~f~F~~~l~--~vT---------------~~l~- 108 (222)
T PF05762_consen 54 PRKPRRLVVLCDVSGSMAG--YSEFMLAFLYALQRQF-----RRVRVFVFSTRLT--EVT---------------PLLR- 108 (222)
T ss_pred cCCCccEEEEEeCCCChHH--HHHHHHHHHHHHHHhC-----CCEEEEEEeeehh--hhh---------------hhhc-
Confidence 3444 89999999998853 3333333333333222 2577778876543 111 1110
Q ss_pred CCCccceehhhhHHHHHHHHhhCCCcccCCCCcccchHHHHHHHHHHHHhc---CCEEEEEecC
Q 001711 504 LPDDLLVNLSESRSVVDTLLDSLPSMFQDNMNVESAFGPALKAAFMVMSRL---GGKLLIFQNS 564 (1021)
Q Consensus 504 l~~~lLv~l~es~~~I~~lLd~Lp~~f~~~~~~~~alG~AL~aA~~lL~~~---GGkIivF~sg 564 (1021)
. .+-.+.+..+...... -..++.+|.||+.+...+... +-.|+++.++
T Consensus 109 --~------~~~~~~l~~~~~~~~~-----~~GgTdi~~aL~~~~~~~~~~~~~~t~vvIiSDg 159 (222)
T PF05762_consen 109 --R------RDPEEALARLSALVQS-----FGGGTDIGQALREFLRQYARPDLRRTTVVIISDG 159 (222)
T ss_pred --c------CCHHHHHHHHHhhccC-----CCCccHHHHHHHHHHHHhhcccccCcEEEEEecc
Confidence 0 0111223333222221 345677899999888877632 3456666554
No 94
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=40.44 E-value=1.3e+02 Score=32.81 Aligned_cols=10 Identities=30% Similarity=0.464 Sum_probs=4.7
Q ss_pred ehhhhHHHHH
Q 001711 511 NLSESRSVVD 520 (1021)
Q Consensus 511 ~l~es~~~I~ 520 (1021)
.|+|.|..+-
T Consensus 323 sleerraqlp 332 (341)
T KOG2893|consen 323 SLEERRAQLP 332 (341)
T ss_pred cHHHHhhhhh
Confidence 3455555443
No 95
>PF02905 EBV-NA1: Epstein Barr virus nuclear antigen-1, DNA-binding domain; InterPro: IPR004186 The Epstein-Barr virus (strain GD1) nuclear antigen 1 (EBNA1) binds to and activates DNA replication from the latent origin of replication. The crystal structure of the DNA-binding and dimerization domains were solved [], and it was found that EBNA1 appears to bind DNA via two independent regions, the core and the flanking DNA-binding domains. This DNA-binding domain has a ferredoxin-like fold.; GO: 0003677 DNA binding, 0003688 DNA replication origin binding, 0006260 DNA replication, 0006275 regulation of DNA replication, 0045893 positive regulation of transcription, DNA-dependent, 0042025 host cell nucleus; PDB: 1B3T_B 1VHI_B.
Probab=32.45 E-value=71 Score=31.52 Aligned_cols=33 Identities=24% Similarity=0.338 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHhcCCC-CCCceEEEEEEcCeEE
Q 001711 446 LEVVAQTIKSCLDELPG-FPRTQIGFITFDSTIH 478 (1021)
Q Consensus 446 l~~~~~sI~~~L~~Lp~-~~rt~VgiITFds~V~ 478 (1021)
.+.++++|+.-+..-|. ..+++|.+++||+.|-
T Consensus 112 Ae~vkDAi~Dyi~T~P~PT~~~~Vt~~~Fd~~V~ 145 (146)
T PF02905_consen 112 AECVKDAIRDYIMTRPQPTCNTQVTVCSFDDGVM 145 (146)
T ss_dssp HHHHHHHHHHHHCTS-TTGGGEEEEEEEEEEEE-
T ss_pred HHHHHHHHHHHhcCCCCCCcceEEEEEeCCCCCc
Confidence 36788899888877653 3458999999998764
No 96
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=31.72 E-value=1.5e+02 Score=37.59 Aligned_cols=6 Identities=33% Similarity=0.622 Sum_probs=2.6
Q ss_pred EEEEec
Q 001711 477 IHFYNM 482 (1021)
Q Consensus 477 V~fynl 482 (1021)
||-++|
T Consensus 465 ih~~dL 470 (830)
T KOG1923|consen 465 IHPLDL 470 (830)
T ss_pred hhhccc
Confidence 444444
No 97
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown]
Probab=31.48 E-value=2.7e+02 Score=32.87 Aligned_cols=6 Identities=67% Similarity=1.370 Sum_probs=2.3
Q ss_pred CCCCCC
Q 001711 150 PMGSPV 155 (1021)
Q Consensus 150 ~~~~~~ 155 (1021)
+||++|
T Consensus 381 p~Gp~p 386 (487)
T KOG4672|consen 381 PMGPPP 386 (487)
T ss_pred CCCCCC
Confidence 344333
No 98
>PF10058 DUF2296: Predicted integral membrane metal-binding protein (DUF2296); InterPro: IPR019273 This domain, found mainly in the eukaryotic lunapark proteins, has no known function [].
Probab=25.72 E-value=55 Score=27.74 Aligned_cols=13 Identities=38% Similarity=0.912 Sum_probs=11.0
Q ss_pred CceEEEcCCCCCC
Q 001711 370 GRKWRCNICALLN 382 (1021)
Q Consensus 370 g~~W~Cn~C~~~N 382 (1021)
.-+|+|..|+..|
T Consensus 42 ~i~y~C~~Cg~~N 54 (54)
T PF10058_consen 42 EIQYRCPYCGALN 54 (54)
T ss_pred ceEEEcCCCCCcC
Confidence 4589999999887
No 99
>KOG1985 consensus Vesicle coat complex COPII, subunit SEC24/subunit SFB2 [Intracellular trafficking, secretion, and vesicular transport]
Probab=25.13 E-value=1.3e+03 Score=30.11 Aligned_cols=24 Identities=25% Similarity=0.433 Sum_probs=15.6
Q ss_pred EEccceeEecCCceEEEcCCCCC-CC
Q 001711 359 YVNPYVTFTDAGRKWRCNICALL-ND 383 (1021)
Q Consensus 359 YiNPf~~f~~~g~~W~Cn~C~~~-N~ 383 (1021)
++++-+.+.. +.--+|.-|.+. |.
T Consensus 206 d~~~~p~~~~-~~IvRCr~CRtYiNP 230 (887)
T KOG1985|consen 206 DIDPLPVITS-TLIVRCRRCRTYINP 230 (887)
T ss_pred ccCCCCcccC-CceeeehhhhhhcCC
Confidence 5555555544 568889999864 53
No 100
>PF12257 DUF3608: Protein of unknown function (DUF3608); InterPro: IPR022046 This domain family is found in eukaryotes, and is approximately 280 amino acids in length. The family is found in association with PF00610 from PFAM.
Probab=23.95 E-value=8.3e+02 Score=27.77 Aligned_cols=28 Identities=11% Similarity=0.113 Sum_probs=22.4
Q ss_pred cHHHHHHHHHHhhCCcEEEEEEecCCCc
Q 001711 596 DPFYKQMAADLTKFQIAVNVYAFSDKYT 623 (1021)
Q Consensus 596 ~~fY~~La~~~~~~gIsVDlF~~s~~~~ 623 (1021)
.+.++-..+++...||++|+.+.+..-.
T Consensus 246 ~~ll~~T~~rl~~~gi~~DlIcL~~~PL 273 (281)
T PF12257_consen 246 YDLLRLTTQRLLDNGIGIDLICLSKPPL 273 (281)
T ss_pred HHHHHHHHHHHHhcCccEEEEEcCCCCc
Confidence 3456778889999999999999876543
No 101
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=23.51 E-value=34 Score=36.58 Aligned_cols=33 Identities=15% Similarity=0.215 Sum_probs=25.4
Q ss_pred CccceEEccceeEecC--------CceEEEcCCCCCCCCCc
Q 001711 354 FICRTYVNPYVTFTDA--------GRKWRCNICALLNDVPG 386 (1021)
Q Consensus 354 ~rCrAYiNPf~~f~~~--------g~~W~Cn~C~~~N~vP~ 386 (1021)
..=.|.|+|.|.+-.| -..|+|.+|++.|+.+.
T Consensus 188 ~~~~alIC~~C~hhngl~~~~ek~~~efiC~~Cn~~n~~~~ 228 (251)
T COG5415 188 SPFKALICPQCHHHNGLYRLAEKPIIEFICPHCNHKNDEVK 228 (251)
T ss_pred CchhhhccccccccccccccccccchheecccchhhcCccc
Confidence 5566888888877654 34799999999997664
No 102
>COG1580 FliL Flagellar basal body-associated protein [Cell motility and secretion]
Probab=22.77 E-value=2.5e+02 Score=29.19 Aligned_cols=65 Identities=15% Similarity=0.253 Sum_probs=42.7
Q ss_pred CceeEEEEEEEEEecCCcEEEEEEeeeecccCCHHHHHHhcCH--hHHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHHH
Q 001711 721 TQTVYFQVALLYTASCGERRIRVHTLAAPVVSNLSDMYQQADT--GAIVSVFSRLAIEKTLS-HKLEDARNAVQLRLVKA 797 (1021)
Q Consensus 721 ~~~~~iQ~AllYTt~~GeRrIRV~Tl~lpvt~~l~~vf~s~D~--eai~~~laK~a~~~~l~-~~l~d~R~~l~~~lv~i 797 (1021)
....|+|+++.|--.+ .....++=+.-.. ++++.+|+++.++.+.. .+.++.|+++.++|-.+
T Consensus 76 ~~~~~v~i~i~l~~~n--------------~~~~~el~~~~p~vrd~li~lfsskt~~eL~t~~Gke~Lk~ei~~~in~~ 141 (159)
T COG1580 76 PKDRYVKIAITLEVAN--------------KALLEELEEKKPEVRDALLMLFSSKTAAELSTPEGKEKLKAEIKDRINTI 141 (159)
T ss_pred CCcEEEEEEEEEeeCC--------------HHHHHHHHHhhHHHHHHHHHHHHhCCHHHhcCchhHHHHHHHHHHHHHHH
Confidence 3557777777765332 1112333333332 79999999999988877 67777888888887776
Q ss_pred HH
Q 001711 798 LK 799 (1021)
Q Consensus 798 L~ 799 (1021)
|.
T Consensus 142 L~ 143 (159)
T COG1580 142 LK 143 (159)
T ss_pred Hh
Confidence 63
No 103
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=21.81 E-value=47 Score=34.59 Aligned_cols=15 Identities=27% Similarity=1.068 Sum_probs=11.7
Q ss_pred CCceEEEcCCCCCCC
Q 001711 369 AGRKWRCNICALLND 383 (1021)
Q Consensus 369 ~g~~W~Cn~C~~~N~ 383 (1021)
+|+.|+|..||+.-.
T Consensus 131 ~~~~~vC~vCGy~~~ 145 (166)
T COG1592 131 EGKVWVCPVCGYTHE 145 (166)
T ss_pred cCCEEEcCCCCCccc
Confidence 466899999998653
No 104
>KOG4368 consensus Predicted RNA binding protein, contains SWAP, RPR and G-patch domains [General function prediction only]
Probab=21.09 E-value=1.6e+03 Score=28.04 Aligned_cols=151 Identities=17% Similarity=0.121 Sum_probs=0.0
Q ss_pred CCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccC
Q 001711 81 FNDPSVSSSPITYVPPTSGPF-QRFPTPQFPPVAQAPPVRGPPVGLPPVSHPIGQVPNPPVPLRAQPPPVPMGSPVQRAN 159 (1021)
Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (1021)
+..||...|-.-..++.++++ |.+|..+ +.+...+-+++.+.+.++-..++.+++..-.-. .|+
T Consensus 291 ~~~~p~~GPgdH~h~~~~~p~dq~hpqA~-------~~~~~~prqpp~p~~~~~~P~~p~~~~~h~~~~----~pg---- 355 (757)
T KOG4368|consen 291 TPPPPAPGPGPHDQIPPNKPFDQPHPVAP-------WGQQQPPEQPPYPHHQGGPPHCPPWNNSHEGRG----DPG---- 355 (757)
T ss_pred cCCCCCCCCCcccccCCCCCCCCCCCCCC-------CCCCCCccCCCCCCcccCCCCCCCCCcccccCC----CCC----
Q ss_pred CCCCCCCCCCCCCCCCccCCCCCCCCCCCCCCCCCCCCCCCCCCccCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCC
Q 001711 160 FAPSGVNVPQPLSDSSFSASRPNSPPDSSYPFARPTPQQPLPGYVTTQP----NAVSQGPTMPSSFPSHPRSYVPPPPTS 235 (1021)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (1021)
+.|+..+++.+ ....-+++...++...+.-++.++++..+.+ +.-++.++.+++|..-+.....+--+.
T Consensus 356 ~pGp~~~n~g~-------a~g~q~~~p~~~~~~q~p~~g~epp~~~q~~~~~~qq~~Q~~qp~hp~n~~ppgq~q~d~s~ 428 (757)
T KOG4368|consen 356 WNGPWNNNPDA-------AWGSQFEGPWNSQHEQPPWGGGEPPFRMQGPFPPHQQHPQFNQPPHPFNRFPPRFMQDDFPP 428 (757)
T ss_pred CCCCCCCCCCC-------CcccccCCccccccccCcccCCCCchhhcCcCchhhhccccCCCCCccccCChhhcccccCc
Q ss_pred CCCCCCCCCCCCCCCCCC
Q 001711 236 ASSFPAHQGGYVPPGVQS 253 (1021)
Q Consensus 236 ~~~~~~~~~~~~~~~~~~ 253 (1021)
..++..+......+++..
T Consensus 429 ~~~~~~~p~~~~~~~p~~ 446 (757)
T KOG4368|consen 429 RHPFERPPYPHRFDYPQG 446 (757)
T ss_pred ccccccCccccccCCCCC
No 105
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=20.69 E-value=47 Score=21.86 Aligned_cols=12 Identities=25% Similarity=0.642 Sum_probs=7.5
Q ss_pred EEEcCCCCCCCC
Q 001711 373 WRCNICALLNDV 384 (1021)
Q Consensus 373 W~Cn~C~~~N~v 384 (1021)
|+|.+|+....-
T Consensus 1 ~~C~~C~~~~~~ 12 (24)
T PF13894_consen 1 FQCPICGKSFRS 12 (24)
T ss_dssp EE-SSTS-EESS
T ss_pred CCCcCCCCcCCc
Confidence 789999886543
No 106
>COG3285 Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]
Probab=20.28 E-value=4.2e+02 Score=30.21 Aligned_cols=15 Identities=13% Similarity=0.040 Sum_probs=12.4
Q ss_pred CccceEEccceeEec
Q 001711 354 FICRTYVNPYVTFTD 368 (1021)
Q Consensus 354 ~rCrAYiNPf~~f~~ 368 (1021)
++|-.++.++++-.+
T Consensus 66 Kha~~~~p~~v~~~~ 80 (299)
T COG3285 66 KHAPRGAPPWVQTVR 80 (299)
T ss_pred ccCCCCCCchheeee
Confidence 899999999987654
Done!