Query 007491
Match_columns 601
No_of_seqs 91 out of 93
Neff 3.5
Searched_HMMs 46136
Date Thu Mar 28 11:19:33 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/007491.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/007491hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF11265 Med25_VWA: Mediator c 100.0 2.4E-69 5.2E-74 531.0 21.9 210 3-228 13-226 (226)
2 cd01480 vWA_collagen_alpha_1-V 99.1 2.8E-09 6.1E-14 100.2 17.4 153 3-198 2-160 (186)
3 PF13519 VWA_2: von Willebrand 99.1 1.1E-09 2.4E-14 96.7 12.9 144 5-204 1-154 (172)
4 cd01450 vWFA_subfamily_ECM Von 99.0 9.7E-09 2.1E-13 90.3 15.6 147 4-198 1-152 (161)
5 cd01482 vWA_collagen_alphaI-XI 99.0 2.1E-08 4.5E-13 92.1 18.0 135 4-187 1-137 (164)
6 cd01481 vWA_collagen_alpha3-VI 99.0 7.7E-09 1.7E-13 96.8 15.2 140 4-189 1-142 (165)
7 cd01472 vWA_collagen von Wille 99.0 2.6E-08 5.6E-13 90.8 17.3 150 4-201 1-153 (164)
8 cd01469 vWA_integrins_alpha_su 99.0 2.2E-08 4.7E-13 93.7 16.0 138 4-187 1-139 (177)
9 cd01476 VWA_integrin_invertebr 98.8 2.5E-07 5.5E-12 83.7 17.4 138 4-189 1-140 (163)
10 cd01474 vWA_ATR ATR (Anthrax T 98.8 8.8E-08 1.9E-12 89.7 14.8 136 2-187 3-141 (185)
11 cd01473 vWA_CTRP CTRP for CS 98.8 1.2E-07 2.6E-12 91.0 14.7 152 4-202 1-160 (192)
12 cd01475 vWA_Matrilin VWA_Matri 98.8 2.1E-07 4.5E-12 90.2 16.0 138 3-188 2-143 (224)
13 PF00092 VWA: von Willebrand f 98.7 3.9E-07 8.4E-12 81.7 14.6 148 5-199 1-154 (178)
14 cd01453 vWA_transcription_fact 98.7 6.7E-07 1.5E-11 85.2 16.7 150 4-204 4-161 (183)
15 cd01477 vWA_F09G8-8_type VWA F 98.7 4.1E-07 9E-12 87.9 15.2 145 3-186 19-167 (193)
16 cd00198 vWFA Von Willebrand fa 98.7 1.3E-06 2.9E-11 74.7 16.6 146 4-198 1-152 (161)
17 cd01471 vWA_micronemal_protein 98.7 7.7E-07 1.7E-11 82.8 15.6 139 4-188 1-145 (186)
18 smart00187 INB Integrin beta s 98.6 1E-06 2.3E-11 95.1 16.7 172 4-198 100-309 (423)
19 cd01467 vWA_BatA_type VWA BatA 98.5 5E-06 1.1E-10 76.3 14.6 133 3-186 2-138 (180)
20 smart00327 VWA von Willebrand 98.5 1.1E-05 2.5E-10 71.6 16.4 149 3-199 1-156 (177)
21 cd01454 vWA_norD_type norD typ 98.4 1.2E-05 2.7E-10 74.4 15.2 157 5-201 2-169 (174)
22 cd01456 vWA_ywmD_type VWA ywmD 98.4 9.6E-06 2.1E-10 77.3 14.9 150 3-204 20-195 (206)
23 cd01470 vWA_complement_factors 98.4 1E-05 2.3E-10 76.5 14.2 123 4-139 1-126 (198)
24 PF13768 VWA_3: von Willebrand 98.3 9.9E-06 2.2E-10 73.2 13.0 107 4-137 1-108 (155)
25 cd01463 vWA_VGCC_like VWA Volt 98.3 1.8E-05 3.9E-10 74.6 15.0 116 3-137 13-135 (190)
26 cd01452 VWA_26S_proteasome_sub 98.3 2.7E-05 5.9E-10 76.0 15.2 145 6-202 6-162 (187)
27 cd01465 vWA_subgroup VWA subgr 98.3 8.4E-05 1.8E-09 67.2 17.1 150 4-204 1-156 (170)
28 COG1240 ChlD Mg-chelatase subu 98.2 2.7E-05 5.8E-10 79.8 14.5 157 4-205 79-241 (261)
29 TIGR03436 acidobact_VWFA VWFA- 98.2 5.7E-05 1.2E-09 75.8 16.5 154 3-204 53-234 (296)
30 PTZ00441 sporozoite surface pr 98.2 3.2E-05 6.9E-10 86.5 15.1 139 2-186 41-184 (576)
31 PRK13685 hypothetical protein; 98.2 4.2E-05 9.2E-10 78.9 15.0 141 4-187 89-235 (326)
32 cd01451 vWA_Magnesium_chelatas 98.1 0.00016 3.5E-09 67.6 16.6 154 6-205 3-161 (178)
33 cd01466 vWA_C3HC4_type VWA C3H 98.1 7.8E-05 1.7E-09 68.5 13.8 144 4-203 1-149 (155)
34 cd01461 vWA_interalpha_trypsin 98.1 0.0001 2.3E-09 66.4 14.0 148 3-204 2-155 (171)
35 PRK13406 bchD magnesium chelat 98.1 0.00012 2.6E-09 82.1 17.0 160 3-204 401-563 (584)
36 cd01464 vWA_subfamily VWA subf 98.1 5E-05 1.1E-09 70.6 11.8 154 1-198 1-158 (176)
37 cd01458 vWA_ku Ku70/Ku80 N-ter 98.0 4E-05 8.6E-10 74.2 10.8 152 5-188 3-170 (218)
38 TIGR02031 BchD-ChlD magnesium 98.0 0.00013 2.9E-09 81.5 15.5 165 3-204 407-576 (589)
39 PF00362 Integrin_beta: Integr 97.9 0.00013 2.8E-09 79.2 13.2 180 4-198 103-312 (426)
40 cd01462 VWA_YIEM_type VWA YIEM 97.9 0.00056 1.2E-08 61.7 14.4 104 4-135 1-104 (152)
41 TIGR02442 Cob-chelat-sub cobal 97.7 0.00067 1.5E-08 76.4 15.8 158 3-204 465-628 (633)
42 TIGR00868 hCaCC calcium-activa 97.2 0.0081 1.7E-07 70.6 16.7 106 3-137 304-416 (863)
43 cd01460 vWA_midasin VWA_Midasi 97.2 0.0051 1.1E-07 63.4 13.5 133 3-186 60-200 (266)
44 PF03731 Ku_N: Ku70/Ku80 N-ter 97.1 0.016 3.5E-07 56.0 15.1 141 6-186 2-170 (224)
45 TIGR03788 marine_srt_targ mari 97.1 0.0027 5.9E-08 70.8 11.0 107 3-136 271-380 (596)
46 cd01455 vWA_F11C1-5a_type Von 96.8 0.026 5.6E-07 56.0 13.7 151 4-201 1-165 (191)
47 cd01457 vWA_ORF176_type VWA OR 96.7 0.043 9.4E-07 52.5 13.6 111 3-139 2-121 (199)
48 PF04056 Ssl1: Ssl1-like; Int 96.5 0.048 1E-06 54.2 12.8 160 9-231 1-167 (193)
49 PF04811 Sec23_trunk: Sec23/Se 96.4 0.032 6.9E-07 55.2 11.1 178 5-205 5-220 (243)
50 COG4245 TerY Uncharacterized p 96.4 0.023 4.9E-07 56.9 10.0 155 1-202 1-163 (207)
51 TIGR00578 ku70 ATP-dependent D 95.2 0.3 6.4E-06 55.4 13.9 148 6-186 13-179 (584)
52 cd01479 Sec24-like Sec24-like: 93.6 1.1 2.4E-05 45.2 12.3 177 5-205 5-215 (244)
53 PF05762 VWA_CoxE: VWA domain 92.9 0.87 1.9E-05 45.1 10.2 129 5-189 59-188 (222)
54 cd01468 trunk_domain trunk dom 92.8 3.2 6.8E-05 41.4 14.1 178 6-205 6-218 (239)
55 KOG1226 Integrin beta subunit 91.9 0.9 2E-05 53.2 10.1 179 4-197 133-340 (783)
56 PRK10997 yieM hypothetical pro 91.8 2.9 6.2E-05 47.1 13.6 88 3-110 323-410 (487)
57 PF06707 DUF1194: Protein of u 91.6 4.1 9E-05 41.3 13.2 157 3-201 3-172 (205)
58 PF11235 Med25_SD1: Mediator c 89.0 0.7 1.5E-05 45.1 5.1 83 238-323 7-97 (168)
59 KOG2807 RNA polymerase II tran 88.0 13 0.00028 40.6 14.0 150 4-203 61-217 (378)
60 KOG2884 26S proteasome regulat 84.1 32 0.00068 36.0 14.0 179 6-238 6-187 (259)
61 KOG2353 L-type voltage-depende 81.7 7.7 0.00017 47.7 10.1 190 2-242 224-419 (1104)
62 cd01478 Sec23-like Sec23-like: 81.6 33 0.00071 35.6 13.3 117 80-205 119-250 (267)
63 PF10138 vWA-TerF-like: vWA fo 81.2 41 0.0009 34.1 13.5 145 3-199 1-154 (200)
64 COG5148 RPN10 26S proteasome r 79.9 36 0.00078 35.0 12.5 164 6-224 6-171 (243)
65 KOG1984 Vesicle coat complex C 79.7 24 0.00051 42.8 12.7 163 4-190 418-613 (1007)
66 TIGR00627 tfb4 transcription f 79.1 38 0.00083 35.8 13.0 84 95-204 117-203 (279)
67 PF09967 DUF2201: VWA-like dom 77.8 7.8 0.00017 35.6 6.7 77 6-105 1-78 (126)
68 COG2425 Uncharacterized protei 75.0 23 0.00051 39.7 10.6 83 5-103 274-357 (437)
69 KOG3768 DEAD box RNA helicase 72.7 4.1 8.9E-05 47.2 4.2 87 6-104 4-98 (888)
70 PTZ00395 Sec24-related protein 70.1 34 0.00073 43.4 11.1 160 5-190 954-1155(1560)
71 PLN00162 transport protein sec 69.6 72 0.0016 38.0 13.5 38 169-206 331-372 (761)
72 TIGR00670 asp_carb_tr aspartat 61.8 35 0.00076 36.1 8.2 93 62-191 94-187 (301)
73 cd01459 vWA_copine_like VWA Co 60.8 89 0.0019 32.6 10.8 91 3-110 31-148 (254)
74 KOG1781 Small Nuclear ribonucl 60.4 13 0.00029 34.2 4.2 49 11-61 39-87 (108)
75 PRK02102 ornithine carbamoyltr 56.4 44 0.00096 36.0 8.0 93 62-192 100-192 (331)
76 PRK13814 pyrB aspartate carbam 50.1 54 0.0012 35.0 7.3 92 64-191 102-195 (310)
77 PRK14804 ornithine carbamoyltr 49.6 94 0.002 33.1 9.0 103 62-201 96-200 (311)
78 PRK00779 ornithine carbamoyltr 49.3 72 0.0016 33.8 8.1 92 62-192 97-188 (304)
79 PRK00856 pyrB aspartate carbam 49.2 61 0.0013 34.4 7.6 93 62-191 99-193 (305)
80 PLN02342 ornithine carbamoyltr 47.4 69 0.0015 34.8 7.8 101 62-201 139-239 (348)
81 PRK14805 ornithine carbamoyltr 46.9 83 0.0018 33.4 8.1 97 62-197 92-188 (302)
82 COG1721 Uncharacterized conser 45.4 2E+02 0.0044 31.3 11.0 74 3-93 224-303 (416)
83 PRK04284 ornithine carbamoyltr 45.3 90 0.002 33.6 8.2 100 62-198 99-198 (332)
84 TIGR01651 CobT cobaltochelatas 43.7 2.6E+02 0.0057 32.9 11.9 64 100-186 499-563 (600)
85 TIGR00658 orni_carb_tr ornithi 43.1 96 0.0021 32.8 7.9 92 62-192 93-184 (304)
86 PRK01713 ornithine carbamoyltr 41.2 1E+02 0.0022 33.2 7.8 94 62-192 100-193 (334)
87 TIGR03316 ygeW probable carbam 37.6 1.5E+02 0.0032 32.5 8.4 111 62-201 96-222 (357)
88 PRK03515 ornithine carbamoyltr 36.7 72 0.0016 34.4 5.9 95 62-192 99-193 (336)
89 PLN02527 aspartate carbamoyltr 35.8 1.7E+02 0.0036 31.2 8.3 93 62-191 95-189 (306)
90 PRK12562 ornithine carbamoyltr 34.4 88 0.0019 33.8 6.1 95 62-192 99-193 (334)
91 PRK02255 putrescine carbamoylt 33.5 97 0.0021 33.5 6.2 94 62-191 96-189 (338)
92 PF03850 Tfb4: Transcription f 31.2 6.6E+02 0.014 26.6 13.9 83 97-204 116-201 (276)
93 COG5151 SSL1 RNA polymerase II 29.9 8.2E+02 0.018 27.3 13.4 188 4-254 88-286 (421)
94 PRK07200 aspartate/ornithine c 29.2 2.2E+02 0.0047 31.7 8.1 108 62-199 113-237 (395)
95 TIGR01285 nifN nitrogenase mol 24.5 3.2E+02 0.007 30.2 8.4 88 73-201 55-156 (432)
96 TIGR02886 spore_II_AA anti-sig 21.5 1E+02 0.0022 26.4 3.2 41 162-204 53-93 (106)
97 PRK04523 N-acetylornithine car 20.5 3.4E+02 0.0073 29.4 7.4 82 75-191 127-211 (335)
98 cd06844 STAS Sulphate Transpor 20.4 1E+02 0.0023 26.3 3.0 42 162-205 53-94 (100)
99 PF12637 TSCPD: TSCPD domain; 20.3 1.3E+02 0.0028 26.8 3.6 32 75-106 55-94 (95)
100 PF07611 DUF1574: Protein of u 20.1 1.1E+02 0.0024 33.5 3.8 35 169-203 254-288 (345)
No 1
>PF11265 Med25_VWA: Mediator complex subunit 25 von Willebrand factor type A; InterPro: IPR021419 The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA domain which is this one, an SD2 domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This VWA or von Willebrand factor type A domain when bound to RAR and the histone acetyltransferase CBP is responsible for recruiting Med1 to the rest of the Mediator complex [].
Probab=100.00 E-value=2.4e-69 Score=531.03 Aligned_cols=210 Identities=43% Similarity=0.699 Sum_probs=194.2
Q ss_pred cceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCC--CcceEEEEEeecCCCccccceeeccCCcCHHHH
Q 007491 3 EKQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSA--SNVELSLVTFNTHGSYCACLVQRSGWTKDVDIF 80 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~--~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~F 80 (601)
++|||||||+||+||+||++||+|||+|+||||++|++.|+++++ .+++||||+||+||||++++|++++||+|+++|
T Consensus 13 ~~~vVfvvEgTAalgpy~~~Lkt~Yl~P~le~f~~g~~~e~~~~~~~~~t~y~LVvf~t~d~~~~~~v~~~g~T~~~~~f 92 (226)
T PF11265_consen 13 QAQVVFVVEGTAALGPYWNTLKTNYLDPILEYFNGGPIAERDFGGDYSNTEYGLVVFNTADCYPEPIVQRSGPTSSPQKF 92 (226)
T ss_pred cceEEEEEecchhhhhhHHHHHHHHHHHHHHHhcCCCcccccccccCCCceEEEEEEeccCCCcccceeccCCcCCHHHH
Confidence 689999999999999999999999999999999999999999743 779999999999999999999999999999999
Q ss_pred HHHhcCcccCCCCCc-hhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhh
Q 007491 81 LHWLSTIPFAGGGFN-DAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNE 159 (601)
Q Consensus 81 lqwLdsI~FsGGGfs-paAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~ 159 (601)
++|||+|+|.|||++ +++|+|||++||+|||+++..++..++.+.|||||||||||||++|+... +.
T Consensus 93 l~~L~~I~f~GGG~e~~a~iaEGLa~AL~~fd~~~~~r~~~~~~~~~khcILI~nSpP~~~p~~~~------------~~ 160 (226)
T PF11265_consen 93 LQWLDAIQFSGGGFESCAAIAEGLAEALQCFDDFKQMRQQQQQTDVQKHCILICNSPPYRLPVNEC------------PQ 160 (226)
T ss_pred HHHHHccCcCCCCcccchhHHHHHHHHHHHhcchhhhccccCcccccceEEEEeCCCCccccccCC------------Cc
Confidence 999999999999987 56699999999999998877777777778999999999999999997321 11
Q ss_pred hhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHhcCCCCC-CCCCCCCCCCCceEEEecccc
Q 007491 160 AQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAAKRNPRA-ADPPVDNSKNPHFLVLISENF 228 (601)
Q Consensus 160 ~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka~~n~~~-a~~s~dyAKdP~HLVLL~g~~ 228 (601)
.+..++|++|+++.||+|+||||||||+|+|++||||+++|+++ +++++||||||||||||||+|
T Consensus 161 ----~~~~~~d~la~~~~~~~I~LSiisPrklP~l~~Lfeka~~~~~~~~~~~~~~ak~p~hlVLl~g~~ 226 (226)
T PF11265_consen 161 ----YSGKTCDQLAVLISERNISLSIISPRKLPSLRSLFEKAKGNPRAAADPSKDYAKDPRHLVLLRGYF 226 (226)
T ss_pred ----ccCCCHHHHHHHHHhcCceEEEEcCccCHHHHHHHHhcCCCcccccccccccccCCCeEEEeecCC
Confidence 23358999999999999999999999999999999999999998 799999999999999999986
No 2
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=99.13 E-value=2.8e-09 Score=100.19 Aligned_cols=153 Identities=10% Similarity=0.227 Sum_probs=108.7
Q ss_pred cceEEEEEecccccc-hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCC---cCHH
Q 007491 3 EKQLIVAVEGTAAMG-PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWT---KDVD 78 (601)
Q Consensus 3 ~~DIVFVIDgTAsMG-pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfT---sD~~ 78 (601)
.+|||||||++++|+ ..|+.+| +++..+++.|... . ..+......++|||.|.+.. . ..++|+ .|..
T Consensus 2 ~~dvv~vlD~S~Sm~~~~~~~~k-~~~~~~~~~l~~~-~-~~~i~~~~~rvglv~fs~~~---~---~~~~l~~~~~~~~ 72 (186)
T cd01480 2 PVDITFVLDSSESVGLQNFDITK-NFVKRVAERFLKD-Y-YRKDPAGSWRVGVVQYSDQQ---E---VEAGFLRDIRNYT 72 (186)
T ss_pred CeeEEEEEeCCCccchhhHHHHH-HHHHHHHHHHhhh-h-ccCCCCCceEEEEEEecCCc---e---eeEecccccCCHH
Confidence 379999999999999 7788776 5577778877431 0 00111245899999999765 1 234555 6889
Q ss_pred HHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 79 IFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 79 ~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
.+++.|++|++.||| -++.+||..|++.+.+. ......|++||++|-.+.....
T Consensus 73 ~l~~~i~~l~~~gg~---T~~~~AL~~a~~~l~~~-------~~~~~~~~iillTDG~~~~~~~---------------- 126 (186)
T cd01480 73 SLKEAVDNLEYIGGG---TFTDCALKYATEQLLEG-------SHQKENKFLLVITDGHSDGSPD---------------- 126 (186)
T ss_pred HHHHHHHhCccCCCC---ccHHHHHHHHHHHHhcc-------CCCCCceEEEEEeCCCcCCCcc----------------
Confidence 999999999987776 57889999999877531 1235789999999887643211
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEeCCCc--chHHHHHH
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVICPKQ--LPKLTAIY 198 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSIIsPRk--lP~Lk~Lf 198 (601)
.+.+..++.+.++||.++.|.-.. ...|+.|=
T Consensus 127 --------~~~~~~~~~~~~~gi~i~~vgig~~~~~~L~~IA 160 (186)
T cd01480 127 --------GGIEKAVNEADHLGIKIFFVAVGSQNEEPLSRIA 160 (186)
T ss_pred --------hhHHHHHHHHHHCCCEEEEEecCccchHHHHHHH
Confidence 145778888999999999997654 33444443
No 3
>PF13519 VWA_2: von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=99.11 E-value=1.1e-09 Score=96.70 Aligned_cols=144 Identities=16% Similarity=0.232 Sum_probs=103.9
Q ss_pred eEEEEEecccccchh------HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHH
Q 007491 5 QLIVAVEGTAAMGPY------WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVD 78 (601)
Q Consensus 5 DIVFVIDgTAsMGpY------i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~ 78 (601)
|||||||.|++|..+ |+.+|.- +..+++.+. ..+++||.|.+.. . ..++||.|..
T Consensus 1 dvv~v~D~SgSM~~~~~~~~~~~~~~~~-~~~~~~~~~------------~~~v~l~~f~~~~---~---~~~~~t~~~~ 61 (172)
T PF13519_consen 1 DVVFVLDNSGSMNGYDGNRTRIDQAKDA-LNELLANLP------------GDRVGLVSFSDSS---R---TLSPLTSDKD 61 (172)
T ss_dssp EEEEEEE-SGGGGTTTSSS-HHHHHHHH-HHHHHHHHT------------TSEEEEEEESTSC---E---EEEEEESSHH
T ss_pred CEEEEEECCcccCCCCCCCcHHHHHHHH-HHHHHHHCC------------CCEEEEEEecccc---c---ccccccccHH
Confidence 899999999999999 7777644 566666641 2399999999854 1 2446889999
Q ss_pred HHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 79 IFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 79 ~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
.|++.|+++...+.......+.+||.+|+.+|++.+ ...|++|||+|-.|..
T Consensus 62 ~~~~~l~~~~~~~~~~~~t~~~~al~~a~~~~~~~~---------~~~~~iv~iTDG~~~~------------------- 113 (172)
T PF13519_consen 62 ELKNALNKLSPQGMPGGGTNLYDALQEAAKMLASSD---------NRRRAIVLITDGEDNS------------------- 113 (172)
T ss_dssp HHHHHHHTHHHHG--SSS--HHHHHHHHHHHHHC-S---------SEEEEEEEEES-TTHC-------------------
T ss_pred HHHHHhhcccccccCccCCcHHHHHHHHHHHHHhCC---------CCceEEEEecCCCCCc-------------------
Confidence 999999999976444556889999999999997521 4678999999964331
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEe--CCCc--chHHHHHHHHhcCC
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVI--CPKQ--LPKLTAIYNAAKRN 204 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSII--sPRk--lP~Lk~Lfeka~~n 204 (601)
+..++++.+.+++|++.+| .+.. ...|+.|.+..++.
T Consensus 114 ---------~~~~~~~~~~~~~i~i~~v~~~~~~~~~~~l~~la~~tgG~ 154 (172)
T PF13519_consen 114 ---------SDIEAAKALKQQGITIYTVGIGSDSDANEFLQRLAEATGGR 154 (172)
T ss_dssp ---------HHHHHHHHHHCTTEEEEEEEES-TT-EHHHHHHHHHHTEEE
T ss_pred ---------chhHHHHHHHHcCCeEEEEEECCCccHHHHHHHHHHhcCCE
Confidence 2345888899999999999 2222 24688888766543
No 4
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=99.04 E-value=9.7e-09 Score=90.32 Aligned_cols=147 Identities=15% Similarity=0.282 Sum_probs=105.3
Q ss_pred ceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCc--CHHHH
Q 007491 4 KQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTK--DVDIF 80 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTs--D~~~F 80 (601)
.||+||||++++|.. +|+.++ ..+..+++.|...+ ...+++||.|.+.. .. .+.|++ +..++
T Consensus 1 ~di~~llD~S~Sm~~~~~~~~~-~~~~~~~~~~~~~~--------~~~~~~li~f~~~~---~~---~~~~~~~~~~~~~ 65 (161)
T cd01450 1 LDIVFLLDGSESVGPENFEKVK-DFIEKLVEKLDIGP--------DKTRVGLVQYSDDV---RV---EFSLNDYKSKDDL 65 (161)
T ss_pred CcEEEEEeCCCCcCHHHHHHHH-HHHHHHHHheeeCC--------CceEEEEEEEcCCc---eE---EEECCCCCCHHHH
Confidence 499999999999999 666665 44677777775311 36799999999854 22 234444 48999
Q ss_pred HHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhh
Q 007491 81 LHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEA 160 (601)
Q Consensus 81 lqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~ 160 (601)
.++|+++...+| +...+.|||..|+..+..... ......|++|||+|..++.-+
T Consensus 66 ~~~i~~~~~~~~--~~t~~~~al~~a~~~~~~~~~-----~~~~~~~~iiliTDG~~~~~~------------------- 119 (161)
T cd01450 66 LKAVKNLKYLGG--GGTNTGKALQYALEQLFSESN-----ARENVPKVIIVLTDGRSDDGG------------------- 119 (161)
T ss_pred HHHHHhcccCCC--CCccHHHHHHHHHHHhccccc-----ccCCCCeEEEEECCCCCCCCc-------------------
Confidence 999999988776 457788999999987754211 123567899999998776521
Q ss_pred hhcccCCCHHHHHHHhhhcCcEEEEeCCC--cchHHHHHH
Q 007491 161 QAESRLSDAETVAKSFVQCSVSLSVICPK--QLPKLTAIY 198 (601)
Q Consensus 161 ~~~s~laDaEqVAk~faEk~IsLSIIsPR--klP~Lk~Lf 198 (601)
+.+++++.+.+++|.+++|.=. -...|++|=
T Consensus 120 -------~~~~~~~~~~~~~v~v~~i~~g~~~~~~l~~la 152 (161)
T cd01450 120 -------DPKEAAAKLKDEGIKVFVVGVGPADEEELREIA 152 (161)
T ss_pred -------chHHHHHHHHHCCCEEEEEeccccCHHHHHHHh
Confidence 3578889999999999999332 233444443
No 5
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=99.03 E-value=2.1e-08 Score=92.06 Aligned_cols=135 Identities=16% Similarity=0.323 Sum_probs=96.9
Q ss_pred ceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 4 KQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
+|||||||++++|+. .|+.+|+ ++..+++.|.- +....++|||.|.+.. ...+ .+.-..+..++++
T Consensus 1 ~Dv~~vlD~S~Sm~~~~~~~~k~-~~~~l~~~~~~--------~~~~~rvgli~fs~~~---~~~~-~l~~~~~~~~l~~ 67 (164)
T cd01482 1 ADIVFLVDGSWSIGRSNFNLVRS-FLSSVVEAFEI--------GPDGVQVGLVQYSDDP---RTEF-DLNAYTSKEDVLA 67 (164)
T ss_pred CCEEEEEeCCCCcChhhHHHHHH-HHHHHHhheee--------CCCceEEEEEEECCCe---eEEE-ecCCCCCHHHHHH
Confidence 599999999999984 7777764 46777777722 1246899999999975 2211 1222467889999
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhc-cccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhh
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMM-FSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQ 161 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~-fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~ 161 (601)
.|+++.+.||+ -.+.+||..|++. |.... + ......|.+||++|-.|..
T Consensus 68 ~l~~~~~~~g~---T~~~~aL~~a~~~~~~~~~-~----~r~~~~k~iillTDG~~~~---------------------- 117 (164)
T cd01482 68 AIKNLPYKGGN---TRTGKALTHVREKNFTPDA-G----ARPGVPKVVILITDGKSQD---------------------- 117 (164)
T ss_pred HHHhCcCCCCC---ChHHHHHHHHHHHhccccc-C----CCCCCCEEEEEEcCCCCCc----------------------
Confidence 99999988876 3688999888864 43211 1 1234678889988765431
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEeC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVIC 187 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSIIs 187 (601)
|.+..|+.+.+.||.+++|.
T Consensus 118 ------~~~~~a~~lk~~gi~i~~ig 137 (164)
T cd01482 118 ------DVELPARVLRNLGVNVFAVG 137 (164)
T ss_pred ------hHHHHHHHHHHCCCEEEEEe
Confidence 45778899999999999994
No 6
>cd01481 vWA_collagen_alpha3-VI-like VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=99.03 E-value=7.7e-09 Score=96.75 Aligned_cols=140 Identities=16% Similarity=0.289 Sum_probs=102.2
Q ss_pred ceEEEEEecccccc-hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 4 KQLIVAVEGTAAMG-PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 4 ~DIVFVIDgTAsMG-pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
+||+|+||++++++ ..|+.+| ++|..+++.|.- +...+++|||.|.+.. ..++ .+.-..|.+.+++
T Consensus 1 ~DivfllD~S~Si~~~~f~~~k-~fi~~lv~~f~i--------~~~~~rVgvv~ys~~~-~~~~---~l~~~~~~~~l~~ 67 (165)
T cd01481 1 KDIVFLIDGSDNVGSGNFPAIR-DFIERIVQSLDV--------GPDKIRVAVVQFSDTP-RPEF---YLNTHSTKADVLG 67 (165)
T ss_pred CCEEEEEeCCCCcCHHHHHHHH-HHHHHHHhhccC--------CCCCcEEEEEEecCCe-eEEE---eccccCCHHHHHH
Confidence 59999999999999 8888887 678999999943 1246899999998765 2222 1233447789999
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHh-ccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhh
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALM-MFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQ 161 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~-~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~ 161 (601)
.+++|++.||+. ..+.+||..+.+ +|.. ..+.+ ......|..|||+|-..+.
T Consensus 68 ~i~~i~~~~g~~--t~t~~AL~~~~~~~f~~-~~g~R--~~~~~~kv~vviTdG~s~d---------------------- 120 (165)
T cd01481 68 AVRRLRLRGGSQ--LNTGSALDYVVKNLFTK-SAGSR--IEEGVPQFLVLITGGKSQD---------------------- 120 (165)
T ss_pred HHHhcccCCCCc--ccHHHHHHHHHHhhcCc-cccCC--ccCCCCeEEEEEeCCCCcc----------------------
Confidence 999999998742 357788888775 5542 11211 1234678899998764332
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEeCCC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVICPK 189 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSIIsPR 189 (601)
|.+..|+.+.+.||.+..|.-.
T Consensus 121 ------~~~~~a~~lr~~gv~i~~vG~~ 142 (165)
T cd01481 121 ------DVERPAVALKRAGIVPFAIGAR 142 (165)
T ss_pred ------hHHHHHHHHHHCCcEEEEEeCC
Confidence 4577899999999999999665
No 7
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=99.00 E-value=2.6e-08 Score=90.82 Aligned_cols=150 Identities=13% Similarity=0.242 Sum_probs=103.6
Q ss_pred ceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 4 KQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
+|||||||.+++|.. .|+.+| +.+..+++.|.. .....++|||+|.+.. ... ..+....|...++.
T Consensus 1 ~Dvv~vlD~SgSm~~~~~~~~k-~~~~~~~~~l~~--------~~~~~~~giv~Fs~~~---~~~-~~~~~~~~~~~~~~ 67 (164)
T cd01472 1 ADIVFLVDGSESIGLSNFNLVK-DFVKRVVERLDI--------GPDGVRVGVVQYSDDP---RTE-FYLNTYRSKDDVLE 67 (164)
T ss_pred CCEEEEEeCCCCCCHHHHHHHH-HHHHHHHhhccc--------CCCCeEEEEEEEcCce---eEE-EecCCCCCHHHHHH
Confidence 599999999999999 777776 457788887742 1135799999999765 221 22222488999999
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhh
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQA 162 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~ 162 (601)
.|+++++.||+ -.+.+||..|.+.+.+.. ........|++|||+|--+..
T Consensus 68 ~l~~l~~~~g~---T~~~~al~~a~~~l~~~~----~~~~~~~~~~iiliTDG~~~~----------------------- 117 (164)
T cd01472 68 AVKNLRYIGGG---TNTGKALKYVRENLFTEA----SGSREGVPKVLVVITDGKSQD----------------------- 117 (164)
T ss_pred HHHhCcCCCCC---chHHHHHHHHHHHhCCcc----cCCCCCCCEEEEEEcCCCCCc-----------------------
Confidence 99999987654 468899999998775311 011234678999998873221
Q ss_pred cccCCCHHHHHHHhhhcCcEEEEeCCC--cchHHHHHHHHh
Q 007491 163 ESRLSDAETVAKSFVQCSVSLSVICPK--QLPKLTAIYNAA 201 (601)
Q Consensus 163 ~s~laDaEqVAk~faEk~IsLSIIsPR--klP~Lk~Lfeka 201 (601)
+.+..++.+.+++|.++.|.-. ....|+.|-++.
T Consensus 118 -----~~~~~~~~l~~~gv~i~~ig~g~~~~~~L~~ia~~~ 153 (164)
T cd01472 118 -----DVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDP 153 (164)
T ss_pred -----hHHHHHHHHHHCCCEEEEEECCcCCHHHHHHHHCCC
Confidence 1234456677899999999433 455666665543
No 8
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=98.97 E-value=2.2e-08 Score=93.71 Aligned_cols=138 Identities=12% Similarity=0.192 Sum_probs=100.2
Q ss_pred ceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 4 KQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
.||+||||++++|++ .|+.+| .++..+++.|.-+ ....|+|||.|.+.. ...+. +.-..+...+++
T Consensus 1 ~Di~fvlD~S~S~~~~~f~~~k-~fi~~~i~~l~~~--------~~~~rvgvv~fs~~~---~~~~~-l~~~~~~~~~~~ 67 (177)
T cd01469 1 MDIVFVLDGSGSIYPDDFQKVK-NFLSTVMKKLDIG--------PTKTQFGLVQYSESF---RTEFT-LNEYRTKEEPLS 67 (177)
T ss_pred CcEEEEEeCCCCCCHHHHHHHH-HHHHHHHHHcCcC--------CCCcEEEEEEECCce---eEEEe-cCccCCHHHHHH
Confidence 599999999999997 888877 5578888888431 135899999999875 21111 222345678899
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhh
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQA 162 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~ 162 (601)
.++.+++.||+ ..+.+||..|+..+.....+ ......|.+||++|-..+.-+
T Consensus 68 ~i~~~~~~~g~---T~~~~AL~~a~~~l~~~~~g----~R~~~~kv~illTDG~~~~~~--------------------- 119 (177)
T cd01469 68 LVKHISQLLGL---TNTATAIQYVVTELFSESNG----ARKDATKVLVVITDGESHDDP--------------------- 119 (177)
T ss_pred HHHhCccCCCC---ccHHHHHHHHHHHhcCcccC----CCCCCCeEEEEEeCCCCCCcc---------------------
Confidence 99999888876 88899999999764211111 123478899999988776522
Q ss_pred cccCCCHHHHHHHhhhcCcEEEEeC
Q 007491 163 ESRLSDAETVAKSFVQCSVSLSVIC 187 (601)
Q Consensus 163 ~s~laDaEqVAk~faEk~IsLSIIs 187 (601)
+.+.+++.+.+.||.++.|.
T Consensus 120 -----~~~~~~~~~k~~gv~v~~Vg 139 (177)
T cd01469 120 -----LLKDVIPQAEREGIIRYAIG 139 (177)
T ss_pred -----ccHHHHHHHHHCCcEEEEEE
Confidence 12667888899999999993
No 9
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=98.84 E-value=2.5e-07 Score=83.72 Aligned_cols=138 Identities=15% Similarity=0.285 Sum_probs=96.2
Q ss_pred ceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccce-eeccCCcCHHHHHH
Q 007491 4 KQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLV-QRSGWTKDVDIFLH 82 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~V-qr~GfTsD~~~Flq 82 (601)
+||||+||++++|...|+..| +++..+++.|..+ ....++|||.|.+.. ...+ -...+.++.+.+++
T Consensus 1 ldv~~llD~S~Sm~~~~~~~~-~~~~~~~~~l~~~--------~~~~~v~lv~f~~~~---~~~~~~~l~~~~~~~~l~~ 68 (163)
T cd01476 1 LDLLFVLDSSGSVRGKFEKYK-KYIERIVEGLEIG--------PTATRVALITYSGRG---RQRVRFNLPKHNDGEELLE 68 (163)
T ss_pred CCEEEEEeCCcchhhhHHHHH-HHHHHHHHhcCCC--------CCCcEEEEEEEcCCC---ceEEEecCCCCCCHHHHHH
Confidence 599999999999999887665 5678888887432 136899999998753 2222 23445678889999
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhh
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQA 162 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~ 162 (601)
.++.++..||+ .++.+||..|...+... .+ ......|.+||++|=-+..
T Consensus 69 ~i~~l~~~gg~---T~l~~aL~~a~~~l~~~-~~----~r~~~~~~villTDG~~~~----------------------- 117 (163)
T cd01476 69 KVDNLRFIGGT---TATGAAIEVALQQLDPS-EG----RREGIPKVVVVLTDGRSHD----------------------- 117 (163)
T ss_pred HHHhCccCCCC---ccHHHHHHHHHHHhccc-cC----CCCCCCeEEEEECCCCCCC-----------------------
Confidence 99999876654 47889999999877421 11 1123457889998743221
Q ss_pred cccCCCHHHHHHHhhh-cCcEEEEeCCC
Q 007491 163 ESRLSDAETVAKSFVQ-CSVSLSVICPK 189 (601)
Q Consensus 163 ~s~laDaEqVAk~faE-k~IsLSIIsPR 189 (601)
+....++.+.+ .+|.++.|.-.
T Consensus 118 -----~~~~~~~~l~~~~~v~v~~vg~g 140 (163)
T cd01476 118 -----DPEKQARILRAVPNIETFAVGTG 140 (163)
T ss_pred -----chHHHHHHHhhcCCCEEEEEECC
Confidence 23455677777 89999999543
No 10
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=98.83 E-value=8.8e-08 Score=89.73 Aligned_cols=136 Identities=18% Similarity=0.223 Sum_probs=93.0
Q ss_pred CcceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHH
Q 007491 2 SEKQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFL 81 (601)
Q Consensus 2 ~~~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Fl 81 (601)
..+|||||||+++||+..|...+ +++..+++.|.. ...|+|||.|.+.. -..++|+++.+...
T Consensus 3 ~~~Dvv~llD~SgSm~~~~~~~~-~~~~~l~~~~~~----------~~~rvglv~Fs~~~------~~~~~l~~~~~~~~ 65 (185)
T cd01474 3 GHFDLYFVLDKSGSVAANWIEIY-DFVEQLVDRFNS----------PGLRFSFITFSTRA------TKILPLTDDSSAII 65 (185)
T ss_pred CceeEEEEEeCcCchhhhHHHHH-HHHHHHHHHcCC----------CCcEEEEEEecCCc------eEEEeccccHHHHH
Confidence 35899999999999999887765 667888888843 24799999998764 12457777766666
Q ss_pred HHh---cCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 82 HWL---STIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 82 qwL---dsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
+.| +.+.. || .-++.+||..|++.+.....+ .....|++|||+|-.++.-..
T Consensus 66 ~~l~~l~~~~~-~g---~T~~~~aL~~a~~~l~~~~~~-----~r~~~~~villTDG~~~~~~~---------------- 120 (185)
T cd01474 66 KGLEVLKKVTP-SG---QTYIHEGLENANEQIFNRNGG-----GRETVSVIIALTDGQLLLNGH---------------- 120 (185)
T ss_pred HHHHHHhccCC-CC---CCcHHHHHHHHHHHHHhhccC-----CCCCCeEEEEEcCCCcCCCCC----------------
Confidence 665 54433 33 357789999998654211111 112338899999887653110
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEeC
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVIC 187 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSIIs 187 (601)
.++...|+.+.+.||.++.|.
T Consensus 121 --------~~~~~~a~~l~~~gv~i~~vg 141 (185)
T cd01474 121 --------KYPEHEAKLSRKLGAIVYCVG 141 (185)
T ss_pred --------cchHHHHHHHHHcCCEEEEEe
Confidence 134566788889999999985
No 11
>cd01473 vWA_CTRP CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=98.80 E-value=1.2e-07 Score=90.96 Aligned_cols=152 Identities=17% Similarity=0.278 Sum_probs=103.3
Q ss_pred ceEEEEEecccccchh-HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCC----cCHH
Q 007491 4 KQLIVAVEGTAAMGPY-WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWT----KDVD 78 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpY-i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfT----sD~~ 78 (601)
.||+|+||++.+|+.. |+.-..+++..+++.|.-+. ..+++|||.|-+.. . ..++|+ .|-.
T Consensus 1 ~Di~fllD~S~Si~~~~f~~~~~~f~~~lv~~l~i~~--------~~~rvgvv~fs~~~---~---~~~~~~~~~~~~~~ 66 (192)
T cd01473 1 YDLTLILDESASIGYSNWRKDVIPFTEKIINNLNISK--------DKVHVGILLFAEKN---R---DVVPFSDEERYDKN 66 (192)
T ss_pred CcEEEEEeCCCcccHHHHHHHHHHHHHHHHHhCccCC--------CccEEEEEEecCCc---e---eEEecCcccccCHH
Confidence 4999999999999976 87544567888899885422 46899999999877 1 122333 2456
Q ss_pred HHHHHhcCcc---cCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCcccccccccccc
Q 007491 79 IFLHWLSTIP---FAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQN 155 (601)
Q Consensus 79 ~FlqwLdsI~---FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~g 155 (601)
++++.+++|. +.||| -.+.+||..|++.|... .++ .....|..||++|...+.--.
T Consensus 67 ~l~~~i~~l~~~~~~~g~---T~~~~AL~~a~~~~~~~-~~~----r~~~~kv~IllTDG~s~~~~~------------- 125 (192)
T cd01473 67 ELLKKINDLKNSYRSGGE---TYIVEALKYGLKNYTKH-GNR----RKDAPKVTMLFTDGNDTSASK------------- 125 (192)
T ss_pred HHHHHHHHHHhccCCCCc---CcHHHHHHHHHHHhccC-CCC----cccCCeEEEEEecCCCCCcch-------------
Confidence 7778787774 33433 35788888888876421 111 123588999999987765211
Q ss_pred chhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHhc
Q 007491 156 ENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAAK 202 (601)
Q Consensus 156 e~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka~ 202 (601)
.+....|+.+.++||.+++|-=.. .....|..-++
T Consensus 126 -----------~~~~~~a~~lk~~gV~i~~vGiG~-~~~~el~~ia~ 160 (192)
T cd01473 126 -----------KELQDISLLYKEENVKLLVVGVGA-ASENKLKLLAG 160 (192)
T ss_pred -----------hhHHHHHHHHHHCCCEEEEEEecc-ccHHHHHHhcC
Confidence 145788999999999999994443 23445555443
No 12
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=98.79 E-value=2.1e-07 Score=90.21 Aligned_cols=138 Identities=17% Similarity=0.248 Sum_probs=97.4
Q ss_pred cceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccC--CcCHHH
Q 007491 3 EKQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGW--TKDVDI 79 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~Gf--TsD~~~ 79 (601)
.+|||||||++++|++ .|+.+| +++..+++.|.-+ ....++|||.|.+-. . ..+++ ..|.++
T Consensus 2 ~~DlvfllD~S~Sm~~~~~~~~k-~f~~~l~~~l~~~--------~~~~rvglv~fs~~~---~---~~~~l~~~~~~~~ 66 (224)
T cd01475 2 PTDLVFLIDSSRSVRPENFELVK-QFLNQIIDSLDVG--------PDATRVGLVQYSSTV---K---QEFPLGRFKSKAD 66 (224)
T ss_pred CccEEEEEeCCCCCCHHHHHHHH-HHHHHHHHhcccC--------CCccEEEEEEecCce---e---EEecccccCCHHH
Confidence 4799999999999996 577776 6678888888431 136899999998865 1 12333 467788
Q ss_pred HHHHhcCcccCCCCCchhhHHHHHHHHHh-ccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 80 FLHWLSTIPFAGGGFNDAAIAEGLSEALM-MFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 80 FlqwLdsI~FsGGGfspaAVAEGLAeAL~-~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
+++.|++|++.|||- .+.+||..|+. .|... .+. ++...+..|.+||++|-.+..
T Consensus 67 l~~~i~~i~~~~~~t---~tg~AL~~a~~~~~~~~-~g~-r~~~~~~~kvvillTDG~s~~------------------- 122 (224)
T cd01475 67 LKRAVRRMEYLETGT---MTGLAIQYAMNNAFSEA-EGA-RPGSERVPRVGIVVTDGRPQD------------------- 122 (224)
T ss_pred HHHHHHhCcCCCCCC---hHHHHHHHHHHHhCChh-cCC-CCCCCCCCeEEEEEcCCCCcc-------------------
Confidence 999999999988763 45677777775 34321 111 122223578889998775421
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEeCC
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVICP 188 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSIIsP 188 (601)
+.+..++.+.+.||.++.|.=
T Consensus 123 ---------~~~~~a~~lk~~gv~i~~Vgv 143 (224)
T cd01475 123 ---------DVSEVAAKARALGIEMFAVGV 143 (224)
T ss_pred ---------cHHHHHHHHHHCCcEEEEEeC
Confidence 456788899999999999953
No 13
>PF00092 VWA: von Willebrand factor type A domain; InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=98.72 E-value=3.9e-07 Score=81.71 Aligned_cols=148 Identities=16% Similarity=0.245 Sum_probs=103.6
Q ss_pred eEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccC--CcCHHHHH
Q 007491 5 QLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGW--TKDVDIFL 81 (601)
Q Consensus 5 DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~Gf--TsD~~~Fl 81 (601)
||||+||.+++|+. -|+.+| .+|..+++.|.. .....+++||.|.+.. . ..+.+ ..+..+++
T Consensus 1 DivflvD~S~sm~~~~~~~~~-~~v~~~i~~~~~--------~~~~~rv~iv~f~~~~---~---~~~~~~~~~~~~~~~ 65 (178)
T PF00092_consen 1 DIVFLVDTSGSMSGDNFEKAK-QFVKSIISRLSI--------SNNGTRVGIVTFSDSA---R---VLFSLTDYQSKNDLL 65 (178)
T ss_dssp EEEEEEE-STTSCHHHHHHHH-HHHHHHHHHSTB--------STTSEEEEEEEESSSE---E---EEEETTSHSSHHHHH
T ss_pred CEEEEEeCCCCCchHHHHHHH-HHHHHHHHhhhc--------cccccccceeeeeccc---c---ccccccccccccccc
Confidence 89999999999997 778866 557999998852 1257899999999887 2 23334 35678999
Q ss_pred HHh-cCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhh
Q 007491 82 HWL-STIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEA 160 (601)
Q Consensus 82 qwL-dsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~ 160 (601)
..+ +.+.+.+|+- .+++||..|++.+.... ........|..|||++..++.-+.
T Consensus 66 ~~i~~~~~~~~g~t---~~~~aL~~a~~~l~~~~----~~~r~~~~~~iiliTDG~~~~~~~------------------ 120 (178)
T PF00092_consen 66 NAINDSIPSSGGGT---NLGAALKFAREQLFSSN----NGGRPNSPKVIILITDGNSNDSDS------------------ 120 (178)
T ss_dssp HHHHTTGGCCBSSB----HHHHHHHHHHHTTSGG----GTTGTTSEEEEEEEESSSSSSHSG------------------
T ss_pred ccccccccccchhh---hHHHHHhhhhhcccccc----cccccccccceEEEEeecccCCcc------------------
Confidence 988 8888777553 38899999998875320 011235788899999998877321
Q ss_pred hhcccCCCHHHHHHHhhhcCcEEEEeCC--CcchHHHHHHH
Q 007491 161 QAESRLSDAETVAKSFVQCSVSLSVICP--KQLPKLTAIYN 199 (601)
Q Consensus 161 ~~~s~laDaEqVAk~faEk~IsLSIIsP--RklP~Lk~Lfe 199 (601)
..+..++...+.+|.+..|.. .....|+.|..
T Consensus 121 -------~~~~~~~~~~~~~i~~~~ig~~~~~~~~l~~la~ 154 (178)
T PF00092_consen 121 -------PSEEAANLKKSNGIKVIAIGIDNADNEELRELAS 154 (178)
T ss_dssp -------HHHHHHHHHHHCTEEEEEEEESCCHHHHHHHHSH
T ss_pred -------hHHHHHHHHHhcCcEEEEEecCcCCHHHHHHHhC
Confidence 123333333336999999977 34667777764
No 14
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=98.71 E-value=6.7e-07 Score=85.21 Aligned_cols=150 Identities=17% Similarity=0.241 Sum_probs=99.3
Q ss_pred ceEEEEEecccccc------hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCH
Q 007491 4 KQLIVAVEGTAAMG------PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDV 77 (601)
Q Consensus 4 ~DIVFVIDgTAsMG------pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~ 77 (601)
+|||||||.+.+|. .=++..|.. +..+++.+-. .. ..-++|||+|++..+| ...++|.|.
T Consensus 4 r~ivi~lD~S~SM~a~D~~ptRl~~ak~~-~~~fi~~~~~----~~----~~~~vglv~f~~~~a~-----~~~PlT~D~ 69 (183)
T cd01453 4 RHLIIVIDCSRSMEEQDLKPSRLAVVLKL-LELFIEEFFD----QN----PISQLGIISIKNGRAE-----KLTDLTGNP 69 (183)
T ss_pred eEEEEEEECcHHHhcCCCCchHHHHHHHH-HHHHHHHHhh----cC----ccccEEEEEEcCCccE-----EEECCCCCH
Confidence 69999999999998 455555533 4445544411 10 2358999999765521 355889999
Q ss_pred HHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCcc-EEEEeecCCCCCCCCCccccccccccccc
Q 007491 78 DIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQR-HCILVAASNPHPLPTPVYRPQMQNLDQNE 156 (601)
Q Consensus 78 ~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qK-hCILI~nSPPhplP~pVy~P~~~~~~~ge 156 (601)
+.|+..|+.+...+| ..++.+||..|+..|.... ....| +.||+++-. .
T Consensus 70 ~~~~~~L~~~~~~~G---~t~l~~aL~~A~~~l~~~~--------~~~~~~iiil~sd~~-----~-------------- 119 (183)
T cd01453 70 RKHIQALKTARECSG---EPSLQNGLEMALESLKHMP--------SHGSREVLIIFSSLS-----T-------------- 119 (183)
T ss_pred HHHHHHhhcccCCCC---chhHHHHHHHHHHHHhcCC--------ccCceEEEEEEcCCC-----c--------------
Confidence 999999998822222 3689999999999885311 11234 445554310 0
Q ss_pred hhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCC-cchHHHHHHHHhcCC
Q 007491 157 NNEAQAESRLSDAETVAKSFVQCSVSLSVICPK-QLPKLTAIYNAAKRN 204 (601)
Q Consensus 157 ~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPR-klP~Lk~Lfeka~~n 204 (601)
....+.+.+++.+.+.+|.+++|.=- ..+.|++|-++.++.
T Consensus 120 -------~~~~~~~~~~~~l~~~~I~v~~IgiG~~~~~L~~ia~~tgG~ 161 (183)
T cd01453 120 -------CDPGNIYETIDKLKKENIRVSVIGLSAEMHICKEICKATNGT 161 (183)
T ss_pred -------CChhhHHHHHHHHHHcCcEEEEEEechHHHHHHHHHHHhCCe
Confidence 00013467899999999999999322 256899999988765
No 15
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=98.71 E-value=4.1e-07 Score=87.94 Aligned_cols=145 Identities=10% Similarity=0.166 Sum_probs=91.4
Q ss_pred cceEEEEEecccccchh-HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHH
Q 007491 3 EKQLIVAVEGTAAMGPY-WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFL 81 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpY-i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Fl 81 (601)
.+|||||||++.+|++. |+.+|.. |..++..|..+.. .......+|+|||+|.+.. +-++.++ -..+...++
T Consensus 19 ~~DivfvlD~S~Sm~~~~f~~~k~f-i~~~~~~~~~~~~--~~~~~~~~rVGlV~fs~~a-~~~~~L~---d~~~~~~~~ 91 (193)
T cd01477 19 WLDIVFVVDNSKGMTQGGLWQVRAT-ISSLFGSSSQIGT--DYDDPRSTRVGLVTYNSNA-TVVADLN---DLQSFDDLY 91 (193)
T ss_pred eeeEEEEEeCCCCcchhhHHHHHHH-HHHHHhhcccccc--ccCCCCCcEEEEEEccCce-EEEEecc---cccCHHHHH
Confidence 48999999999999854 6666643 4556666632100 0000145899999998865 2222222 123667888
Q ss_pred HHhcC-c-cc-CCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 82 HWLST-I-PF-AGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 82 qwLds-I-~F-sGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
+.|+. + .. .||| -.+..||..|+++|.... +..+....|+.|||++-.-..
T Consensus 92 ~ai~~~~~~~~~~gg---T~ig~aL~~A~~~l~~~~----~~~R~~v~kvvIllTDg~~~~------------------- 145 (193)
T cd01477 92 SQIQGSLTDVSSTNA---SYLDTGLQAAEQMLAAGK----RTSRENYKKVVIVFASDYNDE------------------- 145 (193)
T ss_pred HHHHHHhhccccCCc---chHHHHHHHHHHHHHhhh----ccccCCCCeEEEEEecCccCC-------------------
Confidence 87875 1 22 2332 357888999998885311 111235789999997631100
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEe
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVI 186 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSII 186 (601)
.. .++..+|+.+.+.||.++.|
T Consensus 146 -----~~-~~~~~~a~~l~~~GI~i~tV 167 (193)
T cd01477 146 -----GS-NDPRPIAARLKSTGIAIITV 167 (193)
T ss_pred -----CC-CCHHHHHHHHHHCCCEEEEE
Confidence 00 25678899999999999999
No 16
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=98.70 E-value=1.3e-06 Score=74.73 Aligned_cols=146 Identities=17% Similarity=0.229 Sum_probs=98.2
Q ss_pred ceEEEEEeccccc-chhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCc--CHHHH
Q 007491 4 KQLIVAVEGTAAM-GPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTK--DVDIF 80 (601)
Q Consensus 4 ~DIVFVIDgTAsM-GpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTs--D~~~F 80 (601)
+||+||||.+++| ..+|+.+| .++.-+++.+... ....+++|+.|.+.. .....|+. +.+.+
T Consensus 1 ~~v~~viD~S~Sm~~~~~~~~~-~~~~~~~~~~~~~--------~~~~~i~v~~f~~~~------~~~~~~~~~~~~~~~ 65 (161)
T cd00198 1 ADIVFLLDVSGSMGGEKLDKAK-EALKALVSSLSAS--------PPGDRVGLVTFGSNA------RVVLPLTTDTDKADL 65 (161)
T ss_pred CcEEEEEeCCCCcCcchHHHHH-HHHHHHHHhcccC--------CCCcEEEEEEecCcc------ceeecccccCCHHHH
Confidence 4899999999999 67777776 4467888887442 135799999999843 12234443 67888
Q ss_pred HHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhh
Q 007491 81 LHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEA 160 (601)
Q Consensus 81 lqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~ 160 (601)
.+.++.+.. +......+.++|..++..+.+.. .....|..|+|+|..++.-..
T Consensus 66 ~~~~~~~~~--~~~~~t~~~~al~~~~~~~~~~~-------~~~~~~~lvvitDg~~~~~~~------------------ 118 (161)
T cd00198 66 LEAIDALKK--GLGGGTNIGAALRLALELLKSAK-------RPNARRVIILLTDGEPNDGPE------------------ 118 (161)
T ss_pred HHHHHhccc--CCCCCccHHHHHHHHHHHhcccC-------CCCCceEEEEEeCCCCCCCcc------------------
Confidence 888888876 11223455678888888775421 124578889999987655321
Q ss_pred hhcccCCCHHHHHHHhhhcCcEEEEe--CC-CcchHHHHHH
Q 007491 161 QAESRLSDAETVAKSFVQCSVSLSVI--CP-KQLPKLTAIY 198 (601)
Q Consensus 161 ~~~s~laDaEqVAk~faEk~IsLSII--sP-RklP~Lk~Lf 198 (601)
+.+.+.+.+.+++|.+.+| .+ ..-+.|++|-
T Consensus 119 -------~~~~~~~~~~~~~v~v~~v~~g~~~~~~~l~~l~ 152 (161)
T cd00198 119 -------LLAEAARELRKLGITVYTIGIGDDANEDELKEIA 152 (161)
T ss_pred -------hhHHHHHHHHHcCCEEEEEEcCCCCCHHHHHHHh
Confidence 3466788888899999999 33 2334455443
No 17
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=98.68 E-value=7.7e-07 Score=82.80 Aligned_cols=139 Identities=17% Similarity=0.261 Sum_probs=90.2
Q ss_pred ceEEEEEecccccchh--HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceee-ccCCcCHHHH
Q 007491 4 KQLIVAVEGTAAMGPY--WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQR-SGWTKDVDIF 80 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpY--i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr-~GfTsD~~~F 80 (601)
+|||||||.+++|+.. |+.+| ..+..+++.|.-. ....++|||.|.+.. ...+.- ..++.+...+
T Consensus 1 ~Dv~~vlD~SgSm~~~~~~~~~k-~~~~~~~~~~~~~--------~~~~~vglv~Fs~~~---~~~~~l~~~~~~~~~~~ 68 (186)
T cd01471 1 LDLYLLVDGSGSIGYSNWVTHVV-PFLHTFVQNLNIS--------PDEINLYLVTFSTNA---KELIRLSSPNSTNKDLA 68 (186)
T ss_pred CcEEEEEeCCCCccchhhHHHHH-HHHHHHHHhcccC--------CCceEEEEEEecCCc---eEEEECCCccccchHHH
Confidence 5999999999999955 55555 4467777777421 135799999998765 322221 1234456664
Q ss_pred HHHhcC---cccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccch
Q 007491 81 LHWLST---IPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNEN 157 (601)
Q Consensus 81 lqwLds---I~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~ 157 (601)
+.++++ +...||| -.+++||..|...+... . .......|++||++|-.+..-+
T Consensus 69 ~~~i~~l~~~~~~~G~---T~l~~aL~~a~~~l~~~-~----~~r~~~~~~villTDG~~~~~~---------------- 124 (186)
T cd01471 69 LNAIRALLSLYYPNGS---TNTTSALLVVEKHLFDT-R----GNRENAPQLVIIMTDGIPDSKF---------------- 124 (186)
T ss_pred HHHHHHHHhCcCCCCC---ccHHHHHHHHHHHhhcc-C----CCcccCceEEEEEccCCCCCCc----------------
Confidence 444444 4444544 46888999998766421 1 1123467899999998764311
Q ss_pred hhhhhcccCCCHHHHHHHhhhcCcEEEEeCC
Q 007491 158 NEAQAESRLSDAETVAKSFVQCSVSLSVICP 188 (601)
Q Consensus 158 ~~~~~~s~laDaEqVAk~faEk~IsLSIIsP 188 (601)
.+...++.+.+++|.+++|.=
T Consensus 125 ----------~~~~~a~~l~~~gv~v~~igi 145 (186)
T cd01471 125 ----------RTLKEARKLRERGVIIAVLGV 145 (186)
T ss_pred ----------chhHHHHHHHHCCCEEEEEEe
Confidence 223567788899999999943
No 18
>smart00187 INB Integrin beta subunits (N-terminal portion of extracellular region). Portion of beta integrins that lies N-terminal to their EGF-like repeats. Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Beta integrins are proposed to have a von Willebrand factor type-A "insert" or "I" -like domain (although this remains to be confirmed).
Probab=98.62 E-value=1e-06 Score=95.07 Aligned_cols=172 Identities=12% Similarity=0.076 Sum_probs=124.9
Q ss_pred ceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC--Ccccc---------------
Q 007491 4 KQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG--SYCAC--------------- 66 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~--P~~e~--------------- 66 (601)
.||.|++|-|.+|+++++.||+. ...+.+.+.. ...++|+|+=.|-|.+ ||-.+
T Consensus 100 vDLYyLMDlS~SM~ddl~~lk~l-g~~L~~~m~~--------it~n~rlGfGsFVDK~v~P~~~t~p~~l~~PC~~~~~~ 170 (423)
T smart00187 100 VDLYYLMDLSYSMKDDLDNLKSL-GDDLAREMKG--------LTSNFRLGFGSFVDKTVSPFVSTRPEKLENPCPNYNLT 170 (423)
T ss_pred cceEEEEeCCccHHHHHHHHHHH-HHHHHHHHHh--------cccCceeeEEEeecCccCCcccCCHHHhcCCCcCCCCC
Confidence 79999999999999999999977 5788777754 2357899998888863 33221
Q ss_pred ------ceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCC
Q 007491 67 ------LVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPL 140 (601)
Q Consensus 67 ------~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhpl 140 (601)
|-.....|+|.++|.+.+++...+|+.|.||.=.+||--|..|-+.+. || .+..|..|+++|+++|-.
T Consensus 171 c~p~f~f~~~L~LT~~~~~F~~~V~~~~iSgN~D~PEgG~DAimQaaVC~~~IG-WR-----~~a~rllv~~TDa~fH~A 244 (423)
T smart00187 171 CEPPYGFKHVLSLTDDTDEFNEEVKKQRISGNLDAPEGGFDAIMQAAVCTEQIG-WR-----EDARRLLVFSTDAGFHFA 244 (423)
T ss_pred cCCCcceeeeccCCCCHHHHHHHHhhceeecCCcCCcccHHHHHHHHhhccccc-cC-----CCceEEEEEEcCCCcccc
Confidence 112235599999999999999999999999988888877775553321 22 346788899999999965
Q ss_pred CC------------Cccccccccccccchhhhhh--cccCCCHHHHHHHhhhcCcEEEE-eCCCcchHHHHHH
Q 007491 141 PT------------PVYRPQMQNLDQNENNEAQA--ESRLSDAETVAKSFVQCSVSLSV-ICPKQLPKLTAIY 198 (601)
Q Consensus 141 P~------------pVy~P~~~~~~~ge~~~~~~--~s~laDaEqVAk~faEk~IsLSI-IsPRklP~Lk~Lf 198 (601)
.. .|++- ..++|.. +.++-...||++.+.|++|++=+ |.-+..+.+++|-
T Consensus 245 GDGkLaGIv~PNDg~CHL~--------~~g~Yt~s~~~DYPSi~ql~~kL~e~nI~~IFAVT~~~~~~Y~~Ls 309 (423)
T smart00187 245 GDGKLAGIVQPNDGQCHLD--------NNGEYTMSTTQDYPSIGQLNQKLAENNINPIFAVTKKQVSLYKELS 309 (423)
T ss_pred CCcceeeEecCCCCcceeC--------CCCCcCccCcCCCCCHHHHHHHHHhcCceEEEEEcccchhHHHHHH
Confidence 53 34321 1223332 23444789999999999998644 6777776665553
No 19
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=98.46 E-value=5e-06 Score=76.28 Aligned_cols=133 Identities=18% Similarity=0.246 Sum_probs=86.9
Q ss_pred cceEEEEEecccccchh-H-HHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHH
Q 007491 3 EKQLIVAVEGTAAMGPY-W-QSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIF 80 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpY-i-~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~F 80 (601)
.+|||||||.+++|... | ..-|-++...++..|... . ...+++||.|.+.. .....|+.|...+
T Consensus 2 ~~~vv~vlD~S~SM~~~~~~~~~r~~~a~~~~~~~~~~-~-------~~~~v~lv~f~~~~------~~~~~~~~~~~~~ 67 (180)
T cd01467 2 GRDIMIALDVSGSMLAQDFVKPSRLEAAKEVLSDFIDR-R-------ENDRIGLVVFAGAA------FTQAPLTLDRESL 67 (180)
T ss_pred CceEEEEEECCcccccccCCCCCHHHHHHHHHHHHHHh-C-------CCCeEEEEEEcCCe------eeccCCCccHHHH
Confidence 47999999999999753 2 011222333334344221 1 24599999997654 1235788888888
Q ss_pred HHHhcCccc--CCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 81 LHWLSTIPF--AGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 81 lqwLdsI~F--sGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
.+.|+.|.. .+|+ ..+.+||..|+.++.... ...+++|||+|-.+..-+
T Consensus 68 ~~~l~~l~~~~~~g~---T~l~~al~~a~~~l~~~~---------~~~~~iiliTDG~~~~g~----------------- 118 (180)
T cd01467 68 KELLEDIKIGLAGQG---TAIGDAIGLAIKRLKNSE---------AKERVIVLLTDGENNAGE----------------- 118 (180)
T ss_pred HHHHHHhhhcccCCC---CcHHHHHHHHHHHHHhcC---------CCCCEEEEEeCCCCCCCC-----------------
Confidence 888888873 2222 467889999998875321 234788999887654311
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEe
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVI 186 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSII 186 (601)
.+.+++++.+.+++|.+++|
T Consensus 119 --------~~~~~~~~~~~~~gi~i~~i 138 (180)
T cd01467 119 --------IDPATAAELAKNKGVRIYTI 138 (180)
T ss_pred --------CCHHHHHHHHHHCCCEEEEE
Confidence 13456677788899999988
No 20
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=98.46 E-value=1.1e-05 Score=71.62 Aligned_cols=149 Identities=16% Similarity=0.241 Sum_probs=99.8
Q ss_pred cceEEEEEeccccc-chhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccC--CcCHHH
Q 007491 3 EKQLIVAVEGTAAM-GPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGW--TKDVDI 79 (601)
Q Consensus 3 ~~DIVFVIDgTAsM-GpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~Gf--TsD~~~ 79 (601)
++||+||||.|.+| +.+|+.+|.- +..+++.+... ....+++++.|.+.. ...+.+ +++.+.
T Consensus 1 ~~~v~l~vD~S~SM~~~~~~~~~~~-~~~~~~~~~~~--------~~~~~i~ii~f~~~~------~~~~~~~~~~~~~~ 65 (177)
T smart00327 1 PLDVVFLLDGSGSMGPNRFEKAKEF-VLKLVEQLDIG--------PDGDRVGLVTFSDDA------TVLFPLNDSRSKDA 65 (177)
T ss_pred CccEEEEEeCCCccchHHHHHHHHH-HHHHHHhcCCC--------CCCcEEEEEEeCCCc------eEEEcccccCCHHH
Confidence 47999999999999 7777777744 56667666331 135799999999844 122344 899999
Q ss_pred HHHHhcCccc-CCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 80 FLHWLSTIPF-AGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 80 FlqwLdsI~F-sGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
+...++.+.. .+|+ ..+.++|..|+..+..... .......+..|+|+|..++.-
T Consensus 66 ~~~~i~~~~~~~~~~---~~~~~al~~~~~~~~~~~~----~~~~~~~~~iviitDg~~~~~------------------ 120 (177)
T smart00327 66 LLEALASLSYKLGGG---TNLGAALQYALENLFSKSA----GSRRGAPKVLILITDGESNDG------------------ 120 (177)
T ss_pred HHHHHHhcCCCCCCC---chHHHHHHHHHHHhcCcCC----CCCCCCCeEEEEEcCCCCCCC------------------
Confidence 9999999987 3332 4455666666665531100 011123677888888777652
Q ss_pred hhhhcccCCCHHHHHHHhhhcCcEEEEeCCC---cchHHHHHHH
Q 007491 159 EAQAESRLSDAETVAKSFVQCSVSLSVICPK---QLPKLTAIYN 199 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~faEk~IsLSIIsPR---klP~Lk~Lfe 199 (601)
.+.+.+++.+.+++|.+.+|.-- ....|+.|..
T Consensus 121 --------~~~~~~~~~~~~~~i~i~~i~~~~~~~~~~l~~~~~ 156 (177)
T smart00327 121 --------GDLLKAAKELKRSGVKVFVVGVGNDVDEEELKKLAS 156 (177)
T ss_pred --------ccHHHHHHHHHHCCCEEEEEEccCccCHHHHHHHhC
Confidence 14578888889999999999443 3445555544
No 21
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=98.39 E-value=1.2e-05 Score=74.40 Aligned_cols=157 Identities=10% Similarity=0.114 Sum_probs=99.7
Q ss_pred eEEEEEecccccch--hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCC--ccccceeeccCCcCH-HH
Q 007491 5 QLIVAVEGTAAMGP--YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGS--YCACLVQRSGWTKDV-DI 79 (601)
Q Consensus 5 DIVFVIDgTAsMGp--Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P--~~e~~Vqr~GfTsD~-~~ 79 (601)
-|+|+||.++||.. .|+.+|.- +.-+++.+.. ...+++|+.|.+..+ +...++....|..+. ..
T Consensus 2 ~v~~llD~SgSM~~~~kl~~ak~a-~~~l~~~l~~----------~~d~~~l~~F~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (174)
T cd01454 2 AVTLLLDLSGSMRSDRRIDVAKKA-AVLLAEALEA----------CGVPHAILGFTTDAGGRERVRWIKIKDFDESLHER 70 (174)
T ss_pred EEEEEEECCCCCCCCcHHHHHHHH-HHHHHHHHHH----------cCCcEEEEEecCCCCCccceEEEEecCcccccchh
Confidence 48999999999998 67777754 3444444432 246899999998731 111223333555543 35
Q ss_pred HHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhh
Q 007491 80 FLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNE 159 (601)
Q Consensus 80 FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~ 159 (601)
....|+.+...| ...+.+||..|++.+... ....|++|||+|--|..-.. |. ++..
T Consensus 71 ~~~~l~~~~~~g----~T~~~~al~~a~~~l~~~---------~~~~~~iiliTDG~~~~~~~--~~--------~~~~- 126 (174)
T cd01454 71 ARKRLAALSPGG----NTRDGAAIRHAAERLLAR---------PEKRKILLVISDGEPNDLDY--YE--------GNVF- 126 (174)
T ss_pred HHHHHHccCCCC----CCcHHHHHHHHHHHHhcC---------CCcCcEEEEEeCCCcCcccc--cC--------cchh-
Confidence 667788776422 346788888898877531 13468999999998765332 11 1100
Q ss_pred hhhcccCCCHHHHHHHhhhcCcEEEEe--CCCc----chHHHHHHHHh
Q 007491 160 AQAESRLSDAETVAKSFVQCSVSLSVI--CPKQ----LPKLTAIYNAA 201 (601)
Q Consensus 160 ~~~~s~laDaEqVAk~faEk~IsLSII--sPRk----lP~Lk~Lfeka 201 (601)
...|+...++.+.+.||.++.| -... .+.++++|++.
T Consensus 127 -----~~~~~~~~~~~~~~~gi~v~~igig~~~~~~~~~~~~~~~~~~ 169 (174)
T cd01454 127 -----ATEDALRAVIEARKLGIEVFGITIDRDATTVDKEYLKNIFGEE 169 (174)
T ss_pred -----HHHHHHHHHHHHHhCCcEEEEEEecCccccchHHHHHHhhCcc
Confidence 1112223378888999999988 3333 78999999854
No 22
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=98.38 E-value=9.6e-06 Score=77.32 Aligned_cols=150 Identities=19% Similarity=0.282 Sum_probs=94.7
Q ss_pred cceEEEEEecccccc-------hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC----Cccc-----c
Q 007491 3 EKQLIVAVEGTAAMG-------PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG----SYCA-----C 66 (601)
Q Consensus 3 ~~DIVFVIDgTAsMG-------pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~----P~~e-----~ 66 (601)
..++|||||.++||. ..|+.+|.- +..+++.+. ...+++||+|.+.. .+.. +
T Consensus 20 ~~~vv~vlD~SgSM~~~~~~~~~rl~~ak~a-~~~~l~~l~-----------~~~~v~lv~F~~~~~~~~~~~~~~p~~~ 87 (206)
T cd01456 20 PPNVAIVLDNSGSMREVDGGGETRLDNAKAA-LDETANALP-----------DGTRLGLWTFSGDGDNPLDVRVLVPKGC 87 (206)
T ss_pred CCcEEEEEeCCCCCcCCCCCcchHHHHHHHH-HHHHHHhCC-----------CCceEEEEEecCCCCCCccccccccccc
Confidence 378999999999997 466666644 344454442 24699999999832 1111 1
Q ss_pred cee-eccCC-cCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCc
Q 007491 67 LVQ-RSGWT-KDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPV 144 (601)
Q Consensus 67 ~Vq-r~GfT-sD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pV 144 (601)
... ..++. .+...+.+.|++|...+|| -.+.+||..|+.++. + ...+++|||+|--+..-.
T Consensus 88 ~~~~~~~~~~~~~~~l~~~i~~i~~~~G~---T~l~~aL~~a~~~l~--~---------~~~~~iillTDG~~~~~~--- 150 (206)
T cd01456 88 LTAPVNGFPSAQRSALDAALNSLQTPTGW---TPLAAALAEAAAYVD--P---------GRVNVVVLITDGEDTCGP--- 150 (206)
T ss_pred cccccCCCCcccHHHHHHHHHhhcCCCCc---ChHHHHHHHHHHHhC--C---------CCcceEEEEcCCCccCCC---
Confidence 111 22433 5789999999999844443 578899999998874 1 123889999886443210
Q ss_pred cccccccccccchhhhhhcccCCCHHHHHHHhhh-----cCcEEEEeC--CC-cchHHHHHHHHhcCC
Q 007491 145 YRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQ-----CSVSLSVIC--PK-QLPKLTAIYNAAKRN 204 (601)
Q Consensus 145 y~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faE-----k~IsLSIIs--PR-klP~Lk~Lfeka~~n 204 (601)
+..+.++.+.+ ++|.+++|. .. ....|+.|=+..++.
T Consensus 151 -----------------------~~~~~~~~~~~~~~~~~~i~i~~igiG~~~~~~~l~~iA~~tgG~ 195 (206)
T cd01456 151 -----------------------DPCEVARELAKRRTPAPPIKVNVIDFGGDADRAELEAIAEATGGT 195 (206)
T ss_pred -----------------------CHHHHHHHHHHhcCCCCCceEEEEEecCcccHHHHHHHHHhcCCe
Confidence 22334444443 599999983 22 245677777666554
No 23
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=98.35 E-value=1e-05 Score=76.47 Aligned_cols=123 Identities=11% Similarity=0.212 Sum_probs=79.8
Q ss_pred ceEEEEEecccccc-hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCcccccee-eccCCcCHHHHH
Q 007491 4 KQLIVAVEGTAAMG-PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQ-RSGWTKDVDIFL 81 (601)
Q Consensus 4 ~DIVFVIDgTAsMG-pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vq-r~GfTsD~~~Fl 81 (601)
+|||||||+++||+ .-|+.+| .-|.-+++.+.... ...+++||.|.+.. ...+. +..++.+...++
T Consensus 1 ~di~~vlD~SgSM~~~~~~~~k-~~~~~l~~~l~~~~--------~~~~v~li~Fs~~~---~~~~~~~~~~~~~~~~~~ 68 (198)
T cd01470 1 LNIYIALDASDSIGEEDFDEAK-NAIKTLIEKISSYE--------VSPRYEIISYASDP---KEIVSIRDFNSNDADDVI 68 (198)
T ss_pred CcEEEEEECCCCccHHHHHHHH-HHHHHHHHHccccC--------CCceEEEEEecCCc---eEEEecccCCCCCHHHHH
Confidence 59999999999997 5666666 33566666663211 24689999998875 22222 124566889999
Q ss_pred HHhcCcccCCCC-CchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCC
Q 007491 82 HWLSTIPFAGGG-FNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHP 139 (601)
Q Consensus 82 qwLdsI~FsGGG-fspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhp 139 (601)
.+|+++++.+++ .+.-.+++||..+++.+..... +........++.+|||+|=-|..
T Consensus 69 ~~l~~~~~~~~~~~ggT~~~~Al~~~~~~l~~~~~-~~~~~~~~~~~~iillTDG~~~~ 126 (198)
T cd01470 69 KRLEDFNYDDHGDKTGTNTAAALKKVYERMALEKV-RNKEAFNETRHVIILFTDGKSNM 126 (198)
T ss_pred HHHHhCCcccccCccchhHHHHHHHHHHHHHHHHh-cCccchhhcceEEEEEcCCCcCC
Confidence 999999876543 2346788999888876532111 10011123467789998876553
No 24
>PF13768 VWA_3: von Willebrand factor type A domain
Probab=98.33 E-value=9.9e-06 Score=73.17 Aligned_cols=107 Identities=20% Similarity=0.375 Sum_probs=74.2
Q ss_pred ceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCC-ccccceeeccCCcCHHHHHH
Q 007491 4 KQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGS-YCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P-~~e~~Vqr~GfTsD~~~Flq 82 (601)
.|||||||.+++|...+ .+.++-|.-+|+.+. ...+|.||.|.+..- +...++... .......++
T Consensus 1 ~~vvilvD~S~Sm~g~~-~~~k~al~~~l~~L~-----------~~d~fnii~f~~~~~~~~~~~~~~~--~~~~~~a~~ 66 (155)
T PF13768_consen 1 ADVVILVDTSGSMSGEK-ELVKDALRAILRSLP-----------PGDRFNIIAFGSSVRPLFPGLVPAT--EENRQEALQ 66 (155)
T ss_pred CeEEEEEeCCCCCCCcH-HHHHHHHHHHHHhCC-----------CCCEEEEEEeCCEeeEcchhHHHHh--HHHHHHHHH
Confidence 58999999999999998 555555666666653 245999999999751 111111111 255688899
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCC
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNP 137 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPP 137 (601)
|++.++..+|| ..+.+||.+|+..+.. ....+++||++|-.|
T Consensus 67 ~I~~~~~~~G~---t~l~~aL~~a~~~~~~----------~~~~~~IilltDG~~ 108 (155)
T PF13768_consen 67 WIKSLEANSGG---TDLLAALRAALALLQR----------PGCVRAIILLTDGQP 108 (155)
T ss_pred HHHHhcccCCC---ccHHHHHHHHHHhccc----------CCCccEEEEEEeccC
Confidence 99999984443 4466777777766521 246799999998877
No 25
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=98.32 E-value=1.8e-05 Score=74.55 Aligned_cols=116 Identities=13% Similarity=0.176 Sum_probs=71.4
Q ss_pred cceEEEEEecccccchh-HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeec------cCCc
Q 007491 3 EKQLIVAVEGTAAMGPY-WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRS------GWTK 75 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpY-i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~------GfTs 75 (601)
.+|||||||+++||... |+.+| +-+.-+++.+. ..-+++||.|.+.. ...+-++ .+..
T Consensus 13 p~~vv~llD~SgSM~~~~l~~ak-~~~~~ll~~l~-----------~~d~v~lv~F~~~~---~~~~~~~~~~~~~~~~~ 77 (190)
T cd01463 13 PKDIVILLDVSGSMTGQRLHLAK-QTVSSILDTLS-----------DNDFFNIITFSNEV---NPVVPCFNDTLVQATTS 77 (190)
T ss_pred CceEEEEEECCCCCCcHHHHHHH-HHHHHHHHhCC-----------CCCEEEEEEeCCCe---eEEeeecccceEecCHH
Confidence 37999999999999743 33333 22344444442 23589999999986 3332221 1345
Q ss_pred CHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCC
Q 007491 76 DVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNP 137 (601)
Q Consensus 76 D~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPP 137 (601)
+.+.++.+|+.|+..|| ..+.+||.+|+..+................+.+|||+|=-+
T Consensus 78 ~~~~~~~~l~~l~~~G~----T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~ 135 (190)
T cd01463 78 NKKVLKEALDMLEAKGI----ANYTKALEFAFSLLLKNLQSNHSGSRSQCNQAIMLITDGVP 135 (190)
T ss_pred HHHHHHHHHhhCCCCCc----chHHHHHHHHHHHHHHhhhcccccccCCceeEEEEEeCCCC
Confidence 78899999999998553 34667777777655321001111112235678999987644
No 26
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=98.26 E-value=2.7e-05 Score=76.01 Aligned_cols=145 Identities=20% Similarity=0.228 Sum_probs=96.4
Q ss_pred EEEEEecccccc--hh----HHHHHHHHHHHHH-HHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHH
Q 007491 6 LIVAVEGTAAMG--PY----WQSIVSDYLEKII-RCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVD 78 (601)
Q Consensus 6 IVFVIDgTAsMG--pY----i~~LktnYI~PiL-EyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~ 78 (601)
+||+||.+.+|- +| ++..| +++.-++ +||.+. ...++|||+|++..+ -...++|.|..
T Consensus 6 ~vi~lD~S~sM~a~D~~PnRL~aak-~~i~~~~~~f~~~n---------p~~~vGlv~fag~~a-----~v~~plT~D~~ 70 (187)
T cd01452 6 TMICIDNSEYMRNGDYPPTRFQAQA-DAVNLICQAKTRSN---------PENNVGLMTMAGNSP-----EVLVTLTNDQG 70 (187)
T ss_pred EEEEEECCHHHHcCCCCCCHHHHHH-HHHHHHHHHHHhcC---------CCccEEEEEecCCce-----EEEECCCCCHH
Confidence 789999999974 23 33333 4456666 444331 235999999998542 13457899999
Q ss_pred HHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCcc-EEEEeecCCCCCCCCCccccccccccccch
Q 007491 79 IFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQR-HCILVAASNPHPLPTPVYRPQMQNLDQNEN 157 (601)
Q Consensus 79 ~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qK-hCILI~nSPPhplP~pVy~P~~~~~~~ge~ 157 (601)
.++..|++++..| ..++.+||..|+..|...+ .+.+| .+|++..|+=-
T Consensus 71 ~~~~~L~~i~~~g----~~~l~~AL~~A~~~L~~~~--------~~~~~~rivi~v~S~~~------------------- 119 (187)
T cd01452 71 KILSKLHDVQPKG----KANFITGIQIAQLALKHRQ--------NKNQKQRIVAFVGSPIE------------------- 119 (187)
T ss_pred HHHHHHHhCCCCC----cchHHHHHHHHHHHHhcCC--------CcCCcceEEEEEecCCc-------------------
Confidence 9999999998643 3578899999998885322 11232 43333322211
Q ss_pred hhhhhcccCCCHHHHHHHhhhcCcEEEEeC---CCc-chHHHHHHHHhc
Q 007491 158 NEAQAESRLSDAETVAKSFVQCSVSLSVIC---PKQ-LPKLTAIYNAAK 202 (601)
Q Consensus 158 ~~~~~~s~laDaEqVAk~faEk~IsLSIIs---PRk-lP~Lk~Lfeka~ 202 (601)
+.-.+...+++.++++||.++||+ +.. .+.|+++++...
T Consensus 120 ------~d~~~i~~~~~~lkk~~I~v~vI~~G~~~~~~~~l~~~~~~~~ 162 (187)
T cd01452 120 ------EDEKDLVKLAKRLKKNNVSVDIINFGEIDDNTEKLTAFIDAVN 162 (187)
T ss_pred ------CCHHHHHHHHHHHHHcCCeEEEEEeCCCCCCHHHHHHHHHHhc
Confidence 111256789999999999999994 333 678999998663
No 27
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=98.25 E-value=8.4e-05 Score=67.15 Aligned_cols=150 Identities=15% Similarity=0.158 Sum_probs=89.8
Q ss_pred ceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCC--cCHHHH
Q 007491 4 KQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWT--KDVDIF 80 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfT--sD~~~F 80 (601)
.+||||||++++|+. .|+.+|.- +.-+++.+. ...+++||.|.+.. ..+ +.|+ .|...+
T Consensus 1 ~~~~~vlD~S~SM~~~~~~~~k~a-~~~~~~~l~-----------~~~~v~li~f~~~~---~~~---~~~~~~~~~~~l 62 (170)
T cd01465 1 LNLVFVIDRSGSMDGPKLPLVKSA-LKLLVDQLR-----------PDDRLAIVTYDGAA---ETV---LPATPVRDKAAI 62 (170)
T ss_pred CcEEEEEECCCCCCChhHHHHHHH-HHHHHHhCC-----------CCCEEEEEEecCCc---cEE---ecCcccchHHHH
Confidence 479999999999964 36666643 344454442 23589999999864 222 2333 467888
Q ss_pred HHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhh
Q 007491 81 LHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEA 160 (601)
Q Consensus 81 lqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~ 160 (601)
+..|+++...||- .+.+||..|+..+.... + ....+++||++|-.+..-+.. +
T Consensus 63 ~~~l~~~~~~g~T----~~~~al~~a~~~~~~~~--~-----~~~~~~ivl~TDG~~~~~~~~--------------~-- 115 (170)
T cd01465 63 LAAIDRLTAGGST----AGGAGIQLGYQEAQKHF--V-----PGGVNRILLATDGDFNVGETD--------------P-- 115 (170)
T ss_pred HHHHHcCCCCCCC----CHHHHHHHHHHHHHhhc--C-----CCCeeEEEEEeCCCCCCCCCC--------------H--
Confidence 8889998864332 34455555555443210 0 112378899999876531110 0
Q ss_pred hhcccCCCHHHHHHHhhhcCcEEEEeC---CCcchHHHHHHHHhcCC
Q 007491 161 QAESRLSDAETVAKSFVQCSVSLSVIC---PKQLPKLTAIYNAAKRN 204 (601)
Q Consensus 161 ~~~s~laDaEqVAk~faEk~IsLSIIs---PRklP~Lk~Lfeka~~n 204 (601)
.+..++++.+.+++|.+++|. ..-...|++|-...++.
T Consensus 116 ------~~~~~~~~~~~~~~v~i~~i~~g~~~~~~~l~~ia~~~~g~ 156 (170)
T cd01465 116 ------DELARLVAQKRESGITLSTLGFGDNYNEDLMEAIADAGNGN 156 (170)
T ss_pred ------HHHHHHHHHhhcCCeEEEEEEeCCCcCHHHHHHHHhcCCce
Confidence 023445555667899999982 22345677776544443
No 28
>COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism]
Probab=98.22 E-value=2.7e-05 Score=79.82 Aligned_cols=157 Identities=14% Similarity=0.159 Sum_probs=108.3
Q ss_pred ceEEEEEecccccchh--HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHH
Q 007491 4 KQLIVAVEGTAAMGPY--WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFL 81 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpY--i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Fl 81 (601)
..||||||+++||..+ +...|--- .-+|+.-.+ .+.++++|.|+.++ .. -...+|+|++.+.
T Consensus 79 ~lvvfvVDASgSM~~~~Rm~aaKG~~-~~lL~dAYq----------~RdkvavI~F~G~~---A~--lll~pT~sv~~~~ 142 (261)
T COG1240 79 NLIVFVVDASGSMAARRRMAAAKGAA-LSLLRDAYQ----------RRDKVAVIAFRGEK---AE--LLLPPTSSVELAE 142 (261)
T ss_pred CcEEEEEeCcccchhHHHHHHHHHHH-HHHHHHHHH----------ccceEEEEEecCCc---ce--EEeCCcccHHHHH
Confidence 6799999999999999 88888664 444443322 47899999999887 33 2346799999999
Q ss_pred HHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhh
Q 007491 82 HWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQ 161 (601)
Q Consensus 82 qwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~ 161 (601)
+||..+..-|+ . =+++||..|...+.+.. + ...+.+.+.|+|+|=-|-- + .+..
T Consensus 143 ~~L~~l~~GG~--T--PL~~aL~~a~ev~~r~~--r---~~p~~~~~~vviTDGr~n~---~----------~~~~---- 196 (261)
T COG1240 143 RALERLPTGGK--T--PLADALRQAYEVLAREK--R---RGPDRRPVMVVITDGRANV---P----------IPLG---- 196 (261)
T ss_pred HHHHhCCCCCC--C--chHHHHHHHHHHHHHhh--c---cCCCcceEEEEEeCCccCC---C----------CCCc----
Confidence 99999985442 2 26678888888775422 1 1234677888888864211 0 0110
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEe-CC---CcchHHHHHHHHhcCCC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVI-CP---KQLPKLTAIYNAAKRNP 205 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSII-sP---RklP~Lk~Lfeka~~n~ 205 (601)
-.++....+..+.++++++.|| |- -++-..+.|-...+++-
T Consensus 197 ---~~~e~~~~a~~~~~~g~~~lvid~e~~~~~~g~~~~iA~~~Gg~~ 241 (261)
T COG1240 197 ---PKAETLEAASKLRLRGIQLLVIDTEGSEVRLGLAEEIARASGGEY 241 (261)
T ss_pred ---hHHHHHHHHHHHhhcCCcEEEEecCCccccccHHHHHHHHhCCeE
Confidence 0125567788899999999999 32 23556777777776664
No 29
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=98.21 E-value=5.7e-05 Score=75.85 Aligned_cols=154 Identities=14% Similarity=0.147 Sum_probs=100.4
Q ss_pred cceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 3 EKQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
.++||||||++++|...+..+|.- +..+++.+.. ...+++||.|.+.. .. ...||.|.+.+.+
T Consensus 53 p~~vvlvlD~SgSM~~~~~~a~~a-~~~~l~~~l~----------~~d~v~lv~f~~~~---~~---~~~~t~~~~~l~~ 115 (296)
T TIGR03436 53 PLTVGLVIDTSGSMRNDLDRARAA-AIRFLKTVLR----------PNDRVFVVTFNTRL---RL---LQDFTSDPRLLEA 115 (296)
T ss_pred CceEEEEEECCCCchHHHHHHHHH-HHHHHHhhCC----------CCCEEEEEEeCCce---eE---eecCCCCHHHHHH
Confidence 478999999999999988777744 4555555311 35699999999754 11 2589999999999
Q ss_pred HhcCcccCC-------CC----CchhhHHHHHHHH-HhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccc
Q 007491 83 WLSTIPFAG-------GG----FNDAAIAEGLSEA-LMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQ 150 (601)
Q Consensus 83 wLdsI~FsG-------GG----fspaAVAEGLAeA-L~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~ 150 (601)
.|+++.... |. .+..++.+||..| ++++.... . .....|.+|||+|--...
T Consensus 116 ~l~~l~~~~~~~~~~~~~~~~~~g~T~l~~al~~aa~~~~~~~~-~-----~~p~rk~iIllTDG~~~~----------- 178 (296)
T TIGR03436 116 ALNRLKPPLRTDYNSSGAFVRDGGGTALYDAITLAALEQLANAL-A-----GIPGRKALIVISDGGDNR----------- 178 (296)
T ss_pred HHHhccCCCccccccccccccCCCcchhHHHHHHHHHHHHHHhh-c-----CCCCCeEEEEEecCCCcc-----------
Confidence 999998621 00 1234567776444 44443210 0 012457889998752211
Q ss_pred cccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCc----------------chHHHHHHHHhcCC
Q 007491 151 NLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQ----------------LPKLTAIYNAAKRN 204 (601)
Q Consensus 151 ~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRk----------------lP~Lk~Lfeka~~n 204 (601)
...+.+++++.+.+.+|.++.|.-.. -..|+.|=+..++.
T Consensus 179 --------------~~~~~~~~~~~~~~~~v~vy~I~~~~~~~~~~~~~~~~~~~~~~~L~~iA~~TGG~ 234 (296)
T TIGR03436 179 --------------SRDTLERAIDAAQRADVAIYSIDARGLRAPDLGAGAKAGLGGPEALERLAEETGGR 234 (296)
T ss_pred --------------hHHHHHHHHHHHHHcCCEEEEeccCccccCCcccccccCCCcHHHHHHHHHHhCCe
Confidence 01145788888899999999996542 23577776666654
No 30
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=98.17 E-value=3.2e-05 Score=86.51 Aligned_cols=139 Identities=20% Similarity=0.277 Sum_probs=81.9
Q ss_pred CcceEEEEEecccccc-hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeec-cCCcCHHH
Q 007491 2 SEKQLIVAVEGTAAMG-PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRS-GWTKDVDI 79 (601)
Q Consensus 2 ~~~DIVFVIDgTAsMG-pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~-GfTsD~~~ 79 (601)
.++||+||||+++||+ +.|...-+..+..++..|.. +...+++++|+|-+.. ..++... +...|...
T Consensus 41 ~~lDIvFLLD~SgSMg~~Nfle~AK~Fa~~LV~~l~I--------s~D~V~VgiV~FSd~~---r~vfpL~s~~s~Dk~~ 109 (576)
T PTZ00441 41 EEVDLYLLVDGSGSIGYHNWITHVIPMLMGLIQQLNL--------SDDAINLYMSLFSNNT---TELIRLGSGASKDKEQ 109 (576)
T ss_pred CCceEEEEEeCCCccCCccHHHHHHHHHHHHHHHhcc--------CCCceEEEEEEeCCCc---eEEEecCCCccccHHH
Confidence 3689999999999998 44522222345566666632 1246899999998775 3222211 12233334
Q ss_pred HHHHhc---CcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccc
Q 007491 80 FLHWLS---TIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNE 156 (601)
Q Consensus 80 FlqwLd---sI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge 156 (601)
.+..|. .....||| -.+.+||..|...+... + ......|++|||+|--+....
T Consensus 110 aL~~I~sL~~~~~pgGg---Tnig~AL~~Aae~L~sr--~----~R~nvpKVVILLTDG~sns~~--------------- 165 (576)
T PTZ00441 110 ALIIVKSLRKTYLPYGK---TNMTDALLEVRKHLNDR--V----NRENAIQLVILMTDGIPNSKY--------------- 165 (576)
T ss_pred HHHHHHHHHhhccCCCC---ccHHHHHHHHHHHHhhc--c----cccCCceEEEEEecCCCCCcc---------------
Confidence 444443 33344554 44667777777665321 1 123567999999887764311
Q ss_pred hhhhhhcccCCCHHHHHHHhhhcCcEEEEe
Q 007491 157 NNEAQAESRLSDAETVAKSFVQCSVSLSVI 186 (601)
Q Consensus 157 ~~~~~~~s~laDaEqVAk~faEk~IsLSII 186 (601)
+....|+.+.++||.|++|
T Consensus 166 -----------dvleaAq~LR~~GVeI~vI 184 (576)
T PTZ00441 166 -----------RALEESRKLKDRNVKLAVI 184 (576)
T ss_pred -----------cHHHHHHHHHHCCCEEEEE
Confidence 3345566677777777776
No 31
>PRK13685 hypothetical protein; Provisional
Probab=98.17 E-value=4.2e-05 Score=78.92 Aligned_cols=141 Identities=13% Similarity=0.144 Sum_probs=89.0
Q ss_pred ceEEEEEecccccch------hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCH
Q 007491 4 KQLIVAVEGTAAMGP------YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDV 77 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp------Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~ 77 (601)
.|||||||.++||.. -++..|.. +..+++.+. ...++|||+|.+.. . ...++|.|.
T Consensus 89 ~~vvlvlD~S~SM~~~D~~p~RL~~ak~~-~~~~l~~l~-----------~~d~vglv~Fa~~a---~---~~~p~t~d~ 150 (326)
T PRK13685 89 AVVMLVIDVSQSMRATDVEPNRLAAAQEA-AKQFADELT-----------PGINLGLIAFAGTA---T---VLVSPTTNR 150 (326)
T ss_pred ceEEEEEECCccccCCCCCCCHHHHHHHH-HHHHHHhCC-----------CCCeEEEEEEcCce---e---ecCCCCCCH
Confidence 589999999999984 46666644 344444442 23589999999864 1 224789999
Q ss_pred HHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccch
Q 007491 78 DIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNEN 157 (601)
Q Consensus 78 ~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~ 157 (601)
+.+...|+++...||- ++.+||..|++.+..... ..........+.+||++|-....-..++
T Consensus 151 ~~l~~~l~~l~~~~~T----~~g~al~~A~~~l~~~~~-~~~~~~~~~~~~IILlTDG~~~~~~~~~------------- 212 (326)
T PRK13685 151 EATKNAIDKLQLADRT----ATGEAIFTALQAIATVGA-VIGGGDTPPPARIVLMSDGKETVPTNPD------------- 212 (326)
T ss_pred HHHHHHHHhCCCCCCc----chHHHHHHHHHHHHhhhc-ccccccCCCCCEEEEEcCCCCCCCCCCC-------------
Confidence 9999999999975432 345666666665532100 0000011235789999886433211100
Q ss_pred hhhhhcccCCCHHHHHHHhhhcCcEEEEeC
Q 007491 158 NEAQAESRLSDAETVAKSFVQCSVSLSVIC 187 (601)
Q Consensus 158 ~~~~~~s~laDaEqVAk~faEk~IsLSIIs 187 (601)
...+..+.++.+.+++|.+++|.
T Consensus 213 -------~~~~~~~aa~~a~~~gi~i~~Ig 235 (326)
T PRK13685 213 -------NPRGAYTAARTAKDQGVPISTIS 235 (326)
T ss_pred -------CcccHHHHHHHHHHcCCeEEEEE
Confidence 01134567888899999999993
No 32
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=98.12 E-value=0.00016 Score=67.64 Aligned_cols=154 Identities=15% Similarity=0.182 Sum_probs=96.2
Q ss_pred EEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHHhc
Q 007491 6 LIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHWLS 85 (601)
Q Consensus 6 IVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~FlqwLd 85 (601)
||||||+++||+.. -|-++...+++.|...-+ ....+++||.|.+... . ...++|.+...+..+|+
T Consensus 3 v~lvlD~SgSM~~~---~rl~~ak~a~~~~~~~~~------~~~d~v~lv~F~~~~~--~---~~~~~t~~~~~~~~~l~ 68 (178)
T cd01451 3 VIFVVDASGSMAAR---HRMAAAKGAVLSLLRDAY------QRRDKVALIAFRGTEA--E---VLLPPTRSVELAKRRLA 68 (178)
T ss_pred EEEEEECCccCCCc---cHHHHHHHHHHHHHHHhh------cCCCEEEEEEECCCCc--e---EEeCCCCCHHHHHHHHH
Confidence 79999999999853 122233344454432111 1346999999986531 1 23467889999999999
Q ss_pred CcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhhccc
Q 007491 86 TIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQAESR 165 (601)
Q Consensus 86 sI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~s~ 165 (601)
.++..| + -.+.+||..|+..+.... + .....+.+|||+|--+..-..+ .
T Consensus 69 ~l~~~G-~---T~l~~aL~~a~~~l~~~~--~----~~~~~~~ivliTDG~~~~g~~~---------------------~ 117 (178)
T cd01451 69 RLPTGG-G---TPLAAGLLAAYELAAEQA--R----DPGQRPLIVVITDGRANVGPDP---------------------T 117 (178)
T ss_pred hCCCCC-C---CcHHHHHHHHHHHHHHHh--c----CCCCceEEEEECCCCCCCCCCc---------------------h
Confidence 998643 2 356677777776551100 0 0113578999998654421110 0
Q ss_pred CCCH-HHHHHHhhhcCcEEEEeCCCc----chHHHHHHHHhcCCC
Q 007491 166 LSDA-ETVAKSFVQCSVSLSVICPKQ----LPKLTAIYNAAKRNP 205 (601)
Q Consensus 166 laDa-EqVAk~faEk~IsLSIIsPRk----lP~Lk~Lfeka~~n~ 205 (601)
... ..+++.+.+.+|.+.+|.-.. -..|+.|=+++++.-
T Consensus 118 -~~~~~~~~~~l~~~gi~v~~I~~~~~~~~~~~l~~iA~~tgG~~ 161 (178)
T cd01451 118 -ADRALAAARKLRARGISALVIDTEGRPVRRGLAKDLARALGGQY 161 (178)
T ss_pred -hHHHHHHHHHHHhcCCcEEEEeCCCCccCccHHHHHHHHcCCeE
Confidence 012 677888999999999996543 346888888776653
No 33
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most,
Probab=98.10 E-value=7.8e-05 Score=68.45 Aligned_cols=144 Identities=17% Similarity=0.179 Sum_probs=86.3
Q ss_pred ceEEEEEecccccchh-HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCC-cCHHHHH
Q 007491 4 KQLIVAVEGTAAMGPY-WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWT-KDVDIFL 81 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpY-i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfT-sD~~~Fl 81 (601)
.|||||||+++||... |+.+|.- |.-+++.+. ...+++||+|.+.. ...+.-..++ .+-+.+.
T Consensus 1 ~~v~~vlD~S~SM~~~rl~~ak~a-~~~l~~~l~-----------~~~~~~li~F~~~~---~~~~~~~~~~~~~~~~~~ 65 (155)
T cd01466 1 VDLVAVLDVSGSMAGDKLQLVKHA-LRFVISSLG-----------DADRLSIVTFSTSA---KRLSPLRRMTAKGKRSAK 65 (155)
T ss_pred CcEEEEEECCCCCCcHHHHHHHHH-HHHHHHhCC-----------CcceEEEEEecCCc---cccCCCcccCHHHHHHHH
Confidence 5899999999999864 4444322 333333332 23589999998864 2211111222 2457888
Q ss_pred HHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhh
Q 007491 82 HWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQ 161 (601)
Q Consensus 82 qwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~ 161 (601)
.+|++|...|| -.+.+||..|+..+.... .....+++|||+|--+-.-.
T Consensus 66 ~~i~~~~~~g~----T~~~~al~~a~~~~~~~~-------~~~~~~~iillTDG~~~~~~-------------------- 114 (155)
T cd01466 66 RVVDGLQAGGG----TNVVGGLKKALKVLGDRR-------QKNPVASIMLLSDGQDNHGA-------------------- 114 (155)
T ss_pred HHHHhccCCCC----ccHHHHHHHHHHHHhhcc-------cCCCceEEEEEcCCCCCcch--------------------
Confidence 89999876443 456677777777764210 11245789999886442210
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEe---CCCcchHHHHHHHHhcC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVI---CPKQLPKLTAIYNAAKR 203 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSII---sPRklP~Lk~Lfeka~~ 203 (601)
.+....+.+|.+++| .+.....|+.|=+..++
T Consensus 115 ----------~~~~~~~~~v~v~~igig~~~~~~~l~~iA~~t~G 149 (155)
T cd01466 115 ----------VVLRADNAPIPIHTFGLGASHDPALLAFIAEITGG 149 (155)
T ss_pred ----------hhhcccCCCceEEEEecCCCCCHHHHHHHHhccCc
Confidence 011234578888888 34556677777665544
No 34
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=98.08 E-value=0.0001 Score=66.45 Aligned_cols=148 Identities=11% Similarity=0.135 Sum_probs=93.7
Q ss_pred cceEEEEEecccccchh-HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeec-cC-CcCHHH
Q 007491 3 EKQLIVAVEGTAAMGPY-WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRS-GW-TKDVDI 79 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpY-i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~-Gf-TsD~~~ 79 (601)
.+||+||+|+++||..+ |+.+|.. |..+++.+.. ..+++|+.|.+.. ..+.... .. ..+...
T Consensus 2 ~~~v~~vlD~S~SM~~~~~~~~~~a-l~~~l~~l~~-----------~~~~~l~~Fs~~~---~~~~~~~~~~~~~~~~~ 66 (171)
T cd01461 2 PKEVVFVIDTSGSMSGTKIEQTKEA-LLTALKDLPP-----------GDYFNIIGFSDTV---EEFSPSSVSATAENVAA 66 (171)
T ss_pred CceEEEEEECCCCCCChhHHHHHHH-HHHHHHhCCC-----------CCEEEEEEeCCCc---eeecCcceeCCHHHHHH
Confidence 47999999999999765 6666643 5666665532 2479999997753 1111111 11 235677
Q ss_pred HHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhh
Q 007491 80 FLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNE 159 (601)
Q Consensus 80 FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~ 159 (601)
++.+|+++.. ||| ..+.+||..|+..+.+. ....+.+|||+|-.+..-
T Consensus 67 ~~~~l~~~~~-~g~---T~l~~al~~a~~~l~~~---------~~~~~~iillTDG~~~~~------------------- 114 (171)
T cd01461 67 AIEYVNRLQA-LGG---TNMNDALEAALELLNSS---------PGSVPQIILLTDGEVTNE------------------- 114 (171)
T ss_pred HHHHHHhcCC-CCC---cCHHHHHHHHHHhhccC---------CCCccEEEEEeCCCCCCH-------------------
Confidence 8888898887 333 45778888888776431 235689999988763210
Q ss_pred hhhcccCCCHHHHHHHhhhcCcEEEEeCC---CcchHHHHHHHHhcCC
Q 007491 160 AQAESRLSDAETVAKSFVQCSVSLSVICP---KQLPKLTAIYNAAKRN 204 (601)
Q Consensus 160 ~~~~s~laDaEqVAk~faEk~IsLSIIsP---RklP~Lk~Lfeka~~n 204 (601)
.+....++...+.+|.++.|.= -..+.|+.|-++.++.
T Consensus 115 -------~~~~~~~~~~~~~~i~i~~i~~g~~~~~~~l~~ia~~~gG~ 155 (171)
T cd01461 115 -------SQILKNVREALSGRIRLFTFGIGSDVNTYLLERLAREGRGI 155 (171)
T ss_pred -------HHHHHHHHHhcCCCceEEEEEeCCccCHHHHHHHHHcCCCe
Confidence 0223444444556899988832 2357788887776654
No 35
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=98.06 E-value=0.00012 Score=82.13 Aligned_cols=160 Identities=17% Similarity=0.182 Sum_probs=102.4
Q ss_pred cceEEEEEecccccc-hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHH
Q 007491 3 EKQLIVAVEGTAAMG-PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFL 81 (601)
Q Consensus 3 ~~DIVFVIDgTAsMG-pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Fl 81 (601)
...||||||+++||. .=|...|. +++.|-..-+ ..+-+++||.|+... .. ...++|++.....
T Consensus 401 ~~~vvfvvD~SGSM~~~rl~~aK~-----a~~~ll~~ay------~~rD~v~lI~F~g~~---a~--~~lppT~~~~~~~ 464 (584)
T PRK13406 401 ETTTIFVVDASGSAALHRLAEAKG-----AVELLLAEAY------VRRDQVALVAFRGRG---AE--LLLPPTRSLVRAK 464 (584)
T ss_pred CccEEEEEECCCCCcHhHHHHHHH-----HHHHHHHhhc------CCCCEEEEEEECCCc---ee--EEcCCCcCHHHHH
Confidence 478999999999994 34455553 3344321111 135689999998764 11 2357899999999
Q ss_pred HHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhh
Q 007491 82 HWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQ 161 (601)
Q Consensus 82 qwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~ 161 (601)
.+|+.+...| .-.+++||..|++.+..... ....+++|||+|=-+.. +. +.. ..
T Consensus 465 ~~L~~l~~gG----gTpL~~gL~~A~~~l~~~~~-------~~~~~~iVLlTDG~~n~-~~------------~~~--~~ 518 (584)
T PRK13406 465 RSLAGLPGGG----GTPLAAGLDAAAALALQVRR-------KGMTPTVVLLTDGRANI-AR------------DGT--AG 518 (584)
T ss_pred HHHhcCCCCC----CChHHHHHHHHHHHHHHhcc-------CCCceEEEEEeCCCCCC-Cc------------ccc--cc
Confidence 9999987332 25678888888877643210 12458999999865432 11 000 00
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEeC--CCcchHHHHHHHHhcCC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVIC--PKQLPKLTAIYNAAKRN 204 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSIIs--PRklP~Lk~Lfeka~~n 204 (601)
......|+..+|+.+.+.+|.+++|. .+....+++|-++.++.
T Consensus 519 ~~~~~~~~~~~a~~~~~~gi~~~vId~g~~~~~~~~~LA~~~gg~ 563 (584)
T PRK13406 519 RAQAEEDALAAARALRAAGLPALVIDTSPRPQPQARALAEAMGAR 563 (584)
T ss_pred ccchhhHHHHHHHHHHhcCCeEEEEecCCCCcHHHHHHHHhcCCe
Confidence 01122367888999999999999993 33345677777665544
No 36
>cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=98.06 E-value=5e-05 Score=70.65 Aligned_cols=154 Identities=15% Similarity=0.138 Sum_probs=87.7
Q ss_pred CCcceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHH
Q 007491 1 MSEKQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDI 79 (601)
Q Consensus 1 ~~~~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~ 79 (601)
|-.++|+||||+++||+. .|+.+|.- +..+++.+...+. ...+.+++||.|.+.. .. .++++...+
T Consensus 1 ~~~~~v~~llD~SgSM~~~~~~~~k~a-~~~~~~~l~~~~~-----~~~~~~v~ii~F~~~a---~~---~~~l~~~~~- 67 (176)
T cd01464 1 MRRLPIYLLLDTSGSMAGEPIEALNQG-LQMLQSELRQDPY-----ALESVEISVITFDSAA---RV---IVPLTPLES- 67 (176)
T ss_pred CCCCCEEEEEECCCCCCChHHHHHHHH-HHHHHHHHhcChh-----hccccEEEEEEecCCc---eE---ecCCccHHh-
Confidence 678999999999999964 56666643 4555555543221 1246799999999854 21 234444221
Q ss_pred HHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhh
Q 007491 80 FLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNE 159 (601)
Q Consensus 80 FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~ 159 (601)
..++.++. ||| -++.+||..|++.+................+++|||+|--|..-+
T Consensus 68 --~~~~~l~~-~Gg---T~l~~aL~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~------------------ 123 (176)
T cd01464 68 --FQPPRLTA-SGG---TSMGAALELALDCIDRRVQRYRADQKGDWRPWVFLLTDGEPTDDL------------------ 123 (176)
T ss_pred --cCCCcccC-CCC---CcHHHHHHHHHHHHHHHHHHhcccCcCCcCcEEEEEcCCCCCchH------------------
Confidence 12344444 333 567888888887764211000001112345688999887553210
Q ss_pred hhhcccCCCHHHHHHHhhhcCcEEEEe---CCCcchHHHHHH
Q 007491 160 AQAESRLSDAETVAKSFVQCSVSLSVI---CPKQLPKLTAIY 198 (601)
Q Consensus 160 ~~~~s~laDaEqVAk~faEk~IsLSII---sPRklP~Lk~Lf 198 (601)
.++.+..+.+.+.++.++.| ..-....|+.|=
T Consensus 124 -------~~~~~~~~~~~~~~~~i~~igiG~~~~~~~L~~ia 158 (176)
T cd01464 124 -------TAAIERIKEARDSKGRIVACAVGPKADLDTLKQIT 158 (176)
T ss_pred -------HHHHHHHHhhcccCCcEEEEEeccccCHHHHHHHH
Confidence 02235556666667888876 223355566654
No 37
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=98.02 E-value=4e-05 Score=74.18 Aligned_cols=152 Identities=15% Similarity=0.123 Sum_probs=93.4
Q ss_pred eEEEEEecccccchhHH---HHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC-----CccccceeeccCCcC
Q 007491 5 QLIVAVEGTAAMGPYWQ---SIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG-----SYCACLVQRSGWTKD 76 (601)
Q Consensus 5 DIVFVIDgTAsMGpYi~---~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~-----P~~e~~Vqr~GfTsD 76 (601)
-++|+||.+.+|..--+ .-+=+....++..|-...+.. ...-++|||+|++.. .|...++-+-.-+.|
T Consensus 3 ~ivf~iDvS~SM~~~~~~~~~s~l~~a~~~i~~~~~~ki~~----~~~D~vGlilf~t~~~~~~~~~~~i~v~~~l~~~~ 78 (218)
T cd01458 3 SVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKIIS----SPKDLVGVVFYGTEESKNPVGYENIYVLLDLDTPG 78 (218)
T ss_pred EEEEEEeCCHHHcCCCCCCCCChHHHHHHHHHHHHHhceeC----CCCCeEEEEEEcccCCCCcCCCCceEEeecCCCCC
Confidence 48999999999962210 011111223344442221211 145799999999975 344554444333777
Q ss_pred HHHHHHHhcCcccC------CCC-CchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecC-CCCCCCCCccccc
Q 007491 77 VDIFLHWLSTIPFA------GGG-FNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAAS-NPHPLPTPVYRPQ 148 (601)
Q Consensus 77 ~~~FlqwLdsI~Fs------GGG-fspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nS-PPhplP~pVy~P~ 148 (601)
.+.+..+++-+.-. +.| ....++.+||..|+.+|.... .....|++|||+|- .|+. ..
T Consensus 79 ~~~l~~l~~~~~~~~~~~~~~~~~~~~~~l~~aL~~a~~~~~~~~-------~~~~~k~IvL~TDg~~p~~-~~------ 144 (218)
T cd01458 79 AERVEDLKELIEPGGLSFAGQVGDSGQVSLSDALWVCLDLFSKGK-------KKKSHKRIFLFTNNDDPHG-GD------ 144 (218)
T ss_pred HHHHHHHHHHhhcchhhhcccCCCCCCccHHHHHHHHHHHHHhcc-------ccccccEEEEECCCCCCCC-CC------
Confidence 77777776655321 112 235789999999999996421 12467999999986 4544 10
Q ss_pred cccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCC
Q 007491 149 MQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICP 188 (601)
Q Consensus 149 ~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsP 188 (601)
+. + ..+++.+|+.+.+++|.|.+|.-
T Consensus 145 ------~~--~------~~~~~~~a~~l~~~gI~i~~i~i 170 (218)
T cd01458 145 ------SI--K------DSQAAVKAEDLKDKGIELELFPL 170 (218)
T ss_pred ------HH--H------HHHHHHHHHHHHhCCcEEEEEec
Confidence 00 0 01467899999999999999943
No 38
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=97.99 E-value=0.00013 Score=81.54 Aligned_cols=165 Identities=15% Similarity=0.170 Sum_probs=100.3
Q ss_pred cceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHH
Q 007491 3 EKQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFL 81 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Fl 81 (601)
...++||||+++||.. -|+..| -+++.|...-. ..+-+++||.|++.. .. ...+||++...+.
T Consensus 407 ~~~v~fvvD~SGSM~~~rl~~aK-----~av~~Ll~~~~------~~~D~v~Li~F~~~~---a~--~~lp~t~~~~~~~ 470 (589)
T TIGR02031 407 GRLLIFVVDASGSAAVARMSEAK-----GAVELLLGEAY------VHRDQVSLIAFRGTA---AE--VLLPPSRSVEQAK 470 (589)
T ss_pred CceEEEEEECCCCCChHHHHHHH-----HHHHHHHHhhc------cCCCEEEEEEECCCC---ce--EECCCCCCHHHHH
Confidence 3568999999999963 244444 33444432111 134589999998765 11 2357899999999
Q ss_pred HHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhh
Q 007491 82 HWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQ 161 (601)
Q Consensus 82 qwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~ 161 (601)
.+|+.|+..| | -.+++||..|++.+.... .....+++|||+|=-+-- +. .. ..+ ++..+
T Consensus 471 ~~L~~l~~gG-g---TpL~~gL~~A~~~~~~~~-------~~~~~~~ivllTDG~~nv-~~--~~----~~~-~~~~~-- 529 (589)
T TIGR02031 471 RRLDVLPGGG-G---TPLAAGLAAAFQTALQAR-------SSGGTPTIVLITDGRGNI-PL--DG----DPE-SIKAD-- 529 (589)
T ss_pred HHHhcCCCCC-C---CcHHHHHHHHHHHHHHhc-------ccCCceEEEEECCCCCCC-CC--Cc----ccc-ccccc--
Confidence 9999998533 3 456677777777664211 013457899999864321 10 00 000 00000
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEeCCC----cchHHHHHHHHhcCC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVICPK----QLPKLTAIYNAAKRN 204 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSIIsPR----klP~Lk~Lfeka~~n 204 (601)
.+....++..+|+.+.+.||.+++|... +...++.|-++.+++
T Consensus 530 ~~~~~~~~~~~a~~~~~~gi~~~vid~~~~~~~~~~~~~lA~~~~g~ 576 (589)
T TIGR02031 530 REQAAEEALALARKIREAGMPALVIDTAMRFVSTGFAQKLARKMGAH 576 (589)
T ss_pred chhHHHHHHHHHHHHHhcCCeEEEEeCCCCCccchHHHHHHHhcCCc
Confidence 0001124578899999999999999432 344588888877665
No 39
>PF00362 Integrin_beta: Integrin, beta chain; InterPro: IPR002369 Integrins are the major metazoan receptors for cell adhesion to extracellular matrix proteins and, in vertebrates, also play important roles in certain cell-cell adhesions, make transmembrane connections to the cytoskeleton and activate many intracellular signalling pathways [, ]. The integrin receptors are composed of alpha and beta subunit heterodimers. Each subunit crosses the membrane once, with most of the polypeptide residing in the extracellular space, and has two short cytoplasmic domains. Some members of this family have EGF repeats at the C terminus and also have a vWA domain inserted within the integrin domain at the N terminus. Most integrins recognise relatively short peptide motifs, and in general require an acidic amino acid to be present. Ligand specificity depends upon both the alpha and beta subunits []. There are at least 18 types of alpha and 8 types of beta subunits recognised in humans []. Each alpha subunit tends to associate only with one type of beta subunit, but there are exceptions to this rule []. Each association of alpha and beta subunits has its own binding specificity and signalling properties. Many integrins require activation on the cell surface before they can bind ligands. Integrins frequently intercommunicate, and binding at one integrin receptor activate or inhibit another. The structure of unliganded alphaV beta3 showed the molecule to be folded, with the head bent over towards the C termini of the legs which would normally be inserted into the membrane []. The head comprises a beta propeller domain at the end terminus of the alphaV subunit and an I/A domain inserted into a loop on the top of the hybrid domain in the beta subunit. The I/A domain consists of a Rossman fold with a core of beta parallel sheets surrounded by amphipathic alpha helices. Integrins are important therapeutic targets in conditions such as atherosclerosis, thrombosis, cancer and asthma []. At the N terminus of the beta subunit is a cysteine-containing domain reminiscent of that found in presenillins and semaphorins, which has hence been termed the PSI domain. C-terminal to the PSI domain is an A-domain, which has been predicted to adopt a Rossmann fold similar to that of the alpha subunit, but with additional loops between the second and third beta strands []. The murine gene Pactolus shares significant similarity with the beta subunit [], but lacks either one or both of the inserted loops. The C-terminal portion of the beta subunit extracellular domain contains an internally disulphide-bonded cysteine-rich region, while the intracellular tail contains putative sites of interaction with a variety of intracellular signalling and cytoskeletal proteins, such as focal adhesion kinase and alpha-actinin respectively []. Integrin cytoplasmic domains are normally less than 50 amino acids in length, with the beta-subunit sequences exhibiting greater homology to each other than the alpha-subunit sequences. This is consistent with current evidence that the beta subunit is the principal site for binding of cytoskeletal and signalling molecules, whereas the alpha subunit has a regulatory role. The first 20 amino acids of the beta-subunit cytoplasmic domain are also alpha helical, but the final 25 residues are disordered and, apart from a turn that follows a conserved NPxY motif, appear to lack defined structure, suggesting that this is adopted on effector binding. The two membrane-proximal helices mediate the link between the subunits via a series of hydrophobic and electrostatic contacts. This entry represents the N-terminal portion of the extracellular region of integrin beta subunits.; GO: 0005488 binding, 0007155 cell adhesion, 0007160 cell-matrix adhesion; PDB: 3VI4_B 3VI3_B 2VDQ_B 3IJE_B 1M1X_B 2VDR_B 3NIF_B 3NID_D 1TYE_F 2Q6W_F ....
Probab=97.91 E-value=0.00013 Score=79.19 Aligned_cols=180 Identities=10% Similarity=0.084 Sum_probs=108.9
Q ss_pred ceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC--Ccccc---------------
Q 007491 4 KQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG--SYCAC--------------- 66 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~--P~~e~--------------- 66 (601)
.||.|++|-+.+|+++.+.||+.= ..+.+.++. + ..++|+|+=.|-|.+ ||-..
T Consensus 103 vDLYyLmDlS~Sm~ddl~~l~~lg-~~l~~~~~~--i------t~~~~~GfGsfvdK~~~P~~~~~p~~l~~pc~~~~~~ 173 (426)
T PF00362_consen 103 VDLYYLMDLSYSMKDDLENLKSLG-QDLAEEMRN--I------TSNFRLGFGSFVDKPVMPFVSTTPEKLKNPCPSKNPN 173 (426)
T ss_dssp EEEEEEEE-SGGGHHHHHHHCCCC-HHHHHHHHT--T-------SSEEEEEEEESSSSSTTTST-SSHCHHSTSCCTTS-
T ss_pred eeEEEEeechhhhhhhHHHHHHHH-HHHHHHHHh--c------CccceEechhhcccccCCcccCChhhhcCcccccCCC
Confidence 699999999999999999999763 566666643 1 356899999998876 44211
Q ss_pred ------ceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCC
Q 007491 67 ------LVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPL 140 (601)
Q Consensus 67 ------~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhpl 140 (601)
|-.....|+|.++|.+.+++..++|+-|.||+-.+||--|..|-+.+. || .+..|..|+.+|+++|-.
T Consensus 174 c~~~~~f~~~l~Lt~~~~~F~~~v~~~~is~n~D~PEgg~dal~Qa~vC~~~ig-Wr-----~~a~~llv~~TD~~fH~a 247 (426)
T PF00362_consen 174 CQPPFSFRHVLSLTDDITEFNEEVNKQKISGNLDAPEGGLDALMQAAVCQEEIG-WR-----NEARRLLVFSTDAGFHFA 247 (426)
T ss_dssp -B---SEEEEEEEES-HHHHHHHHHTS--B--SSSSBSHHHHHHHHHH-HHHHT--------STSEEEEEEEESS-B--T
T ss_pred CCCCeeeEEeecccchHHHHHHhhhhccccCCCCCCccccchheeeeecccccC-cc-----cCceEEEEEEcCCccccc
Confidence 112234599999999999999999999999999999988876653221 11 346778899999999986
Q ss_pred CC----Cccccccccccccchhhhh--hcccCCCHHHHHHHhhhcCcEE-EEeCCCcchHHHHHH
Q 007491 141 PT----PVYRPQMQNLDQNENNEAQ--AESRLSDAETVAKSFVQCSVSL-SVICPKQLPKLTAIY 198 (601)
Q Consensus 141 P~----pVy~P~~~~~~~ge~~~~~--~~s~laDaEqVAk~faEk~IsL-SIIsPRklP~Lk~Lf 198 (601)
.. -...|.+..--+...+.|. .+-++-...||++.+.|++|.. +.|.-+....+++|-
T Consensus 248 gDg~l~gi~~pnd~~Chl~~~~~y~~~~~~DYPSv~ql~~~l~e~~i~~IFAVt~~~~~~Y~~L~ 312 (426)
T PF00362_consen 248 GDGKLAGIVKPNDGKCHLDDNGMYTASTEQDYPSVGQLVRKLSENNINPIFAVTKDVYSIYEELS 312 (426)
T ss_dssp TGGGGGT--S---SS--BSTTSBBGGGGCS----HHHHHHHHHHTTEEEEEEEEGGGHHHHHHHH
T ss_pred cccccceeeecCCCceEECCCCcccccccccCCCHHHHHHHHHHcCCEEEEEEchhhhhHHHHHh
Confidence 63 1112211111111111222 2234556789999999999875 455555555555553
No 40
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=97.85 E-value=0.00056 Score=61.70 Aligned_cols=104 Identities=18% Similarity=0.251 Sum_probs=64.8
Q ss_pred ceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHH
Q 007491 4 KQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHW 83 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flqw 83 (601)
+||+||||+++||...=..+.+.-+.-+++.+.. ...+++||.|.+.. . ........+..+++++
T Consensus 1 ~~v~illD~SgSM~~~k~~~a~~~~~~l~~~~~~----------~~~~v~li~F~~~~---~--~~~~~~~~~~~~~~~~ 65 (152)
T cd01462 1 GPVILLVDQSGSMYGAPEEVAKAVALALLRIALA----------ENRDTYLILFDSEF---Q--TKIVDKTDDLEEPVEF 65 (152)
T ss_pred CCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH----------cCCcEEEEEeCCCc---e--EEecCCcccHHHHHHH
Confidence 5899999999999854112222333344444322 23589999998772 1 1223467888999999
Q ss_pred hcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecC
Q 007491 84 LSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAAS 135 (601)
Q Consensus 84 LdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nS 135 (601)
|..+.. ||| -.++++|..++.++... ....+.+|+|+|=
T Consensus 66 l~~~~~-~gg---T~l~~al~~a~~~l~~~---------~~~~~~ivliTDG 104 (152)
T cd01462 66 LSGVQL-GGG---TDINKALRYALELIERR---------DPRKADIVLITDG 104 (152)
T ss_pred HhcCCC-CCC---cCHHHHHHHHHHHHHhc---------CCCCceEEEECCC
Confidence 988765 333 35667777777766421 0124577888764
No 41
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=97.73 E-value=0.00067 Score=76.44 Aligned_cols=158 Identities=18% Similarity=0.219 Sum_probs=98.9
Q ss_pred cceEEEEEecccccch--hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHH
Q 007491 3 EKQLIVAVEGTAAMGP--YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIF 80 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGp--Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~F 80 (601)
...||||||+++||+. -|..+|.- +..++.... ..+-+++||.|++.. .. ...++|++...+
T Consensus 465 ~~~vv~vvD~SgSM~~~~rl~~ak~a-~~~ll~~a~----------~~~D~v~lI~F~g~~---a~--~~~p~t~~~~~~ 528 (633)
T TIGR02442 465 GNLVIFVVDASGSMAARGRMAAAKGA-VLSLLRDAY----------QKRDKVALITFRGEE---AE--VLLPPTSSVELA 528 (633)
T ss_pred CceEEEEEECCccCCCccHHHHHHHH-HHHHHHHhh----------cCCCEEEEEEECCCC---ce--EEcCCCCCHHHH
Confidence 4589999999999975 34444432 233332211 135799999998653 11 245789999999
Q ss_pred HHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhh
Q 007491 81 LHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEA 160 (601)
Q Consensus 81 lqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~ 160 (601)
..+|+.++..|| -.+++||..|+..+..... ......+.+|||+|=-+---. .+
T Consensus 529 ~~~L~~l~~gG~----Tpl~~aL~~A~~~l~~~~~-----~~~~~~~~vvliTDG~~n~~~------------~~----- 582 (633)
T TIGR02442 529 ARRLEELPTGGR----TPLAAGLLKAAEVLSNELL-----RDDDGRPLLVVITDGRANVAD------------GG----- 582 (633)
T ss_pred HHHHHhCCCCCC----CCHHHHHHHHHHHHHHhhc-----cCCCCceEEEEECCCCCCCCC------------CC-----
Confidence 999999986433 3466777777766642100 011345788999875332100 00
Q ss_pred hhcccCCCHHHHHHHhhhcCcEEEEeCCC----cchHHHHHHHHhcCC
Q 007491 161 QAESRLSDAETVAKSFVQCSVSLSVICPK----QLPKLTAIYNAAKRN 204 (601)
Q Consensus 161 ~~~s~laDaEqVAk~faEk~IsLSIIsPR----klP~Lk~Lfeka~~n 204 (601)
.-...++..+|+.+.+.+|.+.+|... +...++.|-++.++.
T Consensus 583 --~~~~~~~~~~a~~l~~~~i~~~vIdt~~~~~~~~~~~~lA~~~gg~ 628 (633)
T TIGR02442 583 --EPPTDDARTIAAKLAARGILFVVIDTESGFVRLGLAEDLARALGGE 628 (633)
T ss_pred --CChHHHHHHHHHHHHhcCCeEEEEeCCCCCcchhHHHHHHHhhCCe
Confidence 001225678899999999999999442 346777777766543
No 42
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=97.25 E-value=0.0081 Score=70.61 Aligned_cols=106 Identities=14% Similarity=0.270 Sum_probs=63.8
Q ss_pred cceEEEEEecccccch--hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCC-----c
Q 007491 3 EKQLIVAVEGTAAMGP--YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWT-----K 75 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGp--Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfT-----s 75 (601)
+++||||||.++||.. -++.+| ..++.|-...+. ...++|||.|.+.. .. ..+++ .
T Consensus 304 ~r~VVLVLDvSGSM~g~dRL~~lk-----qAA~~fL~~~l~------~~DrVGLVtFsssA---~v---l~pLt~Its~~ 366 (863)
T TIGR00868 304 QRIVCLVLDKSGSMTVEDRLKRMN-----QAAKLFLLQTVE------KGSWVGMVTFDSAA---YI---KNELIQITSSA 366 (863)
T ss_pred CceEEEEEECCccccccCHHHHHH-----HHHHHHHHHhCC------CCCEEEEEEECCce---eE---eeccccCCcHH
Confidence 4789999999999974 354444 233333211111 34699999999975 21 12232 2
Q ss_pred CHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCC
Q 007491 76 DVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNP 137 (601)
Q Consensus 76 D~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPP 137 (601)
|.+.+...|. +. .+|| -+|.+||.+|++++..... ....+++||++|--+
T Consensus 367 dr~aL~~~L~-~~-A~GG---T~I~~GL~~Alq~L~~~~~-------~~~~~~IILLTDGed 416 (863)
T TIGR00868 367 ERDALTANLP-TA-ASGG---TSICSGLKAAFQVIKKSYQ-------STDGSEIVLLTDGED 416 (863)
T ss_pred HHHHHHHhhc-cc-cCCC---CcHHHHHHHHHHHHHhccc-------ccCCCEEEEEeCCCC
Confidence 4455554444 22 3343 6788999999988754210 123579999988544
No 43
>cd01460 vWA_midasin VWA_Midasin: Midasin is a member of the AAA ATPase family. The proteins of this family are unified by their common archetectural organization that is based upon a conserved ATPase domain. The AAA domain of midasin contains six tandem AAA protomers. The AAA domains in midasin is followed by a D/E rich domain that is following by a VWA domain. The members of this subgroup have a conserved MIDAS motif. The function of this domain is not exactly known although it has been speculated to play a crucial role in midasin function.
Probab=97.25 E-value=0.0051 Score=63.38 Aligned_cols=133 Identities=13% Similarity=0.131 Sum_probs=84.5
Q ss_pred cceEEEEEecccccchhHHH-HH--H-HHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHH
Q 007491 3 EKQLIVAVEGTAAMGPYWQS-IV--S-DYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVD 78 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpYi~~-Lk--t-nYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~ 78 (601)
+.||||+||-+.||....-. ++ + ..|..+++.. ..-++|+|.|.... ...++||+|..
T Consensus 60 ~~qIvlaID~S~SM~~~~~~~~aleak~lIs~al~~L------------e~g~vgVv~Fg~~~------~~v~Plt~d~~ 121 (266)
T cd01460 60 DYQILIAIDDSKSMSENNSKKLALESLCLVSKALTLL------------EVGQLGVCSFGEDV------QILHPFDEQFS 121 (266)
T ss_pred CceEEEEEecchhcccccccccHHHHHHHHHHHHHhC------------cCCcEEEEEeCCCc------eEeCCCCCCch
Confidence 57999999999999775221 11 1 1222222222 23689999999864 13367788776
Q ss_pred H--HHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCC--CCCCCCCccccccccccc
Q 007491 79 I--FLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASN--PHPLPTPVYRPQMQNLDQ 154 (601)
Q Consensus 79 ~--FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSP--PhplP~pVy~P~~~~~~~ 154 (601)
. ++..++...|..+| -.|..+|.+|+.+|+... .+. .+....+-+|||+|-- .|.
T Consensus 122 ~~a~~~~l~~~~f~~~~---Tni~~aL~~a~~~f~~~~--~~~-~s~~~~qlilLISDG~~~~~e--------------- 180 (266)
T cd01460 122 SQSGPRILNQFTFQQDK---TDIANLLKFTAQIFEDAR--TQS-SSGSLWQLLLIISDGRGEFSE--------------- 180 (266)
T ss_pred hhHHHHHhCcccCCCCC---CcHHHHHHHHHHHHHhhh--ccc-cccccccEEEEEECCCcccCc---------------
Confidence 4 55556555554433 469999999999996421 000 0111358889997754 222
Q ss_pred cchhhhhhcccCCCHHHHHHHhhhcCcEEEEe
Q 007491 155 NENNEAQAESRLSDAETVAKSFVQCSVSLSVI 186 (601)
Q Consensus 155 ge~~~~~~~s~laDaEqVAk~faEk~IsLSII 186 (601)
.....+++.+.|++|.+.+|
T Consensus 181 ------------~~~~~~~r~a~e~~i~l~~I 200 (266)
T cd01460 181 ------------GAQKVRLREAREQNVFVVFI 200 (266)
T ss_pred ------------cHHHHHHHHHHHcCCeEEEE
Confidence 12345689999999999998
No 44
>PF03731 Ku_N: Ku70/Ku80 N-terminal alpha/beta domain; InterPro: IPR005161 The Ku heterodimer (composed of Ku70 P12956 from SWISSPROT and Ku80 P13010 from SWISSPROT) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the N-terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold [].; PDB: 1JEQ_A 1JEY_A.
Probab=97.14 E-value=0.016 Score=56.01 Aligned_cols=141 Identities=13% Similarity=0.180 Sum_probs=75.0
Q ss_pred EEEEEecccccc-------hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCc--------cccce-e
Q 007491 6 LIVAVEGTAAMG-------PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSY--------CACLV-Q 69 (601)
Q Consensus 6 IVFVIDgTAsMG-------pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~--------~e~~V-q 69 (601)
+||+||++.+|. .+|..+.+-.-.-+.+.+ +. +...++|||+|++...- ...++ +
T Consensus 2 ~vflID~s~sM~~~~~~~~~~l~~al~~i~~~~~~ki----~~-----~~kD~vgvvl~gt~~t~n~~~~~~~~~i~~l~ 72 (224)
T PF03731_consen 2 TVFLIDVSPSMFEPSSESESPLEEALKAIEDLMQQKI----IS-----SPKDEVGVVLFGTDETNNPDEDSGYENIFVLQ 72 (224)
T ss_dssp EEEEEE-SCGGGS-BTTCS-HHHHHHHHHHHHHHHHH----HT-----T---EEEEEEES-SS-BST-TTT-STTEEEEE
T ss_pred EEEEEECCHHHCCCCCCcchhHHHHHHHHHHHHHHHH----cC-----CCCCeEEEEEEcCCCCCCcccccCCCceEEee
Confidence 799999999999 334444322211222222 11 13468999999987742 33344 2
Q ss_pred eccCCcCHHHHHHHhcCccc------CCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecC-CCCCCCC
Q 007491 70 RSGWTKDVDIFLHWLSTIPF------AGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAAS-NPHPLPT 142 (601)
Q Consensus 70 r~GfTsD~~~FlqwLdsI~F------sGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nS-PPhplP~ 142 (601)
... ..+++.+....+-+.- .+++..+..+.++|-.|+.+|..... ......|+++||+|. .||.-..
T Consensus 73 ~l~-~~~~~~l~~L~~~~~~~~~~~~~~~~~~~~~l~~al~v~~~~~~~~~~-----~~k~~~krI~l~Td~d~p~~~~~ 146 (224)
T PF03731_consen 73 PLD-PPSAERLKELEELLKPGDKFENFFSGSDEGDLSDALWVASDMFRERTC-----KKKKNKKRIFLFTDNDGPHEDDD 146 (224)
T ss_dssp ECC---BHHHHHHHHTTSHHHHHHHHHC-SSS---HHHHHHHHHHHHHCHCT-----TS-ECEEEEEEEES-SSTTT-CC
T ss_pred cCC-ccCHHHHHHHHHhhcccccccccCCCCCccCHHHHHHHHHHHHHHHhh-----cccCCCcEEEEEeCCCCCCCCHH
Confidence 222 2345544444444432 23445567899999999999864111 123468899999965 6665111
Q ss_pred CccccccccccccchhhhhhcccCCCHHHH-----HHHhhhcCcEEEEe
Q 007491 143 PVYRPQMQNLDQNENNEAQAESRLSDAETV-----AKSFVQCSVSLSVI 186 (601)
Q Consensus 143 pVy~P~~~~~~~ge~~~~~~~s~laDaEqV-----Ak~faEk~IsLSII 186 (601)
+.+.+ ++.+.+.+|.|.++
T Consensus 147 -------------------------~~~~~~~~l~~~Dl~~~~i~~~~~ 170 (224)
T PF03731_consen 147 -------------------------ELERIIQKLKAKDLQDNGIEIELF 170 (224)
T ss_dssp -------------------------CHHHHHHHHHHHHHHHHTEEEEEE
T ss_pred -------------------------HHHHHHHhhccccchhcCcceeEe
Confidence 22333 66699999998777
No 45
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=97.14 E-value=0.0027 Score=70.75 Aligned_cols=107 Identities=13% Similarity=0.204 Sum_probs=69.2
Q ss_pred cceEEEEEecccccch-hHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceee-ccC-CcCHHH
Q 007491 3 EKQLIVAVEGTAAMGP-YWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQR-SGW-TKDVDI 79 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGp-Yi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr-~Gf-TsD~~~ 79 (601)
.+++|||||+++||.. .|+.+|.- +.-+++.+. ..-+|+||.|.+.. ..+... ... ..+..+
T Consensus 271 p~~vvfvlD~SgSM~g~~i~~ak~a-l~~~l~~L~-----------~~d~~~ii~F~~~~---~~~~~~~~~~~~~~~~~ 335 (596)
T TIGR03788 271 PRELVFVIDTSGSMAGESIEQAKSA-LLLALDQLR-----------PGDRFNIIQFDSDV---TLLFPVPVPATAHNLAR 335 (596)
T ss_pred CceEEEEEECCCCCCCccHHHHHHH-HHHHHHhCC-----------CCCEEEEEEECCcc---eEeccccccCCHHHHHH
Confidence 3689999999999964 46666643 344455542 24599999998765 222111 122 235677
Q ss_pred HHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCC
Q 007491 80 FLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASN 136 (601)
Q Consensus 80 FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSP 136 (601)
...||+.+...|| -.+..||..|+..+... .....|++|||+|--
T Consensus 336 a~~~i~~l~a~Gg----T~l~~aL~~a~~~~~~~--------~~~~~~~iillTDG~ 380 (596)
T TIGR03788 336 ARQFVAGLQADGG----TEMAGALSAALRDDGPE--------SSGALRQVVFLTDGA 380 (596)
T ss_pred HHHHHhhCCCCCC----ccHHHHHHHHHHhhccc--------CCCceeEEEEEeCCC
Confidence 8889999987644 34777777787653211 123468999998863
No 46
>cd01455 vWA_F11C1-5a_type Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=96.85 E-value=0.026 Score=56.03 Aligned_cols=151 Identities=10% Similarity=0.019 Sum_probs=90.9
Q ss_pred ceEEEEEecccccchhHHH----H--HHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceee-ccCCcC
Q 007491 4 KQLIVAVEGTAAMGPYWQS----I--VSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQR-SGWTKD 76 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~----L--ktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr-~GfTsD 76 (601)
+.|+++||-++||-. |+. + |-+-+...++.|..- ... . ..-++|+ +.+.. .-++|+. +++|.|
T Consensus 1 ~~l~lavDlSgSM~~-~~~~dg~~~~RL~a~k~v~~~f~~f-~~~--r--~~DriG~--~g~~~--~~~~lt~d~p~t~d 70 (191)
T cd01455 1 KRLKLVVDVSGSMYR-FNGYDGRLDRSLEAVVMVMEAFDGF-EDK--I--QYDIIGH--SGDGP--CVPFVKTNHPPKNN 70 (191)
T ss_pred CceEEEEECcHhHHH-HhccCCccccHHHHHHHHHHHHHHH-HHh--C--ccceeee--cCccc--ccCccccccCcccc
Confidence 469999999999986 441 1 112233445556321 011 1 2245562 33332 3556654 577777
Q ss_pred H---HHHHHHhcCcccCCCCCchhhHHHHHHHHHhccc-cCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccc
Q 007491 77 V---DIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFS-VAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNL 152 (601)
Q Consensus 77 ~---~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd-~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~ 152 (601)
. +.+++.|+..++ |.....-+|||..|++.|. .. ...+|.+||++|-+=..-
T Consensus 71 ~~~~~~l~~~l~~~q~---g~ag~~TadAi~~av~rl~~~~---------~a~~kvvILLTDG~n~~~------------ 126 (191)
T cd01455 71 KERLETLKMMHAHSQF---CWSGDHTVEATEFAIKELAAKE---------DFDEAIVIVLSDANLERY------------ 126 (191)
T ss_pred hhHHHHHHHHHHhccc---CccCccHHHHHHHHHHHHHhcC---------cCCCcEEEEEeCCCcCCC------------
Confidence 7 577778888876 2222233399999999885 32 235789999987653221
Q ss_pred cccchhhhhhcccCCCHHHH-HHHhhhcCcEEEEeCCCc--chHHHHHHHHh
Q 007491 153 DQNENNEAQAESRLSDAETV-AKSFVQCSVSLSVICPKQ--LPKLTAIYNAA 201 (601)
Q Consensus 153 ~~ge~~~~~~~s~laDaEqV-Ak~faEk~IsLSIIsPRk--lP~Lk~Lfeka 201 (601)
+ -+++++ |++.++.||+++.|.=-+ -..|+.+-+..
T Consensus 127 ------------~-i~P~~aAa~lA~~~gV~iytIgiG~~d~~~l~~iA~~t 165 (191)
T cd01455 127 ------------G-IQPKKLADALAREPNVNAFVIFIGSLSDEADQLQRELP 165 (191)
T ss_pred ------------C-CChHHHHHHHHHhCCCEEEEEEecCCCHHHHHHHHhCC
Confidence 1 166775 688889999999995322 45566655543
No 47
>cd01457 vWA_ORF176_type VWA ORF176 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most
Probab=96.68 E-value=0.043 Score=52.49 Aligned_cols=111 Identities=12% Similarity=0.204 Sum_probs=65.2
Q ss_pred cceEEEEEecccccchh--------HHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCC
Q 007491 3 EKQLIVAVEGTAAMGPY--------WQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWT 74 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpY--------i~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfT 74 (601)
.+|++|+||.++||+.. |+..|.- +..+++.... . ..-.++|++|.+.. . ..++.+
T Consensus 2 ~~dvv~~ID~SgSM~~~~~~~~~~k~~~ak~~-~~~l~~~~~~--~-------D~d~i~l~~f~~~~---~---~~~~~~ 65 (199)
T cd01457 2 NRDYTLLIDKSGSMAEADEAKERSRWEEAQES-TRALARKCEE--Y-------DSDGITVYLFSGDF---R---RYDNVN 65 (199)
T ss_pred CcCEEEEEECCCcCCCCCCCCCchHHHHHHHH-HHHHHHHHHh--c-------CCCCeEEEEecCCc---c---ccCCcC
Confidence 47999999999999853 5555533 3444443321 1 12357788886654 1 112333
Q ss_pred cCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhcc-ccCCCCCCCCCCCCCccEEEEeecCCCCC
Q 007491 75 KDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMF-SVAPNGSQNQQNVDGQRHCILVAASNPHP 139 (601)
Q Consensus 75 sD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~f-d~~~~~~q~~~~~~~qKhCILI~nSPPhp 139 (601)
...+.+++.++...|| -.+..+|..|+..+ ...... ......+++|+|+|--|+.
T Consensus 66 --~~~v~~~~~~~~p~G~----T~l~~~l~~a~~~~~~~~~~~----~~~p~~~~vIiiTDG~~~d 121 (199)
T cd01457 66 --SSKVDQLFAENSPDGG----TNLAAVLQDALNNYFQRKENG----ATCPEGETFLVITDGAPDD 121 (199)
T ss_pred --HHHHHHHHhcCCCCCc----CcHHHHHHHHHHHHHHHHhhc----cCCCCceEEEEEcCCCCCc
Confidence 6888889988876544 34556666666422 211000 0111368999999988765
No 48
>PF04056 Ssl1: Ssl1-like; InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=96.48 E-value=0.048 Score=54.16 Aligned_cols=160 Identities=17% Similarity=0.230 Sum_probs=103.9
Q ss_pred EEecccccc--hhHHH---HHHHHHHHHHH-HhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 9 AVEGTAAMG--PYWQS---IVSDYLEKIIR-CFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 9 VIDgTAsMG--pYi~~---LktnYI~PiLE-yF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
|||.+.+|. +|-+. ..-.|++..++ ||.+.++ .++|+|+.|+.- .. ....++.++.+..+
T Consensus 1 viD~S~~m~~~D~~PtRl~~~~~~l~~Fv~eff~qNPi---------Sqlgii~~~~~~---a~--~ls~lsgn~~~h~~ 66 (193)
T PF04056_consen 1 VIDMSEAMREKDLKPTRLQCVLKALEEFVREFFDQNPI---------SQLGIIVMRDGR---AE--RLSELSGNPQEHIE 66 (193)
T ss_pred CeechHhHHhCcCCccHHHHHHHHHHHHHHHHHhcCCh---------hheeeeeeecce---eE--EeeecCCCHHHHHH
Confidence 578777774 22222 22233333333 3344333 689999999865 22 34567889999999
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhh
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQA 162 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~ 162 (601)
.|+++...+ ..++..+-.||..|...+...+. -..|-+++|-.|- .
T Consensus 67 ~L~~~~~~~-~~G~~SLqN~Le~A~~~L~~~p~--------~~srEIlvi~gSl-------------------------~ 112 (193)
T PF04056_consen 67 ALKKLRKLE-PSGEPSLQNGLEMARSSLKHMPS--------HGSREILVIFGSL-------------------------T 112 (193)
T ss_pred HHHHhccCC-CCCChhHHHHHHHHHHHHhhCcc--------ccceEEEEEEeec-------------------------c
Confidence 888886332 23467788999999988754331 2457777764321 1
Q ss_pred cccCCCHHHHHHHhhhcCcEEEEe-CCCcchHHHHHHHHhcCCCCCCCCCCCCCCCCceEEEecccchhh
Q 007491 163 ESRLSDAETVAKSFVQCSVSLSVI-CPKQLPKLTAIYNAAKRNPRAADPPVDNSKNPHFLVLISENFMEA 231 (601)
Q Consensus 163 ~s~laDaEqVAk~faEk~IsLSII-sPRklP~Lk~Lfeka~~n~~~a~~s~dyAKdP~HLVLL~g~~L~a 231 (601)
.++..|....++.+.+.+|.+||| ---.+-.+|+|.++.++. |.|.|++.-+++
T Consensus 113 t~Dp~di~~ti~~l~~~~IrvsvI~laaEv~I~k~i~~~T~G~---------------y~V~lde~H~~~ 167 (193)
T PF04056_consen 113 TCDPGDIHETIESLKKENIRVSVISLAAEVYICKKICKETGGT---------------YGVILDEDHFKE 167 (193)
T ss_pred cCCchhHHHHHHHHHHcCCEEEEEEEhHHHHHHHHHHHhhCCE---------------EEEecCHHHHHH
Confidence 233347788999999999999999 233588999999988763 566666655544
No 49
>PF04811 Sec23_trunk: Sec23/Sec24 trunk domain; InterPro: IPR006896 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain, an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the Sec23/24 alpha/beta trunk domain, which is formed from a single, approximately 250-residue segment plugged into the beta-barrel between strands beta-1 and beta-19. The trunk has an alpha/beta fold with a vWA topology, and it forms the dimer interface, primarily involving strand beta-14 on Sec23 and Sec24; in addition, the trunk domain of Sec23 contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_A 2NUP_A 3EG9_A 3EFO_A 3EGX_A 2NUT_A 1PD0_A 1PD1_A 1M2V_B 1PCX_A ....
Probab=96.41 E-value=0.032 Score=55.20 Aligned_cols=178 Identities=14% Similarity=0.184 Sum_probs=86.0
Q ss_pred eEEEEEeccc--ccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC---------------------
Q 007491 5 QLIVAVEGTA--AMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG--------------------- 61 (601)
Q Consensus 5 DIVFVIDgTA--sMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~--------------------- 61 (601)
-.|||||.+- .-..+.+.+++. |..+|+.+. ...+.++|||+|.+.=
T Consensus 5 ~y~FvID~s~~av~~g~~~~~~~s-l~~~l~~l~---------~~~~~~vgiitfd~~V~~y~l~~~~~~~~~~v~~dl~ 74 (243)
T PF04811_consen 5 VYVFVIDVSYEAVQSGLLQSLIES-LKSALDSLP---------GDERTRVGIITFDSSVHFYNLSSSLSQPQMIVVSDLD 74 (243)
T ss_dssp EEEEEEE-SHHHHHHTHHHHHHHH-HHHHGCTSS---------TSTT-EEEEEEESSSEEEEETTTTSSSTEEEEEHHTT
T ss_pred EEEEEEECchhhhhccHHHHHHHH-HHHHHHhcc---------CCCCcEEEEEEeCCEEEEEECCCCcCCCcccchHHHh
Confidence 3699999983 244555555544 334443332 1257999999996521
Q ss_pred ----Cccc-cceeeccCCcCHHHHHHHhcCcccCCC-C-CchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeec
Q 007491 62 ----SYCA-CLVQRSGWTKDVDIFLHWLSTIPFAGG-G-FNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAA 134 (601)
Q Consensus 62 ----P~~e-~~Vqr~GfTsD~~~FlqwLdsI~FsGG-G-fspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~n 134 (601)
|.++ .++.....-..++.+++.|.++ +... + ..+.+...||..|+.++.... .--++++...
T Consensus 75 ~~~~p~~~~llv~~~e~~~~i~~ll~~L~~~-~~~~~~~~~~~c~G~Al~~A~~ll~~~~----------~gGkI~~F~s 143 (243)
T PF04811_consen 75 DPFIPLPDGLLVPLSECRDAIEELLESLPSI-FPETAGKRPERCLGSALSAALSLLSSRN----------TGGKILVFTS 143 (243)
T ss_dssp SHHSSTSSSSSEETTTCHHHHHHHHHHHHHH-STT-TTB-----HHHHHHHHHHHHHHHT----------S-EEEEEEES
T ss_pred hcccCCcccEEEEhHHhHHHHHHHHHHhhhh-cccccccCccccHHHHHHHHHHHHhccc----------cCCEEEEEec
Confidence 1111 1111111112234444444442 2222 2 236788999999999886321 2245666667
Q ss_pred CCCCCCC-C---CccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEe-CCCc---chHHHHHHHHhcCCC
Q 007491 135 SNPHPLP-T---PVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVI-CPKQ---LPKLTAIYNAAKRNP 205 (601)
Q Consensus 135 SPPhplP-~---pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSII-sPRk---lP~Lk~Lfeka~~n~ 205 (601)
++|.-=| - .-..+...+.+ +|...+.+ ..-.-.+.+|+.|.+++|.+.+. ++.. ++.|..|....+++.
T Consensus 144 ~~pt~G~Gg~l~~~~~~~~~~~~-~~~~~~~~-~~~~fY~~la~~~~~~~isvDlf~~~~~~~~l~tl~~l~~~TGG~l 220 (243)
T PF04811_consen 144 GPPTYGPGGSLKKREDSSHYDTE-KEKALLLP-PANEFYKKLAEECSKQGISVDLFVFSSDYVDLATLGPLARYTGGSL 220 (243)
T ss_dssp S---SSSTTSS-SBTTSCCCCHC-TTHHCHSH-SSSHHHHHHHHHHHHCTEEEEEEEECSS--SHHHHTHHHHCTT-EE
T ss_pred cCCCCCCCceecccccccccccc-cchhhhcc-ccchHHHHHHHHHHhcCCEEEEEeecCCCCCcHhHHHHHHhCceeE
Confidence 7774333 1 00001001100 11011111 11112589999999999988776 5554 778888888777764
No 50
>COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]
Probab=96.41 E-value=0.023 Score=56.91 Aligned_cols=155 Identities=21% Similarity=0.223 Sum_probs=96.0
Q ss_pred CCcceEEEEEeccccc-chhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeec-cCCcCHH
Q 007491 1 MSEKQLIVAVEGTAAM-GPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRS-GWTKDVD 78 (601)
Q Consensus 1 ~~~~DIVFVIDgTAsM-GpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~-GfTsD~~ 78 (601)
|.++-++|++|+.++| |.-|+.|. +-|..+++.+.+.+.+. .+.++++|+|..+- ..+ +|| |+.
T Consensus 1 mrRlP~~lllDtSgSM~Ge~IealN-~Glq~m~~~Lkqdp~Al-----e~v~lsIVTF~~~a-------~~~~pf~-~~~ 66 (207)
T COG4245 1 MRRLPCYLLLDTSGSMIGEPIEALN-AGLQMMIDTLKQDPYAL-----ERVELSIVTFGGPA-------RVIQPFT-DAA 66 (207)
T ss_pred CCCCCEEEEEecCcccccccHHHHH-HHHHHHHHHHHhChhhh-----heeEEEEEEecCcc-------eEEechh-hHh
Confidence 6678899999999998 67777776 55899999998866654 57899999998744 222 333 233
Q ss_pred HHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchh
Q 007491 79 IFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENN 158 (601)
Q Consensus 79 ~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~ 158 (601)
.|. .-.| +.+||- .+..||..|+.|.++...........+..-..+||+|..|-. .|+
T Consensus 67 nF~--~p~L-~a~GgT---~lGaAl~~a~d~Ie~~~~~~~a~~kgdyrP~vfLiTDG~PtD----~w~------------ 124 (207)
T COG4245 67 NFN--PPIL-TAQGGT---PLGAALTLALDMIEERKRKYDANGKGDYRPWVFLITDGEPTD----DWQ------------ 124 (207)
T ss_pred hcC--CCce-ecCCCC---chHHHHHHHHHHHHHHHhhcccCCccccceEEEEecCCCcch----HHH------------
Confidence 221 1222 233442 344566777877765322222222234566778999888833 111
Q ss_pred hhhhcccCCCHHHHHHHh----hhcCcEEEEeCCC--cchHHHHHHHHhc
Q 007491 159 EAQAESRLSDAETVAKSF----VQCSVSLSVICPK--QLPKLTAIYNAAK 202 (601)
Q Consensus 159 ~~~~~s~laDaEqVAk~f----aEk~IsLSIIsPR--klP~Lk~Lfeka~ 202 (601)
|.+++-+ ..++|-.+.+-++ .++.|+.|+|+-.
T Consensus 125 -----------~~~~~~~~~~~~~k~v~a~~~G~~~ad~~~L~qit~~V~ 163 (207)
T COG4245 125 -----------AGAALVFQGERRAKSVAAFSVGVQGADNKTLNQITEKVR 163 (207)
T ss_pred -----------hHHHHhhhcccccceEEEEEecccccccHHHHHHHHhhc
Confidence 2222221 2344556666565 5999999998753
No 51
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70). Proteins in this family are involved in non-homologous end joining, a process used for the repair of double stranded DNA breaks. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). Cutoff does not detect the putative ku70 homologs in yeast.
Probab=95.25 E-value=0.3 Score=55.41 Aligned_cols=148 Identities=20% Similarity=0.161 Sum_probs=85.8
Q ss_pred EEEEEecccccchhHHHH----HHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC-----CccccceeeccCCcC
Q 007491 6 LIVAVEGTAAMGPYWQSI----VSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG-----SYCACLVQRSGWTKD 76 (601)
Q Consensus 6 IVFVIDgTAsMGpYi~~L----ktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~-----P~~e~~Vqr~GfTsD 76 (601)
|||+||++.+|..=...+ .=+-+..++..+.+..+- .....++|||+|++-. .|..+||-+.==..|
T Consensus 13 ilflIDvs~sM~~~~~~~~~~s~~~~al~~i~~l~q~kIi----s~~~D~vGivlfgT~~t~n~~~~~~i~v~~~L~~p~ 88 (584)
T TIGR00578 13 LIFLVDASKAMFEESQGEDELTPFDMSIQCIQSVYTSKII----SSDKDLLAVVFYGTEKDKNSVNFKNIYVLQELDNPG 88 (584)
T ss_pred EEEEEECCHHHcCCCcCcCcCChHHHHHHHHHHHHHhcCC----CCCCCeEEEEEEeccCCCCccCCCceEEEeeCCCCC
Confidence 899999999998411110 001123445555333331 1256799999999965 367777744211223
Q ss_pred HHHHHHHhcCcccC-C--------CCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecC-CCCCCCCCccc
Q 007491 77 VDIFLHWLSTIPFA-G--------GGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAAS-NPHPLPTPVYR 146 (601)
Q Consensus 77 ~~~FlqwLdsI~Fs-G--------GGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nS-PPhplP~pVy~ 146 (601)
++.+ ..|+.|.-. | |......++++|-.|+.+|.... ....+|+++||+|. .||.-..
T Consensus 89 a~~i-~~L~~l~~~~~~~~~~~~~~~~~~~~l~daL~~~~~~f~~~~-------~k~~~kRI~lfTd~D~P~~~~~---- 156 (584)
T TIGR00578 89 AKRI-LELDQFKGDQGPKKFRDTYGHGSDYSLSEVLWVCANLFSDVQ-------FRMSHKRIMLFTNEDNPHGNDS---- 156 (584)
T ss_pred HHHH-HHHHHHhhccCccchhhccCCCCCCcHHHHHHHHHHHHHhcc-------hhhcCcEEEEECCCCCCCCCch----
Confidence 3322 233333211 1 11123478999999999996422 12357888888776 7776221
Q ss_pred cccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEe
Q 007491 147 PQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVI 186 (601)
Q Consensus 147 P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSII 186 (601)
.... -+.+.|+.+.+.||.|-++
T Consensus 157 --------------~~~~---~a~~~a~dl~~~gi~ielf 179 (584)
T TIGR00578 157 --------------AKAS---RARTKAGDLRDTGIFLDLM 179 (584)
T ss_pred --------------hHHH---HHHHHHHHHHhcCeEEEEE
Confidence 0000 2356899999999999987
No 52
>cd01479 Sec24-like Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24
Probab=93.57 E-value=1.1 Score=45.17 Aligned_cols=177 Identities=14% Similarity=0.165 Sum_probs=92.9
Q ss_pred eEEEEEecccccc--hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC------C---ccccce-----
Q 007491 5 QLIVAVEGTAAMG--PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG------S---YCACLV----- 68 (601)
Q Consensus 5 DIVFVIDgTAsMG--pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~------P---~~e~~V----- 68 (601)
-.|||||.+..=- .+.+.+++- |..+|+.+-. + -.+.++|||+|.+.= . ...-++
T Consensus 5 ~~~FvIDvs~~a~~~g~~~~~~~s-i~~~L~~lp~------~--~~~~~VgiITfd~~v~~y~l~~~~~~~q~~vv~dl~ 75 (244)
T cd01479 5 VYVFLIDVSYNAIKSGLLATACEA-LLSNLDNLPG------D--DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLD 75 (244)
T ss_pred EEEEEEEccHHHHhhChHHHHHHH-HHHHHHhcCC------C--CCCeEEEEEEECCeEEEEECCCCCCCCeEEEeeCcc
Confidence 3699999875221 245555544 3444554421 0 146899999998732 0 011111
Q ss_pred eec-----cC----CcCHHHHHHHhcCcccC--CCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCC
Q 007491 69 QRS-----GW----TKDVDIFLHWLSTIPFA--GGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNP 137 (601)
Q Consensus 69 qr~-----Gf----TsD~~~FlqwLdsI~Fs--GGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPP 137 (601)
..+ +| ..+-+.+...|++|+-. -+.....|...||..|+.++... . -++|+.+.++|
T Consensus 76 d~f~P~~~~~lv~l~e~~~~i~~lL~~L~~~~~~~~~~~~c~G~Al~~A~~lL~~~-----------G-GkIi~f~s~~p 143 (244)
T cd01479 76 DPFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKET-----------G-GKIIVFQSSLP 143 (244)
T ss_pred cccCCCCcceeecHHHHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHhc-----------C-CEEEEEeCCCC
Confidence 011 11 12223444566666422 22334688999999999888531 1 24455555555
Q ss_pred C-CCCCCcccc--ccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEe-CCCc---chHHHHHHHHhcCCC
Q 007491 138 H-PLPTPVYRP--QMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVI-CPKQ---LPKLTAIYNAAKRNP 205 (601)
Q Consensus 138 h-plP~pVy~P--~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSII-sPRk---lP~Lk~Lfeka~~n~ 205 (601)
- ....-+-+. .+.+-+ .|. +..+ ..-.-.+.+|..+.+++|.+.+. ++.. +..+..|-+..++..
T Consensus 144 t~GpG~l~~~~~~~~~~~~-~e~-~~~~-p~~~fY~~la~~~~~~~isvDlF~~~~~~~dla~l~~l~~~TGG~v 215 (244)
T cd01479 144 TLGAGKLKSREDPKLLSTD-KEK-QLLQ-PQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQV 215 (244)
T ss_pred CcCCcccccCccccccCch-hhh-hhcC-cchHHHHHHHHHHHHcCeEEEEEEccCcccChhhhhhhhhhcCceE
Confidence 3 222211110 000000 110 1111 11112478999999999999887 4443 678888888887764
No 53
>PF05762 VWA_CoxE: VWA domain containing CoxE-like protein; InterPro: IPR008912 This group of proteins contains a VWA type domain and the function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon in several bacteria [].
Probab=92.86 E-value=0.87 Score=45.14 Aligned_cols=129 Identities=16% Similarity=0.225 Sum_probs=69.8
Q ss_pred eEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHHh
Q 007491 5 QLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHWL 84 (601)
Q Consensus 5 DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~FlqwL 84 (601)
.|||++|.++||..|.. ++..++..+.. ..-++-+++|.+.- . -++..--..|+.+.+.-+
T Consensus 59 ~lvvl~DvSGSM~~~s~-----~~l~~~~~l~~----------~~~~~~~f~F~~~l---~-~vT~~l~~~~~~~~l~~~ 119 (222)
T PF05762_consen 59 RLVVLCDVSGSMAGYSE-----FMLAFLYALQR----------QFRRVRVFVFSTRL---T-EVTPLLRRRDPEEALARL 119 (222)
T ss_pred cEEEEEeCCCChHHHHH-----HHHHHHHHHHH----------hCCCEEEEEEeeeh---h-hhhhhhccCCHHHHHHHH
Confidence 79999999999999854 33445555533 11267777777643 1 111111123555444444
Q ss_pred c-CcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhhc
Q 007491 85 S-TIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQAE 163 (601)
Q Consensus 85 d-sI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~ 163 (601)
. .....||| -.|.++|.+++..+... .. .....|+|+|. +. .++
T Consensus 120 ~~~~~~~~Gg---Tdi~~aL~~~~~~~~~~--------~~-~~t~vvIiSDg--~~--------------~~~------- 164 (222)
T PF05762_consen 120 SALVQSFGGG---TDIGQALREFLRQYARP--------DL-RRTTVVIISDG--WD--------------TND------- 164 (222)
T ss_pred HhhccCCCCc---cHHHHHHHHHHHHhhcc--------cc-cCcEEEEEecc--cc--------------cCC-------
Confidence 3 22223333 33556666666655310 01 23445667663 11 011
Q ss_pred ccCCCHHHHHHHhhhcCcEEEEeCCC
Q 007491 164 SRLSDAETVAKSFVQCSVSLSVICPK 189 (601)
Q Consensus 164 s~laDaEqVAk~faEk~IsLSIIsPR 189 (601)
..+.+...+.+.+++-.|..+.|-
T Consensus 165 --~~~~~~~l~~l~~r~~rviwLnP~ 188 (222)
T PF05762_consen 165 --PEPLAEELRRLRRRGRRVIWLNPL 188 (222)
T ss_pred --hHHHHHHHHHHHHhCCEEEEECCc
Confidence 113456667788999999999886
No 54
>cd01468 trunk_domain trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.
Probab=92.80 E-value=3.2 Score=41.40 Aligned_cols=178 Identities=10% Similarity=0.116 Sum_probs=94.1
Q ss_pred EEEEEecccc--cchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC------C---ccccceee----
Q 007491 6 LIVAVEGTAA--MGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG------S---YCACLVQR---- 70 (601)
Q Consensus 6 IVFVIDgTAs--MGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~------P---~~e~~Vqr---- 70 (601)
.|||||.+.. -..+.+.+++- |..+|+.+-. ..+.++|||+|.++= . .+..+|-.
T Consensus 6 ~vFvID~s~~ai~~~~l~~~~~s-l~~~l~~lp~---------~~~~~igiITf~~~V~~~~~~~~~~~~~~~v~~dl~d 75 (239)
T cd01468 6 FVFVIDVSYEAIKEGLLQALKES-LLASLDLLPG---------DPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKD 75 (239)
T ss_pred EEEEEEcchHhccccHHHHHHHH-HHHHHHhCCC---------CCCcEEEEEEeCCeEEEEECCCCCCCCeEEEeCCCcc
Confidence 6999999874 24456777754 4566665521 146899999996421 1 11111110
Q ss_pred ------ccCC----cCHHHHHHHhcCcccCCCC----CchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCC
Q 007491 71 ------SGWT----KDVDIFLHWLSTIPFAGGG----FNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASN 136 (601)
Q Consensus 71 ------~GfT----sD~~~FlqwLdsI~FsGGG----fspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSP 136 (601)
.++- +.-+.+...|++|+-.... -...+...||..|+.++.... .--++++.+.++
T Consensus 76 ~f~p~~~~~l~~~~e~~~~i~~~l~~l~~~~~~~~~~~~~~~~G~Al~~A~~ll~~~~----------~gGkI~~f~sg~ 145 (239)
T cd01468 76 VFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTF----------AGGRIIVFQGGL 145 (239)
T ss_pred CcCCCcCceeeeHHHHHHHHHHHHHhhhhhccccCCCCCcccHHHHHHHHHHHHhhcC----------CCceEEEEECCC
Confidence 0111 1224455567777544432 235778889999998885321 112444444555
Q ss_pred CC-CCCCCccccccccc-cccchhhhhhcccCCCHHHHHHHhhhcCcEEEEe-CCCc---chHHHHHHHHhcCCC
Q 007491 137 PH-PLPTPVYRPQMQNL-DQNENNEAQAESRLSDAETVAKSFVQCSVSLSVI-CPKQ---LPKLTAIYNAAKRNP 205 (601)
Q Consensus 137 Ph-plP~pVy~P~~~~~-~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSII-sPRk---lP~Lk~Lfeka~~n~ 205 (601)
|- ....-..+-.-... .+.|. +..+ ..-.-.+.+|..|.+++|.+.+. ++.. +..+..|-+..++..
T Consensus 146 pt~GpG~l~~~~~~~~~~~~~e~-~~~~-~a~~fY~~la~~~~~~~isvdlF~~~~~~~dl~~l~~l~~~TGG~v 218 (239)
T cd01468 146 PTVGPGKLKSREDKEPIRSHDEA-QLLK-PATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQV 218 (239)
T ss_pred CCCCCCccccCcccccCCCccch-hccc-ccHHHHHHHHHHHHHcCeEEEEEeccccccCHHHhhhhhhcCCceE
Confidence 53 22221100000000 00000 0011 11112478999999999999888 4443 577888877776664
No 55
>KOG1226 consensus Integrin beta subunit (N-terminal portion of extracellular region) [Signal transduction mechanisms; Extracellular structures]
Probab=91.91 E-value=0.9 Score=53.21 Aligned_cols=179 Identities=12% Similarity=0.088 Sum_probs=112.1
Q ss_pred ceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC--------------Ccccc--c
Q 007491 4 KQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG--------------SYCAC--L 67 (601)
Q Consensus 4 ~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~--------------P~~e~--~ 67 (601)
.||...+|=+.+|+++++.||+.= ..+.+..+. + .+++|+|.=-|-|.. |.++. .
T Consensus 133 VDLYyLMDlS~SM~DDl~~l~~LG-~~L~~~m~~--l------T~nfrlGFGSFVDK~v~P~i~~~pekl~npc~~~~~C 203 (783)
T KOG1226|consen 133 VDLYYLMDLSYSMKDDLENLKSLG-TDLAREMRK--L------TSNFRLGFGSFVDKTVSPYISTTPEKLRNPCPNYKNC 203 (783)
T ss_pred eeEEEEeecchhhhhhHHHHHHHH-HHHHHHHHH--H------hccCCccccchhccccccccccCcHHhcCCCCCcccC
Confidence 699999999999999999999753 445444422 1 123454444443333 22221 1
Q ss_pred eeeccC------CcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 68 VQRSGW------TKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 68 Vqr~Gf------TsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.-.|+| |.|+++|...+++-+.+|-=|.||-=.+|+=-|..+-+.+- || .++.|..|+..++-+|-..
T Consensus 204 ~ppfgfkhvLsLT~~~~~F~~~V~~q~ISgNlDaPEGGfDAimQaavC~~~IG-WR-----~~a~~lLVF~td~~~H~a~ 277 (783)
T KOG1226|consen 204 APPFGFKHVLSLTNDAEEFNEEVGKQRISGNLDAPEGGFDAIMQAAVCTEKIG-WR-----NDATRLLVFSTDAGFHFAG 277 (783)
T ss_pred CCCcccceeeecCCChHHHHHHHhhceeccCCCCCCchHHHHHhhhhcccccc-cc-----ccceeEEEEEcCcceeeec
Confidence 123444 99999999999999999988887664444444443332111 11 3467888999999999876
Q ss_pred C----Cccccccccccccchhhhhhcc--cCCCHHHHHHHhhhcCcEEEEe-CCCcchHHHHH
Q 007491 142 T----PVYRPQMQNLDQNENNEAQAES--RLSDAETVAKSFVQCSVSLSVI-CPKQLPKLTAI 197 (601)
Q Consensus 142 ~----pVy~P~~~~~~~ge~~~~~~~s--~laDaEqVAk~faEk~IsLSII-sPRklP~Lk~L 197 (601)
. -...|-+..=..++.++|..+. ++-...++++.+.+++|++..- .=.+...+++|
T Consensus 278 DgkLaGiv~pnDG~CHL~~~g~Yt~S~~qdyPSia~l~~kl~~~ni~~IFAVt~~~~~~Y~~l 340 (783)
T KOG1226|consen 278 DGKLAGIVQPNDGQCHLDKNGEYTQSTTQDYPSIAQLAQKLADNNINTIFAVTKNSQSLYEEL 340 (783)
T ss_pred ccceeeEecCCCCccccCCCCccceecCCCCCcHHHHHHHHhhhcchhHHHHhhhhhhHHHhh
Confidence 4 2223333333445555555432 4446689999999999986543 55555555544
No 56
>PRK10997 yieM hypothetical protein; Provisional
Probab=91.78 E-value=2.9 Score=47.13 Aligned_cols=88 Identities=19% Similarity=0.261 Sum_probs=48.8
Q ss_pred cceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHH
Q 007491 3 EKQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLH 82 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flq 82 (601)
+..+|+|||+++||..+ +..|..-++-.+.--.+. .+-+++++.|.+.. .++ ...-.+....++.
T Consensus 323 kGpiII~VDtSGSM~G~----ke~~AkalAaAL~~iAl~------q~dr~~li~Fs~~i---~~~--~l~~~~gl~~ll~ 387 (487)
T PRK10997 323 RGPFIVCVDTSGSMGGF----NEQCAKAFCLALMRIALA------ENRRCYIMLFSTEV---VTY--ELTGPDGLEQAIR 387 (487)
T ss_pred CCcEEEEEECCCCCCCC----HHHHHHHHHHHHHHHHHh------cCCCEEEEEecCCc---eee--ccCCccCHHHHHH
Confidence 35799999999999855 334333332222110111 23478899998854 221 1223445677788
Q ss_pred HhcCcccCCCCCchhhHHHHHHHHHhcc
Q 007491 83 WLSTIPFAGGGFNDAAIAEGLSEALMMF 110 (601)
Q Consensus 83 wLdsI~FsGGGfspaAVAEGLAeAL~~f 110 (601)
.|+.. | |||- .++.+|.+++...
T Consensus 388 fL~~~-f-~GGT---Dl~~aL~~al~~l 410 (487)
T PRK10997 388 FLSQS-F-RGGT---DLAPCLRAIIEKM 410 (487)
T ss_pred HHHHh-c-CCCC---cHHHHHHHHHHHH
Confidence 88654 4 4442 2445555555554
No 57
>PF06707 DUF1194: Protein of unknown function (DUF1194); InterPro: IPR010607 This family consists of several hypothetical Rhizobiales specific proteins of around 270 residues in length. The function of this family is unknown.
Probab=91.60 E-value=4.1 Score=41.25 Aligned_cols=157 Identities=19% Similarity=0.257 Sum_probs=95.8
Q ss_pred cceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCC--CcceEEEEEeecCCCccccceeeccC-----Cc
Q 007491 3 EKQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSA--SNVELSLVTFNTHGSYCACLVQRSGW-----TK 75 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~--~s~eyGLVvYrd~~P~~e~~Vqr~Gf-----Ts 75 (601)
.++||+.||.++|+-.-=-.|.. +-+.+-|.+..+.+.=..+ .+..+++|.+-+.. ... ...+| ..
T Consensus 3 dlaLvLavDvS~SVD~~E~~lQ~---~G~A~Al~dp~V~~Ai~~g~~g~Iav~~~eWsg~~--~q~--~~v~Wt~i~~~~ 75 (205)
T PF06707_consen 3 DLALVLAVDVSGSVDADEYRLQR---EGYAAALRDPEVIAAILSGPIGRIAVAVVEWSGPG--RQR--VVVPWTRIDSPA 75 (205)
T ss_pred cceeeeeeeccCCCCHHHHHHHH---HHHHHHHCCHHHHHHHhcCCCCeEEEEEEEecCCC--Cce--EEeCCEEeCCHH
Confidence 46899999999999876444442 3445555443332211122 34455555554432 222 23467 46
Q ss_pred CHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCcccccccccccc
Q 007491 76 DVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQN 155 (601)
Q Consensus 76 D~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~g 155 (601)
|...|-..|+..+-..+++ -+|.++|..|+.+|+..+ ..+.|+.|.|.-- |
T Consensus 76 da~a~A~~l~~~~r~~~~~--Taig~Al~~a~~ll~~~~--------~~~~RrVIDvSGD-------------------G 126 (205)
T PF06707_consen 76 DAEAFAARLRAAPRRFGGR--TAIGSALDFAAALLAQNP--------FECWRRVIDVSGD-------------------G 126 (205)
T ss_pred HHHHHHHHHHhCCCCCCCC--chHHHHHHHHHHHHHhCC--------CCCceEEEEECCC-------------------C
Confidence 8888999999998777666 899999999999997532 2478899988432 2
Q ss_pred chhhhhhcccCCCHH-HHHHHhhhcCcEEEEe-CCCcch----HHHHHHHHh
Q 007491 156 ENNEAQAESRLSDAE-TVAKSFVQCSVSLSVI-CPKQLP----KLTAIYNAA 201 (601)
Q Consensus 156 e~~~~~~~s~laDaE-qVAk~faEk~IsLSII-sPRklP----~Lk~Lfeka 201 (601)
.++. ...+. .+-+.+..+||.+.-+ --..-+ .|..-|+..
T Consensus 127 ~~N~------G~~p~~~ard~~~~~GitINgL~I~~~~~~~~~~L~~yy~~~ 172 (205)
T PF06707_consen 127 PNNQ------GPRPVTSARDAAVAAGITINGLAILDDDPFGGADLDAYYRRC 172 (205)
T ss_pred CCCC------CCCccHHHHHHHHHCCeEEeeeEecCCCCCccccHHHHHhhh
Confidence 2111 11223 5556777889988766 222222 566666643
No 58
>PF11235 Med25_SD1: Mediator complex subunit 25 synapsin 1; InterPro: IPR021397 The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA, domain, this SD1 - synapsin 1 - domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This The function of the SD domains is unclear [].
Probab=88.98 E-value=0.7 Score=45.06 Aligned_cols=83 Identities=22% Similarity=0.264 Sum_probs=61.5
Q ss_pred CCCCCCCCCCCCcccccCCCcc-ccCCCCCCCCCCCCCCCCCccc-cc--ccc----cceeeccccccccccCCCCCCCC
Q 007491 238 PGVANLAPNQNPVKMDIAPVTS-VTGPAPTSIPSVNGINRPPISV-GN--VPT----ATVKVEPSTVTSMVNGPTFPHIP 309 (601)
Q Consensus 238 ~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~psvng~~r~~~~v-g~--~~t----a~vkve~~~~~~m~~~P~f~~~~ 309 (601)
...+.|+++|.+.....+++.. ..++|++.+|.|..+ -|+| |+ +.+ |...||..+-.+...++||+.+.
T Consensus 7 S~pg~lq~KQ~Vp~pP~p~~~~~l~aaP~q~LPpVs~p---q~qvp~n~tLsaAq~AAq~AVEAAknQKaglg~RF~pi~ 83 (168)
T PF11235_consen 7 SVPGPLQPKQPVPLPPAPVSPAQLSAAPQQPLPPVSPP---QYQVPGNSTLSAAQVAAQNAVEAAKNQKAGLGPRFSPIN 83 (168)
T ss_pred CCCCCCCCCCcccCCCCCCCccccccCCcCCCCCCCcc---ccccCccccchHHHHHHHHHHHHHHhccccCCCCCCCCC
Confidence 3456899999998888776664 469999999999666 3554 44 555 36678999989999999999999
Q ss_pred CCCCCCCCCCCCcc
Q 007491 310 SVPRPASQGVPSLQ 323 (601)
Q Consensus 310 ~~~~~~s~~~~~~q 323 (601)
++.+..--|-|--|
T Consensus 84 p~~~a~~vgpPFsQ 97 (168)
T PF11235_consen 84 PLQQAPGVGPPFSQ 97 (168)
T ss_pred cccCCCCCCCCCcC
Confidence 87775433434333
No 59
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=87.98 E-value=13 Score=40.56 Aligned_cols=150 Identities=17% Similarity=0.257 Sum_probs=90.9
Q ss_pred ceEEEEEecccccch-----hHHHHHHHHHHHHH-HHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCH
Q 007491 4 KQLIVAVEGTAAMGP-----YWQSIVSDYLEKII-RCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDV 77 (601)
Q Consensus 4 ~DIVFVIDgTAsMGp-----Yi~~LktnYI~PiL-EyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~ 77 (601)
+-+++|||++-+|-. -.-.+...|++-.+ |+|.+.++ .++|||..||-- .. .....|.++
T Consensus 61 Rhl~iviD~S~am~e~Df~P~r~a~~~K~le~Fv~eFFdQNPi---------SQigii~~k~g~---A~--~lt~ltgnp 126 (378)
T KOG2807|consen 61 RHLYIVIDCSRAMEEKDFRPSRFANVIKYLEGFVPEFFDQNPI---------SQIGIISIKDGK---AD--RLTDLTGNP 126 (378)
T ss_pred eeEEEEEEhhhhhhhccCCchHHHHHHHHHHHHHHHHhccCch---------hheeEEEEecch---hh--HHHHhcCCH
Confidence 458999999988742 22223333333333 33344344 589999999765 22 123458899
Q ss_pred HHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccch
Q 007491 78 DIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNEN 157 (601)
Q Consensus 78 ~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~ 157 (601)
..-.+.|.++.+-+|-|+ +--||.-|...+..++. -..|-+++|-.|
T Consensus 127 ~~hI~aL~~~~~~~g~fS---LqNaLe~a~~~Lk~~p~--------H~sREVLii~ss---------------------- 173 (378)
T KOG2807|consen 127 RIHIHALKGLTECSGDFS---LQNALELAREVLKHMPG--------HVSREVLIIFSS---------------------- 173 (378)
T ss_pred HHHHHHHhcccccCCChH---HHHHHHHHHHHhcCCCc--------ccceEEEEEEee----------------------
Confidence 999999999996655565 33444555544432221 133444444221
Q ss_pred hhhhhcccCCCHHHHHHHhhhcCcEEEEe-CCCcchHHHHHHHHhcC
Q 007491 158 NEAQAESRLSDAETVAKSFVQCSVSLSVI-CPKQLPKLTAIYNAAKR 203 (601)
Q Consensus 158 ~~~~~~s~laDaEqVAk~faEk~IsLSII-sPRklP~Lk~Lfeka~~ 203 (601)
...++.+|..++.+.+..-+|.++|| .--.+-..|.|..+.++
T Consensus 174 ---lsT~DPgdi~~tI~~lk~~kIRvsvIgLsaEv~icK~l~kaT~G 217 (378)
T KOG2807|consen 174 ---LSTCDPGDIYETIDKLKAYKIRVSVIGLSAEVFICKELCKATGG 217 (378)
T ss_pred ---ecccCcccHHHHHHHHHhhCeEEEEEeechhHHHHHHHHHhhCC
Confidence 11234457788888899999999999 33446677777776654
No 60
>KOG2884 consensus 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=84.11 E-value=32 Score=36.01 Aligned_cols=179 Identities=17% Similarity=0.261 Sum_probs=105.5
Q ss_pred EEEEEeccccc--chhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHH
Q 007491 6 LIVAVEGTAAM--GPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHW 83 (601)
Q Consensus 6 IVFVIDgTAsM--GpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flqw 83 (601)
.++|||-.--| |+|+++=..-- ...++.+|+-.+... .-.-+||+.--.+.|. . -+.+|+|.-+++..
T Consensus 6 tmi~iDNse~mrNgDy~PtRf~aQ-~daVn~v~~~K~~sn----pEntvGiitla~a~~~--v---LsT~T~d~gkils~ 75 (259)
T KOG2884|consen 6 TMICIDNSEYMRNGDYLPTRFQAQ-KDAVNLVCQAKLRSN----PENTVGIITLANASVQ--V---LSTLTSDRGKILSK 75 (259)
T ss_pred EEEEEeChHHhhcCCCChHHHHHH-HHHHHHHHHhhhcCC----cccceeeEeccCCCce--e---eeeccccchHHHHH
Confidence 57888887654 78877644332 344666666555322 2245899988776621 1 24689999999999
Q ss_pred hcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhhc
Q 007491 84 LSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQAE 163 (601)
Q Consensus 84 LdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~ 163 (601)
|..|+..| .+.+..||.-|+-.|.. || +.+.+.++|+...||= . |+.+
T Consensus 76 lh~i~~~g----~~~~~~~i~iA~lalkh----Rq---nk~~~~riVvFvGSpi-~----------------e~ek---- 123 (259)
T KOG2884|consen 76 LHGIQPHG----KANFMTGIQIAQLALKH----RQ---NKNQKQRIVVFVGSPI-E----------------ESEK---- 123 (259)
T ss_pred hcCCCcCC----cccHHHHHHHHHHHHHh----hc---CCCcceEEEEEecCcc-h----------------hhHH----
Confidence 99999887 35567788888777642 21 2122222333333321 1 1111
Q ss_pred ccCCCHHHHHHHhhhcCcEEEEeCCCcchH-HHHHHHHhcCCCCCCCCCCCCCCCCceEEEecccchhhhhhccCC
Q 007491 164 SRLSDAETVAKSFVQCSVSLSVICPKQLPK-LTAIYNAAKRNPRAADPPVDNSKNPHFLVLISENFMEARAALSRP 238 (601)
Q Consensus 164 s~laDaEqVAk~faEk~IsLSIIsPRklP~-Lk~Lfeka~~n~~~a~~s~dyAKdP~HLVLL~g~~L~ar~~l~~~ 238 (601)
+...+|+-+.+.+|.+.||-=--... -.+||++-+ ..|..-|--|||-+.-..+-.-+.++.|
T Consensus 124 ----eLv~~akrlkk~~Vaidii~FGE~~~~~e~l~~fid--------a~N~~~~gshlv~Vppg~~L~d~l~ssP 187 (259)
T KOG2884|consen 124 ----ELVKLAKRLKKNKVAIDIINFGEAENNTEKLFEFID--------ALNGKGDGSHLVSVPPGPLLSDALLSSP 187 (259)
T ss_pred ----HHHHHHHHHHhcCeeEEEEEeccccccHHHHHHHHH--------HhcCCCCCceEEEeCCCccHHHHhhcCc
Confidence 56889999999999999994222111 123332210 1122245569998876665555555555
No 61
>KOG2353 consensus L-type voltage-dependent Ca2+ channel, alpha2/delta subunit [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=81.69 E-value=7.7 Score=47.71 Aligned_cols=190 Identities=18% Similarity=0.168 Sum_probs=116.8
Q ss_pred CcceEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC-Ccc----ccceeeccCCcC
Q 007491 2 SEKQLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG-SYC----ACLVQRSGWTKD 76 (601)
Q Consensus 2 ~~~DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~-P~~----e~~Vqr~GfTsD 76 (601)
+.|||||.+|..++|...-=+|-+.-+.-||+.+.. .--+-++.|...- +.. ..+|+ +=..+
T Consensus 224 ~pKdiviLlD~SgSm~g~~~~lak~tv~~iLdtLs~-----------~Dfvni~tf~~~~~~v~pc~~~~lvq--At~~n 290 (1104)
T KOG2353|consen 224 SPKDIVILLDVSGSMSGLRLDLAKQTVNEILDTLSD-----------NDFVNILTFNSEVNPVSPCFNGTLVQ--ATMRN 290 (1104)
T ss_pred CccceEEEEeccccccchhhHHHHHHHHHHHHhccc-----------CCeEEEEeeccccCcccccccCceee--cchHH
Confidence 359999999999999887766766667888888843 1236677777665 111 11222 33678
Q ss_pred HHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccc
Q 007491 77 VDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNE 156 (601)
Q Consensus 77 ~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge 156 (601)
.++|...++.+.+-| -+...+||..|+.++-+...++....+..+.+-++||++..+-..- +
T Consensus 291 k~~~~~~i~~l~~k~----~a~~~~~~e~aF~lL~~~n~s~~~~~~~~C~~~iml~tdG~~~~~~--------------~ 352 (1104)
T KOG2353|consen 291 KKVFKEAIETLDAKG----IANYTAALEYAFSLLRDYNDSRANTQRSPCNQAIMLITDGVDENAK--------------E 352 (1104)
T ss_pred HHHHHHHHhhhcccc----ccchhhhHHHHHHHHHHhccccccccccccceeeEEeecCCcccHH--------------H
Confidence 889999999998554 4556778888887665444454444445588999999877654421 1
Q ss_pred hhhhhhcccCCCHHHHHHHhhhcCcEEEEeC-CCcchHHHHHHHHhcCCCCCCCCCCCCCCCCceEEEecccchhhhhhc
Q 007491 157 NNEAQAESRLSDAETVAKSFVQCSVSLSVIC-PKQLPKLTAIYNAAKRNPRAADPPVDNSKNPHFLVLISENFMEARAAL 235 (601)
Q Consensus 157 ~~~~~~~s~laDaEqVAk~faEk~IsLSIIs-PRklP~Lk~Lfeka~~n~~~a~~s~dyAKdP~HLVLL~g~~L~ar~~l 235 (601)
+++ ++-|. .+.|.++-.. =++--.+.+|+.- +--++.|-++--++.=++++.++.-.+|
T Consensus 353 If~--~yn~~-----------~~~Vrvftflig~~~~~~~~~~wm-------ac~n~gyy~~I~~~~~v~~~~~~y~~vl 412 (1104)
T KOG2353|consen 353 IFE--KYNWP-----------DKKVRVFTFLIGDEVYDLDEIQWM-------ACANKGYYVHIISIADVRENVLEYLDVL 412 (1104)
T ss_pred HHH--hhccC-----------CCceEEEEEEecccccccccchhh-------hhhCCCceEeccchhhcChHhhhhhhhh
Confidence 222 11110 3445444331 1211122222221 2235567777777777888888888888
Q ss_pred cCCCCCC
Q 007491 236 SRPGVAN 242 (601)
Q Consensus 236 ~~~~~~~ 242 (601)
+||-+-.
T Consensus 413 sRp~vl~ 419 (1104)
T KOG2353|consen 413 SRPLVLQ 419 (1104)
T ss_pred ccceeec
Confidence 8885443
No 62
>cd01478 Sec23-like Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24
Probab=81.58 E-value=33 Score=35.57 Aligned_cols=117 Identities=13% Similarity=0.128 Sum_probs=60.6
Q ss_pred HHHHhcCcccC-----CCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC-CCccccccccc-
Q 007491 80 FLHWLSTIPFA-----GGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP-TPVYRPQMQNL- 152 (601)
Q Consensus 80 FlqwLdsI~Fs-----GGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP-~pVy~P~~~~~- 152 (601)
+...|+.|+-. -+.....|...||..|..+++-.. ...--++++....||--=| .-+.+..-+..
T Consensus 119 i~~lLe~L~~~~~~~~~~~r~~r~~G~Al~~A~~ll~~~~--------~~~gGki~~F~sg~pT~GpG~l~~r~~~~~~r 190 (267)
T cd01478 119 LTDLLEQLQPDPWPVPAGHRPLRCTGVALSIAVGLLEACF--------PNTGARIMLFAGGPCTVGPGAVVSTELKDPIR 190 (267)
T ss_pred HHHHHHhCcccccccCCCCCCCCchHHHHHHHHHHHHhhc--------CCCCcEEEEEECCCCCCCCceeeccccccccc
Confidence 33466666443 222335778888888888775211 0122355555566663322 22221100000
Q ss_pred cccch----hhhhhcccCCCHHHHHHHhhhcCcEEEEe--CCCc--chHHHHHHHHhcCCC
Q 007491 153 DQNEN----NEAQAESRLSDAETVAKSFVQCSVSLSVI--CPKQ--LPKLTAIYNAAKRNP 205 (601)
Q Consensus 153 ~~ge~----~~~~~~s~laDaEqVAk~faEk~IsLSII--sPRk--lP~Lk~Lfeka~~n~ 205 (601)
.+.++ .++.+. .-.-...+|..+.+++|.+.+. +..+ +..++.|.++.++.-
T Consensus 191 ~~~d~~~~~~~~~~~-a~~fY~~la~~~~~~~vsvDlF~~s~d~vglaem~~l~~~TGG~v 250 (267)
T cd01478 191 SHHDIDKDNAKYYKK-AVKFYDSLAKRLAANGHAVDIFAGCLDQVGLLEMKVLVNSTGGHV 250 (267)
T ss_pred cccccccchhhhhhh-HHHHHHHHHHHHHhCCeEEEEEeccccccCHHHHHHHHHhcCcEE
Confidence 01110 011111 1112367999999999888887 3443 678888888887764
No 63
>PF10138 vWA-TerF-like: vWA found in TerF C terminus ; InterPro: IPR019303 This entry represents the N-terminal domain of a family of proteins that confer resistance to the metalloid element tellurium and its salts.
Probab=81.23 E-value=41 Score=34.15 Aligned_cols=145 Identities=13% Similarity=0.230 Sum_probs=84.5
Q ss_pred cceEEEEEecccccchhHHH-HHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHH
Q 007491 3 EKQLIVAVEGTAAMGPYWQS-IVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFL 81 (601)
Q Consensus 3 ~~DIVFVIDgTAsMGpYi~~-LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Fl 81 (601)
++.|++|||.++||.++++. -..+.++.++..=.+ ++ ..-++=+++|.+.. .+.+ .=..+.|.
T Consensus 1 rArV~LVLD~SGSM~~~yk~G~vQ~~~Er~lalA~~--~D------dDG~i~v~~Fs~~~-------~~~~-~vt~~~~~ 64 (200)
T PF10138_consen 1 RARVYLVLDISGSMRPLYKDGTVQRVVERILALAAQ--FD------DDGEIDVWFFSTEF-------DRLP-DVTLDNYE 64 (200)
T ss_pred CcEEEEEEeCCCCCchhhhCccHHHHHHHHHHHHhh--cC------CCCceEEEEeCCCC-------CcCC-CcCHHHHH
Confidence 46899999999999998876 333444444433322 11 12356677776654 2211 12346677
Q ss_pred HHhcCcccC----C--CCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCcccccccccccc
Q 007491 82 HWLSTIPFA----G--GGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQN 155 (601)
Q Consensus 82 qwLdsI~Fs----G--GGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~g 155 (601)
+|++++... + || ..-+..|.+.+..+.... .....-+++.|+|=.|+.-.
T Consensus 65 ~~v~~~~~~~~~~~~~G~---t~y~~vm~~v~~~y~~~~-------~~~~P~~VlFiTDG~~~~~~-------------- 120 (200)
T PF10138_consen 65 GYVDELHAGLPDWGRMGG---TNYAPVMEDVLDHYFKRE-------PSDAPALVLFITDGGPDDRR-------------- 120 (200)
T ss_pred HHHHHHhccccccCCCCC---cchHHHHHHHHHHHhhcC-------CCCCCeEEEEEecCCccchH--------------
Confidence 777776432 2 33 222334555665553211 11234578888887776521
Q ss_pred chhhhhhcccCCCHHHHHHHhhhcCcEEEEe--CCCcchHHHHHHH
Q 007491 156 ENNEAQAESRLSDAETVAKSFVQCSVSLSVI--CPKQLPKLTAIYN 199 (601)
Q Consensus 156 e~~~~~~~s~laDaEqVAk~faEk~IsLSII--sPRklP~Lk~Lfe 199 (601)
.++++.+..+...|-..+| .=..+-+|++|=+
T Consensus 121 ------------~~~~~i~~as~~pifwqFVgiG~~~f~fL~kLD~ 154 (200)
T PF10138_consen 121 ------------AIEKLIREASDEPIFWQFVGIGDSNFGFLEKLDD 154 (200)
T ss_pred ------------HHHHHHHhccCCCeeEEEEEecCCcchHHHHhhc
Confidence 2467777778888888887 3334888888865
No 64
>COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=79.89 E-value=36 Score=34.99 Aligned_cols=164 Identities=16% Similarity=0.243 Sum_probs=100.1
Q ss_pred EEEEEecccc--cchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHH
Q 007491 6 LIVAVEGTAA--MGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHW 83 (601)
Q Consensus 6 IVFVIDgTAs--MGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~Flqw 83 (601)
.|++||..-. +|+|+++=..-- ...+|.+-+..++.. .-.-+||+.-.++.|.. -+.||++.-+|+..
T Consensus 6 tvvliDNse~s~NgDy~ptRFeAQ-kd~ve~if~~K~ndn----pEntiGli~~~~a~p~v-----lsT~T~~~gkilt~ 75 (243)
T COG5148 6 TVVLIDNSEASQNGDYLPTRFEAQ-KDAVESIFSKKFNDN----PENTIGLIPLVQAQPNV-----LSTPTKQRGKILTF 75 (243)
T ss_pred EEEEEeChhhhhcCCCCcHHHHHH-HHHHHHHHHHHhcCC----ccceeeeeecccCCcch-----hccchhhhhHHHHH
Confidence 5788887754 578877544332 234555544344322 23468999888877432 24679999999999
Q ss_pred hcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhhc
Q 007491 84 LSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQAE 163 (601)
Q Consensus 84 LdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~ 163 (601)
|..|+..||- .++.+|.-|+-.|.... .-...+|.+..+| ||=. |+.|
T Consensus 76 lhd~~~~g~a----~~~~~lqiaql~lkhR~------nk~q~qriVaFvg-Spi~-----------------esed---- 123 (243)
T COG5148 76 LHDIRLHGGA----DIMRCLQIAQLILKHRD------NKGQRQRIVAFVG-SPIQ-----------------ESED---- 123 (243)
T ss_pred hccccccCcc----hHHHHHHHHHHHHhccc------CCccceEEEEEec-Cccc-----------------ccHH----
Confidence 9999999974 46677777877774321 1111234444443 2211 1111
Q ss_pred ccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHhcCCCCCCCCCCCCCCCCceEEEe
Q 007491 164 SRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAAKRNPRAADPPVDNSKNPHFLVLI 224 (601)
Q Consensus 164 s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka~~n~~~a~~s~dyAKdP~HLVLL 224 (601)
..-.+|+.+.+.||-+.||-=-.+.-...|||+-+. .| ..|.-||+.+
T Consensus 124 ----eLirlak~lkknnVAidii~fGE~~n~~~l~efIda--------~N-~~dsshl~~~ 171 (243)
T COG5148 124 ----ELIRLAKQLKKNNVAIDIIFFGEAANMAGLFEFIDA--------TN-FSDSSHLEVK 171 (243)
T ss_pred ----HHHHHHHHHHhcCeeEEEEehhhhhhhhHHHHHHHh--------hc-cccceeeEec
Confidence 346789999999999999955555555566665421 12 2455688776
No 65
>KOG1984 consensus Vesicle coat complex COPII, subunit SFB3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=79.74 E-value=24 Score=42.79 Aligned_cols=163 Identities=16% Similarity=0.232 Sum_probs=91.6
Q ss_pred ceEEEEEecc--cccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC------C------------c
Q 007491 4 KQLIVAVEGT--AAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG------S------------Y 63 (601)
Q Consensus 4 ~DIVFVIDgT--AsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~------P------------~ 63 (601)
.-.||.||-+ |++..-+..+.++ |..+|++|-. . ..+.|+|+|+|.+.= + .
T Consensus 418 pafvFmIDVSy~Ai~~G~~~a~ce~-ik~~l~~lp~---~-----~p~~~Vgivtfd~tvhFfnl~s~L~qp~mliVsdv 488 (1007)
T KOG1984|consen 418 PAFVFMIDVSYNAISNGAVKAACEA-IKSVLEDLPR---E-----EPNIRVGIVTFDKTVHFFNLSSNLAQPQMLIVSDV 488 (1007)
T ss_pred ceEEEEEEeehhhhhcchHHHHHHH-HHHHHhhcCc---c-----CCceEEEEEEecceeEeeccCccccCceEEEeecc
Confidence 3469999986 4555555666655 5777787733 1 267999999997643 1 1
Q ss_pred ccccee-ecc----CCcCHHHHHHHhcCcccC--CCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCC
Q 007491 64 CACLVQ-RSG----WTKDVDIFLHWLSTIPFA--GGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASN 136 (601)
Q Consensus 64 ~e~~Vq-r~G----fTsD~~~FlqwLdsI~Fs--GGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSP 136 (601)
++.||= ..| ...+...+...||.|.-. +-+.-+.++..+|.+|+..+... +.-|.++....-|
T Consensus 489 ~dvfvPf~~g~~V~~~es~~~i~~lLd~Ip~mf~~sk~pes~~g~alqaa~lalk~~----------~gGKl~vF~s~Lp 558 (1007)
T KOG1984|consen 489 DDVFVPFLDGLFVNPNESRKVIELLLDSIPTMFQDSKIPESVFGSALQAAKLALKAA----------DGGKLFVFHSVLP 558 (1007)
T ss_pred cccccccccCeeccchHHHHHHHHHHHHhhhhhccCCCCchhHHHHHHHHHHHHhcc----------CCceEEEEecccc
Confidence 111221 012 145556666789999633 33444677888888887766432 1335555554444
Q ss_pred CCCCCCCccccccccccccchhhhhhc---c--cCCCHHHHHHHhhhcCcEEEEe-CCCc
Q 007491 137 PHPLPTPVYRPQMQNLDQNENNEAQAE---S--RLSDAETVAKSFVQCSVSLSVI-CPKQ 190 (601)
Q Consensus 137 PhplP~pVy~P~~~~~~~ge~~~~~~~---s--~laDaEqVAk~faEk~IsLSII-sPRk 190 (601)
=+..+- ++.+-+++.-+.-.|+ + .-++...||++|.|.+|.|.+. .|..
T Consensus 559 t~g~g~-----kl~~r~D~~l~~t~kek~l~~pq~~~y~~LA~e~v~~g~svDlF~t~~a 613 (1007)
T KOG1984|consen 559 TAGAGG-----KLSNRDDRRLIGTDKEKNLLQPQDKTYTTLAKEFVESGCSVDLFLTPNA 613 (1007)
T ss_pred cccCcc-----cccccchhhhhcccchhhccCcchhHHHHHHHHHHHhCceEEEEEcccc
Confidence 444441 1222222221111111 1 1123468999999999988876 4443
No 66
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=79.11 E-value=38 Score=35.80 Aligned_cols=84 Identities=13% Similarity=0.137 Sum_probs=56.8
Q ss_pred chhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhhcccCCCHHHHHH
Q 007491 95 NDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQAESRLSDAETVAK 174 (601)
Q Consensus 95 spaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk 174 (601)
....++.||.-||+...+.. +..+.....+.++++|..|+-+.. -|.| ....+.
T Consensus 117 ~~s~lagals~ALcyinr~~--~~~~~~~~~~~RIlii~~s~~~~~---qYi~---------------------~mn~If 170 (279)
T TIGR00627 117 SRTVLAGALSDALGYINRSE--QSETASEKLKSRILVISITPDMAL---QYIP---------------------LMNCIF 170 (279)
T ss_pred ccccchhHHHhhhhhhcccc--cccccCcCCcceEEEEECCCCchH---HHHH---------------------HHHHHH
Confidence 45668899999997765431 111122346777777777754332 1322 367788
Q ss_pred HhhhcCcEEEEe--CCC-cchHHHHHHHHhcCC
Q 007491 175 SFVQCSVSLSVI--CPK-QLPKLTAIYNAAKRN 204 (601)
Q Consensus 175 ~faEk~IsLSII--sPR-klP~Lk~Lfeka~~n 204 (601)
.+++.+|.++++ +.. ....|+.+-+..+|.
T Consensus 171 aaqk~~I~Idv~~L~~e~~~~~lqQa~~~TgG~ 203 (279)
T TIGR00627 171 SAQKQNIPIDVVSIGGDFTSGFLQQAADITGGS 203 (279)
T ss_pred HHHHcCceEEEEEeCCccccHHHHHHHHHhCCE
Confidence 889999999998 443 489999999988774
No 67
>PF09967 DUF2201: VWA-like domain (DUF2201); InterPro: IPR018698 This family of various hypothetical bacterial proteins has no known function.
Probab=77.83 E-value=7.8 Score=35.64 Aligned_cols=77 Identities=16% Similarity=0.224 Sum_probs=40.2
Q ss_pred EEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHHhc
Q 007491 6 LIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHWLS 85 (601)
Q Consensus 6 IVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~FlqwLd 85 (601)
|+++||+++||.. +.|+ .++..+..-+.. ...++-|+ +-|+.-+....+ ++ ....+.
T Consensus 1 i~vaiDtSGSis~--~~l~-~fl~ev~~i~~~----------~~~~v~vi-~~D~~v~~~~~~------~~---~~~~~~ 57 (126)
T PF09967_consen 1 IVVAIDTSGSISD--EELR-RFLSEVAGILRR----------FPAEVHVI-QFDAEVQDVQVF------RS---LEDELR 57 (126)
T ss_pred CEEEEECCCCCCH--HHHH-HHHHHHHHHHHh----------CCCCEEEE-EECCEeeeeeEE------ec---cccccc
Confidence 6899999999986 2333 233333332221 02244333 444443332222 22 333566
Q ss_pred CcccCCCCCc-hhhHHHHHHH
Q 007491 86 TIPFAGGGFN-DAAIAEGLSE 105 (601)
Q Consensus 86 sI~FsGGGfs-paAVAEGLAe 105 (601)
.+++.|||-. ...+.|-+.+
T Consensus 58 ~~~~~GgGGTdf~pvf~~~~~ 78 (126)
T PF09967_consen 58 DIKLKGGGGTDFRPVFEYLEE 78 (126)
T ss_pred ccccCCCCCCcchHHHHHHHh
Confidence 7888888754 4666666543
No 68
>COG2425 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=75.00 E-value=23 Score=39.69 Aligned_cols=83 Identities=18% Similarity=0.274 Sum_probs=53.6
Q ss_pred eEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHHHHHHHh
Q 007491 5 QLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVDIFLHWL 84 (601)
Q Consensus 5 DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~~FlqwL 84 (601)
.++++||.++||+-+=..+-+-...-+++-- +. .+-++.+.+|.+ . -.=+.-+....|..+++..|
T Consensus 274 pvilllD~SGSM~G~~e~~AKAvalAl~~~a----la------enR~~~~~lF~s-~---~~~~el~~k~~~~~e~i~fL 339 (437)
T COG2425 274 PVILLLDKSGSMSGFKEQWAKAVALALMRIA----LA------ENRDCYVILFDS-E---VIEYELYEKKIDIEELIEFL 339 (437)
T ss_pred CEEEEEeCCCCcCCcHHHHHHHHHHHHHHHH----HH------hccceEEEEecc-c---ceeeeecCCccCHHHHHHHH
Confidence 5899999999999887766655322222222 11 345688889988 1 22224578889999999999
Q ss_pred cCcccCCCC-CchhhHHHHH
Q 007491 85 STIPFAGGG-FNDAAIAEGL 103 (601)
Q Consensus 85 dsI~FsGGG-fspaAVAEGL 103 (601)
..+. ||| |-..|+..||
T Consensus 340 ~~~f--~GGTD~~~~l~~al 357 (437)
T COG2425 340 SYVF--GGGTDITKALRSAL 357 (437)
T ss_pred hhhc--CCCCChHHHHHHHH
Confidence 8764 455 3334444333
No 69
>KOG3768 consensus DEAD box RNA helicase [General function prediction only]
Probab=72.67 E-value=4.1 Score=47.21 Aligned_cols=87 Identities=14% Similarity=0.126 Sum_probs=61.6
Q ss_pred EEEEEecccccc-------hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcCHH
Q 007491 6 LIVAVEGTAAMG-------PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKDVD 78 (601)
Q Consensus 6 IVFVIDgTAsMG-------pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD~~ 78 (601)
+.|+||++++|. .|++..| ..|+-.|++= .+.-.+ ..-||-|++|..++ +-.+.+|-+.=.
T Consensus 4 ~lFllDTS~SM~qrah~~~tylD~AK-gaVEtFiK~R-~r~~~~-----~gdryml~TfeepP-----~~vk~~~~~~~a 71 (888)
T KOG3768|consen 4 FLFLLDTSGSMSQRAHPQFTYLDLAK-GAVETFIKQR-TRVGRE-----TGDRYMLTTFEEPP-----KNVKVACEKLGA 71 (888)
T ss_pred EEEEEecccchhhhccCCchhhHHHH-HHHHHHHHHH-hccccc-----cCceEEEEecccCc-----hhhhhHHhhccc
Confidence 689999999995 4666555 3334444433 212222 34689999998877 224568999999
Q ss_pred HHHHHhcCcccCCCC-CchhhHHHHHH
Q 007491 79 IFLHWLSTIPFAGGG-FNDAAIAEGLS 104 (601)
Q Consensus 79 ~FlqwLdsI~FsGGG-fspaAVAEGLA 104 (601)
.|..-|+++.+-+|- -.+.++.||+.
T Consensus 72 ~~~~eik~l~a~~~s~~~~~~~t~AFd 98 (888)
T KOG3768|consen 72 VVIEEIKKLHAPYGSCQLHHAITEAFD 98 (888)
T ss_pred HHHHHHHhhcCccchhhhhHHHHHHhh
Confidence 999999999999884 56888888654
No 70
>PTZ00395 Sec24-related protein; Provisional
Probab=70.08 E-value=34 Score=43.40 Aligned_cols=160 Identities=13% Similarity=0.135 Sum_probs=84.0
Q ss_pred eEEEEEecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC------Cc---------------
Q 007491 5 QLIVAVEGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG------SY--------------- 63 (601)
Q Consensus 5 DIVFVIDgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~------P~--------------- 63 (601)
-.|||||.+. ..+++.++.-++|.+.. .|+.-. ..++++|+++|-..= +-
T Consensus 954 ~YvFLIDVS~------~AVkSGLl~tacesIK~-sLDsL~--dpRTRVGIITFDSsLHFYNLks~l~~~~~~~~~~~~l~ 1024 (1560)
T PTZ00395 954 YFVFVVECSY------NAIYNNITYTILEGIRY-AVQNVK--CPQTKIAIITFNSSIYFYHCKGGKGVSGEEGDGGGGSG 1024 (1560)
T ss_pred EEEEEEECCH------HHHhhChHHHHHHHHHH-HHhcCC--CCCcEEEEEEecCcEEEEecCcccccccccccccccCC
Confidence 4799999985 56666655555555532 221111 257899999885421 10
Q ss_pred -cccce-----eec-c---------CCcCHHHHHHHhcCcccCC--CCCchhhHHHHHHHHHhccccCCCCCCCCCCCCC
Q 007491 64 -CACLV-----QRS-G---------WTKDVDIFLHWLSTIPFAG--GGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDG 125 (601)
Q Consensus 64 -~e~~V-----qr~-G---------fTsD~~~FlqwLdsI~FsG--GGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~ 125 (601)
+.-+| +.| + ...+-+.+...||.|+-.. -...+.|+.-||.+|+.++.... -.
T Consensus 1025 qPQMLVVSDLDDPFLPlP~ddLLVnL~ESRevIe~LLDkLPemFt~t~~~esCLGSALqAA~~aLk~~G---------GG 1095 (1560)
T PTZ00395 1025 NHQVIVMSDVDDPFLPLPLEDLFFGCVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERN---------GL 1095 (1560)
T ss_pred CceEEeecCCccCcCCCCccCeeechHHHHHHHHHHHHHHHHHhhccCCCcccHHHHHHHHHHHHHhcC---------CC
Confidence 00011 111 1 1234455666777775322 23457889999999999886321 12
Q ss_pred ccEEEEeecCCCCCCCCCccccccccccccch-hhh-hhcccCCCHHHHHHHhhhcCcEEEEe-CCCc
Q 007491 126 QRHCILVAASNPHPLPTPVYRPQMQNLDQNEN-NEA-QAESRLSDAETVAKSFVQCSVSLSVI-CPKQ 190 (601)
Q Consensus 126 qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~-~~~-~~~s~laDaEqVAk~faEk~IsLSII-sPRk 190 (601)
-|.|++.+.-|=+....--.+.. . ..|. ... ..++ ...+|.+|.+.+|.|.+. ++.+
T Consensus 1096 GKIiVF~SSLPniGpGaLK~Re~--~--~KEk~Ll~pqd~F----YK~LA~ECsk~qISVDLFLfSsq 1155 (1560)
T PTZ00395 1096 GSICMFYTTTPNCGIGAIKELKK--D--LQENFLEVKQKIF----YDSLLLDLYAFNISVDIFIISSN 1155 (1560)
T ss_pred ceEEEEEcCCCCCCCCccccccc--c--cccccccccchHH----HHHHHHHHHhcCCceEEEEccCc
Confidence 36666666555232222100100 0 0111 000 1111 268999999999988776 4543
No 71
>PLN00162 transport protein sec23; Provisional
Probab=69.64 E-value=72 Score=37.96 Aligned_cols=38 Identities=18% Similarity=0.128 Sum_probs=30.6
Q ss_pred HHHHHHHhhhcCcEEEEe--CCCc--chHHHHHHHHhcCCCC
Q 007491 169 AETVAKSFVQCSVSLSVI--CPKQ--LPKLTAIYNAAKRNPR 206 (601)
Q Consensus 169 aEqVAk~faEk~IsLSII--sPRk--lP~Lk~Lfeka~~n~~ 206 (601)
.+.+|+.+.+.+|.+.+. +-.+ +..++.|.++.++...
T Consensus 331 Y~~la~~~~~~gisvDlF~~s~dqvglaem~~l~~~TGG~v~ 372 (761)
T PLN00162 331 YEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVV 372 (761)
T ss_pred HHHHHHHHHHcCceEEEEEccccccCHHHHhhhHhhcCcEEE
Confidence 478999999999998888 3443 7788999988887753
No 72
>TIGR00670 asp_carb_tr aspartate carbamoyltransferase. Ornithine carbamoyltransferases are in the same superfamily and form an outgroup.
Probab=61.78 E-value=35 Score=36.10 Aligned_cols=93 Identities=15% Similarity=0.215 Sum_probs=66.0
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCc-hhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFN-DAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPL 140 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfs-paAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhpl 140 (601)
.|.+.+|-|+.....+.+|.+| ..++++=+|++ .++=.++|+-.+++.+.... ...+++.+|+|--+.
T Consensus 94 ~y~D~iviR~~~~~~~~~~a~~-s~vPVINa~~g~~~HPtQ~LaDl~Ti~e~~g~--------l~g~~va~vGD~~~~-- 162 (301)
T TIGR00670 94 GYSDAIVIRHPLEGAARLAAEV-SEVPVINAGDGSNQHPTQTLLDLYTIYEEFGR--------LDGLKIALVGDLKYG-- 162 (301)
T ss_pred HhCCEEEEECCchhHHHHHHhh-CCCCEEeCCCCCCCCcHHHHHHHHHHHHHhCC--------CCCCEEEEEccCCCC--
Confidence 4666777888778888888888 47888888874 68888999998887754321 122444445432111
Q ss_pred CCCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcc
Q 007491 141 PTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQL 191 (601)
Q Consensus 141 P~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRkl 191 (601)
..+-.++.+++..|+++.+++|+.+
T Consensus 163 --------------------------~v~~Sl~~~~a~~g~~v~~~~P~~~ 187 (301)
T TIGR00670 163 --------------------------RTVHSLAEALTRFGVEVYLISPEEL 187 (301)
T ss_pred --------------------------cHHHHHHHHHHHcCCEEEEECCccc
Confidence 1456778888999999999999986
No 73
>cd01459 vWA_copine_like VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
Probab=60.76 E-value=89 Score=32.56 Aligned_cols=91 Identities=18% Similarity=0.270 Sum_probs=48.3
Q ss_pred cceEEEEEecccccc-----------------hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCC---
Q 007491 3 EKQLIVAVEGTAAMG-----------------PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGS--- 62 (601)
Q Consensus 3 ~~DIVFVIDgTAsMG-----------------pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P--- 62 (601)
|..++|.||-|+++| +|-..|+. |.++|+.|.. .-+|.+--|.+-.+
T Consensus 31 ~~nl~vaIDfT~SNg~p~~~~SLHy~~~~~~N~Yq~aI~~--vg~il~~yD~-----------D~~ip~~GFGa~~~~~~ 97 (254)
T cd01459 31 ESNLIVAIDFTKSNGWPGEKRSLHYISPGRLNPYQKAIRI--VGEVLQPYDS-----------DKLIPAFGFGAIVTKDQ 97 (254)
T ss_pred eeeEEEEEEeCCCCCCCCCCCCcccCCCCCccHHHHHHHH--HHHHHHhcCC-----------CCceeeEeecccCCCCC
Confidence 568999999999975 33333332 3566777743 22444444544221
Q ss_pred -ccccce------eeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhcc
Q 007491 63 -YCACLV------QRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMF 110 (601)
Q Consensus 63 -~~e~~V------qr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~f 110 (601)
...++- .|.|...=++.....|..++++|+- ..+.-+.+|.++.
T Consensus 98 ~v~~~f~~~~~~p~~~Gi~gvl~aY~~~l~~v~lsGpT----~fapvI~~a~~~a 148 (254)
T cd01459 98 SVFSFFPGYSESPECQGFEGVLRAYREALPNVSLSGPT----NFAPVIRAAANIA 148 (254)
T ss_pred ccccccCCCCCCCcccCHHHHHHHHHHHhceeeecCcc----hHHHHHHHHHHHH
Confidence 111110 1223333344555678889998643 3334455555544
No 74
>KOG1781 consensus Small Nuclear ribonucleoprotein splicing factor [RNA processing and modification]
Probab=60.36 E-value=13 Score=34.22 Aligned_cols=49 Identities=14% Similarity=0.200 Sum_probs=36.1
Q ss_pred ecccccchhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCC
Q 007491 11 EGTAAMGPYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHG 61 (601)
Q Consensus 11 DgTAsMGpYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~ 61 (601)
|+|+.++.| +.|.+..++.++||+++.+- ..+....+-++||||+|.-.
T Consensus 39 ~~sGiLkGy-DqLlNlVLDd~vEylrdpdd-~~~~~~~tR~LGLvV~RGTa 87 (108)
T KOG1781|consen 39 EASGILKGY-DQLLNLVLDDTVEYLRDPDD-PYKLTDETRKLGLVVCRGTA 87 (108)
T ss_pred eeeeehhhH-HHHHHHHHHHHHHHhcCCCC-ccchhhhhheeeeEEEcccE
Confidence 568888887 88888999999999976322 22223355679999998643
No 75
>PRK02102 ornithine carbamoyltransferase; Validated
Probab=56.35 E-value=44 Score=35.96 Aligned_cols=93 Identities=12% Similarity=0.069 Sum_probs=65.9
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+++|-|+.-...+.+|-.+. .++.+=+|+..++=..+|+-.+++.++... . ..+++.+|+|-- +
T Consensus 100 ~y~D~iviR~~~~~~~~~~a~~~-~vPVINa~~~~~HPtQaLaDl~Ti~e~~g~-------l-~g~~va~vGd~~-~--- 166 (331)
T PRK02102 100 RMYDGIEYRGFKQEIVEELAKYS-GVPVWNGLTDEWHPTQMLADFMTMKEHFGP-------L-KGLKLAYVGDGR-N--- 166 (331)
T ss_pred hcCCEEEEECCchHHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHhCC-------C-CCCEEEEECCCc-c---
Confidence 67777888887777888888775 577776676678888999999988765321 1 224444444321 1
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcch
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLP 192 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP 192 (601)
..+..++.++...|++|.+++|+.+-
T Consensus 167 -------------------------~v~~Sl~~~~~~~g~~v~~~~P~~~~ 192 (331)
T PRK02102 167 -------------------------NMANSLMVGGAKLGMDVRICAPKELW 192 (331)
T ss_pred -------------------------cHHHHHHHHHHHcCCEEEEECCcccc
Confidence 14567788888999999999999853
No 76
>PRK13814 pyrB aspartate carbamoyltransferase catalytic subunit; Provisional
Probab=50.11 E-value=54 Score=34.97 Aligned_cols=92 Identities=12% Similarity=0.155 Sum_probs=62.3
Q ss_pred cccceeeccCCcCHHHHHHHhcCcccCCCCCc-hhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCC
Q 007491 64 CACLVQRSGWTKDVDIFLHWLSTIPFAGGGFN-DAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPT 142 (601)
Q Consensus 64 ~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfs-paAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~ 142 (601)
.+.+|-|+.-...+.+|..|..+++++=+|++ .++=..+|+-.+++.+... . . ...++..|+|--+.
T Consensus 102 ~D~iv~R~~~~~~~~~~a~~~~~vPvINag~g~~~HPtQaLaDl~Ti~e~~g--~-----l-~g~~va~vGD~~~~---- 169 (310)
T PRK13814 102 VYFFIVRHSENETPEQIAKQLSSGVVINAGDGNHQHPSQALIDLMTIKQHKP--H-----W-NKLCVTIIGDIRHS---- 169 (310)
T ss_pred CCEEEEeCCchhHHHHHHHhCCCCCeEECCcCCCCCchHHHHHHHHHHHHhC--C-----c-CCcEEEEECCCCCC----
Confidence 35555666556677788888778888888765 5778888999888876532 1 1 12344444332211
Q ss_pred CccccccccccccchhhhhhcccCCCHHHHHHHhhhcCc-EEEEeCCCcc
Q 007491 143 PVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSV-SLSVICPKQL 191 (601)
Q Consensus 143 pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~I-sLSIIsPRkl 191 (601)
..+-.++.+++..|+ ++.+++|+.+
T Consensus 170 ------------------------rv~~Sl~~~~a~~g~~~v~~~~P~~~ 195 (310)
T PRK13814 170 ------------------------RVANSLMDGLVTMGVPEIRLVGPSSL 195 (310)
T ss_pred ------------------------cHHHHHHHHHHHcCCCEEEEeCCccc
Confidence 146778888999998 9999999874
No 77
>PRK14804 ornithine carbamoyltransferase; Provisional
Probab=49.60 E-value=94 Score=33.07 Aligned_cols=103 Identities=12% Similarity=0.154 Sum_probs=65.5
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.+|-|+.-..++.+|.+|. .++.+=+|+..++=..+|+-.+++.+..... .. ..+++..|++ |.
T Consensus 96 ~~~D~iv~R~~~~~~~~~~a~~~-~vPVINag~~~~HPtQaL~Dl~Ti~e~~g~~-----~l-~g~~va~vGd---~~-- 163 (311)
T PRK14804 96 RNVSVIMARLKKHEDLLVMKNGS-QVPVINGCDNMFHPCQSLADIMTIALDSPEI-----PL-NQKQLTYIGV---HN-- 163 (311)
T ss_pred hcCCEEEEeCCChHHHHHHHHHC-CCCEEECCCCCCChHHHHHHHHHHHHHhCCC-----CC-CCCEEEEECC---CC--
Confidence 46667777776666777777664 5777666665688888999888887643200 01 1233334443 21
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchH--HHHHHHHh
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPK--LTAIYNAA 201 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~--Lk~Lfeka 201 (601)
. .+-.++.+++..|+.|.+++|+.++. +...++.+
T Consensus 164 r-------------------------v~~Sl~~~~~~~G~~v~~~~P~~~~~~~~~~~~~~~ 200 (311)
T PRK14804 164 N-------------------------VVNSLIGITAALGIHLTLVTPIAAKENIHAQTVERA 200 (311)
T ss_pred c-------------------------HHHHHHHHHHHcCCEEEEECCCCccHHHHHHHHHHH
Confidence 1 35667778888999999999999752 34444433
No 78
>PRK00779 ornithine carbamoyltransferase; Provisional
Probab=49.27 E-value=72 Score=33.78 Aligned_cols=92 Identities=14% Similarity=0.164 Sum_probs=62.1
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
+|.+.+|-|+.--..+.+|.++. .++++=+|+..++=.++|+-.+.+.+.... . ..+++..|+| +.
T Consensus 97 ~~~D~iv~R~~~~~~~~~~a~~~-~vPVINag~~~~HPtQaL~Dl~Ti~e~~g~-------l-~gl~i~~vGd---~~-- 162 (304)
T PRK00779 97 RYVDAIMIRTFEHETLEELAEYS-TVPVINGLTDLSHPCQILADLLTIYEHRGS-------L-KGLKVAWVGD---GN-- 162 (304)
T ss_pred HhCCEEEEcCCChhHHHHHHHhC-CCCEEeCCCCCCChHHHHHHHHHHHHHhCC-------c-CCcEEEEEeC---CC--
Confidence 45666777776666777777775 466665555678888999998888765321 1 2244455555 11
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcch
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLP 192 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP 192 (601)
. .+-.+++++...|+.+.+++|+.+-
T Consensus 163 ~-------------------------v~~Sl~~~l~~~g~~v~~~~P~~~~ 188 (304)
T PRK00779 163 N-------------------------VANSLLLAAALLGFDLRVATPKGYE 188 (304)
T ss_pred c-------------------------cHHHHHHHHHHcCCEEEEECCcccC
Confidence 1 3466778888899999999999854
No 79
>PRK00856 pyrB aspartate carbamoyltransferase catalytic subunit; Provisional
Probab=49.20 E-value=61 Score=34.41 Aligned_cols=93 Identities=13% Similarity=0.147 Sum_probs=65.2
Q ss_pred Cc-cccceeeccCCcCHHHHHHHhcCcccCCCCCc-hhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCC
Q 007491 62 SY-CACLVQRSGWTKDVDIFLHWLSTIPFAGGGFN-DAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHP 139 (601)
Q Consensus 62 P~-~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfs-paAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhp 139 (601)
.| .+++|-|+.--..+.+|.+| .+++++=+|++ .++=..+|+-.+++.+... . ...+++.+|+|-...
T Consensus 99 ~y~~D~iv~R~~~~~~~~~~a~~-~~vPVINa~~g~~~HPtQ~LaDl~Ti~e~~G--~------l~g~kv~~vGD~~~~- 168 (305)
T PRK00856 99 AMGADAIVIRHPQSGAARLLAES-SDVPVINAGDGSHQHPTQALLDLLTIREEFG--R------LEGLKVAIVGDIKHS- 168 (305)
T ss_pred hcCCCEEEEeCCChHHHHHHHHH-CCCCEEECCCCCCCCcHHHHHHHHHHHHHhC--C------CCCCEEEEECCCCCC-
Confidence 56 77777887766777888777 67888888874 6788888998888876432 1 122455555432111
Q ss_pred CCCCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcc
Q 007491 140 LPTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQL 191 (601)
Q Consensus 140 lP~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRkl 191 (601)
..+-.++.+++..|+.|.+++|+.+
T Consensus 169 ---------------------------~v~~Sl~~~~~~~g~~~~~~~P~~~ 193 (305)
T PRK00856 169 ---------------------------RVARSNIQALTRLGAEVRLIAPPTL 193 (305)
T ss_pred ---------------------------cHHHHHHHHHHHcCCEEEEECCccc
Confidence 1456778888889999999999975
No 80
>PLN02342 ornithine carbamoyltransferase
Probab=47.41 E-value=69 Score=34.84 Aligned_cols=101 Identities=15% Similarity=0.173 Sum_probs=66.6
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+++|-|+.-...+.+|-+|. .++++=+|+..++=..+|+..+++.+... ..+. +++.+|+|-+
T Consensus 139 ~y~D~IviR~~~~~~~~~la~~~-~vPVINA~~~~~HPtQaLaDl~Ti~e~~G-------~l~g-lkva~vGD~~----- 204 (348)
T PLN02342 139 RYNDIIMARVFAHQDVLDLAEYS-SVPVINGLTDYNHPCQIMADALTIIEHIG-------RLEG-TKVVYVGDGN----- 204 (348)
T ss_pred HhCCEEEEeCCChHHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHhC-------CcCC-CEEEEECCCc-----
Confidence 46666777776666777777775 57766666556788889998888876532 1122 3344444411
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHh
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAA 201 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka 201 (601)
..+-.++.++...|+++.+++|+.+..-..+.+++
T Consensus 205 -------------------------nva~Sli~~~~~~G~~v~~~~P~~~~~~~~~~~~a 239 (348)
T PLN02342 205 -------------------------NIVHSWLLLAAVLPFHFVCACPKGYEPDAKTVEKA 239 (348)
T ss_pred -------------------------hhHHHHHHHHHHcCCEEEEECCcccccCHHHHHHH
Confidence 13456778888899999999999975445555444
No 81
>PRK14805 ornithine carbamoyltransferase; Provisional
Probab=46.94 E-value=83 Score=33.36 Aligned_cols=97 Identities=13% Similarity=0.145 Sum_probs=64.9
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+++|-|+.-...+.+|-+|. .++.+=+|+..++=..+|+-.+++.+.... . ..+++..|+|.+
T Consensus 92 ~y~D~iviR~~~~~~~~~~a~~~-~vPVINa~~~~~HPtQaL~Dl~Ti~e~~g~-------l-~g~kva~vGD~~----- 157 (302)
T PRK14805 92 CWADAIVARVFSHSTIEQLAEHG-SVPVINALCDLYHPCQALADFLTLAEQFGD-------V-SKVKLAYVGDGN----- 157 (302)
T ss_pred HhCCEEEEeCCChhHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHhCC-------c-CCcEEEEEcCCC-----
Confidence 46666777777777788888875 577776776668888899988888754321 1 224444455410
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHH
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAI 197 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~L 197 (601)
. .+-.++.+++..|+.+.+++|+.+..-..+
T Consensus 158 ~-------------------------v~~S~~~~~~~~g~~v~~~~P~~~~~~~~~ 188 (302)
T PRK14805 158 N-------------------------VTHSLMYGAAILGATMTVICPPGHFPDGQI 188 (302)
T ss_pred c-------------------------cHHHHHHHHHHcCCEEEEECCchhcCCHHH
Confidence 0 245667788888999999999986433333
No 82
>COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]
Probab=45.37 E-value=2e+02 Score=31.31 Aligned_cols=74 Identities=8% Similarity=0.128 Sum_probs=51.1
Q ss_pred cceEEEEEecccccc------hhHHHHHHHHHHHHHHHhhcccccCccCCCCcceEEEEEeecCCCccccceeeccCCcC
Q 007491 3 EKQLIVAVEGTAAMG------PYWQSIVSDYLEKIIRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQRSGWTKD 76 (601)
Q Consensus 3 ~~DIVFVIDgTAsMG------pYi~~LktnYI~PiLEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr~GfTsD 76 (601)
+.+|++++|+..+|. ..|+....-- .-+..+... .+.++|+.+|.+.. ....+..+.
T Consensus 224 ~~~v~l~lD~~~~m~~~~~~~~~~e~av~~a-~~la~~~l~----------~gd~vg~~~~~~~~------~~~~~p~~G 286 (416)
T COG1721 224 GRTVVLVLDASRSMLFGSGVASKFEEAVRAA-ASLAYAALK----------NGDRVGLLIFGGGG------PKWIPPSRG 286 (416)
T ss_pred CceEEEEEeCCccccCCCCCccHHHHHHHHH-HHHHHHHHh----------CCCeeEEEEECCCc------ceeeCCCcc
Confidence 589999999999998 5777666442 333333322 45799999998765 134566677
Q ss_pred HHHHHHHhcCcccCCCC
Q 007491 77 VDIFLHWLSTIPFAGGG 93 (601)
Q Consensus 77 ~~~FlqwLdsI~FsGGG 93 (601)
-+.|.+.|+.+......
T Consensus 287 ~~~l~~~l~~l~~~~~~ 303 (416)
T COG1721 287 RRHLARILKALALLRPA 303 (416)
T ss_pred hHHHHHHHHHhhccCCC
Confidence 77888887777766654
No 83
>PRK04284 ornithine carbamoyltransferase; Provisional
Probab=45.34 E-value=90 Score=33.58 Aligned_cols=100 Identities=12% Similarity=0.140 Sum_probs=63.1
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.+|-|+.-...+.+|-++. .++.+=+|...++=..+|+-.+++.+.+. + . ...+++.+|+|- -+.
T Consensus 99 ~y~D~iviR~~~~~~~~~~a~~s-~vPVINa~~~~~HPtQaL~Dl~Ti~e~~~-g-----~-l~g~kia~vGD~-~~~-- 167 (332)
T PRK04284 99 GMYDGIEYRGFSQRTVETLAEYS-GVPVWNGLTDEDHPTQVLADFLTAKEHLK-K-----P-YKDIKFTYVGDG-RNN-- 167 (332)
T ss_pred HhCCEEEEecCchHHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHhc-C-----C-cCCcEEEEecCC-Ccc--
Confidence 45566677766667778887775 46665555556777888888888776411 1 1 123445555542 011
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHH
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIY 198 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lf 198 (601)
.+-.++.++...|+.|.+++|+.+..-..+.
T Consensus 168 --------------------------v~~Sl~~~~~~~g~~v~~~~P~~~~~~~~~~ 198 (332)
T PRK04284 168 --------------------------VANALMQGAAIMGMDFHLVCPKELNPDDELL 198 (332)
T ss_pred --------------------------hHHHHHHHHHHcCCEEEEECCccccCCHHHH
Confidence 3456677788889999999999754333333
No 84
>TIGR01651 CobT cobaltochelatase, CobT subunit. This model describes the aerobic cobalamin pathway Pseudomonas denitrificans CobT gene product, which is a cobalt chelatase subunit, with a MW ~70 kDa. The aerobic pathway cobalt chelatase is a heterotrimeric, ATP-dependent enzyme that catalyzes cobalt insertion during cobalamin biosynthesis. The other two subunits are the P. denitrificans CobS (TIGR01650) and CobN (pfam02514 CobN/Magnesium Chelatase) proteins. To avoid potential confusion with the nonhomologous Salmonella typhimurium/E.coli cobT gene product, the P. denitrificans gene symbol is not used in the name of this model.
Probab=43.67 E-value=2.6e+02 Score=32.95 Aligned_cols=64 Identities=19% Similarity=0.160 Sum_probs=37.9
Q ss_pred HHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccccccchhhhhhcccCCCHHHHHHHhhhc
Q 007491 100 AEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQC 179 (601)
Q Consensus 100 AEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk 179 (601)
.|||.-|...+-- ....+|.+|+|+|--|..-.+ +.-|.. .... ++.-+|.+....+
T Consensus 499 GeAl~wa~~rL~~---------R~e~rKiL~ViSDG~P~D~~T---------lsvN~~---~~l~--~hLr~vi~~~e~~ 555 (600)
T TIGR01651 499 GEALMWAHQRLIA---------RPEQRRILMMISDGAPVDDST---------LSVNPG---NYLE--RHLRAVIEEIETR 555 (600)
T ss_pred hHHHHHHHHHHhc---------CcccceEEEEEeCCCcCCccc---------cccCch---hHHH--HHHHHHHHHHhcc
Confidence 3666666654421 234678889999999987554 111111 0101 1345677777775
Q ss_pred -CcEEEEe
Q 007491 180 -SVSLSVI 186 (601)
Q Consensus 180 -~IsLSII 186 (601)
+|.|-.|
T Consensus 556 ~~vel~ai 563 (600)
T TIGR01651 556 SPVELLAI 563 (600)
T ss_pred CCceEEEe
Confidence 7998877
No 85
>TIGR00658 orni_carb_tr ornithine carbamoyltransferase. Most OTCases are homotrimers, but the homotrimers are organized into dodecamers built from four trimers in at least two species; the catabolic OTCase of Pseudomonas aeruginosa is allosterically regulated, while OTCase of the extreme thermophile Pyrococcus furiosus shows both allostery and thermophily.
Probab=43.05 E-value=96 Score=32.81 Aligned_cols=92 Identities=13% Similarity=0.178 Sum_probs=60.9
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.+|-|+.-...+++|.++.+ ++++=+|...++=..+|+-.+++.++... . ...++..|+|- .
T Consensus 93 ~y~D~iv~R~~~~~~~~~~a~~~~-vPVINa~~~~~HPtQaL~Dl~Ti~e~~g~-------l-~g~~v~~vGd~---~-- 158 (304)
T TIGR00658 93 RYVDGIMARVYKHEDVEELAKYAS-VPVINGLTDLFHPCQALADLLTIIEHFGK-------L-KGVKVVYVGDG---N-- 158 (304)
T ss_pred HhCCEEEEECCChHHHHHHHHhCC-CCEEECCCCCCChHHHHHHHHHHHHHhCC-------C-CCcEEEEEeCC---C--
Confidence 455667777766667788877764 55554455568888889888887754321 1 22344444442 0
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcch
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLP 192 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP 192 (601)
..+-.++.++...|+++.+++|+.+-
T Consensus 159 -------------------------~v~~Sl~~~l~~~g~~v~~~~P~~~~ 184 (304)
T TIGR00658 159 -------------------------NVCNSLMLAGAKLGMDVVVATPEGYE 184 (304)
T ss_pred -------------------------chHHHHHHHHHHcCCEEEEECCchhc
Confidence 13567778888999999999999854
No 86
>PRK01713 ornithine carbamoyltransferase; Provisional
Probab=41.16 E-value=1e+02 Score=33.15 Aligned_cols=94 Identities=11% Similarity=0.133 Sum_probs=61.2
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.+|-|+.-...+.+|-+|. .++.+=++...++=..+|+-.+++.++.. . .. ..+++.+|+|-. |.
T Consensus 100 ~y~D~iv~R~~~~~~~~~~a~~~-~vPVINa~~~~~HPtQaL~Dl~Ti~e~~g--~----~l-~gl~ia~vGD~~-~~-- 168 (334)
T PRK01713 100 RMYDAIEYRGFKQSIVNELAEYA-GVPVFNGLTDEFHPTQMLADVLTMIENCD--K----PL-SEISYVYIGDAR-NN-- 168 (334)
T ss_pred HhCCEEEEEcCchHHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHcC--C----Cc-CCcEEEEECCCc-cC--
Confidence 45566777776667777777775 46666555566777888888888876421 0 01 123444444421 21
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcch
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLP 192 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP 192 (601)
.+-.++.+++..|+.|.+++|+.+-
T Consensus 169 --------------------------v~~Sl~~~~~~~g~~v~~~~P~~~~ 193 (334)
T PRK01713 169 --------------------------MGNSLLLIGAKLGMDVRICAPKALL 193 (334)
T ss_pred --------------------------HHHHHHHHHHHcCCEEEEECCchhc
Confidence 3456777888889999999999853
No 87
>TIGR03316 ygeW probable carbamoyltransferase YgeW. Members of this protein family include the ygeW gene product of Escherichia coli. The function is unknown. Members show homology to ornithine carbamoyltransferase (TIGR00658) and aspartate carbamoyltransferase (carbamoyltransferase), and therefore may belong to the carbamoyltransferases in function. Members often are found in a large, conserved genomic region associated with purine catabolism.
Probab=37.59 E-value=1.5e+02 Score=32.49 Aligned_cols=111 Identities=9% Similarity=0.100 Sum_probs=65.2
Q ss_pred CccccceeeccCC-----cCHHHHHHHh----------cCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCc
Q 007491 62 SYCACLVQRSGWT-----KDVDIFLHWL----------STIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQ 126 (601)
Q Consensus 62 P~~e~~Vqr~GfT-----sD~~~FlqwL----------dsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~q 126 (601)
.|.+.+|-|+.+. ..+.+|.++. .+++++=+|...++=..+|+-.+++.+.+... ...+ .
T Consensus 96 ~y~D~Iv~R~~~~~~~~~~~l~~~a~~~~~~~~~~~~~s~vPVINa~~~~~HPtQaLaDl~Ti~e~~G~~----~~l~-g 170 (357)
T TIGR03316 96 FFADGIGIRDDMYIGVGNAYMREVAKYVQEGYKDGVLEQRPPLVNLQCDIDHPTQAMADIMTLQEKFGGI----ENLK-G 170 (357)
T ss_pred HhCcEEEEeCCCccccccHHHHHHHHhhhhccccccccCCCCEEECCCCCCCchHHHHHHHHHHHHhCCc----cccC-C
Confidence 5667778887542 3346777885 57888777766678888899888887643210 0011 1
Q ss_pred cEEEEeec-CCCCCCCCCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHh
Q 007491 127 RHCILVAA-SNPHPLPTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAA 201 (601)
Q Consensus 127 KhCILI~n-SPPhplP~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka 201 (601)
+++.++.+ ..-|.... ..+-.++.+++..|+++.+++|+.+.....+.+++
T Consensus 171 ~kvai~~~~d~~~gr~~------------------------~v~~Sl~~~~~~~G~~v~~~~P~~~~~~~~~~~~a 222 (357)
T TIGR03316 171 KKFAMTWAYSPSYGKPL------------------------SVPQGIIGLMTRFGMDVTLAHPEGYHLLPEVIEVA 222 (357)
T ss_pred CEEEEEeccccccCccc------------------------hHHHHHHHHHHHcCCEEEEECCCcccCCHHHHHHH
Confidence 22222211 11122111 12345667788889999999999765555555543
No 88
>PRK03515 ornithine carbamoyltransferase subunit I; Provisional
Probab=36.69 E-value=72 Score=34.45 Aligned_cols=95 Identities=13% Similarity=0.120 Sum_probs=60.5
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.+|-|+.-...+.+|-.+. .++.+=++...++=..+|+..+++.+....+ ..+. .++-+|+|- .+.
T Consensus 99 ~y~D~Iv~R~~~~~~~~~~a~~~-~vPVINa~~~~~HPtQaLaDl~Ti~e~~g~~-----~l~g-~~ia~vGD~-~~~-- 168 (336)
T PRK03515 99 RMYDGIQYRGYGQEIVETLAEYA-GVPVWNGLTNEFHPTQLLADLLTMQEHLPGK-----AFNE-MTLAYAGDA-RNN-- 168 (336)
T ss_pred HhCcEEEEEeCChHHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHhCCC-----CcCC-CEEEEeCCC-cCc--
Confidence 45666777776677788887775 4666555555677788898888886542100 0111 233334431 111
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcch
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLP 192 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP 192 (601)
.+-.++.+++..|++|.+++|+.+-
T Consensus 169 --------------------------v~~Sl~~~~~~~g~~v~~~~P~~~~ 193 (336)
T PRK03515 169 --------------------------MGNSLLEAAALTGLDLRLVAPKACW 193 (336)
T ss_pred --------------------------HHHHHHHHHHHcCCEEEEECCchhc
Confidence 3456777788889999999999853
No 89
>PLN02527 aspartate carbamoyltransferase
Probab=35.82 E-value=1.7e+02 Score=31.17 Aligned_cols=93 Identities=13% Similarity=0.108 Sum_probs=61.5
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCc-hhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFN-DAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPL 140 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfs-paAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhpl 140 (601)
.|.+++|-|+.-...+.+|.+|. .++++=+|++ .++=.++|+-.+++.+.+.+ . ..+++.+|+|-- |.
T Consensus 95 ~y~D~iviR~~~~~~~~~~a~~~-~vPVINa~~g~~~HPtQ~LaDl~Ti~e~~g~-------l-~g~kva~vGD~~-~~- 163 (306)
T PLN02527 95 GYSDIIVLRHFESGAARRAAATA-EIPVINAGDGPGQHPTQALLDVYTIQREIGR-------L-DGIKVGLVGDLA-NG- 163 (306)
T ss_pred HhCcEEEEECCChhHHHHHHHhC-CCCEEECCCCCCCChHHHHHHHHHHHHHhCC-------c-CCCEEEEECCCC-CC-
Confidence 45566777776666788887774 6888888874 68888899988888764321 1 223444444321 10
Q ss_pred CCCccccccccccccchhhhhhcccCCCHHHHHHHhhhc-CcEEEEeCCCcc
Q 007491 141 PTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQC-SVSLSVICPKQL 191 (601)
Q Consensus 141 P~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk-~IsLSIIsPRkl 191 (601)
..+-.++.++... |+.|.++||+.+
T Consensus 164 --------------------------rv~~Sl~~~~~~~~g~~v~~~~P~~~ 189 (306)
T PLN02527 164 --------------------------RTVRSLAYLLAKYEDVKIYFVAPDVV 189 (306)
T ss_pred --------------------------hhHHHHHHHHHhcCCCEEEEECCCcc
Confidence 1345667777776 899999999986
No 90
>PRK12562 ornithine carbamoyltransferase subunit F; Provisional
Probab=34.39 E-value=88 Score=33.82 Aligned_cols=95 Identities=12% Similarity=0.146 Sum_probs=60.8
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.+|-|+.-...+.+|-++. .++.+=+|...++=..+|+..+++.+....+ .. ..+++..|+|-. +.
T Consensus 99 ~y~D~iviR~~~~~~~~~~a~~~-~vPVINa~~~~~HPtQaLaDl~Ti~e~~g~~-----~l-~gl~va~vGD~~-~~-- 168 (334)
T PRK12562 99 RMYDGIQYRGHGQEVVETLAEYA-GVPVWNGLTNEFHPTQLLADLLTMQEHLPGK-----AF-NEMTLVYAGDAR-NN-- 168 (334)
T ss_pred HhCCEEEEECCchHHHHHHHHhC-CCCEEECCCCCCChHHHHHHHHHHHHHhCCC-----Cc-CCcEEEEECCCC-CC--
Confidence 45566777766666777777775 4666655555688889999999887653100 01 123334444321 11
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcch
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLP 192 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP 192 (601)
.+-.++.+++..|++|.+++|+.+.
T Consensus 169 --------------------------v~~S~~~~~~~~G~~v~~~~P~~~~ 193 (334)
T PRK12562 169 --------------------------MGNSMLEAAALTGLDLRLVAPQACW 193 (334)
T ss_pred --------------------------HHHHHHHHHHHcCCEEEEECCcccC
Confidence 3456777788899999999999853
No 91
>PRK02255 putrescine carbamoyltransferase; Provisional
Probab=33.53 E-value=97 Score=33.47 Aligned_cols=94 Identities=13% Similarity=0.211 Sum_probs=61.6
Q ss_pred CccccceeeccCCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCC
Q 007491 62 SYCACLVQRSGWTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLP 141 (601)
Q Consensus 62 P~~e~~Vqr~GfTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP 141 (601)
.|.+.++-|+.-..++.+|.++ .+++++=||...++=..+|+-.+++.++...+. .. ..+++.+|+|-+
T Consensus 96 ~y~D~iviR~~~~~~~~~~a~~-~~vPVINa~~~~~HPtQaLaDl~Ti~e~~g~g~----~l-~glkv~~vGD~~----- 164 (338)
T PRK02255 96 RLVDIIMARVDRHQTVVELAKY-ATVPVINGMSDYNHPTQELGDLFTMIEHLPEGK----KL-EDCKVVFVGDAT----- 164 (338)
T ss_pred HhCcEEEEecCChHHHHHHHHh-CCCCEEECCCCCCChHHHHHHHHHHHHHhCCCC----CC-CCCEEEEECCCc-----
Confidence 4666676676556677887776 457777666666788889998888766532110 11 224445554410
Q ss_pred CCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcc
Q 007491 142 TPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQL 191 (601)
Q Consensus 142 ~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRkl 191 (601)
..+-.++.++...|+.+.+++|+.+
T Consensus 165 -------------------------~v~~Sl~~~~~~~g~~v~~~~P~~~ 189 (338)
T PRK02255 165 -------------------------QVCVSLMFIATKMGMDFVHFGPKGY 189 (338)
T ss_pred -------------------------hHHHHHHHHHHhCCCEEEEECCCcc
Confidence 1345677888889999999999975
No 92
>PF03850 Tfb4: Transcription factor Tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=31.24 E-value=6.6e+02 Score=26.61 Aligned_cols=83 Identities=16% Similarity=0.079 Sum_probs=50.5
Q ss_pred hhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEE-eecCCCCCCCCCccccccccccccchhhhhhcccCCCHHHHHHH
Q 007491 97 AAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCIL-VAASNPHPLPTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKS 175 (601)
Q Consensus 97 aAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCIL-I~nSPPhplP~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~ 175 (601)
..++-||+.||+..-++..... ......+.++++ |..|+-+... |.|. +-.+=+
T Consensus 116 s~LagALS~ALCyINR~~~~~~-~~~~~~~~RILv~~s~s~d~~~Q---Yi~~---------------------MN~iFa 170 (276)
T PF03850_consen 116 SLLAGALSMALCYINRISRESP-SGGTSLKSRILVIVSGSPDSSSQ---YIPL---------------------MNCIFA 170 (276)
T ss_pred hhhHHHHHHHHHHHhhhhhccc-CCCCCcCccEEEEEecCCCccHH---HHHH---------------------HHHHHH
Confidence 7899999999987765421110 001234444544 7777766643 3332 344556
Q ss_pred hhhcCcEEEEeCCC--cchHHHHHHHHhcCC
Q 007491 176 FVQCSVSLSVICPK--QLPKLTAIYNAAKRN 204 (601)
Q Consensus 176 faEk~IsLSIIsPR--klP~Lk~Lfeka~~n 204 (601)
.++.+|.+.+..-. ....|+-.-+..+|-
T Consensus 171 Aqk~~v~IDv~~L~~~~s~fLqQa~d~T~G~ 201 (276)
T PF03850_consen 171 AQKQKVPIDVCKLGGKDSTFLQQASDITGGI 201 (276)
T ss_pred HhcCCceeEEEEecCCchHHHHHHHHHhCce
Confidence 67888888888444 477787777666554
No 93
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=29.93 E-value=8.2e+02 Score=27.30 Aligned_cols=188 Identities=18% Similarity=0.270 Sum_probs=103.0
Q ss_pred ceEEEEEecccccc--hhHHHHH---HHHHHHH-HHHhhcccccCccCCCCcceEEEEEeecCCCccccceee-ccCCcC
Q 007491 4 KQLIVAVEGTAAMG--PYWQSIV---SDYLEKI-IRCFCVNELAGQKTSASNVELSLVTFNTHGSYCACLVQR-SGWTKD 76 (601)
Q Consensus 4 ~DIVFVIDgTAsMG--pYi~~Lk---tnYI~Pi-LEyF~~g~l~e~~~g~~s~eyGLVvYrd~~P~~e~~Vqr-~GfTsD 76 (601)
+-+++++|.+.+|- +|.+.-+ ..|.+.. .|+|.+.++ ..++++..||.- +++ +..--|
T Consensus 88 Rhl~l~lD~Seam~e~Df~p~r~a~vikya~~Fv~eFf~qNPi---------Sqlsii~irdg~------a~~~s~~~gn 152 (421)
T COG5151 88 RHLHLILDVSEAMDESDFLPTRRANVIKYAEGFVPEFFSQNPI---------SQLSIISIRDGC------AKYTSSMDGN 152 (421)
T ss_pred heeEEEEEhhhhhhhhhccchHHHHHHHHHHHHhHHHhccCCc---------hheeeeehhhhH------HHHhhhcCCC
Confidence 35899999999985 2333333 2333332 444555444 468888877643 222 234568
Q ss_pred HHHHHHHhcCcccCCCCCchhhHHHHHHHH-HhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCcccccccccccc
Q 007491 77 VDIFLHWLSTIPFAGGGFNDAAIAEGLSEA-LMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNLDQN 155 (601)
Q Consensus 77 ~~~FlqwLdsI~FsGGGfspaAVAEGLAeA-L~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~~~g 155 (601)
++.++..|+++.---|-|+ +-.||.-| ..+... + .-.-|-+.+|-.|--
T Consensus 153 pq~hi~~lkS~rd~~gnfS---LqNaLEmar~~l~~~-~--------~H~trEvLiifgS~s------------------ 202 (421)
T COG5151 153 PQAHIGQLKSKRDCSGNFS---LQNALEMARIELMKN-T--------MHGTREVLIIFGSTS------------------ 202 (421)
T ss_pred HHHHHHHhhcccccCCChh---HHhHHHHhhhhhccc-c--------cccceEEEEEEeecc------------------
Confidence 8999999999984444454 22333333 112111 0 012233433432211
Q ss_pred chhhhhhcccCCCHHHHHHHhhhcCcEEEEe-CCCcchHHHHHHHHhcCCCCCCCCCCCCCCCCceEEEecccchhhh-h
Q 007491 156 ENNEAQAESRLSDAETVAKSFVQCSVSLSVI-CPKQLPKLTAIYNAAKRNPRAADPPVDNSKNPHFLVLISENFMEAR-A 233 (601)
Q Consensus 156 e~~~~~~~s~laDaEqVAk~faEk~IsLSII-sPRklP~Lk~Lfeka~~n~~~a~~s~dyAKdP~HLVLL~g~~L~ar-~ 233 (601)
..+.+|..+....+..-+|.+-+| .--..-..|.|..+ .||...-++-|+|.+--|+|- .
T Consensus 203 -------t~DPgdi~~tid~Lv~~~IrV~~igL~aevaicKeicka-----------Tn~~~e~~y~v~vde~Hl~el~~ 264 (421)
T COG5151 203 -------TRDPGDIAETIDKLVAYNIRVHFIGLCAEVAICKEICKA-----------TNSSTEGRYYVPVDEGHLSELMR 264 (421)
T ss_pred -------cCCCccHHHHHHHHHhhceEEEEEeehhHHHHHHHHHhh-----------cCcCcCceeEeeecHHHHHHHHH
Confidence 122335666666777779999999 34446677777764 345666688899888766542 1
Q ss_pred hccCCCCCCCC-CCCCCccccc
Q 007491 234 ALSRPGVANLA-PNQNPVKMDI 254 (601)
Q Consensus 234 ~l~~~~~~~l~-~~~~~~~~~~ 254 (601)
-+.+|...+-. ..-+.+||--
T Consensus 265 E~~~P~~~n~~k~~~sLVkmGF 286 (421)
T COG5151 265 ELSHPTDFNGTKTDLSLVKMGF 286 (421)
T ss_pred hcCCCCCCCccccCceEEEecC
Confidence 23334333322 2234555543
No 94
>PRK07200 aspartate/ornithine carbamoyltransferase family protein; Validated
Probab=29.15 E-value=2.2e+02 Score=31.73 Aligned_cols=108 Identities=6% Similarity=0.026 Sum_probs=59.5
Q ss_pred CccccceeeccC-----CcCHHHHHHHhc---------Cc-c-cCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCC
Q 007491 62 SYCACLVQRSGW-----TKDVDIFLHWLS---------TI-P-FAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDG 125 (601)
Q Consensus 62 P~~e~~Vqr~Gf-----TsD~~~FlqwLd---------sI-~-FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~ 125 (601)
.|.+++|-|+++ ...++++-.+.+ .+ + ..+|| ..++=..+|+-.+++.+... .. ... .
T Consensus 113 ~y~D~IviR~~~~~g~~~~~~~ela~~~~~~~~~~~~~~~pPVINa~~-~~~HPtQaLaDl~TI~E~~G--~~--~~l-~ 186 (395)
T PRK07200 113 FMADVIGIRDDMYIGKGNAYMREVGAAVDDGYKQGVLPQRPTLVNLQC-DIDHPTQSMADLLHLIEHFG--GL--ENL-K 186 (395)
T ss_pred HhCCEEEEecCcccccccHHHHHHHHHhhhhcccccccCCCeEEECCC-CCCCcHHHHHHHHHHHHHhC--CC--ccc-C
Confidence 577778888643 233566666652 33 3 44443 35777788888888876532 10 011 1
Q ss_pred ccEEEEeec-CCCCCCCCCccccccccccccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHH
Q 007491 126 QRHCILVAA-SNPHPLPTPVYRPQMQNLDQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYN 199 (601)
Q Consensus 126 qKhCILI~n-SPPhplP~pVy~P~~~~~~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfe 199 (601)
.+++.+|.+ ...|..+.. .+-.++.++...|+.+.+++|+.+.....+.+
T Consensus 187 g~kVaivg~~~~~~g~~~~------------------------Va~Sl~~~~~~lG~~v~~~~P~~~~~~~~i~~ 237 (395)
T PRK07200 187 GKKIAMTWAYSPSYGKPLS------------------------VPQGIIGLMTRFGMDVTLAHPEGYDLMPEVVE 237 (395)
T ss_pred CCEEEEEeccccccCCcch------------------------HHHHHHHHHHHcCCEEEEECCCccCCCHHHHH
Confidence 123333322 222222211 23566777788899999999997533334443
No 95
>TIGR01285 nifN nitrogenase molybdenum-iron cofactor biosynthesis protein NifN. This protein forms a complex with NifE, and appears as a NifEN in some species. NifEN is a required for producing the molybdenum-iron cofactor of molybdenum-requiring nitrogenases. NifN is closely related to the nitrogenase molybdenum-iron protein beta chain NifK. This model describes most examples of NifN but excludes some cases, such as the putative NifN of Chlorobium tepidum, for which a separate model may be created.
Probab=24.47 E-value=3.2e+02 Score=30.21 Aligned_cols=88 Identities=14% Similarity=0.174 Sum_probs=55.4
Q ss_pred CCcCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCCCCccEEEEeecCCCCCCCCCccccccccc
Q 007491 73 WTKDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNVDGQRHCILVAASNPHPLPTPVYRPQMQNL 152 (601)
Q Consensus 73 fTsD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~~~qKhCILI~nSPPhplP~pVy~P~~~~~ 152 (601)
|-++...+-..|+....+-||+ +.+.|+|.+++..+. .+ .|.|+.+-...+
T Consensus 55 ~~~~~~~~sT~l~E~d~VfGg~--~~L~~~I~~~~~~~~--------------P~-~I~V~ttC~~ei------------ 105 (432)
T TIGR01285 55 FREPIPLQTTAMDEVSTILGGD--EHIEEAIDTLCQRNK--------------PK-AIGLLSTGLTET------------ 105 (432)
T ss_pred ccCCccceeecCCCCceEECcH--HHHHHHHHHHHHhcC--------------CC-EEEEeCCCcccc------------
Confidence 3344556666888888888876 678888888887653 13 455544433221
Q ss_pred cccchhhhhhcccCCCHHHHHHHhhhc-----CcEEEEe-CCCcc--------hHHHHHHHHh
Q 007491 153 DQNENNEAQAESRLSDAETVAKSFVQC-----SVSLSVI-CPKQL--------PKLTAIYNAA 201 (601)
Q Consensus 153 ~~ge~~~~~~~s~laDaEqVAk~faEk-----~IsLSII-sPRkl--------P~Lk~Lfeka 201 (601)
++| |.+.+++.+.++ ++.+-.| +|--. -+++.|+++-
T Consensus 106 ----IGD--------Di~~v~~~~~~e~p~~~~~pvi~v~tpgf~g~~~~G~~~a~~al~~~~ 156 (432)
T TIGR01285 106 ----RGE--------DIARVVRQFREKHPQHKGTAVVTVNTPDFKGSLEDGYAAAVESIIEAW 156 (432)
T ss_pred ----ccc--------CHHHHHHHHHhhcccccCCeEEEecCCCcCCchHHHHHHHHHHHHHHH
Confidence 112 899999999875 7776666 55322 1566676643
No 96
>TIGR02886 spore_II_AA anti-sigma F factor antagonist. The anti-sigma F factor antagonist, also called stage II sporulation protein AA, is a protein universal among endospore-forming bacteria, all of which belong to the Firmcutes
Probab=21.52 E-value=1e+02 Score=26.41 Aligned_cols=41 Identities=12% Similarity=0.010 Sum_probs=33.5
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHhcCC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAAKRN 204 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka~~n 204 (601)
+.+++.....+.+.+.++|+.|.+..++ +.++++|++.+.+
T Consensus 53 Dssgi~~L~~~~~~~~~~g~~l~l~~~~--~~v~~~l~~~gl~ 93 (106)
T TIGR02886 53 DSSGLGVILGRYKKIKNEGGEVIVCNVS--PAVKRLFELSGLF 93 (106)
T ss_pred cchHHHHHHHHHHHHHHcCCEEEEEeCC--HHHHHHHHHhCCc
Confidence 3344555688999999999999999888 7899999988766
No 97
>PRK04523 N-acetylornithine carbamoyltransferase; Reviewed
Probab=20.49 E-value=3.4e+02 Score=29.41 Aligned_cols=82 Identities=15% Similarity=0.246 Sum_probs=50.7
Q ss_pred cCHHHHHHHhcCcccCCCCCchhhHHHHHHHHHhccccCCCCCCCCCCC-CCccEEEE-eecCCCCCCCCCccccccccc
Q 007491 75 KDVDIFLHWLSTIPFAGGGFNDAAIAEGLSEALMMFSVAPNGSQNQQNV-DGQRHCIL-VAASNPHPLPTPVYRPQMQNL 152 (601)
Q Consensus 75 sD~~~FlqwLdsI~FsGGGfspaAVAEGLAeAL~~fd~~~~~~q~~~~~-~~qKhCIL-I~nSPPhplP~pVy~P~~~~~ 152 (601)
..+++|.++ ..++.+=+|+. ++=..+|+-.+++.+... .. +.-|.+|. +.+..+-
T Consensus 127 ~~~~~~a~~-s~vPVINa~~~-~HPtQaLaDl~Ti~e~~g-------~~~~g~ki~i~~~gd~~~~-------------- 183 (335)
T PRK04523 127 QVLNSFAKY-STVPVINMETI-THPCQELAHALALQEHFG-------TTLRGKKYVLTWTYHPKPL-------------- 183 (335)
T ss_pred HHHHHHHHh-CCCCEEECCCC-CChHHHHHHHHHHHHHhC-------CccCCCEEEEEEeccCccc--------------
Confidence 345666666 46888777777 888899999998876532 11 12233321 1121000
Q ss_pred cccchhhhhhcccCCCHHHHHHHhhhcCcEEEEeCC-Ccc
Q 007491 153 DQNENNEAQAESRLSDAETVAKSFVQCSVSLSVICP-KQL 191 (601)
Q Consensus 153 ~~ge~~~~~~~s~laDaEqVAk~faEk~IsLSIIsP-Rkl 191 (601)
.-..+-.++.++...|+++.+++| +.+
T Consensus 184 ------------~~~v~~S~~~~~~~~g~~v~~~~P~~~~ 211 (335)
T PRK04523 184 ------------NTAVANSALLIATRLGMDVTLLCPTPDY 211 (335)
T ss_pred ------------ccHHHHHHHHHHHHcCCEEEEECCchhh
Confidence 001355677788889999999999 765
No 98
>cd06844 STAS Sulphate Transporter and Anti-Sigma factor antagonist domain found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain is found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors, like anti-anti-sigma factors and "stressosome" components. The sigma factor regulators are involved in protein-protein interaction which is regulated by phosphorylation.
Probab=20.45 E-value=1e+02 Score=26.33 Aligned_cols=42 Identities=5% Similarity=-0.070 Sum_probs=34.0
Q ss_pred hcccCCCHHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHhcCCC
Q 007491 162 AESRLSDAETVAKSFVQCSVSLSVICPKQLPKLTAIYNAAKRNP 205 (601)
Q Consensus 162 ~~s~laDaEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka~~n~ 205 (601)
+.+++...+.+++.+.++|+.|.++.++ +.++++|+.++-+.
T Consensus 53 Dssgl~~L~~l~~~~~~~g~~l~l~~~~--~~v~~~l~~~gl~~ 94 (100)
T cd06844 53 DSSGTGVLLERSRLAEAVGGQFVLTGIS--PAVRITLTESGLDK 94 (100)
T ss_pred cHHHHHHHHHHHHHHHHcCCEEEEECCC--HHHHHHHHHhCchh
Confidence 3445556799999999999999999887 78889998876553
No 99
>PF12637 TSCPD: TSCPD domain; InterPro: IPR024434 The domain is found in isolation in many proteins where it has a conserved C-terminal motif TSCPD, after which the domain is named. Most copies of the domain possess 4 conserved cysteines that may be part of an Iron-sulphur cluster. This domain is found at the C terminus of some ribonucleoside-diphosphate reductase enzymes.
Probab=20.27 E-value=1.3e+02 Score=26.78 Aligned_cols=32 Identities=25% Similarity=0.246 Sum_probs=22.4
Q ss_pred cCHHHHHHHhcCcccCCCCC-------c-hhhHHHHHHHH
Q 007491 75 KDVDIFLHWLSTIPFAGGGF-------N-DAAIAEGLSEA 106 (601)
Q Consensus 75 sD~~~FlqwLdsI~FsGGGf-------s-paAVAEGLAeA 106 (601)
-+++++.+-|++|.+...+. + +.+|+.+|++.
T Consensus 55 ~~~~~ii~~L~gi~~~~~~~~~~~~~~S~~D~Ia~~L~~~ 94 (95)
T PF12637_consen 55 VPPEEIIDQLRGIRCGPSGTVGGSRVTSCPDAIAKALEEH 94 (95)
T ss_pred CCHHHHHHHhcCCCCCCCCccCCCccCcHHHHHHHHHHHh
Confidence 45788888999998888775 4 45555555554
No 100
>PF07611 DUF1574: Protein of unknown function (DUF1574); InterPro: IPR011468 This is a family of hypothetical proteins found in Leptospira interrogans and other bacteria.
Probab=20.06 E-value=1.1e+02 Score=33.48 Aligned_cols=35 Identities=20% Similarity=0.261 Sum_probs=32.1
Q ss_pred HHHHHHHhhhcCcEEEEeCCCcchHHHHHHHHhcC
Q 007491 169 AETVAKSFVQCSVSLSVICPKQLPKLTAIYNAAKR 203 (601)
Q Consensus 169 aEqVAk~faEk~IsLSIIsPRklP~Lk~Lfeka~~ 203 (601)
-|+..+.+.|.||.+-+|+|+-.+.+++++++.+.
T Consensus 254 ~e~~L~~ake~~I~~vl~~P~V~~~~~~~~~~~~~ 288 (345)
T PF07611_consen 254 LEKFLKLAKENGIPVVLWWPKVSPPYEKLYKELKV 288 (345)
T ss_pred HHHHHHHHHHcCCcEEEEEeccCHHHHHHHHhhch
Confidence 47888999999999999999999999999988766
Done!