Query 027721
Match_columns 219
No_of_seqs 109 out of 217
Neff 6.2
Searched_HMMs 46136
Date Fri Mar 29 14:10:30 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/027721.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/027721hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03850 Tfb4: Transcription f 100.0 5.2E-64 1.1E-68 444.0 19.6 198 13-215 1-224 (276)
2 TIGR00627 tfb4 transcription f 100.0 1.7E-57 3.7E-62 402.3 20.1 199 12-215 1-226 (279)
3 KOG2487 RNA polymerase II tran 100.0 4.8E-57 1E-61 391.5 17.2 205 4-215 14-244 (314)
4 COG5242 TFB4 RNA polymerase II 100.0 7.2E-52 1.6E-56 352.0 16.8 198 7-216 14-232 (296)
5 cd01452 VWA_26S_proteasome_sub 97.9 0.001 2.2E-08 56.3 16.7 140 15-185 5-147 (187)
6 cd01453 vWA_transcription_fact 97.8 0.0039 8.5E-08 51.9 18.2 160 16-208 6-177 (183)
7 cd01465 vWA_subgroup VWA subgr 97.2 0.077 1.7E-06 42.2 17.0 156 16-205 3-169 (170)
8 smart00327 VWA von Willebrand 96.9 0.12 2.6E-06 40.6 15.6 151 15-193 3-154 (177)
9 cd01482 vWA_collagen_alphaI-XI 96.6 0.23 5E-06 39.9 15.8 147 16-193 3-149 (164)
10 cd01480 vWA_collagen_alpha_1-V 96.6 0.33 7.1E-06 40.0 17.0 149 16-193 5-159 (186)
11 PF04056 Ssl1: Ssl1-like; Int 96.5 0.23 4.9E-06 42.4 15.3 149 31-210 15-172 (193)
12 cd01461 vWA_interalpha_trypsin 96.2 0.48 1E-05 37.5 17.2 159 14-207 3-170 (171)
13 cd00198 vWFA Von Willebrand fa 96.0 0.44 9.6E-06 36.1 15.9 147 16-192 3-150 (161)
14 PF13519 VWA_2: von Willebrand 96.0 0.52 1.1E-05 36.8 15.6 155 16-206 2-169 (172)
15 cd01472 vWA_collagen von Wille 96.0 0.62 1.3E-05 37.2 15.7 147 16-193 3-149 (164)
16 PRK13685 hypothetical protein; 95.9 1.2 2.5E-05 40.4 18.1 164 10-203 85-280 (326)
17 cd01451 vWA_Magnesium_chelatas 95.3 1.3 2.8E-05 36.1 17.4 147 15-193 2-153 (178)
18 cd01467 vWA_BatA_type VWA BatA 95.1 1.4 2.9E-05 35.5 14.7 136 15-184 4-142 (180)
19 cd01466 vWA_C3HC4_type VWA C3H 95.0 1.1 2.3E-05 35.8 13.0 140 16-193 3-143 (155)
20 cd01469 vWA_integrins_alpha_su 94.5 2.2 4.9E-05 34.8 15.2 140 16-184 3-142 (177)
21 cd01458 vWA_ku Ku70/Ku80 N-ter 94.3 2.8 6.1E-05 35.3 16.7 155 14-185 2-173 (218)
22 PF13768 VWA_3: von Willebrand 93.9 2.5 5.5E-05 33.2 14.6 141 16-193 3-145 (155)
23 TIGR03436 acidobact_VWFA VWFA- 92.6 6.8 0.00015 34.5 17.9 146 12-183 52-203 (296)
24 PRK13406 bchD magnesium chelat 92.5 12 0.00025 37.1 17.3 150 10-193 398-556 (584)
25 cd01476 VWA_integrin_invertebr 91.9 5.2 0.00011 31.6 16.1 137 16-184 3-141 (163)
26 cd01475 vWA_Matrilin VWA_Matri 91.7 7.5 0.00016 32.9 16.8 147 16-193 5-154 (224)
27 cd01450 vWFA_subfamily_ECM Von 91.3 5.5 0.00012 30.6 15.3 148 16-192 3-150 (161)
28 cd01474 vWA_ATR ATR (Anthrax T 91.0 7.5 0.00016 31.7 15.5 148 15-193 6-153 (185)
29 PF03731 Ku_N: Ku70/Ku80 N-ter 90.2 8.5 0.00018 32.3 12.3 158 16-183 2-173 (224)
30 cd01463 vWA_VGCC_like VWA Volt 89.7 10 0.00022 31.1 14.9 153 11-193 11-175 (190)
31 cd01462 VWA_YIEM_type VWA YIEM 88.7 9.8 0.00021 29.7 16.8 138 16-191 3-141 (152)
32 cd01471 vWA_micronemal_protein 86.3 16 0.00035 29.5 15.8 73 118-193 84-157 (186)
33 TIGR00868 hCaCC calcium-activa 84.1 57 0.0012 33.9 17.5 151 15-200 306-464 (863)
34 TIGR02031 BchD-ChlD magnesium 83.6 22 0.00049 35.0 12.6 109 11-153 405-513 (589)
35 cd01464 vWA_subfamily VWA subf 80.7 28 0.0006 28.0 13.4 144 16-193 6-157 (176)
36 cd01456 vWA_ywmD_type VWA ywmD 78.7 36 0.00078 28.1 16.8 160 8-193 15-188 (206)
37 PF00092 VWA: von Willebrand f 78.4 30 0.00064 27.0 15.1 148 16-193 2-152 (178)
38 cd01470 vWA_complement_factors 77.0 39 0.00086 27.6 17.2 168 16-204 3-195 (198)
39 COG1240 ChlD Mg-chelatase subu 76.7 19 0.0004 32.2 8.5 111 11-152 76-186 (261)
40 TIGR03788 marine_srt_targ mari 66.3 1.4E+02 0.003 29.3 20.3 148 11-193 269-417 (596)
41 TIGR02442 Cob-chelat-sub cobal 62.5 1.7E+02 0.0037 29.1 17.6 112 13-155 465-576 (633)
42 KOG2884 26S proteasome regulat 58.6 1.3E+02 0.0028 26.5 13.5 140 16-185 6-147 (259)
43 cd01473 vWA_CTRP CTRP for CS 56.0 1.2E+02 0.0025 25.2 15.7 154 16-193 3-158 (192)
44 PF04811 Sec23_trunk: Sec23/Se 55.4 1.3E+02 0.0029 25.6 12.2 154 13-185 3-202 (243)
45 KOG2807 RNA polymerase II tran 54.2 1.9E+02 0.0041 27.0 13.1 168 15-214 62-239 (378)
46 cd01477 vWA_F09G8-8_type VWA F 50.6 1.5E+02 0.0032 24.8 16.6 153 13-193 19-181 (193)
47 PTZ00441 sporozoite surface pr 47.2 3.1E+02 0.0066 27.4 17.3 72 118-193 126-198 (576)
48 cd01454 vWA_norD_type norD typ 43.7 1.6E+02 0.0036 23.3 14.9 138 16-185 3-153 (174)
49 COG5148 RPN10 26S proteasome r 34.6 3.1E+02 0.0066 23.8 12.3 139 15-184 5-146 (243)
50 cd00529 RuvC_resolvase Hollida 28.0 35 0.00076 27.4 1.5 15 120-134 139-153 (154)
51 COG4867 Uncharacterized protei 27.5 2.2E+02 0.0048 27.7 6.8 92 117-213 531-649 (652)
52 COG4245 TerY Uncharacterized p 25.9 4.3E+02 0.0094 22.8 8.7 97 118-215 78-190 (207)
53 TIGR00228 ruvC crossover junct 24.3 40 0.00088 27.7 1.2 17 121-137 137-153 (156)
54 PF11373 DUF3175: Protein of u 22.1 1.4E+02 0.0031 22.2 3.6 22 117-138 43-64 (86)
No 1
>PF03850 Tfb4: Transcription factor Tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=100.00 E-value=5.2e-64 Score=444.01 Aligned_cols=198 Identities=42% Similarity=0.645 Sum_probs=171.3
Q ss_pred CCeEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCC---CC----C--
Q 027721 13 VSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN---QS----V-- 81 (219)
Q Consensus 13 ~slLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~---~~----~-- 81 (219)
||||+||||+||.+|+. +.++|++++++++||+||||++++.|+|||||+|+++++||||++.... .. .
T Consensus 1 pSLLvIILD~nP~~W~~~~~~~~l~~~l~~llvFlNahL~l~~~N~vaVIAs~~~~s~~LYP~~~~~~~~~~~~~~~~~~ 80 (276)
T PF03850_consen 1 PSLLVIILDTNPLAWGQLSDQLSLSQFLDSLLVFLNAHLALNHSNQVAVIASHSNSSKFLYPSPSSSESSNSGDVEMNSS 80 (276)
T ss_pred CcEEEEEEECCHHHHhhccccccHHHHHHHHHHHHHHHHhhCccCCEEEEEEcCCccEEEeCCCccccccCCCccccccc
Confidence 79999999999999998 5699999999999999999999999999999999999999999988321 10 0
Q ss_pred --CCCCchhHH-HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCC---CCCCceEEE-EecCC
Q 027721 82 --GNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGL---LHPQPRILC-LQGSP 154 (219)
Q Consensus 82 --~~~~~~~~~-~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~---~~~~~RILv-is~S~ 154 (219)
+..+.+..+ +.|.++++++++++...... +.+++|+||||||||||||+.|+.. .++++|||| +++|+
T Consensus 81 ~~~~y~~f~~v~~~v~~~l~~l~~~~~~~~~~-----~~~s~LagALS~ALCyINR~~~~~~~~~~~~~~RILv~~s~s~ 155 (276)
T PF03850_consen 81 DSNKYRQFRNVDETVLEELKKLMSETSESSDS-----TTSSLLAGALSMALCYINRISRESPSGGTSLKSRILVIVSGSP 155 (276)
T ss_pred ccchhHHHHHHHHHHHHHHHHHHhhccccccc-----ccchhhHHHHHHHHHHHhhhhhcccCCCCCcCccEEEEEecCC
Confidence 011122223 45889999999986554432 2248999999999999999988764 589999999 99999
Q ss_pred CCchhhhHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHHhcCCCCC
Q 027721 155 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIISLNNWMGS 215 (219)
Q Consensus 155 d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~ 215 (219)
|.++||||+|||||||||++|+||||+|+.++++||||| +++++++||+|||||.|+++++
T Consensus 156 d~~~QYi~~MN~iFaAqk~~v~IDv~~L~~~~s~fLqQa~d~T~G~y~~~~~~~~l~q~L~~~fl~~~~ 224 (276)
T PF03850_consen 156 DSSSQYIPLMNCIFAAQKQKVPIDVCKLGGKDSTFLQQASDITGGIYLKVSKPEGLLQYLLTAFLPDPS 224 (276)
T ss_pred CccHHHHHHHHHHHHHhcCCceeEEEEecCCchHHHHHHHHHhCceeeccCccccHHHHHHHhhcCCHH
Confidence 999999999999999999999999999987799999999 9999999999999999999864
No 2
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=100.00 E-value=1.7e-57 Score=402.28 Aligned_cols=199 Identities=31% Similarity=0.447 Sum_probs=168.8
Q ss_pred CCCeEEEEEeCCcccccc-----CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCC-----
Q 027721 12 DVSLVVVLLDTNPFFWSS-----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSV----- 81 (219)
Q Consensus 12 ~~slLviIlD~np~~W~~-----~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~----- 81 (219)
++|||+||||+||.+|+. ++++|++++++++||+||||+++++|+|||||+|+++++||||++.++++..
T Consensus 1 ~~slL~vvlD~np~~W~~~~~~~~~~~l~~~l~sllvF~NahL~l~~~N~vaVIAs~~~~~~~LYps~~~~~~~~~~~~~ 80 (279)
T TIGR00627 1 NPSLLVVIIEANPCSWGMLALAHGKRTISKVLRAIVVFLNAHLAFNANNKLAVIASHSQDNKYLYPSTRCEDRNASELDP 80 (279)
T ss_pred CCcEEEEEEeCCHHHHHHHhhccCCCcHHHHHHHHHHHHHHHHhcCccCCEEEEEecCCcceEEecCCcccccccccccc
Confidence 479999999999999954 4789999999999999999999999999999999999999999875432111
Q ss_pred -----CCCCchhHH-HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccC--CCCCCceEEEEecC
Q 027721 82 -----GNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG--LLHPQPRILCLQGS 153 (219)
Q Consensus 82 -----~~~~~~~~~-~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~--~~~~~~RILvis~S 153 (219)
+..+.+..+ +.|.++|++++.++...... ..+++|+||||||||||||+.++. .+++++|||||++|
T Consensus 81 ~~~~~~~y~~f~~v~~~v~~~l~~l~~~~~~~~~~-----~~~s~lagals~ALcyinr~~~~~~~~~~~~~RIlii~~s 155 (279)
T TIGR00627 81 KRLRELLYRDFRTVDETIVEEIKPLMAHADKHMKK-----DSRTVLAGALSDALGYINRSEQSETASEKLKSRILVISIT 155 (279)
T ss_pred ccccchhccchhHHHHHHHHHHHHHHhhchhcccc-----cccccchhHHHhhhhhhcccccccccCcCCcceEEEEECC
Confidence 111112223 35899999999876543321 257899999999999999997753 36899999999999
Q ss_pred CCCchhhhHHHHHHHHhhhCCceeeEeeeCCC-Cchhhhhh--------ccccchhHHHHHHHHhcCCCCC
Q 027721 154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC--------MSDRLLTLLVEYIISLNNWMGS 215 (219)
Q Consensus 154 ~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~ 215 (219)
+|.++||||+||+||+|||++|+||||+|+.+ ++.||||| +++++++||+||||+.|+++++
T Consensus 156 ~~~~~qYi~~mn~Ifaaqk~~I~Idv~~L~~e~~~~~lqQa~~~TgG~Y~~~~~~~~L~q~L~~~~~pp~~ 226 (279)
T TIGR00627 156 PDMALQYIPLMNCIFSAQKQNIPIDVVSIGGDFTSGFLQQAADITGGSYLHVKKPQGLLQYLMTNMLPDPT 226 (279)
T ss_pred CCchHHHHHHHHHHHHHHHcCceEEEEEeCCccccHHHHHHHHHhCCEEeccCCHhHHHHHHHHhcCCChh
Confidence 99999999999999999999999999999754 69999999 7789999999999999998864
No 3
>KOG2487 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription; Replication, recombination and repair]
Probab=100.00 E-value=4.8e-57 Score=391.50 Aligned_cols=205 Identities=30% Similarity=0.421 Sum_probs=175.9
Q ss_pred CCCCCCCCCCCeEEEEEeCCcccccc-----CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCC
Q 027721 4 APSKLYSDDVSLVVVLLDTNPFFWSS-----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN 78 (219)
Q Consensus 4 ~~~~~~~~~~slLviIlD~np~~W~~-----~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~ 78 (219)
+.+..+++.||||+||||+||..|+. +..++.+++++|+||+||||++|+.|+|||||+|+++.+||||++...+
T Consensus 14 ~~~~~dn~~~slL~vlId~~p~~Wg~~as~~~~~ti~kvl~aivVFlNAHL~~~~~NrvaViA~~~q~~~~lyp~st~~e 93 (314)
T KOG2487|consen 14 PASHRDNLNPSLLVVLIDANPCSWGMLASAENWETISKVLNAIVVFLNAHLAFSRNNRVAVIASHSQVDNYLYPSSTRCE 93 (314)
T ss_pred cCcccccCCceeEEEEEecCcchhhhhhhhcCceeHHHHHHHHHHHHHHHHhhccCCcEEEEEecccccceeccccccCC
Confidence 34556677899999999999999986 5669999999999999999999999999999999999999999876533
Q ss_pred CCC-----CC-CCchhH------HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccC-CCCCCc
Q 027721 79 QSV-----GN-GRMPSL------CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG-LLHPQP 145 (219)
Q Consensus 79 ~~~-----~~-~~~~~~------~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~-~~~~~~ 145 (219)
+.. .. ...+.+ -+.|.++|.++|.....-... ..+.+||||+.|||||||+.++. ++++++
T Consensus 94 ~~n~~~~~~t~~~~~~y~~~~~~d~tiv~ei~~lm~~~~~~~~~------~rt~lagals~~L~yi~~~~ke~~~~~lkS 167 (314)
T KOG2487|consen 94 DRNASELDPTRLVLFDYSEFRTVDDTIVEEIYRLMEHPDKYDVG------DRTVLAGALSDALGYINRLHKEEASEKLKS 167 (314)
T ss_pred ccCccccCchhhhcchhhhhcccchHHHHHHHHHHhCccccccc------cceeeccchhhccchHhhhhhhhhhhhhhc
Confidence 110 00 111111 235899999999976543331 37899999999999999998874 578999
Q ss_pred eEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHHhcCCCCC
Q 027721 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIISLNNWMGS 215 (219)
Q Consensus 146 RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~ 215 (219)
||||++.++|.+.||||+|||||+|||++|+||||+|| ++++||||| +|+++|+||||||||.|+++++
T Consensus 168 RilV~t~t~d~~~qyi~~MNciFaAqKq~I~Idv~~l~-~~s~~LqQa~D~TGG~YL~v~~~~gLLqyLlt~~~~D~~ 244 (314)
T KOG2487|consen 168 RILVFTLTRDRALQYIPYMNCIFAAQKQNIPIDVVSLG-GDSGFLQQACDITGGDYLHVEKPDGLLQYLLTLLLTDPE 244 (314)
T ss_pred eEEEEEechHHHhhhhhHHHHHHHHHhcCceeEEEEec-CCchHHHHHHhhcCCeeEecCCcchHHHHHHHHhcCCcc
Confidence 99999999999999999999999999999999999995 779999999 9999999999999999999986
No 4
>COG5242 TFB4 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription / DNA replication, recombination, and repair]
Probab=100.00 E-value=7.2e-52 Score=352.03 Aligned_cols=198 Identities=27% Similarity=0.378 Sum_probs=172.4
Q ss_pred CCCCCCCCeEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCC---CCC
Q 027721 7 KLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN---QSV 81 (219)
Q Consensus 7 ~~~~~~~slLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~---~~~ 81 (219)
+-.||.||||+||||.+|..|+. ...+..++|+.++||+||||++++.|+|||+|+++.+.+||||+..+.. +.+
T Consensus 14 ~vteespslL~viid~~p~~W~~~~ek~~~~kvl~di~VFLNAhlaf~~~NrVaVva~~s~~~~yLypss~s~~k~se~e 93 (296)
T COG5242 14 QVTEESPSLLFVIIDLEPENWELTTEKGSRDKVLNDIVVFLNAHLAFSRNNRVAVVAGYSQGKTYLYPSSESALKASESE 93 (296)
T ss_pred cccccCCceEEEEEecChhhcccccccccHHHHHHHHHHHHHHHHhhccCCeEEEEEeccCceEEeccCcchhhhhhccc
Confidence 35678899999999999999998 5688999999999999999999999999999999999999999987644 223
Q ss_pred CCCCchhH-------HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC
Q 027721 82 GNGRMPSL-------CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP 154 (219)
Q Consensus 82 ~~~~~~~~-------~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~ 154 (219)
+. +..++ -+.++++|.+++..+.+. .+.+.++|||+.||.|+||+.++ .++++||||++.|.
T Consensus 94 ~t-r~sd~yrrfr~vde~~i~eiyrl~e~~~k~--------sqr~~v~gams~glay~n~~~~e--~slkSriliftlsG 162 (296)
T COG5242 94 NT-RNSDMYRRFRNVDETDITEIYRLIEHPHKN--------SQRYDVGGAMSLGLAYCNHRDEE--TSLKSRILIFTLSG 162 (296)
T ss_pred Cc-cchhhhhhhcccchHHHHHHHHHHhCcccc--------cceeehhhhhhhhHHHHhhhccc--ccccceEEEEEecC
Confidence 32 22222 234789999999876542 24688999999999999999876 68999999999977
Q ss_pred -CCchhhhHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHHhcCCCCCC
Q 027721 155 -DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIISLNNWMGSF 216 (219)
Q Consensus 155 -d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~~ 216 (219)
|...||||+|||||+|||++||||||.|+ +.+.||||+ +++++++|||||||+.+.++++-
T Consensus 163 ~d~~~qYip~mnCiF~Aqk~~ipI~v~~i~-g~s~fl~Q~~daTgG~Yl~ve~~eGllqyL~~~lf~d~~l 232 (296)
T COG5242 163 RDRKDQYIPYMNCIFAAQKFGIPISVFSIF-GNSKFLLQCCDATGGDYLTVEDTEGLLQYLLSLLFTDGEL 232 (296)
T ss_pred chhhhhhchhhhheeehhhcCCceEEEEec-CccHHHHHHhhccCCeeEeecCchhHHHHHHHHhcCCCCc
Confidence 99999999999999999999999999995 779999999 99999999999999999998863
No 5
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=97.95 E-value=0.001 Score=56.28 Aligned_cols=140 Identities=14% Similarity=0.156 Sum_probs=99.7
Q ss_pred eEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721 15 LVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT 92 (219)
Q Consensus 15 lLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~ 92 (219)
-.+|+||.+..--.. .|-=|....+.+-.|++.|.-.+..+++.+|+...+.+..+-|-..+
T Consensus 5 a~vi~lD~S~sM~a~D~~PnRL~aak~~i~~~~~~f~~~np~~~vGlv~fag~~a~v~~plT~D---------------- 68 (187)
T cd01452 5 ATMICIDNSEYMRNGDYPPTRFQAQADAVNLICQAKTRSNPENNVGLMTMAGNSPEVLVTLTND---------------- 68 (187)
T ss_pred EEEEEEECCHHHHcCCCCCCHHHHHHHHHHHHHHHHHhcCCCccEEEEEecCCceEEEECCCCC----------------
Confidence 468999997665444 34457888888999999999999999999999988778777775543
Q ss_pred HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC-CCchhhhHHHHHHHHhh
Q 027721 93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP-DGPEQYVAIMNAIFSAQ 171 (219)
Q Consensus 93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~-d~~~QYi~lMN~iFaAq 171 (219)
...+..++.+-.. ..++.|..||.+|+-...... ...-+.||+|+-+|+ +....- +...+-.++
T Consensus 69 -~~~~~~~L~~i~~---------~g~~~l~~AL~~A~~~L~~~~---~~~~~~rivi~v~S~~~~d~~~--i~~~~~~lk 133 (187)
T cd01452 69 -QGKILSKLHDVQP---------KGKANFITGIQIAQLALKHRQ---NKNQKQRIVAFVGSPIEEDEKD--LVKLAKRLK 133 (187)
T ss_pred -HHHHHHHHHhCCC---------CCcchHHHHHHHHHHHHhcCC---CcCCcceEEEEEecCCcCCHHH--HHHHHHHHH
Confidence 2334444443211 124568899999986665431 122346999988887 332222 568999999
Q ss_pred hCCceeeEeeeCCC
Q 027721 172 RSMVPIDSCYLGAQ 185 (219)
Q Consensus 172 K~~V~IDvc~L~~~ 185 (219)
|.+|.||+..+|+.
T Consensus 134 k~~I~v~vI~~G~~ 147 (187)
T cd01452 134 KNNVSVDIINFGEI 147 (187)
T ss_pred HcCCeEEEEEeCCC
Confidence 99999999999854
No 6
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=97.83 E-value=0.0039 Score=51.94 Aligned_cols=160 Identities=13% Similarity=0.143 Sum_probs=108.4
Q ss_pred EEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721 16 VVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL 93 (219)
Q Consensus 16 LviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v 93 (219)
++|+||+++-=-.. .+--|+.+-..+.-|++.++..+...++.+|+...+.++.+-|-....
T Consensus 6 ivi~lD~S~SM~a~D~~ptRl~~ak~~~~~fi~~~~~~~~~~~vglv~f~~~~a~~~~PlT~D~---------------- 69 (183)
T cd01453 6 LIIVIDCSRSMEEQDLKPSRLAVVLKLLELFIEEFFDQNPISQLGIISIKNGRAEKLTDLTGNP---------------- 69 (183)
T ss_pred EEEEEECcHHHhcCCCCchHHHHHHHHHHHHHHHHhhcCccccEEEEEEcCCccEEEECCCCCH----------------
Confidence 78999998774433 345588889999999999999999999999999777788887764331
Q ss_pred HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecC-CCC-chhhhHHHHHHHHhh
Q 027721 94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS-PDG-PEQYVAIMNAIFSAQ 171 (219)
Q Consensus 94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S-~d~-~~QYi~lMN~iFaAq 171 (219)
+.+...+... ... ...+.+..||.+|+-+..+.. ..-+.||+|+..+ .+. +.++ -...=.++
T Consensus 70 -~~~~~~L~~~--~~~------~G~t~l~~aL~~A~~~l~~~~----~~~~~~iiil~sd~~~~~~~~~---~~~~~~l~ 133 (183)
T cd01453 70 -RKHIQALKTA--REC------SGEPSLQNGLEMALESLKHMP----SHGSREVLIIFSSLSTCDPGNI---YETIDKLK 133 (183)
T ss_pred -HHHHHHhhcc--cCC------CCchhHHHHHHHHHHHHhcCC----ccCceEEEEEEcCCCcCChhhH---HHHHHHHH
Confidence 1222222211 011 124679999999988886431 1224567766443 222 2223 34555678
Q ss_pred hCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHH
Q 027721 172 RSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIIS 208 (219)
Q Consensus 172 K~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~ 208 (219)
|.+|.|++..+| .+..+||+. +.+.+++.|-+-+.-
T Consensus 134 ~~~I~v~~IgiG-~~~~~L~~ia~~tgG~~~~~~~~~~l~~~~~~ 177 (183)
T cd01453 134 KENIRVSVIGLS-AEMHICKEICKATNGTYKVILDETHLKELLLE 177 (183)
T ss_pred HcCcEEEEEEec-hHHHHHHHHHHHhCCeeEeeCCHHHHHHHHHh
Confidence 899999999997 556678887 777888888876654
No 7
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=97.16 E-value=0.077 Score=42.22 Aligned_cols=156 Identities=8% Similarity=0.053 Sum_probs=94.3
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++++||++.---.. .+..+...+..++ ..+....++.|+..+.. .+.++|..... + .+
T Consensus 3 ~~~vlD~S~SM~~~---~~~~~k~a~~~~~---~~l~~~~~v~li~f~~~-~~~~~~~~~~~----~-----------~~ 60 (170)
T cd01465 3 LVFVIDRSGSMDGP---KLPLVKSALKLLV---DQLRPDDRLAIVTYDGA-AETVLPATPVR----D-----------KA 60 (170)
T ss_pred EEEEEECCCCCCCh---hHHHHHHHHHHHH---HhCCCCCEEEEEEecCC-ccEEecCcccc----h-----------HH
Confidence 67899998754322 2444444444333 35567789999998765 56666642211 0 23
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCch--hhhHHHHHHHHhhhC
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPE--QYVAIMNAIFSAQRS 173 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~--QYi~lMN~iFaAqK~ 173 (219)
.+.+.+..-.. ...+.+..||..|+-.+.+..+. .-..+|++++-+.+... .+-.+-.+.=.+++.
T Consensus 61 ~l~~~l~~~~~---------~g~T~~~~al~~a~~~~~~~~~~---~~~~~ivl~TDG~~~~~~~~~~~~~~~~~~~~~~ 128 (170)
T cd01465 61 AILAAIDRLTA---------GGSTAGGAGIQLGYQEAQKHFVP---GGVNRILLATDGDFNVGETDPDELARLVAQKRES 128 (170)
T ss_pred HHHHHHHcCCC---------CCCCCHHHHHHHHHHHHHhhcCC---CCeeEEEEEeCCCCCCCCCCHHHHHHHHHHhhcC
Confidence 34444443211 12467889999999887664321 22367888887654321 122344455456788
Q ss_pred CceeeEeeeCCC-Cchhhhhh--------ccccchhHHHHH
Q 027721 174 MVPIDSCYLGAQ-NSAFLQQC--------MSDRLLTLLVEY 205 (219)
Q Consensus 174 ~V~IDvc~L~~~-~s~fLQQA--------~~~~~~~gLlQy 205 (219)
+|.|.++.+|.. +..+|++. ..+++++.+-++
T Consensus 129 ~v~i~~i~~g~~~~~~~l~~ia~~~~g~~~~~~~~~~~~~~ 169 (170)
T cd01465 129 GITLSTLGFGDNYNEDLMEAIADAGNGNTAYIDNLAEARKV 169 (170)
T ss_pred CeEEEEEEeCCCcCHHHHHHHHhcCCceEEEeCCHHHHHhh
Confidence 999999999843 56788886 667777776554
No 8
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=96.87 E-value=0.12 Score=40.60 Aligned_cols=151 Identities=14% Similarity=0.063 Sum_probs=92.1
Q ss_pred eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721 15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL 94 (219)
Q Consensus 15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~ 94 (219)
-+++++|++.--- ...+..+...+..|+...-..+...++.|+..+.+... ..|..... + .
T Consensus 3 ~v~l~vD~S~SM~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~ii~f~~~~~~-~~~~~~~~-------~--------~ 63 (177)
T smart00327 3 DVVFLLDGSGSMG---PNRFEKAKEFVLKLVEQLDIGPDGDRVGLVTFSDDATV-LFPLNDSR-------S--------K 63 (177)
T ss_pred cEEEEEeCCCccc---hHHHHHHHHHHHHHHHhcCCCCCCcEEEEEEeCCCceE-EEcccccC-------C--------H
Confidence 3688999976542 12356666666777777766666889999998875433 33322110 0 3
Q ss_pred HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721 95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174 (219)
Q Consensus 95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~ 174 (219)
+.+.+.+..-... . ...+.+..||-.|+-.+.+........-+..|++++-..+... -.+..++..+++.+
T Consensus 64 ~~~~~~i~~~~~~-~------~~~~~~~~al~~~~~~~~~~~~~~~~~~~~~iviitDg~~~~~--~~~~~~~~~~~~~~ 134 (177)
T smart00327 64 DALLEALASLSYK-L------GGGTNLGAALQYALENLFSKSAGSRRGAPKVLILITDGESNDG--GDLLKAAKELKRSG 134 (177)
T ss_pred HHHHHHHHhcCCC-C------CCCchHHHHHHHHHHHhcCcCCCCCCCCCeEEEEEcCCCCCCC--ccHHHHHHHHHHCC
Confidence 3344444432110 1 1245678888887766642221111122567888876555433 35588889999999
Q ss_pred ceeeEeeeCCC-Cchhhhhh
Q 027721 175 VPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 175 V~IDvc~L~~~-~s~fLQQA 193 (219)
|.|.++.+|.. +..+|++.
T Consensus 135 i~i~~i~~~~~~~~~~l~~~ 154 (177)
T smart00327 135 VKVFVVGVGNDVDEEELKKL 154 (177)
T ss_pred CEEEEEEccCccCHHHHHHH
Confidence 99999999865 67778777
No 9
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=96.63 E-value=0.23 Score=39.89 Aligned_cols=147 Identities=8% Similarity=-0.044 Sum_probs=83.6
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++++||.+.--- ...|.++.+.+..+++.+-......++++|..+.. .+..+|-.... . .+
T Consensus 3 v~~vlD~S~Sm~---~~~~~~~k~~~~~l~~~~~~~~~~~rvgli~fs~~-~~~~~~l~~~~-------~--------~~ 63 (164)
T cd01482 3 IVFLVDGSWSIG---RSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDD-PRTEFDLNAYT-------S--------KE 63 (164)
T ss_pred EEEEEeCCCCcC---hhhHHHHHHHHHHHHhheeeCCCceEEEEEEECCC-eeEEEecCCCC-------C--------HH
Confidence 678999954332 22466666666677776534456789999998776 34344422111 0 22
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V 175 (219)
.+.+.+.+..... ..+.+..||-.|...+-+..+...+..+..|++++-+... . .+......+++.+|
T Consensus 64 ~l~~~l~~~~~~~--------g~T~~~~aL~~a~~~~~~~~~~~r~~~~k~iillTDG~~~-~---~~~~~a~~lk~~gi 131 (164)
T cd01482 64 DVLAAIKNLPYKG--------GNTRTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQ-D---DVELPARVLRNLGV 131 (164)
T ss_pred HHHHHHHhCcCCC--------CCChHHHHHHHHHHHhcccccCCCCCCCEEEEEEcCCCCC-c---hHHHHHHHHHHCCC
Confidence 2333333211111 2456888888877655442211223344556667643321 1 34677888999999
Q ss_pred eeeEeeeCCCCchhhhhh
Q 027721 176 PIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 176 ~IDvc~L~~~~s~fLQQA 193 (219)
.|-++.+|..+-..|+|.
T Consensus 132 ~i~~ig~g~~~~~~L~~i 149 (164)
T cd01482 132 NVFAVGVKDADESELKMI 149 (164)
T ss_pred EEEEEecCcCCHHHHHHH
Confidence 999999975444455554
No 10
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=96.58 E-value=0.33 Score=40.01 Aligned_cols=149 Identities=15% Similarity=0.061 Sum_probs=85.1
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhc------CCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTL------NQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSL 89 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~------~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~ 89 (219)
++++||++.---.. .|....+.+..+++..+.. ....+++||..+.. .+..+|-..... +
T Consensus 5 vv~vlD~S~Sm~~~---~~~~~k~~~~~~~~~l~~~~~~~i~~~~~rvglv~fs~~-~~~~~~l~~~~~------~---- 70 (186)
T cd01480 5 ITFVLDSSESVGLQ---NFDITKNFVKRVAERFLKDYYRKDPAGSWRVGVVQYSDQ-QEVEAGFLRDIR------N---- 70 (186)
T ss_pred EEEEEeCCCccchh---hHHHHHHHHHHHHHHHhhhhccCCCCCceEEEEEEecCC-ceeeEecccccC------C----
Confidence 78999995543222 3444444455555555432 34578999998755 455665432100 0
Q ss_pred HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHH
Q 027721 90 CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFS 169 (219)
Q Consensus 90 ~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFa 169 (219)
.+.+.+.+..-.... ..+.+..||-.|.-.+.+- ..+..+..|++++-+......-..+-++.=.
T Consensus 71 ----~~~l~~~i~~l~~~g--------g~T~~~~AL~~a~~~l~~~---~~~~~~~~iillTDG~~~~~~~~~~~~~~~~ 135 (186)
T cd01480 71 ----YTSLKEAVDNLEYIG--------GGTFTDCALKYATEQLLEG---SHQKENKFLLVITDGHSDGSPDGGIEKAVNE 135 (186)
T ss_pred ----HHHHHHHHHhCccCC--------CCccHHHHHHHHHHHHhcc---CCCCCceEEEEEeCCCcCCCcchhHHHHHHH
Confidence 334444444321111 2456777887777666541 1234566777787654322222334566677
Q ss_pred hhhCCceeeEeeeCCCCchhhhhh
Q 027721 170 AQRSMVPIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 170 AqK~~V~IDvc~L~~~~s~fLQQA 193 (219)
+++.+|.|=++.+|..+..-|++.
T Consensus 136 ~~~~gi~i~~vgig~~~~~~L~~I 159 (186)
T cd01480 136 ADHLGIKIFFVAVGSQNEEPLSRI 159 (186)
T ss_pred HHHCCCEEEEEecCccchHHHHHH
Confidence 899999999999986555556655
No 11
>PF04056 Ssl1: Ssl1-like; InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=96.48 E-value=0.23 Score=42.35 Aligned_cols=149 Identities=15% Similarity=0.152 Sum_probs=100.4
Q ss_pred cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHHHHHHHHHhhhhcCCC
Q 027721 31 SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ 110 (219)
Q Consensus 31 ~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~~l~~li~~~~~~~~~ 110 (219)
+.-+.-.++.+-.|+.-|+-+|.-.|+.||+...+.++.+-+-..+. ....+.|.++.+. ..
T Consensus 15 PtRl~~~~~~l~~Fv~eff~qNPiSqlgii~~~~~~a~~ls~lsgn~-------------~~h~~~L~~~~~~----~~- 76 (193)
T PF04056_consen 15 PTRLQCVLKALEEFVREFFDQNPISQLGIIVMRDGRAERLSELSGNP-------------QEHIEALKKLRKL----EP- 76 (193)
T ss_pred ccHHHHHHHHHHHHHHHHHhcCChhheeeeeeecceeEEeeecCCCH-------------HHHHHHHHHhccC----CC-
Confidence 44577789999999999999999999999999888888887755432 1123333333221 11
Q ss_pred CCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCC--CchhhhHHHHHHHHhhhCCceeeEeeeCCCCch
Q 027721 111 EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD--GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 188 (219)
Q Consensus 111 ~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d--~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~ 188 (219)
.....|--||-+|......+. .+-..+||+|-+|-- .+. .+-..|=..+|.+|.++|..|+ .+-.
T Consensus 77 -----~G~~SLqN~Le~A~~~L~~~p----~~~srEIlvi~gSl~t~Dp~---di~~ti~~l~~~~IrvsvI~la-aEv~ 143 (193)
T PF04056_consen 77 -----SGEPSLQNGLEMARSSLKHMP----SHGSREILVIFGSLTTCDPG---DIHETIESLKKENIRVSVISLA-AEVY 143 (193)
T ss_pred -----CCChhHHHHHHHHHHHHhhCc----cccceEEEEEEeecccCCch---hHHHHHHHHHHcCCEEEEEEEh-HHHH
Confidence 124557788888877776542 345568999886542 233 3467788889999999999995 4555
Q ss_pred hhhhh-------ccccchhHHHHHHHHhc
Q 027721 189 FLQQC-------MSDRLLTLLVEYIISLN 210 (219)
Q Consensus 189 fLQQA-------~~~~~~~gLlQyLl~~~ 210 (219)
.++.. +.+.--+.-+.-||+.+
T Consensus 144 I~k~i~~~T~G~y~V~lde~H~~~lL~~~ 172 (193)
T PF04056_consen 144 ICKKICKETGGTYGVILDEDHFKELLMEH 172 (193)
T ss_pred HHHHHHHhhCCEEEEecCHHHHHHHHHhh
Confidence 66665 55544444455555543
No 12
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=96.16 E-value=0.48 Score=37.50 Aligned_cols=159 Identities=16% Similarity=0.140 Sum_probs=96.6
Q ss_pred CeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721 14 SLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL 93 (219)
Q Consensus 14 slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v 93 (219)
.=++++||++.--.+. .+..+-..+..|++. +....++.|++.+.. .+.++|..... + ..-
T Consensus 3 ~~v~~vlD~S~SM~~~---~~~~~~~al~~~l~~---l~~~~~~~l~~Fs~~-~~~~~~~~~~~----~--------~~~ 63 (171)
T cd01461 3 KEVVFVIDTSGSMSGT---KIEQTKEALLTALKD---LPPGDYFNIIGFSDT-VEEFSPSSVSA----T--------AEN 63 (171)
T ss_pred ceEEEEEECCCCCCCh---hHHHHHHHHHHHHHh---CCCCCEEEEEEeCCC-ceeecCcceeC----C--------HHH
Confidence 3478999998877543 366666666665543 456678998887655 34445432110 0 001
Q ss_pred HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhC
Q 027721 94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 173 (219)
Q Consensus 94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~ 173 (219)
++++.+++..-.. . ..+.+..||..|+-.+.+ .+.....|+++|-......+. +...+-.+.+.
T Consensus 64 ~~~~~~~l~~~~~---~------g~T~l~~al~~a~~~l~~-----~~~~~~~iillTDG~~~~~~~--~~~~~~~~~~~ 127 (171)
T cd01461 64 VAAAIEYVNRLQA---L------GGTNMNDALEAALELLNS-----SPGSVPQIILLTDGEVTNESQ--ILKNVREALSG 127 (171)
T ss_pred HHHHHHHHHhcCC---C------CCcCHHHHHHHHHHhhcc-----CCCCccEEEEEeCCCCCCHHH--HHHHHHHhcCC
Confidence 2333344433211 1 245677777777655543 123456788888665433332 23444456667
Q ss_pred CceeeEeeeCC-CCchhhhhh--------ccccchhHHHHHHH
Q 027721 174 MVPIDSCYLGA-QNSAFLQQC--------MSDRLLTLLVEYII 207 (219)
Q Consensus 174 ~V~IDvc~L~~-~~s~fLQQA--------~~~~~~~gLlQyLl 207 (219)
+|+|-++.+|. .+..+|++. +++++.+.+.++|+
T Consensus 128 ~i~i~~i~~g~~~~~~~l~~ia~~~gG~~~~~~~~~~~~~~~~ 170 (171)
T cd01461 128 RIRLFTFGIGSDVNTYLLERLAREGRGIARRIYETDDIESQLL 170 (171)
T ss_pred CceEEEEEeCCccCHHHHHHHHHcCCCeEEEecChHHHHHHhc
Confidence 99999999975 367899998 88999999988874
No 13
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=96.05 E-value=0.44 Score=36.07 Aligned_cols=147 Identities=14% Similarity=0.140 Sum_probs=86.8
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
+++++|++..-. ...+..+...+..++..........++.|++.+.. .+..+|..... . .+
T Consensus 3 v~~viD~S~Sm~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~v~~f~~~-~~~~~~~~~~~-------~--------~~ 63 (161)
T cd00198 3 IVFLLDVSGSMG---GEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSN-ARVVLPLTTDT-------D--------KA 63 (161)
T ss_pred EEEEEeCCCCcC---cchHHHHHHHHHHHHHhcccCCCCcEEEEEEecCc-cceeecccccC-------C--------HH
Confidence 678999977651 12355666666666666666567889999998764 34444432210 0 23
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V 175 (219)
.+.+.++.-.. .. ...+.+..|+..|+-...+.. ......+|++++-....... -.+--++-.+.+.+|
T Consensus 64 ~~~~~~~~~~~-~~------~~~t~~~~al~~~~~~~~~~~---~~~~~~~lvvitDg~~~~~~-~~~~~~~~~~~~~~v 132 (161)
T cd00198 64 DLLEAIDALKK-GL------GGGTNIGAALRLALELLKSAK---RPNARRVIILLTDGEPNDGP-ELLAEAARELRKLGI 132 (161)
T ss_pred HHHHHHHhccc-CC------CCCccHHHHHHHHHHHhcccC---CCCCceEEEEEeCCCCCCCc-chhHHHHHHHHHcCC
Confidence 33444433211 01 134557777777777666543 12345667777765544332 244556777888899
Q ss_pred eeeEeeeCC-CCchhhhh
Q 027721 176 PIDSCYLGA-QNSAFLQQ 192 (219)
Q Consensus 176 ~IDvc~L~~-~~s~fLQQ 192 (219)
.|.+..+|+ ....+|+.
T Consensus 133 ~v~~v~~g~~~~~~~l~~ 150 (161)
T cd00198 133 TVYTIGIGDDANEDELKE 150 (161)
T ss_pred EEEEEEcCCCCCHHHHHH
Confidence 999999975 44555554
No 14
>PF13519 VWA_2: von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=96.04 E-value=0.52 Score=36.77 Aligned_cols=155 Identities=15% Similarity=0.115 Sum_probs=88.2
Q ss_pred EEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721 16 VVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL 93 (219)
Q Consensus 16 LviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v 93 (219)
+++++|.+.--... ..--+..+...+..+++.+ ..++++|+..+.. .....|-...
T Consensus 2 vv~v~D~SgSM~~~~~~~~~~~~~~~~~~~~~~~~----~~~~v~l~~f~~~-~~~~~~~t~~----------------- 59 (172)
T PF13519_consen 2 VVFVLDNSGSMNGYDGNRTRIDQAKDALNELLANL----PGDRVGLVSFSDS-SRTLSPLTSD----------------- 59 (172)
T ss_dssp EEEEEE-SGGGGTTTSSS-HHHHHHHHHHHHHHHH----TTSEEEEEEESTS-CEEEEEEESS-----------------
T ss_pred EEEEEECCcccCCCCCCCcHHHHHHHHHHHHHHHC----CCCEEEEEEeccc-cccccccccc-----------------
Confidence 68999998775544 2234666666666677663 4669999998775 3445553211
Q ss_pred HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhC
Q 027721 94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 173 (219)
Q Consensus 94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~ 173 (219)
.+.+.+.+.+-...... ...+.+..||..|.-++.+.. .-+..|++|+-..+. ..+.. .+-.+++.
T Consensus 60 ~~~~~~~l~~~~~~~~~-----~~~t~~~~al~~a~~~~~~~~-----~~~~~iv~iTDG~~~-~~~~~---~~~~~~~~ 125 (172)
T PF13519_consen 60 KDELKNALNKLSPQGMP-----GGGTNLYDALQEAAKMLASSD-----NRRRAIVLITDGEDN-SSDIE---AAKALKQQ 125 (172)
T ss_dssp HHHHHHHHHTHHHHG-------SSS--HHHHHHHHHHHHHC-S-----SEEEEEEEEES-TTH-CHHHH---HHHHHHCT
T ss_pred HHHHHHHhhcccccccC-----ccCCcHHHHHHHHHHHHHhCC-----CCceEEEEecCCCCC-cchhH---HHHHHHHc
Confidence 23333333332211010 135678888888887776542 234455566654433 33333 55577899
Q ss_pred CceeeEeeeCCCCc--hhhhhh--------ccc-cchhHHHHHH
Q 027721 174 MVPIDSCYLGAQNS--AFLQQC--------MSD-RLLTLLVEYI 206 (219)
Q Consensus 174 ~V~IDvc~L~~~~s--~fLQQA--------~~~-~~~~gLlQyL 206 (219)
+|.|.++.+|.... ..||+. +.+ ..++.|-+-|
T Consensus 126 ~i~i~~v~~~~~~~~~~~l~~la~~tgG~~~~~~~~~~~l~~~~ 169 (172)
T PF13519_consen 126 GITIYTVGIGSDSDANEFLQRLAEATGGRYFHVDNDPEDLDDAF 169 (172)
T ss_dssp TEEEEEEEES-TT-EHHHHHHHHHHTEEEEEEE-SSSHHHHHHH
T ss_pred CCeEEEEEECCCccHHHHHHHHHHhcCCEEEEecCCHHHHHHHH
Confidence 99999999986544 478887 555 4666665543
No 15
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=95.98 E-value=0.62 Score=37.18 Aligned_cols=147 Identities=9% Similarity=0.011 Sum_probs=81.1
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++++||.+.--+.. .+..+-+.+..+++.+.......++.+|..+.. .+...|-.... + .+
T Consensus 3 vv~vlD~SgSm~~~---~~~~~k~~~~~~~~~l~~~~~~~~~giv~Fs~~-~~~~~~~~~~~-------~--------~~ 63 (164)
T cd01472 3 IVFLVDGSESIGLS---NFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDD-PRTEFYLNTYR-------S--------KD 63 (164)
T ss_pred EEEEEeCCCCCCHH---HHHHHHHHHHHHHhhcccCCCCeEEEEEEEcCc-eeEEEecCCCC-------C--------HH
Confidence 57999998766553 244444444445544333344568999887644 45555533211 0 22
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V 175 (219)
.+.+.++.-.... ..+.+..||..|...+.+............|++++-+.... . ...+...+.+.+|
T Consensus 64 ~~~~~l~~l~~~~--------g~T~~~~al~~a~~~l~~~~~~~~~~~~~~iiliTDG~~~~-~---~~~~~~~l~~~gv 131 (164)
T cd01472 64 DVLEAVKNLRYIG--------GGTNTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQD-D---VEEPAVELKQAGI 131 (164)
T ss_pred HHHHHHHhCcCCC--------CCchHHHHHHHHHHHhCCcccCCCCCCCEEEEEEcCCCCCc-h---HHHHHHHHHHCCC
Confidence 2333333211111 24568888888888776432111123344456666443221 1 2334556777899
Q ss_pred eeeEeeeCCCCchhhhhh
Q 027721 176 PIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 176 ~IDvc~L~~~~s~fLQQA 193 (219)
.|-+..+|..+...|++.
T Consensus 132 ~i~~ig~g~~~~~~L~~i 149 (164)
T cd01472 132 EVFAVGVKNADEEELKQI 149 (164)
T ss_pred EEEEEECCcCCHHHHHHH
Confidence 999999986677777765
No 16
>PRK13685 hypothetical protein; Provisional
Probab=95.95 E-value=1.2 Score=40.40 Aligned_cols=164 Identities=15% Similarity=0.145 Sum_probs=100.5
Q ss_pred CCCCCeEEEEEeCCccccccC--cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCch
Q 027721 10 SDDVSLVVVLLDTNPFFWSSS--SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMP 87 (219)
Q Consensus 10 ~~~~slLviIlD~np~~W~~~--~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~ 87 (219)
...+.-++++||++.-=...+ +--+..+-..+..|++. +....++.+|+.+.+ +..+.|...+
T Consensus 85 ~~~~~~vvlvlD~S~SM~~~D~~p~RL~~ak~~~~~~l~~---l~~~d~vglv~Fa~~-a~~~~p~t~d----------- 149 (326)
T PRK13685 85 PRNRAVVMLVIDVSQSMRATDVEPNRLAAAQEAAKQFADE---LTPGINLGLIAFAGT-ATVLVSPTTN----------- 149 (326)
T ss_pred CCCCceEEEEEECCccccCCCCCCCHHHHHHHHHHHHHHh---CCCCCeEEEEEEcCc-eeecCCCCCC-----------
Confidence 334557899999988766542 33477777777778775 356778999998665 4555553211
Q ss_pred hHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhc---cCCCCCCceEEEEecCCCCchh----h
Q 027721 88 SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR---SGLLHPQPRILCLQGSPDGPEQ----Y 160 (219)
Q Consensus 88 ~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~---~~~~~~~~RILvis~S~d~~~Q----Y 160 (219)
.+.+.+.++.-... ..+.+..+|..|+-...+..+ ...+....+|+++|-+.+.... -
T Consensus 150 ------~~~l~~~l~~l~~~---------~~T~~g~al~~A~~~l~~~~~~~~~~~~~~~~~IILlTDG~~~~~~~~~~~ 214 (326)
T PRK13685 150 ------REATKNAIDKLQLA---------DRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLMSDGKETVPTNPDNP 214 (326)
T ss_pred ------HHHHHHHHHhCCCC---------CCcchHHHHHHHHHHHHhhhcccccccCCCCCEEEEEcCCCCCCCCCCCCc
Confidence 33444444432111 234567788888877654311 1112345689999866554321 1
Q ss_pred hHHHHHHHHhhhCCceeeEeeeCCC---------------Cchhhhhh--------ccccchhHHH
Q 027721 161 VAIMNAIFSAQRSMVPIDSCYLGAQ---------------NSAFLQQC--------MSDRLLTLLV 203 (219)
Q Consensus 161 i~lMN~iFaAqK~~V~IDvc~L~~~---------------~s~fLQQA--------~~~~~~~gLl 203 (219)
...+.+.-.|++.+|+|.+..+|.+ +...||+. +++.+++.|-
T Consensus 215 ~~~~~aa~~a~~~gi~i~~Ig~G~~~g~~~~~g~~~~~~~d~~~L~~iA~~tgG~~~~~~~~~~L~ 280 (326)
T PRK13685 215 RGAYTAARTAKDQGVPISTISFGTPYGSVEINGQRQPVPVDDESLKKIAQLSGGEFYTAASLEELR 280 (326)
T ss_pred ccHHHHHHHHHHcCCeEEEEEECCCCCCcCcCCceeeecCCHHHHHHHHHhcCCEEEEcCCHHHHH
Confidence 2235667778899999999999853 34577776 5566655543
No 17
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=95.30 E-value=1.3 Score=36.11 Aligned_cols=147 Identities=11% Similarity=0.077 Sum_probs=85.8
Q ss_pred eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721 15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL 94 (219)
Q Consensus 15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~ 94 (219)
+++++||++.--... -.+..+-..+..|+.. .+....++.+|+...+....+.|.... .
T Consensus 2 ~v~lvlD~SgSM~~~--~rl~~ak~a~~~~~~~--~~~~~d~v~lv~F~~~~~~~~~~~t~~-----------------~ 60 (178)
T cd01451 2 LVIFVVDASGSMAAR--HRMAAAKGAVLSLLRD--AYQRRDKVALIAFRGTEAEVLLPPTRS-----------------V 60 (178)
T ss_pred eEEEEEECCccCCCc--cHHHHHHHHHHHHHHH--hhcCCCEEEEEEECCCCceEEeCCCCC-----------------H
Confidence 678999998755433 1244455555555432 345778999999876656666664321 2
Q ss_pred HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhh---hHHHHHHHHhh
Q 027721 95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQY---VAIMNAIFSAQ 171 (219)
Q Consensus 95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QY---i~lMN~iFaAq 171 (219)
+.+.+.+..-. . ...+.|..||..|+-.+++..+. ..-+.+|+++|-..+...+. .....+.-.+.
T Consensus 61 ~~~~~~l~~l~---~------~G~T~l~~aL~~a~~~l~~~~~~--~~~~~~ivliTDG~~~~g~~~~~~~~~~~~~~l~ 129 (178)
T cd01451 61 ELAKRRLARLP---T------GGGTPLAAGLLAAYELAAEQARD--PGQRPLIVVITDGRANVGPDPTADRALAAARKLR 129 (178)
T ss_pred HHHHHHHHhCC---C------CCCCcHHHHHHHHHHHHHHHhcC--CCCceEEEEECCCCCCCCCCchhHHHHHHHHHHH
Confidence 22333443321 1 12567999999999887322111 12357888888655432221 11245566677
Q ss_pred hCCceeeEeeeCCC--Cchhhhhh
Q 027721 172 RSMVPIDSCYLGAQ--NSAFLQQC 193 (219)
Q Consensus 172 K~~V~IDvc~L~~~--~s~fLQQA 193 (219)
+.+|.|.+.-.+.. +..+|++.
T Consensus 130 ~~gi~v~~I~~~~~~~~~~~l~~i 153 (178)
T cd01451 130 ARGISALVIDTEGRPVRRGLAKDL 153 (178)
T ss_pred hcCCcEEEEeCCCCccCccHHHHH
Confidence 88999877776532 34577776
No 18
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=95.15 E-value=1.4 Score=35.48 Aligned_cols=136 Identities=10% Similarity=0.052 Sum_probs=70.9
Q ss_pred eEEEEEeCCccccccC---cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHH
Q 027721 15 LVVVLLDTNPFFWSSS---SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCA 91 (219)
Q Consensus 15 lLviIlD~np~~W~~~---~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~ 91 (219)
-++++||++.--...+ ...+.. ....+..++......++.++..+.. .+.+.|-...
T Consensus 4 ~vv~vlD~S~SM~~~~~~~~~r~~~----a~~~~~~~~~~~~~~~v~lv~f~~~-~~~~~~~~~~--------------- 63 (180)
T cd01467 4 DIMIALDVSGSMLAQDFVKPSRLEA----AKEVLSDFIDRRENDRIGLVVFAGA-AFTQAPLTLD--------------- 63 (180)
T ss_pred eEEEEEECCcccccccCCCCCHHHH----HHHHHHHHHHhCCCCeEEEEEEcCC-eeeccCCCcc---------------
Confidence 4789999988643221 111222 2223445555566788999988554 3333332111
Q ss_pred HHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhh
Q 027721 92 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 171 (219)
Q Consensus 92 ~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAq 171 (219)
.+.+.+++..-..... ...+.+..||..|+-.+.+. ..-+..|++++-..+... +.....+.=.++
T Consensus 64 --~~~~~~~l~~l~~~~~------~g~T~l~~al~~a~~~l~~~-----~~~~~~iiliTDG~~~~g-~~~~~~~~~~~~ 129 (180)
T cd01467 64 --RESLKELLEDIKIGLA------GQGTAIGDAIGLAIKRLKNS-----EAKERVIVLLTDGENNAG-EIDPATAAELAK 129 (180)
T ss_pred --HHHHHHHHHHhhhccc------CCCCcHHHHHHHHHHHHHhc-----CCCCCEEEEEeCCCCCCC-CCCHHHHHHHHH
Confidence 1112222221111000 12455777777777665443 123467888886544322 222233344567
Q ss_pred hCCceeeEeeeCC
Q 027721 172 RSMVPIDSCYLGA 184 (219)
Q Consensus 172 K~~V~IDvc~L~~ 184 (219)
+.+|+|.++.+|.
T Consensus 130 ~~gi~i~~i~ig~ 142 (180)
T cd01467 130 NKGVRIYTIGVGK 142 (180)
T ss_pred HCCCEEEEEEecC
Confidence 7899999999985
No 19
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most,
Probab=95.00 E-value=1.1 Score=35.84 Aligned_cols=140 Identities=15% Similarity=0.182 Sum_probs=82.2
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
+++|||+++--.+. .+..+-.++. ++... +...+++.||..+.. .+.++|-..... .-++
T Consensus 3 v~~vlD~S~SM~~~---rl~~ak~a~~-~l~~~--l~~~~~~~li~F~~~-~~~~~~~~~~~~-------------~~~~ 62 (155)
T cd01466 3 LVAVLDVSGSMAGD---KLQLVKHALR-FVISS--LGDADRLSIVTFSTS-AKRLSPLRRMTA-------------KGKR 62 (155)
T ss_pred EEEEEECCCCCCcH---HHHHHHHHHH-HHHHh--CCCcceEEEEEecCC-ccccCCCcccCH-------------HHHH
Confidence 67899998866543 2444433333 33322 345678999997664 566666321110 0123
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V 175 (219)
.+.+.+..-.. ...+.+..||..|.-.+.+.. ......+|+++|-..+... .....+.+.+|
T Consensus 63 ~~~~~i~~~~~---------~g~T~~~~al~~a~~~~~~~~---~~~~~~~iillTDG~~~~~------~~~~~~~~~~v 124 (155)
T cd01466 63 SAKRVVDGLQA---------GGGTNVVGGLKKALKVLGDRR---QKNPVASIMLLSDGQDNHG------AVVLRADNAPI 124 (155)
T ss_pred HHHHHHHhccC---------CCCccHHHHHHHHHHHHhhcc---cCCCceEEEEEcCCCCCcc------hhhhcccCCCc
Confidence 44444443211 135678899999987765431 1234578999996544322 23456777899
Q ss_pred eeeEeeeCC-CCchhhhhh
Q 027721 176 PIDSCYLGA-QNSAFLQQC 193 (219)
Q Consensus 176 ~IDvc~L~~-~~s~fLQQA 193 (219)
+|-+.-+|. .+..+||+.
T Consensus 125 ~v~~igig~~~~~~~l~~i 143 (155)
T cd01466 125 PIHTFGLGASHDPALLAFI 143 (155)
T ss_pred eEEEEecCCCCCHHHHHHH
Confidence 999999973 355667664
No 20
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=94.46 E-value=2.2 Score=34.77 Aligned_cols=140 Identities=8% Similarity=0.035 Sum_probs=72.0
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++++||.+--- + +-.|.++..-+.-|++.+-.-....+|+||..+.. .+..+|-..... .+
T Consensus 3 i~fvlD~S~S~-~--~~~f~~~k~fi~~~i~~l~~~~~~~rvgvv~fs~~-~~~~~~l~~~~~---------------~~ 63 (177)
T cd01469 3 IVFVLDGSGSI-Y--PDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSES-FRTEFTLNEYRT---------------KE 63 (177)
T ss_pred EEEEEeCCCCC-C--HHHHHHHHHHHHHHHHHcCcCCCCcEEEEEEECCc-eeEEEecCccCC---------------HH
Confidence 68899985432 1 12344444445555553333344789999999776 455565332110 11
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V 175 (219)
.+.+.++.-.... ..+.+..||..|.-.+-...+...+..+.-+++++-+.+...... .-++-.|++.+|
T Consensus 64 ~~~~~i~~~~~~~--------g~T~~~~AL~~a~~~l~~~~~g~R~~~~kv~illTDG~~~~~~~~--~~~~~~~k~~gv 133 (177)
T cd01469 64 EPLSLVKHISQLL--------GLTNTATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDPLL--KDVIPQAEREGI 133 (177)
T ss_pred HHHHHHHhCccCC--------CCccHHHHHHHHHHHhcCcccCCCCCCCeEEEEEeCCCCCCcccc--HHHHHHHHHCCc
Confidence 2223333211111 135566777666655422111111223334555665544333321 335567889999
Q ss_pred eeeEeeeCC
Q 027721 176 PIDSCYLGA 184 (219)
Q Consensus 176 ~IDvc~L~~ 184 (219)
.|-+..+|.
T Consensus 134 ~v~~Vgvg~ 142 (177)
T cd01469 134 IRYAIGVGG 142 (177)
T ss_pred EEEEEEecc
Confidence 999999974
No 21
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=94.32 E-value=2.8 Score=35.35 Aligned_cols=155 Identities=15% Similarity=0.151 Sum_probs=90.5
Q ss_pred CeEEEEEeCCcccccc----CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcce---------EEecCCCCCCCC
Q 027721 14 SLVVVLLDTNPFFWSS----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD---------YVYDSSSTGNQS 80 (219)
Q Consensus 14 slLviIlD~np~~W~~----~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~---------~LYp~~~~~~~~ 80 (219)
.-++++||++|.=-.. .+..|..++..+.-|+..=+..+...++.+|+.++.... .++|-...
T Consensus 2 e~ivf~iDvS~SM~~~~~~~~~s~l~~a~~~i~~~~~~ki~~~~~D~vGlilf~t~~~~~~~~~~~i~v~~~l~~~---- 77 (218)
T cd01458 2 ESVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKIISSPKDLVGVVFYGTEESKNPVGYENIYVLLDLDTP---- 77 (218)
T ss_pred cEEEEEEeCCHHHcCCCCCCCCChHHHHHHHHHHHHHhceeCCCCCeEEEEEEcccCCCCcCCCCceEEeecCCCC----
Confidence 3578999999854322 245688888888877777666788899999998776432 12221110
Q ss_pred CCCCCchhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCc---
Q 027721 81 VGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--- 157 (219)
Q Consensus 81 ~~~~~~~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~--- 157 (219)
....++.+.+++.......... ......+.+..||-.|+=.+.+.. ....+.||+++|-.++..
T Consensus 78 ---------~~~~l~~l~~~~~~~~~~~~~~-~~~~~~~~l~~aL~~a~~~~~~~~---~~~~~k~IvL~TDg~~p~~~~ 144 (218)
T cd01458 78 ---------GAERVEDLKELIEPGGLSFAGQ-VGDSGQVSLSDALWVCLDLFSKGK---KKKSHKRIFLFTNNDDPHGGD 144 (218)
T ss_pred ---------CHHHHHHHHHHhhcchhhhccc-CCCCCCccHHHHHHHHHHHHHhcc---ccccccEEEEECCCCCCCCCC
Confidence 0123444444443211000000 001246778889988888877621 124567899999766542
Q ss_pred -hhhhHHHHHHHHhhhCCceeeEeeeCCC
Q 027721 158 -EQYVAIMNAIFSAQRSMVPIDSCYLGAQ 185 (219)
Q Consensus 158 -~QYi~lMN~iFaAqK~~V~IDvc~L~~~ 185 (219)
...-.+-..+=..++.+|.|.++.++..
T Consensus 145 ~~~~~~~~~~a~~l~~~gI~i~~i~i~~~ 173 (218)
T cd01458 145 SIKDSQAAVKAEDLKDKGIELELFPLSSP 173 (218)
T ss_pred HHHHHHHHHHHHHHHhCCcEEEEEecCCC
Confidence 1222223333345567999999999754
No 22
>PF13768 VWA_3: von Willebrand factor type A domain
Probab=93.90 E-value=2.5 Score=33.22 Aligned_cols=141 Identities=18% Similarity=0.189 Sum_probs=82.4
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
+++|||++..--+. ...+-+.+..+++. +...-++.||+.+.+ .+.++|.... ..+.-++
T Consensus 3 vvilvD~S~Sm~g~----~~~~k~al~~~l~~---L~~~d~fnii~f~~~-~~~~~~~~~~------------~~~~~~~ 62 (155)
T PF13768_consen 3 VVILVDTSGSMSGE----KELVKDALRAILRS---LPPGDRFNIIAFGSS-VRPLFPGLVP------------ATEENRQ 62 (155)
T ss_pred EEEEEeCCCCCCCc----HHHHHHHHHHHHHh---CCCCCEEEEEEeCCE-eeEcchhHHH------------HhHHHHH
Confidence 68999997765443 14444455555555 667779999999774 5667776321 1112245
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEec-CCCCchhhhHHHHHHHHhhhCC
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG-SPDGPEQYVAIMNAIFSAQRSM 174 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~-S~d~~~QYi~lMN~iFaAqK~~ 174 (219)
+..++++.-... ...+-+.+||-.|+--. .......+|++++- .++...+.+ ...+=. .+.+
T Consensus 63 ~a~~~I~~~~~~--------~G~t~l~~aL~~a~~~~------~~~~~~~~IilltDG~~~~~~~~i--~~~v~~-~~~~ 125 (155)
T PF13768_consen 63 EALQWIKSLEAN--------SGGTDLLAALRAALALL------QRPGCVRAIILLTDGQPVSGEEEI--LDLVRR-ARGH 125 (155)
T ss_pred HHHHHHHHhccc--------CCCccHHHHHHHHHHhc------ccCCCccEEEEEEeccCCCCHHHH--HHHHHh-cCCC
Confidence 555666652221 12466778877776432 12345677888883 334444332 333322 3367
Q ss_pred ceeeEeeeCCC-Cchhhhhh
Q 027721 175 VPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 175 V~IDvc~L~~~-~s~fLQQA 193 (219)
++|.++.+|.. +..+|++.
T Consensus 126 ~~i~~~~~g~~~~~~~L~~L 145 (155)
T PF13768_consen 126 IRIFTFGIGSDADADFLREL 145 (155)
T ss_pred ceEEEEEECChhHHHHHHHH
Confidence 99999999863 55577664
No 23
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=92.59 E-value=6.8 Score=34.45 Aligned_cols=146 Identities=16% Similarity=0.119 Sum_probs=81.9
Q ss_pred CCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHH
Q 027721 12 DVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCA 91 (219)
Q Consensus 12 ~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~ 91 (219)
.|--++++||++.---. .+..+.+++..|++..+ ....+++|+..+. +.+.+.|-... .+
T Consensus 52 ~p~~vvlvlD~SgSM~~----~~~~a~~a~~~~l~~~l--~~~d~v~lv~f~~-~~~~~~~~t~~-------------~~ 111 (296)
T TIGR03436 52 LPLTVGLVIDTSGSMRN----DLDRARAAAIRFLKTVL--RPNDRVFVVTFNT-RLRLLQDFTSD-------------PR 111 (296)
T ss_pred CCceEEEEEECCCCchH----HHHHHHHHHHHHHHhhC--CCCCEEEEEEeCC-ceeEeecCCCC-------------HH
Confidence 36678999999754322 26667778888887644 5788999999875 35555543211 11
Q ss_pred HHHHHHHHHHHhhhhcCCC---C---CCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHH
Q 027721 92 TLLQNLEEFMNKDEQLGKQ---E---PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMN 165 (219)
Q Consensus 92 ~v~~~l~~li~~~~~~~~~---~---~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN 165 (219)
.+.+.|.++ .....+ . .......+.|..||.+|.-...+..... ..-+..|++++-+.|..+++ .+--
T Consensus 112 ~l~~~l~~l----~~~~~~~~~~~~~~~~~~g~T~l~~al~~aa~~~~~~~~~~-~p~rk~iIllTDG~~~~~~~-~~~~ 185 (296)
T TIGR03436 112 LLEAALNRL----KPPLRTDYNSSGAFVRDGGGTALYDAITLAALEQLANALAG-IPGRKALIVISDGGDNRSRD-TLER 185 (296)
T ss_pred HHHHHHHhc----cCCCccccccccccccCCCcchhHHHHHHHHHHHHHHhhcC-CCCCeEEEEEecCCCcchHH-HHHH
Confidence 223333332 110000 0 0000134567777766643332221111 11234588888777765544 4455
Q ss_pred HHHHhhhCCceeeEeeeC
Q 027721 166 AIFSAQRSMVPIDSCYLG 183 (219)
Q Consensus 166 ~iFaAqK~~V~IDvc~L~ 183 (219)
++-.|++.+|+|-+.-++
T Consensus 186 ~~~~~~~~~v~vy~I~~~ 203 (296)
T TIGR03436 186 AIDAAQRADVAIYSIDAR 203 (296)
T ss_pred HHHHHHHcCCEEEEeccC
Confidence 666789999999999886
No 24
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=92.49 E-value=12 Score=37.13 Aligned_cols=150 Identities=11% Similarity=0.047 Sum_probs=91.4
Q ss_pred CCCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhH
Q 027721 10 SDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSL 89 (219)
Q Consensus 10 ~~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~ 89 (219)
...+.+++++||++---... -+..+-.++..|+.. +.....+|+||+.......++.|...+
T Consensus 398 ~~~~~~vvfvvD~SGSM~~~---rl~~aK~a~~~ll~~--ay~~rD~v~lI~F~g~~a~~~lppT~~------------- 459 (584)
T PRK13406 398 QRSETTTIFVVDASGSAALH---RLAEAKGAVELLLAE--AYVRRDQVALVAFRGRGAELLLPPTRS------------- 459 (584)
T ss_pred ccCCccEEEEEECCCCCcHh---HHHHHHHHHHHHHHh--hcCCCCEEEEEEECCCceeEEcCCCcC-------------
Confidence 34568999999998754222 355555555555543 345788999999988888999885542
Q ss_pred HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCc---------hhh
Q 027721 90 CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP---------EQY 160 (219)
Q Consensus 90 ~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~---------~QY 160 (219)
.+.+.+.+..- .. ...+.|+.||.+|+-...+..+. ...++|+++|-+.... ...
T Consensus 460 ----~~~~~~~L~~l---~~------gGgTpL~~gL~~A~~~l~~~~~~---~~~~~iVLlTDG~~n~~~~~~~~~~~~~ 523 (584)
T PRK13406 460 ----LVRAKRSLAGL---PG------GGGTPLAAGLDAAAALALQVRRK---GMTPTVVLLTDGRANIARDGTAGRAQAE 523 (584)
T ss_pred ----HHHHHHHHhcC---CC------CCCChHHHHHHHHHHHHHHhccC---CCceEEEEEeCCCCCCCccccccccchh
Confidence 22333333321 11 13688999999999887776432 3467888888544321 112
Q ss_pred hHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh
Q 027721 161 VAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 161 i~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA 193 (219)
.....+.-.+.+.+|++=+.-.+.....++++.
T Consensus 524 ~~~~~~a~~~~~~gi~~~vId~g~~~~~~~~~L 556 (584)
T PRK13406 524 EDALAAARALRAAGLPALVIDTSPRPQPQARAL 556 (584)
T ss_pred hHHHHHHHHHHhcCCeEEEEecCCCCcHHHHHH
Confidence 233455566666777665555543343456555
No 25
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=91.92 E-value=5.2 Score=31.55 Aligned_cols=137 Identities=9% Similarity=0.061 Sum_probs=73.8
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCC-cceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN-SCDYVYDSSSTGNQSVGNGRMPSLCATLL 94 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~-~~~~LYp~~~~~~~~~~~~~~~~~~~~v~ 94 (219)
++++||++.-- .. .+.+..+.+..+++.........+++||..+.. .....+|-..... .
T Consensus 3 v~~llD~S~Sm-~~---~~~~~~~~~~~~~~~l~~~~~~~~v~lv~f~~~~~~~~~~~l~~~~~---------------~ 63 (163)
T cd01476 3 LLFVLDSSGSV-RG---KFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRQRVRFNLPKHND---------------G 63 (163)
T ss_pred EEEEEeCCcch-hh---hHHHHHHHHHHHHHhcCCCCCCcEEEEEEEcCCCceEEEecCCCCCC---------------H
Confidence 57899996533 21 244455555555655444455799999998664 4566666432210 2
Q ss_pred HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhh-C
Q 027721 95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR-S 173 (219)
Q Consensus 95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK-~ 173 (219)
+.+.+.+..-.... ..+.+..||..|.-.+.+.. ...+.....|++++-..... . .....-.+++ .
T Consensus 64 ~~l~~~i~~l~~~g--------g~T~l~~aL~~a~~~l~~~~-~~r~~~~~~villTDG~~~~-~---~~~~~~~l~~~~ 130 (163)
T cd01476 64 EELLEKVDNLRFIG--------GTTATGAAIEVALQQLDPSE-GRREGIPKVVVVLTDGRSHD-D---PEKQARILRAVP 130 (163)
T ss_pred HHHHHHHHhCccCC--------CCccHHHHHHHHHHHhcccc-CCCCCCCeEEEEECCCCCCC-c---hHHHHHHHhhcC
Confidence 23333333211111 24678888888887775321 11123445666676432211 1 1233344555 8
Q ss_pred CceeeEeeeCC
Q 027721 174 MVPIDSCYLGA 184 (219)
Q Consensus 174 ~V~IDvc~L~~ 184 (219)
+|.|-++.+|+
T Consensus 131 ~v~v~~vg~g~ 141 (163)
T cd01476 131 NIETFAVGTGD 141 (163)
T ss_pred CCEEEEEECCC
Confidence 88888888864
No 26
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.65 E-value=7.5 Score=32.89 Aligned_cols=147 Identities=10% Similarity=0.091 Sum_probs=73.8
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
|+++||++---... .|.++..-+..+++..-.-...-+|.||..+.. .+..+|-..... .+
T Consensus 5 lvfllD~S~Sm~~~---~~~~~k~f~~~l~~~l~~~~~~~rvglv~fs~~-~~~~~~l~~~~~---------------~~ 65 (224)
T cd01475 5 LVFLIDSSRSVRPE---NFELVKQFLNQIIDSLDVGPDATRVGLVQYSST-VKQEFPLGRFKS---------------KA 65 (224)
T ss_pred EEEEEeCCCCCCHH---HHHHHHHHHHHHHHhcccCCCccEEEEEEecCc-eeEEecccccCC---------------HH
Confidence 78999996654322 355555555555555433344679999997665 555666432110 12
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCC--CceEE-EEecCCCCchhhhHHHHHHHHhhh
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRIL-CLQGSPDGPEQYVAIMNAIFSAQR 172 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~--~~RIL-vis~S~d~~~QYi~lMN~iFaAqK 172 (219)
.+.+.+..-.... ..+....||-.|+-..-...+...+.. -+|++ +++-+... . .+...+-.+++
T Consensus 66 ~l~~~i~~i~~~~--------~~t~tg~AL~~a~~~~~~~~~g~r~~~~~~~kvvillTDG~s~-~---~~~~~a~~lk~ 133 (224)
T cd01475 66 DLKRAVRRMEYLE--------TGTMTGLAIQYAMNNAFSEAEGARPGSERVPRVGIVVTDGRPQ-D---DVSEVAAKARA 133 (224)
T ss_pred HHHHHHHhCcCCC--------CCChHHHHHHHHHHHhCChhcCCCCCCCCCCeEEEEEcCCCCc-c---cHHHHHHHHHH
Confidence 2333333211111 122233344444322111000011111 14654 45433221 2 23455667889
Q ss_pred CCceeeEeeeCCCCchhhhhh
Q 027721 173 SMVPIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 173 ~~V~IDvc~L~~~~s~fLQQA 193 (219)
.+|.|-++-+|.-+...|++.
T Consensus 134 ~gv~i~~VgvG~~~~~~L~~i 154 (224)
T cd01475 134 LGIEMFAVGVGRADEEELREI 154 (224)
T ss_pred CCcEEEEEeCCcCCHHHHHHH
Confidence 999999999986566666665
No 27
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=91.32 E-value=5.5 Score=30.64 Aligned_cols=148 Identities=11% Similarity=0.068 Sum_probs=80.7
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
+++|||++.---.. .+..+.+.+-.|++........-++.++..+.+ .+...|..... + .+
T Consensus 3 i~~llD~S~Sm~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~li~f~~~-~~~~~~~~~~~-------~--------~~ 63 (161)
T cd01450 3 IVFLLDGSESVGPE---NFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDD-VRVEFSLNDYK-------S--------KD 63 (161)
T ss_pred EEEEEeCCCCcCHH---HHHHHHHHHHHHHHheeeCCCceEEEEEEEcCC-ceEEEECCCCC-------C--------HH
Confidence 56899986542111 233333334444444444445778888887664 44455543221 0 22
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V 175 (219)
++.+.+..-..... ..+.+..||..|.-++.+... .....+..|++++-+.+.... ..=..+-..++.+|
T Consensus 64 ~~~~~i~~~~~~~~-------~~t~~~~al~~a~~~~~~~~~-~~~~~~~~iiliTDG~~~~~~--~~~~~~~~~~~~~v 133 (161)
T cd01450 64 DLLKAVKNLKYLGG-------GGTNTGKALQYALEQLFSESN-ARENVPKVIIVLTDGRSDDGG--DPKEAAAKLKDEGI 133 (161)
T ss_pred HHHHHHHhcccCCC-------CCccHHHHHHHHHHHhccccc-ccCCCCeEEEEECCCCCCCCc--chHHHHHHHHHCCC
Confidence 33333332111100 146677888877777766532 122445667777766554332 34566777788899
Q ss_pred eeeEeeeCCCCchhhhh
Q 027721 176 PIDSCYLGAQNSAFLQQ 192 (219)
Q Consensus 176 ~IDvc~L~~~~s~fLQQ 192 (219)
+|.++.+|..+-..|.+
T Consensus 134 ~v~~i~~g~~~~~~l~~ 150 (161)
T cd01450 134 KVFVVGVGPADEEELRE 150 (161)
T ss_pred EEEEEeccccCHHHHHH
Confidence 99999997534444443
No 28
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=91.03 E-value=7.5 Score=31.71 Aligned_cols=148 Identities=9% Similarity=-0.015 Sum_probs=77.8
Q ss_pred eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721 15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL 94 (219)
Q Consensus 15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~ 94 (219)
=++++||++.---.. +.+.++-+..++..++ +...++.||..+.. .+.++|-.... +.+.
T Consensus 6 Dvv~llD~SgSm~~~----~~~~~~~~~~l~~~~~--~~~~rvglv~Fs~~-~~~~~~l~~~~-------------~~~~ 65 (185)
T cd01474 6 DLYFVLDKSGSVAAN----WIEIYDFVEQLVDRFN--SPGLRFSFITFSTR-ATKILPLTDDS-------------SAII 65 (185)
T ss_pred eEEEEEeCcCchhhh----HHHHHHHHHHHHHHcC--CCCcEEEEEEecCC-ceEEEeccccH-------------HHHH
Confidence 478999986543111 2222222333333332 35689999998754 56677743211 1122
Q ss_pred HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721 95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174 (219)
Q Consensus 95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~ 174 (219)
+.+.. +..- .. ...+.+..||..|.-.+.+..+.. +.-...|++++-+.+............-.+++.+
T Consensus 66 ~~l~~-l~~~---~~------~g~T~~~~aL~~a~~~l~~~~~~~-r~~~~~villTDG~~~~~~~~~~~~~a~~l~~~g 134 (185)
T cd01474 66 KGLEV-LKKV---TP------SGQTYIHEGLENANEQIFNRNGGG-RETVSVIIALTDGQLLLNGHKYPEHEAKLSRKLG 134 (185)
T ss_pred HHHHH-Hhcc---CC------CCCCcHHHHHHHHHHHHHhhccCC-CCCCeEEEEEcCCCcCCCCCcchHHHHHHHHHcC
Confidence 22221 1110 00 125678888888876664322211 1122456667644432112222355666788899
Q ss_pred ceeeEeeeCCCCchhhhhh
Q 027721 175 VPIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 175 V~IDvc~L~~~~s~fLQQA 193 (219)
|.|-+..+++-+...|++.
T Consensus 135 v~i~~vgv~~~~~~~L~~i 153 (185)
T cd01474 135 AIVYCVGVTDFLKSQLINI 153 (185)
T ss_pred CEEEEEeechhhHHHHHHH
Confidence 9999998864455566666
No 29
>PF03731 Ku_N: Ku70/Ku80 N-terminal alpha/beta domain; InterPro: IPR005161 The Ku heterodimer (composed of Ku70 P12956 from SWISSPROT and Ku80 P13010 from SWISSPROT) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the N-terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold [].; PDB: 1JEQ_A 1JEY_A.
Probab=90.24 E-value=8.5 Score=32.34 Aligned_cols=158 Identities=15% Similarity=0.133 Sum_probs=83.5
Q ss_pred EEEEEeCCcccccc---CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721 16 VVVLLDTNPFFWSS---SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT 92 (219)
Q Consensus 16 LviIlD~np~~W~~---~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~ 92 (219)
++++||++|.=-.. ....|..+++.+--++---...+....|.||.+++.++.=- + ...+-++......+...
T Consensus 2 ~vflID~s~sM~~~~~~~~~~l~~al~~i~~~~~~ki~~~~kD~vgvvl~gt~~t~n~-~---~~~~~~~i~~l~~l~~~ 77 (224)
T PF03731_consen 2 TVFLIDVSPSMFEPSSESESPLEEALKAIEDLMQQKIISSPKDEVGVVLFGTDETNNP-D---EDSGYENIFVLQPLDPP 77 (224)
T ss_dssp EEEEEE-SCGGGS-BTTCS-HHHHHHHHHHHHHHHHHHTT---EEEEEEES-SS-BST-----TTT-STTEEEEEECC--
T ss_pred EEEEEECCHHHCCCCCCcchhHHHHHHHHHHHHHHHHcCCCCCeEEEEEEcCCCCCCc-c---cccCCCceEEeecCCcc
Confidence 68999999986644 22279999999999999999999999999999987765310 0 00000000000011001
Q ss_pred HHHHHHHHHHhhhh------cCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCC---CchhhhHH
Q 027721 93 LLQNLEEFMNKDEQ------LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD---GPEQYVAI 163 (219)
Q Consensus 93 v~~~l~~li~~~~~------~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d---~~~QYi~l 163 (219)
-.+.|+++.+.... ... ......+..||-.|.-.+.+.. ......+.||++++-.++ ..+++-.+
T Consensus 78 ~~~~l~~L~~~~~~~~~~~~~~~-----~~~~~~l~~al~v~~~~~~~~~-~~~k~~~krI~l~Td~d~p~~~~~~~~~~ 151 (224)
T PF03731_consen 78 SAERLKELEELLKPGDKFENFFS-----GSDEGDLSDALWVASDMFRERT-CKKKKNKKRIFLFTDNDGPHEDDDELERI 151 (224)
T ss_dssp BHHHHHHHHTTSHHHHHHHHHC------SSS---HHHHHHHHHHHHHCHC-TTS-ECEEEEEEEES-SSTTT-CCCHHHH
T ss_pred CHHHHHHHHHhhcccccccccCC-----CCCccCHHHHHHHHHHHHHHHh-hcccCCCcEEEEEeCCCCCCCCHHHHHHH
Confidence 12223333222111 001 1234567777777777665521 122356789999995433 34555444
Q ss_pred HHH--HHHhhhCCceeeEeeeC
Q 027721 164 MNA--IFSAQRSMVPIDSCYLG 183 (219)
Q Consensus 164 MN~--iFaAqK~~V~IDvc~L~ 183 (219)
..- +--.+..+|.|+++.|+
T Consensus 152 ~~~l~~~Dl~~~~i~~~~~~l~ 173 (224)
T PF03731_consen 152 IQKLKAKDLQDNGIEIELFFLP 173 (224)
T ss_dssp HHHHHHHHHHHHTEEEEEEECT
T ss_pred HHhhccccchhcCcceeEeecC
Confidence 444 55678899999999993
No 30
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=89.65 E-value=10 Score=31.06 Aligned_cols=153 Identities=14% Similarity=0.110 Sum_probs=81.7
Q ss_pred CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721 11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC 90 (219)
Q Consensus 11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~ 90 (219)
..|.-++++||++.--.+. .++.+-..+..+++ .+....++.|++.+.. .+.++|.......... ..-
T Consensus 11 ~~p~~vv~llD~SgSM~~~---~l~~ak~~~~~ll~---~l~~~d~v~lv~F~~~-~~~~~~~~~~~~~~~~----~~~- 78 (190)
T cd01463 11 TSPKDIVILLDVSGSMTGQ---RLHLAKQTVSSILD---TLSDNDFFNIITFSNE-VNPVVPCFNDTLVQAT----TSN- 78 (190)
T ss_pred cCCceEEEEEECCCCCCcH---HHHHHHHHHHHHHH---hCCCCCEEEEEEeCCC-eeEEeeecccceEecC----HHH-
Confidence 4578899999998754332 24444444444443 4566779999987665 5666664322100000 000
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhh---c-cCCCCCCceEEEEecCCCCchhhhHHHHH
Q 027721 91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF---R-SGLLHPQPRILCLQGSPDGPEQYVAIMNA 166 (219)
Q Consensus 91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~---~-~~~~~~~~RILvis~S~d~~~QYi~lMN~ 166 (219)
.+.+.+.+..- ... ..+.+..||-.|+-.+.+.. . .........|++++-+.+....-
T Consensus 79 ---~~~~~~~l~~l---~~~------G~T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~------ 140 (190)
T cd01463 79 ---KKVLKEALDML---EAK------GIANYTKALEFAFSLLLKNLQSNHSGSRSQCNQAIMLITDGVPENYKE------ 140 (190)
T ss_pred ---HHHHHHHHhhC---CCC------CcchHHHHHHHHHHHHHHhhhcccccccCCceeEEEEEeCCCCCcHhH------
Confidence 22333333221 111 24567888888887666521 1 11123445788888554432211
Q ss_pred HHHhh---h---CCceeeEeeeCCC--Cchhhhhh
Q 027721 167 IFSAQ---R---SMVPIDSCYLGAQ--NSAFLQQC 193 (219)
Q Consensus 167 iFaAq---K---~~V~IDvc~L~~~--~s~fLQQA 193 (219)
++.+. + .+|.|-++.+|.+ +..+||+.
T Consensus 141 ~~~~~~~~~~~~~~v~i~tigiG~~~~d~~~L~~l 175 (190)
T cd01463 141 IFDKYNWDKNSEIPVRVFTYLIGREVTDRREIQWM 175 (190)
T ss_pred HHHHhcccccCCCcEEEEEEecCCccccchHHHHH
Confidence 22222 2 2588888888854 46788876
No 31
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=88.71 E-value=9.8 Score=29.67 Aligned_cols=138 Identities=16% Similarity=0.169 Sum_probs=74.3
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++|+||+++---+. .+...-..+..++. .......++.||..+.......++.... ..
T Consensus 3 v~illD~SgSM~~~---k~~~a~~~~~~l~~--~~~~~~~~v~li~F~~~~~~~~~~~~~~-----------------~~ 60 (152)
T cd01462 3 VILLVDQSGSMYGA---PEEVAKAVALALLR--IALAENRDTYLILFDSEFQTKIVDKTDD-----------------LE 60 (152)
T ss_pred EEEEEECCCCCCCC---HHHHHHHHHHHHHH--HHHHcCCcEEEEEeCCCceEEecCCccc-----------------HH
Confidence 68899998754322 12222222333333 2233567899998876632222221110 12
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC-CCchhhhHHHHHHHHhhhCC
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP-DGPEQYVAIMNAIFSAQRSM 174 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~-d~~~QYi~lMN~iFaAqK~~ 174 (219)
++.+.+... . . ...+.+..||..++-.+.+.. .-+.+|+++|-.. +..... ++.....+++.+
T Consensus 61 ~~~~~l~~~-~--~------~ggT~l~~al~~a~~~l~~~~-----~~~~~ivliTDG~~~~~~~~--~~~~~~~~~~~~ 124 (152)
T cd01462 61 EPVEFLSGV-Q--L------GGGTDINKALRYALELIERRD-----PRKADIVLITDGYEGGVSDE--LLREVELKRSRV 124 (152)
T ss_pred HHHHHHhcC-C--C------CCCcCHHHHHHHHHHHHHhcC-----CCCceEEEECCCCCCCCCHH--HHHHHHHHHhcC
Confidence 222222211 1 1 124668888888876665531 2346899998663 322222 235567778889
Q ss_pred ceeeEeeeCCCCchhhh
Q 027721 175 VPIDSCYLGAQNSAFLQ 191 (219)
Q Consensus 175 V~IDvc~L~~~~s~fLQ 191 (219)
+.|-+..+|...-..|.
T Consensus 125 ~~v~~~~~g~~~~~~~~ 141 (152)
T cd01462 125 ARFVALALGDHGNPGYD 141 (152)
T ss_pred cEEEEEEecCCCCchHH
Confidence 99999999865444443
No 32
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=86.33 E-value=16 Score=29.53 Aligned_cols=73 Identities=8% Similarity=-0.012 Sum_probs=44.1
Q ss_pred ccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCC-Cchhhhhh
Q 027721 118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 118 ~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA 193 (219)
.+.++.||-.|.-.+.+. +...+....+|++++-+...... ......-.+++.+|.|-++.+|.. +..+|+..
T Consensus 84 ~T~l~~aL~~a~~~l~~~-~~~r~~~~~~villTDG~~~~~~--~~~~~a~~l~~~gv~v~~igiG~~~d~~~l~~i 157 (186)
T cd01471 84 STNTTSALLVVEKHLFDT-RGNRENAPQLVIIMTDGIPDSKF--RTLKEARKLRERGVIIAVLGVGQGVNHEENRSL 157 (186)
T ss_pred CccHHHHHHHHHHHhhcc-CCCcccCceEEEEEccCCCCCCc--chhHHHHHHHHCCCEEEEEEeehhhCHHHHHHh
Confidence 567888888887666543 22223445677778754432211 223445566677898888888743 55667654
No 33
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=84.07 E-value=57 Score=33.95 Aligned_cols=151 Identities=12% Similarity=0.095 Sum_probs=91.2
Q ss_pred eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721 15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL 94 (219)
Q Consensus 15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~ 94 (219)
.+++|||++.-=-+.+ .+..+-.++..|+-.- +...-++.||.-+.. +..+.|-..-.. ...+
T Consensus 306 ~VVLVLDvSGSM~g~d--RL~~lkqAA~~fL~~~--l~~~DrVGLVtFsss-A~vl~pLt~Its------------~~dr 368 (863)
T TIGR00868 306 IVCLVLDKSGSMTVED--RLKRMNQAAKLFLLQT--VEKGSWVGMVTFDSA-AYIKNELIQITS------------SAER 368 (863)
T ss_pred eEEEEEECCccccccC--HHHHHHHHHHHHHHHh--CCCCCEEEEEEECCc-eeEeeccccCCc------------HHHH
Confidence 5889999976532221 2444444455554332 456789999998765 445555322110 0113
Q ss_pred HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721 95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174 (219)
Q Consensus 95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~ 174 (219)
+.+.+.+... . ...+.|..||.+|+-.+++..+ .....+|++++-+.|... -.|+=.+++.+
T Consensus 369 ~aL~~~L~~~---A-------~GGT~I~~GL~~Alq~L~~~~~---~~~~~~IILLTDGedn~~-----~~~l~~lk~~g 430 (863)
T TIGR00868 369 DALTANLPTA---A-------SGGTSICSGLKAAFQVIKKSYQ---STDGSEIVLLTDGEDNTI-----SSCFEEVKQSG 430 (863)
T ss_pred HHHHHhhccc---c-------CCCCcHHHHHHHHHHHHHhccc---ccCCCEEEEEeCCCCCCH-----HHHHHHHHHcC
Confidence 3333333211 1 1357799999999988877532 123467888887666432 24555677889
Q ss_pred ceeeEeeeCCCCchhhhhh--------ccccchh
Q 027721 175 VPIDSCYLGAQNSAFLQQC--------MSDRLLT 200 (219)
Q Consensus 175 V~IDvc~L~~~~s~fLQQA--------~~~~~~~ 200 (219)
|+|.+..+|......|++. +.+.+.+
T Consensus 431 VtI~TIg~G~dad~~L~~IA~~TGG~~f~asd~~ 464 (863)
T TIGR00868 431 AIIHTIALGPSAAKELEELSDMTGGLRFYASDQA 464 (863)
T ss_pred CEEEEEEeCCChHHHHHHHHHhcCCEEEEeCCHH
Confidence 9999999997666678877 6666554
No 34
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=83.60 E-value=22 Score=35.03 Aligned_cols=109 Identities=15% Similarity=0.117 Sum_probs=66.1
Q ss_pred CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721 11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC 90 (219)
Q Consensus 11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~ 90 (219)
....+++++||++--- . .-.+..+-..+..|++ .+.....+|++|+.......+++|...+
T Consensus 405 ~~~~~v~fvvD~SGSM-~--~~rl~~aK~av~~Ll~--~~~~~~D~v~Li~F~~~~a~~~lp~t~~-------------- 465 (589)
T TIGR02031 405 KSGRLLIFVVDASGSA-A--VARMSEAKGAVELLLG--EAYVHRDQVSLIAFRGTAAEVLLPPSRS-------------- 465 (589)
T ss_pred ccCceEEEEEECCCCC-C--hHHHHHHHHHHHHHHH--hhccCCCEEEEEEECCCCceEECCCCCC--------------
Confidence 3345678999998733 1 1135555555555543 2344667999999977767777775322
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecC
Q 027721 91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS 153 (219)
Q Consensus 91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S 153 (219)
.+.+.+.+..-. . ...+.|+.||.+|+-.+.+..+. ..+++|+++|-+
T Consensus 466 ---~~~~~~~L~~l~---~------gGgTpL~~gL~~A~~~~~~~~~~---~~~~~ivllTDG 513 (589)
T TIGR02031 466 ---VEQAKRRLDVLP---G------GGGTPLAAGLAAAFQTALQARSS---GGTPTIVLITDG 513 (589)
T ss_pred ---HHHHHHHHhcCC---C------CCCCcHHHHHHHHHHHHHHhccc---CCceEEEEECCC
Confidence 222333333211 1 13578999999999887765332 456788888843
No 35
>cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=80.75 E-value=28 Score=28.00 Aligned_cols=144 Identities=16% Similarity=0.151 Sum_probs=77.9
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHh---cCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILT---LNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT 92 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~---~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~ 92 (219)
++++||++.---+. .+..+-.++-.|+..... ....-++.||..+. +++.+.|-......
T Consensus 6 v~~llD~SgSM~~~---~~~~~k~a~~~~~~~l~~~~~~~~~~~v~ii~F~~-~a~~~~~l~~~~~~------------- 68 (176)
T cd01464 6 IYLLLDTSGSMAGE---PIEALNQGLQMLQSELRQDPYALESVEISVITFDS-AARVIVPLTPLESF------------- 68 (176)
T ss_pred EEEEEECCCCCCCh---HHHHHHHHHHHHHHHHhcChhhccccEEEEEEecC-CceEecCCccHHhc-------------
Confidence 57899997643222 244444444444443221 12355899998866 46777774321100
Q ss_pred HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhcc----CCCCCCceEEEEecCCCCchhhhHHHHHHH
Q 027721 93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS----GLLHPQPRILCLQGSPDGPEQYVAIMNAIF 168 (219)
Q Consensus 93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~----~~~~~~~RILvis~S~d~~~QYi~lMN~iF 168 (219)
... .... ...+.+..||-.|+-.+.+.... .....+++|+++|-+.+....- ...-.+=
T Consensus 69 ---~~~-------~l~~------~GgT~l~~aL~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~-~~~~~~~ 131 (176)
T cd01464 69 ---QPP-------RLTA------SGGTSMGAALELALDCIDRRVQRYRADQKGDWRPWVFLLTDGEPTDDLT-AAIERIK 131 (176)
T ss_pred ---CCC-------cccC------CCCCcHHHHHHHHHHHHHHHHHHhcccCcCCcCcEEEEEcCCCCCchHH-HHHHHHH
Confidence 000 0000 12466888888888777553211 1123456788888655432211 1123444
Q ss_pred HhhhCCceeeEeeeCC-CCchhhhhh
Q 027721 169 SAQRSMVPIDSCYLGA-QNSAFLQQC 193 (219)
Q Consensus 169 aAqK~~V~IDvc~L~~-~~s~fLQQA 193 (219)
.+++.++.|-++-+|. -+..+|++.
T Consensus 132 ~~~~~~~~i~~igiG~~~~~~~L~~i 157 (176)
T cd01464 132 EARDSKGRIVACAVGPKADLDTLKQI 157 (176)
T ss_pred hhcccCCcEEEEEeccccCHHHHHHH
Confidence 5566689999999984 356677766
No 36
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=78.72 E-value=36 Score=28.13 Aligned_cols=160 Identities=12% Similarity=0.075 Sum_probs=85.9
Q ss_pred CCCCCCCeEEEEEeCCcccccc---CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCc-----ceEEecCCCCCCC
Q 027721 8 LYSDDVSLVVVLLDTNPFFWSS---SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS-----CDYVYDSSSTGNQ 79 (219)
Q Consensus 8 ~~~~~~slLviIlD~np~~W~~---~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~-----~~~LYp~~~~~~~ 79 (219)
.....+.-+++|||++.-=... ..--+..+-..+..|++. +...+++.++..+... .+.+.|.......
T Consensus 15 ~~~~~~~~vv~vlD~SgSM~~~~~~~~~rl~~ak~a~~~~l~~---l~~~~~v~lv~F~~~~~~~~~~~~~~p~~~~~~~ 91 (206)
T cd01456 15 TEPQLPPNVAIVLDNSGSMREVDGGGETRLDNAKAALDETANA---LPDGTRLGLWTFSGDGDNPLDVRVLVPKGCLTAP 91 (206)
T ss_pred cccCCCCcEEEEEeCCCCCcCCCCCcchHHHHHHHHHHHHHHh---CCCCceEEEEEecCCCCCCccccccccccccccc
Confidence 4455677899999997764421 123466666666666664 5678899999976531 2233321100000
Q ss_pred CCCCCCchhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchh
Q 027721 80 SVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ 159 (219)
Q Consensus 80 ~~~~~~~~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~Q 159 (219)
..+.. ..- .+.+.+.+..-.. . ...+.|..||..|+-++. ..-..+|++||-+.+...+
T Consensus 92 ~~~~~--~~~----~~~l~~~i~~i~~--~------~G~T~l~~aL~~a~~~l~-------~~~~~~iillTDG~~~~~~ 150 (206)
T cd01456 92 VNGFP--SAQ----RSALDAALNSLQT--P------TGWTPLAAALAEAAAYVD-------PGRVNVVVLITDGEDTCGP 150 (206)
T ss_pred cCCCC--ccc----HHHHHHHHHhhcC--C------CCcChHHHHHHHHHHHhC-------CCCcceEEEEcCCCccCCC
Confidence 00000 001 2333333333210 1 135678899988887764 1112688888866554322
Q ss_pred -hhHHHHHHHHhhh----CCceeeEeeeCCC-Cchhhhhh
Q 027721 160 -YVAIMNAIFSAQR----SMVPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 160 -Yi~lMN~iFaAqK----~~V~IDvc~L~~~-~s~fLQQA 193 (219)
...... --+++ .+|+|.++.+|.+ +..+|++.
T Consensus 151 ~~~~~~~--~~~~~~~~~~~i~i~~igiG~~~~~~~l~~i 188 (206)
T cd01456 151 DPCEVAR--ELAKRRTPAPPIKVNVIDFGGDADRAELEAI 188 (206)
T ss_pred CHHHHHH--HHHHhcCCCCCceEEEEEecCcccHHHHHHH
Confidence 222222 22333 5999999999854 35566665
No 37
>PF00092 VWA: von Willebrand factor type A domain; InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=78.39 E-value=30 Score=26.97 Aligned_cols=148 Identities=11% Similarity=0.099 Sum_probs=77.0
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++++||.+..--.. .+.++.+.+..|++.+......-+++|+..+... +.+++-..... .+
T Consensus 2 ivflvD~S~sm~~~---~~~~~~~~v~~~i~~~~~~~~~~rv~iv~f~~~~-~~~~~~~~~~~---------------~~ 62 (178)
T PF00092_consen 2 IVFLVDTSGSMSGD---NFEKAKQFVKSIISRLSISNNGTRVGIVTFSDSA-RVLFSLTDYQS---------------KN 62 (178)
T ss_dssp EEEEEE-STTSCHH---HHHHHHHHHHHHHHHSTBSTTSEEEEEEEESSSE-EEEEETTSHSS---------------HH
T ss_pred EEEEEeCCCCCchH---HHHHHHHHHHHHHHhhhccccccccceeeeeccc-ccccccccccc---------------cc
Confidence 58899986543222 3666677777777755577888899998886663 45555332210 22
Q ss_pred HHHHHH-HhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721 96 NLEEFM-NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174 (219)
Q Consensus 96 ~l~~li-~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~ 174 (219)
++.+.+ ..... . ...+.++.||-.|..++....+...+..+.-|++++-+.......... .-...+..
T Consensus 63 ~~~~~i~~~~~~-~-------~g~t~~~~aL~~a~~~l~~~~~~~r~~~~~~iiliTDG~~~~~~~~~~---~~~~~~~~ 131 (178)
T PF00092_consen 63 DLLNAINDSIPS-S-------GGGTNLGAALKFAREQLFSSNNGGRPNSPKVIILITDGNSNDSDSPSE---EAANLKKS 131 (178)
T ss_dssp HHHHHHHTTGGC-C-------BSSB-HHHHHHHHHHHTTSGGGTTGTTSEEEEEEEESSSSSSHSGHHH---HHHHHHHH
T ss_pred cccccccccccc-c-------chhhhHHHHHhhhhhcccccccccccccccceEEEEeecccCCcchHH---HHHHHHHh
Confidence 233322 21111 1 235668899999888877652222223333455577666555433221 11111111
Q ss_pred ceeeEeeeCC--CCchhhhhh
Q 027721 175 VPIDSCYLGA--QNSAFLQQC 193 (219)
Q Consensus 175 V~IDvc~L~~--~~s~fLQQA 193 (219)
.-|-++.+|. .+...|+..
T Consensus 132 ~~i~~~~ig~~~~~~~~l~~l 152 (178)
T PF00092_consen 132 NGIKVIAIGIDNADNEELREL 152 (178)
T ss_dssp CTEEEEEEEESCCHHHHHHHH
T ss_pred cCcEEEEEecCcCCHHHHHHH
Confidence 4555666653 344555554
No 38
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=76.98 E-value=39 Score=27.65 Aligned_cols=168 Identities=8% Similarity=0.059 Sum_probs=86.0
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~ 95 (219)
++++||++--- . .-.|..+-..+..|++........-++.||..+.. .+.++|-..... .+ .+
T Consensus 3 i~~vlD~SgSM-~--~~~~~~~k~~~~~l~~~l~~~~~~~~v~li~Fs~~-~~~~~~~~~~~~-----~~--------~~ 65 (198)
T cd01470 3 IYIALDASDSI-G--EEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASD-PKEIVSIRDFNS-----ND--------AD 65 (198)
T ss_pred EEEEEECCCCc-c--HHHHHHHHHHHHHHHHHccccCCCceEEEEEecCC-ceEEEecccCCC-----CC--------HH
Confidence 67899995432 2 12455566666666665433334568888887665 455666432110 00 12
Q ss_pred HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhcc---CCCCCCceEEEEecCCCC----chhhhHHHHHHH
Q 027721 96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS---GLLHPQPRILCLQGSPDG----PEQYVAIMNAIF 168 (219)
Q Consensus 96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~---~~~~~~~RILvis~S~d~----~~QYi~lMN~iF 168 (219)
.+.+.+..-...... ....+.+..||.+|+-.+.+.... .....+.+|+++|-+.+. +.+....+--..
T Consensus 66 ~~~~~l~~~~~~~~~----~~ggT~~~~Al~~~~~~l~~~~~~~~~~~~~~~~~iillTDG~~~~g~~~~~~~~~~~~~~ 141 (198)
T cd01470 66 DVIKRLEDFNYDDHG----DKTGTNTAAALKKVYERMALEKVRNKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLV 141 (198)
T ss_pred HHHHHHHhCCccccc----CccchhHHHHHHHHHHHHHHHHhcCccchhhcceEEEEEcCCCcCCCCChhHHHHHHHHHH
Confidence 233333321110000 012456788887776555332111 112356678888865432 222222222111
Q ss_pred -------HhhhCCceeeEeeeCCC-Cchhhhhh----------ccccchhHHHH
Q 027721 169 -------SAQRSMVPIDSCYLGAQ-NSAFLQQC----------MSDRLLTLLVE 204 (219)
Q Consensus 169 -------aAqK~~V~IDvc~L~~~-~s~fLQQA----------~~~~~~~gLlQ 204 (219)
.+++.+|.|-++-+|.. +...|++. +.+.+++.|-+
T Consensus 142 ~~~~~~~~~~~~~v~i~~iGvG~~~~~~~L~~iA~~~~g~~~~f~~~~~~~l~~ 195 (198)
T cd01470 142 YKNNKSDNPREDYLDVYVFGVGDDVNKEELNDLASKKDNERHFFKLKDYEDLQE 195 (198)
T ss_pred hcccccccchhcceeEEEEecCcccCHHHHHHHhcCCCCCceEEEeCCHHHHHH
Confidence 22556899999999854 66788776 44566665543
No 39
>COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism]
Probab=76.71 E-value=19 Score=32.21 Aligned_cols=111 Identities=16% Similarity=0.186 Sum_probs=71.3
Q ss_pred CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721 11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC 90 (219)
Q Consensus 11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~ 90 (219)
-...|.++|+|++--=-..+ -+..+=-.++.|+ |=+-.+.++|+||+...++++++.|...+
T Consensus 76 r~g~lvvfvVDASgSM~~~~--Rm~aaKG~~~~lL--~dAYq~RdkvavI~F~G~~A~lll~pT~s-------------- 137 (261)
T COG1240 76 RAGNLIVFVVDASGSMAARR--RMAAAKGAALSLL--RDAYQRRDKVAVIAFRGEKAELLLPPTSS-------------- 137 (261)
T ss_pred CcCCcEEEEEeCcccchhHH--HHHHHHHHHHHHH--HHHHHccceEEEEEecCCcceEEeCCccc--------------
Confidence 34568889999964322111 1222222333333 23557899999999999999999997654
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEec
Q 027721 91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 152 (219)
Q Consensus 91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~ 152 (219)
++.+++.+.+-.. -..++|+-||-+|.--+-|..+.. +...+.++|||-
T Consensus 138 ---v~~~~~~L~~l~~---------GG~TPL~~aL~~a~ev~~r~~r~~-p~~~~~~vviTD 186 (261)
T COG1240 138 ---VELAERALERLPT---------GGKTPLADALRQAYEVLAREKRRG-PDRRPVMVVITD 186 (261)
T ss_pred ---HHHHHHHHHhCCC---------CCCCchHHHHHHHHHHHHHhhccC-CCcceEEEEEeC
Confidence 3344444443211 136789999999999998877642 356788888883
No 40
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=66.26 E-value=1.4e+02 Score=29.34 Aligned_cols=148 Identities=17% Similarity=0.172 Sum_probs=83.7
Q ss_pred CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721 11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC 90 (219)
Q Consensus 11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~ 90 (219)
..|.-+++|||++----+. .+..+-.++..+++ .+...-++.||+.+.. .+.++|...... ..
T Consensus 269 ~~p~~vvfvlD~SgSM~g~---~i~~ak~al~~~l~---~L~~~d~~~ii~F~~~-~~~~~~~~~~~~--------~~-- 331 (596)
T TIGR03788 269 VLPRELVFVIDTSGSMAGE---SIEQAKSALLLALD---QLRPGDRFNIIQFDSD-VTLLFPVPVPAT--------AH-- 331 (596)
T ss_pred CCCceEEEEEECCCCCCCc---cHHHHHHHHHHHHH---hCCCCCEEEEEEECCc-ceEeccccccCC--------HH--
Confidence 3455688999997654433 35555554444443 4566779999997555 567787543211 00
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHh
Q 027721 91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSA 170 (219)
Q Consensus 91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaA 170 (219)
=++++.+++..-... ..+.+..||-.|+-.. .........+|+++|-+.....+ .++..+ .+
T Consensus 332 --~~~~a~~~i~~l~a~---------GgT~l~~aL~~a~~~~----~~~~~~~~~~iillTDG~~~~~~--~~~~~~-~~ 393 (596)
T TIGR03788 332 --NLARARQFVAGLQAD---------GGTEMAGALSAALRDD----GPESSGALRQVVFLTDGAVGNED--ALFQLI-RT 393 (596)
T ss_pred --HHHHHHHHHhhCCCC---------CCccHHHHHHHHHHhh----cccCCCceeEEEEEeCCCCCCHH--HHHHHH-HH
Confidence 133444444432111 2456888888877431 11112334568888876543221 223322 23
Q ss_pred hhCCceeeEeeeCCC-Cchhhhhh
Q 027721 171 QRSMVPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 171 qK~~V~IDvc~L~~~-~s~fLQQA 193 (219)
+..+++|.++-+|.. +..+|++.
T Consensus 394 ~~~~~ri~tvGiG~~~n~~lL~~l 417 (596)
T TIGR03788 394 KLGDSRLFTVGIGSAPNSYFMRKA 417 (596)
T ss_pred hcCCceEEEEEeCCCcCHHHHHHH
Confidence 345789999999864 67889886
No 41
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=62.50 E-value=1.7e+02 Score=29.11 Aligned_cols=112 Identities=12% Similarity=0.095 Sum_probs=67.6
Q ss_pred CCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721 13 VSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT 92 (219)
Q Consensus 13 ~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~ 92 (219)
..+++++||++.--... --+..+-..+..|+. .+.....+|.||+.......++.|...+
T Consensus 465 ~~~vv~vvD~SgSM~~~--~rl~~ak~a~~~ll~--~a~~~~D~v~lI~F~g~~a~~~~p~t~~---------------- 524 (633)
T TIGR02442 465 GNLVIFVVDASGSMAAR--GRMAAAKGAVLSLLR--DAYQKRDKVALITFRGEEAEVLLPPTSS---------------- 524 (633)
T ss_pred CceEEEEEECCccCCCc--cHHHHHHHHHHHHHH--HhhcCCCEEEEEEECCCCceEEcCCCCC----------------
Confidence 45888999998754432 124444445555544 2346689999999987667778775422
Q ss_pred HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCC
Q 027721 93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD 155 (219)
Q Consensus 93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d 155 (219)
.+.+.+.+..- . . ...+.|+.||..|.-.+.+..+. ....++.|++||-..+
T Consensus 525 -~~~~~~~L~~l-~--~------gG~Tpl~~aL~~A~~~l~~~~~~-~~~~~~~vvliTDG~~ 576 (633)
T TIGR02442 525 -VELAARRLEEL-P--T------GGRTPLAAGLLKAAEVLSNELLR-DDDGRPLLVVITDGRA 576 (633)
T ss_pred -HHHHHHHHHhC-C--C------CCCCCHHHHHHHHHHHHHHhhcc-CCCCceEEEEECCCCC
Confidence 22222333321 1 1 13578999999999887754322 1245677888885443
No 42
>KOG2884 consensus 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=58.57 E-value=1.3e+02 Score=26.54 Aligned_cols=140 Identities=14% Similarity=0.173 Sum_probs=89.5
Q ss_pred EEEEEeCCccccccC--cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721 16 VVVLLDTNPFFWSSS--SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL 93 (219)
Q Consensus 16 LviIlD~np~~W~~~--~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v 93 (219)
-+|.+|-.-+...-+ +.-|..=-++|=..+++-+--|..|.|-+|.-...++..|-... .+
T Consensus 6 tmi~iDNse~mrNgDy~PtRf~aQ~daVn~v~~~K~~snpEntvGiitla~a~~~vLsT~T-~d---------------- 68 (259)
T KOG2884|consen 6 TMICIDNSEYMRNGDYLPTRFQAQKDAVNLVCQAKLRSNPENTVGIITLANASVQVLSTLT-SD---------------- 68 (259)
T ss_pred EEEEEeChHHhhcCCCChHHHHHHHHHHHHHHHhhhcCCcccceeeEeccCCCceeeeecc-cc----------------
Confidence 356677543222222 33355555667677889999999999999987665565443211 11
Q ss_pred HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhC
Q 027721 94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 173 (219)
Q Consensus 94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~ 173 (219)
.-+|...+- ..... ...-+..+|..|.--.+.+.. ..-+-||+++-+||-.-++- .+.-.+.-=+|.
T Consensus 69 ~gkils~lh---~i~~~------g~~~~~~~i~iA~lalkhRqn---k~~~~riVvFvGSpi~e~ek-eLv~~akrlkk~ 135 (259)
T KOG2884|consen 69 RGKILSKLH---GIQPH------GKANFMTGIQIAQLALKHRQN---KNQKQRIVVFVGSPIEESEK-ELVKLAKRLKKN 135 (259)
T ss_pred chHHHHHhc---CCCcC------CcccHHHHHHHHHHHHHhhcC---CCcceEEEEEecCcchhhHH-HHHHHHHHHHhc
Confidence 111211111 11111 245678888888776665532 34578999999999766555 777888888999
Q ss_pred CceeeEeeeCCC
Q 027721 174 MVPIDSCYLGAQ 185 (219)
Q Consensus 174 ~V~IDvc~L~~~ 185 (219)
+|.||+...|+.
T Consensus 136 ~Vaidii~FGE~ 147 (259)
T KOG2884|consen 136 KVAIDIINFGEA 147 (259)
T ss_pred CeeEEEEEeccc
Confidence 999999999853
No 43
>cd01473 vWA_CTRP CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=56.04 E-value=1.2e+02 Score=25.17 Aligned_cols=154 Identities=11% Similarity=0.106 Sum_probs=73.6
Q ss_pred EEEEEeCCccccccCcccHHH-HHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQ-FLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL 94 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~-~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~ 94 (219)
++++||.+--- +.-.+.+ +.+-+-.+++....-...-+|.|+..+.. .+..+|-..... .+ ...+.
T Consensus 3 i~fllD~S~Si---~~~~f~~~~~~f~~~lv~~l~i~~~~~rvgvv~fs~~-~~~~~~~~~~~~-----~~----~~~l~ 69 (192)
T cd01473 3 LTLILDESASI---GYSNWRKDVIPFTEKIINNLNISKDKVHVGILLFAEK-NRDVVPFSDEER-----YD----KNELL 69 (192)
T ss_pred EEEEEeCCCcc---cHHHHHHHHHHHHHHHHHhCccCCCccEEEEEEecCC-ceeEEecCcccc-----cC----HHHHH
Confidence 68889984332 1112321 22333333444434456778999887654 344545322110 00 11234
Q ss_pred HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEE-EecCCCCchhhhHHHHHHHHhhhC
Q 027721 95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC-LQGSPDGPEQYVAIMNAIFSAQRS 173 (219)
Q Consensus 95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILv-is~S~d~~~QYi~lMN~iFaAqK~ 173 (219)
+.+.++...-. .. ..+.+..||.+|+-++-+. ...+.-.+||+| ++-+.+....--.+.-..=.++++
T Consensus 70 ~~i~~l~~~~~--~~-------g~T~~~~AL~~a~~~~~~~--~~~r~~~~kv~IllTDG~s~~~~~~~~~~~a~~lk~~ 138 (192)
T cd01473 70 KKINDLKNSYR--SG-------GETYIVEALKYGLKNYTKH--GNRRKDAPKVTMLFTDGNDTSASKKELQDISLLYKEE 138 (192)
T ss_pred HHHHHHHhccC--CC-------CcCcHHHHHHHHHHHhccC--CCCcccCCeEEEEEecCCCCCcchhhHHHHHHHHHHC
Confidence 44444332110 01 2455677777765444221 111122255555 555544321111223344467889
Q ss_pred CceeeEeeeCCCCchhhhhh
Q 027721 174 MVPIDSCYLGAQNSAFLQQC 193 (219)
Q Consensus 174 ~V~IDvc~L~~~~s~fLQQA 193 (219)
+|.|-+.-+|..+..-|+..
T Consensus 139 gV~i~~vGiG~~~~~el~~i 158 (192)
T cd01473 139 NVKLLVVGVGAASENKLKLL 158 (192)
T ss_pred CCEEEEEEeccccHHHHHHh
Confidence 99999999985544444444
No 44
>PF04811 Sec23_trunk: Sec23/Sec24 trunk domain; InterPro: IPR006896 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain, an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the Sec23/24 alpha/beta trunk domain, which is formed from a single, approximately 250-residue segment plugged into the beta-barrel between strands beta-1 and beta-19. The trunk has an alpha/beta fold with a vWA topology, and it forms the dimer interface, primarily involving strand beta-14 on Sec23 and Sec24; in addition, the trunk domain of Sec23 contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_A 2NUP_A 3EG9_A 3EFO_A 3EGX_A 2NUT_A 1PD0_A 1PD1_A 1M2V_B 1PCX_A ....
Probab=55.38 E-value=1.3e+02 Score=25.59 Aligned_cols=154 Identities=17% Similarity=0.203 Sum_probs=81.3
Q ss_pred CCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCC--------CC--
Q 027721 13 VSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQS--------VG-- 82 (219)
Q Consensus 13 ~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~--------~~-- 82 (219)
|-..+.+||++..+-.. .-+..+.++|...++. +-....-+|++|..+. +-.+|.-....... ++
T Consensus 3 pp~y~FvID~s~~av~~--g~~~~~~~sl~~~l~~-l~~~~~~~vgiitfd~--~V~~y~l~~~~~~~~~~v~~dl~~~~ 77 (243)
T PF04811_consen 3 PPVYVFVIDVSYEAVQS--GLLQSLIESLKSALDS-LPGDERTRVGIITFDS--SVHFYNLSSSLSQPQMIVVSDLDDPF 77 (243)
T ss_dssp --EEEEEEE-SHHHHHH--THHHHHHHHHHHHGCT-SSTSTT-EEEEEEESS--SEEEEETTTTSSSTEEEEEHHTTSHH
T ss_pred CCEEEEEEECchhhhhc--cHHHHHHHHHHHHHHh-ccCCCCcEEEEEEeCC--EEEEEECCCCcCCCcccchHHHhhcc
Confidence 44678999998664333 2466667777666622 2234777888888843 35567543322100 00
Q ss_pred ---CC-------CchhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEec
Q 027721 83 ---NG-------RMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 152 (219)
Q Consensus 83 ---~~-------~~~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~ 152 (219)
.. +.......++++|.+...+.. +.....-+..||..|.-.+... +..+||+++..
T Consensus 78 ~p~~~~llv~~~e~~~~i~~ll~~L~~~~~~~~--------~~~~~~c~G~Al~~A~~ll~~~------~~gGkI~~F~s 143 (243)
T PF04811_consen 78 IPLPDGLLVPLSECRDAIEELLESLPSIFPETA--------GKRPERCLGSALSAALSLLSSR------NTGGKILVFTS 143 (243)
T ss_dssp SSTSSSSSEETTTCHHHHHHHHHHHHHHSTT-T--------TB-----HHHHHHHHHHHHHHH------TS-EEEEEEES
T ss_pred cCCcccEEEEhHHhHHHHHHHHHHhhhhccccc--------ccCccccHHHHHHHHHHHHhcc------ccCCEEEEEec
Confidence 00 001112223333333222110 0134567889999999998865 25689999876
Q ss_pred CCC--Cc-------------------hhhh-----HHHHHHHHhhhCCceeeEeeeCCC
Q 027721 153 SPD--GP-------------------EQYV-----AIMNAIFSAQRSMVPIDSCYLGAQ 185 (219)
Q Consensus 153 S~d--~~-------------------~QYi-----~lMN~iFaAqK~~V~IDvc~L~~~ 185 (219)
++- .+ ..|. -+-+....+.+++|.||++..+.+
T Consensus 144 ~~pt~G~Gg~l~~~~~~~~~~~~~~~~~~~~~~~~fY~~la~~~~~~~isvDlf~~~~~ 202 (243)
T PF04811_consen 144 GPPTYGPGGSLKKREDSSHYDTEKEKALLLPPANEFYKKLAEECSKQGISVDLFVFSSD 202 (243)
T ss_dssp S---SSSTTSS-SBTTSCCCCHCTTHHCHSHSSSHHHHHHHHHHHHCTEEEEEEEECSS
T ss_pred cCCCCCCCceecccccccccccccchhhhccccchHHHHHHHHHHhcCCEEEEEeecCC
Confidence 541 00 1111 356788899999999999999643
No 45
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=54.24 E-value=1.9e+02 Score=27.01 Aligned_cols=168 Identities=13% Similarity=0.097 Sum_probs=106.8
Q ss_pred eEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721 15 LVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT 92 (219)
Q Consensus 15 lLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~ 92 (219)
.|+||||.+-.-=.. .+--+..+++.+-.|+--|.-+|.--|+-+|..-.+.+.-+=.-..+
T Consensus 62 hl~iviD~S~am~e~Df~P~r~a~~~K~le~Fv~eFFdQNPiSQigii~~k~g~A~~lt~ltgn---------------- 125 (378)
T KOG2807|consen 62 HLYIVIDCSRAMEEKDFRPSRFANVIKYLEGFVPEFFDQNPISQIGIISIKDGKADRLTDLTGN---------------- 125 (378)
T ss_pred eEEEEEEhhhhhhhccCCchHHHHHHHHHHHHHHHHhccCchhheeEEEEecchhhHHHHhcCC----------------
Confidence 589999986543333 34457888999999999999999999999999866655432221111
Q ss_pred HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecC--CCCchhhhHHHHHHHHh
Q 027721 93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS--PDGPEQYVAIMNAIFSA 170 (219)
Q Consensus 93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S--~d~~~QYi~lMN~iFaA 170 (219)
....+..|...... .....|--||.+|--..+.+ +.+....+|||=+| .-.+..| |-.|=+-
T Consensus 126 p~~hI~aL~~~~~~---------~g~fSLqNaLe~a~~~Lk~~----p~H~sREVLii~sslsT~DPgdi---~~tI~~l 189 (378)
T KOG2807|consen 126 PRIHIHALKGLTEC---------SGDFSLQNALELAREVLKHM----PGHVSREVLIIFSSLSTCDPGDI---YETIDKL 189 (378)
T ss_pred HHHHHHHHhccccc---------CCChHHHHHHHHHHHHhcCC----CcccceEEEEEEeeecccCcccH---HHHHHHH
Confidence 12233333332211 12445777888887777665 24566677776543 3345544 7888899
Q ss_pred hhCCceeeEeeeCCCCch--hhhhh----ccccchhHHHHHHHHhcCCCC
Q 027721 171 QRSMVPIDSCYLGAQNSA--FLQQC----MSDRLLTLLVEYIISLNNWMG 214 (219)
Q Consensus 171 qK~~V~IDvc~L~~~~s~--fLQQA----~~~~~~~gLlQyLl~~~~~~~ 214 (219)
.+.+|.+-+.-|.++=.. =|-+| +++.--++=|+-||.--..-+
T Consensus 190 k~~kIRvsvIgLsaEv~icK~l~kaT~G~Y~V~lDe~HlkeLl~e~~~Pp 239 (378)
T KOG2807|consen 190 KAYKIRVSVIGLSAEVFICKELCKATGGRYSVALDEGHLKELLLEHTHPP 239 (378)
T ss_pred HhhCeEEEEEeechhHHHHHHHHHhhCCeEEEEeCHHHHHHHHHhcCCCC
Confidence 999999999999643111 12233 677666777777776443333
No 46
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=50.62 E-value=1.5e+02 Score=24.75 Aligned_cols=153 Identities=11% Similarity=0.059 Sum_probs=76.2
Q ss_pred CCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhc------CCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCc
Q 027721 13 VSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTL------NQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRM 86 (219)
Q Consensus 13 ~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~------~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~ 86 (219)
.-=++++||.+---= +..|.++..-+..+++..--. ...-+|.||..+.. .+..+|-..-.
T Consensus 19 ~~DivfvlD~S~Sm~---~~~f~~~k~fi~~~~~~~~~~~~~~~~~~~~rVGlV~fs~~-a~~~~~L~d~~--------- 85 (193)
T cd01477 19 WLDIVFVVDNSKGMT---QGGLWQVRATISSLFGSSSQIGTDYDDPRSTRVGLVTYNSN-ATVVADLNDLQ--------- 85 (193)
T ss_pred eeeEEEEEeCCCCcc---hhhHHHHHHHHHHHHhhccccccccCCCCCcEEEEEEccCc-eEEEEeccccc---------
Confidence 344788999854321 112444433333333333211 22468999888665 67788753211
Q ss_pred hhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEE-Ee-cCCCCc-hhhhHH
Q 027721 87 PSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC-LQ-GSPDGP-EQYVAI 163 (219)
Q Consensus 87 ~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILv-is-~S~d~~-~QYi~l 163 (219)
-...+.+.+...+... .. ...+.+..||.+|+-....-.+ ..+.-.+|++| ++ +..+.. .+...
T Consensus 86 --~~~~~~~ai~~~~~~~---~~------~ggT~ig~aL~~A~~~l~~~~~-~~R~~v~kvvIllTDg~~~~~~~~~~~- 152 (193)
T cd01477 86 --SFDDLYSQIQGSLTDV---SS------TNASYLDTGLQAAEQMLAAGKR-TSRENYKKVVIVFASDYNDEGSNDPRP- 152 (193)
T ss_pred --CHHHHHHHHHHHhhcc---cc------CCcchHHHHHHHHHHHHHhhhc-cccCCCCeEEEEEecCccCCCCCCHHH-
Confidence 0111233333221111 00 0245678888888665542211 11233577544 54 322222 22222
Q ss_pred HHHHHHhhhCCceeeEeeeCCC-Cchhhhhh
Q 027721 164 MNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 164 MN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA 193 (219)
..=.+++.+|.|-+.-+|.. |...++|.
T Consensus 153 --~a~~l~~~GI~i~tVGiG~~~d~~~~~~L 181 (193)
T cd01477 153 --IAARLKSTGIAIITVAFTQDESSNLLDKL 181 (193)
T ss_pred --HHHHHHHCCCEEEEEEeCCCCCHHHHHHH
Confidence 23347889999999999854 33556664
No 47
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=47.17 E-value=3.1e+02 Score=27.41 Aligned_cols=72 Identities=8% Similarity=0.050 Sum_probs=42.5
Q ss_pred ccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCC-Cchhhhhh
Q 027721 118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC 193 (219)
Q Consensus 118 ~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA 193 (219)
.+.+..||..|.-++.+.. .......-|+|+|-..+... + ..+..+-..++.+|.|=++-+|.. +..+|++.
T Consensus 126 gTnig~AL~~Aae~L~sr~--~R~nvpKVVILLTDG~sns~-~-dvleaAq~LR~~GVeI~vIGVG~g~n~e~LrlI 198 (576)
T PTZ00441 126 KTNMTDALLEVRKHLNDRV--NRENAIQLVILMTDGIPNSK-Y-RALEESRKLKDRNVKLAVIGIGQGINHQFNRLL 198 (576)
T ss_pred CccHHHHHHHHHHHHhhcc--cccCCceEEEEEecCCCCCc-c-cHHHHHHHHHHCCCEEEEEEeCCCcCHHHHHHH
Confidence 4668888888776665431 11233344555665544221 1 233455567788999999999753 45577643
No 48
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=43.69 E-value=1.6e+02 Score=23.30 Aligned_cols=138 Identities=9% Similarity=0.045 Sum_probs=73.5
Q ss_pred EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCc-----ceEEecCCCCCCCCCCCCCchhHH
Q 027721 16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS-----CDYVYDSSSTGNQSVGNGRMPSLC 90 (219)
Q Consensus 16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~-----~~~LYp~~~~~~~~~~~~~~~~~~ 90 (219)
++++||++.--... ..++.+-..+..|+.+... ...++.|++-.... ..++-+.+-..
T Consensus 3 v~~llD~SgSM~~~--~kl~~ak~a~~~l~~~l~~--~~d~~~l~~F~~~~~~~~~~~~~~~~~~~~------------- 65 (174)
T cd01454 3 VTLLLDLSGSMRSD--RRIDVAKKAAVLLAEALEA--CGVPHAILGFTTDAGGRERVRWIKIKDFDE------------- 65 (174)
T ss_pred EEEEEECCCCCCCC--cHHHHHHHHHHHHHHHHHH--cCCcEEEEEecCCCCCccceEEEEecCccc-------------
Confidence 57889997753321 2355555555555444333 56788888875541 23333211110
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCc-hhh----hHHHH
Q 027721 91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP-EQY----VAIMN 165 (219)
Q Consensus 91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~-~QY----i~lMN 165 (219)
....++.+.+..-.. ...+.+..||-.|+-.+.+.. .-+..|++||-..+.. ..| ..+-.
T Consensus 66 -~~~~~~~~~l~~~~~---------~g~T~~~~al~~a~~~l~~~~-----~~~~~iiliTDG~~~~~~~~~~~~~~~~~ 130 (174)
T cd01454 66 -SLHERARKRLAALSP---------GGNTRDGAAIRHAAERLLARP-----EKRKILLVISDGEPNDLDYYEGNVFATED 130 (174)
T ss_pred -ccchhHHHHHHccCC---------CCCCcHHHHHHHHHHHHhcCC-----CcCcEEEEEeCCCcCcccccCcchhHHHH
Confidence 001122222221101 124668888888877776531 2234466677544432 222 13334
Q ss_pred H---HHHhhhCCceeeEeeeCCC
Q 027721 166 A---IFSAQRSMVPIDSCYLGAQ 185 (219)
Q Consensus 166 ~---iFaAqK~~V~IDvc~L~~~ 185 (219)
+ +..|++.+|.+-+.-+|..
T Consensus 131 ~~~~~~~~~~~gi~v~~igig~~ 153 (174)
T cd01454 131 ALRAVIEARKLGIEVFGITIDRD 153 (174)
T ss_pred HHHHHHHHHhCCcEEEEEEecCc
Confidence 4 7888999999999999743
No 49
>COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=34.59 E-value=3.1e+02 Score=23.79 Aligned_cols=139 Identities=15% Similarity=0.150 Sum_probs=85.2
Q ss_pred eEEEEEeCCccccccC--cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEe-cCCCCCCCCCCCCCchhHHH
Q 027721 15 LVVVLLDTNPFFWSSS--SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCA 91 (219)
Q Consensus 15 lLviIlD~np~~W~~~--~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LY-p~~~~~~~~~~~~~~~~~~~ 91 (219)
.-||+||-.-.....+ +.-|..-.+++-...+.-.--+..|.+-+|.-.......|- |+..
T Consensus 5 atvvliDNse~s~NgDy~ptRFeAQkd~ve~if~~K~ndnpEntiGli~~~~a~p~vlsT~T~~---------------- 68 (243)
T COG5148 5 ATVVLIDNSEASQNGDYLPTRFEAQKDAVESIFSKKFNDNPENTIGLIPLVQAQPNVLSTPTKQ---------------- 68 (243)
T ss_pred eEEEEEeChhhhhcCCCCcHHHHHHHHHHHHHHHHHhcCCccceeeeeecccCCcchhccchhh----------------
Confidence 3577777643322222 33466666777777888888899999988865433322111 1110
Q ss_pred HHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhh
Q 027721 92 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 171 (219)
Q Consensus 92 ~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAq 171 (219)
.-++..++... ..+ ...-++-+|..|+--.+.+. +..-+-||.++-+||-..+. =.+.-.+..-+
T Consensus 69 --~gkilt~lhd~---~~~------g~a~~~~~lqiaql~lkhR~---nk~q~qriVaFvgSpi~ese-deLirlak~lk 133 (243)
T COG5148 69 --RGKILTFLHDI---RLH------GGADIMRCLQIAQLILKHRD---NKGQRQRIVAFVGSPIQESE-DELIRLAKQLK 133 (243)
T ss_pred --hhHHHHHhccc---ccc------CcchHHHHHHHHHHHHhccc---CCccceEEEEEecCcccccH-HHHHHHHHHHH
Confidence 11222333221 111 13446778888876665542 23456899999999976655 45666778889
Q ss_pred hCCceeeEeeeCC
Q 027721 172 RSMVPIDSCYLGA 184 (219)
Q Consensus 172 K~~V~IDvc~L~~ 184 (219)
|.+|.||....|+
T Consensus 134 knnVAidii~fGE 146 (243)
T COG5148 134 KNNVAIDIIFFGE 146 (243)
T ss_pred hcCeeEEEEehhh
Confidence 9999999999984
No 50
>cd00529 RuvC_resolvase Holliday junction resolvases (HJRs) are endonucleases that specifically resolve Holliday junction DNA intermediates during homologous recombination. HJR's occur in archaea, bacteria, and in the mitochondria of certain fungi, however this CD includes only the bacterial and mitochondrial HJR's. These are referred to as the RuvC family of Holliday junction resolvases, RuvC being the E.coli HJR. RuvC and its orthologs are homodimers and are structurely similar to RNase H and Hsp70.
Probab=28.02 E-value=35 Score=27.44 Aligned_cols=15 Identities=13% Similarity=0.116 Sum_probs=13.0
Q ss_pred chhhHHHHHHHHHhh
Q 027721 120 LLSGSLSMALCYIQR 134 (219)
Q Consensus 120 ~LagALs~ALCyINR 134 (219)
-.|=||+.|+||++|
T Consensus 139 d~aDAlaiA~~~~~~ 153 (154)
T cd00529 139 DAADALAVAITHAHW 153 (154)
T ss_pred CHHHHHHHHHHHHhh
Confidence 367799999999987
No 51
>COG4867 Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=27.50 E-value=2.2e+02 Score=27.71 Aligned_cols=92 Identities=20% Similarity=0.156 Sum_probs=64.5
Q ss_pred cccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC--------CC---chhhhHHHHHHH-------HhhhCCceee
Q 027721 117 ACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP--------DG---PEQYVAIMNAIF-------SAQRSMVPID 178 (219)
Q Consensus 117 ~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~--------d~---~~QYi~lMN~iF-------aAqK~~V~ID 178 (219)
..+.+.+||.+|=-+..|. .+.++-||||+-.. |. -..|-+==-.+| -..|.++-|.
T Consensus 531 qgTNlhhaL~LA~r~l~Rh-----~~~~~~il~vTDGePtAhle~~DG~~~~f~yp~DP~t~~~Tvr~~d~~~r~G~q~t 605 (652)
T COG4867 531 QGTNLHHALALAGRHLRRH-----AGAQPVVLVVTDGEPTAHLEDGDGTSVFFDYPPDPRTIAHTVRGFDDMARLGAQVT 605 (652)
T ss_pred cccchHHHHHHHHHHHHhC-----cccCceEEEEeCCCccccccCCCCceEecCCCCChhHHHHHHHHHHHHHhccceee
Confidence 4677999999998888776 47788999998533 21 122211111222 2368999999
Q ss_pred EeeeCCCC--chhhhhh-------ccccchhHHHHHHHHhcCCC
Q 027721 179 SCYLGAQN--SAFLQQC-------MSDRLLTLLVEYIISLNNWM 213 (219)
Q Consensus 179 vc~L~~~~--s~fLQQA-------~~~~~~~gLlQyLl~~~~~~ 213 (219)
.+.||... -.|+||. ...++++||-||.+.-|+..
T Consensus 606 ~FrLg~DpgL~~Fv~qva~rv~G~vv~pdldglGaaVvgdylr~ 649 (652)
T COG4867 606 IFRLGSDPGLARFIDQVARRVQGRVVVPDLDGLGAAVVGDYLRF 649 (652)
T ss_pred EEeecCCHhHHHHHHHHHHHhCCeEEecCcchhhHHHHHHHHhh
Confidence 99997432 3499998 77899999999998877654
No 52
>COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]
Probab=25.92 E-value=4.3e+02 Score=22.79 Aligned_cols=97 Identities=18% Similarity=0.260 Sum_probs=58.9
Q ss_pred ccchhhHHHHHHHHHhhhhcc----CCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCC--Cchhhh
Q 027721 118 CSLLSGSLSMALCYIQRVFRS----GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ--NSAFLQ 191 (219)
Q Consensus 118 ~s~LagALs~ALCyINR~~~~----~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~--~s~fLQ 191 (219)
.+.|..||.+|+=.|-+..++ .....+|=+++++-+.- ...|-.=-+-.|.-.+-+..+=.|.+|.+ +-.-|+
T Consensus 78 gT~lGaAl~~a~d~Ie~~~~~~~a~~kgdyrP~vfLiTDG~P-tD~w~~~~~~~~~~~~~~k~v~a~~~G~~~ad~~~L~ 156 (207)
T COG4245 78 GTPLGAALTLALDMIEERKRKYDANGKGDYRPWVFLITDGEP-TDDWQAGAALVFQGERRAKSVAAFSVGVQGADNKTLN 156 (207)
T ss_pred CCchHHHHHHHHHHHHHHHhhcccCCccccceEEEEecCCCc-chHHHhHHHHhhhcccccceEEEEEecccccccHHHH
Confidence 677999999999888776443 22457888887774332 44443333444444444455557888744 678899
Q ss_pred hh---------ccccchhHHHHHHH-HhcCCCCC
Q 027721 192 QC---------MSDRLLTLLVEYII-SLNNWMGS 215 (219)
Q Consensus 192 QA---------~~~~~~~gLlQyLl-~~~~~~~~ 215 (219)
|. ++..+..+++.++- ++-..|+|
T Consensus 157 qit~~V~~~~t~d~~~f~~fFkW~SaSisagS~S 190 (207)
T COG4245 157 QITEKVRQFLTLDGLQFREFFKWLSASISAGSRS 190 (207)
T ss_pred HHHHhhccccccchHHHHHHHHHHHHHhhccccc
Confidence 98 33344455666653 34444544
No 53
>TIGR00228 ruvC crossover junction endodeoxyribonuclease RuvC. Endonuclease that resolves Holliday junction intermediates in genetic recombination. The active form of the protein is a dimer. Structure studies reveals that the catalytic center, comprised of four acidic residues, lies at the bottom of a cleft that fits a DNA duplex. The model hits a single Synechocystis PCC6803 protein at a score of 30, below the trusted cutoff, that appears orthologous and may act as authentic RuvC.
Probab=24.29 E-value=40 Score=27.73 Aligned_cols=17 Identities=12% Similarity=0.126 Sum_probs=14.2
Q ss_pred hhhHHHHHHHHHhhhhc
Q 027721 121 LSGSLSMALCYIQRVFR 137 (219)
Q Consensus 121 LagALs~ALCyINR~~~ 137 (219)
-|=||+.|+||.+|..+
T Consensus 137 aaDALAiAi~h~~~~~~ 153 (156)
T TIGR00228 137 AADALAIAITHAHVSQN 153 (156)
T ss_pred HHHHHHHHHHHhcccCc
Confidence 56799999999999753
No 54
>PF11373 DUF3175: Protein of unknown function (DUF3175); InterPro: IPR021513 This entry is represented by Ralstonia phage RSL1, Orf186. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family of proteins with unknown function appears to be restricted to Proteobacteria.
Probab=22.12 E-value=1.4e+02 Score=22.19 Aligned_cols=22 Identities=23% Similarity=0.344 Sum_probs=19.4
Q ss_pred cccchhhHHHHHHHHHhhhhcc
Q 027721 117 ACSLLSGSLSMALCYIQRVFRS 138 (219)
Q Consensus 117 ~~s~LagALs~ALCyINR~~~~ 138 (219)
+.++.-+||||==-||||..+.
T Consensus 43 k~~pf~SAMSML~FYINRAGk~ 64 (86)
T PF11373_consen 43 KATPFRSAMSMLNFYINRAGKN 64 (86)
T ss_pred CCCchHHHHHHHHHHHhccccc
Confidence 5778999999999999998765
Done!