Query         027721
Match_columns 219
No_of_seqs    109 out of 217
Neff          6.2 
Searched_HMMs 46136
Date          Fri Mar 29 14:10:30 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/027721.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/027721hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF03850 Tfb4:  Transcription f 100.0 5.2E-64 1.1E-68  444.0  19.6  198   13-215     1-224 (276)
  2 TIGR00627 tfb4 transcription f 100.0 1.7E-57 3.7E-62  402.3  20.1  199   12-215     1-226 (279)
  3 KOG2487 RNA polymerase II tran 100.0 4.8E-57   1E-61  391.5  17.2  205    4-215    14-244 (314)
  4 COG5242 TFB4 RNA polymerase II 100.0 7.2E-52 1.6E-56  352.0  16.8  198    7-216    14-232 (296)
  5 cd01452 VWA_26S_proteasome_sub  97.9   0.001 2.2E-08   56.3  16.7  140   15-185     5-147 (187)
  6 cd01453 vWA_transcription_fact  97.8  0.0039 8.5E-08   51.9  18.2  160   16-208     6-177 (183)
  7 cd01465 vWA_subgroup VWA subgr  97.2   0.077 1.7E-06   42.2  17.0  156   16-205     3-169 (170)
  8 smart00327 VWA von Willebrand   96.9    0.12 2.6E-06   40.6  15.6  151   15-193     3-154 (177)
  9 cd01482 vWA_collagen_alphaI-XI  96.6    0.23   5E-06   39.9  15.8  147   16-193     3-149 (164)
 10 cd01480 vWA_collagen_alpha_1-V  96.6    0.33 7.1E-06   40.0  17.0  149   16-193     5-159 (186)
 11 PF04056 Ssl1:  Ssl1-like;  Int  96.5    0.23 4.9E-06   42.4  15.3  149   31-210    15-172 (193)
 12 cd01461 vWA_interalpha_trypsin  96.2    0.48   1E-05   37.5  17.2  159   14-207     3-170 (171)
 13 cd00198 vWFA Von Willebrand fa  96.0    0.44 9.6E-06   36.1  15.9  147   16-192     3-150 (161)
 14 PF13519 VWA_2:  von Willebrand  96.0    0.52 1.1E-05   36.8  15.6  155   16-206     2-169 (172)
 15 cd01472 vWA_collagen von Wille  96.0    0.62 1.3E-05   37.2  15.7  147   16-193     3-149 (164)
 16 PRK13685 hypothetical protein;  95.9     1.2 2.5E-05   40.4  18.1  164   10-203    85-280 (326)
 17 cd01451 vWA_Magnesium_chelatas  95.3     1.3 2.8E-05   36.1  17.4  147   15-193     2-153 (178)
 18 cd01467 vWA_BatA_type VWA BatA  95.1     1.4 2.9E-05   35.5  14.7  136   15-184     4-142 (180)
 19 cd01466 vWA_C3HC4_type VWA C3H  95.0     1.1 2.3E-05   35.8  13.0  140   16-193     3-143 (155)
 20 cd01469 vWA_integrins_alpha_su  94.5     2.2 4.9E-05   34.8  15.2  140   16-184     3-142 (177)
 21 cd01458 vWA_ku Ku70/Ku80 N-ter  94.3     2.8 6.1E-05   35.3  16.7  155   14-185     2-173 (218)
 22 PF13768 VWA_3:  von Willebrand  93.9     2.5 5.5E-05   33.2  14.6  141   16-193     3-145 (155)
 23 TIGR03436 acidobact_VWFA VWFA-  92.6     6.8 0.00015   34.5  17.9  146   12-183    52-203 (296)
 24 PRK13406 bchD magnesium chelat  92.5      12 0.00025   37.1  17.3  150   10-193   398-556 (584)
 25 cd01476 VWA_integrin_invertebr  91.9     5.2 0.00011   31.6  16.1  137   16-184     3-141 (163)
 26 cd01475 vWA_Matrilin VWA_Matri  91.7     7.5 0.00016   32.9  16.8  147   16-193     5-154 (224)
 27 cd01450 vWFA_subfamily_ECM Von  91.3     5.5 0.00012   30.6  15.3  148   16-192     3-150 (161)
 28 cd01474 vWA_ATR ATR (Anthrax T  91.0     7.5 0.00016   31.7  15.5  148   15-193     6-153 (185)
 29 PF03731 Ku_N:  Ku70/Ku80 N-ter  90.2     8.5 0.00018   32.3  12.3  158   16-183     2-173 (224)
 30 cd01463 vWA_VGCC_like VWA Volt  89.7      10 0.00022   31.1  14.9  153   11-193    11-175 (190)
 31 cd01462 VWA_YIEM_type VWA YIEM  88.7     9.8 0.00021   29.7  16.8  138   16-191     3-141 (152)
 32 cd01471 vWA_micronemal_protein  86.3      16 0.00035   29.5  15.8   73  118-193    84-157 (186)
 33 TIGR00868 hCaCC calcium-activa  84.1      57  0.0012   33.9  17.5  151   15-200   306-464 (863)
 34 TIGR02031 BchD-ChlD magnesium   83.6      22 0.00049   35.0  12.6  109   11-153   405-513 (589)
 35 cd01464 vWA_subfamily VWA subf  80.7      28  0.0006   28.0  13.4  144   16-193     6-157 (176)
 36 cd01456 vWA_ywmD_type VWA ywmD  78.7      36 0.00078   28.1  16.8  160    8-193    15-188 (206)
 37 PF00092 VWA:  von Willebrand f  78.4      30 0.00064   27.0  15.1  148   16-193     2-152 (178)
 38 cd01470 vWA_complement_factors  77.0      39 0.00086   27.6  17.2  168   16-204     3-195 (198)
 39 COG1240 ChlD Mg-chelatase subu  76.7      19  0.0004   32.2   8.5  111   11-152    76-186 (261)
 40 TIGR03788 marine_srt_targ mari  66.3 1.4E+02   0.003   29.3  20.3  148   11-193   269-417 (596)
 41 TIGR02442 Cob-chelat-sub cobal  62.5 1.7E+02  0.0037   29.1  17.6  112   13-155   465-576 (633)
 42 KOG2884 26S proteasome regulat  58.6 1.3E+02  0.0028   26.5  13.5  140   16-185     6-147 (259)
 43 cd01473 vWA_CTRP CTRP for  CS   56.0 1.2E+02  0.0025   25.2  15.7  154   16-193     3-158 (192)
 44 PF04811 Sec23_trunk:  Sec23/Se  55.4 1.3E+02  0.0029   25.6  12.2  154   13-185     3-202 (243)
 45 KOG2807 RNA polymerase II tran  54.2 1.9E+02  0.0041   27.0  13.1  168   15-214    62-239 (378)
 46 cd01477 vWA_F09G8-8_type VWA F  50.6 1.5E+02  0.0032   24.8  16.6  153   13-193    19-181 (193)
 47 PTZ00441 sporozoite surface pr  47.2 3.1E+02  0.0066   27.4  17.3   72  118-193   126-198 (576)
 48 cd01454 vWA_norD_type norD typ  43.7 1.6E+02  0.0036   23.3  14.9  138   16-185     3-153 (174)
 49 COG5148 RPN10 26S proteasome r  34.6 3.1E+02  0.0066   23.8  12.3  139   15-184     5-146 (243)
 50 cd00529 RuvC_resolvase Hollida  28.0      35 0.00076   27.4   1.5   15  120-134   139-153 (154)
 51 COG4867 Uncharacterized protei  27.5 2.2E+02  0.0048   27.7   6.8   92  117-213   531-649 (652)
 52 COG4245 TerY Uncharacterized p  25.9 4.3E+02  0.0094   22.8   8.7   97  118-215    78-190 (207)
 53 TIGR00228 ruvC crossover junct  24.3      40 0.00088   27.7   1.2   17  121-137   137-153 (156)
 54 PF11373 DUF3175:  Protein of u  22.1 1.4E+02  0.0031   22.2   3.6   22  117-138    43-64  (86)

No 1  
>PF03850 Tfb4:  Transcription factor Tfb4;  InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=100.00  E-value=5.2e-64  Score=444.01  Aligned_cols=198  Identities=42%  Similarity=0.645  Sum_probs=171.3

Q ss_pred             CCeEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCC---CC----C--
Q 027721           13 VSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN---QS----V--   81 (219)
Q Consensus        13 ~slLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~---~~----~--   81 (219)
                      ||||+||||+||.+|+.  +.++|++++++++||+||||++++.|+|||||+|+++++||||++....   ..    .  
T Consensus         1 pSLLvIILD~nP~~W~~~~~~~~l~~~l~~llvFlNahL~l~~~N~vaVIAs~~~~s~~LYP~~~~~~~~~~~~~~~~~~   80 (276)
T PF03850_consen    1 PSLLVIILDTNPLAWGQLSDQLSLSQFLDSLLVFLNAHLALNHSNQVAVIASHSNSSKFLYPSPSSSESSNSGDVEMNSS   80 (276)
T ss_pred             CcEEEEEEECCHHHHhhccccccHHHHHHHHHHHHHHHHhhCccCCEEEEEEcCCccEEEeCCCccccccCCCccccccc
Confidence            79999999999999998  5699999999999999999999999999999999999999999988321   10    0  


Q ss_pred             --CCCCchhHH-HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCC---CCCCceEEE-EecCC
Q 027721           82 --GNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGL---LHPQPRILC-LQGSP  154 (219)
Q Consensus        82 --~~~~~~~~~-~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~---~~~~~RILv-is~S~  154 (219)
                        +..+.+..+ +.|.++++++++++......     +.+++|+||||||||||||+.|+..   .++++|||| +++|+
T Consensus        81 ~~~~y~~f~~v~~~v~~~l~~l~~~~~~~~~~-----~~~s~LagALS~ALCyINR~~~~~~~~~~~~~~RILv~~s~s~  155 (276)
T PF03850_consen   81 DSNKYRQFRNVDETVLEELKKLMSETSESSDS-----TTSSLLAGALSMALCYINRISRESPSGGTSLKSRILVIVSGSP  155 (276)
T ss_pred             ccchhHHHHHHHHHHHHHHHHHHhhccccccc-----ccchhhHHHHHHHHHHHhhhhhcccCCCCCcCccEEEEEecCC
Confidence              011122223 45889999999986554432     2248999999999999999988764   589999999 99999


Q ss_pred             CCchhhhHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHHhcCCCCC
Q 027721          155 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIISLNNWMGS  215 (219)
Q Consensus       155 d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~  215 (219)
                      |.++||||+|||||||||++|+||||+|+.++++|||||        +++++++||+|||||.|+++++
T Consensus       156 d~~~QYi~~MN~iFaAqk~~v~IDv~~L~~~~s~fLqQa~d~T~G~y~~~~~~~~l~q~L~~~fl~~~~  224 (276)
T PF03850_consen  156 DSSSQYIPLMNCIFAAQKQKVPIDVCKLGGKDSTFLQQASDITGGIYLKVSKPEGLLQYLLTAFLPDPS  224 (276)
T ss_pred             CccHHHHHHHHHHHHHhcCCceeEEEEecCCchHHHHHHHHHhCceeeccCccccHHHHHHHhhcCCHH
Confidence            999999999999999999999999999987799999999        9999999999999999999864


No 2  
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=100.00  E-value=1.7e-57  Score=402.28  Aligned_cols=199  Identities=31%  Similarity=0.447  Sum_probs=168.8

Q ss_pred             CCCeEEEEEeCCcccccc-----CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCC-----
Q 027721           12 DVSLVVVLLDTNPFFWSS-----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSV-----   81 (219)
Q Consensus        12 ~~slLviIlD~np~~W~~-----~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~-----   81 (219)
                      ++|||+||||+||.+|+.     ++++|++++++++||+||||+++++|+|||||+|+++++||||++.++++..     
T Consensus         1 ~~slL~vvlD~np~~W~~~~~~~~~~~l~~~l~sllvF~NahL~l~~~N~vaVIAs~~~~~~~LYps~~~~~~~~~~~~~   80 (279)
T TIGR00627         1 NPSLLVVIIEANPCSWGMLALAHGKRTISKVLRAIVVFLNAHLAFNANNKLAVIASHSQDNKYLYPSTRCEDRNASELDP   80 (279)
T ss_pred             CCcEEEEEEeCCHHHHHHHhhccCCCcHHHHHHHHHHHHHHHHhcCccCCEEEEEecCCcceEEecCCcccccccccccc
Confidence            479999999999999954     4789999999999999999999999999999999999999999875432111     


Q ss_pred             -----CCCCchhHH-HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccC--CCCCCceEEEEecC
Q 027721           82 -----GNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG--LLHPQPRILCLQGS  153 (219)
Q Consensus        82 -----~~~~~~~~~-~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~--~~~~~~RILvis~S  153 (219)
                           +..+.+..+ +.|.++|++++.++......     ..+++|+||||||||||||+.++.  .+++++|||||++|
T Consensus        81 ~~~~~~~y~~f~~v~~~v~~~l~~l~~~~~~~~~~-----~~~s~lagals~ALcyinr~~~~~~~~~~~~~RIlii~~s  155 (279)
T TIGR00627        81 KRLRELLYRDFRTVDETIVEEIKPLMAHADKHMKK-----DSRTVLAGALSDALGYINRSEQSETASEKLKSRILVISIT  155 (279)
T ss_pred             ccccchhccchhHHHHHHHHHHHHHHhhchhcccc-----cccccchhHHHhhhhhhcccccccccCcCCcceEEEEECC
Confidence                 111112223 35899999999876543321     257899999999999999997753  36899999999999


Q ss_pred             CCCchhhhHHHHHHHHhhhCCceeeEeeeCCC-Cchhhhhh--------ccccchhHHHHHHHHhcCCCCC
Q 027721          154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC--------MSDRLLTLLVEYIISLNNWMGS  215 (219)
Q Consensus       154 ~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~  215 (219)
                      +|.++||||+||+||+|||++|+||||+|+.+ ++.|||||        +++++++||+||||+.|+++++
T Consensus       156 ~~~~~qYi~~mn~Ifaaqk~~I~Idv~~L~~e~~~~~lqQa~~~TgG~Y~~~~~~~~L~q~L~~~~~pp~~  226 (279)
T TIGR00627       156 PDMALQYIPLMNCIFSAQKQNIPIDVVSIGGDFTSGFLQQAADITGGSYLHVKKPQGLLQYLMTNMLPDPT  226 (279)
T ss_pred             CCchHHHHHHHHHHHHHHHcCceEEEEEeCCccccHHHHHHHHHhCCEEeccCCHhHHHHHHHHhcCCChh
Confidence            99999999999999999999999999999754 69999999        7789999999999999998864


No 3  
>KOG2487 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription; Replication, recombination and repair]
Probab=100.00  E-value=4.8e-57  Score=391.50  Aligned_cols=205  Identities=30%  Similarity=0.421  Sum_probs=175.9

Q ss_pred             CCCCCCCCCCCeEEEEEeCCcccccc-----CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCC
Q 027721            4 APSKLYSDDVSLVVVLLDTNPFFWSS-----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN   78 (219)
Q Consensus         4 ~~~~~~~~~~slLviIlD~np~~W~~-----~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~   78 (219)
                      +.+..+++.||||+||||+||..|+.     +..++.+++++|+||+||||++|+.|+|||||+|+++.+||||++...+
T Consensus        14 ~~~~~dn~~~slL~vlId~~p~~Wg~~as~~~~~ti~kvl~aivVFlNAHL~~~~~NrvaViA~~~q~~~~lyp~st~~e   93 (314)
T KOG2487|consen   14 PASHRDNLNPSLLVVLIDANPCSWGMLASAENWETISKVLNAIVVFLNAHLAFSRNNRVAVIASHSQVDNYLYPSSTRCE   93 (314)
T ss_pred             cCcccccCCceeEEEEEecCcchhhhhhhhcCceeHHHHHHHHHHHHHHHHhhccCCcEEEEEecccccceeccccccCC
Confidence            34556677899999999999999986     5669999999999999999999999999999999999999999876533


Q ss_pred             CCC-----CC-CCchhH------HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccC-CCCCCc
Q 027721           79 QSV-----GN-GRMPSL------CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG-LLHPQP  145 (219)
Q Consensus        79 ~~~-----~~-~~~~~~------~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~-~~~~~~  145 (219)
                      +..     .. ...+.+      -+.|.++|.++|.....-...      ..+.+||||+.|||||||+.++. ++++++
T Consensus        94 ~~n~~~~~~t~~~~~~y~~~~~~d~tiv~ei~~lm~~~~~~~~~------~rt~lagals~~L~yi~~~~ke~~~~~lkS  167 (314)
T KOG2487|consen   94 DRNASELDPTRLVLFDYSEFRTVDDTIVEEIYRLMEHPDKYDVG------DRTVLAGALSDALGYINRLHKEEASEKLKS  167 (314)
T ss_pred             ccCccccCchhhhcchhhhhcccchHHHHHHHHHHhCccccccc------cceeeccchhhccchHhhhhhhhhhhhhhc
Confidence            110     00 111111      235899999999976543331      37899999999999999998874 578999


Q ss_pred             eEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHHhcCCCCC
Q 027721          146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIISLNNWMGS  215 (219)
Q Consensus       146 RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~  215 (219)
                      ||||++.++|.+.||||+|||||+|||++|+||||+|| ++++|||||        +|+++|+||||||||.|+++++
T Consensus       168 RilV~t~t~d~~~qyi~~MNciFaAqKq~I~Idv~~l~-~~s~~LqQa~D~TGG~YL~v~~~~gLLqyLlt~~~~D~~  244 (314)
T KOG2487|consen  168 RILVFTLTRDRALQYIPYMNCIFAAQKQNIPIDVVSLG-GDSGFLQQACDITGGDYLHVEKPDGLLQYLLTLLLTDPE  244 (314)
T ss_pred             eEEEEEechHHHhhhhhHHHHHHHHHhcCceeEEEEec-CCchHHHHHHhhcCCeeEecCCcchHHHHHHHHhcCCcc
Confidence            99999999999999999999999999999999999995 779999999        9999999999999999999986


No 4  
>COG5242 TFB4 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 [Transcription / DNA replication, recombination, and repair]
Probab=100.00  E-value=7.2e-52  Score=352.03  Aligned_cols=198  Identities=27%  Similarity=0.378  Sum_probs=172.4

Q ss_pred             CCCCCCCCeEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCC---CCC
Q 027721            7 KLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN---QSV   81 (219)
Q Consensus         7 ~~~~~~~slLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~---~~~   81 (219)
                      +-.||.||||+||||.+|..|+.  ...+..++|+.++||+||||++++.|+|||+|+++.+.+||||+..+..   +.+
T Consensus        14 ~vteespslL~viid~~p~~W~~~~ek~~~~kvl~di~VFLNAhlaf~~~NrVaVva~~s~~~~yLypss~s~~k~se~e   93 (296)
T COG5242          14 QVTEESPSLLFVIIDLEPENWELTTEKGSRDKVLNDIVVFLNAHLAFSRNNRVAVVAGYSQGKTYLYPSSESALKASESE   93 (296)
T ss_pred             cccccCCceEEEEEecChhhcccccccccHHHHHHHHHHHHHHHHhhccCCeEEEEEeccCceEEeccCcchhhhhhccc
Confidence            35678899999999999999998  5688999999999999999999999999999999999999999987644   223


Q ss_pred             CCCCchhH-------HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC
Q 027721           82 GNGRMPSL-------CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP  154 (219)
Q Consensus        82 ~~~~~~~~-------~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~  154 (219)
                      +. +..++       -+.++++|.+++..+.+.        .+.+.++|||+.||.|+||+.++  .++++||||++.|.
T Consensus        94 ~t-r~sd~yrrfr~vde~~i~eiyrl~e~~~k~--------sqr~~v~gams~glay~n~~~~e--~slkSriliftlsG  162 (296)
T COG5242          94 NT-RNSDMYRRFRNVDETDITEIYRLIEHPHKN--------SQRYDVGGAMSLGLAYCNHRDEE--TSLKSRILIFTLSG  162 (296)
T ss_pred             Cc-cchhhhhhhcccchHHHHHHHHHHhCcccc--------cceeehhhhhhhhHHHHhhhccc--ccccceEEEEEecC
Confidence            32 22222       234789999999876542        24688999999999999999876  68999999999977


Q ss_pred             -CCchhhhHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHHhcCCCCCC
Q 027721          155 -DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIISLNNWMGSF  216 (219)
Q Consensus       155 -d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~~~~~~~~~  216 (219)
                       |...||||+|||||+|||++||||||.|+ +.+.||||+        +++++++|||||||+.+.++++-
T Consensus       163 ~d~~~qYip~mnCiF~Aqk~~ipI~v~~i~-g~s~fl~Q~~daTgG~Yl~ve~~eGllqyL~~~lf~d~~l  232 (296)
T COG5242         163 RDRKDQYIPYMNCIFAAQKFGIPISVFSIF-GNSKFLLQCCDATGGDYLTVEDTEGLLQYLLSLLFTDGEL  232 (296)
T ss_pred             chhhhhhchhhhheeehhhcCCceEEEEec-CccHHHHHHhhccCCeeEeecCchhHHHHHHHHhcCCCCc
Confidence             99999999999999999999999999995 779999999        99999999999999999998863


No 5  
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's  proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=97.95  E-value=0.001  Score=56.28  Aligned_cols=140  Identities=14%  Similarity=0.156  Sum_probs=99.7

Q ss_pred             eEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721           15 LVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT   92 (219)
Q Consensus        15 lLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~   92 (219)
                      -.+|+||.+..--..  .|-=|....+.+-.|++.|.-.+..+++.+|+...+.+..+-|-..+                
T Consensus         5 a~vi~lD~S~sM~a~D~~PnRL~aak~~i~~~~~~f~~~np~~~vGlv~fag~~a~v~~plT~D----------------   68 (187)
T cd01452           5 ATMICIDNSEYMRNGDYPPTRFQAQADAVNLICQAKTRSNPENNVGLMTMAGNSPEVLVTLTND----------------   68 (187)
T ss_pred             EEEEEEECCHHHHcCCCCCCHHHHHHHHHHHHHHHHHhcCCCccEEEEEecCCceEEEECCCCC----------------
Confidence            468999997665444  34457888888999999999999999999999988778777775543                


Q ss_pred             HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC-CCchhhhHHHHHHHHhh
Q 027721           93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP-DGPEQYVAIMNAIFSAQ  171 (219)
Q Consensus        93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~-d~~~QYi~lMN~iFaAq  171 (219)
                       ...+..++.+-..         ..++.|..||.+|+-......   ...-+.||+|+-+|+ +....-  +...+-.++
T Consensus        69 -~~~~~~~L~~i~~---------~g~~~l~~AL~~A~~~L~~~~---~~~~~~rivi~v~S~~~~d~~~--i~~~~~~lk  133 (187)
T cd01452          69 -QGKILSKLHDVQP---------KGKANFITGIQIAQLALKHRQ---NKNQKQRIVAFVGSPIEEDEKD--LVKLAKRLK  133 (187)
T ss_pred             -HHHHHHHHHhCCC---------CCcchHHHHHHHHHHHHhcCC---CcCCcceEEEEEecCCcCCHHH--HHHHHHHHH
Confidence             2334444443211         124568899999986665431   122346999988887 332222  568999999


Q ss_pred             hCCceeeEeeeCCC
Q 027721          172 RSMVPIDSCYLGAQ  185 (219)
Q Consensus       172 K~~V~IDvc~L~~~  185 (219)
                      |.+|.||+..+|+.
T Consensus       134 k~~I~v~vI~~G~~  147 (187)
T cd01452         134 KNNVSVDIINFGEI  147 (187)
T ss_pred             HcCCeEEEEEeCCC
Confidence            99999999999854


No 6  
>cd01453 vWA_transcription_factor_IIH_type Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
Probab=97.83  E-value=0.0039  Score=51.94  Aligned_cols=160  Identities=13%  Similarity=0.143  Sum_probs=108.4

Q ss_pred             EEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721           16 VVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL   93 (219)
Q Consensus        16 LviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v   93 (219)
                      ++|+||+++-=-..  .+--|+.+-..+.-|++.++..+...++.+|+...+.++.+-|-....                
T Consensus         6 ivi~lD~S~SM~a~D~~ptRl~~ak~~~~~fi~~~~~~~~~~~vglv~f~~~~a~~~~PlT~D~----------------   69 (183)
T cd01453           6 LIIVIDCSRSMEEQDLKPSRLAVVLKLLELFIEEFFDQNPISQLGIISIKNGRAEKLTDLTGNP----------------   69 (183)
T ss_pred             EEEEEECcHHHhcCCCCchHHHHHHHHHHHHHHHHhhcCccccEEEEEEcCCccEEEECCCCCH----------------
Confidence            78999998774433  345588889999999999999999999999999777788887764331                


Q ss_pred             HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecC-CCC-chhhhHHHHHHHHhh
Q 027721           94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS-PDG-PEQYVAIMNAIFSAQ  171 (219)
Q Consensus        94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S-~d~-~~QYi~lMN~iFaAq  171 (219)
                       +.+...+...  ...      ...+.+..||.+|+-+..+..    ..-+.||+|+..+ .+. +.++   -...=.++
T Consensus        70 -~~~~~~L~~~--~~~------~G~t~l~~aL~~A~~~l~~~~----~~~~~~iiil~sd~~~~~~~~~---~~~~~~l~  133 (183)
T cd01453          70 -RKHIQALKTA--REC------SGEPSLQNGLEMALESLKHMP----SHGSREVLIIFSSLSTCDPGNI---YETIDKLK  133 (183)
T ss_pred             -HHHHHHhhcc--cCC------CCchhHHHHHHHHHHHHhcCC----ccCceEEEEEEcCCCcCChhhH---HHHHHHHH
Confidence             1222222211  011      124679999999988886431    1224567766443 222 2223   34555678


Q ss_pred             hCCceeeEeeeCCCCchhhhhh--------ccccchhHHHHHHHH
Q 027721          172 RSMVPIDSCYLGAQNSAFLQQC--------MSDRLLTLLVEYIIS  208 (219)
Q Consensus       172 K~~V~IDvc~L~~~~s~fLQQA--------~~~~~~~gLlQyLl~  208 (219)
                      |.+|.|++..+| .+..+||+.        +.+.+++.|-+-+.-
T Consensus       134 ~~~I~v~~IgiG-~~~~~L~~ia~~tgG~~~~~~~~~~l~~~~~~  177 (183)
T cd01453         134 KENIRVSVIGLS-AEMHICKEICKATNGTYKVILDETHLKELLLE  177 (183)
T ss_pred             HcCcEEEEEEec-hHHHHHHHHHHHhCCeeEeeCCHHHHHHHHHh
Confidence            899999999997 556678887        777888888876654


No 7  
>cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n
Probab=97.16  E-value=0.077  Score=42.22  Aligned_cols=156  Identities=8%  Similarity=0.053  Sum_probs=94.3

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++++||++.---..   .+..+...+..++   ..+....++.|+..+.. .+.++|.....    +           .+
T Consensus         3 ~~~vlD~S~SM~~~---~~~~~k~a~~~~~---~~l~~~~~v~li~f~~~-~~~~~~~~~~~----~-----------~~   60 (170)
T cd01465           3 LVFVIDRSGSMDGP---KLPLVKSALKLLV---DQLRPDDRLAIVTYDGA-AETVLPATPVR----D-----------KA   60 (170)
T ss_pred             EEEEEECCCCCCCh---hHHHHHHHHHHHH---HhCCCCCEEEEEEecCC-ccEEecCcccc----h-----------HH
Confidence            67899998754322   2444444444333   35567789999998765 56666642211    0           23


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCch--hhhHHHHHHHHhhhC
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPE--QYVAIMNAIFSAQRS  173 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~--QYi~lMN~iFaAqK~  173 (219)
                      .+.+.+..-..         ...+.+..||..|+-.+.+..+.   .-..+|++++-+.+...  .+-.+-.+.=.+++.
T Consensus        61 ~l~~~l~~~~~---------~g~T~~~~al~~a~~~~~~~~~~---~~~~~ivl~TDG~~~~~~~~~~~~~~~~~~~~~~  128 (170)
T cd01465          61 AILAAIDRLTA---------GGSTAGGAGIQLGYQEAQKHFVP---GGVNRILLATDGDFNVGETDPDELARLVAQKRES  128 (170)
T ss_pred             HHHHHHHcCCC---------CCCCCHHHHHHHHHHHHHhhcCC---CCeeEEEEEeCCCCCCCCCCHHHHHHHHHHhhcC
Confidence            34444443211         12467889999999887664321   22367888887654321  122344455456788


Q ss_pred             CceeeEeeeCCC-Cchhhhhh--------ccccchhHHHHH
Q 027721          174 MVPIDSCYLGAQ-NSAFLQQC--------MSDRLLTLLVEY  205 (219)
Q Consensus       174 ~V~IDvc~L~~~-~s~fLQQA--------~~~~~~~gLlQy  205 (219)
                      +|.|.++.+|.. +..+|++.        ..+++++.+-++
T Consensus       129 ~v~i~~i~~g~~~~~~~l~~ia~~~~g~~~~~~~~~~~~~~  169 (170)
T cd01465         129 GITLSTLGFGDNYNEDLMEAIADAGNGNTAYIDNLAEARKV  169 (170)
T ss_pred             CeEEEEEEeCCCcCHHHHHHHHhcCCceEEEeCCHHHHHhh
Confidence            999999999843 56788886        667777776554


No 8  
>smart00327 VWA von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Probab=96.87  E-value=0.12  Score=40.60  Aligned_cols=151  Identities=14%  Similarity=0.063  Sum_probs=92.1

Q ss_pred             eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721           15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL   94 (219)
Q Consensus        15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~   94 (219)
                      -+++++|++.---   ...+..+...+..|+...-..+...++.|+..+.+... ..|.....       +        .
T Consensus         3 ~v~l~vD~S~SM~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~ii~f~~~~~~-~~~~~~~~-------~--------~   63 (177)
T smart00327        3 DVVFLLDGSGSMG---PNRFEKAKEFVLKLVEQLDIGPDGDRVGLVTFSDDATV-LFPLNDSR-------S--------K   63 (177)
T ss_pred             cEEEEEeCCCccc---hHHHHHHHHHHHHHHHhcCCCCCCcEEEEEEeCCCceE-EEcccccC-------C--------H
Confidence            3688999976542   12356666666777777766666889999998875433 33322110       0        3


Q ss_pred             HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721           95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM  174 (219)
Q Consensus        95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~  174 (219)
                      +.+.+.+..-... .      ...+.+..||-.|+-.+.+........-+..|++++-..+...  -.+..++..+++.+
T Consensus        64 ~~~~~~i~~~~~~-~------~~~~~~~~al~~~~~~~~~~~~~~~~~~~~~iviitDg~~~~~--~~~~~~~~~~~~~~  134 (177)
T smart00327       64 DALLEALASLSYK-L------GGGTNLGAALQYALENLFSKSAGSRRGAPKVLILITDGESNDG--GDLLKAAKELKRSG  134 (177)
T ss_pred             HHHHHHHHhcCCC-C------CCCchHHHHHHHHHHHhcCcCCCCCCCCCeEEEEEcCCCCCCC--ccHHHHHHHHHHCC
Confidence            3344444432110 1      1245678888887766642221111122567888876555433  35588889999999


Q ss_pred             ceeeEeeeCCC-Cchhhhhh
Q 027721          175 VPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       175 V~IDvc~L~~~-~s~fLQQA  193 (219)
                      |.|.++.+|.. +..+|++.
T Consensus       135 i~i~~i~~~~~~~~~~l~~~  154 (177)
T smart00327      135 VKVFVVGVGNDVDEEELKKL  154 (177)
T ss_pred             CEEEEEEccCccCHHHHHHH
Confidence            99999999865 67778777


No 9  
>cd01482 vWA_collagen_alphaI-XII-like Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=96.63  E-value=0.23  Score=39.89  Aligned_cols=147  Identities=8%  Similarity=-0.044  Sum_probs=83.6

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++++||.+.---   ...|.++.+.+..+++.+-......++++|..+.. .+..+|-....       .        .+
T Consensus         3 v~~vlD~S~Sm~---~~~~~~~k~~~~~l~~~~~~~~~~~rvgli~fs~~-~~~~~~l~~~~-------~--------~~   63 (164)
T cd01482           3 IVFLVDGSWSIG---RSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDD-PRTEFDLNAYT-------S--------KE   63 (164)
T ss_pred             EEEEEeCCCCcC---hhhHHHHHHHHHHHHhheeeCCCceEEEEEEECCC-eeEEEecCCCC-------C--------HH
Confidence            678999954332   22466666666677776534456789999998776 34344422111       0        22


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV  175 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V  175 (219)
                      .+.+.+.+.....        ..+.+..||-.|...+-+..+...+..+..|++++-+... .   .+......+++.+|
T Consensus        64 ~l~~~l~~~~~~~--------g~T~~~~aL~~a~~~~~~~~~~~r~~~~k~iillTDG~~~-~---~~~~~a~~lk~~gi  131 (164)
T cd01482          64 DVLAAIKNLPYKG--------GNTRTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQ-D---DVELPARVLRNLGV  131 (164)
T ss_pred             HHHHHHHhCcCCC--------CCChHHHHHHHHHHHhcccccCCCCCCCEEEEEEcCCCCC-c---hHHHHHHHHHHCCC
Confidence            2333333211111        2456888888877655442211223344556667643321 1   34677888999999


Q ss_pred             eeeEeeeCCCCchhhhhh
Q 027721          176 PIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       176 ~IDvc~L~~~~s~fLQQA  193 (219)
                      .|-++.+|..+-..|+|.
T Consensus       132 ~i~~ig~g~~~~~~L~~i  149 (164)
T cd01482         132 NVFAVGVKDADESELKMI  149 (164)
T ss_pred             EEEEEecCcCCHHHHHHH
Confidence            999999975444455554


No 10 
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far.  Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=96.58  E-value=0.33  Score=40.01  Aligned_cols=149  Identities=15%  Similarity=0.061  Sum_probs=85.1

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhc------CCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTL------NQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSL   89 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~------~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~   89 (219)
                      ++++||++.---..   .|....+.+..+++..+..      ....+++||..+.. .+..+|-.....      +    
T Consensus         5 vv~vlD~S~Sm~~~---~~~~~k~~~~~~~~~l~~~~~~~i~~~~~rvglv~fs~~-~~~~~~l~~~~~------~----   70 (186)
T cd01480           5 ITFVLDSSESVGLQ---NFDITKNFVKRVAERFLKDYYRKDPAGSWRVGVVQYSDQ-QEVEAGFLRDIR------N----   70 (186)
T ss_pred             EEEEEeCCCccchh---hHHHHHHHHHHHHHHHhhhhccCCCCCceEEEEEEecCC-ceeeEecccccC------C----
Confidence            78999995543222   3444444455555555432      34578999998755 455665432100      0    


Q ss_pred             HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHH
Q 027721           90 CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFS  169 (219)
Q Consensus        90 ~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFa  169 (219)
                          .+.+.+.+..-....        ..+.+..||-.|.-.+.+-   ..+..+..|++++-+......-..+-++.=.
T Consensus        71 ----~~~l~~~i~~l~~~g--------g~T~~~~AL~~a~~~l~~~---~~~~~~~~iillTDG~~~~~~~~~~~~~~~~  135 (186)
T cd01480          71 ----YTSLKEAVDNLEYIG--------GGTFTDCALKYATEQLLEG---SHQKENKFLLVITDGHSDGSPDGGIEKAVNE  135 (186)
T ss_pred             ----HHHHHHHHHhCccCC--------CCccHHHHHHHHHHHHhcc---CCCCCceEEEEEeCCCcCCCcchhHHHHHHH
Confidence                334444444321111        2456777887777666541   1234566777787654322222334566677


Q ss_pred             hhhCCceeeEeeeCCCCchhhhhh
Q 027721          170 AQRSMVPIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       170 AqK~~V~IDvc~L~~~~s~fLQQA  193 (219)
                      +++.+|.|=++.+|..+..-|++.
T Consensus       136 ~~~~gi~i~~vgig~~~~~~L~~I  159 (186)
T cd01480         136 ADHLGIKIFFVAVGSQNEEPLSRI  159 (186)
T ss_pred             HHHCCCEEEEEecCccchHHHHHH
Confidence            899999999999986555556655


No 11 
>PF04056 Ssl1:  Ssl1-like;  InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=96.48  E-value=0.23  Score=42.35  Aligned_cols=149  Identities=15%  Similarity=0.152  Sum_probs=100.4

Q ss_pred             cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHHHHHHHHHhhhhcCCC
Q 027721           31 SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ  110 (219)
Q Consensus        31 ~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~~l~~li~~~~~~~~~  110 (219)
                      +.-+.-.++.+-.|+.-|+-+|.-.|+.||+...+.++.+-+-..+.             ....+.|.++.+.    .. 
T Consensus        15 PtRl~~~~~~l~~Fv~eff~qNPiSqlgii~~~~~~a~~ls~lsgn~-------------~~h~~~L~~~~~~----~~-   76 (193)
T PF04056_consen   15 PTRLQCVLKALEEFVREFFDQNPISQLGIIVMRDGRAERLSELSGNP-------------QEHIEALKKLRKL----EP-   76 (193)
T ss_pred             ccHHHHHHHHHHHHHHHHHhcCChhheeeeeeecceeEEeeecCCCH-------------HHHHHHHHHhccC----CC-
Confidence            44577789999999999999999999999999888888887755432             1123333333221    11 


Q ss_pred             CCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCC--CchhhhHHHHHHHHhhhCCceeeEeeeCCCCch
Q 027721          111 EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD--GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA  188 (219)
Q Consensus       111 ~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d--~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~~s~  188 (219)
                           .....|--||-+|......+.    .+-..+||+|-+|--  .+.   .+-..|=..+|.+|.++|..|+ .+-.
T Consensus        77 -----~G~~SLqN~Le~A~~~L~~~p----~~~srEIlvi~gSl~t~Dp~---di~~ti~~l~~~~IrvsvI~la-aEv~  143 (193)
T PF04056_consen   77 -----SGEPSLQNGLEMARSSLKHMP----SHGSREILVIFGSLTTCDPG---DIHETIESLKKENIRVSVISLA-AEVY  143 (193)
T ss_pred             -----CCChhHHHHHHHHHHHHhhCc----cccceEEEEEEeecccCCch---hHHHHHHHHHHcCCEEEEEEEh-HHHH
Confidence                 124557788888877776542    345568999886542  233   3467788889999999999995 4555


Q ss_pred             hhhhh-------ccccchhHHHHHHHHhc
Q 027721          189 FLQQC-------MSDRLLTLLVEYIISLN  210 (219)
Q Consensus       189 fLQQA-------~~~~~~~gLlQyLl~~~  210 (219)
                      .++..       +.+.--+.-+.-||+.+
T Consensus       144 I~k~i~~~T~G~y~V~lde~H~~~lL~~~  172 (193)
T PF04056_consen  144 ICKKICKETGGTYGVILDEDHFKELLMEH  172 (193)
T ss_pred             HHHHHHHhhCCEEEEecCHHHHHHHHHhh
Confidence            66665       55544444455555543


No 12 
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=96.16  E-value=0.48  Score=37.50  Aligned_cols=159  Identities=16%  Similarity=0.140  Sum_probs=96.6

Q ss_pred             CeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721           14 SLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL   93 (219)
Q Consensus        14 slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v   93 (219)
                      .=++++||++.--.+.   .+..+-..+..|++.   +....++.|++.+.. .+.++|.....    +        ..-
T Consensus         3 ~~v~~vlD~S~SM~~~---~~~~~~~al~~~l~~---l~~~~~~~l~~Fs~~-~~~~~~~~~~~----~--------~~~   63 (171)
T cd01461           3 KEVVFVIDTSGSMSGT---KIEQTKEALLTALKD---LPPGDYFNIIGFSDT-VEEFSPSSVSA----T--------AEN   63 (171)
T ss_pred             ceEEEEEECCCCCCCh---hHHHHHHHHHHHHHh---CCCCCEEEEEEeCCC-ceeecCcceeC----C--------HHH
Confidence            3478999998877543   366666666665543   456678998887655 34445432110    0        001


Q ss_pred             HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhC
Q 027721           94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS  173 (219)
Q Consensus        94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~  173 (219)
                      ++++.+++..-..   .      ..+.+..||..|+-.+.+     .+.....|+++|-......+.  +...+-.+.+.
T Consensus        64 ~~~~~~~l~~~~~---~------g~T~l~~al~~a~~~l~~-----~~~~~~~iillTDG~~~~~~~--~~~~~~~~~~~  127 (171)
T cd01461          64 VAAAIEYVNRLQA---L------GGTNMNDALEAALELLNS-----SPGSVPQIILLTDGEVTNESQ--ILKNVREALSG  127 (171)
T ss_pred             HHHHHHHHHhcCC---C------CCcCHHHHHHHHHHhhcc-----CCCCccEEEEEeCCCCCCHHH--HHHHHHHhcCC
Confidence            2333344433211   1      245677777777655543     123456788888665433332  23444456667


Q ss_pred             CceeeEeeeCC-CCchhhhhh--------ccccchhHHHHHHH
Q 027721          174 MVPIDSCYLGA-QNSAFLQQC--------MSDRLLTLLVEYII  207 (219)
Q Consensus       174 ~V~IDvc~L~~-~~s~fLQQA--------~~~~~~~gLlQyLl  207 (219)
                      +|+|-++.+|. .+..+|++.        +++++.+.+.++|+
T Consensus       128 ~i~i~~i~~g~~~~~~~l~~ia~~~gG~~~~~~~~~~~~~~~~  170 (171)
T cd01461         128 RIRLFTFGIGSDVNTYLLERLAREGRGIARRIYETDDIESQLL  170 (171)
T ss_pred             CceEEEEEeCCccCHHHHHHHHHcCCCeEEEecChHHHHHHhc
Confidence            99999999975 367899998        88999999988874


No 13 
>cd00198 vWFA Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.
Probab=96.05  E-value=0.44  Score=36.07  Aligned_cols=147  Identities=14%  Similarity=0.140  Sum_probs=86.8

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      +++++|++..-.   ...+..+...+..++..........++.|++.+.. .+..+|.....       .        .+
T Consensus         3 v~~viD~S~Sm~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~v~~f~~~-~~~~~~~~~~~-------~--------~~   63 (161)
T cd00198           3 IVFLLDVSGSMG---GEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSN-ARVVLPLTTDT-------D--------KA   63 (161)
T ss_pred             EEEEEeCCCCcC---cchHHHHHHHHHHHHHhcccCCCCcEEEEEEecCc-cceeecccccC-------C--------HH
Confidence            678999977651   12355666666666666666567889999998764 34444432210       0        23


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV  175 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V  175 (219)
                      .+.+.++.-.. ..      ...+.+..|+..|+-...+..   ......+|++++-....... -.+--++-.+.+.+|
T Consensus        64 ~~~~~~~~~~~-~~------~~~t~~~~al~~~~~~~~~~~---~~~~~~~lvvitDg~~~~~~-~~~~~~~~~~~~~~v  132 (161)
T cd00198          64 DLLEAIDALKK-GL------GGGTNIGAALRLALELLKSAK---RPNARRVIILLTDGEPNDGP-ELLAEAARELRKLGI  132 (161)
T ss_pred             HHHHHHHhccc-CC------CCCccHHHHHHHHHHHhcccC---CCCCceEEEEEeCCCCCCCc-chhHHHHHHHHHcCC
Confidence            33444433211 01      134557777777777666543   12345667777765544332 244556777888899


Q ss_pred             eeeEeeeCC-CCchhhhh
Q 027721          176 PIDSCYLGA-QNSAFLQQ  192 (219)
Q Consensus       176 ~IDvc~L~~-~~s~fLQQ  192 (219)
                      .|.+..+|+ ....+|+.
T Consensus       133 ~v~~v~~g~~~~~~~l~~  150 (161)
T cd00198         133 TVYTIGIGDDANEDELKE  150 (161)
T ss_pred             EEEEEEcCCCCCHHHHHH
Confidence            999999975 44555554


No 14 
>PF13519 VWA_2:  von Willebrand factor type A domain; PDB: 3IBS_B 3RAG_B 2X5N_A.
Probab=96.04  E-value=0.52  Score=36.77  Aligned_cols=155  Identities=15%  Similarity=0.115  Sum_probs=88.2

Q ss_pred             EEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721           16 VVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL   93 (219)
Q Consensus        16 LviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v   93 (219)
                      +++++|.+.--...  ..--+..+...+..+++.+    ..++++|+..+.. .....|-...                 
T Consensus         2 vv~v~D~SgSM~~~~~~~~~~~~~~~~~~~~~~~~----~~~~v~l~~f~~~-~~~~~~~t~~-----------------   59 (172)
T PF13519_consen    2 VVFVLDNSGSMNGYDGNRTRIDQAKDALNELLANL----PGDRVGLVSFSDS-SRTLSPLTSD-----------------   59 (172)
T ss_dssp             EEEEEE-SGGGGTTTSSS-HHHHHHHHHHHHHHHH----TTSEEEEEEESTS-CEEEEEEESS-----------------
T ss_pred             EEEEEECCcccCCCCCCCcHHHHHHHHHHHHHHHC----CCCEEEEEEeccc-cccccccccc-----------------
Confidence            68999998775544  2234666666666677663    4669999998775 3445553211                 


Q ss_pred             HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhC
Q 027721           94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS  173 (219)
Q Consensus        94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~  173 (219)
                      .+.+.+.+.+-......     ...+.+..||..|.-++.+..     .-+..|++|+-..+. ..+..   .+-.+++.
T Consensus        60 ~~~~~~~l~~~~~~~~~-----~~~t~~~~al~~a~~~~~~~~-----~~~~~iv~iTDG~~~-~~~~~---~~~~~~~~  125 (172)
T PF13519_consen   60 KDELKNALNKLSPQGMP-----GGGTNLYDALQEAAKMLASSD-----NRRRAIVLITDGEDN-SSDIE---AAKALKQQ  125 (172)
T ss_dssp             HHHHHHHHHTHHHHG-------SSS--HHHHHHHHHHHHHC-S-----SEEEEEEEEES-TTH-CHHHH---HHHHHHCT
T ss_pred             HHHHHHHhhcccccccC-----ccCCcHHHHHHHHHHHHHhCC-----CCceEEEEecCCCCC-cchhH---HHHHHHHc
Confidence            23333333332211010     135678888888887776542     234455566654433 33333   55577899


Q ss_pred             CceeeEeeeCCCCc--hhhhhh--------ccc-cchhHHHHHH
Q 027721          174 MVPIDSCYLGAQNS--AFLQQC--------MSD-RLLTLLVEYI  206 (219)
Q Consensus       174 ~V~IDvc~L~~~~s--~fLQQA--------~~~-~~~~gLlQyL  206 (219)
                      +|.|.++.+|....  ..||+.        +.+ ..++.|-+-|
T Consensus       126 ~i~i~~v~~~~~~~~~~~l~~la~~tgG~~~~~~~~~~~l~~~~  169 (172)
T PF13519_consen  126 GITIYTVGIGSDSDANEFLQRLAEATGGRYFHVDNDPEDLDDAF  169 (172)
T ss_dssp             TEEEEEEEES-TT-EHHHHHHHHHHTEEEEEEE-SSSHHHHHHH
T ss_pred             CCeEEEEEECCCccHHHHHHHHHHhcCCEEEEecCCHHHHHHHH
Confidence            99999999986544  478887        555 4666665543


No 15 
>cd01472 vWA_collagen von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins.  This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via  the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Probab=95.98  E-value=0.62  Score=37.18  Aligned_cols=147  Identities=9%  Similarity=0.011  Sum_probs=81.1

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++++||.+.--+..   .+..+-+.+..+++.+.......++.+|..+.. .+...|-....       +        .+
T Consensus         3 vv~vlD~SgSm~~~---~~~~~k~~~~~~~~~l~~~~~~~~~giv~Fs~~-~~~~~~~~~~~-------~--------~~   63 (164)
T cd01472           3 IVFLVDGSESIGLS---NFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDD-PRTEFYLNTYR-------S--------KD   63 (164)
T ss_pred             EEEEEeCCCCCCHH---HHHHHHHHHHHHHhhcccCCCCeEEEEEEEcCc-eeEEEecCCCC-------C--------HH
Confidence            57999998766553   244444444445544333344568999887644 45555533211       0        22


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV  175 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V  175 (219)
                      .+.+.++.-....        ..+.+..||..|...+.+............|++++-+.... .   ...+...+.+.+|
T Consensus        64 ~~~~~l~~l~~~~--------g~T~~~~al~~a~~~l~~~~~~~~~~~~~~iiliTDG~~~~-~---~~~~~~~l~~~gv  131 (164)
T cd01472          64 DVLEAVKNLRYIG--------GGTNTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQD-D---VEEPAVELKQAGI  131 (164)
T ss_pred             HHHHHHHhCcCCC--------CCchHHHHHHHHHHHhCCcccCCCCCCCEEEEEEcCCCCCc-h---HHHHHHHHHHCCC
Confidence            2333333211111        24568888888888776432111123344456666443221 1   2334556777899


Q ss_pred             eeeEeeeCCCCchhhhhh
Q 027721          176 PIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       176 ~IDvc~L~~~~s~fLQQA  193 (219)
                      .|-+..+|..+...|++.
T Consensus       132 ~i~~ig~g~~~~~~L~~i  149 (164)
T cd01472         132 EVFAVGVKNADEEELKQI  149 (164)
T ss_pred             EEEEEECCcCCHHHHHHH
Confidence            999999986677777765


No 16 
>PRK13685 hypothetical protein; Provisional
Probab=95.95  E-value=1.2  Score=40.40  Aligned_cols=164  Identities=15%  Similarity=0.145  Sum_probs=100.5

Q ss_pred             CCCCCeEEEEEeCCccccccC--cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCch
Q 027721           10 SDDVSLVVVLLDTNPFFWSSS--SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMP   87 (219)
Q Consensus        10 ~~~~slLviIlD~np~~W~~~--~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~   87 (219)
                      ...+.-++++||++.-=...+  +--+..+-..+..|++.   +....++.+|+.+.+ +..+.|...+           
T Consensus        85 ~~~~~~vvlvlD~S~SM~~~D~~p~RL~~ak~~~~~~l~~---l~~~d~vglv~Fa~~-a~~~~p~t~d-----------  149 (326)
T PRK13685         85 PRNRAVVMLVIDVSQSMRATDVEPNRLAAAQEAAKQFADE---LTPGINLGLIAFAGT-ATVLVSPTTN-----------  149 (326)
T ss_pred             CCCCceEEEEEECCccccCCCCCCCHHHHHHHHHHHHHHh---CCCCCeEEEEEEcCc-eeecCCCCCC-----------
Confidence            334557899999988766542  33477777777778775   356778999998665 4555553211           


Q ss_pred             hHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhc---cCCCCCCceEEEEecCCCCchh----h
Q 027721           88 SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR---SGLLHPQPRILCLQGSPDGPEQ----Y  160 (219)
Q Consensus        88 ~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~---~~~~~~~~RILvis~S~d~~~Q----Y  160 (219)
                            .+.+.+.++.-...         ..+.+..+|..|+-...+..+   ...+....+|+++|-+.+....    -
T Consensus       150 ------~~~l~~~l~~l~~~---------~~T~~g~al~~A~~~l~~~~~~~~~~~~~~~~~IILlTDG~~~~~~~~~~~  214 (326)
T PRK13685        150 ------REATKNAIDKLQLA---------DRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLMSDGKETVPTNPDNP  214 (326)
T ss_pred             ------HHHHHHHHHhCCCC---------CCcchHHHHHHHHHHHHhhhcccccccCCCCCEEEEEcCCCCCCCCCCCCc
Confidence                  33444444432111         234567788888877654311   1112345689999866554321    1


Q ss_pred             hHHHHHHHHhhhCCceeeEeeeCCC---------------Cchhhhhh--------ccccchhHHH
Q 027721          161 VAIMNAIFSAQRSMVPIDSCYLGAQ---------------NSAFLQQC--------MSDRLLTLLV  203 (219)
Q Consensus       161 i~lMN~iFaAqK~~V~IDvc~L~~~---------------~s~fLQQA--------~~~~~~~gLl  203 (219)
                      ...+.+.-.|++.+|+|.+..+|.+               +...||+.        +++.+++.|-
T Consensus       215 ~~~~~aa~~a~~~gi~i~~Ig~G~~~g~~~~~g~~~~~~~d~~~L~~iA~~tgG~~~~~~~~~~L~  280 (326)
T PRK13685        215 RGAYTAARTAKDQGVPISTISFGTPYGSVEINGQRQPVPVDDESLKKIAQLSGGEFYTAASLEELR  280 (326)
T ss_pred             ccHHHHHHHHHHcCCeEEEEEECCCCCCcCcCCceeeecCCHHHHHHHHHhcCCEEEEcCCHHHHH
Confidence            2235667778899999999999853               34577776        5566655543


No 17 
>cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions.
Probab=95.30  E-value=1.3  Score=36.11  Aligned_cols=147  Identities=11%  Similarity=0.077  Sum_probs=85.8

Q ss_pred             eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721           15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL   94 (219)
Q Consensus        15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~   94 (219)
                      +++++||++.--...  -.+..+-..+..|+..  .+....++.+|+...+....+.|....                 .
T Consensus         2 ~v~lvlD~SgSM~~~--~rl~~ak~a~~~~~~~--~~~~~d~v~lv~F~~~~~~~~~~~t~~-----------------~   60 (178)
T cd01451           2 LVIFVVDASGSMAAR--HRMAAAKGAVLSLLRD--AYQRRDKVALIAFRGTEAEVLLPPTRS-----------------V   60 (178)
T ss_pred             eEEEEEECCccCCCc--cHHHHHHHHHHHHHHH--hhcCCCEEEEEEECCCCceEEeCCCCC-----------------H
Confidence            678999998755433  1244455555555432  345778999999876656666664321                 2


Q ss_pred             HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhh---hHHHHHHHHhh
Q 027721           95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQY---VAIMNAIFSAQ  171 (219)
Q Consensus        95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QY---i~lMN~iFaAq  171 (219)
                      +.+.+.+..-.   .      ...+.|..||..|+-.+++..+.  ..-+.+|+++|-..+...+.   .....+.-.+.
T Consensus        61 ~~~~~~l~~l~---~------~G~T~l~~aL~~a~~~l~~~~~~--~~~~~~ivliTDG~~~~g~~~~~~~~~~~~~~l~  129 (178)
T cd01451          61 ELAKRRLARLP---T------GGGTPLAAGLLAAYELAAEQARD--PGQRPLIVVITDGRANVGPDPTADRALAAARKLR  129 (178)
T ss_pred             HHHHHHHHhCC---C------CCCCcHHHHHHHHHHHHHHHhcC--CCCceEEEEECCCCCCCCCCchhHHHHHHHHHHH
Confidence            22333443321   1      12567999999999887322111  12357888888655432221   11245566677


Q ss_pred             hCCceeeEeeeCCC--Cchhhhhh
Q 027721          172 RSMVPIDSCYLGAQ--NSAFLQQC  193 (219)
Q Consensus       172 K~~V~IDvc~L~~~--~s~fLQQA  193 (219)
                      +.+|.|.+.-.+..  +..+|++.
T Consensus       130 ~~gi~v~~I~~~~~~~~~~~l~~i  153 (178)
T cd01451         130 ARGISALVIDTEGRPVRRGLAKDL  153 (178)
T ss_pred             hcCCcEEEEeCCCCccCccHHHHH
Confidence            88999877776532  34577776


No 18 
>cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=95.15  E-value=1.4  Score=35.48  Aligned_cols=136  Identities=10%  Similarity=0.052  Sum_probs=70.9

Q ss_pred             eEEEEEeCCccccccC---cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHH
Q 027721           15 LVVVLLDTNPFFWSSS---SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCA   91 (219)
Q Consensus        15 lLviIlD~np~~W~~~---~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~   91 (219)
                      -++++||++.--...+   ...+..    ....+..++......++.++..+.. .+.+.|-...               
T Consensus         4 ~vv~vlD~S~SM~~~~~~~~~r~~~----a~~~~~~~~~~~~~~~v~lv~f~~~-~~~~~~~~~~---------------   63 (180)
T cd01467           4 DIMIALDVSGSMLAQDFVKPSRLEA----AKEVLSDFIDRRENDRIGLVVFAGA-AFTQAPLTLD---------------   63 (180)
T ss_pred             eEEEEEECCcccccccCCCCCHHHH----HHHHHHHHHHhCCCCeEEEEEEcCC-eeeccCCCcc---------------
Confidence            4789999988643221   111222    2223445555566788999988554 3333332111               


Q ss_pred             HHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhh
Q 027721           92 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ  171 (219)
Q Consensus        92 ~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAq  171 (219)
                        .+.+.+++..-.....      ...+.+..||..|+-.+.+.     ..-+..|++++-..+... +.....+.=.++
T Consensus        64 --~~~~~~~l~~l~~~~~------~g~T~l~~al~~a~~~l~~~-----~~~~~~iiliTDG~~~~g-~~~~~~~~~~~~  129 (180)
T cd01467          64 --RESLKELLEDIKIGLA------GQGTAIGDAIGLAIKRLKNS-----EAKERVIVLLTDGENNAG-EIDPATAAELAK  129 (180)
T ss_pred             --HHHHHHHHHHhhhccc------CCCCcHHHHHHHHHHHHHhc-----CCCCCEEEEEeCCCCCCC-CCCHHHHHHHHH
Confidence              1112222221111000      12455777777777665443     123467888886544322 222233344567


Q ss_pred             hCCceeeEeeeCC
Q 027721          172 RSMVPIDSCYLGA  184 (219)
Q Consensus       172 K~~V~IDvc~L~~  184 (219)
                      +.+|+|.++.+|.
T Consensus       130 ~~gi~i~~i~ig~  142 (180)
T cd01467         130 NKGVRIYTIGVGK  142 (180)
T ss_pred             HCCCEEEEEEecC
Confidence            7899999999985


No 19 
>cd01466 vWA_C3HC4_type VWA C3HC4-type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, 
Probab=95.00  E-value=1.1  Score=35.84  Aligned_cols=140  Identities=15%  Similarity=0.182  Sum_probs=82.2

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      +++|||+++--.+.   .+..+-.++. ++...  +...+++.||..+.. .+.++|-.....             .-++
T Consensus         3 v~~vlD~S~SM~~~---rl~~ak~a~~-~l~~~--l~~~~~~~li~F~~~-~~~~~~~~~~~~-------------~~~~   62 (155)
T cd01466           3 LVAVLDVSGSMAGD---KLQLVKHALR-FVISS--LGDADRLSIVTFSTS-AKRLSPLRRMTA-------------KGKR   62 (155)
T ss_pred             EEEEEECCCCCCcH---HHHHHHHHHH-HHHHh--CCCcceEEEEEecCC-ccccCCCcccCH-------------HHHH
Confidence            67899998866543   2444433333 33322  345678999997664 566666321110             0123


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV  175 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V  175 (219)
                      .+.+.+..-..         ...+.+..||..|.-.+.+..   ......+|+++|-..+...      .....+.+.+|
T Consensus        63 ~~~~~i~~~~~---------~g~T~~~~al~~a~~~~~~~~---~~~~~~~iillTDG~~~~~------~~~~~~~~~~v  124 (155)
T cd01466          63 SAKRVVDGLQA---------GGGTNVVGGLKKALKVLGDRR---QKNPVASIMLLSDGQDNHG------AVVLRADNAPI  124 (155)
T ss_pred             HHHHHHHhccC---------CCCccHHHHHHHHHHHHhhcc---cCCCceEEEEEcCCCCCcc------hhhhcccCCCc
Confidence            44444443211         135678899999987765431   1234578999996544322      23456777899


Q ss_pred             eeeEeeeCC-CCchhhhhh
Q 027721          176 PIDSCYLGA-QNSAFLQQC  193 (219)
Q Consensus       176 ~IDvc~L~~-~~s~fLQQA  193 (219)
                      +|-+.-+|. .+..+||+.
T Consensus       125 ~v~~igig~~~~~~~l~~i  143 (155)
T cd01466         125 PIHTFGLGASHDPALLAFI  143 (155)
T ss_pred             eEEEEecCCCCCHHHHHHH
Confidence            999999973 355667664


No 20 
>cd01469 vWA_integrins_alpha_subunit Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.
Probab=94.46  E-value=2.2  Score=34.77  Aligned_cols=140  Identities=8%  Similarity=0.035  Sum_probs=72.0

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++++||.+--- +  +-.|.++..-+.-|++.+-.-....+|+||..+.. .+..+|-.....               .+
T Consensus         3 i~fvlD~S~S~-~--~~~f~~~k~fi~~~i~~l~~~~~~~rvgvv~fs~~-~~~~~~l~~~~~---------------~~   63 (177)
T cd01469           3 IVFVLDGSGSI-Y--PDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSES-FRTEFTLNEYRT---------------KE   63 (177)
T ss_pred             EEEEEeCCCCC-C--HHHHHHHHHHHHHHHHHcCcCCCCcEEEEEEECCc-eeEEEecCccCC---------------HH
Confidence            68899985432 1  12344444445555553333344789999999776 455565332110               11


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV  175 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V  175 (219)
                      .+.+.++.-....        ..+.+..||..|.-.+-...+...+..+.-+++++-+.+......  .-++-.|++.+|
T Consensus        64 ~~~~~i~~~~~~~--------g~T~~~~AL~~a~~~l~~~~~g~R~~~~kv~illTDG~~~~~~~~--~~~~~~~k~~gv  133 (177)
T cd01469          64 EPLSLVKHISQLL--------GLTNTATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDPLL--KDVIPQAEREGI  133 (177)
T ss_pred             HHHHHHHhCccCC--------CCccHHHHHHHHHHHhcCcccCCCCCCCeEEEEEeCCCCCCcccc--HHHHHHHHHCCc
Confidence            2223333211111        135566777666655422111111223334555665544333321  335567889999


Q ss_pred             eeeEeeeCC
Q 027721          176 PIDSCYLGA  184 (219)
Q Consensus       176 ~IDvc~L~~  184 (219)
                      .|-+..+|.
T Consensus       134 ~v~~Vgvg~  142 (177)
T cd01469         134 IRYAIGVGG  142 (177)
T ss_pred             EEEEEEecc
Confidence            999999974


No 21 
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=94.32  E-value=2.8  Score=35.35  Aligned_cols=155  Identities=15%  Similarity=0.151  Sum_probs=90.5

Q ss_pred             CeEEEEEeCCcccccc----CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcce---------EEecCCCCCCCC
Q 027721           14 SLVVVLLDTNPFFWSS----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD---------YVYDSSSTGNQS   80 (219)
Q Consensus        14 slLviIlD~np~~W~~----~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~---------~LYp~~~~~~~~   80 (219)
                      .-++++||++|.=-..    .+..|..++..+.-|+..=+..+...++.+|+.++....         .++|-...    
T Consensus         2 e~ivf~iDvS~SM~~~~~~~~~s~l~~a~~~i~~~~~~ki~~~~~D~vGlilf~t~~~~~~~~~~~i~v~~~l~~~----   77 (218)
T cd01458           2 ESVVFLVDVSPSMFESKDGEYESPFEEALKCIRQLMKSKIISSPKDLVGVVFYGTEESKNPVGYENIYVLLDLDTP----   77 (218)
T ss_pred             cEEEEEEeCCHHHcCCCCCCCCChHHHHHHHHHHHHHhceeCCCCCeEEEEEEcccCCCCcCCCCceEEeecCCCC----
Confidence            3578999999854322    245688888888877777666788899999998776432         12221110    


Q ss_pred             CCCCCchhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCc---
Q 027721           81 VGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP---  157 (219)
Q Consensus        81 ~~~~~~~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~---  157 (219)
                               ....++.+.+++.......... ......+.+..||-.|+=.+.+..   ....+.||+++|-.++..   
T Consensus        78 ---------~~~~l~~l~~~~~~~~~~~~~~-~~~~~~~~l~~aL~~a~~~~~~~~---~~~~~k~IvL~TDg~~p~~~~  144 (218)
T cd01458          78 ---------GAERVEDLKELIEPGGLSFAGQ-VGDSGQVSLSDALWVCLDLFSKGK---KKKSHKRIFLFTNNDDPHGGD  144 (218)
T ss_pred             ---------CHHHHHHHHHHhhcchhhhccc-CCCCCCccHHHHHHHHHHHHHhcc---ccccccEEEEECCCCCCCCCC
Confidence                     0123444444443211000000 001246778889988888877621   124567899999766542   


Q ss_pred             -hhhhHHHHHHHHhhhCCceeeEeeeCCC
Q 027721          158 -EQYVAIMNAIFSAQRSMVPIDSCYLGAQ  185 (219)
Q Consensus       158 -~QYi~lMN~iFaAqK~~V~IDvc~L~~~  185 (219)
                       ...-.+-..+=..++.+|.|.++.++..
T Consensus       145 ~~~~~~~~~~a~~l~~~gI~i~~i~i~~~  173 (218)
T cd01458         145 SIKDSQAAVKAEDLKDKGIELELFPLSSP  173 (218)
T ss_pred             HHHHHHHHHHHHHHHhCCcEEEEEecCCC
Confidence             1222223333345567999999999754


No 22 
>PF13768 VWA_3:  von Willebrand factor type A domain
Probab=93.90  E-value=2.5  Score=33.22  Aligned_cols=141  Identities=18%  Similarity=0.189  Sum_probs=82.4

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      +++|||++..--+.    ...+-+.+..+++.   +...-++.||+.+.+ .+.++|....            ..+.-++
T Consensus         3 vvilvD~S~Sm~g~----~~~~k~al~~~l~~---L~~~d~fnii~f~~~-~~~~~~~~~~------------~~~~~~~   62 (155)
T PF13768_consen    3 VVILVDTSGSMSGE----KELVKDALRAILRS---LPPGDRFNIIAFGSS-VRPLFPGLVP------------ATEENRQ   62 (155)
T ss_pred             EEEEEeCCCCCCCc----HHHHHHHHHHHHHh---CCCCCEEEEEEeCCE-eeEcchhHHH------------HhHHHHH
Confidence            68999997765443    14444455555555   667779999999774 5667776321            1112245


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEec-CCCCchhhhHHHHHHHHhhhCC
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG-SPDGPEQYVAIMNAIFSAQRSM  174 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~-S~d~~~QYi~lMN~iFaAqK~~  174 (219)
                      +..++++.-...        ...+-+.+||-.|+--.      .......+|++++- .++...+.+  ...+=. .+.+
T Consensus        63 ~a~~~I~~~~~~--------~G~t~l~~aL~~a~~~~------~~~~~~~~IilltDG~~~~~~~~i--~~~v~~-~~~~  125 (155)
T PF13768_consen   63 EALQWIKSLEAN--------SGGTDLLAALRAALALL------QRPGCVRAIILLTDGQPVSGEEEI--LDLVRR-ARGH  125 (155)
T ss_pred             HHHHHHHHhccc--------CCCccHHHHHHHHHHhc------ccCCCccEEEEEEeccCCCCHHHH--HHHHHh-cCCC
Confidence            555666652221        12466778877776432      12345677888883 334444332  333322 3367


Q ss_pred             ceeeEeeeCCC-Cchhhhhh
Q 027721          175 VPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       175 V~IDvc~L~~~-~s~fLQQA  193 (219)
                      ++|.++.+|.. +..+|++.
T Consensus       126 ~~i~~~~~g~~~~~~~L~~L  145 (155)
T PF13768_consen  126 IRIFTFGIGSDADADFLREL  145 (155)
T ss_pred             ceEEEEEECChhHHHHHHHH
Confidence            99999999863 55577664


No 23 
>TIGR03436 acidobact_VWFA VWFA-related Acidobacterial domain. Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
Probab=92.59  E-value=6.8  Score=34.45  Aligned_cols=146  Identities=16%  Similarity=0.119  Sum_probs=81.9

Q ss_pred             CCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHH
Q 027721           12 DVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCA   91 (219)
Q Consensus        12 ~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~   91 (219)
                      .|--++++||++.---.    .+..+.+++..|++..+  ....+++|+..+. +.+.+.|-...             .+
T Consensus        52 ~p~~vvlvlD~SgSM~~----~~~~a~~a~~~~l~~~l--~~~d~v~lv~f~~-~~~~~~~~t~~-------------~~  111 (296)
T TIGR03436        52 LPLTVGLVIDTSGSMRN----DLDRARAAAIRFLKTVL--RPNDRVFVVTFNT-RLRLLQDFTSD-------------PR  111 (296)
T ss_pred             CCceEEEEEECCCCchH----HHHHHHHHHHHHHHhhC--CCCCEEEEEEeCC-ceeEeecCCCC-------------HH
Confidence            36678999999754322    26667778888887644  5788999999875 35555543211             11


Q ss_pred             HHHHHHHHHHHhhhhcCCC---C---CCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHH
Q 027721           92 TLLQNLEEFMNKDEQLGKQ---E---PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMN  165 (219)
Q Consensus        92 ~v~~~l~~li~~~~~~~~~---~---~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN  165 (219)
                      .+.+.|.++    .....+   .   .......+.|..||.+|.-...+..... ..-+..|++++-+.|..+++ .+--
T Consensus       112 ~l~~~l~~l----~~~~~~~~~~~~~~~~~~g~T~l~~al~~aa~~~~~~~~~~-~p~rk~iIllTDG~~~~~~~-~~~~  185 (296)
T TIGR03436       112 LLEAALNRL----KPPLRTDYNSSGAFVRDGGGTALYDAITLAALEQLANALAG-IPGRKALIVISDGGDNRSRD-TLER  185 (296)
T ss_pred             HHHHHHHhc----cCCCccccccccccccCCCcchhHHHHHHHHHHHHHHhhcC-CCCCeEEEEEecCCCcchHH-HHHH
Confidence            223333332    110000   0   0000134567777766643332221111 11234588888777765544 4455


Q ss_pred             HHHHhhhCCceeeEeeeC
Q 027721          166 AIFSAQRSMVPIDSCYLG  183 (219)
Q Consensus       166 ~iFaAqK~~V~IDvc~L~  183 (219)
                      ++-.|++.+|+|-+.-++
T Consensus       186 ~~~~~~~~~v~vy~I~~~  203 (296)
T TIGR03436       186 AIDAAQRADVAIYSIDAR  203 (296)
T ss_pred             HHHHHHHcCCEEEEeccC
Confidence            666789999999999886


No 24 
>PRK13406 bchD magnesium chelatase subunit D; Provisional
Probab=92.49  E-value=12  Score=37.13  Aligned_cols=150  Identities=11%  Similarity=0.047  Sum_probs=91.4

Q ss_pred             CCCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhH
Q 027721           10 SDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSL   89 (219)
Q Consensus        10 ~~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~   89 (219)
                      ...+.+++++||++---...   -+..+-.++..|+..  +.....+|+||+.......++.|...+             
T Consensus       398 ~~~~~~vvfvvD~SGSM~~~---rl~~aK~a~~~ll~~--ay~~rD~v~lI~F~g~~a~~~lppT~~-------------  459 (584)
T PRK13406        398 QRSETTTIFVVDASGSAALH---RLAEAKGAVELLLAE--AYVRRDQVALVAFRGRGAELLLPPTRS-------------  459 (584)
T ss_pred             ccCCccEEEEEECCCCCcHh---HHHHHHHHHHHHHHh--hcCCCCEEEEEEECCCceeEEcCCCcC-------------
Confidence            34568999999998754222   355555555555543  345788999999988888999885542             


Q ss_pred             HHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCc---------hhh
Q 027721           90 CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP---------EQY  160 (219)
Q Consensus        90 ~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~---------~QY  160 (219)
                          .+.+.+.+..-   ..      ...+.|+.||.+|+-...+..+.   ...++|+++|-+....         ...
T Consensus       460 ----~~~~~~~L~~l---~~------gGgTpL~~gL~~A~~~l~~~~~~---~~~~~iVLlTDG~~n~~~~~~~~~~~~~  523 (584)
T PRK13406        460 ----LVRAKRSLAGL---PG------GGGTPLAAGLDAAAALALQVRRK---GMTPTVVLLTDGRANIARDGTAGRAQAE  523 (584)
T ss_pred             ----HHHHHHHHhcC---CC------CCCChHHHHHHHHHHHHHHhccC---CCceEEEEEeCCCCCCCccccccccchh
Confidence                22333333321   11      13688999999999887776432   3467888888544321         112


Q ss_pred             hHHHHHHHHhhhCCceeeEeeeCCCCchhhhhh
Q 027721          161 VAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       161 i~lMN~iFaAqK~~V~IDvc~L~~~~s~fLQQA  193 (219)
                      .....+.-.+.+.+|++=+.-.+.....++++.
T Consensus       524 ~~~~~~a~~~~~~gi~~~vId~g~~~~~~~~~L  556 (584)
T PRK13406        524 EDALAAARALRAAGLPALVIDTSPRPQPQARAL  556 (584)
T ss_pred             hHHHHHHHHHHhcCCeEEEEecCCCCcHHHHHH
Confidence            233455566666777665555543343456555


No 25 
>cd01476 VWA_integrin_invertebrates VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in  cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest  any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.
Probab=91.92  E-value=5.2  Score=31.55  Aligned_cols=137  Identities=9%  Similarity=0.061  Sum_probs=73.8

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCC-cceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN-SCDYVYDSSSTGNQSVGNGRMPSLCATLL   94 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~-~~~~LYp~~~~~~~~~~~~~~~~~~~~v~   94 (219)
                      ++++||++.-- ..   .+.+..+.+..+++.........+++||..+.. .....+|-.....               .
T Consensus         3 v~~llD~S~Sm-~~---~~~~~~~~~~~~~~~l~~~~~~~~v~lv~f~~~~~~~~~~~l~~~~~---------------~   63 (163)
T cd01476           3 LLFVLDSSGSV-RG---KFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRQRVRFNLPKHND---------------G   63 (163)
T ss_pred             EEEEEeCCcch-hh---hHHHHHHHHHHHHHhcCCCCCCcEEEEEEEcCCCceEEEecCCCCCC---------------H
Confidence            57899996533 21   244455555555655444455799999998664 4566666432210               2


Q ss_pred             HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhh-C
Q 027721           95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR-S  173 (219)
Q Consensus        95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK-~  173 (219)
                      +.+.+.+..-....        ..+.+..||..|.-.+.+.. ...+.....|++++-..... .   .....-.+++ .
T Consensus        64 ~~l~~~i~~l~~~g--------g~T~l~~aL~~a~~~l~~~~-~~r~~~~~~villTDG~~~~-~---~~~~~~~l~~~~  130 (163)
T cd01476          64 EELLEKVDNLRFIG--------GTTATGAAIEVALQQLDPSE-GRREGIPKVVVVLTDGRSHD-D---PEKQARILRAVP  130 (163)
T ss_pred             HHHHHHHHhCccCC--------CCccHHHHHHHHHHHhcccc-CCCCCCCeEEEEECCCCCCC-c---hHHHHHHHhhcC
Confidence            23333333211111        24678888888887775321 11123445666676432211 1   1233344555 8


Q ss_pred             CceeeEeeeCC
Q 027721          174 MVPIDSCYLGA  184 (219)
Q Consensus       174 ~V~IDvc~L~~  184 (219)
                      +|.|-++.+|+
T Consensus       131 ~v~v~~vg~g~  141 (163)
T cd01476         131 NIETFAVGTGD  141 (163)
T ss_pred             CCEEEEEECCC
Confidence            88888888864


No 26 
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=91.65  E-value=7.5  Score=32.89  Aligned_cols=147  Identities=10%  Similarity=0.091  Sum_probs=73.8

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      |+++||++---...   .|.++..-+..+++..-.-...-+|.||..+.. .+..+|-.....               .+
T Consensus         5 lvfllD~S~Sm~~~---~~~~~k~f~~~l~~~l~~~~~~~rvglv~fs~~-~~~~~~l~~~~~---------------~~   65 (224)
T cd01475           5 LVFLIDSSRSVRPE---NFELVKQFLNQIIDSLDVGPDATRVGLVQYSST-VKQEFPLGRFKS---------------KA   65 (224)
T ss_pred             EEEEEeCCCCCCHH---HHHHHHHHHHHHHHhcccCCCccEEEEEEecCc-eeEEecccccCC---------------HH
Confidence            78999996654322   355555555555555433344679999997665 555666432110               12


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCC--CceEE-EEecCCCCchhhhHHHHHHHHhhh
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRIL-CLQGSPDGPEQYVAIMNAIFSAQR  172 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~--~~RIL-vis~S~d~~~QYi~lMN~iFaAqK  172 (219)
                      .+.+.+..-....        ..+....||-.|+-..-...+...+..  -+|++ +++-+... .   .+...+-.+++
T Consensus        66 ~l~~~i~~i~~~~--------~~t~tg~AL~~a~~~~~~~~~g~r~~~~~~~kvvillTDG~s~-~---~~~~~a~~lk~  133 (224)
T cd01475          66 DLKRAVRRMEYLE--------TGTMTGLAIQYAMNNAFSEAEGARPGSERVPRVGIVVTDGRPQ-D---DVSEVAAKARA  133 (224)
T ss_pred             HHHHHHHhCcCCC--------CCChHHHHHHHHHHHhCChhcCCCCCCCCCCeEEEEEcCCCCc-c---cHHHHHHHHHH
Confidence            2333333211111        122233344444322111000011111  14654 45433221 2   23455667889


Q ss_pred             CCceeeEeeeCCCCchhhhhh
Q 027721          173 SMVPIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       173 ~~V~IDvc~L~~~~s~fLQQA  193 (219)
                      .+|.|-++-+|.-+...|++.
T Consensus       134 ~gv~i~~VgvG~~~~~~L~~i  154 (224)
T cd01475         134 LGIEMFAVGVGRADEEELREI  154 (224)
T ss_pred             CCcEEEEEeCCcCCHHHHHHH
Confidence            999999999986566666665


No 27 
>cd01450 vWFA_subfamily_ECM Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A
Probab=91.32  E-value=5.5  Score=30.64  Aligned_cols=148  Identities=11%  Similarity=0.068  Sum_probs=80.7

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      +++|||++.---..   .+..+.+.+-.|++........-++.++..+.+ .+...|.....       +        .+
T Consensus         3 i~~llD~S~Sm~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~li~f~~~-~~~~~~~~~~~-------~--------~~   63 (161)
T cd01450           3 IVFLLDGSESVGPE---NFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDD-VRVEFSLNDYK-------S--------KD   63 (161)
T ss_pred             EEEEEeCCCCcCHH---HHHHHHHHHHHHHHheeeCCCceEEEEEEEcCC-ceEEEECCCCC-------C--------HH
Confidence            56899986542111   233333334444444444445778888887664 44455543221       0        22


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCc
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV  175 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V  175 (219)
                      ++.+.+..-.....       ..+.+..||..|.-++.+... .....+..|++++-+.+....  ..=..+-..++.+|
T Consensus        64 ~~~~~i~~~~~~~~-------~~t~~~~al~~a~~~~~~~~~-~~~~~~~~iiliTDG~~~~~~--~~~~~~~~~~~~~v  133 (161)
T cd01450          64 DLLKAVKNLKYLGG-------GGTNTGKALQYALEQLFSESN-ARENVPKVIIVLTDGRSDDGG--DPKEAAAKLKDEGI  133 (161)
T ss_pred             HHHHHHHhcccCCC-------CCccHHHHHHHHHHHhccccc-ccCCCCeEEEEECCCCCCCCc--chHHHHHHHHHCCC
Confidence            33333332111100       146677888877777766532 122445667777766554332  34566777788899


Q ss_pred             eeeEeeeCCCCchhhhh
Q 027721          176 PIDSCYLGAQNSAFLQQ  192 (219)
Q Consensus       176 ~IDvc~L~~~~s~fLQQ  192 (219)
                      +|.++.+|..+-..|.+
T Consensus       134 ~v~~i~~g~~~~~~l~~  150 (161)
T cd01450         134 KVFVVGVGPADEEELRE  150 (161)
T ss_pred             EEEEEeccccCHHHHHH
Confidence            99999997534444443


No 28 
>cd01474 vWA_ATR ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.
Probab=91.03  E-value=7.5  Score=31.71  Aligned_cols=148  Identities=9%  Similarity=-0.015  Sum_probs=77.8

Q ss_pred             eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721           15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL   94 (219)
Q Consensus        15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~   94 (219)
                      =++++||++.---..    +.+.++-+..++..++  +...++.||..+.. .+.++|-....             +.+.
T Consensus         6 Dvv~llD~SgSm~~~----~~~~~~~~~~l~~~~~--~~~~rvglv~Fs~~-~~~~~~l~~~~-------------~~~~   65 (185)
T cd01474           6 DLYFVLDKSGSVAAN----WIEIYDFVEQLVDRFN--SPGLRFSFITFSTR-ATKILPLTDDS-------------SAII   65 (185)
T ss_pred             eEEEEEeCcCchhhh----HHHHHHHHHHHHHHcC--CCCcEEEEEEecCC-ceEEEeccccH-------------HHHH
Confidence            478999986543111    2222222333333332  35689999998754 56677743211             1122


Q ss_pred             HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721           95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM  174 (219)
Q Consensus        95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~  174 (219)
                      +.+.. +..-   ..      ...+.+..||..|.-.+.+..+.. +.-...|++++-+.+............-.+++.+
T Consensus        66 ~~l~~-l~~~---~~------~g~T~~~~aL~~a~~~l~~~~~~~-r~~~~~villTDG~~~~~~~~~~~~~a~~l~~~g  134 (185)
T cd01474          66 KGLEV-LKKV---TP------SGQTYIHEGLENANEQIFNRNGGG-RETVSVIIALTDGQLLLNGHKYPEHEAKLSRKLG  134 (185)
T ss_pred             HHHHH-Hhcc---CC------CCCCcHHHHHHHHHHHHHhhccCC-CCCCeEEEEEcCCCcCCCCCcchHHHHHHHHHcC
Confidence            22221 1110   00      125678888888876664322211 1122456667644432112222355666788899


Q ss_pred             ceeeEeeeCCCCchhhhhh
Q 027721          175 VPIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       175 V~IDvc~L~~~~s~fLQQA  193 (219)
                      |.|-+..+++-+...|++.
T Consensus       135 v~i~~vgv~~~~~~~L~~i  153 (185)
T cd01474         135 AIVYCVGVTDFLKSQLINI  153 (185)
T ss_pred             CEEEEEeechhhHHHHHHH
Confidence            9999998864455566666


No 29 
>PF03731 Ku_N:  Ku70/Ku80 N-terminal alpha/beta domain;  InterPro: IPR005161 The Ku heterodimer (composed of Ku70 P12956 from SWISSPROT and Ku80 P13010 from SWISSPROT) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the N-terminal alpha/beta domain. This domain only makes a small contribution to the dimer interface. The domain comprises a six stranded beta sheet of the Rossman fold [].; PDB: 1JEQ_A 1JEY_A.
Probab=90.24  E-value=8.5  Score=32.34  Aligned_cols=158  Identities=15%  Similarity=0.133  Sum_probs=83.5

Q ss_pred             EEEEEeCCcccccc---CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721           16 VVVLLDTNPFFWSS---SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT   92 (219)
Q Consensus        16 LviIlD~np~~W~~---~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~   92 (219)
                      ++++||++|.=-..   ....|..+++.+--++---...+....|.||.+++.++.=- +   ...+-++......+...
T Consensus         2 ~vflID~s~sM~~~~~~~~~~l~~al~~i~~~~~~ki~~~~kD~vgvvl~gt~~t~n~-~---~~~~~~~i~~l~~l~~~   77 (224)
T PF03731_consen    2 TVFLIDVSPSMFEPSSESESPLEEALKAIEDLMQQKIISSPKDEVGVVLFGTDETNNP-D---EDSGYENIFVLQPLDPP   77 (224)
T ss_dssp             EEEEEE-SCGGGS-BTTCS-HHHHHHHHHHHHHHHHHHTT---EEEEEEES-SS-BST-----TTT-STTEEEEEECC--
T ss_pred             EEEEEECCHHHCCCCCCcchhHHHHHHHHHHHHHHHHcCCCCCeEEEEEEcCCCCCCc-c---cccCCCceEEeecCCcc
Confidence            68999999986644   22279999999999999999999999999999987765310 0   00000000000011001


Q ss_pred             HHHHHHHHHHhhhh------cCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCC---CchhhhHH
Q 027721           93 LLQNLEEFMNKDEQ------LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD---GPEQYVAI  163 (219)
Q Consensus        93 v~~~l~~li~~~~~------~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d---~~~QYi~l  163 (219)
                      -.+.|+++.+....      ...     ......+..||-.|.-.+.+.. ......+.||++++-.++   ..+++-.+
T Consensus        78 ~~~~l~~L~~~~~~~~~~~~~~~-----~~~~~~l~~al~v~~~~~~~~~-~~~k~~~krI~l~Td~d~p~~~~~~~~~~  151 (224)
T PF03731_consen   78 SAERLKELEELLKPGDKFENFFS-----GSDEGDLSDALWVASDMFRERT-CKKKKNKKRIFLFTDNDGPHEDDDELERI  151 (224)
T ss_dssp             BHHHHHHHHTTSHHHHHHHHHC------SSS---HHHHHHHHHHHHHCHC-TTS-ECEEEEEEEES-SSTTT-CCCHHHH
T ss_pred             CHHHHHHHHHhhcccccccccCC-----CCCccCHHHHHHHHHHHHHHHh-hcccCCCcEEEEEeCCCCCCCCHHHHHHH
Confidence            12223333222111      001     1234567777777777665521 122356789999995433   34555444


Q ss_pred             HHH--HHHhhhCCceeeEeeeC
Q 027721          164 MNA--IFSAQRSMVPIDSCYLG  183 (219)
Q Consensus       164 MN~--iFaAqK~~V~IDvc~L~  183 (219)
                      ..-  +--.+..+|.|+++.|+
T Consensus       152 ~~~l~~~Dl~~~~i~~~~~~l~  173 (224)
T PF03731_consen  152 IQKLKAKDLQDNGIEIELFFLP  173 (224)
T ss_dssp             HHHHHHHHHHHHTEEEEEEECT
T ss_pred             HHhhccccchhcCcceeEeecC
Confidence            444  55678899999999993


No 30 
>cd01463 vWA_VGCC_like VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain  is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
Probab=89.65  E-value=10  Score=31.06  Aligned_cols=153  Identities=14%  Similarity=0.110  Sum_probs=81.7

Q ss_pred             CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721           11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC   90 (219)
Q Consensus        11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~   90 (219)
                      ..|.-++++||++.--.+.   .++.+-..+..+++   .+....++.|++.+.. .+.++|..........    ..- 
T Consensus        11 ~~p~~vv~llD~SgSM~~~---~l~~ak~~~~~ll~---~l~~~d~v~lv~F~~~-~~~~~~~~~~~~~~~~----~~~-   78 (190)
T cd01463          11 TSPKDIVILLDVSGSMTGQ---RLHLAKQTVSSILD---TLSDNDFFNIITFSNE-VNPVVPCFNDTLVQAT----TSN-   78 (190)
T ss_pred             cCCceEEEEEECCCCCCcH---HHHHHHHHHHHHHH---hCCCCCEEEEEEeCCC-eeEEeeecccceEecC----HHH-
Confidence            4578899999998754332   24444444444443   4566779999987665 5666664322100000    000 


Q ss_pred             HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhh---c-cCCCCCCceEEEEecCCCCchhhhHHHHH
Q 027721           91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF---R-SGLLHPQPRILCLQGSPDGPEQYVAIMNA  166 (219)
Q Consensus        91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~---~-~~~~~~~~RILvis~S~d~~~QYi~lMN~  166 (219)
                         .+.+.+.+..-   ...      ..+.+..||-.|+-.+.+..   . .........|++++-+.+....-      
T Consensus        79 ---~~~~~~~l~~l---~~~------G~T~~~~al~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~------  140 (190)
T cd01463          79 ---KKVLKEALDML---EAK------GIANYTKALEFAFSLLLKNLQSNHSGSRSQCNQAIMLITDGVPENYKE------  140 (190)
T ss_pred             ---HHHHHHHHhhC---CCC------CcchHHHHHHHHHHHHHHhhhcccccccCCceeEEEEEeCCCCCcHhH------
Confidence               22333333221   111      24567888888887666521   1 11123445788888554432211      


Q ss_pred             HHHhh---h---CCceeeEeeeCCC--Cchhhhhh
Q 027721          167 IFSAQ---R---SMVPIDSCYLGAQ--NSAFLQQC  193 (219)
Q Consensus       167 iFaAq---K---~~V~IDvc~L~~~--~s~fLQQA  193 (219)
                      ++.+.   +   .+|.|-++.+|.+  +..+||+.
T Consensus       141 ~~~~~~~~~~~~~~v~i~tigiG~~~~d~~~L~~l  175 (190)
T cd01463         141 IFDKYNWDKNSEIPVRVFTYLIGREVTDRREIQWM  175 (190)
T ss_pred             HHHHhcccccCCCcEEEEEEecCCccccchHHHHH
Confidence            22222   2   2588888888854  46788876


No 31 
>cd01462 VWA_YIEM_type VWA YIEM type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=88.71  E-value=9.8  Score=29.67  Aligned_cols=138  Identities=16%  Similarity=0.169  Sum_probs=74.3

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++|+||+++---+.   .+...-..+..++.  .......++.||..+.......++....                 ..
T Consensus         3 v~illD~SgSM~~~---k~~~a~~~~~~l~~--~~~~~~~~v~li~F~~~~~~~~~~~~~~-----------------~~   60 (152)
T cd01462           3 VILLVDQSGSMYGA---PEEVAKAVALALLR--IALAENRDTYLILFDSEFQTKIVDKTDD-----------------LE   60 (152)
T ss_pred             EEEEEECCCCCCCC---HHHHHHHHHHHHHH--HHHHcCCcEEEEEeCCCceEEecCCccc-----------------HH
Confidence            68899998754322   12222222333333  2233567899998876632222221110                 12


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC-CCchhhhHHHHHHHHhhhCC
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP-DGPEQYVAIMNAIFSAQRSM  174 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~-d~~~QYi~lMN~iFaAqK~~  174 (219)
                      ++.+.+... .  .      ...+.+..||..++-.+.+..     .-+.+|+++|-.. +.....  ++.....+++.+
T Consensus        61 ~~~~~l~~~-~--~------~ggT~l~~al~~a~~~l~~~~-----~~~~~ivliTDG~~~~~~~~--~~~~~~~~~~~~  124 (152)
T cd01462          61 EPVEFLSGV-Q--L------GGGTDINKALRYALELIERRD-----PRKADIVLITDGYEGGVSDE--LLREVELKRSRV  124 (152)
T ss_pred             HHHHHHhcC-C--C------CCCcCHHHHHHHHHHHHHhcC-----CCCceEEEECCCCCCCCCHH--HHHHHHHHHhcC
Confidence            222222211 1  1      124668888888876665531     2346899998663 322222  235567778889


Q ss_pred             ceeeEeeeCCCCchhhh
Q 027721          175 VPIDSCYLGAQNSAFLQ  191 (219)
Q Consensus       175 V~IDvc~L~~~~s~fLQ  191 (219)
                      +.|-+..+|...-..|.
T Consensus       125 ~~v~~~~~g~~~~~~~~  141 (152)
T cd01462         125 ARFVALALGDHGNPGYD  141 (152)
T ss_pred             cEEEEEEecCCCCchHH
Confidence            99999999865444443


No 32 
>cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.
Probab=86.33  E-value=16  Score=29.53  Aligned_cols=73  Identities=8%  Similarity=-0.012  Sum_probs=44.1

Q ss_pred             ccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCC-Cchhhhhh
Q 027721          118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       118 ~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA  193 (219)
                      .+.++.||-.|.-.+.+. +...+....+|++++-+......  ......-.+++.+|.|-++.+|.. +..+|+..
T Consensus        84 ~T~l~~aL~~a~~~l~~~-~~~r~~~~~~villTDG~~~~~~--~~~~~a~~l~~~gv~v~~igiG~~~d~~~l~~i  157 (186)
T cd01471          84 STNTTSALLVVEKHLFDT-RGNRENAPQLVIIMTDGIPDSKF--RTLKEARKLRERGVIIAVLGVGQGVNHEENRSL  157 (186)
T ss_pred             CccHHHHHHHHHHHhhcc-CCCcccCceEEEEEccCCCCCCc--chhHHHHHHHHCCCEEEEEEeehhhCHHHHHHh
Confidence            567888888887666543 22223445677778754432211  223445566677898888888743 55667654


No 33 
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=84.07  E-value=57  Score=33.95  Aligned_cols=151  Identities=12%  Similarity=0.095  Sum_probs=91.2

Q ss_pred             eEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721           15 LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL   94 (219)
Q Consensus        15 lLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~   94 (219)
                      .+++|||++.-=-+.+  .+..+-.++..|+-.-  +...-++.||.-+.. +..+.|-..-..            ...+
T Consensus       306 ~VVLVLDvSGSM~g~d--RL~~lkqAA~~fL~~~--l~~~DrVGLVtFsss-A~vl~pLt~Its------------~~dr  368 (863)
T TIGR00868       306 IVCLVLDKSGSMTVED--RLKRMNQAAKLFLLQT--VEKGSWVGMVTFDSA-AYIKNELIQITS------------SAER  368 (863)
T ss_pred             eEEEEEECCccccccC--HHHHHHHHHHHHHHHh--CCCCCEEEEEEECCc-eeEeeccccCCc------------HHHH
Confidence            5889999976532221  2444444455554332  456789999998765 445555322110            0113


Q ss_pred             HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721           95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM  174 (219)
Q Consensus        95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~  174 (219)
                      +.+.+.+...   .       ...+.|..||.+|+-.+++..+   .....+|++++-+.|...     -.|+=.+++.+
T Consensus       369 ~aL~~~L~~~---A-------~GGT~I~~GL~~Alq~L~~~~~---~~~~~~IILLTDGedn~~-----~~~l~~lk~~g  430 (863)
T TIGR00868       369 DALTANLPTA---A-------SGGTSICSGLKAAFQVIKKSYQ---STDGSEIVLLTDGEDNTI-----SSCFEEVKQSG  430 (863)
T ss_pred             HHHHHhhccc---c-------CCCCcHHHHHHHHHHHHHhccc---ccCCCEEEEEeCCCCCCH-----HHHHHHHHHcC
Confidence            3333333211   1       1357799999999988877532   123467888887666432     24555677889


Q ss_pred             ceeeEeeeCCCCchhhhhh--------ccccchh
Q 027721          175 VPIDSCYLGAQNSAFLQQC--------MSDRLLT  200 (219)
Q Consensus       175 V~IDvc~L~~~~s~fLQQA--------~~~~~~~  200 (219)
                      |+|.+..+|......|++.        +.+.+.+
T Consensus       431 VtI~TIg~G~dad~~L~~IA~~TGG~~f~asd~~  464 (863)
T TIGR00868       431 AIIHTIALGPSAAKELEELSDMTGGLRFYASDQA  464 (863)
T ss_pred             CEEEEEEeCCChHHHHHHHHHhcCCEEEEeCCHH
Confidence            9999999997666678877        6666554


No 34 
>TIGR02031 BchD-ChlD magnesium chelatase ATPase subunit D. This model represents one of two ATPase subunits of the trimeric magnesium chelatase responsible for insertion of magnesium ion into protoporphyrin IX. This is an essential step in the biosynthesis of both chlorophyll and bacteriochlorophyll. This subunit is found in green plants, photosynthetic algae, cyanobacteria and other photosynthetic bacteria. Unlike subunit I (TIGR02030), this subunit is not found in archaea.
Probab=83.60  E-value=22  Score=35.03  Aligned_cols=109  Identities=15%  Similarity=0.117  Sum_probs=66.1

Q ss_pred             CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721           11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC   90 (219)
Q Consensus        11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~   90 (219)
                      ....+++++||++--- .  .-.+..+-..+..|++  .+.....+|++|+.......+++|...+              
T Consensus       405 ~~~~~v~fvvD~SGSM-~--~~rl~~aK~av~~Ll~--~~~~~~D~v~Li~F~~~~a~~~lp~t~~--------------  465 (589)
T TIGR02031       405 KSGRLLIFVVDASGSA-A--VARMSEAKGAVELLLG--EAYVHRDQVSLIAFRGTAAEVLLPPSRS--------------  465 (589)
T ss_pred             ccCceEEEEEECCCCC-C--hHHHHHHHHHHHHHHH--hhccCCCEEEEEEECCCCceEECCCCCC--------------
Confidence            3345678999998733 1  1135555555555543  2344667999999977767777775322              


Q ss_pred             HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecC
Q 027721           91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS  153 (219)
Q Consensus        91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S  153 (219)
                         .+.+.+.+..-.   .      ...+.|+.||.+|+-.+.+..+.   ..+++|+++|-+
T Consensus       466 ---~~~~~~~L~~l~---~------gGgTpL~~gL~~A~~~~~~~~~~---~~~~~ivllTDG  513 (589)
T TIGR02031       466 ---VEQAKRRLDVLP---G------GGGTPLAAGLAAAFQTALQARSS---GGTPTIVLITDG  513 (589)
T ss_pred             ---HHHHHHHHhcCC---C------CCCCcHHHHHHHHHHHHHHhccc---CCceEEEEECCC
Confidence               222333333211   1      13578999999999887765332   456788888843


No 35 
>cd01464 vWA_subfamily VWA subfamily: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if
Probab=80.75  E-value=28  Score=28.00  Aligned_cols=144  Identities=16%  Similarity=0.151  Sum_probs=77.9

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHh---cCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILT---LNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT   92 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~---~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~   92 (219)
                      ++++||++.---+.   .+..+-.++-.|+.....   ....-++.||..+. +++.+.|-......             
T Consensus         6 v~~llD~SgSM~~~---~~~~~k~a~~~~~~~l~~~~~~~~~~~v~ii~F~~-~a~~~~~l~~~~~~-------------   68 (176)
T cd01464           6 IYLLLDTSGSMAGE---PIEALNQGLQMLQSELRQDPYALESVEISVITFDS-AARVIVPLTPLESF-------------   68 (176)
T ss_pred             EEEEEECCCCCCCh---HHHHHHHHHHHHHHHHhcChhhccccEEEEEEecC-CceEecCCccHHhc-------------
Confidence            57899997643222   244444444444443221   12355899998866 46777774321100             


Q ss_pred             HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhcc----CCCCCCceEEEEecCCCCchhhhHHHHHHH
Q 027721           93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS----GLLHPQPRILCLQGSPDGPEQYVAIMNAIF  168 (219)
Q Consensus        93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~----~~~~~~~RILvis~S~d~~~QYi~lMN~iF  168 (219)
                         ...       ....      ...+.+..||-.|+-.+.+....    .....+++|+++|-+.+....- ...-.+=
T Consensus        69 ---~~~-------~l~~------~GgT~l~~aL~~a~~~l~~~~~~~~~~~~~~~~~~iillTDG~~~~~~~-~~~~~~~  131 (176)
T cd01464          69 ---QPP-------RLTA------SGGTSMGAALELALDCIDRRVQRYRADQKGDWRPWVFLLTDGEPTDDLT-AAIERIK  131 (176)
T ss_pred             ---CCC-------cccC------CCCCcHHHHHHHHHHHHHHHHHHhcccCcCCcCcEEEEEcCCCCCchHH-HHHHHHH
Confidence               000       0000      12466888888888777553211    1123456788888655432211 1123444


Q ss_pred             HhhhCCceeeEeeeCC-CCchhhhhh
Q 027721          169 SAQRSMVPIDSCYLGA-QNSAFLQQC  193 (219)
Q Consensus       169 aAqK~~V~IDvc~L~~-~~s~fLQQA  193 (219)
                      .+++.++.|-++-+|. -+..+|++.
T Consensus       132 ~~~~~~~~i~~igiG~~~~~~~L~~i  157 (176)
T cd01464         132 EARDSKGRIVACAVGPKADLDTLKQI  157 (176)
T ss_pred             hhcccCCcEEEEEeccccCHHHHHHH
Confidence            5566689999999984 356677766


No 36 
>cd01456 vWA_ywmD_type VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if 
Probab=78.72  E-value=36  Score=28.13  Aligned_cols=160  Identities=12%  Similarity=0.075  Sum_probs=85.9

Q ss_pred             CCCCCCCeEEEEEeCCcccccc---CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCc-----ceEEecCCCCCCC
Q 027721            8 LYSDDVSLVVVLLDTNPFFWSS---SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS-----CDYVYDSSSTGNQ   79 (219)
Q Consensus         8 ~~~~~~slLviIlD~np~~W~~---~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~-----~~~LYp~~~~~~~   79 (219)
                      .....+.-+++|||++.-=...   ..--+..+-..+..|++.   +...+++.++..+...     .+.+.|.......
T Consensus        15 ~~~~~~~~vv~vlD~SgSM~~~~~~~~~rl~~ak~a~~~~l~~---l~~~~~v~lv~F~~~~~~~~~~~~~~p~~~~~~~   91 (206)
T cd01456          15 TEPQLPPNVAIVLDNSGSMREVDGGGETRLDNAKAALDETANA---LPDGTRLGLWTFSGDGDNPLDVRVLVPKGCLTAP   91 (206)
T ss_pred             cccCCCCcEEEEEeCCCCCcCCCCCcchHHHHHHHHHHHHHHh---CCCCceEEEEEecCCCCCCccccccccccccccc
Confidence            4455677899999997764421   123466666666666664   5678899999976531     2233321100000


Q ss_pred             CCCCCCchhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchh
Q 027721           80 SVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ  159 (219)
Q Consensus        80 ~~~~~~~~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~Q  159 (219)
                      ..+..  ..-    .+.+.+.+..-..  .      ...+.|..||..|+-++.       ..-..+|++||-+.+...+
T Consensus        92 ~~~~~--~~~----~~~l~~~i~~i~~--~------~G~T~l~~aL~~a~~~l~-------~~~~~~iillTDG~~~~~~  150 (206)
T cd01456          92 VNGFP--SAQ----RSALDAALNSLQT--P------TGWTPLAAALAEAAAYVD-------PGRVNVVVLITDGEDTCGP  150 (206)
T ss_pred             cCCCC--ccc----HHHHHHHHHhhcC--C------CCcChHHHHHHHHHHHhC-------CCCcceEEEEcCCCccCCC
Confidence            00000  001    2333333333210  1      135678899988887764       1112688888866554322


Q ss_pred             -hhHHHHHHHHhhh----CCceeeEeeeCCC-Cchhhhhh
Q 027721          160 -YVAIMNAIFSAQR----SMVPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       160 -Yi~lMN~iFaAqK----~~V~IDvc~L~~~-~s~fLQQA  193 (219)
                       ......  --+++    .+|+|.++.+|.+ +..+|++.
T Consensus       151 ~~~~~~~--~~~~~~~~~~~i~i~~igiG~~~~~~~l~~i  188 (206)
T cd01456         151 DPCEVAR--ELAKRRTPAPPIKVNVIDFGGDADRAELEAI  188 (206)
T ss_pred             CHHHHHH--HHHHhcCCCCCceEEEEEecCcccHHHHHHH
Confidence             222222  22333    5999999999854 35566665


No 37 
>PF00092 VWA:  von Willebrand factor type A domain;  InterPro: IPR002035 The von Willebrand factor is a large multimeric glycoprotein found in blood plasma. Mutant forms are involved in the aetiology of bleeding disorders []. In von Willebrand factor, the type A domain (vWF) is the prototype for a protein superfamily. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. Fold recognition algorithms were used to score sequence compatibility with a library of known structures: the vWF domain fold was predicted to be a doubly-wound, open, twisted beta-sheet flanked by alpha-helices []. 3D structures have been determined for the I-domains of integrins CD11b (with bound magnesium) [] and CD11a (with bound manganese) []. The domain adopts a classic alpha/beta Rossmann fold and contains an unusual metal ion coordination site at its surface. It has been suggested that this site represents a general metal ion-dependent adhesion site (MIDAS) for binding protein ligands []. The residues constituting the MIDAS motif in the CD11b and CD11a I-domains are completely conserved, but the manner in which the metal ion is coordinated differs slightly [].; GO: 0005515 protein binding; PDB: 2XGG_B 3ZQK_B 3GXB_A 3PPV_A 3PPX_A 3PPW_A 3PPY_A 1CQP_B 3TCX_B 2ICA_A ....
Probab=78.39  E-value=30  Score=26.97  Aligned_cols=148  Identities=11%  Similarity=0.099  Sum_probs=77.0

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++++||.+..--..   .+.++.+.+..|++.+......-+++|+..+... +.+++-.....               .+
T Consensus         2 ivflvD~S~sm~~~---~~~~~~~~v~~~i~~~~~~~~~~rv~iv~f~~~~-~~~~~~~~~~~---------------~~   62 (178)
T PF00092_consen    2 IVFLVDTSGSMSGD---NFEKAKQFVKSIISRLSISNNGTRVGIVTFSDSA-RVLFSLTDYQS---------------KN   62 (178)
T ss_dssp             EEEEEE-STTSCHH---HHHHHHHHHHHHHHHSTBSTTSEEEEEEEESSSE-EEEEETTSHSS---------------HH
T ss_pred             EEEEEeCCCCCchH---HHHHHHHHHHHHHHhhhccccccccceeeeeccc-ccccccccccc---------------cc
Confidence            58899986543222   3666677777777755577888899998886663 45555332210               22


Q ss_pred             HHHHHH-HhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCC
Q 027721           96 NLEEFM-NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM  174 (219)
Q Consensus        96 ~l~~li-~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~  174 (219)
                      ++.+.+ ..... .       ...+.++.||-.|..++....+...+..+.-|++++-+..........   .-...+..
T Consensus        63 ~~~~~i~~~~~~-~-------~g~t~~~~aL~~a~~~l~~~~~~~r~~~~~~iiliTDG~~~~~~~~~~---~~~~~~~~  131 (178)
T PF00092_consen   63 DLLNAINDSIPS-S-------GGGTNLGAALKFAREQLFSSNNGGRPNSPKVIILITDGNSNDSDSPSE---EAANLKKS  131 (178)
T ss_dssp             HHHHHHHTTGGC-C-------BSSB-HHHHHHHHHHHTTSGGGTTGTTSEEEEEEEESSSSSSHSGHHH---HHHHHHHH
T ss_pred             cccccccccccc-c-------chhhhHHHHHhhhhhcccccccccccccccceEEEEeecccCCcchHH---HHHHHHHh
Confidence            233322 21111 1       235668899999888877652222223333455577666555433221   11111111


Q ss_pred             ceeeEeeeCC--CCchhhhhh
Q 027721          175 VPIDSCYLGA--QNSAFLQQC  193 (219)
Q Consensus       175 V~IDvc~L~~--~~s~fLQQA  193 (219)
                      .-|-++.+|.  .+...|+..
T Consensus       132 ~~i~~~~ig~~~~~~~~l~~l  152 (178)
T PF00092_consen  132 NGIKVIAIGIDNADNEELREL  152 (178)
T ss_dssp             CTEEEEEEEESCCHHHHHHHH
T ss_pred             cCcEEEEEecCcCCHHHHHHH
Confidence            4555666653  344555554


No 38 
>cd01470 vWA_complement_factors Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.
Probab=76.98  E-value=39  Score=27.65  Aligned_cols=168  Identities=8%  Similarity=0.059  Sum_probs=86.0

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ   95 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~~   95 (219)
                      ++++||++--- .  .-.|..+-..+..|++........-++.||..+.. .+.++|-.....     .+        .+
T Consensus         3 i~~vlD~SgSM-~--~~~~~~~k~~~~~l~~~l~~~~~~~~v~li~Fs~~-~~~~~~~~~~~~-----~~--------~~   65 (198)
T cd01470           3 IYIALDASDSI-G--EEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASD-PKEIVSIRDFNS-----ND--------AD   65 (198)
T ss_pred             EEEEEECCCCc-c--HHHHHHHHHHHHHHHHHccccCCCceEEEEEecCC-ceEEEecccCCC-----CC--------HH
Confidence            67899995432 2  12455566666666665433334568888887665 455666432110     00        12


Q ss_pred             HHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhcc---CCCCCCceEEEEecCCCC----chhhhHHHHHHH
Q 027721           96 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS---GLLHPQPRILCLQGSPDG----PEQYVAIMNAIF  168 (219)
Q Consensus        96 ~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~---~~~~~~~RILvis~S~d~----~~QYi~lMN~iF  168 (219)
                      .+.+.+..-......    ....+.+..||.+|+-.+.+....   .....+.+|+++|-+.+.    +.+....+--..
T Consensus        66 ~~~~~l~~~~~~~~~----~~ggT~~~~Al~~~~~~l~~~~~~~~~~~~~~~~~iillTDG~~~~g~~~~~~~~~~~~~~  141 (198)
T cd01470          66 DVIKRLEDFNYDDHG----DKTGTNTAAALKKVYERMALEKVRNKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLV  141 (198)
T ss_pred             HHHHHHHhCCccccc----CccchhHHHHHHHHHHHHHHHHhcCccchhhcceEEEEEcCCCcCCCCChhHHHHHHHHHH
Confidence            233333321110000    012456788887776555332111   112356678888865432    222222222111


Q ss_pred             -------HhhhCCceeeEeeeCCC-Cchhhhhh----------ccccchhHHHH
Q 027721          169 -------SAQRSMVPIDSCYLGAQ-NSAFLQQC----------MSDRLLTLLVE  204 (219)
Q Consensus       169 -------aAqK~~V~IDvc~L~~~-~s~fLQQA----------~~~~~~~gLlQ  204 (219)
                             .+++.+|.|-++-+|.. +...|++.          +.+.+++.|-+
T Consensus       142 ~~~~~~~~~~~~~v~i~~iGvG~~~~~~~L~~iA~~~~g~~~~f~~~~~~~l~~  195 (198)
T cd01470         142 YKNNKSDNPREDYLDVYVFGVGDDVNKEELNDLASKKDNERHFFKLKDYEDLQE  195 (198)
T ss_pred             hcccccccchhcceeEEEEecCcccCHHHHHHHhcCCCCCceEEEeCCHHHHHH
Confidence                   22556899999999854 66788776          44566665543


No 39 
>COG1240 ChlD Mg-chelatase subunit ChlD [Coenzyme metabolism]
Probab=76.71  E-value=19  Score=32.21  Aligned_cols=111  Identities=16%  Similarity=0.186  Sum_probs=71.3

Q ss_pred             CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721           11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC   90 (219)
Q Consensus        11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~   90 (219)
                      -...|.++|+|++--=-..+  -+..+=-.++.|+  |=+-.+.++|+||+...++++++.|...+              
T Consensus        76 r~g~lvvfvVDASgSM~~~~--Rm~aaKG~~~~lL--~dAYq~RdkvavI~F~G~~A~lll~pT~s--------------  137 (261)
T COG1240          76 RAGNLIVFVVDASGSMAARR--RMAAAKGAALSLL--RDAYQRRDKVAVIAFRGEKAELLLPPTSS--------------  137 (261)
T ss_pred             CcCCcEEEEEeCcccchhHH--HHHHHHHHHHHHH--HHHHHccceEEEEEecCCcceEEeCCccc--------------
Confidence            34568889999964322111  1222222333333  23557899999999999999999997654              


Q ss_pred             HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEec
Q 027721           91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG  152 (219)
Q Consensus        91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~  152 (219)
                         ++.+++.+.+-..         -..++|+-||-+|.--+-|..+.. +...+.++|||-
T Consensus       138 ---v~~~~~~L~~l~~---------GG~TPL~~aL~~a~ev~~r~~r~~-p~~~~~~vviTD  186 (261)
T COG1240         138 ---VELAERALERLPT---------GGKTPLADALRQAYEVLAREKRRG-PDRRPVMVVITD  186 (261)
T ss_pred             ---HHHHHHHHHhCCC---------CCCCchHHHHHHHHHHHHHhhccC-CCcceEEEEEeC
Confidence               3344444443211         136789999999999998877642 356788888883


No 40 
>TIGR03788 marine_srt_targ marine proteobacterial sortase target protein. Members of this protein family are restricted to the Proteobacteria. Each contains a C-terminal sortase-recognition motif, transmembrane domain, and basic residues cluster at the the C-terminus, and is encoded adjacent to a sortase gene. This protein is frequently the only sortase target in its genome, which is as unusual its occurrence in Gram-negative rather than Gram-positive genomes. Many bacteria with this system are marine. In addition to the LPXTG signal, members carry a vault protein inter-alpha-trypsin inhibitor domain (pfam08487) and a von Willebrand factor type A domain (pfam00092).
Probab=66.26  E-value=1.4e+02  Score=29.34  Aligned_cols=148  Identities=17%  Similarity=0.172  Sum_probs=83.7

Q ss_pred             CCCCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHH
Q 027721           11 DDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC   90 (219)
Q Consensus        11 ~~~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~   90 (219)
                      ..|.-+++|||++----+.   .+..+-.++..+++   .+...-++.||+.+.. .+.++|......        ..  
T Consensus       269 ~~p~~vvfvlD~SgSM~g~---~i~~ak~al~~~l~---~L~~~d~~~ii~F~~~-~~~~~~~~~~~~--------~~--  331 (596)
T TIGR03788       269 VLPRELVFVIDTSGSMAGE---SIEQAKSALLLALD---QLRPGDRFNIIQFDSD-VTLLFPVPVPAT--------AH--  331 (596)
T ss_pred             CCCceEEEEEECCCCCCCc---cHHHHHHHHHHHHH---hCCCCCEEEEEEECCc-ceEeccccccCC--------HH--
Confidence            3455688999997654433   35555554444443   4566779999997555 567787543211        00  


Q ss_pred             HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHh
Q 027721           91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSA  170 (219)
Q Consensus        91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaA  170 (219)
                        =++++.+++..-...         ..+.+..||-.|+-..    .........+|+++|-+.....+  .++..+ .+
T Consensus       332 --~~~~a~~~i~~l~a~---------GgT~l~~aL~~a~~~~----~~~~~~~~~~iillTDG~~~~~~--~~~~~~-~~  393 (596)
T TIGR03788       332 --NLARARQFVAGLQAD---------GGTEMAGALSAALRDD----GPESSGALRQVVFLTDGAVGNED--ALFQLI-RT  393 (596)
T ss_pred             --HHHHHHHHHhhCCCC---------CCccHHHHHHHHHHhh----cccCCCceeEEEEEeCCCCCCHH--HHHHHH-HH
Confidence              133444444432111         2456888888877431    11112334568888876543221  223322 23


Q ss_pred             hhCCceeeEeeeCCC-Cchhhhhh
Q 027721          171 QRSMVPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       171 qK~~V~IDvc~L~~~-~s~fLQQA  193 (219)
                      +..+++|.++-+|.. +..+|++.
T Consensus       394 ~~~~~ri~tvGiG~~~n~~lL~~l  417 (596)
T TIGR03788       394 KLGDSRLFTVGIGSAPNSYFMRKA  417 (596)
T ss_pred             hcCCceEEEEEeCCCcCHHHHHHH
Confidence            345789999999864 67889886


No 41 
>TIGR02442 Cob-chelat-sub cobaltochelatase subunit. A number of genomes (actinobacteria, cyanobacteria, betaproteobacteria and pseudomonads) which apparently biosynthesize B12, encode a cobN gene but are demonstrably lacking cobS and cobT. These genomes do, however contain a homolog (modelled here) of the magnesium chelatase subunits BchI/BchD family. Aside from the cyanobacteria (which have a separate magnesium chelatase trimer), these species do not make chlorins, so do not have any use for a magnesium chelatase. Furthermore, in nearly all cases the members of this family are proximal to either CobN itself or other genes involved in cobalt transport or B12 biosynthesis.
Probab=62.50  E-value=1.7e+02  Score=29.11  Aligned_cols=112  Identities=12%  Similarity=0.095  Sum_probs=67.6

Q ss_pred             CCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721           13 VSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT   92 (219)
Q Consensus        13 ~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~   92 (219)
                      ..+++++||++.--...  --+..+-..+..|+.  .+.....+|.||+.......++.|...+                
T Consensus       465 ~~~vv~vvD~SgSM~~~--~rl~~ak~a~~~ll~--~a~~~~D~v~lI~F~g~~a~~~~p~t~~----------------  524 (633)
T TIGR02442       465 GNLVIFVVDASGSMAAR--GRMAAAKGAVLSLLR--DAYQKRDKVALITFRGEEAEVLLPPTSS----------------  524 (633)
T ss_pred             CceEEEEEECCccCCCc--cHHHHHHHHHHHHHH--HhhcCCCEEEEEEECCCCceEEcCCCCC----------------
Confidence            45888999998754432  124444445555544  2346689999999987667778775422                


Q ss_pred             HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCC
Q 027721           93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD  155 (219)
Q Consensus        93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d  155 (219)
                       .+.+.+.+..- .  .      ...+.|+.||..|.-.+.+..+. ....++.|++||-..+
T Consensus       525 -~~~~~~~L~~l-~--~------gG~Tpl~~aL~~A~~~l~~~~~~-~~~~~~~vvliTDG~~  576 (633)
T TIGR02442       525 -VELAARRLEEL-P--T------GGRTPLAAGLLKAAEVLSNELLR-DDDGRPLLVVITDGRA  576 (633)
T ss_pred             -HHHHHHHHHhC-C--C------CCCCCHHHHHHHHHHHHHHhhcc-CCCCceEEEEECCCCC
Confidence             22222333321 1  1      13578999999999887754322 1245677888885443


No 42 
>KOG2884 consensus 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=58.57  E-value=1.3e+02  Score=26.54  Aligned_cols=140  Identities=14%  Similarity=0.173  Sum_probs=89.5

Q ss_pred             EEEEEeCCccccccC--cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHH
Q 027721           16 VVVLLDTNPFFWSSS--SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATL   93 (219)
Q Consensus        16 LviIlD~np~~W~~~--~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v   93 (219)
                      -+|.+|-.-+...-+  +.-|..=-++|=..+++-+--|..|.|-+|.-...++..|-... .+                
T Consensus         6 tmi~iDNse~mrNgDy~PtRf~aQ~daVn~v~~~K~~snpEntvGiitla~a~~~vLsT~T-~d----------------   68 (259)
T KOG2884|consen    6 TMICIDNSEYMRNGDYLPTRFQAQKDAVNLVCQAKLRSNPENTVGIITLANASVQVLSTLT-SD----------------   68 (259)
T ss_pred             EEEEEeChHHhhcCCCChHHHHHHHHHHHHHHHhhhcCCcccceeeEeccCCCceeeeecc-cc----------------
Confidence            356677543222222  33355555667677889999999999999987665565443211 11                


Q ss_pred             HHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhC
Q 027721           94 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS  173 (219)
Q Consensus        94 ~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~  173 (219)
                      .-+|...+-   .....      ...-+..+|..|.--.+.+..   ..-+-||+++-+||-.-++- .+.-.+.-=+|.
T Consensus        69 ~gkils~lh---~i~~~------g~~~~~~~i~iA~lalkhRqn---k~~~~riVvFvGSpi~e~ek-eLv~~akrlkk~  135 (259)
T KOG2884|consen   69 RGKILSKLH---GIQPH------GKANFMTGIQIAQLALKHRQN---KNQKQRIVVFVGSPIEESEK-ELVKLAKRLKKN  135 (259)
T ss_pred             chHHHHHhc---CCCcC------CcccHHHHHHHHHHHHHhhcC---CCcceEEEEEecCcchhhHH-HHHHHHHHHHhc
Confidence            111211111   11111      245678888888776665532   34578999999999766555 777888888999


Q ss_pred             CceeeEeeeCCC
Q 027721          174 MVPIDSCYLGAQ  185 (219)
Q Consensus       174 ~V~IDvc~L~~~  185 (219)
                      +|.||+...|+.
T Consensus       136 ~Vaidii~FGE~  147 (259)
T KOG2884|consen  136 KVAIDIINFGEA  147 (259)
T ss_pred             CeeEEEEEeccc
Confidence            999999999853


No 43 
>cd01473 vWA_CTRP CTRP for  CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60  amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.
Probab=56.04  E-value=1.2e+02  Score=25.17  Aligned_cols=154  Identities=11%  Similarity=0.106  Sum_probs=73.6

Q ss_pred             EEEEEeCCccccccCcccHHH-HHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHHHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQ-FLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLL   94 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~-~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~v~   94 (219)
                      ++++||.+---   +.-.+.+ +.+-+-.+++....-...-+|.|+..+.. .+..+|-.....     .+    ...+.
T Consensus         3 i~fllD~S~Si---~~~~f~~~~~~f~~~lv~~l~i~~~~~rvgvv~fs~~-~~~~~~~~~~~~-----~~----~~~l~   69 (192)
T cd01473           3 LTLILDESASI---GYSNWRKDVIPFTEKIINNLNISKDKVHVGILLFAEK-NRDVVPFSDEER-----YD----KNELL   69 (192)
T ss_pred             EEEEEeCCCcc---cHHHHHHHHHHHHHHHHHhCccCCCccEEEEEEecCC-ceeEEecCcccc-----cC----HHHHH
Confidence            68889984332   1112321 22333333444434456778999887654 344545322110     00    11234


Q ss_pred             HHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEE-EecCCCCchhhhHHHHHHHHhhhC
Q 027721           95 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC-LQGSPDGPEQYVAIMNAIFSAQRS  173 (219)
Q Consensus        95 ~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILv-is~S~d~~~QYi~lMN~iFaAqK~  173 (219)
                      +.+.++...-.  ..       ..+.+..||.+|+-++-+.  ...+.-.+||+| ++-+.+....--.+.-..=.++++
T Consensus        70 ~~i~~l~~~~~--~~-------g~T~~~~AL~~a~~~~~~~--~~~r~~~~kv~IllTDG~s~~~~~~~~~~~a~~lk~~  138 (192)
T cd01473          70 KKINDLKNSYR--SG-------GETYIVEALKYGLKNYTKH--GNRRKDAPKVTMLFTDGNDTSASKKELQDISLLYKEE  138 (192)
T ss_pred             HHHHHHHhccC--CC-------CcCcHHHHHHHHHHHhccC--CCCcccCCeEEEEEecCCCCCcchhhHHHHHHHHHHC
Confidence            44444332110  01       2455677777765444221  111122255555 555544321111223344467889


Q ss_pred             CceeeEeeeCCCCchhhhhh
Q 027721          174 MVPIDSCYLGAQNSAFLQQC  193 (219)
Q Consensus       174 ~V~IDvc~L~~~~s~fLQQA  193 (219)
                      +|.|-+.-+|..+..-|+..
T Consensus       139 gV~i~~vGiG~~~~~el~~i  158 (192)
T cd01473         139 NVKLLVVGVGAASENKLKLL  158 (192)
T ss_pred             CCEEEEEEeccccHHHHHHh
Confidence            99999999985544444444


No 44 
>PF04811 Sec23_trunk:  Sec23/Sec24 trunk domain;  InterPro: IPR006896 COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger (IPR006895 from INTERPRO), an alpha/beta trunk domain, an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes the Sec23/24 alpha/beta trunk domain, which is formed from a single, approximately 250-residue segment plugged into the beta-barrel between strands beta-1 and beta-19. The trunk has an alpha/beta fold with a vWA topology, and it forms the dimer interface, primarily involving strand beta-14 on Sec23 and Sec24; in addition, the trunk domain of Sec23 contacts Sar1.; GO: 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EGD_A 2NUP_A 3EG9_A 3EFO_A 3EGX_A 2NUT_A 1PD0_A 1PD1_A 1M2V_B 1PCX_A ....
Probab=55.38  E-value=1.3e+02  Score=25.59  Aligned_cols=154  Identities=17%  Similarity=0.203  Sum_probs=81.3

Q ss_pred             CCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCC--------CC--
Q 027721           13 VSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQS--------VG--   82 (219)
Q Consensus        13 ~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~--------~~--   82 (219)
                      |-..+.+||++..+-..  .-+..+.++|...++. +-....-+|++|..+.  +-.+|.-.......        ++  
T Consensus         3 pp~y~FvID~s~~av~~--g~~~~~~~sl~~~l~~-l~~~~~~~vgiitfd~--~V~~y~l~~~~~~~~~~v~~dl~~~~   77 (243)
T PF04811_consen    3 PPVYVFVIDVSYEAVQS--GLLQSLIESLKSALDS-LPGDERTRVGIITFDS--SVHFYNLSSSLSQPQMIVVSDLDDPF   77 (243)
T ss_dssp             --EEEEEEE-SHHHHHH--THHHHHHHHHHHHGCT-SSTSTT-EEEEEEESS--SEEEEETTTTSSSTEEEEEHHTTSHH
T ss_pred             CCEEEEEEECchhhhhc--cHHHHHHHHHHHHHHh-ccCCCCcEEEEEEeCC--EEEEEECCCCcCCCcccchHHHhhcc
Confidence            44678999998664333  2466667777666622 2234777888888843  35567543322100        00  


Q ss_pred             ---CC-------CchhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEec
Q 027721           83 ---NG-------RMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG  152 (219)
Q Consensus        83 ---~~-------~~~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~  152 (219)
                         ..       +.......++++|.+...+..        +.....-+..||..|.-.+...      +..+||+++..
T Consensus        78 ~p~~~~llv~~~e~~~~i~~ll~~L~~~~~~~~--------~~~~~~c~G~Al~~A~~ll~~~------~~gGkI~~F~s  143 (243)
T PF04811_consen   78 IPLPDGLLVPLSECRDAIEELLESLPSIFPETA--------GKRPERCLGSALSAALSLLSSR------NTGGKILVFTS  143 (243)
T ss_dssp             SSTSSSSSEETTTCHHHHHHHHHHHHHHSTT-T--------TB-----HHHHHHHHHHHHHHH------TS-EEEEEEES
T ss_pred             cCCcccEEEEhHHhHHHHHHHHHHhhhhccccc--------ccCccccHHHHHHHHHHHHhcc------ccCCEEEEEec
Confidence               00       001112223333333222110        0134567889999999998865      25689999876


Q ss_pred             CCC--Cc-------------------hhhh-----HHHHHHHHhhhCCceeeEeeeCCC
Q 027721          153 SPD--GP-------------------EQYV-----AIMNAIFSAQRSMVPIDSCYLGAQ  185 (219)
Q Consensus       153 S~d--~~-------------------~QYi-----~lMN~iFaAqK~~V~IDvc~L~~~  185 (219)
                      ++-  .+                   ..|.     -+-+....+.+++|.||++..+.+
T Consensus       144 ~~pt~G~Gg~l~~~~~~~~~~~~~~~~~~~~~~~~fY~~la~~~~~~~isvDlf~~~~~  202 (243)
T PF04811_consen  144 GPPTYGPGGSLKKREDSSHYDTEKEKALLLPPANEFYKKLAEECSKQGISVDLFVFSSD  202 (243)
T ss_dssp             S---SSSTTSS-SBTTSCCCCHCTTHHCHSHSSSHHHHHHHHHHHHCTEEEEEEEECSS
T ss_pred             cCCCCCCCceecccccccccccccchhhhccccchHHHHHHHHHHhcCCEEEEEeecCC
Confidence            541  00                   1111     356788899999999999999643


No 45 
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=54.24  E-value=1.9e+02  Score=27.01  Aligned_cols=168  Identities=13%  Similarity=0.097  Sum_probs=106.8

Q ss_pred             eEEEEEeCCcccccc--CcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCchhHHHH
Q 027721           15 LVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT   92 (219)
Q Consensus        15 lLviIlD~np~~W~~--~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~~~~~~~   92 (219)
                      .|+||||.+-.-=..  .+--+..+++.+-.|+--|.-+|.--|+-+|..-.+.+.-+=.-..+                
T Consensus        62 hl~iviD~S~am~e~Df~P~r~a~~~K~le~Fv~eFFdQNPiSQigii~~k~g~A~~lt~ltgn----------------  125 (378)
T KOG2807|consen   62 HLYIVIDCSRAMEEKDFRPSRFANVIKYLEGFVPEFFDQNPISQIGIISIKDGKADRLTDLTGN----------------  125 (378)
T ss_pred             eEEEEEEhhhhhhhccCCchHHHHHHHHHHHHHHHHhccCchhheeEEEEecchhhHHHHhcCC----------------
Confidence            589999986543333  34457888999999999999999999999999866655432221111                


Q ss_pred             HHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecC--CCCchhhhHHHHHHHHh
Q 027721           93 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS--PDGPEQYVAIMNAIFSA  170 (219)
Q Consensus        93 v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S--~d~~~QYi~lMN~iFaA  170 (219)
                      ....+..|......         .....|--||.+|--..+.+    +.+....+|||=+|  .-.+..|   |-.|=+-
T Consensus       126 p~~hI~aL~~~~~~---------~g~fSLqNaLe~a~~~Lk~~----p~H~sREVLii~sslsT~DPgdi---~~tI~~l  189 (378)
T KOG2807|consen  126 PRIHIHALKGLTEC---------SGDFSLQNALELAREVLKHM----PGHVSREVLIIFSSLSTCDPGDI---YETIDKL  189 (378)
T ss_pred             HHHHHHHHhccccc---------CCChHHHHHHHHHHHHhcCC----CcccceEEEEEEeeecccCcccH---HHHHHHH
Confidence            12233333332211         12445777888887777665    24566677776543  3345544   7888899


Q ss_pred             hhCCceeeEeeeCCCCch--hhhhh----ccccchhHHHHHHHHhcCCCC
Q 027721          171 QRSMVPIDSCYLGAQNSA--FLQQC----MSDRLLTLLVEYIISLNNWMG  214 (219)
Q Consensus       171 qK~~V~IDvc~L~~~~s~--fLQQA----~~~~~~~gLlQyLl~~~~~~~  214 (219)
                      .+.+|.+-+.-|.++=..  =|-+|    +++.--++=|+-||.--..-+
T Consensus       190 k~~kIRvsvIgLsaEv~icK~l~kaT~G~Y~V~lDe~HlkeLl~e~~~Pp  239 (378)
T KOG2807|consen  190 KAYKIRVSVIGLSAEVFICKELCKATGGRYSVALDEGHLKELLLEHTHPP  239 (378)
T ss_pred             HhhCeEEEEEeechhHHHHHHHHHhhCCeEEEEeCHHHHHHHHHhcCCCC
Confidence            999999999999643111  12233    677666777777776443333


No 46 
>cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses  In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo
Probab=50.62  E-value=1.5e+02  Score=24.75  Aligned_cols=153  Identities=11%  Similarity=0.059  Sum_probs=76.2

Q ss_pred             CCeEEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhc------CCCCcEEEEEEcCCcceEEecCCCCCCCCCCCCCc
Q 027721           13 VSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTL------NQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRM   86 (219)
Q Consensus        13 ~slLviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~------~~~N~vaVia~~~~~~~~LYp~~~~~~~~~~~~~~   86 (219)
                      .-=++++||.+---=   +..|.++..-+..+++..--.      ...-+|.||..+.. .+..+|-..-.         
T Consensus        19 ~~DivfvlD~S~Sm~---~~~f~~~k~fi~~~~~~~~~~~~~~~~~~~~rVGlV~fs~~-a~~~~~L~d~~---------   85 (193)
T cd01477          19 WLDIVFVVDNSKGMT---QGGLWQVRATISSLFGSSSQIGTDYDDPRSTRVGLVTYNSN-ATVVADLNDLQ---------   85 (193)
T ss_pred             eeeEEEEEeCCCCcc---hhhHHHHHHHHHHHHhhccccccccCCCCCcEEEEEEccCc-eEEEEeccccc---------
Confidence            344788999854321   112444433333333333211      22468999888665 67788753211         


Q ss_pred             hhHHHHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEE-Ee-cCCCCc-hhhhHH
Q 027721           87 PSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC-LQ-GSPDGP-EQYVAI  163 (219)
Q Consensus        87 ~~~~~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILv-is-~S~d~~-~QYi~l  163 (219)
                        -...+.+.+...+...   ..      ...+.+..||.+|+-....-.+ ..+.-.+|++| ++ +..+.. .+... 
T Consensus        86 --~~~~~~~ai~~~~~~~---~~------~ggT~ig~aL~~A~~~l~~~~~-~~R~~v~kvvIllTDg~~~~~~~~~~~-  152 (193)
T cd01477          86 --SFDDLYSQIQGSLTDV---SS------TNASYLDTGLQAAEQMLAAGKR-TSRENYKKVVIVFASDYNDEGSNDPRP-  152 (193)
T ss_pred             --CHHHHHHHHHHHhhcc---cc------CCcchHHHHHHHHHHHHHhhhc-cccCCCCeEEEEEecCccCCCCCCHHH-
Confidence              0111233333221111   00      0245678888888665542211 11233577544 54 322222 22222 


Q ss_pred             HHHHHHhhhCCceeeEeeeCCC-Cchhhhhh
Q 027721          164 MNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       164 MN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA  193 (219)
                        ..=.+++.+|.|-+.-+|.. |...++|.
T Consensus       153 --~a~~l~~~GI~i~tVGiG~~~d~~~~~~L  181 (193)
T cd01477         153 --IAARLKSTGIAIITVAFTQDESSNLLDKL  181 (193)
T ss_pred             --HHHHHHHCCCEEEEEEeCCCCCHHHHHHH
Confidence              23347889999999999854 33556664


No 47 
>PTZ00441 sporozoite surface protein 2 (SSP2); Provisional
Probab=47.17  E-value=3.1e+02  Score=27.41  Aligned_cols=72  Identities=8%  Similarity=0.050  Sum_probs=42.5

Q ss_pred             ccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCC-Cchhhhhh
Q 027721          118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQC  193 (219)
Q Consensus       118 ~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~-~s~fLQQA  193 (219)
                      .+.+..||..|.-++.+..  .......-|+|+|-..+... + ..+..+-..++.+|.|=++-+|.. +..+|++.
T Consensus       126 gTnig~AL~~Aae~L~sr~--~R~nvpKVVILLTDG~sns~-~-dvleaAq~LR~~GVeI~vIGVG~g~n~e~LrlI  198 (576)
T PTZ00441        126 KTNMTDALLEVRKHLNDRV--NRENAIQLVILMTDGIPNSK-Y-RALEESRKLKDRNVKLAVIGIGQGINHQFNRLL  198 (576)
T ss_pred             CccHHHHHHHHHHHHhhcc--cccCCceEEEEEecCCCCCc-c-cHHHHHHHHHHCCCEEEEEEeCCCcCHHHHHHH
Confidence            4668888888776665431  11233344555665544221 1 233455567788999999999753 45577643


No 48 
>cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role  in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3-  ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.
Probab=43.69  E-value=1.6e+02  Score=23.30  Aligned_cols=138  Identities=9%  Similarity=0.045  Sum_probs=73.5

Q ss_pred             EEEEEeCCccccccCcccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCc-----ceEEecCCCCCCCCCCCCCchhHH
Q 027721           16 VVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS-----CDYVYDSSSTGNQSVGNGRMPSLC   90 (219)
Q Consensus        16 LviIlD~np~~W~~~~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~-----~~~LYp~~~~~~~~~~~~~~~~~~   90 (219)
                      ++++||++.--...  ..++.+-..+..|+.+...  ...++.|++-....     ..++-+.+-..             
T Consensus         3 v~~llD~SgSM~~~--~kl~~ak~a~~~l~~~l~~--~~d~~~l~~F~~~~~~~~~~~~~~~~~~~~-------------   65 (174)
T cd01454           3 VTLLLDLSGSMRSD--RRIDVAKKAAVLLAEALEA--CGVPHAILGFTTDAGGRERVRWIKIKDFDE-------------   65 (174)
T ss_pred             EEEEEECCCCCCCC--cHHHHHHHHHHHHHHHHHH--cCCcEEEEEecCCCCCccceEEEEecCccc-------------
Confidence            57889997753321  2355555555555444333  56788888875541     23333211110             


Q ss_pred             HHHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCc-hhh----hHHHH
Q 027721           91 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP-EQY----VAIMN  165 (219)
Q Consensus        91 ~~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~-~QY----i~lMN  165 (219)
                       ....++.+.+..-..         ...+.+..||-.|+-.+.+..     .-+..|++||-..+.. ..|    ..+-.
T Consensus        66 -~~~~~~~~~l~~~~~---------~g~T~~~~al~~a~~~l~~~~-----~~~~~iiliTDG~~~~~~~~~~~~~~~~~  130 (174)
T cd01454          66 -SLHERARKRLAALSP---------GGNTRDGAAIRHAAERLLARP-----EKRKILLVISDGEPNDLDYYEGNVFATED  130 (174)
T ss_pred             -ccchhHHHHHHccCC---------CCCCcHHHHHHHHHHHHhcCC-----CcCcEEEEEeCCCcCcccccCcchhHHHH
Confidence             001122222221101         124668888888877776531     2234466677544432 222    13334


Q ss_pred             H---HHHhhhCCceeeEeeeCCC
Q 027721          166 A---IFSAQRSMVPIDSCYLGAQ  185 (219)
Q Consensus       166 ~---iFaAqK~~V~IDvc~L~~~  185 (219)
                      +   +..|++.+|.+-+.-+|..
T Consensus       131 ~~~~~~~~~~~gi~v~~igig~~  153 (174)
T cd01454         131 ALRAVIEARKLGIEVFGITIDRD  153 (174)
T ss_pred             HHHHHHHHHhCCcEEEEEEecCc
Confidence            4   7888999999999999743


No 49 
>COG5148 RPN10 26S proteasome regulatory complex, subunit RPN10/PSMD4 [Posttranslational modification, protein turnover, chaperones]
Probab=34.59  E-value=3.1e+02  Score=23.79  Aligned_cols=139  Identities=15%  Similarity=0.150  Sum_probs=85.2

Q ss_pred             eEEEEEeCCccccccC--cccHHHHHHHHHHHHHHHHhcCCCCcEEEEEEcCCcceEEe-cCCCCCCCCCCCCCchhHHH
Q 027721           15 LVVVLLDTNPFFWSSS--SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCA   91 (219)
Q Consensus        15 lLviIlD~np~~W~~~--~~~l~~~l~~llvFlNahL~~~~~N~vaVia~~~~~~~~LY-p~~~~~~~~~~~~~~~~~~~   91 (219)
                      .-||+||-.-.....+  +.-|..-.+++-...+.-.--+..|.+-+|.-.......|- |+..                
T Consensus         5 atvvliDNse~s~NgDy~ptRFeAQkd~ve~if~~K~ndnpEntiGli~~~~a~p~vlsT~T~~----------------   68 (243)
T COG5148           5 ATVVLIDNSEASQNGDYLPTRFEAQKDAVESIFSKKFNDNPENTIGLIPLVQAQPNVLSTPTKQ----------------   68 (243)
T ss_pred             eEEEEEeChhhhhcCCCCcHHHHHHHHHHHHHHHHHhcCCccceeeeeecccCCcchhccchhh----------------
Confidence            3577777643322222  33466666777777888888899999988865433322111 1110                


Q ss_pred             HHHHHHHHHHHhhhhcCCCCCCccccccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCCCCchhhhHHHHHHHHhh
Q 027721           92 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ  171 (219)
Q Consensus        92 ~v~~~l~~li~~~~~~~~~~~~~~~~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAq  171 (219)
                        .-++..++...   ..+      ...-++-+|..|+--.+.+.   +..-+-||.++-+||-..+. =.+.-.+..-+
T Consensus        69 --~gkilt~lhd~---~~~------g~a~~~~~lqiaql~lkhR~---nk~q~qriVaFvgSpi~ese-deLirlak~lk  133 (243)
T COG5148          69 --RGKILTFLHDI---RLH------GGADIMRCLQIAQLILKHRD---NKGQRQRIVAFVGSPIQESE-DELIRLAKQLK  133 (243)
T ss_pred             --hhHHHHHhccc---ccc------CcchHHHHHHHHHHHHhccc---CCccceEEEEEecCcccccH-HHHHHHHHHHH
Confidence              11222333221   111      13446778888876665542   23456899999999976655 45666778889


Q ss_pred             hCCceeeEeeeCC
Q 027721          172 RSMVPIDSCYLGA  184 (219)
Q Consensus       172 K~~V~IDvc~L~~  184 (219)
                      |.+|.||....|+
T Consensus       134 knnVAidii~fGE  146 (243)
T COG5148         134 KNNVAIDIIFFGE  146 (243)
T ss_pred             hcCeeEEEEehhh
Confidence            9999999999984


No 50 
>cd00529 RuvC_resolvase Holliday junction resolvases (HJRs) are endonucleases that specifically resolve Holliday junction DNA intermediates during homologous recombination.  HJR's occur in archaea, bacteria, and in the mitochondria of certain fungi, however this CD includes only the bacterial and mitochondrial HJR's.  These are referred to as the RuvC family of Holliday junction resolvases, RuvC being the E.coli HJR.  RuvC and its orthologs are homodimers and are structurely similar to RNase H and Hsp70.
Probab=28.02  E-value=35  Score=27.44  Aligned_cols=15  Identities=13%  Similarity=0.116  Sum_probs=13.0

Q ss_pred             chhhHHHHHHHHHhh
Q 027721          120 LLSGSLSMALCYIQR  134 (219)
Q Consensus       120 ~LagALs~ALCyINR  134 (219)
                      -.|=||+.|+||++|
T Consensus       139 d~aDAlaiA~~~~~~  153 (154)
T cd00529         139 DAADALAVAITHAHW  153 (154)
T ss_pred             CHHHHHHHHHHHHhh
Confidence            367799999999987


No 51 
>COG4867 Uncharacterized protein with a von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=27.50  E-value=2.2e+02  Score=27.71  Aligned_cols=92  Identities=20%  Similarity=0.156  Sum_probs=64.5

Q ss_pred             cccchhhHHHHHHHHHhhhhccCCCCCCceEEEEecCC--------CC---chhhhHHHHHHH-------HhhhCCceee
Q 027721          117 ACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSP--------DG---PEQYVAIMNAIF-------SAQRSMVPID  178 (219)
Q Consensus       117 ~~s~LagALs~ALCyINR~~~~~~~~~~~RILvis~S~--------d~---~~QYi~lMN~iF-------aAqK~~V~ID  178 (219)
                      ..+.+.+||.+|=-+..|.     .+.++-||||+-..        |.   -..|-+==-.+|       -..|.++-|.
T Consensus       531 qgTNlhhaL~LA~r~l~Rh-----~~~~~~il~vTDGePtAhle~~DG~~~~f~yp~DP~t~~~Tvr~~d~~~r~G~q~t  605 (652)
T COG4867         531 QGTNLHHALALAGRHLRRH-----AGAQPVVLVVTDGEPTAHLEDGDGTSVFFDYPPDPRTIAHTVRGFDDMARLGAQVT  605 (652)
T ss_pred             cccchHHHHHHHHHHHHhC-----cccCceEEEEeCCCccccccCCCCceEecCCCCChhHHHHHHHHHHHHHhccceee
Confidence            4677999999998888776     47788999998533        21   122211111222       2368999999


Q ss_pred             EeeeCCCC--chhhhhh-------ccccchhHHHHHHHHhcCCC
Q 027721          179 SCYLGAQN--SAFLQQC-------MSDRLLTLLVEYIISLNNWM  213 (219)
Q Consensus       179 vc~L~~~~--s~fLQQA-------~~~~~~~gLlQyLl~~~~~~  213 (219)
                      .+.||...  -.|+||.       ...++++||-||.+.-|+..
T Consensus       606 ~FrLg~DpgL~~Fv~qva~rv~G~vv~pdldglGaaVvgdylr~  649 (652)
T COG4867         606 IFRLGSDPGLARFIDQVARRVQGRVVVPDLDGLGAAVVGDYLRF  649 (652)
T ss_pred             EEeecCCHhHHHHHHHHHHHhCCeEEecCcchhhHHHHHHHHhh
Confidence            99997432  3499998       77899999999998877654


No 52 
>COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]
Probab=25.92  E-value=4.3e+02  Score=22.79  Aligned_cols=97  Identities=18%  Similarity=0.260  Sum_probs=58.9

Q ss_pred             ccchhhHHHHHHHHHhhhhcc----CCCCCCceEEEEecCCCCchhhhHHHHHHHHhhhCCceeeEeeeCCC--Cchhhh
Q 027721          118 CSLLSGSLSMALCYIQRVFRS----GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ--NSAFLQ  191 (219)
Q Consensus       118 ~s~LagALs~ALCyINR~~~~----~~~~~~~RILvis~S~d~~~QYi~lMN~iFaAqK~~V~IDvc~L~~~--~s~fLQ  191 (219)
                      .+.|..||.+|+=.|-+..++    .....+|=+++++-+.- ...|-.=-+-.|.-.+-+..+=.|.+|.+  +-.-|+
T Consensus        78 gT~lGaAl~~a~d~Ie~~~~~~~a~~kgdyrP~vfLiTDG~P-tD~w~~~~~~~~~~~~~~k~v~a~~~G~~~ad~~~L~  156 (207)
T COG4245          78 GTPLGAALTLALDMIEERKRKYDANGKGDYRPWVFLITDGEP-TDDWQAGAALVFQGERRAKSVAAFSVGVQGADNKTLN  156 (207)
T ss_pred             CCchHHHHHHHHHHHHHHHhhcccCCccccceEEEEecCCCc-chHHHhHHHHhhhcccccceEEEEEecccccccHHHH
Confidence            677999999999888776443    22457888887774332 44443333444444444455557888744  678899


Q ss_pred             hh---------ccccchhHHHHHHH-HhcCCCCC
Q 027721          192 QC---------MSDRLLTLLVEYII-SLNNWMGS  215 (219)
Q Consensus       192 QA---------~~~~~~~gLlQyLl-~~~~~~~~  215 (219)
                      |.         ++..+..+++.++- ++-..|+|
T Consensus       157 qit~~V~~~~t~d~~~f~~fFkW~SaSisagS~S  190 (207)
T COG4245         157 QITEKVRQFLTLDGLQFREFFKWLSASISAGSRS  190 (207)
T ss_pred             HHHHhhccccccchHHHHHHHHHHHHHhhccccc
Confidence            98         33344455666653 34444544


No 53 
>TIGR00228 ruvC crossover junction endodeoxyribonuclease RuvC. Endonuclease that resolves Holliday junction intermediates in genetic recombination. The active form of the protein is a dimer. Structure studies reveals that the catalytic center, comprised of four acidic residues, lies at the bottom of a cleft that fits a DNA duplex. The model hits a single Synechocystis PCC6803 protein at a score of 30, below the trusted cutoff, that appears orthologous and may act as authentic RuvC.
Probab=24.29  E-value=40  Score=27.73  Aligned_cols=17  Identities=12%  Similarity=0.126  Sum_probs=14.2

Q ss_pred             hhhHHHHHHHHHhhhhc
Q 027721          121 LSGSLSMALCYIQRVFR  137 (219)
Q Consensus       121 LagALs~ALCyINR~~~  137 (219)
                      -|=||+.|+||.+|..+
T Consensus       137 aaDALAiAi~h~~~~~~  153 (156)
T TIGR00228       137 AADALAIAITHAHVSQN  153 (156)
T ss_pred             HHHHHHHHHHHhcccCc
Confidence            56799999999999753


No 54 
>PF11373 DUF3175:  Protein of unknown function (DUF3175);  InterPro: IPR021513 This entry is represented by Ralstonia phage RSL1, Orf186. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  This family of proteins with unknown function appears to be restricted to Proteobacteria. 
Probab=22.12  E-value=1.4e+02  Score=22.19  Aligned_cols=22  Identities=23%  Similarity=0.344  Sum_probs=19.4

Q ss_pred             cccchhhHHHHHHHHHhhhhcc
Q 027721          117 ACSLLSGSLSMALCYIQRVFRS  138 (219)
Q Consensus       117 ~~s~LagALs~ALCyINR~~~~  138 (219)
                      +.++.-+||||==-||||..+.
T Consensus        43 k~~pf~SAMSML~FYINRAGk~   64 (86)
T PF11373_consen   43 KATPFRSAMSMLNFYINRAGKN   64 (86)
T ss_pred             CCCchHHHHHHHHHHHhccccc
Confidence            5778999999999999998765


Done!